Word Recognition Patents (Class 704/251)
-
Patent number: 11544473Abstract: The present invention allows for the capture and sentiment analysis of text the customer inputs into a chat, but never actually sends to the customer service representative (ghost text). The system captures this ghost text with a ghost capture system (GCS) software module. The GCS module analyzes the ghost text to generate metadata. The ghost text and metadata are used by a sentiment analysis engine to apply appropriate sentiment to the ghost text. The sentiment and ghost text are routed to a customer service representative (CSR). This provides the customer service agent with additional detail and information about a customer's emotions during a text chat conversation, allowing the CSR to determine a court of interaction not only based on the customer's response, but also based on the ghost text and the sentiment from the ghost text.Type: GrantFiled: May 19, 2021Date of Patent: January 3, 2023Assignee: VERINT AMERICAS INC.Inventor: Michael Johnston
-
Patent number: 11545142Abstract: A method includes receiving audio data encoding an utterance, processing, using a speech recognition model, the audio data to generate speech recognition scores for speech elements, and determining context scores for the speech elements based on context data indicating a context for the utterance. The method also includes executing, using the speech recognition scores and the context scores, a beam search decoding process to determine one or more candidate transcriptions for the utterance. The method also includes selecting a transcription for the utterance from the one or more candidate transcriptions.Type: GrantFiled: March 24, 2020Date of Patent: January 3, 2023Assignee: Google LLCInventors: Ding Zhao, Bo Li, Ruoming Pang, Tara N. Sainath, David Rybach, Deepti Bhatia, Zelin Wu
-
Patent number: 11526666Abstract: A programmable device such as a smart phone allows a user an opportunity to make final corrections to textual data in a message after the user has instructed the device to send the message, but before transmittal of the message. The opportunity is temporary, to avoid impeding the flow of communication, and the textual data is transmitted unmodified if the opportunity to modify it is not accepted. Modifications made during the opportunity period may be used to adapt an autocorrect functionality of the programmable device.Type: GrantFiled: October 5, 2018Date of Patent: December 13, 2022Assignee: Apple Inc.Inventors: Mehul K. Sanghavi, Swati J. Deo
-
Patent number: 11520610Abstract: Embodiments described herein are generally directed towards systems and methods relating to a crowd-sourced digital assistant and system. In particular, embodiments facilitate the intuitive creation and distribution of action datasets that include computing events or tasks that can be reproduced when an associated command, stored in an action dataset, is determined received by a digital assistant device. The digital assistant device described herein can generate new action datasets, on-board new action datasets, and receive new action datasets or updates to existing action datasets. Each digital assistant device in the described system can participate in the building of action datasets, so as to crowd-source a dialect that can be understood by a digital assistant device.Type: GrantFiled: May 18, 2018Date of Patent: December 6, 2022Assignee: PELOTON INTERACTIVE INC.Inventors: Rajat Mukherjee, Kiran Bindhu Hemaraj, Matan Levi
-
Patent number: 11514916Abstract: A server for supporting speech recognition of a device and an operation method of the server. The server and method identify a plurality of estimated character strings from the first character string and obtain a second character string, based on the plurality of estimated character strings, and transmit the second character string to the device. The first character string is output from a speech signal input to the device, via speech recognition.Type: GrantFiled: August 13, 2020Date of Patent: November 29, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Chanwoo Kim, Sichen Jin, Kyungmin Lee, Dhananjaya N. Gowda, Kwangyoun Kim
-
Patent number: 11507907Abstract: Systems for optimized forecasting are provided. In some examples, data associated with strategy of one or more business units may be received. The strategy data may include identification of projects or goals. In some examples, industry trend data may be received and may include data associated with in-demand job skills and the like. An instruction to capture user data may be transmitted to one or more user devices of an employee user. The instruction may cause activation of one or more sensors or data capture devices. The captured user data may be received and analyzed to determine a competency of the user. Based on the strategy data, industry data and determined competency, one or more deficiencies between the resources needed to meet the business unit strategy data and the available resources may be identified. Based on the identified deficiency, one or more actions for execution may be identified and executed.Type: GrantFiled: December 9, 2020Date of Patent: November 22, 2022Assignee: Bank of America CorporationInventors: Sandeep Kumar Chauhan, Madhuri Aniruddha Deshpande, Moses Salagala, Jagadish Reddy
-
Patent number: 11503383Abstract: Aspects of the subject disclosure may include, for example, applying first data associated with a first content item to a model to generate first classification characteristics, analyzing the first classification characteristics to generate a first marker, wherein the first marker delineates a first location of inventory within the first content item, selecting a first creative to populate a portion of the inventory, and populating, based on the selecting, the portion of the inventory with the first creative. Other embodiments are disclosed.Type: GrantFiled: May 13, 2021Date of Patent: November 15, 2022Assignee: AT&T Intellectual Property I, L.P.Inventors: Binny Asarikuniyil, Megha Venugopal
-
Patent number: 11495208Abstract: In some embodiments, recognition results produced by a speech processing system (which may include two or more recognition results, including a top recognition result and one or more alternative recognition results) based on an analysis of a speech input, are evaluated for indications of potential errors. In some embodiments, the indications of potential errors may include discrepancies between recognition results that are meaningful for a domain, such as medically-meaningful discrepancies. The evaluation of the recognition results may be carried out using any suitable criteria, including one or more criteria that differ from criteria used by an ASR system in determining the top recognition result and the alternative recognition results from the speech input. In some embodiments, a recognition result may additionally or alternatively be processed to determine whether the recognition result includes a word or phrase that is unlikely to appear in a domain to which speech input relates.Type: GrantFiled: October 23, 2017Date of Patent: November 8, 2022Assignee: Nuance Communications, Inc.Inventors: William F. Ganong, III, Raghu Vemula, Robert Fleming
-
Patent number: 11495218Abstract: Systems and processes for providing a virtual assistant service are provided. In accordance with one or more examples, a method includes receiving, from an accessory device communicatively coupled to the first electronic device, a representation of a speech input representing a user request. The method further includes detecting a second electronic device and transmitting, from the first electronic device, a representation of the user request and data associated with the detected second electronic device to a third electronic device. The method further includes receiving, from the third electronic device, a determination of whether a task is to be performed by the second electronic device in accordance with the user request; and in accordance with a determination that a task is to be performed by the second electronic device, requesting the second electronic device to performed the task in accordance with the user request.Type: GrantFiled: August 31, 2018Date of Patent: November 8, 2022Assignee: Apple Inc.Inventors: Brandon J. Newendorp, Anumita Biswas, Gagan A. Gupta, Benjamin S. Phipps, Kisun You
-
Patent number: 11488587Abstract: Disclosed is a regional-features-based speech recognition method, including learning speech features by region using speech data classified by region category, and recognizing input speech using an acoustic model and a language model generated through classification of a region category for the input speech and the learning. A user may use a dialect recognition service that is improved using learning based on artificial intelligence (AI) and enhanced mobile broadband (eMBB), ultra-reliable and low latency communications (URLLC), and massive machine-type communications (mMTC) techniques of 5G mobile communication.Type: GrantFiled: March 18, 2020Date of Patent: November 1, 2022Assignee: LG ELECTRONICS INC.Inventor: Seonyeong Park
-
Patent number: 11482222Abstract: A method and apparatus for determining a unique wake word for devices within an incident. One system includes an electronic computing device comprising a transceiver and an electronic processor communicatively coupled to the transceiver. The electronic processor is configured to receive a notification indicative of an occurrence of an incident and one or more communication devices present at the incident, determine contextual information associated with the incident and the one or more communication devices, and identify one or more wake words based on the contextual information. The electronic processor is further configured to determine a phonetic distance for each pair of wake words included in the one or more wake words, and select a unique wake word from the one or more wake words for each communication device of the one or more communication devices based on the determined phonetic distance.Type: GrantFiled: March 12, 2020Date of Patent: October 25, 2022Assignee: MOTOROLA SOLUTIONS, INC.Inventors: Sean Regan, Maryam Eneim, Melanie King, Manoj Prasad Nagendra Prasad
-
Patent number: 11475875Abstract: In one aspect, a computerized method useful for implementing a language neutral virtual assistant including the step of providing a language detector. The language detector comprises one or more trained language classifiers. With language detector identifying a language of an incoming message from a user to an artificially intelligent (AI) personal assistant. The method includes the step of receiving an incoming message to the AI personal assistant. The method includes the step of normalizing the incoming message, wherein the normalizing the incoming message comprises a set of spelling corrections and a set of grammar corrections. The method includes the step of translating the incoming message to a specified language with a specified encoding process and a specified decoding process. The method includes the step of providing an AI personal assistant engine that comprise an artificial intelligence which conducts a conversation via auditory or textual methods.Type: GrantFiled: October 27, 2019Date of Patent: October 18, 2022Inventors: Sriram Chakravarthy, Madhav Vodnala, Balakota Srinivas Vinnakota, Ram Menon
-
Patent number: 11475894Abstract: This application discloses a method and apparatus for processing audio information, a storage medium, and an electronic apparatus. The method includes: detecting that a segment of audio information is being received on a client, a first portion of audio information in the segment of audio information having been currently received on the client; obtaining first information, second information, and third information based on the first portion of audio information that has been currently received, the first information including text information corresponding to the first portion of audio information, the second information including information that meets a target condition and that corresponds to the first information, and the third information including information to be pushed to the client, which is obtained based on a keyword in the first information; and displaying the first information, the second information, and the third information on the client.Type: GrantFiled: June 19, 2020Date of Patent: October 18, 2022Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventor: Longbin Li
-
Patent number: 11465290Abstract: A robot capable of conversation with another robot and a method of controlling the same are disclosed. The robot includes a main body having a first region corresponding to a human face and rotatable in left-right direction directions, a signal generator generating a first data signal to be transmitted to a listener robot and a first robot voice signal corresponding to the first data signal, a communication unit transmitting the first data signal to an external server, a speaker outputting the first robot voice signal, and a controller controlling a rotation direction of the main body such that the first region is directed toward the listener robot at a time point adjacent to a transmission time of the first data signal and controlling the speaker to output the first robot voice signal after the rotation direction of the robot is controlled, wherein the listener robot receives the first data signal transmitted from the external server and is controlled to operate based on the first data signal.Type: GrantFiled: August 29, 2019Date of Patent: October 11, 2022Assignee: LG ELECTRONICS INC.Inventors: Ji Yoon Park, Jungkwan Son
-
Patent number: 11457214Abstract: Quantization matrix can be used to adjust quantization of transform coefficients at different frequencies. In one embodiment, a single fixed parametric model, such as a polynomial is used to represent a quantization matrix. Modulation of bit cost and complexity is achieved by specifying only the n first polynomial coefficients, the remaining ones being implicitly set to zero or other default values. One form of the single fixed polynomial is a fully developed polynomial in (x, y), where x, y indicate the coordinates of a given coefficient in a quantization matrix, with terms ordered by increasing exponent. Since higher exponents are the last ones, reducing the number of polynomial coefficients reduces the degree of the polynomial, hence its complexity. The polynomial coefficients can be symmetrical in x and y, and thus reducing the number of polynomial coefficients that need to be signaled in the bitstream.Type: GrantFiled: August 8, 2019Date of Patent: September 27, 2022Assignee: InterDigital VC Holdings France, SASInventors: Philippe De Lagrange, Ya Chen, Edouard Francois
-
Patent number: 11455990Abstract: An electronic device is disclosed. The electronic device comprises: a voice input unit; a storage unit for storing a first text according to a first transcript format and at least one second text obtained by transcribing the first text in a second transcript format; and a processor for, when a voice text converted from a user voice input through the voice input unit corresponds to a preset instruction, executing a function according to the preset instruction. The processor executes a function according to a preset instruction when the preset instruction includes a first text and a voice text is a text in which the first text of the preset instruction has been transcribed into a second text of a second transcript format.Type: GrantFiled: November 23, 2018Date of Patent: September 27, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventor: Jaesung Kwon
-
Patent number: 11455984Abstract: A method and system of reducing noise associated with telephony-based activities occurring in shared workspaces is provided. An end-user may lower their own voice to a whisper or other less audible or intelligible utterances and submit such low-quality audio signals to an automated speech recognition system via a microphone. The words identified by the automated speech recognition system are provided to a speech synthesizer, and a synthesized audio signal is created artificially that carries the content of the original human-produced utterances. The synthesized audio signal is significantly more audible and intelligible than the original audio signal. The method allows customer support agents to speak at barely audible levels yet be heard clearly by their customers.Type: GrantFiled: October 28, 2020Date of Patent: September 27, 2022Assignee: United Services Automobile Association (USAA)Inventors: Justin Dax Haslam, Donnette L. Moncrief Brown, Eric David Schroeder, Ravi Durairaj, Deborah Janette Schulz
-
Patent number: 11443736Abstract: [Problem] Provided is a presentation support system that makes it possible to give effective presentations, for both presentations by machines and normal presenters. [Solution] The presentation support system included: a display unit 3; a material storage unit 5 that stores a presentation material and a plurality of keywords; an audio storage unit 7; an audio analysis unit 9 that analyzes a term contained in a presentation; a keyword order adjustment unit 11 that analyzes an order of appearance of a plurality of keywords contained in the audio analyzed by the audio analysis unit and changes the order of the plurality of keywords on the basis of the order of appearance; and a display control unit 13 that controls content displayed in the display unit 3.Type: GrantFiled: September 9, 2020Date of Patent: September 13, 2022Assignee: Interactive Solutions Corp.Inventor: Kiyoshi Sekine
-
Patent number: 11430442Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for contextual hotwords are disclosed. In one aspect, a method, during a boot process of a computing device, includes the actions of determining, by a computing device, a context associated with the computing device. The actions further include, based on the context associated with the computing device, determining a hotword. The actions further include, after determining the hotword, receiving audio data that corresponds to an utterance. The actions further include determining that the audio data includes the hotword. The actions further include, in response to determining that the audio data includes the hotword, performing an operation associated with the hotword.Type: GrantFiled: October 12, 2020Date of Patent: August 30, 2022Assignee: Google LLCInventors: Christopher Thaddeus Hughes, Ignacio Lopez Moreno, Aleksandar Kracun
-
Patent number: 11427156Abstract: A method for generating an output for controlling a vehicular function of a vehicle includes providing (i) a vehicular sensing device having at least one illumination source operable to backlight a plurality of icons, each icon representative of a respective vehicle function, and (ii) a plurality of sensors, each sensor having a respective field of sensing associated with a respective icon of the plurality of icons. With the vehicular sensing device disposed at a vehicle, and with the at least one illumination source activated to backlight the plurality of icons, the backlit icons are viewable at an exterior portion of the vehicle, and the sensors sense movement of a person's hand or foot in a field of sensing of one of the sensors, and a controller generates an output to control the vehicular function that is represented by the respective backlit icon associated with that sensor.Type: GrantFiled: January 11, 2021Date of Patent: August 30, 2022Assignee: MAGNA MIRRORS OF AMERICA, INC.Inventors: Justin E. Sobecki, David P. O'Connell, Kenneth C. Peterson
-
Patent number: 11431642Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example process, an event associated with an audio input is detected with a first process. In accordance with a detection of the event, a delay value associated with an electronic device is determined. The delay value corresponds to a time required to determine, with a second process, whether the audio input includes a spoken trigger. In accordance with a determination that the delay value exceeds a threshold, the delay value is broadcast during a first advertising session, and determination is made, during a second advertising session, whether the electronic device is to respond to the audio input. In accordance with a determination that the threshold is not exceeded, a determination is made, during the first advertising session, whether the electronic device is to respond to the audio input or wait for the second advertising session.Type: GrantFiled: October 13, 2020Date of Patent: August 30, 2022Assignee: Apple Inc.Inventor: Kurt Piersol
-
Patent number: 11423892Abstract: An electronic device is disclosed. The electronic device comprises: a voice input unit; a storage unit for storing a first text according to a first transcript format and at least one second text obtained by transcribing the first text in a second transcript format; and a processor for, when a voice text converted from a user voice input through the voice input unit corresponds to a preset instruction, executing a function according to the preset instruction. The processor executes a function according to a preset instruction when the preset instruction includes a first text and a voice text is a text in which the first text of the preset instruction has been transcribed into a second text of a second transcript format.Type: GrantFiled: November 23, 2018Date of Patent: August 23, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventor: Jaesung Kwon
-
Patent number: 11417343Abstract: A speaker identification system (“system”) automatically assigns a speaker to voiced segments in a call. The system identifies one or more speakers in a call using one or more speaker-identification parameters. The system processes the call to determine one or more speaker-identification parameters, such as a transcript of the call, a facial image of the speaker, a scene image, which is an image of a scene in which the speaker is located during the call, or textual data associated with the call such as names of the speaker or an organization that are retrieved from the scene images or video data of the call. The system analyzes one or more of the speaker-identification parameters and determines the identity of the speaker. The system then identifies the voice segments associated with the identified speaker and marks the voice segments with the identity of the speaker.Type: GrantFiled: July 2, 2018Date of Patent: August 16, 2022Assignee: ZOOMINFO CONVERSE LLCInventors: Raphael Cohen, Erez Volk, Russell Levy, Micha Yochanan Breakstone, Orgad Keller, Ilana Tuil, Amit Ashkenazi
-
Patent number: 11417321Abstract: A device for changing a speech recognition sensitivity for speech recognition can include a memory and a processor configured to obtain a first plurality of speech data input at different times, apply a pre-trained speech recognition model to the first plurality of speech data at a plurality of different speech recognition sensitivities, obtain a first speech recognition sensitivity from among the plurality of different speech recognition sensitivities based on the pre-trained speech recognition model and the plurality of different speech recognition sensitivities, the first speech recognition sensitivity corresponding to an optimal speech recognition sensitivity at which a speech recognition success rate of the speech recognition model satisfies a set first recognition success rate criterion, and change a setting of the speech recognition sensitivity based on the first speech recognition sensitivity obtained from among the plurality of different speech recognition sensitivities.Type: GrantFiled: April 24, 2020Date of Patent: August 16, 2022Assignee: LG ELECTRONICS INC.Inventors: Sang Won Kim, Joonbeom Lee
-
Patent number: 11417331Abstract: The present disclosure provides a method for controlling a terminal, including the following operations: obtaining recognition results corresponding to control signals after receiving the control signals, and determining whether control instructions corresponding to the recognition results conflict, each control signal comprising at least one of a voice signal or a gesture signal; determining a credibility of each control instruction in response to a determination that there exists conflict among control instructions; and sending the control instruction with highest credibility to a control terminal. The present disclosure further provides a device for controlling a terminal and a computer readable storage medium. When control instructions are received and there exists conflict among control instructions, the control instruction with the highest credibility is sent to the control terminal after the credibility of each control instructions is determined, thereby avoiding settings from conflict.Type: GrantFiled: March 6, 2020Date of Patent: August 16, 2022Assignees: GD MIDEA AIR-CONDITIONING EQUIPMENT CO., LTD., MIDEA GROUP CO., LTD.Inventors: Zhicai Ou, Weiying Li
-
Patent number: 11409961Abstract: This disclosure describes techniques and architectures for evaluating conversations. In some instances, conversations with users, virtual assistants, and others may be analyzed to identify potential risks within a language model that is employed by the virtual assistants and other entities. The potential risks may be evaluated by administrators, users, systems, and others to identify potential issues with the language model that need to be addressed. This may allow the language model to be improved and enhance user experience with the virtual assistants and others that employ the language model.Type: GrantFiled: October 10, 2019Date of Patent: August 9, 2022Inventors: Cynthia Freeman, Ian Beaver
-
Patent number: 11410660Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for voice recognition. In one aspect, a method includes the actions of receiving a voice input; determining a transcription for the voice input, wherein determining the transcription for the voice input includes, for a plurality of segments of the voice input: obtaining a first candidate transcription for a first segment of the voice input; determining one or more contexts associated with the first candidate transcription; adjusting a respective weight for each of the one or more contexts; and determining a second candidate transcription for a second segment of the voice input based in part on the adjusted weights; and providing the transcription of the plurality of segments of the voice input for output.Type: GrantFiled: April 1, 2020Date of Patent: August 9, 2022Assignee: Google LLCInventors: Petar Aleksic, Pedro J. Moreno Mengibar
-
Patent number: 11410648Abstract: The present disclosure is generally related to a data processing system to selectively invoke applications for execution. A data processing system can receive an input audio signal and can parse the input audio signal to identify a command. The data processing system can identify a first functionality of a first digital assistant application hosted on the data processing system in the vehicle and a second functionality of a second digital assistant application accessible via a client device. The data processing system can determine that one of the first functionality or the second functionality supports the command. The data processing system can select one of the first digital assistant application or the second digital assistant application based on the determination. The data processing system invoke one of the first digital assistant application or the second digital assistant application based on the selection.Type: GrantFiled: October 3, 2017Date of Patent: August 9, 2022Assignee: GOOGLE LLCInventors: Haris Ramic, Vikram Aggarwal, Moises Morgenstern Gali, Brandon Stuut
-
Patent number: 11405522Abstract: The present technology relates to an information processing device, and an information processing method, each of which enables to reduce a confirmation load put on a user before a task is executed. The information processing device according to one embodiment of the present technology has the feature of, on the basis of relationship between a first cost required in a case where execution of a predetermined task has been a mistake and a second cost that is allowed by a user for the predetermined task that has been executed by mistake, calculating a confirmation degree of confirming the user as to whether or not to execute the predetermined task, and performing the confirmation by contents corresponding to the calculated degree. The present technology can be applied to an agent apparatus that operates using a voice UI.Type: GrantFiled: April 13, 2018Date of Patent: August 2, 2022Assignee: SONY CORPORATIONInventor: Katsuyoshi Kanemoto
-
Patent number: 11404049Abstract: In non-limiting examples of the present disclosure, systems, methods and devices for integrating speech-to-text transcription in a productivity application are presented. A request to access a real-time speech-to-text transcription of an audio signal that is being received by a second device is sent by a first device. The real-time speech-to-text transcription may be surfaced in a transcription pane of a productivity application on the first device. A request to translate the transcription to a different language may be received. The transcription may be translated in real-time and surfaced in the transcription pane. A selection of a word in the surfaced transcription may be received. A request to drag the word from the transcription pane and drop it in a window in the productivity application outside of the transcription pane may be received. The word may be surfaced in the window in the productivity application outside of the transcription pane.Type: GrantFiled: December 9, 2019Date of Patent: August 2, 2022Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Dana Minh Nguyen, Rohail Mustafa Syed, Alisa Marilyn Bacon, William Duncan Lewis, Michael Tholfsen, Carly Larsson
-
Patent number: 11403469Abstract: The present invention makes it possible to generate a paraphrastic sentence that has a similar meaning to the original sentence despite a local word/phrase difference, or a non-paraphrastic sentence that is not a paraphrase despite having a similar meaning to the original sentence in terms of the entire sentence. An estimation unit 22 estimates a word deletion probability for each of words constituting an input sentence, by using a positive example model that has been trained based on a positive example constituted by a sentence and a paraphrastic sentence of the sentence, and is used to generate a paraphrastic sentence by deleting a word, or by using a negative example model that has been trained based on a negative example constituted by the sentence and a non-paraphrastic sentence of the sentence, and is used to generate a non-paraphrastic sentence by deleting a word.Type: GrantFiled: July 23, 2019Date of Patent: August 2, 2022Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATIONInventors: Itsumi Saito, Kyosuke Nishida, Hisako Asano, Junji Tomita
-
Patent number: 11397856Abstract: A token is extracted from a Natural Language input. A phonetic pattern is computed corresponding to the token, the phonetic pattern including a sound pattern that represents a part of the token when the token is spoken. New data is created from data of the phonetic pattern, the new data including a syllable sequence corresponding to the phonetic pattern. A state of a data storage device is changed by storing the new data in a matrix of syllable sequences corresponding to the token. An option is selected that corresponds to the token by executing a fuzzy matching algorithm using a processor and a memory, the selecting of the option is based on a syllable sequence in the matrix.Type: GrantFiled: November 26, 2019Date of Patent: July 26, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Sean M. Fuoco, John M. Ganci, Jr., Craig M. Trim, Jie Zeng
-
Patent number: 11393464Abstract: Apparatuses, methods and storage medium associated with a spoken dialogue system are disclosed herein. In embodiments, an apparatus for natural machine conversing with a user may comprise a listening component to detect a keyword that denotes start of a conversation; a dialogue engine to converse with the user during the conversation; and a controller to selectively activate or cause to be activated one of the listening component or the dialogue component, and to pass control to the activated listening component or the activated dialogue engine, based at least in part on a state of the conversation. Other embodiments may be disclosed or claimed.Type: GrantFiled: June 6, 2019Date of Patent: July 19, 2022Assignee: Intel CorporationInventors: Lavinia A. Danielescu, Shawn C. Nikkila, Robert J. Firby, Beth Ann Hockey
-
Patent number: 11392665Abstract: A computer-implemented method, system, and computer program product for analyzing readability of a communication intended for a target audience includes: analyzing the communication to determine a first readability measure associated with the communication; determining a second readability measure associated with the target audience based on one or more historical communications previously transmitted or received by the target audience; and generating a readability feedback signal for the communication based on the first readability measure and the second readability measure.Type: GrantFiled: August 1, 2019Date of Patent: July 19, 2022Assignee: International Business Machines CorporationInventors: Adam Pilkington, Graham Charters, Gordon Hutchison, Tim Mitchell
-
Patent number: 11386233Abstract: The present disclosure provides a method, system, and device for distributing a software release. To illustrate, based on one or more files for distribution as a software release, a release bundle is generated that includes release bundle information, such as, for each file of the one or more files, a checksum, meta data, or both. One or more other aspects of the present disclosure further provide sending the release bundle to a node device. After receiving the release bundle at the node device, the node device receives and stores at least one file at a transaction directory. After verification that each of the one or more files is present/available at the node device, the one or more files may be provided to a memory of a node device and meta data included in the release bundle information may be applied to the one or more files transferred to the memory.Type: GrantFiled: April 30, 2019Date of Patent: July 12, 2022Assignee: JFrog, Ltd.Inventor: Yoav Landman
-
Patent number: 11386890Abstract: A system is provided for reducing friction during user interactions with a natural language processing system, such as voice assistant systems. The system determines a pre-trained model using dialog session data corresponding to multiple user profiles. The system determines a fine-tuned model using the pre-trained model and a fine-tuning dataset that corresponds to a particular task, such as query rewriting. The system uses the fine-tuned model to process a user input and determine an alternative representation of the input that can result in a desired response from the natural language processing system.Type: GrantFiled: February 11, 2020Date of Patent: July 12, 2022Assignee: Amazon Technologies, Inc.Inventors: Xing Fan, Zheng Chen, Yuan Ling, Lambert Leo Mathias, Chenlei Guo
-
Patent number: 11386919Abstract: The present disclosure provides methods and systems that may be used for providing quality control for audio samples. The audio samples may be speech samples of a user. The user may be participating in an audio interview.Type: GrantFiled: December 31, 2020Date of Patent: July 12, 2022Assignee: AC Global Risk, Inc.Inventor: James A. Kane
-
Patent number: 11386910Abstract: Various technologies described herein pertain to active noise cancellation in the interior of a vehicle. In exemplary embodiments, a microphone mounted on the vehicle outputs an audio signal indicative of noise emitted by a noise source. A computing system of the vehicle determines a position of the noise source based upon sensor signals output by sensors mounted on the vehicle. The computing system further determines a position of a passenger in the vehicle based upon a sensor mounted inside the vehicle. The computing system generates a complementary signal that is configured to attenuate the noise based upon the audio signal, the position of the noise source, and the position of the passenger. The complementary signal is then output by way of a speaker in the interior of the vehicle.Type: GrantFiled: May 11, 2020Date of Patent: July 12, 2022Assignee: GM CRUISE HOLDINGS LLCInventors: Marko Tintor, Matt Fornero
-
Patent number: 11380306Abstract: Expansion of intent classification data utilizing batch utterance scheduling, by a processor in a computing environment. A set of unlabeled examples for intent processing is received by an intent builder iteratively defining an intent. The set of examples are separated into a first subset processed according to a first model and a second subset processed according to a second model. The first subset is incorporated into the intent builder during a building iteration and scheduling a first batch processing of the second subset processed according to the second model based on a scheduling criteria. The first batch processing of the second subset is initiated once the scheduling criteria is satisfied. Upon completion of the first batch processing, results of the completion are used to influence additional examples retrieved from the first subset and the second subset during a subsequent building iteration by the intent builder.Type: GrantFiled: October 31, 2019Date of Patent: July 5, 2022Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Neil Rohit Mallinar, Rajendra G Ugrani, Ayush Gupta
-
Patent number: 11380303Abstract: A method for voice call analysis and classification includes intercepting a voice call session between an initiating device and a recipient device. Voice call data exchanged between the initiating device and the recipient device during the voice call session is transformed into a predefined data format. The transformed voice call data is analyzed to determine one or more attributes of the intercepted voice call. One or more features associated with the intercepted voice call session are identified based on the determined one or more attributes. The intercepted voice call is classified using the identified one or more features.Type: GrantFiled: January 22, 2021Date of Patent: July 5, 2022Assignee: AO Kaspersky LabInventors: Nikolay A. Churaev, Andrey I. Golubev
-
Patent number: 11373652Abstract: A method includes obtaining, by data processing hardware, a plurality of non-watermarked speech samples. Each non-watermarked speech does not include an audio watermark sample. The method includes, from each non-watermarked speech sample of the plurality of non-watermarked speech samples, generating one or more corresponding watermarked speech samples that each include at least one audio watermark. The method includes training, using the plurality of non-watermarked speech samples and corresponding watermarked speech samples, a model to determine whether a given audio data sample includes an audio watermark, and after training the model, transmitting the trained model to a user computing device.Type: GrantFiled: May 14, 2020Date of Patent: June 28, 2022Assignee: Google LLCInventors: Alexander H. Gruenstein, Taral Pradeep Joglekar, Vijayaditya Peddinti, Michiel A. u. Bacchiani
-
Patent number: 11373644Abstract: Techniques for implementing multiple wakeword detectors on a single device are described. A digital signal processor (DSP) of the device may implement a wakeword detection component to detect when captured speech includes a wakeword. A companion application installed on the device may implement a wakeword detection component trained using speech of a user of the device. In response to determining that the user spoke the wakeword, the companion application may send audio data representing the speech and data corresponding to the user to at least one server(s) for processing. Further, the device may receive captured speech and captured image data corresponding to the captured speech and determine a representation of a user in the captured image data. If the device determines the user is represented in the image data, audio data representing the speech may be sent to at least one server(s) for processing.Type: GrantFiled: July 21, 2020Date of Patent: June 28, 2022Assignee: Amazon Technologies, Inc.Inventors: Deepak Yavagal, Ajith Prabhakara, John Gray
-
Patent number: 11361756Abstract: In one aspect, a playback device includes at least one microphone configured to detect sound. The playback detects sound via the one or more microphones and determines whether (i) the detected sound includes a voice input, (ii) the detected sound excludes background speech, and (iii) the voice input includes a command keyword. In response to the determining, the playback device performs a playback function corresponding to the command keyword.Type: GrantFiled: June 12, 2019Date of Patent: June 14, 2022Assignee: Sonos, Inc.Inventors: Connor Smith, John Tolomei, Kurt Soto
-
Patent number: 11358063Abstract: Multimedia content to be played on a multimedia player device can be received. Whether the multimedia content contains audience-inappropriate content can be determined. Replacement content corresponding to the audience-inappropriate content can be generated. The generated replacement content can be caused to play on the multimedia player device in lieu of the audience-inappropriate content.Type: GrantFiled: March 6, 2020Date of Patent: June 14, 2022Assignee: International Business Machines CorporationInventors: Maryam Ashoori, Anamika Dayaram Singh, Priti Ashvin Shah
-
Patent number: 11355120Abstract: In some examples, a software agent executing on a server receives a communication comprising a first utterance from a customer and predicts, using an intent classifier, a first intent of the first utterance. Based on determining that the first intent is order-related, the software agent predicts, using a dish classifier, a cart delta vector based at least in part on the first utterance and modifies a cart associated with the customer based on the cart delta vector. The software agent predicts, using a dialog model, a first dialog response based at least in part on the first utterance and provides the first dialog response to the customer using a text-to-speech converter.Type: GrantFiled: October 1, 2021Date of Patent: June 7, 2022Assignee: ConverseNowAIInventors: Zubair Talib, Rahul Aggarwal, Vinay Kumar Shukla, Pranav Nirmal Mehra, Vrajesh Navinchandra Sejpal, Akshay Labh Kayastha, Yuganeshan A J, German Kurt Grin, Fernando Ezequiel Gonzalez, Julia Milanese, Matias Grinberg
-
Patent number: 11355136Abstract: A computer includes a processor and a memory storing instructions executable by the processor to identify an occupant in a passenger cabin of a vehicle, detect a position of a head of the occupant relative to the passenger cabin, apply a first filter to speech from the occupant based on the position of the head, generate a second filter, apply the second filter to the speech, adjust the second filter based on a difference between the speech of the occupant filtered by the second filter and a prestored profile of the occupant, and perform an operation using the speech filtered by the first filter and the second filter.Type: GrantFiled: January 11, 2021Date of Patent: June 7, 2022Assignee: Ford Global Technologies, LLCInventors: Scott Andrew Amman, Cynthia M. Neubecker, Pietro Buttolo, Joshua Wheeler, Brian Bennie
-
Patent number: 11354754Abstract: Certain aspects of the present disclosure provide techniques for selecting a response to a self-support query. One example method generally includes receiving an audio stream query including spoken content from a user recorded by a mobile device and determining a set of paralinguistic features from the spoken content. The method further includes estimating an emotional state of the user based on the set of paralinguistic features and identifying subject matter of the spoken content in the audio stream query. The method further includes determining two or more query responses corresponding to the subject matter to present to the user and transmitting at least one query response to the mobile device.Type: GrantFiled: February 12, 2020Date of Patent: June 7, 2022Assignee: INTUIT, INC.Inventors: Benjamin Indyk, Igor A. Podgorny, Raymond Chan
-
Patent number: 11348578Abstract: A method of controlling a battery-powered remote controller to decrease a duty cycle to allow continued operations despite the quantity of the battery is bad determines a drop in voltage of the battery in standby mode as voltage of the battery is being read. When receiving a command to activate a voice function, determining whether the drop in voltage in standby mode is greater than or equal to a preset value. If yes, the method then determines whether the drop in voltage falls in a preset range. If yes, the method regulates a duty cycle of the pulse signal activating the voice function, and activates the voice function as required. A remote controller and a non-transitory storage medium are also provided.Type: GrantFiled: July 22, 2019Date of Patent: May 31, 2022Assignee: Nanning FuLian FuGui Precision Industrial Co., Ltd.Inventors: Huang-Yu Chiang, Chung-Chih Yeh
-
Patent number: 11347802Abstract: Methods and systems for generation of a database schema compliant search query based on a natural language input are described herein. Natural language input may be received from a computing device. The natural language input may be associated with multiple search requests to a database. The natural language input may be parsed into a plurality of segments. The plurality of segments may be, for example, one or more words of a text string. At least one identifier for the plurality of segments may be associated with one or more confidence values. The natural language input may be converted into a single search query based on the confidence values and/or on a set of rules. The single search query may be initiated with respect to the database. The single search query may fetch content more efficiently than the multiple search requests.Type: GrantFiled: July 24, 2019Date of Patent: May 31, 2022Assignee: Citrix Systems, Inc.Inventors: Shiv Prasad Khillar, Saifulla Shaik, Nagendra Tank
-
Patent number: 11338211Abstract: An application execution unit 110 generates a game image. A message generation unit 112 generates a notification message. An image processing unit 118 generates a distribution image including the game image. A distribution processing unit 126 distributes the distribution image to one or more information processing terminals through a shared server. A setting unit 114 allows a user to set whether or not the notification message is included in the distribution image so as to be visually recognizable, and registers setting contents in a storage apparatus.Type: GrantFiled: November 22, 2018Date of Patent: May 24, 2022Assignee: SONY INTERACTIVE ENTERTAINMENT INC.Inventors: Masahiro Fujihara, Kiyobumi Matsunaga