Speech Controlled System Patents (Class 704/275)
  • Patent number: 11404057
    Abstract: An interactive voice adapter for adaptive voice routing may establish a real-time communication session between a voice communication client and a text communication client and the voice adapter may receive the audio stream and the text information. The voice adapter may obtain adapted natural language text corresponding to the natural language audio by selectively accessing a speech-to-text service based on a selection criteria. The voice adapter may obtain adapted natural language audio corresponding to the natural language text by selectively accessing a text-to-speech service based on the selection criteria. The voice adapter may communicate the adapted natural language text to the text communication client and the adapted natural language audio to the voice communication client.
    Type: Grant
    Filed: June 5, 2018
    Date of Patent: August 2, 2022
    Assignee: ACCENTURE GLOBAL SOLUTIONS LIMITED
    Inventors: Artur Liashenko, Augusto Gugliotta, Laetitia Cailleteau, Juris Stürainis, Ankur Banerjee
  • Patent number: 11405466
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. In one example process, a first instance of a digital assistant operating on a first electronic device receives a natural-language speech input indicative of a user request. The first electronic device obtains a set of data corresponding to a second instance of the digital assistant on a second electronic device, and updates one or more settings of the first instance of the digital assistant based on the received set of data. The first instance of the digital assistant performs one or more tasks based on the updated one or more settings and provides an output indicative of whether the one or more tasks are performed.
    Type: Grant
    Filed: May 5, 2020
    Date of Patent: August 2, 2022
    Assignee: Apple Inc.
    Inventors: Benjamin S. Phipps, Gennaro Frazzingaro, Karl F. Schramm
  • Patent number: 11403065
    Abstract: Characteristics of a speaker are estimated using speech processing and machine learning. The characteristics of the speaker are used to automatically customize a user interface of a client device for the speaker.
    Type: Grant
    Filed: December 29, 2020
    Date of Patent: August 2, 2022
    Assignee: Google LLC
    Inventors: Eugene Weinstein, Ignacio L. Moreno
  • Patent number: 11397558
    Abstract: Embodiments of the present invention provide systems, methods, and computer storage media directed to optimizing engagement with a display during digital assistant-performed operations in response to a received command. The digital assistant generates an overlay having user interface elements that present information determined to be relevant to a user based on the received command and contextual data. The overlay is presented over the underlying operations performed on corresponding applications to mask the visible steps of the operations being performed. In this way, the digital assistant optimizes display resources that are typically rendered useless during the processing of digital assistant-performed operations.
    Type: Grant
    Filed: March 26, 2018
    Date of Patent: July 26, 2022
    Assignee: PELOTON INTERACTIVE, INC.
    Inventor: Rajat Mukherjee
  • Patent number: 11397507
    Abstract: Examples of systems and methods for voice-based navigation in one or more virtual areas that define respective persistent virtual communication contexts are described. These examples enable communicants to use voice commands to, for example, search for communication opportunities in the different virtual communication contexts, enter specific ones of the virtual communication contexts, and bring other communicants into specific ones of the virtual communication contexts. In this way, these examples allow communicants to exploit the communication opportunities that are available in virtual areas, even when hands-based or visual methods of interfacing with the virtual areas are not available.
    Type: Grant
    Filed: April 7, 2020
    Date of Patent: July 26, 2022
    Assignee: Sococo, Inc.
    Inventor: David Van Wie
  • Patent number: 11393133
    Abstract: A machine learning system is accessed. The machine learning system is used to translate content into a representative icon. The machine learning system is used to manipulate emoji. The machine learning system is used to process an image of an individual. The machine learning processing includes identifying a face of the individual. The machine learning processing includes classifying the face to determine facial content using a plurality of image classifiers. The classifying includes generating confidence values for a plurality of action units for the face. The facial content is translated into a representative icon. The translating the facial content includes summing the confidence values for the plurality of action units. The representative icon comprises an emoji. A set of emoji can be imported. The representative icon is selected from the set of emoji. The emoji selection is based on emotion content analysis of the face.
    Type: Grant
    Filed: March 19, 2020
    Date of Patent: July 19, 2022
    Assignee: Affectiva, Inc.
    Inventors: Rana el Kaliouby, May Amr Fouad, Abdelrahman N. Mahmoud, Seyedmohammad Mavadati, Daniel McDuff
  • Patent number: 11393472
    Abstract: An apparatus and method for executing a voice command in an electronic device. In an exemplary embodiment, a voice signal is detected and speech thereof is recognized. When the recognized speech contains a wakeup command, a voice command mode is activated, and a signal containing at least a portion of the detected voice signal is transmitted to a server. The server generates a control signal or a result signal corresponding to the voice command, and transmits the same to the electronic device. The device receives and processes the control or result signal, and awakens. Thereby, voice commands are executed without the need for the user to physically touch the electronic device.
    Type: Grant
    Filed: May 18, 2020
    Date of Patent: July 19, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Subhojit Chakladar, Sang-Hoon Lee, Hee-Woon Kim
  • Patent number: 11393477
    Abstract: Techniques for a natural language processing (NLP) system to implement more than one assistant are described. The NLP system may receive a natural language input from a device. The NLP system may also receive one or more signals representing one or more assistants to be implemented with respect to the natural language input. The NLP system may intelligently select an assistant to be invoked with respect to the natural language input. Once the assistant is selected, the NLP system may cause content, output to a user, to have characteristics specific to the assistant.
    Type: Grant
    Filed: September 24, 2019
    Date of Patent: July 19, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Munir Mahmood, Leopold Bushkin, Alexander Thomas Loeb, Michael Schwartz, Mohammed Arif, Rongzhou Shen, Vikram Kumar Gundeti, Shemyla Anwar, Yaser Khan, Edward Page Foyle, Bo Li
  • Patent number: 11394755
    Abstract: Two or more computing devices involved in a software based conference are determined. Each computing device of the two or more computing devices has an associated user. An input from a computing device of the two or more computing devices is received. A user action for a user associated with a computing device of the two or more computing devices from the input is determined. Whether the user completed the user action within a threshold is determined. The user is alerted of the user action.
    Type: Grant
    Filed: June 7, 2021
    Date of Patent: July 19, 2022
    Assignee: International Business Machines Corporation
    Inventors: Neerju Neerju, Mukesh Muraleedharan Nair, Jeremy R. Fox, Zachary A. Silverstein
  • Patent number: 11393262
    Abstract: In order to quickly respond to a question for vehicle information from a vehicle user at a remote location and offer the user a sense of relief, a vehicle management system of the present invention comprises a telematics communication unit which is mounted on a vehicle and acquires vehicle information and, a vehicle information server which receives from the telematics communication unit the vehicle information and a time at which the vehicle information is acquired, stores the received vehicle information and time, and transmits the stored vehicle information and time to a speech processing system as a response to a question when receiving the question about the vehicle information from the speech processing system.
    Type: Grant
    Filed: July 18, 2019
    Date of Patent: July 19, 2022
    Assignee: HONDA MOTOR CO., LTD.
    Inventor: Yuji Nishikawa
  • Patent number: 11394862
    Abstract: A voice input apparatus inputs voice and performs control to, in a case where a second voice instruction for operating the voice input apparatus is input in a fixed period after a first voice instruction for enabling operations by voice on the voice input apparatus is input, execute processing corresponding to the second voice instruction. The voice input apparatus, in a case where it is estimated that a predetermined user issued the second voice instruction, executes processing corresponding to the second voice instruction when the second voice instruction is input, even in a case where the first voice instruction is not input.
    Type: Grant
    Filed: February 1, 2021
    Date of Patent: July 19, 2022
    Assignee: CANON KABUSHIKI KAISHA
    Inventors: Daiyu Ueno, Maiki Okuwaki
  • Patent number: 11393469
    Abstract: A vehicle-mounted device operation system, includes: an operating part that is located in a vehicle cabin, and configured to be subjected to operation for manually operating a vehicle-mounted device; a sound collector that is located in the vehicle cabin, and configured to collect speech of an occupant; and a processor configured to determine whether a command to the vehicle-mounted device is included in a content of the speech of the occupant collected by the sound collector, activate the vehicle-mounted device according to the command, when the processor determines that the command to the vehicle-mounted device is included in the content of the speech of the occupant, and highlight the operating part for the vehicle-mounted device activated by the processor.
    Type: Grant
    Filed: December 2, 2019
    Date of Patent: July 19, 2022
    Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA
    Inventors: Yuki Kozono, Shu Nakajima, Takeshi Nawata
  • Patent number: 11393467
    Abstract: An electronic device includes a voice receiving unit configured to receive a voice input, a first communication unit configured to communicate with an external device having a voice recognition function, and a control unit. The control unit receives a notification indicating whether the external device is ready to recognize the voice input, via the first communication unit. In a case where the notification indicates that the external device is not ready to recognize the voice input, the control unit controls the external device to be ready to recognize the voice input via the first communication unit when a predetermined voice input including a phrase corresponding to the external device is received through the voice receiving unit.
    Type: Grant
    Filed: October 22, 2019
    Date of Patent: July 19, 2022
    Assignee: Canon Kabushiki Kaisha
    Inventor: Shunji Fujita
  • Patent number: 11382703
    Abstract: The invention relates to an operation-assistance system for guiding a medical auxiliary instrument (20), which can be inserted in an operating site (12) of a patient body (10) via an operation opening (11), and can be moved in a controlled manner. The system comprises a kinematic robot (3, 4, 5) that receives the medical auxiliary instrument (20) on the free end thereof by means of an auxiliary instrument holding device (6), and can be moved in a motor-controlled manner in order to guide the medical auxiliary instrument (20) in the operating site (12), by means of control signals (SS) generated by a control unit (CU). At least one voice control routine (SSR) is implemented in the control unit (CU), by means of which different voice commands (SB, SB1, SB2) are detected and evaluated and associated control signals (SS) are determined in accordance.
    Type: Grant
    Filed: January 29, 2018
    Date of Patent: July 12, 2022
    Assignee: Aktormed GmbH
    Inventor: Robert Geiger
  • Patent number: 11386886
    Abstract: An embodiment provides a method, including: obtaining, using a processor, contextual information relating to an information handling device; adjusting, using a processor, an automated speech recognition engine using the contextual information; receiving, at an audio receiver of the information handling device, user speech input; and providing, using a processor, recognized speech based on the user speech input received and the contextual information adjustment to the automated speech recognition engine. Other aspects are described and claimed.
    Type: Grant
    Filed: January 28, 2014
    Date of Patent: July 12, 2022
    Assignee: Lenovo (Singapore) Pte. Ltd.
    Inventors: Rod D. Waltermann, Mark Evan Cohen
  • Patent number: 11380331
    Abstract: In one example, a method includes method comprising: receiving audio data generated by a microphone of a current computing device; identifying, based on the audio data, one or more computing devices that each emitted a respective audio signal in response to speech reception being activated at the current computing device; and selecting either the current computing device or a particular computing device from the identified one or more computing devices to satisfy a spoken utterance determined based on the audio data.
    Type: Grant
    Filed: March 7, 2022
    Date of Patent: July 5, 2022
    Assignee: GOOGLE LLC
    Inventor: Jian Wei Leong
  • Patent number: 11380325
    Abstract: An agent device includes one or more agent controllers configured to provide a service including causing an output device to output a response of voice according to a voice of an occupant which is collected in a vehicle interior of a vehicle, a receiver configured to receive an input from the occupant, and a starting method setter configured to change or add a starting method of the agent controller on the basis of content received by the receiver.
    Type: Grant
    Filed: March 12, 2020
    Date of Patent: July 5, 2022
    Assignee: HONDA MOTOR CO., LTD.
    Inventors: Masaki Kurihara, Shinichi Kikuchi, Shinya Yasuhara, Yusuke Oi, Hiroshi Honda
  • Patent number: 11381531
    Abstract: Systems and methods for an interactive communications system capable of generating a response to conversational input are provided. The interactive communications system analyzes the conversational input to determine relevant topics of discussion. The interactive communications system further determines which of the relevant topics of discussion can potentially lead to an unwanted end to a conversation. The interactive communications system redirects the conversation by providing responses to the conversational input that are intended simply to avoid the unwanted end to the conversation.
    Type: Grant
    Filed: March 23, 2021
    Date of Patent: July 5, 2022
    Assignee: Disney Enterprises, Inc.
    Inventors: Raymond Scanlon, Douglas Fidaleo
  • Patent number: 11375287
    Abstract: Systems and methods for providing a viewer with relevant commentary for a live video. For example, a media guidance application may receive, during playback of a live video, a request for clarification regarding an aspect (e.g., a play, a score, a player, a strategy, etc.) of the live video. In response to receiving the request, the media guidance application may identify the aspect and identify videos generated by other viewers explaining the aspect. The media guidance application may further select one of the videos based on a preference of the viewer, and cause a user device to play back the selected video to the viewer.
    Type: Grant
    Filed: February 12, 2020
    Date of Patent: June 28, 2022
    Assignee: Rovi Guides, Inc.
    Inventors: Mario Miguel Sanchez, Dylan Matthew Wondra, Jean Michelle Somlo, Michaela Schlocker Logan, William L. Thomas
  • Patent number: 11373049
    Abstract: Training and/or using a multilingual classification neural network model to perform a natural language processing classification task, where the model reuses an encoder portion of a multilingual neural machine translation model. In a variety of implementations, a client device can generate a natural language data stream from a spoken input from a user. The natural language data stream can be applied as input to an encoder portion of the multilingual classification model. The output generated by the encoder portion can be applied as input to a classifier portion of the multilingual classification model. The classifier portion can generate a predicted classification label of the natural language data stream. In many implementations, an output can be generated based on the predicted classification label, and a client device can present the output.
    Type: Grant
    Filed: August 26, 2019
    Date of Patent: June 28, 2022
    Assignee: GOOGLE LLC
    Inventors: Melvin Jose Johnson Premkumar, Akiko Eriguchi, Orhan Firat
  • Patent number: 11367436
    Abstract: In one example of the disclosure, a communication apparatus includes a first microphone. The communication apparatus is to be wirelessly and contemporaneously connected to a set of microphones including the first microphone. The communication apparatus is to receive microphone data from each microphone of the set of microphones, wherein the microphone data is indicative of a user spoken phrase captured by the set of microphones. The communication apparatus is to establish based on the received microphone data a selected microphone from among the set of microphones.
    Type: Grant
    Filed: September 27, 2016
    Date of Patent: June 21, 2022
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: David H. Hanes, John Michael Main, Jon R. Dory
  • Patent number: 11367435
    Abstract: An interface device and method of use, comprising audio and image inputs; a processor for determining topics of interest, and receiving information of interest to the user from a remote resource; an audio-visual output for presenting an anthropomorphic object conveying the received information, having a selectively defined and adaptively alterable mood; an external communication device adapted to remotely communicate at least a voice conversation with a human user of the personal interface device. Also provided is a system and method adapted to receive logic for, synthesize, and engage in conversation dependent on received conversational logic and a personality.
    Type: Grant
    Filed: April 20, 2017
    Date of Patent: June 21, 2022
    Assignee: Poltorak Technologies LLC
    Inventor: Alexander Poltorak
  • Patent number: 11368415
    Abstract: The present disclosure relates to an intelligent, adaptable, and trainable bot that orchestrates automation, event data integration, and application programming interfaces across multiple applications. The technology may include receiving event data describing events from distributed software applications and processing the event data describing the events to generate notifications, the event data being received based on execution of a software recipe. The bot may transmit the notifications for display to a user using a conversational interface and receive a command from the user via the conversational interface, the command including a requested operation respective to at least one delivered notification. In response to receiving the command, the method may generate recommendations for additional commands respective to the at least one notification based on metadata associated with an event corresponding to the at least one notification.
    Type: Grant
    Filed: November 18, 2020
    Date of Patent: June 21, 2022
    Assignee: Workato, Inc.
    Inventors: Gautham Viswanathan, Harish Shetty, Bhaskar Roy, Konstantin Tikhonov, Alexey Pikin
  • Patent number: 11367432
    Abstract: A method for generating final transcriptions representing numerical sequences of utterances in a written domain includes receiving audio data for an utterance containing a numeric sequence, and decoding, using a sequence-to-sequence speech recognition model, the audio data for the utterance to generate, as output from the sequence-to-sequence speech recognition model, an intermediate transcription of the utterance. The method also includes processing, using a neural corrector/denormer, the intermediate transcription to generate a final transcription that represents the numeric sequence of the utterance in a written domain. The neural corrector/denormer is trained on a set of training samples, where each training sample includes a speech recognition hypothesis for a training utterance and a ground-truth transcription of the training utterance. The ground-truth transcription of the training utterance is in the written domain.
    Type: Grant
    Filed: March 26, 2020
    Date of Patent: June 21, 2022
    Assignee: Google LLC
    Inventors: Charles Caleb Peyser, Hao Zhang, Tara N. Sainath, Zelin Wu
  • Patent number: 11363128
    Abstract: A method on a mobile device for processing an audio input is described. A trigger for the audio input is received. At least one parameter is determined for an audio processor based on at least one input characteristic for the audio input. The audio input is routed to the audio processor with the at least one parameter.
    Type: Grant
    Filed: December 4, 2019
    Date of Patent: June 14, 2022
    Assignee: GOOGLE TECHNOLOGY HOLDINGS LLC
    Inventors: Kazuhiro Ondo, Michael P. Labowicz, Hideki Yoshino
  • Patent number: 11361755
    Abstract: A computer-implemented conversational agent engages in a natural language conversation with a user, interpreting the natural language conversation by parsing and tokenizing utterances in the natural language conversation. Based on interpreting, a set of utterances in the natural language conversation to be recorded as a macro is determined. The macro is stored in a database with an associated macro identifier. Replaying of the macro executes a function specified in the set of utterances.
    Type: Grant
    Filed: December 4, 2019
    Date of Patent: June 14, 2022
    Assignee: International Business Machines Corporation
    Inventors: Martin Hirzel, Louis Mandel, Avraham E. Shinnar, Jerome Simeon, Mandana Vaziri
  • Patent number: 11355104
    Abstract: Systems and methods for determining that artificial commands, in excess of a threshold value, are detected by multiple voice activated electronic devices is described herein. In some embodiments, numerous voice activated electronic devices may send audio data representing a phrase to a backend system at a substantially same time. Text data representing the phrase, and counts for instances of that text data, may be generated. If the number of counts exceeds a predefined threshold, the backend system may cause any remaining response generation functionality that particular command that is in excess of the predefined threshold to be stopped, and those devices returned to a sleep state. In some embodiments, a sound profile unique to the phrase that caused the excess of the predefined threshold may be generated such that future instances of the same phrase may be recognized prior to text data being generated, conserving the backend system's resources.
    Type: Grant
    Filed: October 18, 2019
    Date of Patent: June 7, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Colin Wills Wightman, Naresh Narayanan, Daniel Robert Rashid
  • Patent number: 11356727
    Abstract: Circuit integrated with voice wake-up function, television and voice control method. The circuit integrated with voice wake-up function comprises television mainboard and far-field voice module. The far-field voice module comprises: microphone unit for collecting voice simulation signal; analog-digital conversion unit for converting voice simulation signal to digital signal; main control unit for processing converted digital signal and outputting same to television mainboard; and voice wake-up unit for processing voice simulation signal and outputting wake-up signal to television mainboard. Microphone unit is connected to analog-digital conversion unit and voice recognition unit, respectively; analog-digital conversion unit is connected to main control unit and television mainboard, respectively; and television mainboard is connected to main control unit and voice wake-up unit, respectively.
    Type: Grant
    Filed: May 28, 2019
    Date of Patent: June 7, 2022
    Assignee: KONKA GROUP CO., LTD.
    Inventors: Qin Wang, Chunmeng Ye, Minqiang Lin
  • Patent number: 11356730
    Abstract: Devices and methods for routing content are provided herein. In some embodiments, a method for routing content include receiving audio data representing a command from a first electronic device, determining content that is associated with the command, sending responsive audio data to the first electronic device, and sending instructions to the second electronic device to output the content associated with the command. In some embodiments, a method for routing contents includes determining a state of the second electronic device and sending instructions to output the content to a selected one of the first and second electronic devices based on the state of the second electronic device.
    Type: Grant
    Filed: June 25, 2021
    Date of Patent: June 7, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Soniya Jobanputra, Marcello Typrin, Mallory Trudell
  • Patent number: 11355112
    Abstract: A system may include first and second speech-processing systems with corresponding first and second wakewords. An utterance may contain two or more wakewords. The system determines which speech-processing system to use to perform further audio processing and to determine a response to the utterance.
    Type: Grant
    Filed: March 3, 2020
    Date of Patent: June 7, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Kunal Dewan Pahwa, Patrick Sheehy
  • Patent number: 11348161
    Abstract: Technology that facilitates prediction of order-fulfillment abeyance are disclosed. Exemplary implementations may: obtain order details of an inchoate order from an orderer; predict that the inchoate order, upon submission, would be have its fulfillment held in abeyance; and in response to the abeyance prediction, disable submission of the inchoate order with the obtained order details.
    Type: Grant
    Filed: October 22, 2019
    Date of Patent: May 31, 2022
    Assignee: Dell Products L.P.
    Inventor: Venkata Chandra Sekar Rao
  • Patent number: 11348597
    Abstract: A network validation system is described which may perform operations such as generating, analyzing, verifying, correcting, recommending, and deploying language, symbols, etc., such as domain specific language, configured to allow users to express their intent on the configuration and operation of a network, such as a cloud-based network. The network validation system may provide domain specific language that includes rules, statements, symbols, data, etc., configured to convey the intent of users on the configuration and operation of networks for purposes such as configuring and/or validating communication paths, testing or setting associated network object configurations, and may be employed to report violations in such configurations relative to user intent of the one or more users. The network validation system may also be employed to monitor such domain specific language and generate telemetry signaling, for example, that a rule has or has not been violated, actions a user may take, etc.
    Type: Grant
    Filed: November 21, 2019
    Date of Patent: May 31, 2022
    Assignee: Oracle International Corporation
    Inventors: Peter J. Hill, Jagwinder Brar, Yogesh Sreenivasan
  • Patent number: 11350185
    Abstract: A device configured to receive a video request that includes animation instructions for a video scene. The animation instructions identify one or more animations associated with the video scene. The device is further configured to identify a first animation from the one or more animations associated with the video scene and to determine that the first animation is configured for text-to-audio. The device is further configured to identify text associated with the first animation and to convert the text associated with the first animation into an audio sample. The device is further configured to associate the audio sample with an animation identifier for the first animation in an audio sample buffer. The device is further configured to associate a timestamp with a source scene identifier for the video scene and the animation identifier for the first animation in the video timing map.
    Type: Grant
    Filed: December 13, 2019
    Date of Patent: May 31, 2022
    Assignee: Bank of America Corporation
    Inventor: Shankar Sangoli
  • Patent number: 11348590
    Abstract: The present disclosure provides methods and devices for registering a voiceprint and authenticating a voiceprint. The method for registering a voiceprint includes performing a frame alignment operation on a registration character string inputted by a user in voice to extract first acoustic features of each first character constituting the registration character string; calculating a first posterior probability of the first acoustic features of each first character in a global Gaussian Mixture Model (GMM) model to perform a Baum-Welch (BW) statistic; extracting first vector features of each first character through a preset vector feature extractor configured for multi-character; and stitching the first vector features of each first character sequentially, to obtain a registration voiceprint model of the user.
    Type: Grant
    Filed: August 24, 2016
    Date of Patent: May 31, 2022
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Chao Li, Bengu Wu
  • Patent number: 11347782
    Abstract: Embodiments of the present disclosure disclose an Internet text mining-based method and apparatus for judging the validity of a point of interest. An implementation of the method includes: determining a search word set for indicating a to-be-detected point of interest; performing a search by using a determined search word as a search keyword, to obtain a description information set for describing the to-be-detected point of interest; and inputting a name of the to-be-detected point of interest and description information in the description information set into a pre-established validity discriminant model, to obtain a status label for indicating validity of the to-be-detected point of interest. This implementation enables timely discovery of invalid POI information. Thus, more accurate information are provided for users, user needs are met, and user experience is improved.
    Type: Grant
    Filed: July 10, 2019
    Date of Patent: May 31, 2022
    Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
    Inventors: Jizhou Huang, Yaming Sun
  • Patent number: 11341962
    Abstract: An interface device and method of use, comprising audio and image inputs; a processor for determining topics of interest, and receiving information of interest to the user from a remote resource; an audio-visual output for presenting an anthropomorphic object conveying the received information, having a selectively defined and adaptively alterable mood; an external communication device adapted to remotely communicate at least a voice conversation with a human user of the personal interface device. Also provided is a system and method adapted to receive logic for, synthesize, and engage in conversation dependent on received conversational logic and a personality.
    Type: Grant
    Filed: April 20, 2017
    Date of Patent: May 24, 2022
    Assignee: Poltorak Technologies LLC
    Inventor: Alexander Poltorak
  • Patent number: 11341967
    Abstract: A method for a voice interaction is provided according to embodiments of the disclosure, the method belonging to the field of smart devices. The method may include: receiving an external input; checking a current time in response to the external input; calling a voice program; and raising a question according to the current time and the called voice program, and playing the called voice program. The method and apparatus for a voice interaction may perform more immersive interaction with a user, thereby improving the user experience.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: May 24, 2022
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventors: Dongli Liu, Xiaocheng Dai, Jian Peng
  • Patent number: 11335338
    Abstract: A speech recognition system comprises: an input, for receiving an input signal from at least one microphone; a first buffer, for storing the input signal; a noise reduction block, for receiving the input signal and generating a noise reduced input signal; a speech recognition engine, for receiving either the input signal output from the first buffer or the noise reduced input signal from the noise reduction block; and a selection circuit for directing either the input signal output from the first buffer or the noise reduced input signal from the noise reduction block to the speech recognition engine.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: May 17, 2022
    Assignee: Cirrus Logic, Inc.
    Inventors: John Paul Lesso, Robert James Hatfield
  • Patent number: 11335334
    Abstract: There is provided an information processing device and an information processing method that enable the intention of a speech of a user to be estimated more accurately. The information processing device includes: a detection unit configured to detect a breakpoint of a speech of a user on the basis of a result of recognition that is to be obtained during the speech of the user; and an estimation unit configured to estimate an intention of the speech of the user on the basis of a result of semantic analysis of a divided speech sentence obtained by dividing a speech sentence at the detected breakpoint of the speech. The present technology can be applied, for example, to a speech dialogue system.
    Type: Grant
    Filed: October 19, 2018
    Date of Patent: May 17, 2022
    Assignee: SONY CORPORATION
    Inventors: Hiro Iwase, Shinichi Kawano, Yuhei Taki, Kunihito Sawai
  • Patent number: 11334383
    Abstract: A method, apparatus, computer system, and computer program product for processing requests. Overlapping requests are received by a computer system from users using a shared client device. The overlapping requests are requests for which responses have not been sent to the shared client device. Priorities for the overlapping requests are determined by the computer system based on a set of priority considerations for the overlapping requests and using request information derived from the overlapping requests in which the request information includes at least one of an emotional state or an urgency. The overlapping requests are processed by the computer system based on the priorities determined for the overlapping requests.
    Type: Grant
    Filed: April 24, 2019
    Date of Patent: May 17, 2022
    Assignee: International Business Machines Corporation
    Inventors: Sarbajit K. Rakshit, Martin G. Keen, James E. Bostick, John M. Ganci, Jr.
  • Patent number: 11330521
    Abstract: The application discloses an intelligent device wake-up method, an intelligent device and a computer-readable storage medium. A specific implementation is: obtaining wake-up voice sent by a user; when determining that a current wake-up mode is a group wake-up mode, recognizing volume information corresponding to obtained wake-up voice, and determining wake-up delay time according to the volume information; when determining that the wake-up delay time is different from wake-up delay time corresponding to other intelligent device in the group, and no response information sent by other intelligent device in the group is obtained within the wake-up delay time, performing a wake-up process and playing response information when the wake-up delay time is over. Thus, it can ensure that only one intelligent device responds to the wake-up voice sent out by the user at any time, thereby avoiding the situation that multiple intelligent devices respond at the same time.
    Type: Grant
    Filed: April 2, 2020
    Date of Patent: May 10, 2022
    Assignees: Baidu Online Network Technology (Beijing) Co., Ltd., Shanghai Xiaodu Technology Co., Ltd.
    Inventor: Dehong Yu
  • Patent number: 11327642
    Abstract: A system and method for organizing and representing in a single display, using temporal and locational relationships, multiple selected pieces of information that may exist in different embodiments and that may be related to one or more past, present or future events.
    Type: Grant
    Filed: April 29, 2019
    Date of Patent: May 10, 2022
    Assignee: Priority 5 Holdings Inc.
    Inventors: Charles Q. Miller, Allen D. Bierbaum, Aron L. Bierbaum
  • Patent number: 11328725
    Abstract: An apparatus for recognizing a voice in a vehicle includes an input device for receiving a voice command and a controller. The controller: determines whether a number of the voice commands is at least two; determines whether the voice commands are able to be executed based on a preset priority, when the number of the voice commands is at least two; calculates an execution sequence of the voice commands based on the determination result; and allows operations corresponding to the voice commands to be executed based on the calculated execution sequence of the voice commands. When the plurality of voice commands is input, the operations corresponding to the voice commands are executed in an optimized manner.
    Type: Grant
    Filed: August 12, 2020
    Date of Patent: May 10, 2022
    Assignees: HYUNDAI MOTOR COMPANY, KIA MOTORS CORPORATION
    Inventor: Seong Soo Yae
  • Patent number: 11327646
    Abstract: A method for modifying visual aspects of a keyboard in response to a user typing on the keyboard. The method includes one or more computer processors receiving a first character input to an input device. The method further includes determining a plurality of words that begin with the first received character. The method further includes ranking the determined plurality of words. The method further includes selecting a word from among the ranked plurality of words based on a first set of criteria. The method further includes determining a sequence of one or more characters after the received first character that correspond to the selected word. The method further includes modifying one or more respective characteristics of input elements of the input device that correspond to the sequence of characters of the selected word.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: May 10, 2022
    Assignee: International Business Machines Corporation
    Inventors: Richard V. Tran, Heidi Lagares-Greenblatt, Kevin David Hite
  • Patent number: 11322153
    Abstract: A conversation interaction method and apparatus, and a computer-readable storage medium are provided. The method includes: converting a speech to be recognized into a first text; inputting the first text into a semantic analysis model, to obtain intention information and slot information of the first text; and inputting the intention information and the slot information of the first text into a conversation state machine, to obtain interaction information corresponding to the first text. By using a semantic analysis model, intention information and slot information of a first text are obtained directly from the first text. The process in the existing technology, where a semantic analysis model needs to be used immediately after a language model, is avoided, thereby shortening processing time and making it possible to respond faster to a user. Further, by using the above scheme, calculation complexity and the cost of a whole system are reduced.
    Type: Grant
    Filed: February 21, 2020
    Date of Patent: May 3, 2022
    Assignees: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., SHANGHAI XIAODU TECHNOLOGY CO. LTD.
    Inventors: Yunfei Xu, Guoguo Chen
  • Patent number: 11322152
    Abstract: Power consumption for a computing device may be managed by one or more keywords. For example, if an audio input obtained by the computing device includes a keyword, a network interface module and/or an application processing module of the computing device may be activated. The audio input may then be transmitted via the network interface module to a remote computing device, such as a speech recognition server. Alternately, the computing device may be provided with a speech recognition engine configured to process the audio input for on-device speech recognition.
    Type: Grant
    Filed: June 17, 2019
    Date of Patent: May 3, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Kenneth John Basye, Hugh Evan Secker-Walker, Tony David, Reinhard Kneser, Jeffrey Penrod Adams, Stan Weidner Salvador, Mahesh Krishnamoorthy
  • Patent number: 11321116
    Abstract: The electronic device with one or more processors and memory receives an input of a user. The electronic device, in accordance with the input, identifies a respective task type from a plurality of predefined task types associated with a plurality of third party service providers. The respective task type is associated with at least one third party service provider for which the user is authorized and at least one third party service provider for which the user is not authorized. In response to identifying the respective task type, the electronic device sends a request to perform at least a portion of a task to a third party service provider of the plurality of third party service providers that is associated with the respective task type.
    Type: Grant
    Filed: June 22, 2021
    Date of Patent: May 3, 2022
    Assignee: Apple Inc.
    Inventors: Thomas R. Gruber, Christopher D. Brigham, Adam J. Cheyer, Daniel Keen, Kenneth Kocienda
  • Patent number: 11321386
    Abstract: Systems and methods are described herein for automatically changing the priority of a media asset using a continuous listening device. The system may receive an audio clip of a conversation a user, and then determine whether that conversation relates to any of the programs recorded or scheduled to be recorded on a storage device associated with the user. In response to determining that the media asset does relate to one of the programs recorded or scheduled to be recorded on the storage device, a user profile may be consulted to determine past instances of the user discussing the media asset, and, if a measure of the total number of instances the user discussed the media asset meets a threshold measure, the priority of the media asset may be updated.
    Type: Grant
    Filed: May 1, 2019
    Date of Patent: May 3, 2022
    Assignee: Rovi Guides, Inc.
    Inventors: Michael McCarty, Glen E. Roe
  • Patent number: 11314481
    Abstract: Systems and methods for enabling voice-based interactions with electronic devices can include a data processing system maintaining a plurality of device action data sets and a respective identifier for each device action data set. The data processing system can receive, from an electronic device, an audio signal representing a voice query and an identifier. The data processing system can identify, using the identifier, a device action data set. The data processing system can identify a device action from device action data set based on content of the audio signal. The data processing system can then identify, from the device action dataset, a command associated with the device action and send the command to the for execution device for execution.
    Type: Grant
    Filed: May 7, 2018
    Date of Patent: April 26, 2022
    Assignee: GOOGLE LLC
    Inventors: Bo Wang, Venkat Kotla, Chad Yoshikawa, Chris Ramsdale, Pravir Gupta, Alfonso Gomez-Jordana, Kevin Yeun, Jae Won Seo, Lantian Zheng, Sang Soo Sung
  • Patent number: 11315565
    Abstract: A multi-party conversational agent includes a computing platform having a hardware processor and a memory storing a software code. The hardware processor is configured to execute the software code to identify a first predetermined expression for conversing with a group of people, and to have a group conversation, using the first predetermined expression, with at least some members of the group. The hardware processor is configured to further execute the software code to identify, while having the group conversation, a second predetermined expression for having a dialogue with at least one member of the group, and to interrupt the group conversation to have the dialogue, using the second predetermined expression, with the at least one member of the group.
    Type: Grant
    Filed: April 3, 2020
    Date of Patent: April 26, 2022
    Assignee: Disney Enterprises, Inc.
    Inventors: James R. Kennedy, Victor R. Martinez Palacios