Patents Examined by Abul K. Azad
  • Patent number: 11487503
    Abstract: The present invention discloses an interactive control method executed during instant video communication between a user and one or more other users. The method comprises: monitoring video information collected by a camera during the instant video communication between the user and the one or more other users; performing recognition on the video information after acquiring the video information, to acquire user behavior data inputted by the user in a preset manner; determining whether the user behavior data comprises preset trigger information; when it is determined that the user behavior data comprises the preset trigger information, further determining whether the user behavior data comprises a preset gesture action; and when it is determined that the user behavior data comprises the preset gesture action, determining an operation instruction corresponding to the preset gesture action in a preset operation instruction set, and performing an event corresponding to the operation instruction.
    Type: Grant
    Filed: June 11, 2020
    Date of Patent: November 1, 2022
    Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
    Inventor: Feng Li
  • Patent number: 11475885
    Abstract: Methods for mapping intents to utterances using a three-tiered system is provided. Methods may include receiving a plurality of predetermined action-topic pairs and a plurality of predetermined intents. Methods may include mapping the plurality of predetermined action-topic pairs to the plurality of predetermined intents via a one-to-many mapping. Methods may include receiving a linguistic utterance at a first tier of the three-tiered system. Methods may include translating the linguistic utterance at the first tier of the three-tiered system. Methods may include mapping the textual representation to one or more action-topic pairs included in the plurality of action-topic pairs. The mapping may be executed at the second tier of the three-tiered system. Methods may include identifying one or more intents that correlate to the textual representation. The identifying may be executed at the third tier. The identifying may be based on the mapping between the action-topics pairs and the predetermined intents.
    Type: Grant
    Filed: May 26, 2020
    Date of Patent: October 18, 2022
    Assignee: Bank of America Corporation
    Inventors: Isaac Persing, Emad Noorizadeh
  • Patent number: 11475890
    Abstract: Training and/or utilizing a single neural network model to generate, at each of a plurality of assistant turns of a dialog session between a user and an automated assistant, a corresponding automated assistant natural language response and/or a corresponding automated assistant action. For example, at a given assistant turn of a dialog session, both a corresponding natural language response and a corresponding action can be generated jointly and based directly on output generated using the single neural network model. The corresponding response and/or corresponding action can be generated based on processing, using the neural network model, dialog history and a plurality of discrete resources. For example, the neural network model can be used to generate a response and/or action on a token-by-token basis.
    Type: Grant
    Filed: June 24, 2020
    Date of Patent: October 18, 2022
    Assignee: GOOGLE LLC
    Inventors: Arvind Neelakantan, Daniel Duckworth, Ben Goodrich, Vishaal Prasad, Chinnadhurai Sankar, Semih Yavuz
  • Patent number: 11465290
    Abstract: A robot capable of conversation with another robot and a method of controlling the same are disclosed. The robot includes a main body having a first region corresponding to a human face and rotatable in left-right direction directions, a signal generator generating a first data signal to be transmitted to a listener robot and a first robot voice signal corresponding to the first data signal, a communication unit transmitting the first data signal to an external server, a speaker outputting the first robot voice signal, and a controller controlling a rotation direction of the main body such that the first region is directed toward the listener robot at a time point adjacent to a transmission time of the first data signal and controlling the speaker to output the first robot voice signal after the rotation direction of the robot is controlled, wherein the listener robot receives the first data signal transmitted from the external server and is controlled to operate based on the first data signal.
    Type: Grant
    Filed: August 29, 2019
    Date of Patent: October 11, 2022
    Assignee: LG ELECTRONICS INC.
    Inventors: Ji Yoon Park, Jungkwan Son
  • Patent number: 11461560
    Abstract: A device includes a memory adapted to store a list in a file or database comprising a plurality of vocabulary words in a first language and, for each vocabulary word, a corresponding word in a second language, a display device, and a processor. The processor is adapted to receive a plurality of words in the first language, select one or more words among the plurality of words, based on one or more predetermined criteria, translate, match or equate the one or more selected words from the first language to words of the second language, and cause the display device to display the plurality of words, wherein one or more first words that are in the plurality of words and are not among the one or more selected words which are displayed in the first language and one or more second words that are in the plurality of words and are among the one or more selected words are displayed in the second language.
    Type: Grant
    Filed: September 14, 2018
    Date of Patent: October 4, 2022
    Inventor: Robert F. Deming, Jr.
  • Patent number: 11430458
    Abstract: A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
    Type: Grant
    Filed: March 31, 2020
    Date of Patent: August 30, 2022
    Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KAWNGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION
    Inventors: Seungkwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jeongil Seo, Jin Woo Hong, Chieteuk Ahn, Ho Chong Park, Young-cheol Park
  • Patent number: 11417354
    Abstract: In accordance with an example embodiment of the present invention, disclosed is a method and an apparatus for voice activity detection (VAD). The VAD comprises creating a signal indicative of a primary VAD decision and determining hangover addition. The determination on hangover addition is made in dependence of a short term activity measure and/or a long term activity measure. A signal indicative of a final VAD decision is then created.
    Type: Grant
    Filed: February 18, 2020
    Date of Patent: August 16, 2022
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventor: Martin Sehlstedt
  • Patent number: 11417331
    Abstract: The present disclosure provides a method for controlling a terminal, including the following operations: obtaining recognition results corresponding to control signals after receiving the control signals, and determining whether control instructions corresponding to the recognition results conflict, each control signal comprising at least one of a voice signal or a gesture signal; determining a credibility of each control instruction in response to a determination that there exists conflict among control instructions; and sending the control instruction with highest credibility to a control terminal. The present disclosure further provides a device for controlling a terminal and a computer readable storage medium. When control instructions are received and there exists conflict among control instructions, the control instruction with the highest credibility is sent to the control terminal after the credibility of each control instructions is determined, thereby avoiding settings from conflict.
    Type: Grant
    Filed: March 6, 2020
    Date of Patent: August 16, 2022
    Assignees: GD MIDEA AIR-CONDITIONING EQUIPMENT CO., LTD., MIDEA GROUP CO., LTD.
    Inventors: Zhicai Ou, Weiying Li
  • Patent number: 11410651
    Abstract: Network source identification via audio signals is provided. A system receives data packets with an input audio signal from a client device. The system identifies a request. The system selects a digital component provided by a digital component provider device. The system identifies audio chimes stored in memory of the client device. The system matches, based on a policy, an identifier of the digital component provider device to a first audio chime stored in the memory of the client device. The system determines, based on a characteristic of the first audio chime, a configuration to combine the digital component with the first audio chime. The system generates an action data structure with the digital component, an indication of the first audio chime, and the configuration. The system transmits the action data structure to the client device to cause the client device to generate an output audio signal.
    Type: Grant
    Filed: April 30, 2020
    Date of Patent: August 9, 2022
    Assignee: GOOGLE LLC
    Inventor: Peter Kraker
  • Patent number: 11410643
    Abstract: A computer-implemented method of responding to a conversational event. The method comprises enacting, by a conversational computing interface, an initial computer-executable plan based on a conversational event received by the conversational computing interface, wherein the initial computer-executable plan is configured to output an initial value based on the conversational event. The method further comprises selecting, by the conversational computing interface, an extended computer-executable plan based on determining that the initial value is insufficient for generating an extended description responsive to the conversational event. The method further comprises enacting, by the conversational computing interface, the extended computer-executable plan to output additional information beyond what the initial computer-executable plan is configured to output, the additional information sufficient for generating the extended description responsive to the conversational event.
    Type: Grant
    Filed: October 18, 2019
    Date of Patent: August 9, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Jacob Daniel Andreas, Jayant Sivarama Krishnamurthy, Alan Xinyu Guo, Andrei Vorobev, John Philip Bufe, III, Jesse Daniel Eskes Rusak, Yuchen Zhang
  • Patent number: 11410664
    Abstract: An apparatus for estimating an inter-channel time difference between a first channel signal and a second channel signal, includes: a calculator for calculating a cross-correlation spectrum for a time block from the first channel signal in the time block and the second channel signal in the time block; a spectral characteristic estimator for estimating a characteristic of a spectrum of the first channel signal or the second channel signal for the time block; a smoothing filter for smoothing the cross-correlation spectrum over time using the spectral characteristic to obtain a smoothed cross-correlation spectrum; and a processor for processing the smoothed cross-correlation spectrum to obtain the inter-channel time difference.
    Type: Grant
    Filed: February 19, 2020
    Date of Patent: August 9, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Stefan Bayer, Eleni Fotopoulou, Markus Multrus, Guillaume Fuchs, Emmanuel Ravelli, Markus Schnell, Stefan Doehla, Wolfgang Jaegers, Martin Dietz, Goran Markovic
  • Patent number: 11404062
    Abstract: Provided is a voice assistance system with proactive routines that couples a remote server and respective user voice interactive devices to deliver a complete experience to the end user of the device. The user devices can be managed by groups and/or associated entities who manage voice services for their users. For example, the entities can provide pre-configured voice routines that perform actions on behalf of their users. The voice assistance system can also allow users to customize these routines to improve day to day operation. In addition, external services and/or providers can be linked to the system and allowed to define routines that have external system dependencies. Avoiding and managing conflicts in this environment becomes quite challenging. Some approaches use execution queues and priority, others invoke time slices and limitations on assignment of routines to time slices to resolve these issues, among other examples.
    Type: Grant
    Filed: July 26, 2021
    Date of Patent: August 2, 2022
    Assignee: LifePod Solutions, Inc.
    Inventors: Nirmalya K. De, Alan R. Bugos, Dale M. Smith, Stuart R. Patterson, Jonathan E. Gordon
  • Patent number: 11393470
    Abstract: Disclosed are a method for providing a speech recognition service and a speech recognition apparatus, which may perform speech recognition by executing an artificial intelligence (AI) algorithm and/or a machine learning algorithm, which are mounted therein, so that a speech recognition apparatus and a server may communicate with each other in a 5G communication environment. The method and the speech recognition apparatus provide a response based on a user's intention analysis with respect to the ambiguous utterance of the user.
    Type: Grant
    Filed: January 10, 2020
    Date of Patent: July 19, 2022
    Assignee: LG ELECTRONICS INC.
    Inventor: Da Hae Kim
  • Patent number: 11393473
    Abstract: Described herein is a system for device arbitration using characteristics of audio data captured by multiple devices. The system determines a feature vector corresponding to each device that captured an audio input, where the feature vector includes the audio energy levels, spectral data corresponding to the audio data and centroid data corresponding to the audio data. The feature vectors are processed using a trained component to select a device for further processing.
    Type: Grant
    Filed: May 18, 2020
    Date of Patent: July 19, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Alexandra Fenster, Ragini Rajendra Prasad
  • Patent number: 11393466
    Abstract: An electronic device and method are disclosed herein. The electronic device includes a communication interface, a processor and a memory. The processor implements the method, including detecting a login to a first user account through a communication interface, identifying electronic medical record (EMR) data stored in a memory corresponding to the first user account based at least in part on a result of the detected login, generate first utterance data for output through a user device based at least in part on the stored EMR data, wherein the first utterance data is generated before any data associated with utterance is received from the user device; and transmitting the generated first utterance data to the user device through the communication interface for output by the user device.
    Type: Grant
    Filed: October 22, 2019
    Date of Patent: July 19, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Joohyun Kim
  • Patent number: 11386887
    Abstract: This disclosure proposes systems and methods for processing natural language inputs using data associated with multiple language recognition contexts (LRC). A system using multiple LRCs can receive input data from a device, identify a first identifier associated with the device, and further identify second identifiers associated with the first identifier and representing candidate users of the device. The system can access language processing data used for natural language processing for the LRCs corresponding to each of the first and second identifiers, and process the input data using the language processing data at one or more stages of automatic speech recognition, natural language understanding, entity resolution, and/or command execution. User recognition can reduce the number of candidate users, and thus the amount of data used to process the input data. Dynamic arbitration can select from between competing hypotheses representing the first identifier and a second identifier, respectively.
    Type: Grant
    Filed: March 23, 2020
    Date of Patent: July 12, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Da Teng, Adrian Evans, Naresh Narayanan
  • Patent number: 11380332
    Abstract: Provided is an information processing apparatus capable of reliably delivering a message to a third party desired by a user. Provided is an information processing apparatus including an acquisition unit configured to acquire information including a sound message, and a recognition unit configured to recognize a sender of the sound message, a destination of a message included is the sound message, and content of the message from the information acquired by the acquisition unit, in which the recognition unit generates information for inputting the destination of the message is a case where the destination cannot be uniquely specified.
    Type: Grant
    Filed: January 31, 2018
    Date of Patent: July 5, 2022
    Assignee: Sony Mobile Communications Inc.
    Inventors: Manabu Kii, Hidehiro Komatsu, Atsushi Ishihara, Yu Shigeta, Junji Itoyama
  • Patent number: 11380334
    Abstract: Methods and Systems for Interactive Online Language Learning in a Pandemic-aware World.
    Type: Grant
    Filed: October 21, 2019
    Date of Patent: July 5, 2022
    Inventors: Norman Abramovitz, Jonathan Stiebel
  • Patent number: 11373634
    Abstract: An electronic device secures diversity of a user utterance with respect to a content name when a user searches a content through a display device by utilizing a voice. A method by an electronic device includes steps of receiving input of a user voice, acquiring a keyword related to a content included in the user voice, and acquiring at least one modified keyword based on the keyword, acquiring a plurality of search results corresponding to the keyword and the at least one modified keyword, comparing the keyword and the modified keyword with the plurality of search results and acquiring a content name corresponding to the keyword, and updating a database of content names based on the keyword, the modified keyword, and the final content name.
    Type: Grant
    Filed: October 30, 2019
    Date of Patent: June 28, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Jiwon Yoo, Jihun Park
  • Patent number: 11367444
    Abstract: A search is performed based on a voice input combined with user selection of entities displayed on a display screen as well as real-world entities. A voice input is received from the user by a media device, as well as a selection of a first entity being displayed on the media device. A conjunction spoken in the voice input triggers the media device to wait for selection of a second entity before performing the search. After receiving selection of the second entity, a search query is constructed based on the voice input, the first entity, and the second entity. The search query is transmitted to a database and, in response, the media device receives at least one identifier of a least one content item. The at least one identifier is then generated for display to the user.
    Type: Grant
    Filed: January 7, 2020
    Date of Patent: June 21, 2022
    Assignee: ROVI GUIDES, INC.
    Inventors: Susanto Sen, Charishma Chundi