Patents Examined by Huyen X. Vo
  • Patent number: 11152005
    Abstract: A method of converting speech to text comprises receiving an audio recording from an input device comprising speech of a plurality of speakers. Extracting from the audio recording, a speaker audio recording comprising recorded audio of an individual speaker. Selecting, based on a characteristic of the speaker audio recording, a speech to text engine and a dictionary. Configuring the speech to text engine with the dictionary and executing a first conversion process to convert a first portion of the speaker audio recording to produce a first transcript. Evaluating a performance metric of the conversion process against a quality metric to reconfigure the speech to text engine and execute a second conversion process to convert a second portion of the speaker audio recording to produce a second transcript. Combining the first transcript and the second transcript to produce a transcript of the speaker audio recording.
    Type: Grant
    Filed: September 11, 2019
    Date of Patent: October 19, 2021
    Assignee: VIQ Solutions Inc.
    Inventor: Malcolm Macallum
  • Patent number: 11145301
    Abstract: A system and method establishes a communication connection between a first device of a first user and a second device of a second user. Request data corresponding to a request to establish a communication connection with a second user is received, and a user profile associated with the second user is determined. One or more sensors of the second device receive input data corresponding to the environment of the second device, and an identity of the second user is determined based thereon. The communication connection is established and, based on the identity, the second device tracks movement of the second user in the environment.
    Type: Grant
    Filed: September 25, 2018
    Date of Patent: October 12, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Shambhavi Sathyanarayana Rao, Anna Chen Santos, Tony Roy Hardie
  • Patent number: 11133000
    Abstract: A portable gateway device for use with a Building Management System (BMS) enables voice command control of BMS devices. The portable gateway device comprises a Wi-Fi module, a serial communications interface, and a data conversion module. The Wi-Fi module is configured to enable communication with a user device via Wi-Fi. The portable gateway device is configured to receive a voice command spoken by a user of the user device. The serial communications interface is configured to enable communication with a bus connected to a BMS device. The data conversion module is configured to translate the voice command received from the user device into a control action associated with the BMS device. The portable gateway device is configured to provide the control action to the BMS device via the serial communications interface. The control action affects a state or condition of the BMS device.
    Type: Grant
    Filed: October 12, 2018
    Date of Patent: September 28, 2021
    Assignee: Johnson Controls Tyco IP Holdings LLP
    Inventors: Sumit Kumar, Pramod Balbhim Kolhapure, Sachin Yashwant Pate, Suraj Sunil Lawand, Ankur Thareja, Shyam M. Sunder
  • Patent number: 11127402
    Abstract: The present disclosure relates generally to a system and method for voice development frameworks. Certain cloud-based systems may be embodied in a multi-instance or multi-tenant framework, and may provide for certain computing systems and resources. For example, the cloud-based systems may provide for data repositories and the creation of executable objects, e.g., Flow Designer objects that include voice commands. In certain embodiments, visual development tools, including a Flow Designer system, may be used to create the executable objects, including voice command objects. For example, the Flow Designer system may enable the non-technical personnel to use natural language to more easily create and visualize objects and processes that automate certain tasks.
    Type: Grant
    Filed: May 3, 2019
    Date of Patent: September 21, 2021
    Assignee: ServiceNow, Inc.
    Inventors: Santosh Kumar Das, Gagan deep, Sumit Rathi, Ashita Narayan, Chakradhar Narasimha Jillellamudi, Raghavan Muthuraman
  • Patent number: 11128714
    Abstract: A method of using voice commands from a mobile device to remotely access and control a computer. The method includes receiving audio data from the mobile device at the computer. The audio data is decoded into a command. A software program that the command was provided for is determined. At least one process is executed at the computer in response to the command. Output data is generated at the computer in response to executing at least one process at the computer. The output data is transmitted to the mobile device.
    Type: Grant
    Filed: June 9, 2020
    Date of Patent: September 21, 2021
    Assignee: Voice Tech Corporation
    Inventor: Todd R. Smith
  • Patent number: 11120405
    Abstract: The present disclosure generally relates to interview training and providing interview feedback. An exemplary method comprises: at an electronic device that is in communication with a display and one or more input devices: receiving, via the one or more input devices, media data corresponding to a user's responses to a plurality of prompts; analyzing the media data; and while displaying, on the display, a media representation of the media data, displaying a plurality of analysis representations overlaid on the media representation, wherein each of the plurality of analysis representations is associated with an analysis of content located at a given time in the media representation and is displayed in coordination with the given time in the media representation.
    Type: Grant
    Filed: September 17, 2020
    Date of Patent: September 14, 2021
    Assignee: Korn Ferry
    Inventors: Thom Steinhoff, Panos S. Stamus, Bryan Ackermann, John Deyto
  • Patent number: 11120794
    Abstract: Systems and methods for maintaining voice assistant persistence across multiple network microphone devices are described. In one example, first and second NMDs each identify a wake word based on detected sound, and are each transitioned from an inactive state to an active state in which the NMD captures and transmits sound data over a network interface. The first NMD is selected over the second NMD to output a first response, and both NMDs remain in the active state to further capture and transmit sound data. After further capturing and transmitting of sound data, the second NMD is selected over the first NMD to output a second response. After a predetermined time, one or both of the NMDs are transitioned back to the inactive state. The selection of one NMD over another for outputting a response can be based at least in part on user location information.
    Type: Grant
    Filed: May 3, 2019
    Date of Patent: September 14, 2021
    Assignee: Sonos, Inc.
    Inventors: Connor Kristopher Smith, Paul Bates
  • Patent number: 11120803
    Abstract: A building automation system (BAS) is configured to support a variety of different natural language processing (NLP) service providers with minimal or no redesign effort. The BAS includes an event handler configured to receive an external request from a service provider. The external request is associated with a voice input uttered by a user. The BAS further includes an abstraction layer configured to receive the external request from the event handler and generate an internal request based on the external request. The BAS further includes an intent processor configured to receive the internal request from the abstraction layer and an intent handler in communication with the intent processor and configured to perform an action in accordance with the internal request.
    Type: Grant
    Filed: January 11, 2019
    Date of Patent: September 14, 2021
    Assignee: Johnson Controls Tyco IP Holdings LLP
    Inventors: Daniel Mellenthin, Gerald A. Asp, Joseph M. Mueller
  • Patent number: 11107482
    Abstract: The present disclosure relates to systems and methods for speech signal processing on a signal to transcribe speech. In one implementation, the system may include a memory storing instructions and a processor configured to execute the instructions. The instructions may include instructions to receive the signal, determine if at least a portion of data in the signal is missing, and when at least a portion of data is missing: process the signal using a hidden Markov model to generate an output; using the output, calculate a set of possible contents to fill a gap due to the missing data portion, with each possible content having an associated probability; based on the associated probabilities, select one of the set of possible contents; and using the selected possible content, update the signal.
    Type: Grant
    Filed: December 5, 2019
    Date of Patent: August 31, 2021
    Assignee: RingCentral, Inc.
    Inventors: Xiaoming Li, Ehtesham Khan, Santosh Panattu Sethumadhavan
  • Patent number: 11107487
    Abstract: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.
    Type: Grant
    Filed: October 28, 2019
    Date of Patent: August 31, 2021
    Assignee: Dolby International AB
    Inventor: Per Ekstrand
  • Patent number: 11107041
    Abstract: The present disclosure generally relates to interview training and providing interview feedback. An exemplary method comprises: at an electronic device that is in communication with a display and one or more input devices: receiving, via the one or more input devices, media data corresponding to a user's responses to a plurality of prompts; analyzing the media data; and while displaying, on the display, a media representation of the media data, displaying a plurality of analysis representations overlaid on the media representation, wherein each of the plurality of analysis representations is associated with an analysis of content located at a given time in the media representation and is displayed in coordination with the given time in the media representation.
    Type: Grant
    Filed: July 8, 2019
    Date of Patent: August 31, 2021
    Assignee: Korn Ferry
    Inventors: Thom Steinhoff, Panos S. Stamus, Bryan Ackermann, John Deyto
  • Patent number: 11100923
    Abstract: Systems and methods for media playback via a media playback system include capturing sound data via a network microphone device and identifying a candidate wake word in the sound data. Based on identification of the candidate wake word in the sound data, the system selects a first wake-word engine from a plurality of wake-word engines. Via the first wake-word engine, the system analyzes the sound data to detect a confirmed wake word, and, in response to detecting the confirmed wake word, transmits a voice utterance of the sound data to one or more remote computing devices associated with a voice assistant service.
    Type: Grant
    Filed: September 28, 2018
    Date of Patent: August 24, 2021
    Assignee: Sonos, Inc.
    Inventors: Joachim Fainberg, Daniele Giacobello, Klaus Hartung
  • Patent number: 11100934
    Abstract: A method and an apparatus for voiceprint creation and registration, comprising: prompting to create a voiceprint and register when a device is enabled for a first time(101); using a text-related training method to create a voiceprint model for a user(102); generating an ID for the user(103); and prompting the user to input user ID-related data; storing the ID for the user and the voiceprint model correspondingly in a voiceprint registration database(104). The problems in the prior art that the technology of the voiceprint creation and registration method has a high learning cost and is more disturbing to the user may be avoided. The voiceprint creation process may cover various scenes, the voiceprint creation may guide the user in all stages, or the voiceprint creation is separated from registration through a frequency to minimize user's disturbance, and after the user is guided to register the voiceprint, the speech interaction product may provide personalized service to the user based on the voiceprint.
    Type: Grant
    Filed: November 30, 2017
    Date of Patent: August 24, 2021
    Assignees: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD., SHANGHAI XIAODU TECHNOLOGY CO. LTD.
    Inventors: Wenyu Wang, Yuan Hu
  • Patent number: 11094315
    Abstract: A determination unit (7) determines whether or not a specific passenger in a car has spoken, on the basis of sound data collected in the car. A control unit (8) activates an in-car communication function, when it is determined by the determination unit (7) that the specific passenger has spoken.
    Type: Grant
    Filed: March 17, 2017
    Date of Patent: August 17, 2021
    Assignee: MITSUBISHI ELECTRIC CORPORATION
    Inventor: Gen Nishikawa
  • Patent number: 11093714
    Abstract: The present disclosure is directed to optimizing transfer learning for neural networks by creating a dynamic transfer network configuration through gated architecture. In some embodiments, transfer learning implements multiple parameter sharing schemes across a source task and a target task. The gating architecture can learn the optimal parameter sharing schemes as the neural network is trained. In some embodiments, the system can be used in named entity recognition applications where the training data is limited.
    Type: Grant
    Filed: March 5, 2019
    Date of Patent: August 17, 2021
    Assignee: Amazon Technologies, Inc.
    Inventor: Parminder Bhatia
  • Patent number: 11080488
    Abstract: A non-transitory computer-readable recording medium stores therein an output control program that causes a computer to execute a process including: receiving a phoneme string for a text having a plurality of sentences; determining a sentence corresponding to a specific phoneme or a phoneme string included in the received phoneme string; referring to a storage that stores therein co-occurrence information on sentences for words in association with the words and determining a word the co-occurrence information on the determined sentence of which satisfies a standard among the words; changing the specific phoneme or the phoneme string included in the received phoneme string to the determined word to generate a text corresponding to the received phoneme string; and outputting the generated text.
    Type: Grant
    Filed: February 21, 2019
    Date of Patent: August 3, 2021
    Assignee: FUJITSU LIMITED
    Inventors: Masahiro Kataoka, Masao Ideuchi, Tamana Kobayashi
  • Patent number: 11074913
    Abstract: Various embodiments are provided for understanding user sentiment in a dialog system in a computing environment by a processor. A sentiment of a user may be detected according to a sentiment analysis and user feedback during a dialog with the user. One or more reasons for the sentiment of the user may be identified. Behavior of the dialog system may be adjusted according to the one or more reasons.
    Type: Grant
    Filed: January 3, 2019
    Date of Patent: July 27, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Oznur Alkan, Adi I. Botea, Elizabeth Daly, Matthew Davis, Christian Muise
  • Patent number: 11062702
    Abstract: Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for providing voice control using multiple digital assistants. In some embodiments, a voice platform operates to receive a voice input from a user. The voice platform selects a digital assistant from a plurality of digital assistants based on a trigger word. The voice platform then generates an intent from the voice input using the selected digital assistant. The voice platform then transmits the intent to a media device for processing.
    Type: Grant
    Filed: July 11, 2018
    Date of Patent: July 13, 2021
    Assignee: Roku, Inc.
    Inventors: Anthony John Wood, David Stern, Gregory Mack Garner
  • Patent number: 11048293
    Abstract: An electronic device includes a speaker, a microphone, a communication circuit, a processor operatively connected to the speaker, the microphone, and the communication circuit, and a memory operatively connected to the processor. The memory stores instructions that, when executed, cause the processor to receive a user input to activate an intelligent system, to determine at least part of a duration to receive a user utterance via the microphone, based at least partly on a state of the electronic device, to receive a first user utterance via the microphone after receiving the user input, to transmit first data associated with the first user utterance to an external server via the communication circuit, and to receive a first response from the external server via the communication circuit. The first response is generated based at least partly on the first data.
    Type: Grant
    Filed: July 19, 2018
    Date of Patent: June 29, 2021
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Ho Seon Shin, Chui Min Lee, Seung Yeol Lee, Seong Min Je
  • Patent number: 11050499
    Abstract: Methods and systems for collecting and analyzing an audience response are provided. An example method commences with determining that a media playing device has played a question within a media stream. The method further includes recording, via an acoustic sensor, an ambient acoustic signal for a pre-determined time interval. The method further includes detecting a presence of a voice of a user in the ambient acoustic signal, providing the ambient acoustic signal to a remote computing system, and performing, by the remote computing system, speech recognition of the ambient acoustic signal to obtain a text response. The method further includes adding, by the remote computing system, the text response to a set of text responses and analyzing the set of text responses to obtain a statistics concerning text results. The method then continues with providing, by the remote computing system, the statistics to a provider of the media stream.
    Type: Grant
    Filed: March 12, 2021
    Date of Patent: June 29, 2021
    Assignee: INSTREAMATIC, INC.
    Inventor: Stanislav Tushinskiy