Patents Examined by Shaun A Roberts

Knowledge-grounded dialogue system and method for language learning

Patent number: 11989524

Abstract: The present invention solves difficulties in constructing a dialogue corpus, ensures the accuracy of a system utterance, and evaluates a user utterance in a dialogue technology for language learning and a knowledge-grounded dialogue technology, in which a system and method is capable of helping a learner in language learning by constructing a language learning dialogue corpus using passages and exercises commonly used in language education and learning sites, training a dialogue model and a dialogue evaluation model with the language learning dialogue corpus, and allowing a user and a system to have a dialogue on the basis of a given passage. It is expected that it will be possible to implement a dialogue system for language learning that is capable of performing evaluation and easily expanding a domain (expansion of learning content).

Type: Grant

Filed: October 19, 2021

Date of Patent: May 21, 2024

Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE

Inventor: Jinxia Huang
Voice-controlled media play in smart media environment

Patent number: 11990126

Abstract: A method is implemented to move media content display between two media output devices. A server system determines in a voice message recorded by an electronic device a media transfer request that includes a user voice command to transfer media content to a destination media output device and a user voice designation of the destination media output device. The server system then obtains from a source cast device instant media play information including information of a media play application, the media content that is being played, and a temporal position. The server system further identifies a destination cast device associated in a user domain coupled to the destination media output device, and sends to the destination cast device a media play request including the instant media play information, thereby enabling the destination cast device to execute the media play application for playing the media content from the temporal location.

Type: Grant

Filed: May 23, 2022

Date of Patent: May 21, 2024

Assignee: Google LLC

Inventors: Raunaq Shah, Matt Van Der Staay
Truncateable predictive coding

Patent number: 11978460

Abstract: A method, system, and computer program to encode and decode a channel coherence parameter applied on a frequency band basis, where the coherence parameters of each frequency band form a coherence vector. The coherence vector is encoded and decoded using a predictive scheme followed by a variable bit rate entropy coding.

Type: Grant

Filed: August 3, 2022

Date of Patent: May 7, 2024

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Erik Norvell, Fredrik Jansson
Generating topic-specific language models

Patent number: 11978439

Abstract: Speech recognition may be improved by generating and using a topic specific language model. A topic specific language model may be created by performing an initial pass on an audio signal using a generic or basis language model. A speech recognition device may then determine topics relating to the audio signal based on the words identified in the initial pass and retrieve a corpus of text relating to those topics. Using the retrieved corpus of text, the speech recognition device may create a topic specific language model. In one example, the speech recognition device may adapt or otherwise modify the generic language model based on the retrieved corpus of text.

Type: Grant

Filed: December 20, 2022

Date of Patent: May 7, 2024

Assignee: TiVo Corporation

Inventors: David F. Houghton, Seth Michael Murray, Sibley Verbeck Simon
Detection of attachment problem of apparatus being worn by user

Patent number: 11977617

Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.

Type: Grant

Filed: November 29, 2022

Date of Patent: May 7, 2024

Assignee: NEC CORPORATION

Inventors: Takayuki Arakawa, Takafumi Koshinaka
Stereo signal encoding method and apparatus using a residual signal encoding parameter

Patent number: 11978463

Abstract: A stereo signal encoding method includes: obtaining a residual signal encoding parameter of a current frame of a stereo signal based on downmixed signal energy and residual signal energy of each of M sub-bands of the current frame, where the residual signal encoding parameter indicates whether to encode residual signals of the M sub-bands; determining whether to encode the residual signals based on the residual signal encoding parameter; and encoding the residual signals when it is determined that the residual signals need to be encoded.

Type: Grant

Filed: August 11, 2022

Date of Patent: May 7, 2024

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Bin Wang, Zexin Liu, Haiting Li
Inaudible frequency transmission in interactive content

Patent number: 11978446

Abstract: Systems and methods may be used to detect an inaudible signal associated with a first audible signal of an audio input. The inaudible signal may include a frequency signature. The frequency signature may be associated with an electronic device type. The systems and methods may activate a response monitor. The response monitor may be activated for a predetermined time. The response monitor may be activated responsive to the frequency signature. The systems and methods may determine a content characteristic of the first audible signal based on the inaudible signal. The systems and methods may include generating a message. The message may be based on the content characteristic. The systems and methods may include transmitting the message. The message may be transmitted on a condition that a second audible signal corresponds to the message and is received within the predetermined time.

Type: Grant

Filed: April 20, 2021

Date of Patent: May 7, 2024

Assignee: Rovi Guides, Inc.

Inventors: David D. Shoop, Dylan M. Wondra
Systems and methods for covariance smoothing

Patent number: 11972767

Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.

Type: Grant

Filed: July 31, 2020

Date of Patent: April 30, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: David S. McGrath, Stefanie Brown, Juan Felix Torres
Lexicon development via shared translation database

Patent number: 11972227

Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.

Type: Grant

Filed: December 7, 2021

Date of Patent: April 30, 2024

Assignee: Meta Platforms, Inc.

Inventors: Alexander Waibel, Ian R. Lane
Decoding communications with token sky maps

Patent number: 11960845

Abstract: Embodiments relate to decoding communications with token sky maps. At least one electronic communication including emoticons having a non-original meaning is received. A candidate meaning is determined for the emoticons having the non-original meaning in the at least one electronic communication based at least in part on token neighborhood distribution structures. The candidate meaning for the emoticons having the non-original meaning is caused to be displayed on at least one device.

Type: Grant

Filed: October 20, 2021

Date of Patent: April 16, 2024

Assignee: International Business Machines Corporation

Inventors: Ziqiumin Wang, Qing Lu, Wei Jun Zheng, Xiao Feng Ji, Yuan Jin
Determining position values for transformer models

Patent number: 11954448

Abstract: Embodiments of the present disclosure include systems and methods for determining position values for training data that is used to train transformer models. In some embodiments, a set of input data for training a transformer model is received. The set of input data comprises a set of tokens. Based on an offset value, a set of successive position values for the set of tokens is determined. Each position value in the set of successive position values represents a position of a token in the set of tokens relative to other tokens in the set of tokens. A set of training data is generated to comprise the set of tokens and the set of successive position values. The transformer model is trained using the set of training data.

Type: Grant

Filed: July 21, 2020

Date of Patent: April 9, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Andy Wagner, Tiyasa Mitra, Marc Tremblay
Implementations for voice assistant on devices

Patent number: 11935535

Abstract: An electronic device configures a device-agnostic voice assistant library for execution on the electronic device based on the electronic device having a first device type. The electronic device also selects an implementation for the voice assistant library. After the configuring, the electronic device receives a verbal input from a user. It extracts request information from the verbal input by processing the verbal input using the voice assistant library executing on the device. It transmits a request to a remote system, the request including the extracted request information. The electronic device receives a response to the request. The response is generated by the remote system in accordance with the extracted request information. The electronic device performs an operation in accordance with the response by one or more voice processing modules of the configured voice assistant library.

Type: Grant

Filed: June 3, 2022

Date of Patent: March 19, 2024

Assignee: Google LLC

Inventors: Kenneth Mixter, Raunaq Shah
Subband block based harmonic transposition

Patent number: 11935555

Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.

Type: Grant

Filed: March 30, 2023

Date of Patent: March 19, 2024

Assignee: DOLBY INTERNATIONAL AB

Inventor: Lars Villemoes
User-perceived latency while maintaining accuracy

Patent number: 11929076

Abstract: Disclosed speech recognition techniques improve user-perceived latency while maintaining accuracy by: receiving an audio stream, in parallel, by a primary (e.g., accurate) speech recognition engine (SRE) and a secondary (e.g., fast) SRE; generating, with the primary SRE, a primary result; generating, with the secondary SRE, a secondary result; appending the secondary result to a word list; and merging the primary result into the secondary result in the word list. Combining output from the primary and secondary SREs into a single decoder as described herein improves user-perceived latency while maintaining or improving accuracy, among other advantages.

Type: Grant

Filed: December 1, 2022

Date of Patent: March 12, 2024

Assignee: Microsoft Technology Licensing, LLC.

Inventors: Hosam Adel Khalil, Emilian Stoimenov, Christopher Hakan Basoglu, Kshitiz Kumar, Jian Wu
Implementations for voice assistant on devices

Patent number: 11922941

Abstract: An electronic device stores a voice assistant library for execution on the electronic device based on the electronic device having a first device type. The electronic device receives a verbal input from a user. It extracts request information from the verbal input by processing the verbal input using the voice assistant library executing on the device. It transmits a request to a remote system. The electronic device receives a response to the request. The response is generated by the remote system. The electronic device performs an operation in accordance with the response by one or more voice-processing modules of the configured voice assistant library.

Type: Grant

Filed: July 25, 2023

Date of Patent: March 5, 2024

Assignee: Google LLC

Inventors: Kenneth Mixter, Raunaq Shah
Voice to text conversion based on third-party agent content

Patent number: 11922945

Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.

Type: Grant

Filed: March 23, 2023

Date of Patent: March 5, 2024

Assignee: GOOGLE LLC

Inventors: Barnaby James, Bo Wang, Sunil Vemuri, David Schairer, Ulas Kirazci, Ertan Dogrultan, Petar Aleksic
Audio-visual speech separation

Patent number: 11894014

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.

Type: Grant

Filed: September 22, 2022

Date of Patent: February 6, 2024

Assignee: Google LLC

Inventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
Methods and systems for optimized selection of data features for a neuro-linguistic cognitive artificial intelligence system

Patent number: 11875784

Abstract: Techniques are disclosed to optimize feature selection in generating betas for a feature dictionary of a neuro-linguistic Cognitive AI System. A machine learning engine receives a sample vector of input data to be analyzed by the neuro-linguistic Cognitive AI System. The neuro-linguistic Cognitive AI System is configured to generate multiple betas for each of a plurality of sensors. The machine learning engine identifies a sensor specified in the sample vector and selects optimization parameters for generating betas based on the identified sensor.

Type: Grant

Filed: November 30, 2020

Date of Patent: January 16, 2024

Assignee: Intellective Ai, Inc.

Inventors: Gang Xu, Tao Yang, Ming-Jung Seow
Spectral shape estimation from MDCT coefficients

Patent number: 11862180

Abstract: A method, decoder, and program code for controlling a concealment method for a lost audio frame is provided. A first audio frame and a second audio frame of the received audio signal are decoded to obtain modified discrete cosine transform (MDCT) coefficients. Values of a first spectral shape based upon the MDCT coefficients decoded from the first audio frame decoded and values of a second spectral shape based upon MDCT coefficients decoded from the second audio frame decoded are determined, the spectral shapes each comprising a number of sub-bands. The values of the spectral shapes and frame energies of the first audio frame and second audio frame are transformed into representations of FFT based spectral analyses. A transient condition is detected based on the representations of the FFTs. Responsive to detecting the transient condition, the concealment method is modified by selectively adjusting a spectrum magnitude of a substitution frame spectrum.

Type: Grant

Filed: February 20, 2020

Date of Patent: January 2, 2024

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Martin Sehlstedt, Jonas Svedberg
Virtual assistant domain functionality

Patent number: 11836453

Abstract: Aspects include methods, systems, and computer-program products providing virtual assistant domain functionality. A natural language query including one or more words is received. A collection of natural language modules is accessed. The collection natural language modules are configured to process sets of natural language queries. A natural language module, from the collection of natural language modules, is identified to interpret the natural language query. An interpretation of the natural language query is computed using the identified natural language module. A response to the natural language query is returned using the computed interpretation.

Type: Grant

Filed: July 22, 2021

Date of Patent: December 5, 2023

Assignee: SoundHound, Inc.

Inventors: Kamyar Mohajer, Keyvan Mohajer, Bernard Mont-Reynaud, Pranav Singh

1 2 3 4 5 … next