Patents Examined by Shaun Roberts
  • Patent number: 11978463
    Abstract: A stereo signal encoding method includes: obtaining a residual signal encoding parameter of a current frame of a stereo signal based on downmixed signal energy and residual signal energy of each of M sub-bands of the current frame, where the residual signal encoding parameter indicates whether to encode residual signals of the M sub-bands; determining whether to encode the residual signals based on the residual signal encoding parameter; and encoding the residual signals when it is determined that the residual signals need to be encoded.
    Type: Grant
    Filed: August 11, 2022
    Date of Patent: May 7, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Bin Wang, Zexin Liu, Haiting Li
  • Patent number: 11978460
    Abstract: A method, system, and computer program to encode and decode a channel coherence parameter applied on a frequency band basis, where the coherence parameters of each frequency band form a coherence vector. The coherence vector is encoded and decoded using a predictive scheme followed by a variable bit rate entropy coding.
    Type: Grant
    Filed: August 3, 2022
    Date of Patent: May 7, 2024
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Erik Norvell, Fredrik Jansson
  • Patent number: 11977617
    Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.
    Type: Grant
    Filed: November 29, 2022
    Date of Patent: May 7, 2024
    Assignee: NEC CORPORATION
    Inventors: Takayuki Arakawa, Takafumi Koshinaka
  • Patent number: 11978439
    Abstract: Speech recognition may be improved by generating and using a topic specific language model. A topic specific language model may be created by performing an initial pass on an audio signal using a generic or basis language model. A speech recognition device may then determine topics relating to the audio signal based on the words identified in the initial pass and retrieve a corpus of text relating to those topics. Using the retrieved corpus of text, the speech recognition device may create a topic specific language model. In one example, the speech recognition device may adapt or otherwise modify the generic language model based on the retrieved corpus of text.
    Type: Grant
    Filed: December 20, 2022
    Date of Patent: May 7, 2024
    Assignee: TiVo Corporation
    Inventors: David F. Houghton, Seth Michael Murray, Sibley Verbeck Simon
  • Patent number: 11978446
    Abstract: Systems and methods may be used to detect an inaudible signal associated with a first audible signal of an audio input. The inaudible signal may include a frequency signature. The frequency signature may be associated with an electronic device type. The systems and methods may activate a response monitor. The response monitor may be activated for a predetermined time. The response monitor may be activated responsive to the frequency signature. The systems and methods may determine a content characteristic of the first audible signal based on the inaudible signal. The systems and methods may include generating a message. The message may be based on the content characteristic. The systems and methods may include transmitting the message. The message may be transmitted on a condition that a second audible signal corresponds to the message and is received within the predetermined time.
    Type: Grant
    Filed: April 20, 2021
    Date of Patent: May 7, 2024
    Assignee: Rovi Guides, Inc.
    Inventors: David D. Shoop, Dylan M. Wondra
  • Patent number: 11972767
    Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.
    Type: Grant
    Filed: July 31, 2020
    Date of Patent: April 30, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: David S. McGrath, Stefanie Brown, Juan Felix Torres
  • Patent number: 11972227
    Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.
    Type: Grant
    Filed: December 7, 2021
    Date of Patent: April 30, 2024
    Assignee: Meta Platforms, Inc.
    Inventors: Alexander Waibel, Ian R. Lane
  • Patent number: 11960845
    Abstract: Embodiments relate to decoding communications with token sky maps. At least one electronic communication including emoticons having a non-original meaning is received. A candidate meaning is determined for the emoticons having the non-original meaning in the at least one electronic communication based at least in part on token neighborhood distribution structures. The candidate meaning for the emoticons having the non-original meaning is caused to be displayed on at least one device.
    Type: Grant
    Filed: October 20, 2021
    Date of Patent: April 16, 2024
    Assignee: International Business Machines Corporation
    Inventors: Ziqiumin Wang, Qing Lu, Wei Jun Zheng, Xiao Feng Ji, Yuan Jin
  • Patent number: 11954448
    Abstract: Embodiments of the present disclosure include systems and methods for determining position values for training data that is used to train transformer models. In some embodiments, a set of input data for training a transformer model is received. The set of input data comprises a set of tokens. Based on an offset value, a set of successive position values for the set of tokens is determined. Each position value in the set of successive position values represents a position of a token in the set of tokens relative to other tokens in the set of tokens. A set of training data is generated to comprise the set of tokens and the set of successive position values. The transformer model is trained using the set of training data.
    Type: Grant
    Filed: July 21, 2020
    Date of Patent: April 9, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Andy Wagner, Tiyasa Mitra, Marc Tremblay
  • Patent number: 11935535
    Abstract: An electronic device configures a device-agnostic voice assistant library for execution on the electronic device based on the electronic device having a first device type. The electronic device also selects an implementation for the voice assistant library. After the configuring, the electronic device receives a verbal input from a user. It extracts request information from the verbal input by processing the verbal input using the voice assistant library executing on the device. It transmits a request to a remote system, the request including the extracted request information. The electronic device receives a response to the request. The response is generated by the remote system in accordance with the extracted request information. The electronic device performs an operation in accordance with the response by one or more voice processing modules of the configured voice assistant library.
    Type: Grant
    Filed: June 3, 2022
    Date of Patent: March 19, 2024
    Assignee: Google LLC
    Inventors: Kenneth Mixter, Raunaq Shah
  • Patent number: 11935555
    Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.
    Type: Grant
    Filed: March 30, 2023
    Date of Patent: March 19, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventor: Lars Villemoes
  • Patent number: 11929076
    Abstract: Disclosed speech recognition techniques improve user-perceived latency while maintaining accuracy by: receiving an audio stream, in parallel, by a primary (e.g., accurate) speech recognition engine (SRE) and a secondary (e.g., fast) SRE; generating, with the primary SRE, a primary result; generating, with the secondary SRE, a secondary result; appending the secondary result to a word list; and merging the primary result into the secondary result in the word list. Combining output from the primary and secondary SREs into a single decoder as described herein improves user-perceived latency while maintaining or improving accuracy, among other advantages.
    Type: Grant
    Filed: December 1, 2022
    Date of Patent: March 12, 2024
    Assignee: Microsoft Technology Licensing, LLC.
    Inventors: Hosam Adel Khalil, Emilian Stoimenov, Christopher Hakan Basoglu, Kshitiz Kumar, Jian Wu
  • Patent number: 11922941
    Abstract: An electronic device stores a voice assistant library for execution on the electronic device based on the electronic device having a first device type. The electronic device receives a verbal input from a user. It extracts request information from the verbal input by processing the verbal input using the voice assistant library executing on the device. It transmits a request to a remote system. The electronic device receives a response to the request. The response is generated by the remote system. The electronic device performs an operation in accordance with the response by one or more voice-processing modules of the configured voice assistant library.
    Type: Grant
    Filed: July 25, 2023
    Date of Patent: March 5, 2024
    Assignee: Google LLC
    Inventors: Kenneth Mixter, Raunaq Shah
  • Patent number: 11922945
    Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.
    Type: Grant
    Filed: March 23, 2023
    Date of Patent: March 5, 2024
    Assignee: GOOGLE LLC
    Inventors: Barnaby James, Bo Wang, Sunil Vemuri, David Schairer, Ulas Kirazci, Ertan Dogrultan, Petar Aleksic
  • Patent number: 11894014
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.
    Type: Grant
    Filed: September 22, 2022
    Date of Patent: February 6, 2024
    Assignee: Google LLC
    Inventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
  • Patent number: 11875784
    Abstract: Techniques are disclosed to optimize feature selection in generating betas for a feature dictionary of a neuro-linguistic Cognitive AI System. A machine learning engine receives a sample vector of input data to be analyzed by the neuro-linguistic Cognitive AI System. The neuro-linguistic Cognitive AI System is configured to generate multiple betas for each of a plurality of sensors. The machine learning engine identifies a sensor specified in the sample vector and selects optimization parameters for generating betas based on the identified sensor.
    Type: Grant
    Filed: November 30, 2020
    Date of Patent: January 16, 2024
    Assignee: Intellective Ai, Inc.
    Inventors: Gang Xu, Tao Yang, Ming-Jung Seow
  • Patent number: 11862180
    Abstract: A method, decoder, and program code for controlling a concealment method for a lost audio frame is provided. A first audio frame and a second audio frame of the received audio signal are decoded to obtain modified discrete cosine transform (MDCT) coefficients. Values of a first spectral shape based upon the MDCT coefficients decoded from the first audio frame decoded and values of a second spectral shape based upon MDCT coefficients decoded from the second audio frame decoded are determined, the spectral shapes each comprising a number of sub-bands. The values of the spectral shapes and frame energies of the first audio frame and second audio frame are transformed into representations of FFT based spectral analyses. A transient condition is detected based on the representations of the FFTs. Responsive to detecting the transient condition, the concealment method is modified by selectively adjusting a spectrum magnitude of a substitution frame spectrum.
    Type: Grant
    Filed: February 20, 2020
    Date of Patent: January 2, 2024
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Martin Sehlstedt, Jonas Svedberg
  • Patent number: 11836453
    Abstract: Aspects include methods, systems, and computer-program products providing virtual assistant domain functionality. A natural language query including one or more words is received. A collection of natural language modules is accessed. The collection natural language modules are configured to process sets of natural language queries. A natural language module, from the collection of natural language modules, is identified to interpret the natural language query. An interpretation of the natural language query is computed using the identified natural language module. A response to the natural language query is returned using the computed interpretation.
    Type: Grant
    Filed: July 22, 2021
    Date of Patent: December 5, 2023
    Assignee: SoundHound, Inc.
    Inventors: Kamyar Mohajer, Keyvan Mohajer, Bernard Mont-Reynaud, Pranav Singh
  • Patent number: 11823714
    Abstract: In exemplary embodiments of the present invention systems and methods are provided to implement and facilitate cross-fading, interstitials and other effects/processing of two or more media elements in a personalized media delivery service so that each client or user has a consistent high quality experience. The effects or crossfade processing can occur on the broadcast, publisher or server-side, but can still be personalized to a specific user, thus still allowing a personalized experience for each individual user, in a manner where the processing burden is minimized on the downstream side or client device. This approach enables a consistent user experience, independent of client device capabilities, both static and dynamic.
    Type: Grant
    Filed: February 22, 2022
    Date of Patent: November 21, 2023
    Assignee: Sirius XM Radio Inc.
    Inventors: Raymond Lowe, Christopher Ward
  • Patent number: 11823700
    Abstract: A system for removing noise from an audio signal is described. For example, noise caused by content playing in the background during a voice command or phone call may be removed from the audio signal representing the voice command or phone call. By removing noise, the signal to noise ratio of the audio signal may be improved.
    Type: Grant
    Filed: June 9, 2021
    Date of Patent: November 21, 2023
    Assignee: Comcast Cable Communications, LLC
    Inventor: George Thomas Des Jardins