Patents Examined by Shaun A Roberts
-
Patent number: 11989524Abstract: The present invention solves difficulties in constructing a dialogue corpus, ensures the accuracy of a system utterance, and evaluates a user utterance in a dialogue technology for language learning and a knowledge-grounded dialogue technology, in which a system and method is capable of helping a learner in language learning by constructing a language learning dialogue corpus using passages and exercises commonly used in language education and learning sites, training a dialogue model and a dialogue evaluation model with the language learning dialogue corpus, and allowing a user and a system to have a dialogue on the basis of a given passage. It is expected that it will be possible to implement a dialogue system for language learning that is capable of performing evaluation and easily expanding a domain (expansion of learning content).Type: GrantFiled: October 19, 2021Date of Patent: May 21, 2024Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTEInventor: Jinxia Huang
-
Patent number: 11990126Abstract: A method is implemented to move media content display between two media output devices. A server system determines in a voice message recorded by an electronic device a media transfer request that includes a user voice command to transfer media content to a destination media output device and a user voice designation of the destination media output device. The server system then obtains from a source cast device instant media play information including information of a media play application, the media content that is being played, and a temporal position. The server system further identifies a destination cast device associated in a user domain coupled to the destination media output device, and sends to the destination cast device a media play request including the instant media play information, thereby enabling the destination cast device to execute the media play application for playing the media content from the temporal location.Type: GrantFiled: May 23, 2022Date of Patent: May 21, 2024Assignee: Google LLCInventors: Raunaq Shah, Matt Van Der Staay
-
Patent number: 11978460Abstract: A method, system, and computer program to encode and decode a channel coherence parameter applied on a frequency band basis, where the coherence parameters of each frequency band form a coherence vector. The coherence vector is encoded and decoded using a predictive scheme followed by a variable bit rate entropy coding.Type: GrantFiled: August 3, 2022Date of Patent: May 7, 2024Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Erik Norvell, Fredrik Jansson
-
Patent number: 11978439Abstract: Speech recognition may be improved by generating and using a topic specific language model. A topic specific language model may be created by performing an initial pass on an audio signal using a generic or basis language model. A speech recognition device may then determine topics relating to the audio signal based on the words identified in the initial pass and retrieve a corpus of text relating to those topics. Using the retrieved corpus of text, the speech recognition device may create a topic specific language model. In one example, the speech recognition device may adapt or otherwise modify the generic language model based on the retrieved corpus of text.Type: GrantFiled: December 20, 2022Date of Patent: May 7, 2024Assignee: TiVo CorporationInventors: David F. Houghton, Seth Michael Murray, Sibley Verbeck Simon
-
Patent number: 11977617Abstract: Provided is to prevent a false determination due to an attachment condition of an apparatus that transmits and receives an acoustic signal, and perform accurate personal authentication. A personal authentication device includes: a personal authentication means that authenticates an individual by using first information at least including an acoustic characteristic calculated from an acoustic signal propagating through the head of the user, which is detected by an apparatus being attached on a head of a user for transmitting and receiving the acoustic signal, and a feature amount extracted from the acoustic characteristic; an attachment trouble rule storage means that stores an attachment trouble rule for detecting an attachment trouble with the apparatus; and an attachment trouble detection means that detects a trouble with an attachment state of the apparatus when the first information satisfies the attachment trouble rule.Type: GrantFiled: November 29, 2022Date of Patent: May 7, 2024Assignee: NEC CORPORATIONInventors: Takayuki Arakawa, Takafumi Koshinaka
-
Patent number: 11978463Abstract: A stereo signal encoding method includes: obtaining a residual signal encoding parameter of a current frame of a stereo signal based on downmixed signal energy and residual signal energy of each of M sub-bands of the current frame, where the residual signal encoding parameter indicates whether to encode residual signals of the M sub-bands; determining whether to encode the residual signals based on the residual signal encoding parameter; and encoding the residual signals when it is determined that the residual signals need to be encoded.Type: GrantFiled: August 11, 2022Date of Patent: May 7, 2024Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Bin Wang, Zexin Liu, Haiting Li
-
Patent number: 11978446Abstract: Systems and methods may be used to detect an inaudible signal associated with a first audible signal of an audio input. The inaudible signal may include a frequency signature. The frequency signature may be associated with an electronic device type. The systems and methods may activate a response monitor. The response monitor may be activated for a predetermined time. The response monitor may be activated responsive to the frequency signature. The systems and methods may determine a content characteristic of the first audible signal based on the inaudible signal. The systems and methods may include generating a message. The message may be based on the content characteristic. The systems and methods may include transmitting the message. The message may be transmitted on a condition that a second audible signal corresponds to the message and is received within the predetermined time.Type: GrantFiled: April 20, 2021Date of Patent: May 7, 2024Assignee: Rovi Guides, Inc.Inventors: David D. Shoop, Dylan M. Wondra
-
Patent number: 11972767Abstract: Methods and systems for improving signal processing by smoothing the covariance matrix of a multi-channel signal by setting a forgetting factor based on the bins of a band. A method and system for resetting the smoothing based on transient detection is also disclosed. A method and system for resampling for the smoothing during a banding transition is also disclosed.Type: GrantFiled: July 31, 2020Date of Patent: April 30, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: David S. McGrath, Stefanie Brown, Juan Felix Torres
-
Patent number: 11972227Abstract: A speech translation system and methods for cross-lingual communication that enable users to improve and customize content and usage of the system and easily. The methods include, in response to receiving an utterance including a first term associated with a field, translating the utterance into a second language. In response to receiving an indication to add the first term associated with the field to a first recognition lexicon, adding the first term associated with the field and the determined translation to a first machine translation module and to a shared database for a community associated with the field of the first term associated with the field, wherein the first term associated with the field added to the shared database is accessible by the community.Type: GrantFiled: December 7, 2021Date of Patent: April 30, 2024Assignee: Meta Platforms, Inc.Inventors: Alexander Waibel, Ian R. Lane
-
Patent number: 11960845Abstract: Embodiments relate to decoding communications with token sky maps. At least one electronic communication including emoticons having a non-original meaning is received. A candidate meaning is determined for the emoticons having the non-original meaning in the at least one electronic communication based at least in part on token neighborhood distribution structures. The candidate meaning for the emoticons having the non-original meaning is caused to be displayed on at least one device.Type: GrantFiled: October 20, 2021Date of Patent: April 16, 2024Assignee: International Business Machines CorporationInventors: Ziqiumin Wang, Qing Lu, Wei Jun Zheng, Xiao Feng Ji, Yuan Jin
-
Patent number: 11954448Abstract: Embodiments of the present disclosure include systems and methods for determining position values for training data that is used to train transformer models. In some embodiments, a set of input data for training a transformer model is received. The set of input data comprises a set of tokens. Based on an offset value, a set of successive position values for the set of tokens is determined. Each position value in the set of successive position values represents a position of a token in the set of tokens relative to other tokens in the set of tokens. A set of training data is generated to comprise the set of tokens and the set of successive position values. The transformer model is trained using the set of training data.Type: GrantFiled: July 21, 2020Date of Patent: April 9, 2024Assignee: Microsoft Technology Licensing, LLCInventors: Andy Wagner, Tiyasa Mitra, Marc Tremblay
-
Patent number: 11935535Abstract: An electronic device configures a device-agnostic voice assistant library for execution on the electronic device based on the electronic device having a first device type. The electronic device also selects an implementation for the voice assistant library. After the configuring, the electronic device receives a verbal input from a user. It extracts request information from the verbal input by processing the verbal input using the voice assistant library executing on the device. It transmits a request to a remote system, the request including the extracted request information. The electronic device receives a response to the request. The response is generated by the remote system in accordance with the extracted request information. The electronic device performs an operation in accordance with the response by one or more voice processing modules of the configured voice assistant library.Type: GrantFiled: June 3, 2022Date of Patent: March 19, 2024Assignee: Google LLCInventors: Kenneth Mixter, Raunaq Shah
-
Patent number: 11935555Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.Type: GrantFiled: March 30, 2023Date of Patent: March 19, 2024Assignee: DOLBY INTERNATIONAL ABInventor: Lars Villemoes
-
Patent number: 11929076Abstract: Disclosed speech recognition techniques improve user-perceived latency while maintaining accuracy by: receiving an audio stream, in parallel, by a primary (e.g., accurate) speech recognition engine (SRE) and a secondary (e.g., fast) SRE; generating, with the primary SRE, a primary result; generating, with the secondary SRE, a secondary result; appending the secondary result to a word list; and merging the primary result into the secondary result in the word list. Combining output from the primary and secondary SREs into a single decoder as described herein improves user-perceived latency while maintaining or improving accuracy, among other advantages.Type: GrantFiled: December 1, 2022Date of Patent: March 12, 2024Assignee: Microsoft Technology Licensing, LLC.Inventors: Hosam Adel Khalil, Emilian Stoimenov, Christopher Hakan Basoglu, Kshitiz Kumar, Jian Wu
-
Patent number: 11922941Abstract: An electronic device stores a voice assistant library for execution on the electronic device based on the electronic device having a first device type. The electronic device receives a verbal input from a user. It extracts request information from the verbal input by processing the verbal input using the voice assistant library executing on the device. It transmits a request to a remote system. The electronic device receives a response to the request. The response is generated by the remote system. The electronic device performs an operation in accordance with the response by one or more voice-processing modules of the configured voice assistant library.Type: GrantFiled: July 25, 2023Date of Patent: March 5, 2024Assignee: Google LLCInventors: Kenneth Mixter, Raunaq Shah
-
Patent number: 11922945Abstract: Implementations relate to dynamically, and in a context-sensitive manner, biasing voice to text conversion. In some implementations, the biasing of voice to text conversions is performed by a voice to text engine of a local agent, and the biasing is based at least in part on content provided to the local agent by a third-party (3P) agent that is in network communication with the local agent. In some of those implementations, the content includes contextual parameters that are provided by the 3P agent in combination with responsive content generated by the 3P agent during a dialog that: is between the 3P agent, and a user of a voice-enabled electronic device; and is facilitated by the local agent. The contextual parameters indicate potential feature(s) of further voice input that is to be provided in response to the responsive content generated by the 3P agent.Type: GrantFiled: March 23, 2023Date of Patent: March 5, 2024Assignee: GOOGLE LLCInventors: Barnaby James, Bo Wang, Sunil Vemuri, David Schairer, Ulas Kirazci, Ertan Dogrultan, Petar Aleksic
-
Patent number: 11894014Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.Type: GrantFiled: September 22, 2022Date of Patent: February 6, 2024Assignee: Google LLCInventors: Inbar Mosseri, Michael Rubinstein, Ariel Ephrat, William Freeman, Oran Lang, Kevin William Wilson, Tali Dekel, Avinatan Hassidim
-
Patent number: 11875784Abstract: Techniques are disclosed to optimize feature selection in generating betas for a feature dictionary of a neuro-linguistic Cognitive AI System. A machine learning engine receives a sample vector of input data to be analyzed by the neuro-linguistic Cognitive AI System. The neuro-linguistic Cognitive AI System is configured to generate multiple betas for each of a plurality of sensors. The machine learning engine identifies a sensor specified in the sample vector and selects optimization parameters for generating betas based on the identified sensor.Type: GrantFiled: November 30, 2020Date of Patent: January 16, 2024Assignee: Intellective Ai, Inc.Inventors: Gang Xu, Tao Yang, Ming-Jung Seow
-
Patent number: 11862180Abstract: A method, decoder, and program code for controlling a concealment method for a lost audio frame is provided. A first audio frame and a second audio frame of the received audio signal are decoded to obtain modified discrete cosine transform (MDCT) coefficients. Values of a first spectral shape based upon the MDCT coefficients decoded from the first audio frame decoded and values of a second spectral shape based upon MDCT coefficients decoded from the second audio frame decoded are determined, the spectral shapes each comprising a number of sub-bands. The values of the spectral shapes and frame energies of the first audio frame and second audio frame are transformed into representations of FFT based spectral analyses. A transient condition is detected based on the representations of the FFTs. Responsive to detecting the transient condition, the concealment method is modified by selectively adjusting a spectrum magnitude of a substitution frame spectrum.Type: GrantFiled: February 20, 2020Date of Patent: January 2, 2024Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Martin Sehlstedt, Jonas Svedberg
-
Patent number: 11836453Abstract: Aspects include methods, systems, and computer-program products providing virtual assistant domain functionality. A natural language query including one or more words is received. A collection of natural language modules is accessed. The collection natural language modules are configured to process sets of natural language queries. A natural language module, from the collection of natural language modules, is identified to interpret the natural language query. An interpretation of the natural language query is computed using the identified natural language module. A response to the natural language query is returned using the computed interpretation.Type: GrantFiled: July 22, 2021Date of Patent: December 5, 2023Assignee: SoundHound, Inc.Inventors: Kamyar Mohajer, Keyvan Mohajer, Bernard Mont-Reynaud, Pranav Singh