Patents Examined by Brian L. Albertalli

Speech recognition using on-the-fly-constrained language model per utterance

Patent number: 11984125

Abstract: Presented herein are techniques for augmenting a speech recognition engine. According to the disclosed techniques, audio data is obtained as part of an automatic speech recognition session. Speech hints are also obtained as part of the automatic speech recognition session. A dynamic language model is generated from the speech hints for use during the automatic speech recognition session. A combined language model is then generated from the dynamic language model and a static language model. Finally, the audio data is converted to text using the combined language model as part of the automatic speech recognition session.

Type: Grant

Filed: June 29, 2021

Date of Patent: May 14, 2024

Assignee: CISCO TECHNOLOGY, INC.

Inventors: Rishabh Gupta Yadav, Kareem Nassar, Sylvain Le Groux, Matthew James Ceravolo
Audio signal processing method and system for noise mitigation of a voice signal measured by a bone conduction sensor, a feedback sensor and a feedforward sensor

Patent number: 11978468

Abstract: An audio signal processing method includes measuring a voice signal, wherein the measurement performed by an audio system including first through third sensors. Measuring the voice signal produces first through third audio signals by the first through third sensors, respectively. The audio signal processing method further includes: producing an output signal by using the first audio signal, the second audio signal and the third audio signal, wherein the output signal corresponds to: the first audio signal below a first crossing frequency, the second audio signal between the first crossing frequency and a second crossing frequency, the third audio signal above the second crossing frequency. The first crossing frequency is lower than or equal to the second crossing frequency, wherein the first crossing frequency and the second crossing frequency are different for at least some operating conditions of the audio system.

Type: Grant

Filed: April 6, 2022

Date of Patent: May 7, 2024

Assignee: Analog Devices International Unlimited Company

Inventors: Stijn Robben, Abdel Yussef Hussenbocus, Jean-Marc Luneau
Electronic device and method for controlling the electronic device

Patent number: 11967325

Abstract: Disclosed are an electronic device capable of efficiently performing speech recognition and natural language understanding and a method for controlling thereof. The electronic device includes: a microphone; a non-volatile memory configured to store virtual assistant model data comprising data that is classified according to a plurality of domains and data that is commonly used for the plurality of domains; a volatile memory; and a processor configured to: based on receiving, through the microphone, a trigger input to perform speech recognition for a user speech, initiate loading the virtual assistant model data from the non-volatile memory into the volatile memory, load, into the volatile memory, first data from among the data classified according to the plurality of domains and, while loading the first data into the volatile memory, load at least a part of the data commonly used for the plurality of domains into the volatile memory.

Type: Grant

Filed: December 30, 2022

Date of Patent: April 23, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Saebom Jang, Hyeonmok Ko, Kyenghun Lee, Kunal Sharma, Raghavendra Hanumantasetty Ramasetty
Methods and devices for generation and processing of modified audio bitstreams

Patent number: 11967330

Abstract: Described herein is a method for generating a modified bitstream on a source device, wherein the method includes the steps of: a) receiving, by a receiver, a bitstream including coded media data; b) generating, by an embedder, payload of additional media data and embedding the payload in the bitstream for obtaining, as an output from the embedder, a modified bitstream including the coded media data and the payload of the additional media data; and c) outputting the modified bitstream to a sink device. Described is further a method for processing said modified bitstream on a sink device. Described are moreover a respective source device and sink device as well as a system of a source device and a sink device and respective computer program products.

Type: Grant

Filed: August 13, 2020

Date of Patent: April 23, 2024

Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Christof Fersch, Daniel Fischer, Leon Terentiv, Gregory John McGarry
Systems and methods for a computerized interactive voice companion

Patent number: 11967338

Abstract: Systems and methods for a computerized interactive voice companion include functionality that receives audio of a user's voice as the user is speaking; detects a tone and/or other relevant aspects associated with the content of the user's voice based on the audio of the user's voice as the user is speaking and determines, as the user is speaking, a response to the user speaking based on the detected tone and/or other relevant aspects associated with the content of the user's voice of the user's voice. The computerized interactive voice companion system, then orally or visually provides the response to the user automatically in real-time as a reply to the user speaking. The system may then continue the conversation based on continuing to detect the mood of the user as they speak and basing responses on this, as well as other recent user behavior detected to be relevant to the conversation.

Type: Grant

Filed: October 27, 2020

Date of Patent: April 23, 2024

Assignee: DISH NETWORK TECHNOLOGIES INDIA PRIVATE LIMITED

Inventor: Rangu Kr
Control of a motion tracking system by user thereof

Patent number: 11960791

Abstract: A method for controlling a motion tracking system, including the steps of: digitally processing sound waves detected by a plurality of microphones so as to detect a voice of a user and estimate a first direction of the user; digitally processing electromagnetic waves captured by antennas so as to detect data packets transmitted to a computing apparatus by sensors and estimate second directions of each sensor; digitally averaging the second directions so as to provide an average direction for the sensors; digitally computing a difference between the first direction and the average direction; and starting to digitally track motion of the user based on measurements of each sensor when the computed difference does not exceed a predetermined difference threshold.

Type: Grant

Filed: October 29, 2020

Date of Patent: April 16, 2024

Assignee: SWORD HEALTH, S.A.

Inventors: Márcio Filipe Moutinho Colunas, José Carlos Coelho Alves, Luís António Correia de Oliveira, Luís Ungaro Pinto Coelho, Virgílio António Ferro Bento
Voice instructed machine authoring of electronic documents

Patent number: 11941345

Abstract: A computer-implemented process is programmed to process a source input, determine text enhancements, and present the text enhancements to apply to the sentences dictated from the source input. A text processor may use machine-learning models to process an audio input to generate sentences in a presentable format. An audio input can be processed by an automatic speech recognition model to generate electronic text. The electronic text may be used to generate sentence structures using a normalization model. A comprehension model may be used to identify instructions associated with the sentence structures and generate sentences based on the instructions and the sentence structures. An enhancement model may be used to identify enhancements to apply to the sentences. The enhancements may be presented alongside sentences generated by the comprehension model to provide the user an option to select either the enhancements or the sentences.

Type: Grant

Filed: October 26, 2021

Date of Patent: March 26, 2024

Assignee: Grammarly, Inc.

Inventors: Timo Mertens, Vipul Raheja, Chad Mills, Ihor Skliarevskyi, Ignat Blazhko, Robyn Perry, Nicholas Bern, Dhruv Kumar, Melissa Lopez
System and method for virtual assistant execution of ambiguous command

Patent number: 11935529

Abstract: Techniques for virtual assistant execution of ambiguous commands is provided. A voice instruction from a user may be received at a virtual assistant. The voice instruction may request the virtual assistant to perform a command. The command that is most likely being requested by the voice instruction from the user is identified. An ordered set of actions to execute when performing the command may be retrieved. Each action of the ordered set of actions may indicate if the action is reversible. Each action of the ordered set of actions may be executed in order until a not reversible action is reached or no further actions are in the ordered set of actions.

Type: Grant

Filed: June 15, 2021

Date of Patent: March 19, 2024

Assignee: MOTOROLA SOLUTIONS, INC.

Inventors: Ying Bin Tan, Chew How Lim, Yih Farn Ghee, Joe Yie Chong
Method, device, and computer readable storage medium for text-to-speech synthesis using machine learning on basis of sequential prosody feature

Patent number: 11929059

Abstract: The present disclosure relates to a text-to-speech synthesis method using machine learning based on a sequential prosody feature. The text-to-speech synthesis method includes receiving input text, receiving a sequential prosody feature, and generating output speech data for the input text reflecting the received sequential prosody feature by inputting the input text and the received sequential prosody feature to an artificial neural network text-to-speech synthesis model.

Type: Grant

Filed: August 27, 2020

Date of Patent: March 12, 2024

Assignee: NEOSAPIENCE, INC.

Inventors: Taesu Kim, Younggun Lee
Interactive voice system for conveyor control

Patent number: 11915694

Abstract: A voice control interactive system and method to provide a hands-free operation for the operator to monitor and control multiple conveyors in a warehouse. The system comprises a first computing device and a second computing device. The first computing device receives an audio signal generated by a second computing device and generates a control signal and a response signal in response to the audio signal. The audio signal comprises information relating to a verbal command spoken by an operator associated with the second computing device. The response signal comprises information relating to a response for the verbal command, wherein the information is generated based on a location of the second computing device. The control signal comprises information to control a conveyor.

Type: Grant

Filed: February 25, 2021

Date of Patent: February 27, 2024

Assignee: Intelligrated Headquarters, LLC

Inventors: Jason-David Nitzberg, Timothy R. Williams, Zachary Reott, Sang Pheng, Lori A. Pike, Jason A. Johnson, Jeffrey P. Pike
Systems and methods for utility-preserving deep reinforcement learning-based text anonymization

Patent number: 11907666

Abstract: Various embodiments of a system and associated method for anonymization of text without losing semantic utility of text by extracting a latent embedding representation of content with respect to a given task and by learning an optimal strategy for text embedding manipulation to satisfy both privacy and utility requirements are disclosed herein. In particular, the system balances private attribute obfuscation with retained semantic utility.

Type: Grant

Filed: November 16, 2021

Date of Patent: February 20, 2024

Assignee: Arizona Board of Regents on Behalf of Arizona State University

Inventors: Ahmadreza Mosallanezhad, Ghazaleh Beigi, Huan Liu
Independent control of avatar location and voice origination location within a virtual collaboration space

Patent number: 11902766

Abstract: An illustrative collaboration space provider system provides a virtual collaboration session that allows for audio communication between a user and one or more other users virtually located within a virtual collaboration space. The user is represented by an avatar located at an avatar location within the virtual collaboration space. The collaboration space provider system receives user input from the user, the user input representative of a voice origination location that is within the virtual collaboration space and is distinct from the avatar location. During the virtual collaboration session, the collaboration space provider system simulates propagation within the virtual collaboration space of a voice communication spoken by the user. The propagation of the voice communication is simulated to originate from the voice origination location and not from the avatar location. Corresponding methods and systems are also disclosed.

Type: Grant

Filed: July 30, 2021

Date of Patent: February 13, 2024

Assignee: Verizon Patent and Licensing Inc.

Inventors: Samuel Charles Mindlin, Kunal Jathal, Shan Anis, David Skuratowicz
Wake-word detection suppression

Patent number: 11900937

Abstract: Example techniques involve suppressing a wake word response to a local wake word. An example implementation involves a playback device receiving audio content for playback by the playback device and providing a sound data stream representing the received audio content to a voice assistant service (VAS) wake-word engine and a local keyword engine. The playback device plays back a first portion of the audio content and detects, via the local keyword engine, that a second portion of the received audio content includes sound data matching one or more particular local keywords. Before the second portion of the received audio content is played back, the playback device disables a local keyword response of the local keyword engine to the one or more particular local keywords and then plays back the second portion of the audio content via one or more speakers.

Type: Grant

Filed: July 1, 2022

Date of Patent: February 13, 2024

Assignee: Sonos, Inc.

Inventor: Jonathan P. Lang
Conditionally assigning various automated assistant function(s) to interaction with a peripheral assistant control device

Patent number: 11893309

Abstract: In response to a user interacting with a tangible peripheral assistant control device (e.g., depressing a button of the device), causing an automated assistant to perform one or more actions. The action(s) performed can be based on input previously provided by the user in configuring the peripheral assistant control device. The action(s) performed in response to interaction with the peripheral assistant control device can vary based on one or more conditions, such as which user is currently active, where the peripheral assistant control device is currently located (which can optionally be inferred based on which of multiple assistant computing devices the button is paired with), and/or the current state of one or more smart devices and/or other devices (e.g., as determined based on a device topology). A utility of the peripheral assistant control device can be automatically extended beyond what was specifically requested by a user during configuration.

Type: Grant

Filed: March 10, 2022

Date of Patent: February 6, 2024

Assignee: GOOGLE LLC

Inventors: Tomer Amarilio, Yuzhao Ni, Bryan Allen, Norbert Tydingco, Will Donnelly, Feng Yuan, Nathaniel Nesiba, Anurag Jain, Jacky Cheung, Ronghui Zhu, Chunya Hua, Gregory Kielian
LPC residual signal encoding/decoding apparatus of modified discrete cosine transform (MDCT)-based unified voice/audio encoding device

Patent number: 11887612

Abstract: Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encode the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).

Type: Grant

Filed: August 25, 2022

Date of Patent: January 30, 2024

Assignee: Electronics and Telecommunications Research Institute

Inventors: Seung Kwon Beack, Tae Jin Lee, Min Je Kim, Kyeongok Kang, Dae Young Jang, Jin Woo Hong, Jeongil Seo, Chieteuk Ahn, Hochong Park, Young-Cheol Park
System and method for machine-learning based extraction of information from documents

Patent number: 11886820

Abstract: A method and system are provided for training a machine-learning (ML) system/module and to provide an ML model. In one embodiment, a method includes using a labeled entities set to train a machine learning (ML) system, to obtain an ML model, and using the trained ML model to predict labels for entities in an unlabeled entities set, yielding a machine-labeled entities set. One or more individual ML models may be trained and used in this way, where each individual ML model corresponds to a respective document source. The document sources can be identified via classification of a corpus of documents. The prediction of labels provides a respective confidence score for each machine-labeled entity. The method also includes selecting from the machine-labeled entities set, a subset of machine-labeled entities having a respective confidence score at least equal to a threshold confidence score; and updating the labeled entities set by adding thereto the selected subset of machine-labeled entities.

Type: Grant

Filed: October 6, 2020

Date of Patent: January 30, 2024

Assignee: Genpact Luxembourg S.à r.l. II

Inventors: Sreekanth Menon, Prakash Selvakumar, Sudheesh Sudevan
Protection against voice misappropriation in a voice interaction system

Patent number: 11881218

Abstract: Prevention of voice misappropriation in voice interaction/response systems. The system relies on telemetry data, including thermal data of components to determine whether a received voice command was made by actual voice. If the voice command is determined to have been made by an actual voice, a response to the command is generated and transmitted, otherwise if the voice command is determined to have likely not been made by an actual voice (e.g., artificial means replicating a voice, such as a laser or the like), no response to the command is transmitted or action taken with respect to the command.

Type: Grant

Filed: July 12, 2021

Date of Patent: January 23, 2024

Assignee: BANK OF AMERICA CORPORATION

Inventor: Steven Mark DiMaria
Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal

Patent number: 11881225

Abstract: Audio encoder for encoding a multichannel signal is shown. The audio encoder includes a downmixer for downmixing the multichannel signal to obtain a downmix signal, a linear prediction domain core encoder for encoding the downmix signal, wherein the downmix signal has a low band and a high band, wherein the linear prediction domain core encoder is configured to apply a bandwidth extension processing for parametrically encoding the high band, a filterbank for generating a spectral representation of the multichannel signal, and a joint multichannel encoder configured to process the spectral representation including the low band and the high band of the multichannel signal to generate multichannel information.

Type: Grant

Filed: January 13, 2022

Date of Patent: January 23, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Guillaume Fuchs, Emmanuel Ravelli, Christian Neukam, Konstantin Schmidt, Conrad Benndorf, Andreas Niedermeier, Benjamin Schubert, Ralf Geiger
Generating author vectors

Patent number: 11868724

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating author vectors. One of the methods includes obtaining a set of sequences of words, the set of sequences of words comprising a plurality of first sequences of words and, for each first sequence of words, a respective second sequence of words that follows the first sequence of words, wherein each first sequence of words and each second sequence of words has been classified as being authored by a first author; and training a neural network system on the first sequences and the second sequences to determine an author vector for the first author, wherein the author vector characterizes the first author.

Type: Grant

Filed: March 14, 2022

Date of Patent: January 9, 2024

Assignee: GOOGLE LLC

Inventors: Quoc V. Le, Brian Patrick Strope
Generative language model for few-shot aspect-based sentiment analysis

Patent number: 11853706

Abstract: Sentiment analysis is a task in natural language processing. The embodiments are directed to using a generative language model to extract an aspect term, aspect category and their corresponding polarities. The generative language model may be trained as a single, joint, and multi-task model. The single-task generative language model determines a term polarity from the aspect term in the sentence or a category polarity from an aspect category in the sentence. The joint-task generative language model determines both the aspect term and the term polarity or the aspect category and the category polarity. The multi-task generative language model determines the aspect term, term polarity, aspect category and category polarity of the sentence.

Type: Grant

Filed: September 8, 2021

Date of Patent: December 26, 2023

Assignee: salesforce.com, inc.

Inventors: Ehsan Hosseini-Asl, Wenhao Liu

1 2 3 4 5 … next