Patents Examined by Michael N. Opsasnick

Two-dimensional smoothing of post-filter masks

Patent number: 11282531

Abstract: A method includes receiving multiple samples of time-domain data that includes noise, computing a first two-dimensional (2D) time-frequency representation of the time domain data, and processing the first time-frequency representation using a time-frequency noise reduction mask to generate a second, noise-reduced time-frequency representation of the time domain data. The method also includes generating a time domain output based on the noise-reduced time-frequency representation.

Type: Grant

Filed: February 3, 2020

Date of Patent: March 22, 2022

Assignee: Bose Corporation

Inventors: Ankita D. Jain, Cristian Marius Hera, Elie Bou Daher
Text-to-speech modeling

Patent number: 11282524

Abstract: A device may receive a set of audio data files corresponding to a set of calls, wherein the set of audio data files includes digital representations of one or more segments of respective calls of the set of calls, and wherein the set of calls includes audio data relating to a particular industry. The device may receive a set of transcripts corresponding to the set of audio data files. The device may determine a plurality of text-audio pairs within the set of calls, wherein a text-audio pair, of the plurality of text-audio pairs, comprises: a digital representation of a segment a call of the set of calls, and a corresponding excerpt of text from the set of transcripts. The device may train, using a machine learning process, an industry-specific text-to-speech model, tailored for the particular industry, based on the plurality of text-audio pairs.

Type: Grant

Filed: September 25, 2020

Date of Patent: March 22, 2022

Assignee: Capital One Services, LLC

Inventor: Abhishek Dube
Method and device for processing audio signal using audio filter having non-linear characteristics to prevent receipt of echo signal

Patent number: 11276414

Abstract: An electronic device includes an audio input module, an audio output module, and a processor. The processor is configured to provide a first signal and a second signal into which a first audio signal is processed, output the first audio signal through the audio output module, acquire an external audio signal comprising the first audio signal of the electronic device, acquire a first output value through a first input channel of an audio filter, acquire a second output value through a second input channel of the audio filter, and provide a second audio signal, based at least on a first difference value between the magnitude value corresponding to the first frequency of the external audio signal and the first output value and a second difference value between the magnitude value corresponding to the second frequency of the external audio signal and the second output value.

Type: Grant

Filed: August 30, 2018

Date of Patent: March 15, 2022

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jaemo Yang, Hangil Moon, Soonho Baek, Beak-Kwon Son, Kiho Cho, Chulmin Choi
Soft decision audio decoding system

Patent number: 11262975

Abstract: A soft decision audio decoding system for preserving audio continuity in a digital wireless audio receiver is provided that deduces the likelihood of errors in a received digital signal, based on generated hard bits and soft bits. The soft bits may be utilized by a soft audio decoder to determine whether the digital signal should be decoded or muted. The soft bits may be generated based on a degree of closeness of a detected phase trajectory to known legal phase trajectories determined from the running the phase trajectory through a soft-output Viterbi algorithm. The value of the soft bits may indicate confidence in the strength of the hard bit generation. The soft decision audio decoding system may infer errors and decode perceptually acceptable audio without requiring error detection, as in conventional systems, as well as have low latency and improved granularity.

Type: Grant

Filed: June 5, 2020

Date of Patent: March 1, 2022

Assignee: Shure Acquisition Holdings, Inc.

Inventor: Robert Mamola
Device identification through dialog

Patent number: 11257490

Abstract: Particular embodiments described herein provide for an electronic device that can be configured to receive a verbal command to active a device with an unknown label, derive a probable device and a label for the probable device, activate the probable device, determine that the activated probable device is the same device to be activated by the verbal command, and store the label and a description for the device. In some examples, the label is associated with the description.

Type: Grant

Filed: April 1, 2016

Date of Patent: February 22, 2022

Assignee: Intel Corporation

Inventors: Robert James Firby, Jesus Gonzalez Marti, Jose Gabriel De Amores Carredano, Martin Henk Van Den Berg, Maria Pilar Manchon Portillo, Guillermo Perez, Steven Thomas Holmes
Speech coding using content latent embedding vectors and speaker latent embedding vectors

Patent number: 11257507

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating discrete latent representations of input audio data. Only the discrete latent representation needs to be transmitted from an encoder system to a decoder system in order for the decoder system to be able to effectively to decode, i.e., reconstruct, the input audio data.

Type: Grant

Filed: January 17, 2020

Date of Patent: February 22, 2022

Assignee: DeepMind Technologies Limited

Inventors: Cristina Garbacea, Aaron Gerard Antonius van den Oord, Yazhe Li, Sze Chie Lim, Alejandro Luebs, Oriol Vinyals, Thomas Chadwick Walters
Natural language processing with an N-gram machine

Patent number: 11256866

Abstract: The present disclosure provides systems and methods that perform machine-learned natural language processing. A computing system can include a machine-learned natural language processing model that includes an encoder model trained to receive a natural language text body and output a knowledge graph and a programmer model trained to receive a natural language question and output a program. The computing system can include a computer-readable medium storing instructions that, when executed, cause the processor to perform operations. The operations can include obtaining the natural language text body, inputting the natural language text body into the encoder model, receiving, as an output of the encoder model, the knowledge graph, obtaining the natural language question, inputting the natural language question into the programmer model, receiving the program as an output of the programmer model, and executing the program on the knowledge graph to produce an answer to the natural language question.

Type: Grant

Filed: October 25, 2017

Date of Patent: February 22, 2022

Assignee: Google LLC

Inventors: Ni Lao, Jiazhong Nie, Fan Yang
Audio quality enhancement system

Patent number: 11250874

Abstract: A language proficiency analyzer automatically evaluates a person's language proficiency by analyzing that person's oral communications with another person. The analyzer first enhances the quality of an audio recording of a conversation between the two people using a neural network that automatically detects loss features in the audio and adds those loss features back into the audio. The analyzer then performs a textual and audio analysis on the improved audio. Through textual analysis, the analyzer uses a multi-attention network to determine how focused one person is on the other and how pleased one person is with the other. Through audio analysis, the analyzer uses a neural network to determine how well one person pronounced words during the conversation.

Type: Grant

Filed: May 21, 2020

Date of Patent: February 15, 2022

Assignee: Bank of America Corporation

Inventors: MadhuSudhanan Krishnamoorthy, Harikrishnan Rajeev
Predictive injection of conversation fillers for assistant systems

Patent number: 11245646

Abstract: In one embodiment, a method includes, by one or more computing systems, receiving, from a client system associated with a first user, a first user input from the first user, identifying one or more entities referenced by the first user input, determining a classification of the first user input based on a machine-learning classifier model, generating several candidate conversational fillers based on the classification of the first user input and the one or more identified entities, wherein each candidate conversational filler references at least one of the one or more identified entities, ranking the candidate conversational fillers based on a relevancy of the candidate conversational filler to the first user input and a decay model hysteresis, and sending instructions for presenting a top-ranked candidate conversational filler as an initial response to the first user.

Type: Grant

Filed: November 15, 2018

Date of Patent: February 8, 2022

Assignee: Facebook, Inc.

Inventors: Emmanouil Koukoumidis, Michael Robert Hanson, Mohsen M Agsen
Alternative output response based on context

Patent number: 11240057

Abstract: One embodiment provides a method in which an audible command is first received from a user. Subsequent to receipt of the audible command, one or more sensors may detect certain contextual data associated with the user's physical surroundings. An embodiment may then determine, using the data, whether a default output response associated with the audible command is appropriate with respect to the user's physical surroundings. If the default output response is determined not to be appropriate, an embodiment may thereafter provide an alternative output response that is appropriate with respect to the user's surroundings. Other aspects are described and claimed.

Type: Grant

Filed: March 15, 2018

Date of Patent: February 1, 2022

Assignee: Lenovo (Singapore) Pte. Ltd.

Inventors: John Carl Mese, Russell Speight VanBlon, Nathan J. Peterson
System and method of video capture and search optimization for creating an acoustic voiceprint

Patent number: 11227603

Abstract: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.

Type: Grant

Filed: April 14, 2020

Date of Patent: January 18, 2022

Assignee: Verint Systems Ltd.

Inventors: Omer Ziv, Ran Achituv, Ido Shapira, Jeremie Dreyfuss
Transliteration of text entry across scripts

Patent number: 11227110

Abstract: Embodiments are disclosed for transliterating text entries across different script systems. A method according to some embodiments includes steps of: receiving an input string in a first script system input using a keyboard; segmenting, using a probabilistic model, the input string into phonemes that correspond to characters or sets of characters in a second script system; converting the phonemes in the first script system into the characters or sets of characters in the second script system, the characters or sets of characters forming a word or a word prefix in the second script system; and outputting the word or the word prefix in the second script system.

Type: Grant

Filed: March 27, 2020

Date of Patent: January 18, 2022

Assignee: FACEBOOK, INC.

Inventors: Juan Miguel Pino, Stanislav Funiak, Mridul Malpani, Gaurav Lochan
Systems and methods for generating a response based on task-independent conversational responses or task-specific responses

Patent number: 11227581

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating responses using task-independent conversational systems are provided. In one example method, a response to a user text input is generated by updating a state of the conversation based on the user text input, generating a conversational (task-independent) output, and determining whether to provide a conversational response based on the conversational output, or to additionally generate a task-specific output.

Type: Grant

Filed: May 8, 2019

Date of Patent: January 18, 2022

Assignee: botbotbotbot Inc.

Inventor: Antoine Raux
Text translation using contextual information related to text objects in translated language

Patent number: 11222373

Abstract: In an example embodiment, text is received at an ecommerce service from a first user, the text in a first language and pertaining to a first listing on the ecommerce service. Contextual information about the first listing may be retrieved. The text may be translated to a second language. Then, a plurality of text objects, in the second language, similar to the translated text may be located in a database, each of the text objects corresponding to a listing. Then, the plurality of text objects similar to the translated text may be ranked based on a comparison of the contextual information about the first listing and contextual information stored in the database for the listings corresponding to the plurality of text objects similar to the translated text. At least one of the ranked plurality of text objects may then be translated to the first language.

Type: Grant

Filed: September 24, 2019

Date of Patent: January 11, 2022

Assignee: eBay Inc.

Inventor: Yan Chelly
Systems and methods for training devices to recognize sound patterns

Patent number: 11222625

Abstract: Systems and methods for training a control panel to recognize user defined and preprogrammed sound patterns are provided. Such systems and methods can include the control panel operating in a learning mode, receiving initial ambient audio from a region, and saving the initial ambient audio as an audio pattern in a memory device of the control panel. Such systems and methods can also include the control panel operating in an active mode, receiving subsequent ambient audio from the region, using an audio classification model to make an initial determination as to whether the subsequent ambient audio matches or is otherwise consistent with the audio pattern, determining whether the initial determination is correct, and when the control panel determines that the initial determination is incorrect, modifying or updating the audio classification model for improving the accuracy in detecting future consistency with the audio pattern.

Type: Grant

Filed: April 15, 2019

Date of Patent: January 11, 2022

Assignee: Ademco Inc.

Inventors: Pradyumna Sampath, Ramprasad Yelchuru, Purnaprajna R. Mangsuli
Text entity extraction method for extracting text from target text based on combination probabilities of segmentation combination of text entities in the target text, apparatus, and device, and storage medium

Patent number: 11222178

Abstract: A text entity extraction method, apparatus, and storage medium are provided. The method includes determining candidate text entities in a target text. Portions of the candidate text entities are combined to generate candidate segmentation combinations corresponding to the target text, the candidate text entities in each candidate segmentation combination being different. A combination probability corresponding to each candidate segmentation combination is calculated, where the combination probability is a probability that grammar is correct when the target text uses the candidate segmentation combination. A target segmentation combination corresponding to the target text is determined according to the combination probabilities. A text entity is extracted from the target text according to the target segmentation combination.

Type: Grant

Filed: May 29, 2019

Date of Patent: January 11, 2022

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LTD

Inventors: Hengyao Bao, Ke Su, Yi Chen, Mengliang Rao
Using a virtual assistant to store a personal voice memo and to obtain a response based on a stored personal voice memo that is retrieved according to a received query

Patent number: 11211064

Abstract: The technology disclosed relates to retrieving a personal memo from a database. The method includes receiving, by a virtual assistant, a natural language utterance that expresses a request, interpreting the natural language utterance according to a natural language grammar rule for retrieving memo data from the natural language utterance, the natural language grammar rule recognizing query information, responsive to interpreting the natural language utterance, using the query information to query the database for a memo related to the query information, and providing, to a user, a response generated in dependence upon the memo related to the query information.

Type: Grant

Filed: January 23, 2019

Date of Patent: December 28, 2021

Assignee: SoundHound, Inc.

Inventors: Mara Selvaggi, Irina A Spiridonova, Karl Stahl
Dynamic gazetteers for personalized entity recognition

Patent number: 11189277

Abstract: In speech processing systems personalization is added in the Natural Language Understanding (NLU) processor by incorporating external knowledge sources of user information to improve entity recognition performance of the speech processing system. Personalization in the NLU is effected by incorporating one or more dictionaries of entries, or gazetteers, with information personal to a respective user, that provide the user's information to permit disambiguation of semantic interpretation for input utterances to improve quality of speech processing results.

Type: Grant

Filed: February 13, 2019

Date of Patent: November 30, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Imre Attila Kiss, Arthur Richard Toth, Lambert Mathias
Systems and methods for voice-based initiation of custom device actions

Patent number: 11183182

Abstract: Systems and methods for enabling voice-based interactions with electronic devices can include a data processing system maintaining a plurality of device action data sets and a respective identifier for each device action data set. The data processing system can receive, from an electronic device, an audio signal representing a voice query and an identifier. The data processing system can identify, using the identifier, a device action data set. The data processing system can identify a device action from device action data set based on content of the audio signal. The data processing system can then identify, from the device action dataset, a command associated with the device action and send the command to the for execution device for execution.

Type: Grant

Filed: August 22, 2018

Date of Patent: November 23, 2021

Assignee: GOOGLE LLC

Inventors: Bo Wang, Venkat Kotla, Chad Yoshikawa, Chris Ramsdale, Pravir Gupta, Alfonso Gomez-Jordana, Kevin Yeun, Jae Won Seo, Lantian Zheng, Sang Soo Sung
Music service selection

Patent number: 11184704

Abstract: Methods and apparatus for identifying a music service based on a user command. A content type is identified from a received user command and a music service is selected that supports the content type. A selected music service can then transmit audio content associated with the content type for playback.

Type: Grant

Filed: February 3, 2020

Date of Patent: November 23, 2021

Assignee: Sonos, Inc.

Inventors: Simon Jarvis, Mark Plagge, Christopher Butts

prev … 2 3 4 5 6 7 8 9 10 … next