Patents by Inventor Monika Almudafar-Depeyrot

Monika Almudafar-Depeyrot has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Text-to-speech adapted by machine learning

Patent number: 11531819

Abstract: Machine learned models take in vectors representing desired behaviors and generate voice vectors that provide the parameters for text-to-speech (TTS) synthesis. Models may be trained on behavior vectors that include user profile attributes, situational attributes, or semantic attributes. Situational attributes may include age of people present, music that is playing, location, noise, and mood. Semantic attributes may include presence of proper nouns, number of modifiers, emotional charge, and domain of discourse. TTS voice parameters may apply per utterance and per word as to enable contrastive emphasis.

Type: Grant

Filed: January 14, 2020

Date of Patent: December 20, 2022

Assignee: SoundHound, Inc.

Inventors: Bernard Mont-Reynaud, Monika Almudafar-Depeyrot
Text-to-Speech Adapted by Machine Learning

Publication number: 20220148566

Abstract: Machine learned models take in vectors representing desired behaviors and generate voice vectors that provide the parameters for text-to-speech (TTS) synthesis. Models may be trained on behavior vectors that include user profile attributes, situational attributes, or semantic attributes. Situational attributes may include age of people present, music that is playing, location, noise, and mood. Semantic attributes may include presence of proper nouns, number of modifiers, emotional charge, and domain of discourse. TTS voice parameters may apply per utterance and per word as to enable contrastive emphasis.

Type: Application

Filed: January 20, 2022

Publication date: May 12, 2022

Applicant: SoundHound, Inc.

Inventors: Bernard Mont-Reynaud, Monika Almudafar-Depeyrot
SYSTEM AND METHOD FOR A LANGUAGE UNDERSTANDING CONVERSATIONAL SYSTEM

Publication number: 20200410983

Abstract: A virtual assistant device recognizes multiple wake-up phrases. In response to a particular wake-up phrase the device sends speech audio to either a default or a third party virtual assistant server. A virtual assistant server can receive speech audio and an indication of which of multiple wake-up phrases was used and, accordingly, send the speech audio, or text recognized from the speech audio using automatic speech recognition, to a third party server. A response from the third party server can be voice audio or text for the virtual assistant server to synthesize distinctively corresponding to the wake-up phrase.

Type: Application

Filed: September 16, 2020

Publication date: December 31, 2020

Applicant: SoundHound, Inc.

Inventors: Keyvan Mohajer, Mark Stevans, Monika Almudafar-Depeyrot
Integration of third party virtual assistants

Patent number: 10783872

Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphine of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.

Type: Grant

Filed: January 13, 2019

Date of Patent: September 22, 2020

Assignee: SoundHound, Inc.

Inventors: Monika Almudafar-Depeyrot, Keyvan Mohajer, Mark Stevans
Text-to-Speech Adapted by Machine Learning

Publication number: 20200151394

Abstract: Machine learned models take in vectors representing desired behaviors and generate voice vectors that provide the parameters for text-to-speech (TTS) synthesis. Models may be trained on behavior vectors that include user profile attributes, situational attributes, or semantic attributes. Situational attributes may include age of people present, music that is playing, location, noise, and mood. Semantic attributes may include presence of proper nouns, number of modifiers, emotional charge, and domain of discourse. TTS voice parameters may apply per utterance and per word as to enable contrastive emphasis.

Type: Application

Filed: January 14, 2020

Publication date: May 14, 2020

Applicant: SoundHound, Inc.

Inventors: Bernard Mont-Reynaud, Monika Almudafar-Depeyrot
Parametric adaptation of voice synthesis

Patent number: 10586079

Abstract: Software-based systems perform parametric speech synthesis. TTS voice parameters determine the generated speech audio. Voice parameters include gender, age, dialect, donor, arousal, authoritativeness, pitch, range, speech rate, volume, flutter, roughness, breath, frequencies, bandwidths, and relative amplitudes of formants and nasal sounds. The system chooses TTS parameters based on one or more of: user profile attributes including gender, age, and dialect; situational attributes such as location, noise level, and mood; natural language semantic attributes such as domain of conversation, expression type, dimensions of affect, word emphasis and sentence structure; and analysis of target speaker voices. The system chooses TTS parameters to improve listener satisfaction or other desired listener behavior. Choices may be made by specified algorithms defined by code developers, or by machine learning algorithms trained on labeled samples of system performance.

Type: Grant

Filed: January 13, 2017

Date of Patent: March 10, 2020

Assignee: SOUNDHOUND, INC.

Inventors: Monika Almudafar-Depeyrot, Bernard Mont-Reynaud
INTEGRATION OF THIRD PARTY VIRTUAL ASSISTANTS

Publication number: 20190147850

Abstract: A virtual assistant device recognizes multiple wake-up phrases. In response to a particular wake-up phrase the device sends speech audio to either a default or a third party virtual assistant server. A virtual assistant server can receive speech audio and an indication of which of multiple wake-up phrases was used and, accordingly, send the speech audio, or text recognized from the speech audio using automatic speech recognition, to a third party server. A response from the third party server can be voice audio or text for the virtual assistant server to synthesize distinctively corresponding to the wake-up phrase.

Type: Application

Filed: January 13, 2019

Publication date: May 16, 2019

Applicant: SoundHound, Inc.

Inventors: Monika Almudafar-Depeyrot, Keyvan Mohajer, Mark Stevans
Virtual assistant configured by selection of wake-up phrase

Patent number: 10217453

Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.

Type: Grant

Filed: October 14, 2016

Date of Patent: February 26, 2019

Assignee: SoundHound, Inc.

Inventors: Mark Stevans, Monika Almudafar-Depeyrot, Keyvan Mohajer
PARAMETRIC ADAPTATION OF VOICE SYNTHESIS

Publication number: 20180182373

Abstract: Software-based systems perform parametric speech synthesis. TTS voice parameters determine the generated speech audio. Voice parameters include gender, age, dialect, donor, arousal, authoritativeness, pitch, range, speech rate, volume, flutter, roughness, breath, frequencies, bandwidths, and relative amplitudes of formants and nasal sounds. The system chooses TTS parameters based on one or more of: user profile attributes including gender, age, and dialect; situational attributes such as location, noise level, and mood; natural language semantic attributes such as domain of conversation, expression type, dimensions of affect, word emphasis and sentence structure; and analysis of target speaker voices. The system chooses TTS parameters to improve listener satisfaction or other desired listener behavior. Choices may be made by specified algorithms defined by code developers, or by machine learning algorithms trained on labeled samples of system performance.

Type: Application

Filed: January 13, 2017

Publication date: June 28, 2018

Applicant: SoundHound, Inc.

Inventors: Monika Almudafar-Depeyrot, Bernard Mont-Reynaud
VIRTUAL ASSISTANT CONFIGURED BY SELECTION OF WAKE-UP PHRASE

Publication number: 20180108343

Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.

Type: Application

Filed: October 14, 2016

Publication date: April 19, 2018

Inventors: Mark Stevans, Monika Almudafar-Depeyrot, Keyvan Mohajer