Patents Assigned to SoundHound, Inc.
  • Publication number: 20220383869
    Abstract: A user specifies a natural language command to a device. Software on the device generates contextual metadata about the user interface of the device, such as data about all visible elements of the user interface, and sends the contextual metadata along with the natural language command to a natural language understanding engine. The natural language understanding engine parses the natural language query using a stored grammar (e.g., a grammar provided by a maker of the device) and as a result of the parsing identifies information about the command (e.g., the user interface elements referenced by the command) and provides that information to the device. The device uses that provided information to respond to the command.
    Type: Application
    Filed: May 27, 2021
    Publication date: December 1, 2022
    Applicant: SoundHound, Inc.
    Inventors: Utku YABAS, Philipp HUBERT, Karl STAHL
  • Publication number: 20220343014
    Abstract: A system and method are disclosed for fulfilling GDPR and other privacy requests in a client device system as well as a downstream service provider with which the client device system partners. In examples, the downstream service provider may be a voice assistant service provider providing voice recognition and language understanding capabilities to an upstream client device system.
    Type: Application
    Filed: April 22, 2021
    Publication date: October 27, 2022
    Applicant: SoundHound, Inc.
    Inventors: Kevin QIU, Evelyn JIANG, Matthias EICHSTAEDT, Warren S. HEIT
  • Patent number: 11461812
    Abstract: Original concepts obtained from a query may be augmented with additional concepts connected to the original concepts in a concept graph in response to determining that the original concepts did not match a sufficient number of bid functions. The augmented set of concepts may then be evaluated with respect to the bid functions to identify matching ad functions. This process may be repeated until a sufficient number of matching ad functions are found. A bid amount of the matching bid functions may be calculated, such as based on semantic information obtained as a result of the query. The bid amounts may further be based on environmental information. A bid function is selected based on the bid amounts and the content associated with the bid function is provided to the source of the query. The content may be selected based on the semantic information.
    Type: Grant
    Filed: September 16, 2019
    Date of Patent: October 4, 2022
    Assignee: SoundHound, Inc.
    Inventors: Keyvan Mohajer, Scott Halstvedt
  • Publication number: 20220262362
    Abstract: A system and method are disclosed for ignoring a wakeword received at a speech-enabled listening device when it is determined the wakeword is reproduced audio from an audio-playing device. Determination can be by detecting audio distortions, by an ignore flag sent locally between an audio-playing device and speech-enabled device, by and ignore flag sent from a server, by comparison of received audio played audio to a wakeword within an audio-playing device or a speech-enabled device, and other means.
    Type: Application
    Filed: May 4, 2022
    Publication date: August 18, 2022
    Applicant: SoundHound, Inc.
    Inventors: Hsuan Yang, Qìndí Zhang, Warren S. Heit
  • Patent number: 11410642
    Abstract: A system and method for creating an embedded phoneme map from a corpus of speech in accordance with a multiplicity of acoustic features of the speech. The embedded phoneme map is used to determine how to pronounce borrowed words from a lending language in the borrowing language, using the phonemes of the borrowing language that are closest to the phonemes of the lending language. The embedded phoneme map is also used to help linguists visualize the phonemes being pronounced by a speaker in real-time and to help non-native speakers practice pronunciation by displaying the differences between proper pronunciation and actual pronunciation for open-ended speech by the speaker.
    Type: Grant
    Filed: August 16, 2019
    Date of Patent: August 9, 2022
    Assignee: SOUNDHOUND, INC.
    Inventors: Serena Caterina Scuderi, Gioia Zoli, Sarah Beth Hotung
  • Patent number: 11392833
    Abstract: An audio processing system is described. The audio processing system uses a convolutional neural network architecture to process audio data, a recurrent neural network architecture to process at least data derived from an output of the convolutional neural network architecture, and a feed-forward neural network architecture to process at least data derived from an output of the recurrent neural network architecture. The feed-forward neural network architecture is configured to output classification scores for a plurality of sound units associated with speech. The classification scores indicate a presence of one or more sound units in the audio data. The convolutional neural network architecture has a plurality of convolutional groups arranged in series, where a convolutional group includes a combination of two data mappings arranged in parallel.
    Type: Grant
    Filed: February 13, 2020
    Date of Patent: July 19, 2022
    Assignee: SoundHound, Inc.
    Inventors: Maisy Wieman, Andrew Carl Spencer, Zìlì L{hacek over (i)}, Cristina Vasconcelos
  • Patent number: 11393463
    Abstract: A system and method are disclosed for setting up a communication link between a device or application and a system with a controller. The controller can collect and send information to the application. A user interfaces with the controller to access the functionality of the application through providing commands to the controller. The system allows the user to interface with multiple applications.
    Type: Grant
    Filed: April 19, 2019
    Date of Patent: July 19, 2022
    Assignee: SoundHound, Inc.
    Inventors: Timothy P. Stonehocker, Kathleen Worthington McMahon
  • Publication number: 20220223155
    Abstract: A system and method are disclosed capable of parsing a spoken utterance into a natural language request and a speech audio segment, where the natural language request directs the system to use the speech audio segment as a new wakeword. In response to this wakeword assignment directive, the system and method are further capable of immediately building a new wakeword spotter to activate the device upon matching the new wakeword in the input audio. Different approaches to promptly building a new wakeword spotter are described. Variations of wakeword assignment directives can make the new wakeword public or private. They can also add the new wakeword to earlier wakewords, or replace earlier wakewords.
    Type: Application
    Filed: March 30, 2022
    Publication date: July 14, 2022
    Applicant: SoundHound, Inc.
    Inventor: Bernard Mont-Reynaud
  • Publication number: 20220208192
    Abstract: A processing system detects a period of non-voice activity and compares its duration to a cutoff period. The system adapts the cutoff period based on parsing previously-recognized speech to determine, according to a model, such as a machine-learned model, the probability that the speech recognized so far is a prefix to a longer complete utterance. The cutoff period is longer when a parse of previously recognized speech has a high probability of being a prefix of a longer utterance.
    Type: Application
    Filed: March 18, 2022
    Publication date: June 30, 2022
    Applicant: SoundHound, Inc.
    Inventors: Patricia Pozon AGUAYO, Jennifer Hee Young ZHANG, Jonah PROBELL
  • Patent number: 11367448
    Abstract: A method of providing a platform for configuring device-specific speech recognition is provided. The method includes providing a user interface for developers to select a set of at least two acoustic models appropriate for a specific type of a device, receiving, from a developer, a selection of the set of the at least two acoustic models, and configuring a speech recognition system to perform device-specific speech recognition by using one acoustic model selected from the at least two acoustic models of the set.
    Type: Grant
    Filed: April 21, 2021
    Date of Patent: June 21, 2022
    Assignee: SOUNDHOUND, INC.
    Inventors: Keyvan Mohajer, Mehul Patel
  • Publication number: 20220188580
    Abstract: A system and a method are disclosed that calculate the center of a geographic region. A set of topological/geographical points is received. A set of clusters is determined. A weight for each cluster is computed. The highest weighted cluster is selected. The geographic region center is calculated using the selected cluster. The geographical points can include a key for each point and be filtered by an indicated key before calculating the center of a geographic location.
    Type: Application
    Filed: December 13, 2021
    Publication date: June 16, 2022
    Applicant: SoundHound, Inc.
    Inventor: Christophe PIERRET
  • Publication number: 20220189464
    Abstract: A system and method invoke virtual assistant action, which may comprise an argument. From audio, a probability of an intent is inferred. A probability of a domain and a plurality of variable values may also be inferred. Invoking the action is in response to the intent probability exceeding a threshold. Invoking the action may also be in response to the domain probability exceeding a threshold, a variable value probability exceeding a threshold, detecting an end of utterance, and a specific amount of time having elapsed. The intent probability may increase when the audio includes speech of words with the same meaning in multiple natural languages. Invoking the action may also be conditional on the variable value exceeding its threshold within a certain period of time of the intent probability exceeding its threshold.
    Type: Application
    Filed: March 3, 2022
    Publication date: June 16, 2022
    Applicant: SoundHound, Inc.
    Inventors: Sudharsan KRISHNASWAMY, Maisy WIEMAN, Jonah PROBELL
  • Publication number: 20220172729
    Abstract: A system and method are disclosed for achieving interoperability and access to a personal extension knowledge/preference database (PEKD) through interconnected voice verification systems. Devices from various different companies and systems can link to a voice verification system (VVS). Users can also enroll with the VSS so that the VSS can provide authentication of users by personal wake phrases. Thereafter users can access their PEKD from un-owned devices by speaking their wake phrase.
    Type: Application
    Filed: December 1, 2020
    Publication date: June 2, 2022
    Applicant: SoundHound, Inc.
    Inventors: Keyvan Mohajer, Warren S. Heit
  • Publication number: 20220165272
    Abstract: A computer-implemented method is provided to support a food ordering system for food items from a menu of a restaurant using natural language. Expressions made for ordering are used to recommend a food item that a user has a high probability of wanting to include in an order. The recommendation engine is trained using machine learning. Expressions are collected and parsed to identify words that might indicate food items offered by the restaurant. The words are provided to a restaurant owner to identify food items on a menu, with which the words are associated.
    Type: Application
    Filed: February 8, 2022
    Publication date: May 26, 2022
    Applicant: SoundHound, Inc.
    Inventors: Kamyar MOHAJER, Robert MACRAE
  • Publication number: 20220165257
    Abstract: Methods and systems for automatically generating sample phrases or sentences that a user can say to invoke a set of defined actions performed by a virtual assistant are disclosed. By enabling finetuned general-purpose natural language models, the system can generate potential and accurate utterance sentences based on extracted keywords or the input utterance sentence. Furthermore, domain-specific datasets can be used to train the pre-trained, general-purpose natural language models via unsupervised learning. These generated sentences can improve the efficiency of configuring a virtual assistant. The system can further optimize the effectiveness of a virtual assistant in understanding the user, which can enhance the user experience of communicating with it.
    Type: Application
    Filed: November 19, 2021
    Publication date: May 26, 2022
    Applicant: SoundHound, Inc.
    Inventors: Pranav SINGH, Keyvan MOHAJER, Yilun ZHANG
  • Publication number: 20220147510
    Abstract: Systems and methods are provided for natural language processing using neural network models and natural language virtual assistants. The system and method include receiving a natural language phrase including a word sequence, computing corresponding error probabilities that the words are errors, and for a word with a corresponding error probability above a threshold, then computing a replacement phrase with a low error probability to provide a response from the virtual assistant depending on the replacement phrase.
    Type: Application
    Filed: January 21, 2022
    Publication date: May 12, 2022
    Applicant: SoundHound, Inc.
    Inventors: Pranav Singh, Olivia Bettaglio
  • Publication number: 20220148566
    Abstract: Machine learned models take in vectors representing desired behaviors and generate voice vectors that provide the parameters for text-to-speech (TTS) synthesis. Models may be trained on behavior vectors that include user profile attributes, situational attributes, or semantic attributes. Situational attributes may include age of people present, music that is playing, location, noise, and mood. Semantic attributes may include presence of proper nouns, number of modifiers, emotional charge, and domain of discourse. TTS voice parameters may apply per utterance and per word as to enable contrastive emphasis.
    Type: Application
    Filed: January 20, 2022
    Publication date: May 12, 2022
    Applicant: SoundHound, Inc.
    Inventors: Bernard Mont-Reynaud, Monika Almudafar-Depeyrot
  • Patent number: 11328721
    Abstract: A system and method are disclosed for ignoring a wakeword received at a speech-enabled listening device when it is determined the wakeword is reproduced audio from an audio-playing device. Determination can be by detecting audio distortions, by an ignore flag sent locally between an audio-playing device and speech-enabled device, by and ignore flag sent from a server, by comparison of received audio played audio to a wakeword within an audio-playing device or a speech-enabled device, and other means.
    Type: Grant
    Filed: February 4, 2020
    Date of Patent: May 10, 2022
    Assignee: SoundHound, Inc.
    Inventors: Hsuan Yang, Qìndí Zhang, Warren S. Heit
  • Publication number: 20220139393
    Abstract: A driver interface for use within an automobile provides responses to voice commands issued for example by a driver of the automobile. The interface includes a camera and microphone for capturing image data such as gestures and audio data from the automobile driver. The image data and audio data are processed to extract image and linguistic features from the image and audio data, which image and linguistic features are processed to interpret and infer a meaning of the voice command.
    Type: Application
    Filed: December 10, 2021
    Publication date: May 5, 2022
    Applicant: SoundHound, Inc.
    Inventors: Zili Li, Cristina Vasconcelos
  • Publication number: 20220130390
    Abstract: Developers can configure custom acoustic models by providing audio files with custom recordings. The custom acoustic model is trained by tuning a baseline model using the audio files. Audio files may contain custom noise to apply to clean speech for training. The custom acoustic model is provided as an alternative to a standard acoustic model. Device developers can select an acoustic model by a user interface. Speech recognition is performed on speech audio using one or more acoustic models. The result can be provided to developers through the user interface, and an error rate can be computed and also provided.
    Type: Application
    Filed: January 11, 2022
    Publication date: April 28, 2022
    Applicant: SOUNDHOUND, INC.
    Inventors: Keyvan MOHAJER, Mehul PATEL