Patents Examined by Samuel G Neway

Modulation of packetized audio signals

Patent number: 11948572

Abstract: Modulating packetized audio signals in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate a first action data structure. The system can identify a content item object based on the trigger keyword, and generate an output signal comprising a first portion corresponding to the first action data structure and a second portion corresponding to the content item object. The system can apply a modulation to the first or second portion of the output signal, and transmit the modulated output signal to the device.

Type: Grant

Filed: October 24, 2022

Date of Patent: April 2, 2024

Assignee: GOOGLE LLC

Inventors: Gaurav Bhaya, Robert Stets
Generating IoT-based notification(s) and provisioning of command(s) to cause automatic rendering of the IoT-based notification(s) by automated assistant client(s) of client device(s)

Patent number: 11948574

Abstract: Remote automated assistant component(s) generate client device notification(s) based on a received IoT state change notification that indicates a change in at least one state associated with at least one IoT device. The generated client device notification(s) can each indicate the change in state associated with the at least one IoT device, and can optionally indicate the at least one IoT device. Further, the remote automated assistant component(s) can identify candidate assistant client devices that are associated with the at least one IoT device, and determine whether each of the one or more of the candidate assistant client device(s) should render a corresponding client device notification.

Type: Grant

Filed: December 21, 2022

Date of Patent: April 2, 2024

Assignee: GOOGLE LLC

Inventors: David Roy Schairer, Sumer Mohammed, Mark Spates, IV, Prem Kumar, Chi Yeung Jonathan Ng, Di Zhu, Steven Clark
Learning data acquisition apparatus, model learning apparatus, methods and programs for the same

Patent number: 11942074

Abstract: A learning data acquisition device or the like, capable of acquiring learning data by superimposing noise data on clean voice data at an appropriate SN ratio, is provided. The learning data acquisition device includes a voice recognition influence degree calculation unit and a learning data acquisition unit. The voice recognition influence degree calculation unit calculates an influence degree on voice recognition accuracy caused by a change of a signal-to-noise ratio, based on a result of voice recognition on the kth noise superimposed voice data and a result of voice recognition on the k?1th noise superimposed voice data, where K is an integer of 2 or larger, k=2, 3, . . .

Type: Grant

Filed: January 29, 2020

Date of Patent: March 26, 2024

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Takaaki Fukutomi, Takashi Nakamura, Kiyoaki Matsui
Generating sentiment analysis of content

Patent number: 11934778

Abstract: Certain aspects of the present disclosure provide techniques for providing sentiment analysis of content. In order to determine the overall sentiment of content, a request is received by a sentiment analyzer, which then identifies a content identification number and retrieves comments associated with the content identification number. The sentiment analyzer pre-processes the comments, which includes removing all personal identifying information from the comments. The sentiment analyzer sends the pre-processed comments to a natural language processing service, and in turn, receives sentiment indications corresponding to the comments provided. Based on the sentiment scores, the sentiment analyzer generates a sentiment analysis and displays the sentiment analysis in the graphical user interface generated by the sentiment analyzer.

Type: Grant

Filed: August 11, 2021

Date of Patent: March 19, 2024

Assignee: Intuit, Inc.

Inventors: Harpreet Singh Hira, Abhay Dhundiraju Sastry, Priyadarshini Rajendran, Sanmathi Sathyanarayana Naga, Tak Yiu Daniel Li, Majo Paulose, Jasen Paul Stine, Darpan Sharma, Nicholas Allen McHenry
Machine reading between the lines

Patent number: 11928437

Abstract: Techniques for identifying one or more missing fragments within input text are disclosed. A discourse tree (DT) is generated for the input text (IT) received, the IT having any suitable number of sentence fragments. An indication that the IT is likely missing one or more sentence fragments may be identified based on determining that one or more rhetorical relationships of the DT matches one of a set of predefined rhetorical relationships. A query is generated one or more sentence fragments of the IT and executed against a knowledge base to obtain a set of search results. A most-relevance search result can be utilized to identify a set of candidate sentence fragments. A subset of those candidate sentence fragments can be identified based on comparing them to the sentence fragments provided in the IT, each candidate sentence fragment of the subset being implied but excluded from the IT.

Type: Grant

Filed: January 4, 2022

Date of Patent: March 12, 2024

Assignee: Oracle International Corporation

Inventor: Boris Galitsky
Digital assistant voice input integration

Patent number: 11915696

Abstract: A digital assistant supported on devices such as smartphones, tablets, personal computers, game consoles, etc. includes an extensibility client that exposes an interface and service that enables third party applications to be integrated with the digital assistant so the application user experiences are rendered using the native voice of the digital assistant. Specific voice inputs associated with a given application may be registered by developers using a manifest that is loaded when the application is launched on the device so that voice inputs from the device user can be mapped by the digital assistant extensibility client to the appropriate application as input events for consumption. In typical implementations, the manifest is arranged as a declarative document that streamlines application development and provides a seamless user experience by enabling customization of third party applications to integrate the digital assistant's voice and behaviors within the user experience of the application's domain.

Type: Grant

Filed: July 19, 2021

Date of Patent: February 27, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Derek Liddell, Francis Zhou, Cheng-Yi Yen
Multichannel audio speech classification

Patent number: 11900961

Abstract: Examples of the present disclosure describe systems and methods for multichannel audio speech classification. In examples, an audio signal comprising multiple audio channels is received at a processing device. Each of the audio channels in the audio signal is transcoded to a predefined audio format. For each of the transcoded audio channels, an average power value is calculated for one or more data windows in the audio signal. A correlation value is calculated between the average power value for each audio channel and the combined average power value of the other audio channels in the audio signal. Each of the correlation values (or an aggregated correlation value for the audio channels) is then compared against a threshold value to determine whether the audio signal is to be classified as a speech-based communication. Based on the classification, an action associated with the audio signal may be performed.

Type: Grant

Filed: May 31, 2022

Date of Patent: February 13, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Oron Nir, Inbal Sagiv, Maayan Yedidia, Fardau Van Neerden, Itai Norman
Detecting continuing conversations with computing devices

Patent number: 11893350

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.

Type: Grant

Filed: September 2, 2022

Date of Patent: February 6, 2024

Assignee: GOOGLE LLC

Inventors: Nathan David Howard, Gabor Simko, Andrei Giurgiu, Behshad Behzadi, Marcin M. Nowak-Przygodzki
Concept for encoding an audio signal and decoding an audio signal using speech related spectral shaping information

Patent number: 11881228

Abstract: According to an aspect of the present invention an encoder for encoding an audio signal has an analyzer configured for deriving prediction coefficients and a residual signal from a frame of the audio signal. The encoder has a formant information calculator configured for calculating a speech related spectral shaping information from the prediction coefficients, a gain parameter calculator configured for calculating a gain parameter from an unvoiced residual signal and the spectral shaping information and a bitstream former configured for forming an output signal based on an information related to a voiced signal frame, the gain parameter or a quantized gain parameter and the prediction coefficients.

Type: Grant

Filed: December 14, 2020

Date of Patent: January 23, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e. V.

Inventors: Guillaume Fuchs, Markus Multrus, Emmanuel Ravelli, Markus Schnell
Voice command processing for locked devices

Patent number: 11862174

Abstract: Techniques for processing voice commands from a locked device are described. A voice command received by a locked device is stored, a prompt requesting that the device be unlocked is generated, and the voice command is processed automatically after the device is unlocked. Thus, the system processes the voice command without the user repeating the voice command. In addition, the system may process certain voice commands even when the device is locked. For example, a whitelist filter compares an intent associated with the voice command to whitelisted intents from a whitelist database before the intent is dispatched to a speechlet, and intents included in the whitelist database are processed normally. Thus, the system performs certain voice commands while the device is locked, while other voice commands may be automatically processed after the device is unlocked without the user repeating the voice command.

Type: Grant

Filed: March 23, 2021

Date of Patent: January 2, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Haitang Wang, Ankur Narendra Bhai Vachhani
Systems and methods for structure discovery and structure-based analysis in natural language processing models

Patent number: 11861321

Abstract: A regular expression prompt may be determined by combining a regular expression prompt template with input text from an input document. The regular expression prompt template may include a natural language instruction to identify one or more regular expressions from the input text and one or more fillable portions designated for filling with the input text. The regular expression prompt may be sent to a large language model for evaluation, and one or more regular expressions may be identified based on a response received from the large language model. The regular expressions may be used to disaggregate the input text, and the disaggregated text portions may be used to determine a structured document based on the input document. The structured document may be used to determine a response to a query of the input document.

Type: Grant

Filed: June 29, 2023

Date of Patent: January 2, 2024

Assignee: Casetext, Inc.

Inventors: Brian O'Kelly, Javed Qadrud-Din, Ryan Walker, Walter DeFoor, Pablo Arredondo
Method for detecting and classifying coughs or other non-semantic sounds using audio feature set learned from speech

Patent number: 11862188

Abstract: A method of detecting a cough in an audio stream includes a step of performing one or more pre-processing steps on the audio stream to generate an input audio sequence comprising a plurality of time-separated audio segments. An embedding is generated by a self-supervised triplet loss embedding model for each of the segments of the input audio sequence using an audio feature set, the embedding model having been trained to learn the audio feature set in a self-supervised triplet loss manner from a plurality of speech audio clips from a speech dataset. The embedding for each of the segments is provided to a model performing cough detection inference. This model generates a probability that each of the segments of the input audio sequence includes a cough episode. The method includes generating cough metrics for each of the cough episodes detected in the input audio sequence.

Type: Grant

Filed: October 21, 2021

Date of Patent: January 2, 2024

Assignee: Google LLC

Inventors: Jacob Garrison, Jacob Scott Peplinski, Joel Shor
Privacy mode based on speaker identifier

Patent number: 11854545

Abstract: Techniques for configuring a speech processing system with a privacy mode that is associated with the identity of a user that activated the privacy mode are described. A user may speak an indication to have the speech processing system activate a privacy mode. When such an indication is detected by the speech processing system, the speech processing system determines an identity of the user, determines a unique system identifier associated with the user, and generates a privacy mode flag. The speech processing system then associates the privacy mode flag with the user's unique system identifier. The privacy mode flag indicates to components of the speech processing system that any data related to processing of the user's utterances should not be sent to long term storage, thus causing various components of the system to delete data once the respective component is finished processing with respect to an utterance of the user.

Type: Grant

Filed: September 1, 2021

Date of Patent: December 26, 2023

Assignee: Amazon Technologies, Inc.

Inventor: Zhenhua Wang
Artificial intelligence-based wakeup word detection method and apparatus, device, and medium

Patent number: 11848008

Abstract: This application discloses an artificial intelligence-based (AI-based) wakeup word detection method performed by a computing device. The method includes: constructing, by using a preset pronunciation dictionary, at least one syllable combination sequence for self-defined wakeup word text inputted by a user; obtaining to-be-recognized speech data, and extracting speech features of speech frames in the speech data; inputting the speech features into a pre-constructed deep neural network (DNN) model, to output posterior probability vectors of the speech features corresponding to syllable identifiers; determine a target probability vector from the posterior probability vectors according to the syllable combination sequence; and calculate a confidence according to the target probability vector, and determine that the speech frames include the wakeup word text when the confidence is greater than or equal to a threshold.

Type: Grant

Filed: September 23, 2021

Date of Patent: December 19, 2023

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Jie Chen, Dan Su, Mingjie Jin, Zhenling Zhu
Machine translation system for entertainment and media

Patent number: 11847425

Abstract: A process receives, with a processor, audio corresponding to media content. Further, the process converts, with the processor, the audio to text. In addition, the process concatenates, with the processor, the text with one or more time codes. The process also parses, with the processor, the concatenated text into one or more text chunks according to one or more subtitle parameters. Further, the process automatically translates, with the processor, the parsed text from a first spoken language to a second spoken language. Moreover, the process determines, with the processor, if the language translation complies with the one or more subtitle parameters. Additionally, the process outputs, with the processor, the language translation to a display device for display of the one or more text chunks as one or more subtitles at one or more times corresponding to the one or more time codes.

Type: Grant

Filed: August 1, 2018

Date of Patent: December 19, 2023

Assignee: Disney Enterprises, Inc.

Inventor: Erika Doggett
Concept for switching of sampling rates at audio processing devices

Patent number: 11830511

Abstract: Audio decoder device for decoding a bitstream, the audio decoder device including: a predictive decoder for producing a decoded audio frame from the bitstream, wherein the predictive decoder includes a parameter decoder for producing one or more audio parameters for the decoded audio frame from the bitstream and wherein the predictive decoder includes a synthesis filter device for producing the decoded audio frame by synthesizing the one or more audio parameters for the decoded audio frame; a memory device including one or more memories, wherein each of the memories is configured to store a memory state for the decoded audio frame, wherein the memory state for the decoded audio frame of the one or more memories is used by the synthesis filter device for synthesizing the one or more audio parameters for the decoded audio frame; and a memory state resampling device configured to determine the memory state for synthesizing the one or more audio parameters for the decoded audio frame, which has a sampling rate, f

Type: Grant

Filed: August 5, 2022

Date of Patent: November 28, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Stefan Doehla, Guillaume Fuchs, Bernhard Grill, Markus Multrus, Grzegorz Pietrzyk, Emmanuel Ravelli, Markus Schnell
Utilizing inflection to select a meaning of a word of a phrase

Patent number: 11816434

Abstract: A method executed by a computing device includes determining a set of identigens for each phrase word of a phrase to produce sets of identigens. A set of identigens of the sets of identigens represents one or more different meanings of a phrase word of the phrase. The method further includes obtaining inflection information for one or more phrase words of the phrase. The method further includes selecting an identigen of a first set of identigens based on the inflection information to produce a first identigen selection for the first set of identigens having a selected meaning of one or more different meanings of the first phrase word. The method further includes interpreting remaining sets of identigens of the sets of identigens to produce an entigen group so that the entigen group represents a most likely meaning interpretation of the phrase.

Type: Grant

Filed: August 19, 2021

Date of Patent: November 14, 2023

Assignee: entigenlogic LLC

Inventors: Frank John Williams, Stephen Emerson Sundberg, Ameeta Vasant Reed, Dennis Arlen Roberson, Thomas James MacTavish, Karl Olaf Knutson, Jessy Thomas, Niklas Josiah MacTavish, David Michael Corns, II, Andrew Chu, Kyle Edward Alberth, Ali Fattahian, Zachary John McCord, Ahmad Abdelqader Abunaser, Gary W. Grube
System and method for audio event detection in surveillance systems

Patent number: 11810435

Abstract: A method and system for detecting and localizing a target audio event in an audio clip is disclosed. The method and system use utilizes a hierarchical approach in which a dilated convolutional neural network to detect the presence of the target audio event anywhere in an audio clip based on high level audio features. If the target audio event is detected somewhere in the audio clip, the method and system further utilizes a robust audio vector representation that encodes the inherent state of the audio as well as a learned relationship between state of the audio and the particular target audio event that was detected in the audio clip. A bi-directional long short term memory classifier is used to model long term dependencies and determine the boundaries in time of the target audio event within the audio clip based on the audio vector representations.

Type: Grant

Filed: February 20, 2019

Date of Patent: November 7, 2023

Assignee: Robert Bosch GmbH

Inventors: Asif Salekin, Zhe Feng, Shabnam Ghaffarzadegan
Concept for encoding an audio signal and decoding an audio signal using deterministic and noise like information

Patent number: 11798570

Abstract: An encoder for encoding an audio signal has: an analyzer configured for deriving prediction coefficients and a residual signal from an unvoiced frame of the audio signal; a gain parameter calculator configured for calculating a first gain parameter information for defining a first excitation signal related to a deterministic codebook and for calculating a second gain parameter information for defining a second excitation signal related to a noise-like signal for the unvoiced frame; and a bitstream former configured for forming an output signal based on an information related to a voiced signal frame, the first gain parameter information and the second gain parameter information.

Type: Grant

Filed: March 17, 2020

Date of Patent: October 24, 2023

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Guillaume Fuchs, Markus Multrus, Emmanuel Ravelli, Markus Schnell
Method for denoising voice data, device, and storage medium

Patent number: 11798573

Abstract: The present disclosure provides a method for denoising voice data, an electronic device, and a computer readable storage medium. The present disclosure relates to the technical field of artificial intelligence, such as Internet of Vehicles, smart cockpit, smart voice, and voice recognition. A specific embodiment of the method includes: receiving an input to-be-played first piece of voice data; and invoking, in response to not detecting a synthetic voice interruption signal in a process of playing the first piece of voice data, a preset first denoising algorithm to filter out noise data except for the first piece of voice data.

Type: Grant

Filed: May 25, 2022

Date of Patent: October 24, 2023

Assignee: APOLLO INTELLIGENT CONNECTIVITY (BEIJING) TECHNOLOGY CO., LTD.

Inventor: Rong Liu

prev 1 2 3 4 5 6 7 … next