Patents Examined by Edgar X Guerra-Erazo

Voice recognition system for use with a personal media streaming appliance

Patent number: 11935526

Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.

Type: Grant

Filed: September 10, 2020

Date of Patent: March 19, 2024

Assignee: Spotify AB

Inventors: Daniel Bromand, Richard Mitic, Horia Jurcut, Jennifer Thom-Santelli, Henriette Cramer, Karl Humphreys, Robert Williams, Kurt Jacobson, Henrik Lindström
Generating multi-modal response(s) through utilization of large language model(s)

Patent number: 11907674

Abstract: Implementations relate to generating multi-modal response(s) through utilization of large language model(s) (LLM(s)). Processor(s) of a system can: receive natural language (NL) based input, generate a multi-modal response that is responsive to the NL based output, and cause the multi-modal response to be rendered. In some implementations, and in generating the multi-modal response, the processor(s) can process, using a LLM, LLM input (e.g., that includes at least the NL based input) to generate LLM output, and determine, based on the LLM output, textual content for inclusion in the multi-modal response and multimedia content for inclusion in the multi-modal response. In some implementations, the multimedia content can be obtained based on a multimedia content tag that is included in the LLM output and that is indicative of the multimedia content. In various implementations, the multimedia content can be interleaved between segments of the textual content.

Type: Grant

Filed: September 20, 2023

Date of Patent: February 20, 2024

Assignee: GOOGLE LLC

Inventors: Oscar Akerlund, Evgeny Sluzhaev, Golnaz Ghiasi, Thang Luong, Yifeng Lu, Igor Petrovski, Ágoston Weisz, Wei Yu, Rakesh Shivanna, Michael Andrew Goodman, Apoorv Kulshreshtha, Yu Du, Amin Ghafouri, Sanil Jain, Dustin Tran, Vikas Peswani, YaGuang Li
Information processing method, system, apparatus, electronic device and storage medium

Patent number: 11900945

Abstract: An information processing method, a system, an apparatus, an electronic device and a storage medium, where the method is applied to a client, and includes: receiving a transcript and a sentence identifier of the transcript sent by a service server; reading a local sentence identifier, and when the received sentence identifier is the same as the local sentence identifier, updating a displayed caption content corresponding to the local sentence identifier with the transcript. When the received sentence identifier of the client is the same as the local sentence identifier, the displayed caption content is replaced with the received transcript.

Type: Grant

Filed: March 21, 2022

Date of Patent: February 13, 2024

Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.

Inventors: Li Zhao, Xiao Han, Kojung Chen, Jian Tong
Aspect-based sentiment analysis

Patent number: 11886825

Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure generate a word embedding for each word of an input phrase, wherein the input phrase indicates a sentiment toward an aspect term, compute a gate vector based on the aspect term, identify a dependency tree representing relations between words of the input phrase, generate a representation vector based on the dependency tree and the word embedding using a graph convolution network, wherein the gate vector is applied to a layer of the graph convolution network, and generate a probability distribution over a plurality of sentiments based on the representation vector.

Type: Grant

Filed: March 31, 2021

Date of Patent: January 30, 2024

Assignee: ADOBE, INC.

Inventors: Amir Pouran Ben Veyseh, Franck Dernoncourt
Method for extracting kansei adjective of product based on principal component analysis and explanation (PCA-E)

Patent number: 11868432

Abstract: A method for extracting a kansei adjective of a product based on principal component analysis and explanation (PCA-E) includes constructing a product kansei evaluation vector matrix through original kansei adjectives; performing dimensionality reduction through PCA; and determining, based on principal component load factors, kansei adjectives representing principal components. In this way, the kansei adjectives extracted are explanatory to help users understand the selected kansei adjectives and make accurate evaluation.

Type: Grant

Filed: June 2, 2023

Date of Patent: January 9, 2024

Assignee: SICHUAN UNIVERSITY

Inventors: Wu Zhao, Xin Guo, Miao Yu, Kai Zhang, Wei Jiang, Chong Jiang, Bing Lai, Yiwei Jiang, Jun Li, Bo Wu, Xingyu Chen
System and method for activation and deactivation of cued health assessment

Patent number: 11869635

Abstract: A system for activating a cued health assessment, which includes an audio receiver for receiving voice samples to measure one of a plurality of voice biomarkers, an audio processing module for extracting one of a plurality of biomarkers from the received voice samples, the audio processing module further classifies the received voice samples to one of plurality of predetermined health states according to the extracted biomarkers, and a voice sample scheduler for activating a cued health assessment module when the classified health state is a clinically actionable health state.

Type: Grant

Filed: April 12, 2021

Date of Patent: January 9, 2024

Assignee: Sonde Health, Inc.

Inventors: James D. Harper, Michael Chen
Self-supervised semantic shift detection and alignment

Patent number: 11853702

Abstract: Generate, for each of the words of a common vocabulary of first and second text corpora, a first word embedding vector in the first text corpus and a second word embedding vector in the second text corpus. Generate, for each word in a random sample of non-landmark words, an artificially shifted word embedding vector by modifying the first word embedding vector for that word. Train a machine learning classifier to predict whether an artificial shift has been injected for a given word, based on the artificially shifted word embedding vector and the second word embedding vector for the given word. Predict semantic shifts for at least a plurality of the words of the common vocabulary by providing the first word embedding vectors and the second word embedding vectors for at least the plurality of the words of the common vocabulary as input to the trained machine learning classifier.

Type: Grant

Filed: January 29, 2021

Date of Patent: December 26, 2023

Assignees: International Business Machines Corporation, RENSSELAER POLYTECHNIC INSTITUTE

Inventors: Pin-Yu Chen, Maurício Gruppi, Sibel Adali
Location-based responses to telephone requests

Patent number: 11854543

Abstract: A method for receiving processed information at a remote device is described. The method includes transmitting from the remote device a verbal request to a first information provider and receiving a digital message from the first information provider in response to the transmitted verbal request. The digital message includes a symbolic representation indicator associated with a symbolic representation of the verbal request and data used to control an application. The method also includes transmitting, using the application, the symbolic representation indicator to a second information provider for generating results to be displayed on the remote device.

Type: Grant

Filed: June 15, 2021

Date of Patent: December 26, 2023

Assignee: Google LLC

Inventors: Gudmundur Hafsteinsson, Michael J. LeBeau, Natalia Marmasse, Sumit Agarwal, Dipchand Nishar
Listening devices for obtaining metrics from ambient noise

Patent number: 11842723

Abstract: A device may receive audio data based on a capturing of sounds associated with a structure. The device may obtain a model associated with the structure. The model may have been trained to receive the audio data as input, determine a score that identifies a likelihood that a sound is present in the audio data, and identify the sound based on the score. The device may determine at least one parameter associated with the sound. The device may generate a metric based on the at least one parameter associated with the sound, and perform an action based on the metric.

Type: Grant

Filed: April 12, 2021

Date of Patent: December 12, 2023

Assignee: Capital One Services, LLC

Inventors: Michael Mossoba, Joshua Edwards, Abdelkadar M'hamed Benkreira, Austen Novis, Sophie Bermudez
Adaptive voice communication

Patent number: 11836415

Abstract: A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The personal media streaming appliance system operates to transmit a media signal representative to the received media content to a vehicle media playback system so that the vehicle media playback system operates to play the media content in the vehicle. Customized voice communications are generated based on receiving input, such as a user query and/or a media track change indication.

Type: Grant

Filed: November 20, 2020

Date of Patent: December 5, 2023

Assignee: Spotify AB

Inventors: Emma-Camelia Gosu, Johan Oskarsson, Daniel Bromand
Cross-lingual regularization for multilingual generalization

Patent number: 11829727

Abstract: Approaches for cross-lingual regularization for multilingual generalization include a method for training a natural language processing (NLP) deep learning module. The method includes accessing a first dataset having a first training data entry, the first training data entry including one or more natural language input text strings in a first language; translating at least one of the one or more natural language input text strings of the first training data entry from the first language to a second language; creating a second training data entry by starting with the first training data entry and substituting the at least one of the natural language input text strings in the first language with the translation of the at least one of the natural language input text strings in the second language; adding the second training data entry to a second dataset; and training the deep learning module using the second dataset.

Type: Grant

Filed: April 23, 2021

Date of Patent: November 28, 2023

Assignee: salesforce.com, inc.

Inventors: Jasdeep Singh, Nitish Shirish Keskar, Bryan McCann
Methods and systems for determining a wake word

Patent number: 11817104

Abstract: A user device (e.g., voice assistant device, voice enabled device, smart device, computing device, etc.) may receive/detect audio content (e.g., speech, etc.) that includes a wake word and/or words similar to a wake word. The user device may require a wake word, a portion of the wake word, or words similar to the wake word to be detected prior to interacting with a user. The user device may, based on characteristics of the audio content, determine if the audio content originates from an authorized user. The user device may decrease and/or increase scrutiny applied to wake word detection based on whether audio content originates from an authorized user.

Type: Grant

Filed: February 26, 2021

Date of Patent: November 14, 2023

Assignee: Comcast Cable Communications, LLC

Inventors: Hans Sayyadi, Nima Bina
Multi-user virtual assistant for verbal device control

Patent number: 11817092

Abstract: In one example, a method includes receiving audio data generated by one or more microphones of a computing device, the audio data representing a spoken utterance; identifying, based on the audio data, a user that provided the spoken utterance; identifying, based on the audio data, an automation action associated with one or more automation devices, the automation action corresponding to the spoken utterance; determining whether the identified user is authorized to cause performance of the identified automation action; and responsive to determining that the identified user is authorized to cause performance of the identified automation action, causing the one or more automation devices to perform the identified automation action.

Type: Grant

Filed: December 2, 2020

Date of Patent: November 14, 2023

Assignee: GOOGLE LLC

Inventors: Yuzhao Ni, David Roy Schairer
Display apparatus and method for question and answer

Patent number: 11817013

Abstract: A display apparatus and a method for questions and answers includes a display unit includes an input unit configured to receive user's speech voice; a communication unit configured to perform data communication with an answer server; and a processor configured to create and display one or more question sentences using the speech voice in response to the speech voice being a word speech, create a question language corresponding to the question sentence selected from among the displayed one or more question sentences, transmit the created question language to the answer server via the communication unit, and, in response to one or more answer results related to the question language being received from the answer server, display the received one or more answer results. Accordingly, the display apparatus may provide an answer result appropriate to a user's question intention although a non-sentence speech is input.

Type: Grant

Filed: November 13, 2020

Date of Patent: November 14, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Eun-sang Bak
Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

Patent number: 11817078

Abstract: A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.

Type: Grant

Filed: June 2, 2023

Date of Patent: November 14, 2023

Assignee: VOCOLLECT, INC.

Inventors: James Hendrickson, Debra Drylie Stiffey, Duane Littleton, John Pecorari, Arkadiusz Slusarczyk
Assertion detection in multi-labelled clinical text using scope localization

Patent number: 11809826

Abstract: For assertion detection from clinical text in a medical system, a model, such as a neural network, is trained to operate on multi-labeled clinical text. Using multi-task learning, both the scope and the class losses are minimized. As a result, a machine learning model can predict both the scope and class of clinical text for a patient where the clinical text is not limited to one class or a particular length.

Type: Grant

Filed: November 17, 2020

Date of Patent: November 7, 2023

Assignee: Siemens Healthcare GmbH

Inventors: Rajeev Bhatt Ambati, Oladimeji Farri, Ramya Vunikili
Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

Patent number: 11810545

Abstract: A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.

Type: Grant

Filed: May 7, 2020

Date of Patent: November 7, 2023

Assignee: VOCOLLECT, INC.

Inventors: James Hendrickson, Debra Drylie Stiffey, Duane Littleton, John Pecorari, Arkadiusz Slusarczyk
Local and cloud speech recognition

Patent number: 11804227

Abstract: Disclosed herein are system, apparatus, article of manufacture, method and/or computer program product embodiments, and/or combinations and sub-combinations thereof, for distributing the performance of speech recognition among a remote control device and a voice platform in the cloud. In some embodiments, the remote control device operates to receive a voice input from a user. The remote control device detects a trigger word in the voice input. The remote control device then processes the voice input. The remote control device then transmits the voice input to a voice platform based on the detecting in order to determine an intent associated with the voice input.

Type: Grant

Filed: May 21, 2021

Date of Patent: October 31, 2023

Assignee: Roku, Inc.

Inventors: Anthony John Wood, David Stern, Gregory Mack Garner
Optimization method for implementation of mel-frequency cepstral coefficients

Patent number: 11804238

Abstract: An optimization method for an implementation of mel-frequency cepstral coefficients is provided. The optimization method includes the following steps: performing a framing step, including using a 400×16 static random access memory to temporarily store a plurality of sampling points of a sound signal with overlap, and decomposing the sound signal into a plurality of frames. Each of the plurality of frames is 400 of the sampling points, there is an overlapping region between adjacent two of the plurality of frames, and the overlapping region includes 240 of the sampling points. The optimization method further includes performing a windowing step, which includes multiplying each of the plurality of frames by a window function in a bit-level design, and the optimization method includes performing a fast Fourier transform (FFT) step, which includes applying a 512 point FFT on a frame signal to obtain a corresponding frequency spectrum.

Type: Grant

Filed: October 29, 2021

Date of Patent: October 31, 2023

Assignee: REALTEK SEMICONDUCTOR CORP.

Inventors: Li-Li Tan, Zhi-Lin Wang, Xiao-Feng Cao, Xiao-Huan Li
Scalable dynamic class language modeling

Patent number: 11804218

Abstract: This document generally describes systems and methods for dynamically adapting speech recognition for individual voice queries of a user using class-based language models. The method may include receiving a voice query from a user that includes audio data corresponding to an utterance of the user, and context data associated with the user. One or more class models are then generated that collectively identify a first set of terms determined based on the context data, and a respective class to which the respective term is assigned for each respective term in the first set of terms. A language model that includes a residual unigram may then be accessed and processed for each respective class to insert a respective class symbol at each instance of the residual unigram that occurs within the language model. A transcription of the utterance of the user is then generated using the modified language model.

Type: Grant

Filed: February 10, 2021

Date of Patent: October 31, 2023

Assignee: Google LLC

Inventors: Justin Max Scheiner, Petar Aleksic

1 2 3 4 5 … next