Patents Examined by Edgar X Guerra-Erazo

Reducing perceived effects of non-voice data in digital speech

Patent number: 11990144

Abstract: Non-voice data is embedded in a voice bit stream that includes frames of voice bits by selecting a frame of voice bits to carry the non-voice data, placing non-voice identifier bits in a first portion of the voice bits in the selected frame, and placing the non-voice data in a second portion of the voice bits in the selected frame. The non-voice identifier bits are employed to reduce a perceived effect of the non-voice data on audible speech produced from the voice bit stream.

Type: Grant

Filed: July 28, 2021

Date of Patent: May 21, 2024

Assignee: Digital Voice Systems, Inc.

Inventor: John C. Hardwick
Identification and personalized protection of text data using shapley values

Patent number: 11983497

Abstract: Privacy, protection, and de-anonymization are issues of societal importance that are implicitly at the core of several key information systems, from electronic health records to online reviews. The system and method herein allows for an identification of an author of anonymous writing based on the text and structured data, subject to practical constraints on the intruder's amount of training data and effort using Shapley values.

Type: Grant

Filed: November 20, 2020

Date of Patent: May 14, 2024

Assignee: Drexel University

Inventors: Matthew John Schneider, Shawn Mankad
Contextualization of voice inputs

Patent number: 11979960

Abstract: Disclosed herein are example techniques to provide contextual information corresponding to a voice command. An example implementation may involve receiving voice data indicating a voice command, receiving contextual information indicating a characteristic of the voice command, and determining a device operation corresponding to the voice command. Determining the device operation corresponding to the voice command may include identifying, among multiple zones of a media playback system, a zone that corresponds to the characteristic of the voice command, and determining that the voice command corresponds to one or more particular devices that are associated with the identified zone. The example implementation may further involve causing the one or more particular devices to perform the device operation.

Type: Grant

Filed: November 17, 2021

Date of Patent: May 7, 2024

Assignee: Sonos, Inc.

Inventors: Jonathan P. Lang, Romi Kadri, Christopher Butts
Systems and methods for processing and presenting conversations

Patent number: 11978472

Abstract: A system for processing and presenting a conversation includes a sensor, a processor, and a presenter. The sensor is configured to capture an audio-form conversation. The processor is configured to automatically transform the audio-form conversation into a transformed conversation. The transformed conversation includes a synchronized text, wherein the synchronized text is synchronized with the audio-form conversation. The presenter is configured to present the transformed conversation including the synchronized text and the audio-form conversation. The presenter is further configured to present the transformed conversation to be navigable, searchable, assignable, editable, and shareable.

Type: Grant

Filed: March 23, 2021

Date of Patent: May 7, 2024

Assignee: Otter.ai, Inc.

Inventors: Yun Fu, Simon Lau, Kaisuke Nakajima, Julius Cheng, Gelei Chen, Sam Song Liang, James Mason Altreuter, Kean Kheong Chin, Zhenhao Ge, Hitesh Anand Gupta, Xiaoke Huang, James Francis McAteer, Brian Francis Williams, Tao Xing
Voice recognition system for use with a personal media streaming appliance

Patent number: 11935526

Abstract: A system and method for voice control of a media playback device is disclosed. The method includes receiving an instruction of a voice command, converting the voice command to text, transmitting the text command to the playback device, and having the playback device execute the command. An instruction may include a command to play a set of audio tracks, and the media playback device plays the set of audio tracks upon receiving the instruction.

Type: Grant

Filed: September 10, 2020

Date of Patent: March 19, 2024

Assignee: Spotify AB

Inventors: Daniel Bromand, Richard Mitic, Horia Jurcut, Jennifer Thom-Santelli, Henriette Cramer, Karl Humphreys, Robert Williams, Kurt Jacobson, Henrik Lindström
Generating multi-modal response(s) through utilization of large language model(s)

Patent number: 11907674

Abstract: Implementations relate to generating multi-modal response(s) through utilization of large language model(s) (LLM(s)). Processor(s) of a system can: receive natural language (NL) based input, generate a multi-modal response that is responsive to the NL based output, and cause the multi-modal response to be rendered. In some implementations, and in generating the multi-modal response, the processor(s) can process, using a LLM, LLM input (e.g., that includes at least the NL based input) to generate LLM output, and determine, based on the LLM output, textual content for inclusion in the multi-modal response and multimedia content for inclusion in the multi-modal response. In some implementations, the multimedia content can be obtained based on a multimedia content tag that is included in the LLM output and that is indicative of the multimedia content. In various implementations, the multimedia content can be interleaved between segments of the textual content.

Type: Grant

Filed: September 20, 2023

Date of Patent: February 20, 2024

Assignee: GOOGLE LLC

Inventors: Oscar Akerlund, Evgeny Sluzhaev, Golnaz Ghiasi, Thang Luong, Yifeng Lu, Igor Petrovski, Ágoston Weisz, Wei Yu, Rakesh Shivanna, Michael Andrew Goodman, Apoorv Kulshreshtha, Yu Du, Amin Ghafouri, Sanil Jain, Dustin Tran, Vikas Peswani, YaGuang Li
Information processing method, system, apparatus, electronic device and storage medium

Patent number: 11900945

Abstract: An information processing method, a system, an apparatus, an electronic device and a storage medium, where the method is applied to a client, and includes: receiving a transcript and a sentence identifier of the transcript sent by a service server; reading a local sentence identifier, and when the received sentence identifier is the same as the local sentence identifier, updating a displayed caption content corresponding to the local sentence identifier with the transcript. When the received sentence identifier of the client is the same as the local sentence identifier, the displayed caption content is replaced with the received transcript.

Type: Grant

Filed: March 21, 2022

Date of Patent: February 13, 2024

Assignee: BEIJING BYTEDANCE NETWORK TECHNOLOGY CO., LTD.

Inventors: Li Zhao, Xiao Han, Kojung Chen, Jian Tong
Aspect-based sentiment analysis

Patent number: 11886825

Abstract: Systems and methods for natural language processing are described. One or more embodiments of the present disclosure generate a word embedding for each word of an input phrase, wherein the input phrase indicates a sentiment toward an aspect term, compute a gate vector based on the aspect term, identify a dependency tree representing relations between words of the input phrase, generate a representation vector based on the dependency tree and the word embedding using a graph convolution network, wherein the gate vector is applied to a layer of the graph convolution network, and generate a probability distribution over a plurality of sentiments based on the representation vector.

Type: Grant

Filed: March 31, 2021

Date of Patent: January 30, 2024

Assignee: ADOBE, INC.

Inventors: Amir Pouran Ben Veyseh, Franck Dernoncourt
Method for extracting kansei adjective of product based on principal component analysis and explanation (PCA-E)

Patent number: 11868432

Abstract: A method for extracting a kansei adjective of a product based on principal component analysis and explanation (PCA-E) includes constructing a product kansei evaluation vector matrix through original kansei adjectives; performing dimensionality reduction through PCA; and determining, based on principal component load factors, kansei adjectives representing principal components. In this way, the kansei adjectives extracted are explanatory to help users understand the selected kansei adjectives and make accurate evaluation.

Type: Grant

Filed: June 2, 2023

Date of Patent: January 9, 2024

Assignee: SICHUAN UNIVERSITY

Inventors: Wu Zhao, Xin Guo, Miao Yu, Kai Zhang, Wei Jiang, Chong Jiang, Bing Lai, Yiwei Jiang, Jun Li, Bo Wu, Xingyu Chen
System and method for activation and deactivation of cued health assessment

Patent number: 11869635

Abstract: A system for activating a cued health assessment, which includes an audio receiver for receiving voice samples to measure one of a plurality of voice biomarkers, an audio processing module for extracting one of a plurality of biomarkers from the received voice samples, the audio processing module further classifies the received voice samples to one of plurality of predetermined health states according to the extracted biomarkers, and a voice sample scheduler for activating a cued health assessment module when the classified health state is a clinically actionable health state.

Type: Grant

Filed: April 12, 2021

Date of Patent: January 9, 2024

Assignee: Sonde Health, Inc.

Inventors: James D. Harper, Michael Chen
Self-supervised semantic shift detection and alignment

Patent number: 11853702

Abstract: Generate, for each of the words of a common vocabulary of first and second text corpora, a first word embedding vector in the first text corpus and a second word embedding vector in the second text corpus. Generate, for each word in a random sample of non-landmark words, an artificially shifted word embedding vector by modifying the first word embedding vector for that word. Train a machine learning classifier to predict whether an artificial shift has been injected for a given word, based on the artificially shifted word embedding vector and the second word embedding vector for the given word. Predict semantic shifts for at least a plurality of the words of the common vocabulary by providing the first word embedding vectors and the second word embedding vectors for at least the plurality of the words of the common vocabulary as input to the trained machine learning classifier.

Type: Grant

Filed: January 29, 2021

Date of Patent: December 26, 2023

Assignees: International Business Machines Corporation, RENSSELAER POLYTECHNIC INSTITUTE

Inventors: Pin-Yu Chen, Maurício Gruppi, Sibel Adali
Location-based responses to telephone requests

Patent number: 11854543

Abstract: A method for receiving processed information at a remote device is described. The method includes transmitting from the remote device a verbal request to a first information provider and receiving a digital message from the first information provider in response to the transmitted verbal request. The digital message includes a symbolic representation indicator associated with a symbolic representation of the verbal request and data used to control an application. The method also includes transmitting, using the application, the symbolic representation indicator to a second information provider for generating results to be displayed on the remote device.

Type: Grant

Filed: June 15, 2021

Date of Patent: December 26, 2023

Assignee: Google LLC

Inventors: Gudmundur Hafsteinsson, Michael J. LeBeau, Natalia Marmasse, Sumit Agarwal, Dipchand Nishar
Listening devices for obtaining metrics from ambient noise

Patent number: 11842723

Abstract: A device may receive audio data based on a capturing of sounds associated with a structure. The device may obtain a model associated with the structure. The model may have been trained to receive the audio data as input, determine a score that identifies a likelihood that a sound is present in the audio data, and identify the sound based on the score. The device may determine at least one parameter associated with the sound. The device may generate a metric based on the at least one parameter associated with the sound, and perform an action based on the metric.

Type: Grant

Filed: April 12, 2021

Date of Patent: December 12, 2023

Assignee: Capital One Services, LLC

Inventors: Michael Mossoba, Joshua Edwards, Abdelkadar M'hamed Benkreira, Austen Novis, Sophie Bermudez
Adaptive voice communication

Patent number: 11836415

Abstract: A system is provided for streaming media content in a vehicle. The system includes a personal media streaming appliance system configured to connect to a media delivery system and receive media content from the media delivery system at least via a cellular network. The personal media streaming appliance system operates to transmit a media signal representative to the received media content to a vehicle media playback system so that the vehicle media playback system operates to play the media content in the vehicle. Customized voice communications are generated based on receiving input, such as a user query and/or a media track change indication.

Type: Grant

Filed: November 20, 2020

Date of Patent: December 5, 2023

Assignee: Spotify AB

Inventors: Emma-Camelia Gosu, Johan Oskarsson, Daniel Bromand
Cross-lingual regularization for multilingual generalization

Patent number: 11829727

Abstract: Approaches for cross-lingual regularization for multilingual generalization include a method for training a natural language processing (NLP) deep learning module. The method includes accessing a first dataset having a first training data entry, the first training data entry including one or more natural language input text strings in a first language; translating at least one of the one or more natural language input text strings of the first training data entry from the first language to a second language; creating a second training data entry by starting with the first training data entry and substituting the at least one of the natural language input text strings in the first language with the translation of the at least one of the natural language input text strings in the second language; adding the second training data entry to a second dataset; and training the deep learning module using the second dataset.

Type: Grant

Filed: April 23, 2021

Date of Patent: November 28, 2023

Assignee: salesforce.com, inc.

Inventors: Jasdeep Singh, Nitish Shirish Keskar, Bryan McCann
Display apparatus and method for question and answer

Patent number: 11817013

Abstract: A display apparatus and a method for questions and answers includes a display unit includes an input unit configured to receive user's speech voice; a communication unit configured to perform data communication with an answer server; and a processor configured to create and display one or more question sentences using the speech voice in response to the speech voice being a word speech, create a question language corresponding to the question sentence selected from among the displayed one or more question sentences, transmit the created question language to the answer server via the communication unit, and, in response to one or more answer results related to the question language being received from the answer server, display the received one or more answer results. Accordingly, the display apparatus may provide an answer result appropriate to a user's question intention although a non-sentence speech is input.

Type: Grant

Filed: November 13, 2020

Date of Patent: November 14, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Eun-sang Bak
Methods and systems for determining a wake word

Patent number: 11817104

Abstract: A user device (e.g., voice assistant device, voice enabled device, smart device, computing device, etc.) may receive/detect audio content (e.g., speech, etc.) that includes a wake word and/or words similar to a wake word. The user device may require a wake word, a portion of the wake word, or words similar to the wake word to be detected prior to interacting with a user. The user device may, based on characteristics of the audio content, determine if the audio content originates from an authorized user. The user device may decrease and/or increase scrutiny applied to wake word detection based on whether audio content originates from an authorized user.

Type: Grant

Filed: February 26, 2021

Date of Patent: November 14, 2023

Assignee: Comcast Cable Communications, LLC

Inventors: Hans Sayyadi, Nima Bina
Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment

Patent number: 11817078

Abstract: A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.

Type: Grant

Filed: June 2, 2023

Date of Patent: November 14, 2023

Assignee: VOCOLLECT, INC.

Inventors: James Hendrickson, Debra Drylie Stiffey, Duane Littleton, John Pecorari, Arkadiusz Slusarczyk
Multi-user virtual assistant for verbal device control

Patent number: 11817092

Abstract: In one example, a method includes receiving audio data generated by one or more microphones of a computing device, the audio data representing a spoken utterance; identifying, based on the audio data, a user that provided the spoken utterance; identifying, based on the audio data, an automation action associated with one or more automation devices, the automation action corresponding to the spoken utterance; determining whether the identified user is authorized to cause performance of the identified automation action; and responsive to determining that the identified user is authorized to cause performance of the identified automation action, causing the one or more automation devices to perform the identified automation action.

Type: Grant

Filed: December 2, 2020

Date of Patent: November 14, 2023

Assignee: GOOGLE LLC

Inventors: Yuzhao Ni, David Roy Schairer
Assertion detection in multi-labelled clinical text using scope localization

Patent number: 11809826

Abstract: For assertion detection from clinical text in a medical system, a model, such as a neural network, is trained to operate on multi-labeled clinical text. Using multi-task learning, both the scope and the class losses are minimized. As a result, a machine learning model can predict both the scope and class of clinical text for a patient where the clinical text is not limited to one class or a particular length.

Type: Grant

Filed: November 17, 2020

Date of Patent: November 7, 2023

Assignee: Siemens Healthcare GmbH

Inventors: Rajeev Bhatt Ambati, Oladimeji Farri, Ramya Vunikili

1 2 3 4 5 … next