Patents Examined by Vu B. Hang

Summarizing conversational speech

Patent number: 11842144

Abstract: Embodiments are directed to summarizing conversational speech. Conversation segments may be provided based on a conversation stream and segmentation models. Summarization models may be determined based on characteristics of the conversation segments. Summarization information may be generated for each of the conversation segments based on the summarization models such that the summarization information includes a text-based summarization of the conversation segment. Summarization profiles may be generated for the conversation segments based on the summarization information such that each summarization profile is associated with quality scores. Summarization models may be modified based on the summarization profiles and the associated quality scores such that the summarization profiles are updated based on the modified summarization models. Modified summarization models and the updated summarization profiles may be employed to provide reports to a user.

Type: Grant

Filed: March 6, 2023

Date of Patent: December 12, 2023

Assignee: Rammer Technologies, Inc.

Inventors: Toshish Arun Jawale, Sekhar Vallath, Pratik Abhaykumar Budruk
Control method and control apparatus for speech interaction, storage medium and system

Patent number: 11823662

Abstract: The present disclosure discloses a control method and a control apparatus for speech interaction. The detailed implementation solution of the control method for the speech interaction includes: collecting an audio signal; detecting a wake-up word in the audio signal to obtain a wake-up word result; and playing a prompt tone and/or executing a speech instruction in the audio signal based on the wake-up word result.

Type: Grant

Filed: January 26, 2021

Date of Patent: November 21, 2023

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Cong Gao, Saisai Zou, Jinfeng Bai, Lei Jia
Applied artificial intelligence technology for contextualizing words to a knowledge base using natural language processing

Patent number: 11816435

Abstract: Disclosed herein is an NLP system that is able to extract meaning from a natural language message using improved parsing techniques. Such an NLP system can be used in concert with an NLG system to interactively interpret messages and generate response messages in an interactive conversational stream. The parsing can include (1) named entity recognition that contextualizes the meanings of words in a message with reference to a knowledge base of named entities understood by the NLP and NLG systems, (2) syntactically parsing the message to determine a grammatical hierarchy for the named entities within the message, (3) reduction of recognized named entities into aggregations of named entities using the determined grammatical hierarchy and reduction rules to further clarify the message's meaning, and (4) mapping the reduced aggregation of named entities to an intent or meaning, wherein this intent/meaning can be used as control instructions for an NLG process.

Type: Grant

Filed: February 15, 2019

Date of Patent: November 14, 2023

Assignee: Narrative Science Inc.

Inventors: Maia Lewis Meza, Clayton Nicholas Norris, Michael Justin Smathers, Daniel Joseph Platt, Nathan D. Nichols
Identification of life events for virtual reality data and content collection

Patent number: 11810595

Abstract: This disclosure describes a solution to identify that a meaningful event has occurred in a person's life. Once identified, data and content related to the event can be collected and stored in a database. This data and content can be used to offer future virtual reality (VR) experiences and content. A person can be equipped with a smart device such as a smartphone, smart watch, or other wearable that can further be equipped with a tracking app that collects data from these devices and from other sources, for instance, over the internet. The tracking app can continuously collect such data and use various algorithms to make a determination as to whether the person is experiencing a meaningful life event.

Type: Grant

Filed: April 16, 2020

Date of Patent: November 7, 2023

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Brittaney Zellner, Sameena Khan, Ryan Schaub, Barrett Kreiner, Ari Craine, Robert Koch
Assisted speech recognition

Patent number: 11810573

Abstract: Systems, apparatuses, and methods are described for assisting speech recognition processing. If speech recognition processing of speech input by an individual does not yield a recognized result, for example, if speech is from an individual with compromised speech, an indication may be sent to a device associated with another person that can assist. The other person may provide, via that device, additional input that indicates the meaning of the speech input. Based on this additional input, an assisted speech recognition result may be determined.

Type: Grant

Filed: April 23, 2021

Date of Patent: November 7, 2023

Assignee: Comcast Cable Communications, LLC

Inventor: Daniel Loftus
Language characteristic extraction device, named entity extraction device, extraction method, and program

Patent number: 11809820

Abstract: It is an object to successfully absorb a difference in characteristics to be taken into consideration between languages and implement common named entity extraction in a processing system.

Type: Grant

Filed: April 22, 2019

Date of Patent: November 7, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Kuniko Saito, Nozomi Kobayashi, Junji Tomita
Systems and methods for training a control system based on prior audio inputs

Patent number: 11804213

Abstract: Systems and methods are disclosed herein for training a control system based on prior audio inputs. The disclosed systems and methods receive a non-lexical or interjectional audio input. State change indications are also received and stored by the system within a predefined period of time starting from the time the system received the audio input. The system then receives a subsequent audio input. If the audio inputs of both the audio input and the subsequent audio input match, and contextual information for the audio input and the subsequent audio input match, the system stores a match association, comprising a confidence factor, for the subsequent audio input to the audio input in the associative data structure. If the confidence factor is greater than a preconfigured confidence level, the system executes one or more functions based on stored state change indications.

Type: Grant

Filed: July 1, 2021

Date of Patent: October 31, 2023

Assignee: Rovi Guides, Inc.

Inventors: Bryan James, Manik Malhotra
Conversation facilitation system for mitigating loneliness

Patent number: 11803708

Abstract: The present disclosure describes a conversation facilitation system for facilitating conversation-based social interactions to improve senior health, one or more operations and functions being efficiently achieved via this system comprise: receiving a dialog act of a conversation, applying natural language understanding (NLU) processing on the dialog act, computing a conversation metric, and generating a result of the conversation to conclude the conversation based on the conversation metric.

Type: Grant

Filed: August 16, 2021

Date of Patent: October 31, 2023

Assignee: CLEARCARE, INC.

Inventors: Geoffrey Nudd, David Cristman, John Taylor, Jonathan J. Hull
Speech-based selection of augmented reality content

Patent number: 11798550

Abstract: Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and method for displaying augmented reality content. The program and method provide for receiving, by a device, speech input to select augmented reality content for display, determining at least one keyword included in the speech input; identifying, from plural augmented reality content items, an augmented reality content item corresponding to the at least one keyword; and displaying the augmented reality content item with an image captured by a camera of the device.

Type: Grant

Filed: March 24, 2021

Date of Patent: October 24, 2023

Assignee: Snap Inc.

Inventors: Joseph Timothy Fortier, Celia Nicole Mourkogiannis, Evan Spiegel, Kaveh Anvaripour
Contextual justification for a virtual assistant response

Patent number: 11798532

Abstract: In an approach to providing contextual justification for a virtual assistant response, one or more computer processors receive a first voice command from a user. One or more computer processors determine one or more boundary conditions associated with the first voice command. Based on the one or more boundary conditions, one or more computer processors determine a first response to the first voice command and a contextual justification of the first response. One or more computer processors respond to the user with the response to the first voice command and the contextual justification of the response.

Type: Grant

Filed: June 7, 2021

Date of Patent: October 24, 2023

Assignee: International Business Machines Corporation

Inventors: Clement Decrop, Tushar Agrawal, Jeremy R. Fox, Sarbajit K. Rakshit, Raghuveer Prasad Nagar, Jagadesh Ramaswamy Hulugundi
Method and an equipment for configuring a service

Patent number: 11790905

Abstract: An equipment and a method for configuring a service on an equipment. A method includes receiving a first voice input from a user to configure an equipment with a service. The equipment is configured with a voice-bot to interact with the user. The method also includes validating the first voice input, initiating configuration of the service and outputting a first voice response based on the validation of the first voice input. The method includes receiving a second voice input from the user in response to the first voice response and validating the second voice input. The method includes outputting a second voice response based on the validation of the second voice input and configuring the service on the equipment based on the voice inputs from the user.

Type: Grant

Filed: December 7, 2020

Date of Patent: October 17, 2023

Assignee: CARRIER CORPORATION

Inventors: Karthikeyan Loganathan, Akil Vivek Jalisatgi
System with post-conversation representation, electronic device, and related methods

Patent number: 11790887

Abstract: System, electronic device, and related methods, in particular a method of operating a system comprising an electronic device is disclosed, the method comprising obtaining one or more audio signals including a first audio signal of a first conversation; determining first speaker metric data of a first speaker based on the first audio signal, the first speaker metric data including first primary speaker metric data; detecting a termination of the first conversation; in accordance with detecting the termination of the first conversation, determining a first post-conversation representation based on the first speaker metric data; and outputting, via the interface of the electronic device, the first post-conversation representation.

Type: Grant

Filed: November 22, 2021

Date of Patent: October 17, 2023

Inventors: Christian Lillelund, Anders Hvelplund, Ali Özkil, Florian Eyben
Image forming apparatus, control method therefor, and program

Patent number: 11770490

Abstract: An image forming apparatus includes a plurality of devices configured to perform different job processing, a control unit configured to control job processing performed by each device, a reception unit configured to receive an instruction for causing the control unit to shift to a state where the job processing is capable of being performed, and a power control unit configured to, when the control unit is shifted to a stand-by state in response to receiving the instruction, supply a power to a device specified based on a job processing function corresponding to an initial screen to be displayed.

Type: Grant

Filed: June 29, 2020

Date of Patent: September 26, 2023

Assignee: Canon Kabushiki Kaisha

Inventor: So Yokomizo
Lower latency speech processing

Patent number: 11756538

Abstract: Devices and techniques are generally described for pre-caching of speech processing feature data. In various examples, first data indicating source data is received from a first speech processing component. The source data may be used to generate first feature data. In various examples, a first request to process first input data is received. A second speech processing component may generate the source data during processing of the first input data. The first feature data may be generated using the source data. The first feature data may be sent to the first speech processing component. In some examples, the first speech processing component may store the first feature data in a first cache local to the first speech processing component.

Type: Grant

Filed: December 10, 2019

Date of Patent: September 12, 2023

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Carl Joshua Dell, Timothy Kay Cheng, Scott G. LeBaron
Method and system for acoustic model conditioning on non-phoneme information features

Patent number: 11741943

Abstract: A method and system for acoustic model conditioning on non-phoneme information features for optimized automatic speech recognition is provided. The method includes using an encoder model to encode sound embedding from a known key phrase of speech and conditioning an acoustic model with the sound embedding to optimize its performance in inferring the probabilities of phonemes in the speech. The sound embedding can comprise non-phoneme information related to the key phrase and the following utterance. Further, the encoder model and the acoustic model can be neural networks that are jointly trained with audio data.

Type: Grant

Filed: April 7, 2021

Date of Patent: August 29, 2023

Assignee: SoundHound, Inc

Inventors: Zizu Gowayyed, Keyvan Mohajer
Speaker diarization using speaker embedding(s) and trained generative model

Patent number: 11735176

Abstract: Speaker diarization techniques that enable processing of audio data to generate one or more refined versions of the audio data, where each of the refined versions of the audio data isolates one or more utterances of a single respective human speaker. Various implementations generate a refined version of audio data that isolates utterance(s) of a single human speaker by generating a speaker embedding for the single human speaker, and processing the audio data using a trained generative model—and using the speaker embedding in determining activations for hidden layers of the trained generative model during the processing. Output is generated over the trained generative model based on the processing, and the output is the refined version of the audio data.

Type: Grant

Filed: March 29, 2021

Date of Patent: August 22, 2023

Assignee: GOOGLE LLC

Inventors: Ignacio Lopez Moreno, Luis Carlos Cobo Rus
Memory allocation for keyword spotting engines

Patent number: 11727919

Abstract: Network microphone devices configured to detect keywords can include microphones for capturing sound samples. Features can be extracted from the sound samples by storing the sound samples in a first portion of a dynamic-access memory block, performing first computations based on spectral coefficients of the sound samples using a second portion of the memory block, and storing results of the first computations as extracted features in a third portion of the memory block. The second and third portions of the memory block can be designated as temporary memory. The extracted features are then processed using a neural network by storing the extracted features in a fourth portion of the memory block, performing second computations on the extracted features using the temporary memory, the second computations comprising computing at least one layer of the neural network, and storing an output of the neural network as a classification in the temporary memory.

Type: Grant

Filed: May 19, 2021

Date of Patent: August 15, 2023

Assignee: Sonos, Inc.

Inventor: Hubert de Taffanel de La Jonquière
Pre-emptively initializing an automated assistant routine and/or dismissing a scheduled alarm

Patent number: 11727930

Abstract: Implementations set forth herein relate to initializing performance of an automated assistant routine and/or dismissing an alarm pre-emptively according to satisfaction of one or more conditions. A condition can be satisfied by a user acknowledging the alarm when the alarm is going off, or causing the alarm to be dismissed prior to a time at which the alarm was scheduled for. The user can cause the alarm to be dismissed pre-emptively by interacting with the automated assistant prior to the time the alarm was scheduled for and/or interacting with a device, which is known to the automated assistant, prior to the time that the alarm was scheduled for. In this way, actions that cause an alarm to be dismissed can be recognized and used to initialize other processes, such as an automated assistant routine, thereby reducing a number of inputs needed from a user.

Type: Grant

Filed: May 17, 2021

Date of Patent: August 15, 2023

Assignee: GOOGLE LLC

Inventors: Nevzat Topcu, Michael Andrew Goodman
Real time correction of accent in speech audio signals

Patent number: 11715457

Abstract: Systems and methods for real-time correction of an accent in a speech audio signal are provided. A method includes dividing the speech audio signal into a stream of input chunks, an input chunk from the stream of input chunks including a pre-defined number of frames of the speech audio signal, extracting, by an acoustic features extraction module from the input chunk and a context associated with the input chunk, acoustic features, the context is a pre-determined number of the frames preceding the input chunk in the stream; extracting, by a linguistic features extraction module from the input chunk and the context, linguistic features, receiving a speaker embedding for a human speaker, providing the speaker embedding, the acoustic features, and the linguistic features to a synthesis module to generate a melspectrogram with a reduced accent, providing the melspectrogram to a vocoder to generate an output chunk of an output audio signal.

Type: Grant

Filed: December 19, 2022

Date of Patent: August 1, 2023

Assignee: Intone Inc.

Inventors: Andrei Golman, Dmitrii Sadykov
Efficient streaming non-recurrent on-device end-to-end model

Patent number: 11715458

Abstract: An ASR model includes a first encoder configured to receive a sequence of acoustic frames and generate a first higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. The ASR model also includes a second encoder configured to receive the first higher order feature representation generated by the first encoder at each of the plurality of output steps and generate a second higher order feature representation for a corresponding first higher order feature frame. The ASR model also includes a decoder configured to receive the second higher order feature representation generated by the second encoder at each of the plurality of output steps and generate a first probability distribution over possible speech recognition hypothesis. The ASR model also includes a language model configured to receive the first probability distribution over possible speech hypothesis and generate a rescored probability distribution.

Type: Grant

Filed: May 10, 2021

Date of Patent: August 1, 2023

Assignee: Google LLC

Inventors: Tara Sainath, Arun Narayanan, Rami Botros, Yanzhang He, Ehsan Variani, Cyril Allauzen, David Rybach, Ruoming Pang, Trevor Strohman

prev 1 2 3 4 5 6 … next