Patents Examined by Oluwadamilola M. Ogunbiyi

Personal assistant for facilitating interaction routines

Patent number: 11537947

Abstract: In one example, the present disclosure describes a device, computer-readable medium, and method for automatically learning and facilitating interaction routines involving at least one human participant. In one example, a method includes learning an interaction routine conducted between a human user and a second party, wherein the interaction routine comprises a series of prompts and responses designed to identify and deliver desired information, storing a template of the interaction routine based on the learning, wherein the template includes at least a portion of the series of prompts and responses, detecting, in the course of a new instance of the interaction routine, at least one prompt from the second party that requests a response from the human user, and using the template to provide a response to the prompt so that involvement of the human user in the new instance of the interaction routine is minimized.

Type: Grant

Filed: April 20, 2020

Date of Patent: December 27, 2022

Assignee: AT&T Intellectual Property I, L.P.

Inventors: Harry Blanchard, Lan Zhang, Gregory Pulz
Speech-to-text system

Patent number: 11532308

Abstract: Systems and methods for processing speech transcription in a speech processing system are disclosed. A first transcription of a first utterance is received. In response to receiving an indication of an erroneous transcribed word in the first transcription, a control circuitry automatically activates an audio receiver for receiving a second utterance. In response to receiving the second utterance, an audio file of the second utterance and an indication of a location of the erroneous transcribed word within the first transcription is transmitted to a speech recognition system for a second transcription of the second utterance. Subsequently, the erroneous transcribed word in the first transcription is replaced with a transcribed word from the second transcription.

Type: Grant

Filed: May 4, 2020

Date of Patent: December 20, 2022

Assignee: ROVI GUIDES, INC.

Inventors: Sukanya Agarwal, Vikram Makam Gupta
On-board agent system, on-board agent system control method, and storage medium

Patent number: 11508370

Abstract: An on-board agent system includes: a plurality of agent functional units, each of the plurality of agent functional units being configured to provide a service including outputting a response using voice to an output unit according to an utterance of an occupant of a vehicle; and a common operator configured to be shared by the plurality of agent functional units and provided in the vehicle, wherein, when an operation is executed on the common operator with an operation pattern set to correspond to each of the plurality of agent functional units, an agent functional unit corresponding to the operation pattern of the executed operation is activated.

Type: Grant

Filed: March 3, 2020

Date of Patent: November 22, 2022

Assignee: HONDA MOTOR CO., LTD.

Inventors: Sawako Furuya, Yoshifumi Wagatsuma, Hiroki Nakayama, Kengo Naiki, Yusuke Oi
Method and apparatus for processing speech

Patent number: 11488603

Abstract: Embodiments of the present disclosure provide a method and apparatus for processing a speech. The method may include: acquiring an original speech; performing speech recognition on the original speech, to obtain an original text corresponding to the original speech; associating a speech segment in the original speech with a text segment in the original text; recognizing an abnormal segment in the original speech and/or the original text; and processing a text segment indicated by the abnormal segment in the original text and/or the speech segment indicated by the abnormal segment in the original speech, to generate a final speech. A speech segment in the original speech is associated with a text segment in the original text to realize visual processing of the speech.

Type: Grant

Filed: December 11, 2019

Date of Patent: November 1, 2022

Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventors: Wanqi Tang, Jiamei Kang, Lixia Zeng, Yijing Zhou, Hanmei Xie, Lina Zhu
Voice control method, voice control device, and computer-executable non-volatile storage medium

Patent number: 11482218

Abstract: A voice control method, includes: acquiring a voice input information; recognizing the voice input information to obtain a voice command; based on the voice command, determining a control corresponding to the voice command by a test framework calling unit, where the test framework calling unit is not in an application program in which the control is coded; and executing a function corresponding to the control. A voice control device and a computer-executable non-volatile storage medium are further provided.

Type: Grant

Filed: January 22, 2019

Date of Patent: October 25, 2022

Assignee: Beijing BOE Technology Development Co., Ltd.

Inventor: Yingjie Li
Frame loss management in an FD/LPD transition context

Patent number: 11475901

Abstract: A method for decoding a digital signal encoded using predictive coding and transform coding, comprising the following steps: predictive decoding of a preceding frame of the digital signal, encoded by a set of predictive coding parameters; detecting the loss of a current frame of the encoded digital signal; generating by prediction, from at least one predictive coding parameter encoding the preceding frame, a frame for replacing the current frame; generating by prediction, from at least one predictive coding parameter encoding the preceding frame, an additional segment of digital signal; temporarily storing said additional segment of digital signal.

Type: Grant

Filed: February 5, 2020

Date of Patent: October 18, 2022

Assignee: ORANGE

Inventors: Julien Faure, Stephane Ragot
Computer apparatus and method implementing sound detection with an image capture system

Patent number: 11468904

Abstract: A computing device comprising a processor, the processor configured to: receive, from an image capture system, an image captured in an environment and image metadata associated with the image, the image metadata comprising an image capture time; receive a sound recognition message from a sound recognition module, the sound recognition message comprising (i) a sound recognition identifier indicating a target sound or scene that has been recognised based on captured audio data captured in the environment, and (ii) time information associated with the sound recognition identifier; detect that the target sound or scene occurred at a time that the image was captured based on the image metadata and the time information in the sound recognition message; and output a camera control command to said image capture system based on said detection.

Type: Grant

Filed: December 18, 2019

Date of Patent: October 11, 2022

Assignee: AUDIO ANALYTIC LTD

Inventors: Christopher James Mitchell, Sacha Krstulovic, Neil Cooper, Julian Harris
Audio encoder, audio decoder, method for encoding an audio information, method for decoding an audio information and computer program using a detection of a group of previously-decoded spectral values

Patent number: 11443752

Abstract: An audio decoder for providing a decoded audio information includes a arithmetic decoder for providing a plurality of decoded spectral values on the basis of an arithmetically-encoded representation of the spectral values and a frequency-domain-to-time-domain converter for providing a time-domain audio representation using the decoded spectral values. The arithmetic decoder is configured to select a mapping rule describing a mapping of a code value onto a symbol code in dependence on a context state. The arithmetic decoder is configured to determine or modify the current context state in dependence on a plurality of previously-decoded spectral values. The arithmetic decoder is configured to detect a group of a plurality of previously-decoded spectral values, which fulfill, individually or taken together, a predetermined condition regarding their magnitudes, and to determine the current context state in dependence on a result of the detection. An audio encoder uses similar principles.

Type: Grant

Filed: December 18, 2017

Date of Patent: September 13, 2022

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Guillaume Fuchs, Vignesh Subbaraman, Nikolaus Rettelbach, Markus Multrus, Marc Gayer, Patrick Warmbold, Christian Griebel, Oliver Weiss
Autonomously motile device with speech commands

Patent number: 11417328

Abstract: An autonomously motile device may be controlled by speech received by a user device. A first speech-processing system associated with the user device may determine that audio data includes a representation of a command; a second speech-processing system associated with the autonomously motile device may determine that the command should be executed by the autonomously motile device. A network connection is established between the user device and the autonomously motile device, and a device manager authorizes execution of the command.

Type: Grant

Filed: December 9, 2019

Date of Patent: August 16, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Anil Kumar Katta, Amy Marie Whitberg, Xiaoqing Jing, Swetha Bijoy, Swati S. Rao, Robert Franklin Ebert
Dialogue system, dialogue method, and storage medium

Patent number: 11417319

Abstract: According to one embodiment, a dialogue system includes a setting apparatus and a processing apparatus. The setting apparatus sets in advance a plurality of words that are in impossible combination relationships to each other. The processing apparatus acquires speech of a user, and when a speech recognition result of an object included in the speech includes a word combination included in the plurality of words that are in impossible combination relationships to each other, output a notification to the user that processing of the object cannot be carried out.

Type: Grant

Filed: February 20, 2018

Date of Patent: August 16, 2022

Assignee: Kabushiki Kaisha Toshiba

Inventors: Takami Yoshida, Kenji Iwata, Yuka Kobayashi, Masami Akamine
Initiating conversation monitoring system action based on conversational content

Patent number: 11417337

Abstract: Techniques for initiating system actions based on conversational content are disclosed. A system identifies a first conversational moment type. The first conversational moment type is defined by a first set of one or more conversational conditions. The system receives a user-selected action to be performed by the system in response to detecting conversational moments of the first conversational moment type. The system stores the user-selected action in association with the first conversational moment type. The system performs the user-selected action in response to detecting the conversational moments of the first conversational moment type.

Type: Grant

Filed: August 12, 2021

Date of Patent: August 16, 2022

Assignee: CRESTA INTELLIGENCE INC.

Inventor: Tianlin Shi
Maintainable and scalable pipeline for automatic speech recognition language modeling

Patent number: 11410658

Abstract: Audio data saved at the end of client interactions are sampled, analyzed for pauses in speech, and sliced into stretches of acoustic data containing human speech between those pauses. The acoustic data are accompanied by machine transcripts made by VoiceAI. A suitable distribution of data useful for training and testing are stipulated during data sampling by applying certain filtering criteria. The resulting datasets are sent for transcription by a human transcriber team. The human transcripts are retrieved, some post-transcription processing and cleaning are performed, and the results are added to datastores for training and testing an acoustic model.

Type: Grant

Filed: October 29, 2019

Date of Patent: August 9, 2022

Assignee: Dialpad, Inc.

Inventors: Eddie Yee Tak Ma, James Palmer, Kevin James, Etienne Manderscheid
Interactive system for hearing devices

Patent number: 11412333

Abstract: In an audio signal, one or more processing circuits recognize spoken content in a user's own speech signal using speech recognition and natural language understanding. The spoken content describes a listening difficulty of the user. The one or more processing circuits generate, based on the spoken content, one or more actions for hearing devices and feedback for the user. The one or more actions attempt to resolve the listening difficulty. Additionally, the one or more processing circuits convert the user feedback to verbal feedback using speech synthesis and transmit the one or more actions and the verbal feedback to the hearing devices via a body-worn device. The hearing devices are configured to perform the one or more actions and play back the verbal feedback to the user.

Type: Grant

Filed: November 15, 2018

Date of Patent: August 9, 2022

Assignee: Starkey Laboratories, Inc.

Inventors: Tao Zhang, Eric Durant, Dean G. Meyer, Martin McKinney, Matthew D. Kleffner, Dominic Perz, Karrie Recker
Dynamic skill endpoint

Patent number: 11410659

Abstract: This disclosure proposes systems and methods employing dynamic skill endpoints by allowing skills to register themselves with a language processing system. The language processing system allows the skill system to open a persistent network connection to the language processing system. This connection does not require the machine(s) running the skill system to have an Internet routable address; rather the skill system can contact the language processing system, which can remain at a static address, through any local routers or firewalls which may block connections from being initiated from outside the local area network. This registration opens the connection between the skill system and the language processing system. When the language processing system receives a skill invocation request indicating the skill, the language processing system can check its registry for a dynamic endpoint corresponding to the skill, and route the request over the network connection to the registered endpoint.

Type: Grant

Filed: March 30, 2020

Date of Patent: August 9, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Veer Yuganter Singh, Saravana Prasad Stalin, Sabrina Chandrasekaran
Audio encoder and decoder using a frequency domain processor, a time domain processor, and a cross processing for continuous initialization

Patent number: 11410668

Abstract: An audio encoder for encoding an audio signal includes: a first encoding processor for encoding a first audio signal portion in a frequency domain, wherein the first encoding processor includes: a time frequency converter for converting the first audio signal portion into a frequency domain representation having spectral lines up to a maximum frequency of the first audio signal portion; a spectral encoder for encoding the frequency domain representation; a second encoding processor for encoding a second different audio signal portion in the time domain; a cross-processor for calculating, from the encoded spectral representation of the first audio signal portion, initialization data of the second encoding processor, so that the second encoding processing is initialized to encode the second audio signal portion immediately following the first audio signal portion in time in the audio signal; a controller configured for analyzing the audio signal and for determining, which portion of the audio signal is the firs

Type: Grant

Filed: March 1, 2019

Date of Patent: August 9, 2022

Inventors: Sascha Disch, Martin Dietz, Markus Multrus, Guillaume Fuchs, Emmanuel Ravelli, Matthias Neusinger, Markus Schnell, Benjamin Schubert, Bernhard Grill
Speech-to-analytics framework with support for large n-gram corpora

Patent number: 11404053

Abstract: An apparatus includes processor(s) to: generate a set of candidate n-grams based on probability distributions from an acoustic model for candidate graphemes of a next word most likely spoken following at least one preceding word spoken within speech audio; provide the set of candidate n-grams to multiple devices; provide, to each node device, an indication of which candidate n-grams are to be searched for within the n-gram corpus by each node device to enable searches for multiple candidate n-grams to be performed, independently and at least partially in parallel, across the node devices; receive, from each node device, an indication of a probability of occurrence of at least one candidate n-gram within the speech audio; based on the received probabilities of occurrence, identify the next word most likely spoken within the speech audio; and add the next word most likely spoken to a transcript of the speech audio.

Type: Grant

Filed: July 8, 2021

Date of Patent: August 2, 2022

Assignee: SAS INSTITUTE INC.

Inventors: Xiaozhuo Cheng, Xu Yang, Xiaolong Li, Biljana Belamaric Wilsey, Haipeng Liu, Jared Peterson
Communication terminal, sharing system, display control method, and non-transitory computer-readable medium

Patent number: 11398237

Abstract: A communication terminal is communicable with a conversion system. The communication terminal includes circuitry configured to: receive a selection of one of a first mode and a second mode, the first mode being a mode in which audio data obtained based on sound collected by a sound collecting device is converted into text data, the second mode being a mode in which audio data obtained based on sound to be output from a sound output device is converted into text data, the audio data being relating to content obtained during an event being conducted; transmit, to the conversion system, audio data corresponding to selected one of the first mode and the second mode; receive, from the conversion system, text data converted from the transmitted audio data; and control a display to display text based on the received text data.

Type: Grant

Filed: February 20, 2020

Date of Patent: July 26, 2022

Assignee: RICOH COMPANY, LTD.

Inventor: Masaaki Kagawa
Conversational system for recognizing, understanding, and acting on multiple intents and hypotheses

Patent number: 11393475

Abstract: A conversational system that recognizes, understands, and acts on multiple intents that may be explicit or implicit during conversations with humans. During a conversation, one or more utterances are received and processed through a plurality of machine learning algorithms to establish precise meanings, additional intentions, and alternative hypothesis. Using a combination of machine learning algorithms and datastores, conversations are interpreted as intended and may diverge where needed or desired, delivering a more useful, natural, and human-like dialogue between machines and people.

Type: Grant

Filed: January 13, 2021

Date of Patent: July 19, 2022

Assignee: ARTIFICIAL SOLUTIONS IBERIA S.L

Inventors: Eric Aili, Ramazan Gurbuz, Andreas Wieweg
System and method for controlling an application using natural language communication

Patent number: 11393463

Abstract: A system and method are disclosed for setting up a communication link between a device or application and a system with a controller. The controller can collect and send information to the application. A user interfaces with the controller to access the functionality of the application through providing commands to the controller. The system allows the user to interface with multiple applications.

Type: Grant

Filed: April 19, 2019

Date of Patent: July 19, 2022

Assignee: SoundHound, Inc.

Inventors: Timothy P. Stonehocker, Kathleen Worthington McMahon
Method and device for playing voice, electronic device, and storage medium

Patent number: 11379180

Abstract: A method for playing voice, which is applied to a webcast server, the method comprising: receiving voice data sent by at least one first electronic device for obtaining a voice data set, the first electronic device having a first preset authority, and the voice data set comprising at least one piece of the voice data; receiving audio-video data sent by a second electronic device, the second electronic device having a second preset authority, the audio-video data comprising the voice data selected for playback, wherein the voice data selected for playback comprises any voice data of the voice data set clicked for playback; pushing the audio-video data to each first electronic device.

Type: Grant

Filed: September 4, 2019

Date of Patent: July 5, 2022

Assignee: BEIJING DAJIA INTERNET INFORMATION TECHNOLOGY CO., LTD

Inventors: Yang Zhang, Meizhuo Li

prev 1 2 3 4 5 6 7 … next