Patents Examined by Pierre-Louis Desir

Language-agnostic multilingual modeling using effective script normalization

Patent number: 11615779

Abstract: A method includes obtaining a plurality of training data sets each associated with a respective native language and includes a plurality of respective training data samples. For each respective training data sample of each training data set in the respective native language, the method includes transliterating the corresponding transcription in the respective native script into corresponding transliterated text representing the respective native language of the corresponding audio in a target script and associating the corresponding transliterated text in the target script with the corresponding audio in the respective native language to generate a respective normalized training data sample.

Type: Grant

Filed: January 19, 2021

Date of Patent: March 28, 2023

Assignee: Google LLC

Inventors: Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Brian Roark
Speech recognition using natural language understanding related knowledge via deep feedforward neural networks

Patent number: 11615785

Abstract: A framework ranks multiple hypotheses generated by one or more ASR engines for each input speech utterance. The framework jointly implements ASR improvement and NLU. It makes use of NLU related knowledge to facilitate the ranking of competing hypotheses, and outputs the top-ranked hypothesis as the improved ASR result together with the NLU results of the speech utterance. The NLU results include intent detection results and the slot filling results.

Type: Grant

Filed: May 5, 2020

Date of Patent: March 28, 2023

Inventors: Zhengyu Zhou, Xuchen Song
Corpus cleaning method and corpus entry system

Patent number: 11580299

Abstract: The present disclosure provides a corpus cleaning method and a corpus entry system. The method includes: obtaining an input utterance; generating a predicted value of an information amount of each word in the input utterance according to the context of the input utterance using a pre-trained general model; and determining redundant words according to the predicted value of the information amount of each word, and determining whether to remove the redundant words from the input utterance. In such a manner, the objectivity and accuracy of corpus cleaning can be improved.

Type: Grant

Filed: May 29, 2020

Date of Patent: February 14, 2023

Assignee: UBTECH ROBOTICS CORP LTD

Inventors: Li Ma, Youjun Xiong
System and method for detecting incorrect triple

Patent number: 11562133

Abstract: Provided is an incorrect triple detection system including a triple selector configured to select a target triple (subject, type, object) in a knowledge base, a sampler configured to create a sentence model by connecting object triples sharing entities included in the target triple, a model builder configured to embed the sentence model into a vector space to create a training entity vector and build an embedding model, and an incorrect triple detector configured to detect an incorrect triple by inputting a test triple into the embedding model.

Type: Grant

Filed: November 26, 2019

Date of Patent: January 24, 2023

Assignee: FOUNDATION OF SOONGSIL UNIV-INDUSTRY COOPERATION

Inventors: Young Tack Park, Wan Gon Lee, Jagvaral Batselem, Hyun Young Choi, Ji Houn Hong
Frictionless handoff of audio content playing using overlaid ultrasonic codes

Patent number: 11557303

Abstract: In a frictionless handoff of audio content playing, a client device listens for ultrasonic audio. The client hears a playing of a modified audio content by another client device, which includes audio content and an ultrasonic audio quick response (QR) code overlaid on the audio content. The ultrasonic audio QR code includes location information corresponding to a location in the audio content. The client device extracts the ultrasonic audio QR code from the modified audio content. After determining that the playing of the modified audio content has stopped, the client device receives a command to resume playing of the audio content on the client device. In, response to the command, the client device retrieves location information in a last extracted ultrasonic audio QR code and plays the audio content starting at a location in the audio content corresponding to the location information in the last extracted ultrasonic audio QR code.

Type: Grant

Filed: July 30, 2019

Date of Patent: January 17, 2023

Assignee: International Business Machines Corporation

Inventors: Andrew Hicks, Brendan Bull, Scott Robert Carrier, Dwi Sianto Mansjur
System and method for language-based service hailing

Patent number: 11545140

Abstract: Systems and methods are provided for language-based service hailing. Such system may comprise one or more processors and a memory storing instructions that, when executed by the one or more processors, cause the computing system to obtain a plurality of speech samples, each speech sample comprising one or more words spoken in a language, train a neural network model with the speech samples to obtain a trained model for determining languages of speeches, obtain a voice input, identify at least one language corresponding to the voice based at least on applying the trained model to the voice input, and communicate a message in the identified language.

Type: Grant

Filed: July 31, 2017

Date of Patent: January 3, 2023

Assignee: Beijing DiDi Infinity Technology and Development Co., Ltd.

Inventors: Fengmin Gong, Xiulin Li
Systems and methods for seamless application of autocorrection and provision of review insights through adapted user interface

Patent number: 11537789

Abstract: The present disclosure relates to processing operations configured to provide, through an adapted user interface of an application/service, seamless application of autocorrection for an electronic document and provision of review insights into the autocorrections. In addition to applying autocorrections, processing described herein selectively determines when to apply an autocorrection (e.g., while a user is entering an input or after) and further highlights autocorrections, through a user interface, in a manner that instantly lets a user know that an autocorrection has occurred and/or what type of autocorrection is being applied.

Type: Grant

Filed: May 23, 2019

Date of Patent: December 27, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Hany Grees Gerges, Olivier Gauthier, Kaushik Ramaiah Narayanan
Method and apparatus for audio data processing

Patent number: 11538471

Abstract: Embodiments of the disclosure provide methods and apparatuses processing audio data. The method can include: acquiring audio data by an audio capturing device, determining feature information of an enclosure in which the audio capturing device is located, and reverberating the feature information into the audio data.

Type: Grant

Filed: January 31, 2019

Date of Patent: December 27, 2022

Assignee: Alibaba Group Holding Limited

Inventors: Shaofei Xue, Biao Tian
Acquiring speech features for predicting emotional severity of adverse events on individuals

Patent number: 11508396

Abstract: Systems and methods of related to a voice-based system used to determine the severity of emotional distress within an audio recording of an individual is provided. In one non-limiting example, a system comprises a computing device that is configured to receive an audio sample that includes an utterance of a user. Feature extraction is performed on the audio sample to extract a plurality of acoustic emotion features using a base model. Emotion level predictions are generated for an emotion type based at least in part on the acoustic emotion features provided to an emotion specific model. An emotion classification for the audio sample is determined based on the emotion level predictions. The emotion classification comprises the emotion type and a level associated with the emotion type.

Type: Grant

Filed: December 14, 2021

Date of Patent: November 22, 2022

Assignee: TQINTELLIGENCE, INC.

Inventors: Yared Alemu, Desmond Caulley, Ashutosh A. Joshi
Method for operating a motor vehicle having an operating device

Patent number: 11501767

Abstract: The invention relates to a method for operating a motor vehicle having an operating device, which includes a speech recognition and language determination device. A recognition of a voice input of a user of the motor vehicle, and a check as to whether a language of the voice input corresponds to the first operating language take place in a first operating mode with a first operating language. Depending on a result of the checking process, a confidence value is assigned to the voice input, which describes a probability with which the language of the voice input is the second operating language. Depending on the assigned confidence value, a query signal is generated, which describes a request, understandable in a second operating language, to the user for indicating the operating mode to be set or the operating language to be set. In response to a received operating signal, the operating mode to be set or the operating language to be set are set.

Type: Grant

Filed: November 28, 2017

Date of Patent: November 15, 2022

Assignee: Audi AG

Inventors: Christian Al Haddad, Stefan Maiwald
Systems and methods for improved accuracy of bullying or altercation detection or identification of excessive machine noise

Patent number: 11450327

Abstract: Systems and methods for identifying potential bullying are disclosed. In various aspects, a system for identifying potential bullying includes a sound detector configured to provide samples of sounds over time, a processor, and a memory storing instructions. The instructions, when executed by the processor, cause the system to determine that a noise event has occurred by processing the samples to determine that the sounds exceed a sound level threshold over a time period that exceeds a time period threshold, process the samples to provide frequency spectrum information of the noise event, determine whether the noise event is a potential bullying occurrence based on comparing the frequency spectrum information of the noise event and at least one frequency spectrum profile, and initiate a bullying notification in a case of determining that the noise event is a potential bullying occurrence.

Type: Grant

Filed: April 20, 2021

Date of Patent: September 20, 2022

Assignee: SOTER TECHNOLOGIES, LLC

Inventor: Cary Chu
Method for automatically translating raw data into real human voiced audio content

Patent number: 11430423

Abstract: A method for automatically translating raw data into real human voiced audio content is provided according to an embodiment of the present disclosure. The method may comprise ingesting data, separating the data into or associating the data with a data type, and creating a list of descriptive data associated with the data type. In some embodiments, the method further comprises compiling audio phrases types associated with the descriptive data, associating a pre-recorded audio file with each audio phrase, and merging a plurality of pre-recorded audio files to create a final audio file.

Type: Grant

Filed: April 17, 2019

Date of Patent: August 30, 2022

Assignee: Weatherology, LLC

Inventor: Derek Christopher Heit
Speech synthesis method and apparatus and computer readable storage medium using the same

Patent number: 11417316

Abstract: The present disclosure provides a speech synthesis method as well as an apparatus and a computer readable storage medium using the same. The method includes: obtaining a to-be-synthesized text, and extracting to-be-processed Mel spectrum features of the to-be-synthesized text through a preset speech feature extraction algorithm; inputting the to-be-processed Mel spectrum features into a preset ResUnet network model to obtain first intermediate features; performing an average pooling and a first down sampling on the to-be-processed Mel spectrum features to obtain second intermediate features; taking the second intermediate features and the first intermediate features output by the ResUnet network model as an input to perform a deconvolution and a first up sampling so as to obtain target Mel spectrum features corresponding to the to-be-processed Mel spectrum features; and converting the target Mel spectrum features into a target speech corresponding to the to-be-synthesized text.

Type: Grant

Filed: December 8, 2020

Date of Patent: August 16, 2022

Assignee: UBTECH ROBOTICS CORP LTD

Inventors: Dongyan Huang, Leyuan Sheng, Youjun Xiong
Keyboard instrument and method performed by computer of keyboard instrument

Patent number: 11417312

Abstract: A keyboard instrument includes at least one processor that determines a first pattern of intonation to be applied to a first time segment of a voice data on the basis of a first user operation on a first operation element, causes a first singing voice for the first time segment to be digitally synthesized from the first segment data in accordance with the determined first pattern of intonation, determines a second pattern of intonation to be applied to the second time segment of the voice data on the basis of a second user operation on a second operation element, and causes a second singing voice for the second time segment to be digitally synthesized from the second segment data in accordance with the determined second pattern of intonation.

Type: Grant

Filed: March 10, 2020

Date of Patent: August 16, 2022

Assignee: CASIO COMPUTER CO., LTD.

Inventor: Toshiyuki Tachibana
Methods and systems for voice and acupressure-based lifestyle management with smart devices

Patent number: 11410686

Abstract: In one aspect, a computerized method for implementing voice and acupressure-based lifestyle management includes the step of measuring a speed at which a user is speaking. A wearable device records the user's voice with a microphone and communicates a digital recording of the user's voice to a computer processor. The method includes the step of measuring a time spacing between a set of user's words and a length of the set of user's words. The method includes the step of determining at least one anomaly by comparing the digital recording of the user's voice with a benchmark recording of the user's voice. The method includes the step of alerting the user of the detected anomaly.

Type: Grant

Filed: July 2, 2019

Date of Patent: August 9, 2022

Assignee: VOECE, INC.

Inventor: Rashmi Panda
Speech synthesizer using artificial intelligence and method of operating the same

Patent number: 11398219

Abstract: Disclosed herein is a speech synthesizer using artificial intelligence including a memory, a communication processor configured to receive utterance information of words uttered by a user from a terminal, and a processor configured to acquire a plurality of utterance intonation phrase (IP) ratios respectively corresponding to a plurality of words uttered by the user based on the utterance information, compare a plurality of IP ratio tables respectively corresponding to a plurality of voice actors with the plurality of utterance IP ratios, acquire a plurality of non-utterance IP ratios respectively corresponding to a plurality of unuttered words based on a result of comparison, and generate a personalized synthesized speech model based on the plurality of utterance IP ratios and the plurality of non-utterance IP ratios.

Type: Grant

Filed: October 10, 2019

Date of Patent: July 26, 2022

Assignee: LG ELECTRONICS INC.

Inventor: Jonghoon Chae
Speech synthesizer using artificial intelligence, method of operating speech synthesizer and computer-readable recording medium

Patent number: 11393447

Abstract: A speech synthesizer using artificial intelligence includes a memory, a communication unit for receiving utterance information of words uttered by a user, and a processor for acquiring a plurality of utterance intonation phrase (IP) ratios respectively corresponding to a plurality of words uttered by the user based on the utterance information, acquiring a plurality of non-utterance IP ratios respectively corresponding to a plurality of unuttered words based on the utterance information and the plurality of utterance IP ratios, and generating a personalized synthesized speech model based on the plurality of utterance IP ratios and the plurality of non-utterance IP ratios. A plurality of classes indicating reading break of a word includes first to third classes. A minor class has a smallest count among the first to third classes. Each of the utterance and non-utterance IP ratios is a ratio in which a word is classified as the minor class.

Type: Grant

Filed: June 18, 2019

Date of Patent: July 19, 2022

Assignee: LG ELECTRONICS INC.

Inventor: Jonghoon Chae
Multi-channel signal encoding method, multi-channel signal decoding method, encoder, and decoder

Patent number: 11386907

Abstract: A multi-channel signal encoding method includes determining a downmixed signal of a first channel signal and a second channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter, and quantizing the first channel signal and the second channel signal based on the downmixed signal and the target reverberation gain parameter, and writing a quantized first channel signal and a quantized second channel signal into a bitstream.

Type: Grant

Filed: September 27, 2019

Date of Patent: July 12, 2022

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Zexin Liu, Lei Miao
System language switching method, readable storage medium, terminal device, and apparatus

Patent number: 11341329

Abstract: The present application relates to a system language switching method, a computer readable storage medium, a terminal device, and a device. The method includes first obtaining a preset image for setting a system language of a target terminal, then extracting text information in the image and determining a target language corresponding to the text information, and finally switching the system language of the target terminal to the target language. Through the present application, the user only needs to prepare an image for setting the system language of the target terminal in advance, for example, a piece of paper with Chinese written, and a system can obtain the text information on the image through the processes of image acquisition, text information extraction, and the like, determine that the text message is Chinese, and finally switch the system language of the target terminal to Chinese.

Type: Grant

Filed: January 31, 2018

Date of Patent: May 24, 2022

Assignee: PING AN TECHNOLOGY (SHENZHEN) CO., LTD.

Inventor: Jinsheng Cai
Virtual assistant interface for call routing

Patent number: 11343377

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining a transfer option for transferring a call. One of the methods include receiving, by a call assistant engine, a keyword related to information provided by a user to an agent during a call; generating, by the call assistant engine, follow-up questions to be displayed on a user device of the agent in an interactive format, the first follow-up question being generated based on the keyword, each of the following follow-up questions being generated based on an answer of the agent to the previous question; and determining, by the call assistant engine, based on answers of the agent to the follow-up questions, a transfer option for transferring the call.

Type: Grant

Filed: January 18, 2019

Date of Patent: May 24, 2022

Assignee: United Services Automobile Association (USAA)

Inventors: Philip Ryan Jensen, Everett Russell Freeman James, James Shamlin, Sheryl Lane Niemann, Shanna Limas, Samir Hojat

prev 1 2 3 4 5 6 7 8 … next