Patents Examined by Richemond Dorvil

Systems and methods for semi-supervised extraction of text classification information

Patent number: 12093646

Abstract: Disclosed embodiments relate to extracting classification information from input text.

Type: Grant

Filed: January 15, 2021

Date of Patent: September 17, 2024

Assignee: RECRUIT CO., LTD.

Inventors: Zhengjie Miao, Yuliang Li, Xiaolan Wang, Wang-Chiew Tan
Optimizing questions to retain engagement

Patent number: 12093640

Abstract: A method optimizes questions to retain engagement. The method includes generating, using a machine learning model, a churn risk from user interaction data. The method includes selecting, when the churn risk satisfies a threshold, a field, from multiple fields, using multiple prediction confidences corresponding to multiple prediction values generated for the multiple fields. The method includes obtaining a prediction value for the field and obtaining a question, corresponding to the field, using the prediction value. The method includes presenting the question and receiving a user input in response to the question.

Type: Grant

Filed: September 29, 2021

Date of Patent: September 17, 2024

Assignee: Intuit Inc.

Inventors: Kevin Michael Furbish, Glenn Carter Scott, Lalla Mouatadid
Disfluency removal using machine learning

Patent number: 12087278

Abstract: A method may including obtaining a voice transcript corpus and a chat transcript corpus, extracting voice transcript sentences from the voice transcript corpus and chat transcript sentences from the chat transcript corpus, encoding, by a series of neural network layers, the voice transcript sentences to generate voice sentence vectors, encoding, by the series of neural network layers, the chat transcript sentences to generate chat sentence vectors, determining, for each voice sentence vector, a matching chat sentence vector to obtain matching voice-chat vector pairs, and adding, to a parallel corpus, matching voice-chat sentence pairs using the matching voice-chat vector pairs. Each of the matching voice-chat sentence pairs may include a voice transcript sentence and a matching chat transcript sentence. The method may further include training a disfluency remover model using the parallel corpus.

Type: Grant

Filed: July 16, 2021

Date of Patent: September 10, 2024

Assignee: Intuit Inc.

Inventors: Alexander Zhicharevich, Yair Horesh
Data processing method based on simultaneous interpretation, computer device, and storage medium

Patent number: 12087290

Abstract: A data processing method based on simultaneous interpretation, applied to a server in a simultaneous interpretation system, including: obtaining audio transmitted by a simultaneous interpretation device; processing the audio by using a simultaneous interpretation model to obtain an initial text; transmitting the initial text to a user terminal; receiving a modified text fed back by the user terminal, the modified text being obtained after the user terminal modifies the initial text; and updating the simultaneous interpretation model according to the initial text and the modified text.

Type: Grant

Filed: July 28, 2020

Date of Patent: September 10, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Jingliang Bai, Caisheng Ouyang, Haikang Liu, Lianwu Chen, Qi Chen, Yulu Zhang, Min Luo, Dan Su
Anticipatory dialog design for question answering with a relational database

Patent number: 12086553

Abstract: An embodiment for creating a dialog based on anticipated questions for database driven conversations is provided. The embodiment may include receiving content from a database. The embodiment may also include identifying one or more schemas, one or more entities, and relational data in the content. The embodiment may further include identifying a semantic type and a number of distinct entries for each entity. The embodiment may also include presenting choices for one or more query targets and one or more filtering conditions to a user. The embodiment may further include prompting the user for one or more annotations and one or more clarifying questions for each chosen query target and filtering condition. The embodiment may also include generating a plurality of modular phrases and combining the plurality of modular phrases into one or more sentences and paraphrases of the one or more sentences.

Type: Grant

Filed: June 18, 2021

Date of Patent: September 10, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: Tin Kam Ho
Concurrent multi-path processing of audio signals for automatic speech recognition systems

Patent number: 12080274

Abstract: A system and method for concurrent multi-path processing of audio signals for automatic speech recognition is presented. Audio information defining a set of audio signals may be obtained (502). The audio signals may convey mixed audio content produced by multiple audio sources. A set of source-specific audio signals may be determined by demixing the mixed audio content produced by the multiple audio sources. Determining the set of source-specific audio signals may comprises providing the set of audio signals to both a first signal processing path and a second signal processing path (504). The first signal processing path may determine a value of a demixing parameter for demixing the mixed audio content (506). The second signal processing path may apply the value of the demixing parameter to the individual audio signals of the set of audio signals (508) to generate the individual source-specific audio signals (510).

Type: Grant

Filed: February 28, 2019

Date of Patent: September 3, 2024

Assignee: Beijing DiDi Infinity Technology and Development Co., Ltd.

Inventors: Yi Zhang, Hui Song, Yongtao Sha, Chengyun Deng
Method for capturing and storing contact information from a physical medium using machine learning

Patent number: 12079706

Abstract: Described herein are systems and methods for facilitating the information entry and task updates to a task database in a cloud server. The task database is in synchronization with a customer relationship management (CRM) system. The systems and methods described herein enable users to update the task database and enter information into the task database in a timely manner such that the task database can stay updated. The updated database can be used to construct a suggested task set at the beginning of a period of time to meet a preset target sales value for the end of the period of time. In one embodiment, a system includes a mobile application to capture contact information from a physical medium as an image, and to send the image to a cloud server, where a trained neural network model is to extract contact details and send the contact details back to the mobile application for editing and confirmation by a user.

Type: Grant

Filed: April 30, 2019

Date of Patent: September 3, 2024

Assignee: CLARI INC.

Inventor: Balasubramaniam Raju
Electric tool

Patent number: 12079539

Abstract: A control unit is adapted to perform a process related to an electric power tool, based on sound input to a microphone. The control unit performs a process related to the electric power tool, based on a result of subjecting input sound to a sound recognition process or a result of subjecting the sound to sound analysis. When a user operation switch is in an on state, the control unit does not perform a sound recognition process, and, when the user operation switch is in an off state, the control unit performs a sound recognition process. Further, when the electric power tool is not in a state of being gripped by the user, the control unit does not perform a sound recognition process, and, when the electric power tool is in a state of being gripped by the user, the control unit performs a sound recognition process.

Type: Grant

Filed: March 20, 2019

Date of Patent: September 3, 2024

Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.

Inventors: Kazuo Dobashi, Itaru Murui
Psychoacoustic audio coding of ambisonic audio data

Patent number: 12073842

Abstract: In general, techniques are described for psychoacoustic audio coding of ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the bitstream that includes an encoded audio object and a corresponding spatial component that defines spatial characteristics of the encoded foreground audio signal. The encoded foreground audio signal may include a coded gain and a coded shape. The one or more processors may perform a gain and shape synthesis with respect to the coded gain and the coded shape to obtain a foreground audio signal, and reconstruct, based on the foreground audio signal and the spatial component, the ambisonic audio data.

Type: Grant

Filed: June 22, 2020

Date of Patent: August 27, 2024

Assignee: QUALCOMM Incorporated

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
Training speech synthesis neural networks using energy scores

Patent number: 12073819

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a generative neural network to convert conditioning text inputs to audio outputs using energy scores.

Type: Grant

Filed: June 4, 2021

Date of Patent: August 27, 2024

Assignee: Google LLC

Inventors: Tim Salimans, Alexey Alexeevich Gritsenko
Systems and methods for natural language processing (NLP) model robustness determination

Patent number: 12073181

Abstract: Systems, apparatuses, methods, and computer program products are disclosed for determining robustness information for an NLP model. Modification rules, such as replacement rules and/or insertion rules, are used to generate instances of modified test data based on instances of test data that comprise words and have a syntax and a semantic meaning. The instances of test data and modified test data are provided to the NLP model and the output of the NLP model is analyzed to determine output changing instances of modified test data, which are instances of modified test data yielded output from the NLP model that is different and/or not similar to the output yielded from the NLP model for the corresponding instance of test data. Robustness information for the NLP model is determined based at least in part on the output changing instances of modified test data.

Type: Grant

Filed: April 21, 2023

Date of Patent: August 27, 2024

Assignee: Wells Fargo Bank, N.A.

Inventors: Tarun Joshi, Rahul Singh, Vijayan Nair, Agus Sudjianto
Sound model localization within an environment

Patent number: 12073319

Abstract: Systems and techniques are provided for sound model localization within an environment. Sound recordings of sounds in the environment may be received from devices in the environment. Preliminary labels for the sound recordings may be determined using pre-trained sound models. The preliminary labels may have associated probabilities. Sound clips with preliminary labels may be generated based on sound recordings that have preliminary labels whose probability is over a high-recall threshold for the pre-trained sound model that determined the preliminary label. The sound clips with preliminary labels may be sent to a user device. Labeled sound clips may be received from the user device. The labeled sound clips may be based on the sound clips with preliminary labels. Training data sets may be generated for the pre-trained sound models using the labeled sound clips. The pre-trained sound models may be trained using the training data sets to generate localized sound models.

Type: Grant

Filed: July 27, 2020

Date of Patent: August 27, 2024

Assignee: Google LLC

Inventors: Rajeev Conrad Nongpiur, Byungchul Kim, Marie Vachovsky, Monica Song, Khe Chai Sim, Qian Zhang
Voice user interface shortcuts for an assistant application

Patent number: 12067984

Abstract: Methods, apparatus, systems, and computer-readable media are provided for using shortcut command phrases to operate an automated assistant. A user of the automated assistant can request that a shortcut command phrase be established for causing the automated assistant to perform a variety of different actions. In this way, the user does not necessarily have to provide an individual command for each action to be performed but, rather, can use a shortcut command phrase to cause the automated assistant to perform the actions. The shortcut command phrases can be used to control peripheral devices, IoT devices, applications, websites, and/or any other apparatuses or processes capable of being controlled through an automated assistant.

Type: Grant

Filed: September 16, 2022

Date of Patent: August 20, 2024

Assignee: GOOGLE LLC

Inventors: Yuzhao Ni, Lucas Palmer
System for enterprise voice signature login

Patent number: 12062376

Abstract: A system, method, and computer-readable medium for performing a data center monitoring and management operation. The data center monitoring and management operation includes: selecting a reference phrase; presenting the reference phrase to a user; generating a voice signature the reference phrase when the reference phrase is vocalized by the user; storing the voice signature for reference phrase within a data center monitoring and management console; instructing the user to recite a subset of words from the reference phrase; and, granting access to the data center monitoring and management console when the subset of words match respective voice signatures stored within the data center monitoring and management console.

Type: Grant

Filed: April 28, 2021

Date of Patent: August 13, 2024

Assignee: Dell Products L.P.

Inventors: Saurav Shrestha, Carlin Mendonca, Margaret Patton, Jeffrey M. Lairsey
Conversation facilitating method and electronic device using the same

Patent number: 12046231

Abstract: A method for facilitating a multiparty conversation is disclosed. An electronic device using the method may facilitate a multiparty conversation by identifying participants of a conversation, localizing relative positions of the participants, detecting speeches of the conversation, matching one of the participants to each of the detected speeches according to the relative positions of the participants, counting participations of the matched participant in the conversation, identifying a passive subject from all the participants according to the participations of all the participants in the conversation, finding a topic of the conversation between the participants, and engaging the passive subject by addressing the passive subject and speaking a sentence related to the topic.

Type: Grant

Filed: August 5, 2021

Date of Patent: July 23, 2024

Assignee: UBKANG (QINGDAO) TECHNOLOGY CO., LTD.

Inventors: David Ayllón Álvarez, Adam David King, Zhen Xiu, Huan Tan
Audio processing method, method for training estimation model, and audio processing system

Patent number: 12039994

Abstract: An audio processing method by which input data are obtained that includes first sound data representing first components of a first frequency band, included in a first sound corresponding to a first sound source, second sound data representing second components of the first frequency band, included in a second sound corresponding to a second sound source, and mix sound data representing mix components of an input frequency band including a second frequency band, the mix components being included in a mix sound of the first sound and the second sound. The input data are then input to a trained estimation model, to generate at least one of first output data representing first estimated components within an output frequency band including the second frequency band, included in the first sound, or second output data representing second estimated components within the output frequency band, included in the second sound.

Type: Grant

Filed: August 26, 2022

Date of Patent: July 16, 2024

Assignee: Yamaha Corporation

Inventors: Daichi Kitamura, Rui Watanabe
Self-supervised federated learning

Patent number: 12039998

Abstract: An acoustic event detection system may employ self-supervised federated learning to update encoder and/or classifier machine learning models. In an example operation, an encoder may be pre-trained to extract audio feature data from an audio signal. A decoder may be pre-trained to predict a subsequent portion of audio data (e.g., a subsequent frame of audio data represented by log filterbank energies). The encoder and decoder may be trained using self-supervised learning to improve the decoder's predictions and, by extension, the quality of the audio feature data generated by the encoder. The system may apply federated learning to share encoder updates across user devices. The system may fine-tune the classifier to improve inferences based on the improved audio feature data. The system may distribute classifier updates to the user device(s) to update the on-device classifier.

Type: Grant

Filed: February 4, 2022

Date of Patent: July 16, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Chieh-Chi Kao, Qingming Tang, Ming Sun, Viktor Rozgic, Spyridon Matsoukas, Chao Wang
Method, apparatus, electronic device and storage medium for semantic recognition

Patent number: 12039971

Abstract: A method for semantic recognition includes: in response to performing semantic analysis on information acquired by a terminal, a sentence to be processed is acquired; word recognition is performed on the sentence, to obtain a plurality of words and part-of-speech information corresponding to each of the words; a target set update operation is determined with a word processing model, according to one or more words to be input, part-of-speech information of the words to be input, and a dependency relationship of a first word. The word processing model is configured to calculate first and second feature vectors according to a word feature vector of the words to be input, a part-of-speech feature vector of the part-of-speech information and a relationship feature vector of the dependency relationship of the first word, calculate confidence levels of the preset set update operations according to the first feature vector and the second feature vector.

Type: Grant

Filed: December 23, 2020

Date of Patent: July 16, 2024

Assignee: Beijing Xiaomi Pinecone Electronics Co., Ltd.

Inventors: Yuankai Guo, Bin Wang, Liang Shi, Yulan Hu, Erli Meng, Shuo Wang, Yingzhe Wang
Noisy student teacher training for robust keyword spotting

Patent number: 12027162

Abstract: Teacher-student learning can be used to train a keyword spotting (KWS) model using augmented training instance(s). Various implementations include aggressively augmenting (e.g., using spectral augmentation) base audio data to generate augmented audio data, where one or more portions of the base instance of audio data can be masked in the augmented instance of audio data (e.g., one or more time frames can be masked, one or more frequencies can be masked, etc.). Many implementations include processing augmented audio data using a KWS teacher model to generate a soft label, and processing the augmented audio data using a KWS student model to generate predicted output. One or more portions of the KWS student model can be updated based on a comparison of the soft label and the generated predicted output.

Type: Grant

Filed: March 3, 2021

Date of Patent: July 2, 2024

Assignee: GOOGLE LLC

Inventors: Hyun Jin Park, Pai Zhu, Ignacio Lopez Moreno, Niranjan Subrahmanya
Real-time name mispronunciation detection

Patent number: 12020683

Abstract: A real-time name mispronunciation detection feature can enable a user to receive instant feedback anytime they have mispronounced another person's name in an online meeting. The feature can receive audio input of a speaker and obtain a transcript of the audio input; identify a name from text of the transcript based on names of meeting participants; and extract a portion of the audio input corresponding to the name identified from the text of the transcript. The feature can obtain a reference pronunciation for the name using a user identifier associated with the name; and can obtain a pronunciation score for the name based on a comparison between the reference pronunciation for the name and the portion of the audio input corresponding to the name. The feature can then determine whether the pronunciation score is below a threshold; and in response, notify the speaker of a pronunciation error.

Type: Grant

Filed: October 28, 2021

Date of Patent: June 25, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Tapan Bohra, Akshay Mallipeddi, Amit Srivastava, Ana Karen Parra

prev 1 2 3 4 5 6 7 … next