Patents Examined by Paras D Shah

Apparatus and method for inserting substitute words based on target characteristics

Patent number: 11853695

Abstract: Data processing apparatus comprises a data memory; a selection controller comprising a computer processor; and a digital interface between a control process implemented by the selection controller and a text handling process implemented by the computer processor or another processor; in which: the selection controller is configured to provide a text document from the data memory to the text handling process to identify one or more characteristics of words in the text document; the selection controller is configured to provide user selection of one or more of the words in the text document to be substituted and of one or more target characteristics; and the selection controller is configured to request from the text handling process a set of one or more substitute words for the selected words such that the substitute words comply with the selected one or more of the target characteristics.

Type: Grant

Filed: January 12, 2021

Date of Patent: December 26, 2023

Assignee: SONY CORPORATION

Inventor: Michael Anslow
System and method for providing voice assistant service

Patent number: 11848012

Abstract: Provided are an artificial intelligence (AI) system that utilizes a machine learning algorithm such as deep learning, etc., and an application of the AI system. A method performed by a device for providing a voice assistant service through a voice assistant program includes: receiving, from an external device, a character specialized model for the voice assistant program; receiving a user voice input including a request for a response of the voice assistant program and a word indicating a character; determining the character specialized model according to the word indicating the character; generating a response message to the request for the response of the voice assistant program, using the character specialized model; and outputting the generated response message.

Type: Grant

Filed: September 19, 2019

Date of Patent: December 19, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Inchul Hwang, Dohee Kang, Seungyoun Kim, Dongchul Ma, Minkyu Park, Daegeun Yi, Dohun Cha
Data-driven social media analytics application synthesis

Patent number: 11847417

Abstract: In some examples, data-driven social media analytics application synthesis may include generating, for each social media analytics application of a plurality of social media analytics applications, a corpus, performing term normalization, and generating a normalized corpus. An actor, an action and an object may be generated for each social media analytics application, which may be mapped into an embedding space. A semantic cohesion network may be generated for each social media analytics application, and a pair-wise semantic cohesion may be determined to identify semantically cohesive groups. A new social media analytics application may be synthesized based on the identified semantically cohesive groups.

Type: Grant

Filed: March 12, 2021

Date of Patent: December 19, 2023

Assignee: ACCENTURE GLOBAL SOLUTIONS LIMITED

Inventors: Janardan Misra, Vikrant Kaulgud, Sanjay Podder
Transcription analysis platform

Patent number: 11837214

Abstract: Various embodiments of the present disclosure evaluate transcription accuracy. In some implementations, the system normalizes a first transcription of an audio file and a baseline transcription of the audio file. The baseline transcription can be used as an accurate transcription of the audio file. The system can further determine an error rate of the first transcription by aligning each portion of the first transcription with the portion of the baseline transcription, and assigning a label to each portion based on a comparison of the portion of the first transcription with the portion of the baseline transcription.

Type: Grant

Filed: October 29, 2020

Date of Patent: December 5, 2023

Assignee: United Services Automobile Association (USAA)

Inventors: Michael J. Szentes, Carlos Chavez, Robert E. Lewis, Nicholas S. Walker
Systems and methods for accelerating automatic speech recognition based on compression and decompression

Patent number: 11830480

Abstract: Systems and methods are provided for automatic speech recognition. In the method, the system obtains a padded sequence by processing a plurality of acoustic signals. The system compresses the padded sequence by reducing the size of the padded sequence to obtain a compressed sequence. The system inputs the compressed sequence into a pre-trained encoder neural network to obtain an encoded sequence and then decompresses the encoded sequence by recovering the encoded sequence to an original sequential ordering. The system inputs the encoded sequence to a decoding module to obtain recognition texts.

Type: Grant

Filed: February 17, 2021

Date of Patent: November 28, 2023

Assignee: KWAI INC.

Inventors: Yongxiong Ren, Yang Liu, Heng Liu, Lingzhi Liu
Fast and robust unsupervised contextual biasing for speech recognition

Patent number: 11830477

Abstract: An automatic speech recognition (ASR) system that determines a textual representation of a word from a word spoken in a natural language is provided. The ASR system uses an acoustic model, a language model, and a decoder. When the ASR system receives a spoken word, the acoustic model generates word candidates for the spoken word. The language model determines an n-gram score for each word candidate. The n-gram score includes a base score and a bias score. The bias score is based on a logarithmic probability of the word candidate, where the logarithmic probability is derived using a class-based language model where the words are clustered into non-overlapping clusters according to word statistics. The decoder decodes a textual representation of the spoken word from the word candidates and the corresponding n-gram score for each word candidate.

Type: Grant

Filed: August 14, 2020

Date of Patent: November 28, 2023

Assignee: Salesforce, Inc.

Inventors: Young Mo Kang, Yingbo Zhou
Method and apparatus for speech interaction, and computer storage medium

Patent number: 11830482

Abstract: Embodiments of the present disclosure relate to a method and an apparatus for speech interaction, and a computer readable storage medium. The method may include determining text information corresponding to a received speech signal. The method also includes obtaining label information of the text information by labeling elements in the text information. In addition, the method further includes determining first intention information of the text information based on the label information. The method further includes determining a semantic of the text information based on the first intention information and the label information.

Type: Grant

Filed: June 8, 2020

Date of Patent: November 28, 2023

Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD

Inventors: Zhen Wu, Yufang Wu, Hua Liang, Jiaxiang Ge, Xingyuan Peng, Jinfeng Bai, Lei Jia
Unified model for zero pronoun recovery and resolution

Patent number: 11822884

Abstract: A method, computer program, and computer system to recover a dropped pronoun is provided for receiving data corresponding to one or more input words and determining contextual representations for the received input word data. The dropped pronoun may be identified based on a probability value associated with the contextual representations, and a span associated with one or more of the received input words may and that corresponds to which of the input words the dropped pronoun refers may be determined.

Type: Grant

Filed: July 25, 2022

Date of Patent: November 21, 2023

Assignee: TENCENT AMERICA LLC

Inventor: Linfeng Song
Unsupervised keyword spotting and word discovery for fraud analytics

Patent number: 11810559

Abstract: Embodiments described herein provide for a computer that detects one or more keywords of interest using acoustic features, to detect or query commonalities across multiple fraud calls. Embodiments described herein may implement unsupervised keyword spotting (UKWS) or unsupervised word discovery (UWD) in order to identify commonalities across a set of calls, where both UKWS and UWD employ Gaussian Mixture Models (GMM) and one or more dynamic time-warping algorithms. A user may indicate a training exemplar or occurrence of call-specific information, referred to herein as “a named entity,” such as a person's name, an account number, account balance, or order number. The computer may perform a redaction process that computationally nullifies the import of the named entity in the modeling processes described herein.

Type: Grant

Filed: June 6, 2022

Date of Patent: November 7, 2023

Assignee: PINDROP SECURITY, INC.

Inventor: Hrishikesh Rao
Resampling output signals of QMF based audio codecs

Patent number: 11810584

Abstract: An apparatus for processing an audio signal includes a configurable first audio signal processor for processing the audio signal in accordance with different configuration settings to obtain a processed audio signal, wherein the apparatus is adapted so that different configuration settings result in different sampling rates of the processed audio signal. The apparatus furthermore includes n analysis filter bank having a first number of analysis filter bank channels, a synthesis filter bank having a second number of synthesis filter bank channels, a second audio processor being adapted to receive and process an audio signal having a predetermined sampling rate, and a controller for controlling the first number of analysis filter bank channels or the second number of synthesis filter bank channels in accordance with a configuration setting.

Type: Grant

Filed: February 10, 2021

Date of Patent: November 7, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Markus Lohwasser, Manuel Jander, Max Neuendorf, Ralf Geiger, Markus Schnell, Matthias Hildenbrand, Tobias Chalupka
Device arbitration for digital assistant-based intercom systems

Patent number: 11810578

Abstract: Systems and processes for operating an intercom system via a digital assistant are provided. The intercom system is trigger-free, in that users communicate, in real-time, via devices without employing a trigger to speak. Acoustic fingerprints are employed to associate users with devices. Acoustic fingerprints include vector embeddings of speech input in an acoustic-feature vector space. Speech heard at multiple devices, as embedded in a fingerprint, may be clustered in the vector space, and the structure of the clusters is employed to associate users and devices. Based on the fingerprints, a device is mapped to a user, and the user employs that device to participate in a conversation, via the intercom service.

Type: Grant

Filed: October 16, 2020

Date of Patent: November 7, 2023

Assignee: Apple Inc.

Inventors: Benjamin S. Phipps, Sachin Kajarekar, Eugene Ray, Mahesh Ramaray Shanbhag, Kisun You, Patrick L. Coffman
Voice transmission compensation apparatus, voice transmission compensation method and program

Patent number: 11806213

Abstract: A speech transmission compensation apparatus that assists discrimination of speech heard by a user, includes: one or more computers each including a memory and a processor configured to: accept input of a speech signal, detect a specific type of sound in the speech signal, analyze an acoustic characteristic of the specific type of sound in the speech signal and output the acoustic characteristic; accept input of the acoustic characteristic being output by the memory and the processor, generate a vibration signal of a duration corresponding to the acoustic characteristic and output the vibration signal; and accept input of the vibration signal being output by the memory and the processor and provide the user with vibration for the duration on the basis of the vibration signal.

Type: Grant

Filed: April 30, 2020

Date of Patent: November 7, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Asuka Ono, Momoko Nakatani, Ai Nakane, Yoko Ishii
Resampling output signals of QMF based audio codecs

Patent number: 11804232

Abstract: An apparatus for processing an audio signal includes a configurable first audio signal processor for processing the audio signal in accordance with different configuration settings to obtain a processed audio signal, wherein the apparatus is adapted so that different configuration settings result in different sampling rates of the processed audio signal. The apparatus furthermore includes n analysis filter bank having a first number of analysis filter bank channels, a synthesis filter bank having a second number of synthesis filter bank channels, a second audio processor being adapted to receive and process an audio signal having a predetermined sampling rate, and a controller for controlling the first number of analysis filter bank channels or the second number of synthesis filter bank channels in accordance with a configuration setting.

Type: Grant

Filed: February 10, 2021

Date of Patent: October 31, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Markus Lohwasser, Manuel Jander, Max Neuendorf, Ralf Geiger, Markus Schnell, Matthias Hildenbrand, Tobias Chalupka
Voice activity detection using zero crossing detection

Patent number: 11790931

Abstract: A first VAD system outputs a pulse stream for zero crossings in an audio signal. The pulse density of the pulse stream is evaluated to identify speech. The audio signal may have noise added to it before evaluating zero crossings. A second VAD system rectifies each audio signal sample and processes each rectified sample by updating a first statistic and evaluating the rectified sample per a first threshold condition that is a function of the first statistic. Rectified samples meeting the first threshold condition may be used to update a second statistic and the rectified sample evaluated per a second threshold condition that is a function of the second statistic. Rectified samples meeting the second threshold condition may be used to update a third statistic. The audio signal sample may be selected as speech if the second statistic is less than a downscaled third statistic.

Type: Grant

Filed: October 27, 2020

Date of Patent: October 17, 2023

Assignee: Ambiq Micro, Inc.

Inventor: Roger David Serwy
Resampling output signals of QMF based audio codecs

Patent number: 11790928

Abstract: An apparatus for processing an audio signal includes a configurable first audio signal processor for processing the audio signal in accordance with different configuration settings to obtain a processed audio signal, wherein the apparatus is adapted so that different configuration settings result in different sampling rates of the processed audio signal. The apparatus furthermore includes n analysis filter bank having a first number of analysis filter bank channels, a synthesis filter bank having a second number of synthesis filter bank channels, a second audio processor being adapted to receive and process an audio signal having a predetermined sampling rate, and a controller for controlling the first number of analysis filter bank channels or the second number of synthesis filter bank channels in accordance with a configuration setting.

Type: Grant

Filed: February 10, 2021

Date of Patent: October 17, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Markus Lohwasser, Manuel Jander, Max Neuendorf, Ralf Geiger, Markus Schnell, Matthias Hildenbrand, Tobias Chalupka
Method and apparatus for processing audio signal

Patent number: 11790926

Abstract: A method and apparatus for processing an audio signal are disclosed. According to an example embodiment, a method of processing an audio signal may include acquiring a final audio signal for an initial audio signal using a plurality of neural network models generating output audio signals by encoding and decoding input audio signals, calculating a difference between the initial audio signal and the final audio signal in a time domain, converting the initial audio signal and the final audio signal into Mel-spectra, calculating a difference between the Mel-spectra of the initial audio signal and the final audio signal in a frequency domain, training the plurality of neural network models based on results calculated in the time domain and the frequency domain, and generating a new final audio signal distinguished from the final audio signal from the initial audio signal using the trained neural network models.

Type: Grant

Filed: January 22, 2021

Date of Patent: October 17, 2023

Assignees: Electronics and Telecommunications Research Institute, The Trustees of Indiana University

Inventors: Mi Suk Lee, Seung Kwon Beack, Jongmo Sung, Tae Jin Lee, Jin Soo Choi, Minje Kim, Kai Zhen
Weakly supervised and explainable training of a machine-learning-based named-entity recognition (NER) mechanism

Patent number: 11775763

Abstract: Systems and methods for weakly-supervised training a machine-learning model to perform named-entity recognition. All possible entity candidates and all possible rule candidates are automatically identified in an input data set of unlabeled text. An initial training of the machine-learning model is performed using labels assigned to entity candidates by a set of seeding rules as a first set of training data. The trained machine-learning model is then applied to the unlabeled text and a subset of rules from the rule candidates is identified that produces labels that most accurately match the labels assigned by the trained machine-learning model. The machine-learning model is then retrained using the labels assigned by the identified subset of rules as the second set of training data. This process is iteratively repeated to further refine and improve the performance of the machine-learning model for named-entity recognition.

Type: Grant

Filed: February 25, 2021

Date of Patent: October 3, 2023

Assignee: Robert Bosch GmbH

Inventors: Jiacheng Li, Haibo Ding, Zhe Feng
Artificial intelligence apparatus for recognizing speech of user and method for the same

Patent number: 11776544

Abstract: An embodiment of the present invention provides an artificial intelligence (AI) apparatus for recognizing a speech of a user, the artificial intelligence apparatus includes a memory to store a speech recognition model and a processor to obtain a speech signal for a user speech, to convert the speech signal into a text using the speech recognition model, to measure a confidence level for the conversion, to perform a control operation corresponding to the converted text if the measured confidence level is greater than or equal to a reference value, and to provide feedback for the conversion if the measured confidence level is less than the reference value.

Type: Grant

Filed: May 18, 2022

Date of Patent: October 3, 2023

Assignee: LG ELECTRONICS INC.

Inventors: Jaehong Kim, Hyoeun Kim, Hangil Jeong, Heeyeon Choi
Meaning inference from speech audio

Patent number: 11769488

Abstract: A system and method invoke virtual assistant action, which may comprise an argument. From audio, a probability of an intent is inferred. A probability of a domain and a plurality of variable values may also be inferred. Invoking the action is in response to the intent probability exceeding a threshold. Invoking the action may also be in response to the domain probability exceeding a threshold, a variable value probability exceeding a threshold, detecting an end of utterance, and a specific amount of time having elapsed. The intent probability may increase when the audio includes speech of words with the same meaning in multiple natural languages. Invoking the action may also be conditional on the variable value exceeding its threshold within a certain period of time of the intent probability exceeding its threshold.

Type: Grant

Filed: March 3, 2022

Date of Patent: September 26, 2023

Assignee: SoundHound AI IP, LLC

Inventors: Sudharsan Krishnaswamy, Maisy Wieman, Jonah Probell
Tool for assisting people with speech disorder

Patent number: 11763821

Abstract: Various tools are disclosed for providing assistive or augmentative means to enhance the fluency and accuracy of persons having speech disabilities. These technologies may automatically ascertain and dynamically improve the accuracy with which automatic speech recognition (ASR) systems recognize utterances of persons having impaired speech conditions. In an embodiment, digitized audio information about a speaker’s utterance is processed to determine a set of candidate words matching the utterance. From these candidate words, a set of concepts is determined using a finite state machine model. A pictogram representing each concept is identified and presented to the speaker so that the speaker may select the pictogram corresponding to the best match of his or her intended meaning associated with the utterance. An action corresponding to speaker’s selection then may be performed. For example, displaying or synthesizing speech from textual information describing the selected concept.

Type: Grant

Filed: June 27, 2019

Date of Patent: September 19, 2023

Assignee: Cerner Innovation, Inc.

Inventor: Douglas S. McNair

prev 1 2 3 4 5 6 7 8 9 … next