Patents Examined by Douglas Godbold
  • Patent number: 11705118
    Abstract: A system configured to process an incoming spoken utterance and to coordinate among multiple speechlet components to execute an action of the utterance, where the output of one speechlet may be used as the input to another speechlet to ultimately perform the action. The speechlets and intervening actions need not be expressly invoked by the utterance. Rather the system may determine how best to complete the action and may identify intermediate speechlets that may be provide input data to the speechlet that will ultimately perform the action. The speechlets may be configured to recognize a common universe of actions and/or entities rather than have each speechlet or subject matter domain have its own set of recognizable actions and entities.
    Type: Grant
    Filed: October 28, 2019
    Date of Patent: July 18, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Bradford Lynch, Adam D. Baran, Kevindra Pal Singh, Udai Sen Mody
  • Patent number: 11699445
    Abstract: A system and method for improving T-matrix training for speaker recognition, comprising receiving an audio input, divisible into a plurality of audio frames including at least an audio sample of a human speaker; generating for each audio frame a feature vector; generating for a first plurality of feature vectors centered statistics of at least a zero order and a first order; generating a first i-vector, the first i-vector representing the human speaker; and generating an optimized T-matrix training sequence computation, based on at least the first i-vector.
    Type: Grant
    Filed: March 15, 2021
    Date of Patent: July 11, 2023
    Assignee: ILLUMA LABS INC.
    Inventor: Milind Borkar
  • Patent number: 11694675
    Abstract: Provided is an apparatus that includes a voice recognition section that executes a voice recognition process on a user speech and a learning processing section that executes a process of updating a degree of confidence on the basis of an interaction made between a user and the information processing apparatus after the user speech. The degree of confidence is an evaluation value indicating the reliability of a voice recognition result of the user speech. The voice recognition section generates data on degrees of confidence in recognition of the user speech in which data plural user speech candidates based on the voice recognition result of the user speech are associated with the degrees of confidence which are evaluation values each indicating reliability of the corresponding user speech candidate.
    Type: Grant
    Filed: November 29, 2018
    Date of Patent: July 4, 2023
    Assignee: SONY CORPORATION
    Inventors: Hidenori Aoki, Fujio Arai, Yusuke Kudo, Gen Hamada, Naoyuki Sato
  • Patent number: 11694687
    Abstract: A computer-implemented method according to one embodiment includes receiving, utilizing a processor, textual data associated with a conversation between a first participant and a second participant; receiving, utilizing the processor, an objective of the first participant for the conversation between the first participant and the second participant, where the objective is separate from the conversation; determining, utilizing the processor, a dialog act to be entered by the first participant that meets the objective, utilizing a model, including scoring a plurality of proposed dialog acts based on an amount that each proposed dialog act will change a probability of the objective being achieved during the conversation, and determining the dialog act to be entered, based on the scoring; and returning, utilizing the processor, the dialog act to the first participant.
    Type: Grant
    Filed: May 17, 2021
    Date of Patent: July 4, 2023
    Assignee: International Business Machines Corporation
    Inventors: Rama Kalyani T. Akkiraju, Mansurul Bhuiyan, Pritam S. Gundecha, Jalal U. Mahmud, Shereen Oraby, Vibha S. Sinha, Sabina Tomkins, Anbang Xu
  • Patent number: 11694699
    Abstract: There is provided mechanisms for frame loss concealment. A method is performed by a receiving entity. The method comprises adding, in association with constructing a substitution frame for a lost frame, a noise component to the substitution frame. The noise component has a frequency characteristic corresponding to a low-resolution spectral representation of a signal in a previously received frame.
    Type: Grant
    Filed: July 21, 2021
    Date of Patent: July 4, 2023
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventor: Stefan Bruhn
  • Patent number: 11694697
    Abstract: A system and method are presented for the correction of packet loss in audio in automatic speech recognition (ASR) systems. Packet loss correction, as presented herein, occurs at the recognition stage without modifying any of the acoustic models generated during training. The behavior of the ASR engine in the absence of packet loss is thus not altered. To accomplish this, the actual input signal may be rectified, the recognition scores may be normalized to account for signal errors, and a best-estimate method using information from previous frames and acoustic models may be used to replace the noisy signal.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: July 4, 2023
    Inventors: Srinath Cheluvaraja, Ananth Nagaraja Iyer, Aravind Ganapathiraju, Felix Immanuel Wyss
  • Patent number: 11670309
    Abstract: Methods, apparatus and articles of manufacture for research data gathering are disclosed. An example apparatus disclosed herein is to detect whether the apparatus is powered by an internal power source or an external power source. The example apparatus is also to, in response to detecting the apparatus is powered by the internal power source, perform first processing on a received audio signal to determine audio data to store in storage of the apparatus. The example apparatus is further to, in response to detecting the apparatus is powered by the external power source, perform second processing on the stored audio data to recover the code, the second processing different from the first processing.
    Type: Grant
    Filed: November 23, 2020
    Date of Patent: June 6, 2023
    Assignee: THE NIELSEN COMPANY (US), LLC
    Inventors: Alan R. Neuhauser, Jack C. Crystal
  • Patent number: 11663408
    Abstract: Implementations of the disclosure are directed to OCR error correction systems and methods. In some implementations, a method comprises: obtaining, at a computing device, optical character recognition (OCR) text extracted from a document image, the text comprising a token; searching, at the computing device, based on a token bigram determined from the token and a mapping between words in a corpus and a corpus bigram set comprised of unique bigrams from the beginning or ending of the words in the corpus, the corpus for a best word to replace the token; and replacing, at the computing device, the token with the best word.
    Type: Grant
    Filed: December 17, 2020
    Date of Patent: May 30, 2023
    Assignee: FIRST AMERICAN FINANCIAL CORPORATION
    Inventor: Jeffrey Norton
  • Patent number: 11657227
    Abstract: Machine learning model training corpus debiasing includes identifying an attribute of input text selected from the training corpus, the attribute including word(s) of the input text, and the attribute corresponding to an attribute class encompassing different possible class values, recognizing bias in the input text with respect to the attribute class, and generating output text corresponding to the attribute and imparting diversity with respect to the attribute class and relative to the input text, where generating the output text uses an optimization function based on loss objectives to minimize loss in the generated output text as compared to the input text.
    Type: Grant
    Filed: January 13, 2021
    Date of Patent: May 23, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Shikhar Kwatra, Nishtha Madaan, Sushain Pandit, Kuntal Dey
  • Patent number: 11645473
    Abstract: Systems, computer-implemented methods, and computer program products that can facilitate predicting a source of a subsequent spoken dialogue are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise a speech receiving component that can receive a spoken dialogue from a first entity. The computer executable components can further comprise a speech processing component that can employ a network that can concurrently process a transition type and a dialogue act of the spoken dialogue to predict a source of a subsequent spoken dialogue.
    Type: Grant
    Filed: December 23, 2020
    Date of Patent: May 9, 2023
    Assignees: INTERNATIONAL BUSINESS MACHINES CORPORATION, THE REGENTS OF THE UNIVERSITY OF MICHIGAN
    Inventors: Lazaros Polymenakos, Dimitrios B. Dimitriadis, Zakaria Aldeneh, Emily Mower Provost
  • Patent number: 11645460
    Abstract: A first text corpus comprising punctuated and capitalized text is received. The words in the first text corpus are then annotated with a set of labels indicating a punctuation and a capitalization of each word. At an initial training stage, a machine learning model is trained on a first training set using the annotated words from the first text corpus and the labels. A second text corpus is received representing conversational speech. The words in the second text corpus are then annotated with the set of labels. In a re-training stage, the machine learning model is re-trained on a second training set comprising the annotated words from the second text corpus, and the labels. At an inference stage, the trained machine learning model is applied to a target set of words representing conversational speech to predict a punctuation and capitalization of each word in the target set.
    Type: Grant
    Filed: December 28, 2020
    Date of Patent: May 9, 2023
    Inventors: Avraham Faizakof, Arnon Mazza, Lev Haikin, Eyal Orbach
  • Patent number: 11636872
    Abstract: In various embodiments, a quality inference application estimates perceived audio quality. The quality inference application computes a set of feature values for a set of audio features based on an audio clip. The quality inference application then uses a trained multitask learning model to generate predicted labels based on the set of feature values. The predicted labels specify metric values for metrics that are relevant to audio quality. Subsequently, the quality inference application computes an audio quality score for the audio clip based on the predicted labels.
    Type: Grant
    Filed: June 18, 2020
    Date of Patent: April 25, 2023
    Assignee: NETFLIX, INC.
    Inventors: Chih-Wei Wu, Phillip A. Williams, William Francis Wolcott, IV
  • Patent number: 11636269
    Abstract: A computerized method is provided for automatically determining answers to a plurality of questions. The method includes automatically discovering a plurality of questions by processing historical data related to prior customer interactions. The automatically discovering includes applying a linguistic analytical model on the data related to historical customer interactions to detect the plurality of questions, vectoring the plurality of questions to generate mathematical representations of the questions, and grouping the plurality of questions into one or more clusters in accordance with similarities of the questions as measured based on their mathematical representations. The method also includes identifying the questions that do not have an existing answer. The method further includes determining at least one probable answer to each of the representative questions using a content mining technique that mines pertinent data from one or more identified content sources.
    Type: Grant
    Filed: October 15, 2020
    Date of Patent: April 25, 2023
    Assignee: FMR LLC
    Inventors: Ankush Chopra, Shruti Agrawal
  • Patent number: 11631400
    Abstract: An electronic apparatus configured to acquire information on a plurality of candidate texts corresponding to input speech of a user through a general speech recognition module, determine text corresponding to the input speech from among the plurality of candidate texts using a trained personal language model, and output the text as a result of speech recognition of the input speech.
    Type: Grant
    Filed: February 10, 2020
    Date of Patent: April 18, 2023
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Beomseok Lee, Sangha Kim, Yoonjin Yoon
  • Patent number: 11631419
    Abstract: A recording device records a video and an imaging time, and a voice. Based on the voice, a sound parameter calculator calculates a sound parameter for specifying magnitude of the voice in a monitoring area at the imaging time for each of pixels and for each of certain times. A sound parameter storage unit stores the sound parameter. A sound parameter display controller superimposes a voice heat map on a captured image of the monitoring area and displays the superimposed image on a monitor. At this time, the sound parameter display controller displays the voice heat map based on a cumulative time value of magnitude of the voice, according to designation of a time range.
    Type: Grant
    Filed: February 12, 2021
    Date of Patent: April 18, 2023
    Assignee: PANASONIC INTELLECTUAL PROPERTY MANAGEMENT CO., LTD.
    Inventors: Ryota Fujii, Hiroyuki Matsumoto, Hiroaki Hayashi, Kazunori Hayashi
  • Patent number: 11620985
    Abstract: A pattern recognition apparatus includes: a model storage part that stores a model(s) generated by using transfer path information indicating a difference of transfer paths of a signal(s) for training, additional to the signal(s) for training, and a pattern recognition part that inputs an input signal and transfer path information indicating a difference of transfer paths of the input signal, and performs pattern recognition of the input signal by using the model(s).
    Type: Grant
    Filed: May 15, 2018
    Date of Patent: April 4, 2023
    Assignee: NEC CORPORATION
    Inventors: Tatsuya Komatsu, Reishi Kondo
  • Patent number: 11610580
    Abstract: A framework for reducing the number of textual items reviewed to determine the source of or reason for an anomaly in a time series that is used to track metrics in textual data is provided. According the framework, textual items in a time window corresponding to the anomaly are ranked according to the cross-entropy as determined by applying a language model to the relevant textual items and ranking textual items that most likely triggered an anomaly in time series data based on the cross-entropy value. In an aspect, a predetermined number of textual items having the highest cross-entropy are provided or all textual items having cross-entropy value higher than predetermine threshold are provided.
    Type: Grant
    Filed: March 5, 2020
    Date of Patent: March 21, 2023
    Assignee: Verint Americas Inc.
    Inventor: Cynthia Freeman
  • Patent number: 11599720
    Abstract: A method may include receiving an electronic message from a sender. The method may further include parsing the electronic message into a set of sections, the set of sections including structured sections and an unstructured section. The method may further include detecting etiquette errors in the structured sections of the electronic message, wherein the etiquette errors include at least one of a missing word, a redundant word, an incorrect usage of a word, a style error, an emotional punctuation error, or a punctuation error. The method may further include generating an etiquette score based on the etiquette errors.
    Type: Grant
    Filed: July 28, 2020
    Date of Patent: March 7, 2023
    Assignee: SHL (India) Private Limited
    Inventors: Varun Aggarwal, Rohit Takhar, Abhishek Unnam
  • Patent number: 11599721
    Abstract: A natural language processing system that trains task models for particular natural language tasks programmatically generates additional utterances for inclusion in the training set, based on the existing utterances in the training set and the existing state of a task model as generated from the original (non-augmented) training set. More specifically, the training augmentation module 220 identifies specific textual units of utterances and generates variants of the utterances based on those identified units. The identification is based on determined importances of the textual units to the output of the task model, as well as on task rules that correspond to the natural language task for which the task model is being generated. The generation of the additional utterances improves the quality of the task model without the expense of manual labeling of utterances for training set inclusion.
    Type: Grant
    Filed: August 25, 2020
    Date of Patent: March 7, 2023
    Assignee: Salesforce, Inc.
    Inventors: Shiva Kumar Pentyala, Mridul Gupta, Ankit Chadha, Indira Iyer, Richard Socher
  • Patent number: 11600284
    Abstract: A voice morphing apparatus having adjustable parameters is described. The disclosed system and method include a voice morphing apparatus that morphs input audio to mask a speaker's identity. Parameter adjustment uses evaluation of an objective function that is based on the input audio and output of the voice morphing apparatus. The voice morphing apparatus includes objectives that are based adversarially on speaker identification and positively on audio fidelity. Thus, the voice morphing apparatus is adjusted to reduce identifiability of speakers while maintaining fidelity of the morphed audio. The voice morphing apparatus may be used as part of an automatic speech recognition system.
    Type: Grant
    Filed: January 11, 2020
    Date of Patent: March 7, 2023
    Assignee: SOUNDHOUND, INC.
    Inventor: Steve Pearson