Patents Examined by Jonathan Kim
  • Patent number: 10170102
    Abstract: Embodiments of the present invention provide an approach for estimating the accuracy of a transcription of a voice recording. Specifically, in a typical embodiment, each word of a transcription of a voice recording is checked against a customer-specific dictionary and/or a common language dictionary. The number of words not found in either dictionary is determined. An accuracy number for the transcription is calculated from the number of said words not found and the total number of words in the transcription.
    Type: Grant
    Filed: April 24, 2018
    Date of Patent: January 1, 2019
    Assignee: International Business Machines Corporation
    Inventors: James E. Bostick, John M. Ganci, Jr., John P. Kaemmerer, Craig M. Trim
  • Patent number: 10157180
    Abstract: One embodiment of the present invention provides a system for facilitating information in multiple languages based on an optical code. During operation, the system scans an optical code accompanying a text phrase and receives a location of a conversion server embedded in the optical code. The system then determines one or more target languages for obtaining the text phrase in the target languages and sends a query message to the conversion server based on the retrieved location. The query message comprises a list of the one or more target languages.
    Type: Grant
    Filed: January 12, 2016
    Date of Patent: December 18, 2018
    Assignee: Alibaba Group Holding Limited
    Inventor: Yifeng Zhu
  • Patent number: 10127214
    Abstract: Methods are presented for generating a natural language model. The method may comprise: ingesting training data representative of documents to be analyzed by the natural language model, generating a hierarchical data structure comprising at least two topical nodes within which the training data is to be subdivided into by the natural language model, selecting a plurality of documents among the training data to be annotated, generating an annotation prompt for each document configured to elicit an annotation about said document indicating which node among the at least two topical nodes said document is to be classified into, receiving the annotation based on the annotation prompt; and generating the natural language model using an adaptive machine learning process configured to determine patterns among the annotations for how the documents in the training data are to be subdivided according to the at least two topical nodes of the hierarchical data structure.
    Type: Grant
    Filed: December 9, 2015
    Date of Patent: November 13, 2018
    Assignee: Sansa Al Inc.
    Inventors: Robert J. Munro, Schuyler D. Erle, Christopher Walker, Sarah K. Luger, Jason Brenier, Gary C. King, Paul A. Tepper, Ross Mechanic, Andrew Gilchrist-Scott, Jessica D. Long, James B. Robinson, Brendan D. Callahan, Michelle Casbon, Ujjwal Sarin, Aneesh Nair, Veena Basavaraj, Tripti Saxena, Edgar Nunez, Martha G. Hinrichs, Haley Most, Tyler J. Schnoebelen
  • Patent number: 10056096
    Abstract: Provided herein is an electronic device and method of voice recognition, the method including analyzing an audio signal of a first frame when the audio signal is input and extracting a first feature value; determining a similarity between the first feature value extracted from the audio signal of the first frame and a first feature value extracted from an audio signal of a previous frame; analyzing the audio signal of the first frame and extracting a second feature value when the similarity is below a predetermined threshold value; and comparing the extracted first feature value and the second feature value and at least one feature value corresponding to a pre-defined voice signal and determining whether or not the audio signal of the first frame is a voice signal, and thus the electronic device may detect only a voice section from the audio signal while improving the processing speed.
    Type: Grant
    Filed: July 22, 2016
    Date of Patent: August 21, 2018
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Jong-uk Yoo
  • Patent number: 10043539
    Abstract: A method for speech processing includes determining an unvoicing parameter for a first frame of a speech signal and determining a smoothed unvoicing parameter for the first frame by weighting the unvoicing parameter of the first frame and a smoothed unvoicing parameter of a second frame. The unvoicing parameter reflects a speech characteristic of the first frame. The smoothed unvoicing parameter of the second frame is weighted less heavily when the smoothed unvoicing parameter of the second frame is greater than the unvoicing parameter of the first frame. The method further includes computing a difference, by a processor, between the unvoicing parameter of the first frame and the smoothed unvoicing parameter of the first frame, and determining a classification of the first frame according to the computed difference. The classification includes unvoiced speech or voiced speech. The first frame is processed in accordance with the classification of the first frame.
    Type: Grant
    Filed: December 27, 2016
    Date of Patent: August 7, 2018
    Assignee: Huawei Technologies Co., Ltd.
    Inventor: Yang Gao
  • Patent number: 10031722
    Abstract: Techniques for creating groups of devices for controlling these groups with voice commands are described herein. For instance, an environment may include an array of secondary devices (or “smart appliances”, or simply “devices”) that are configured to perform an array of operations. Users may request to create different groups of these devices, such that the users may control entire groups at a single time with individual voice commands.
    Type: Grant
    Filed: June 26, 2015
    Date of Patent: July 24, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Rohan Mutagi, He Lu, Willy Lew Yuk Vong, Michael Dale Whiteley, Fred Torok, Shikher Sitoke, David Ross Bronaugh, Bo Li
  • Patent number: 10002606
    Abstract: Embodiments of the present invention provide an approach for estimating the accuracy of a transcription of a voice recording. Specifically, in a typical embodiment, each word of a transcription of a voice recording is checked against a customer-specific dictionary and/or a common language dictionary. The number of words not found in either dictionary is determined. An accuracy number for the transcription is calculated from the number of said words not found and the total number of words in the transcription.
    Type: Grant
    Filed: November 16, 2017
    Date of Patent: June 19, 2018
    Assignee: International Business Machines Corporation
    Inventors: James E. Bostick, John M. Ganci, John P. Kaemmerer, Craig M. Trim
  • Patent number: 9984686
    Abstract: Techniques for defining a set of predefined device capabilities generally offered by available voice-controllable devices are described herein. Thereafter, as a particular user introduces new secondary devices into his environment and registers these devices, the techniques may identify the capabilities of the new device and map these capabilities to one or more of the predefined device capabilities of the set.
    Type: Grant
    Filed: June 26, 2015
    Date of Patent: May 29, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Rohan Mutagi, Willy Lew Yuk Vong, Michael Dale Whiteley, He Lu
  • Patent number: 9984683
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for automatic speech recognition using multi-dimensional models. In some implementations, audio data that describes an utterance is received. A transcription for the utterance is determined using an acoustic model that includes a neural network having first memory blocks for time information and second memory blocks for frequency information. The transcription for the utterance is provided as output of an automated speech recognizer.
    Type: Grant
    Filed: July 22, 2016
    Date of Patent: May 29, 2018
    Assignee: Google LLC
    Inventors: Bo Li, Tara N. Sainath
  • Patent number: 9953647
    Abstract: A method and apparatus for speech recognition are provided. The method and the apparatus calculate signal to noise ratios (SNRs) of speech signals from a user received at speech recognition apparatuses. The method and the apparatus recognize a reference speech signal having a maximum SNR among the SNRs.
    Type: Grant
    Filed: January 12, 2016
    Date of Patent: April 24, 2018
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Minyoung Mun, YoungSang Choi
  • Patent number: 9946712
    Abstract: A computer-implemented technique includes receiving, at a computing device including one or more processors, a user input (i) identifying a portion of a media stream being output from the computing device and (ii) indicating a request to translate the portion of the media stream from a source language to a target language. The technique includes transmitting, from the computing device, the portion of the media stream to a translation server via a network in response to receiving the user input. The technique includes receiving, at the computing device, a translated portion of the media stream from the translation server via the network, the translated portion of the media stream having been translated from the source language to the target language by the translation server. The technique also includes outputting, at the computing device, the translated portion of the media stream.
    Type: Grant
    Filed: June 13, 2013
    Date of Patent: April 17, 2018
    Assignee: GOOGLE LLC
    Inventor: Hong Shen
  • Patent number: 9934793
    Abstract: Disclosed are a method for determining whether a person is drunk after consuming alcohol capable of analyzing alcohol consumption in a time domain by analyzing a voice, and a recording medium and a terminal for carrying out same.
    Type: Grant
    Filed: January 24, 2014
    Date of Patent: April 3, 2018
    Assignee: FOUNDATION OF SOONGSIL UNIVERSITY-INDUSTRY COOPERATION
    Inventors: Myung Jin Bae, Sang Gil Lee, Geum Ran Baek
  • Patent number: 9916844
    Abstract: Disclosed are a method for determining whether a person is drunk after consuming alcohol on the basis of a difference among a plurality of formant energy energies, which are generated by applying linear predictive coding according to a plurality of linear prediction orders, and a recording medium and a terminal for carrying out the method.
    Type: Grant
    Filed: January 28, 2014
    Date of Patent: March 13, 2018
    Assignee: FOUNDATION OF SOONGSIL UNIVERSITY-INDUSTRY COOPERATION
    Inventors: Myung Jin Bae, Sang Gil Lee, Geum Ran Baek
  • Patent number: 9905243
    Abstract: A system of this invention is directed to a speech processing system that efficiently performs noise suppression processing for a plurality of noise sources spreading in a lateral direction with respect to a speaker of interest.
    Type: Grant
    Filed: January 16, 2014
    Date of Patent: February 27, 2018
    Assignee: NEC CORPORATION
    Inventors: Masanori Tsujikawa, Ken Hanazawa, Akihiko Sugiyama
  • Patent number: 9906883
    Abstract: An audio encoding apparatus and method that encodes hybrid contents including an object sound, a background sound, and metadata, and an audio decoding apparatus and method that decodes the encoded hybrid contents are provided. The audio encoding apparatus may include a mixing unit to generate an intermediate channel signal by mixing a background sound and an object sound, a matrix information encoding unit to encode matrix information used for the mixing, an audio encoding unit to encode the intermediate channel signal, and a metadata encoding unit to encode metadata including control information of the object sound.
    Type: Grant
    Filed: September 4, 2014
    Date of Patent: February 27, 2018
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Tae Jin Lee, Jong Mo Sung, Kyeong Ok Kang, Jeong Il Seo, Dae Young Jang, Yong Ju Lee, Jin Woong Kim
  • Patent number: 9899039
    Abstract: Disclosed is a method for determining alcohol consumption capable of analyzing alcohol consumption in a time domain by analyzing a formant slope of a voice signal, and a recording medium and a terminal for carrying out same. An terminal for determining whether a person is drunk comprises: a voice input unit for generating a voice frame by receiving a voice signal; a voiced/unvoiced sound analysis unit for determining whether a received voiced frame corresponds to a voiced sound; a formant frequency extraction unit for extracting a plurality of formant frequencies of the voice frame corresponding to the voiced sound; and an alcohol consumption determining unit for calculating a formant slope between the plurality of formant frequencies, and determining the state of alcohol consumption depending on the formant slope, thereby determining whether a person is drunk by analyzing the formant slope of an inputted voice.
    Type: Grant
    Filed: January 24, 2014
    Date of Patent: February 20, 2018
    Assignee: FOUNDATION OF SOONGSIL UNIVERSITY-INDUSTRY COOPERATION
    Inventors: Myung Jin Bae, Sang Gil Lee, Geum Ran Baek
  • Patent number: 9892725
    Abstract: Embodiments of the present invention provide an approach for estimating the accuracy of a transcription of a voice recording. Specifically, in a typical embodiment, each word of a transcription of a voice recording is checked against a customer-specific dictionary and/or a common language dictionary. The number of words not found in either dictionary is determined. An accuracy number for the transcription is calculated from the number of said words not found and the total number of words in the transcription.
    Type: Grant
    Filed: January 5, 2017
    Date of Patent: February 13, 2018
    Assignee: International Business Machines Corporation
    Inventors: James E. Bostick, John M. Ganci, Jr., John P. Kaemmerer, Craig M. Trim
  • Patent number: 9870357
    Abstract: A method of presenting translated content items is disclosed. It is detected that a content item has been captured by a device of a user. It is identified that the content item is a candidate content item for translation. The candidate content item is translated; and the translated candidate content item is presented via a user interface of a wearable display of the device.
    Type: Grant
    Filed: October 28, 2013
    Date of Patent: January 16, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventor: Tomer Cohen
  • Patent number: 9836450
    Abstract: Systems, methods, and apparatuses are presented for a trained language model to be stored in an efficient manner such that the trained language model may be utilized in virtually any computing device to conduct natural language processing. Unlike other natural language processing engines that may be computationally intensive to the point of being capable of running only on high performance machines, the organization of the natural language models according to the present disclosures allows for natural language processing to be performed even on smaller devices, such as mobile devices.
    Type: Grant
    Filed: December 9, 2015
    Date of Patent: December 5, 2017
    Assignee: Sansa AI Inc.
    Inventors: Schuyler D. Erle, Robert J. Munro, Brendan D. Callahan, Gary C. King, Jason Brenier, James B. Robinson
  • Patent number: 9812138
    Abstract: A robust digital fingerprint of a file ensures that one able to produce the robust digital fingerprint has possession of the file. A client obtains information that is unpredictable to the client and uses that information to modify the file and generate a robust digital fingerprint from the modified file. A server, with access to the same unpredictable information, verifies the generated robust digital fingerprint. An algorithm for generating the robust digital fingerprint has a property that different representations of the same content will produce matching digital fingerprints.
    Type: Grant
    Filed: September 3, 2014
    Date of Patent: November 7, 2017
    Assignee: Amazon Technologies, Inc.
    Inventor: Thibault Candebat