Patents Examined by Jonathan Kim

Automatic accuracy estimation for audio transcriptions

Patent number: 10170102

Abstract: Embodiments of the present invention provide an approach for estimating the accuracy of a transcription of a voice recording. Specifically, in a typical embodiment, each word of a transcription of a voice recording is checked against a customer-specific dictionary and/or a common language dictionary. The number of words not found in either dictionary is determined. An accuracy number for the transcription is calculated from the number of said words not found and the total number of words in the transcription.

Type: Grant

Filed: April 24, 2018

Date of Patent: January 1, 2019

Assignee: International Business Machines Corporation

Inventors: James E. Bostick, John M. Ganci, Jr., John P. Kaemmerer, Craig M. Trim
Displaying information in multiple languages based on optical code reading

Patent number: 10157180

Abstract: One embodiment of the present invention provides a system for facilitating information in multiple languages based on an optical code. During operation, the system scans an optical code accompanying a text phrase and receives a location of a conversion server embedded in the optical code. The system then determines one or more target languages for obtaining the text phrase in the target languages and sends a query message to the conversion server based on the retrieved location. The query message comprises a list of the one or more target languages.

Type: Grant

Filed: January 12, 2016

Date of Patent: December 18, 2018

Assignee: Alibaba Group Holding Limited

Inventor: Yifeng Zhu
Methods for generating natural language processing systems

Patent number: 10127214

Abstract: Methods are presented for generating a natural language model. The method may comprise: ingesting training data representative of documents to be analyzed by the natural language model, generating a hierarchical data structure comprising at least two topical nodes within which the training data is to be subdivided into by the natural language model, selecting a plurality of documents among the training data to be annotated, generating an annotation prompt for each document configured to elicit an annotation about said document indicating which node among the at least two topical nodes said document is to be classified into, receiving the annotation based on the annotation prompt; and generating the natural language model using an adaptive machine learning process configured to determine patterns among the annotations for how the documents in the training data are to be subdivided according to the at least two topical nodes of the hierarchical data structure.

Type: Grant

Filed: December 9, 2015

Date of Patent: November 13, 2018

Assignee: Sansa Al Inc.

Inventors: Robert J. Munro, Schuyler D. Erle, Christopher Walker, Sarah K. Luger, Jason Brenier, Gary C. King, Paul A. Tepper, Ross Mechanic, Andrew Gilchrist-Scott, Jessica D. Long, James B. Robinson, Brendan D. Callahan, Michelle Casbon, Ujjwal Sarin, Aneesh Nair, Veena Basavaraj, Tripti Saxena, Edgar Nunez, Martha G. Hinrichs, Haley Most, Tyler J. Schnoebelen
Electronic device and method capable of voice recognition

Patent number: 10056096

Abstract: Provided herein is an electronic device and method of voice recognition, the method including analyzing an audio signal of a first frame when the audio signal is input and extracting a first feature value; determining a similarity between the first feature value extracted from the audio signal of the first frame and a first feature value extracted from an audio signal of a previous frame; analyzing the audio signal of the first frame and extracting a second feature value when the similarity is below a predetermined threshold value; and comparing the extracted first feature value and the second feature value and at least one feature value corresponding to a pre-defined voice signal and determining whether or not the audio signal of the first frame is a voice signal, and thus the electronic device may detect only a voice section from the audio signal while improving the processing speed.

Type: Grant

Filed: July 22, 2016

Date of Patent: August 21, 2018

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Jong-uk Yoo
Unvoiced/voiced decision for speech processing

Patent number: 10043539

Abstract: A method for speech processing includes determining an unvoicing parameter for a first frame of a speech signal and determining a smoothed unvoicing parameter for the first frame by weighting the unvoicing parameter of the first frame and a smoothed unvoicing parameter of a second frame. The unvoicing parameter reflects a speech characteristic of the first frame. The smoothed unvoicing parameter of the second frame is weighted less heavily when the smoothed unvoicing parameter of the second frame is greater than the unvoicing parameter of the first frame. The method further includes computing a difference, by a processor, between the unvoicing parameter of the first frame and the smoothed unvoicing parameter of the first frame, and determining a classification of the first frame according to the computed difference. The classification includes unvoiced speech or voiced speech. The first frame is processed in accordance with the classification of the first frame.

Type: Grant

Filed: December 27, 2016

Date of Patent: August 7, 2018

Assignee: Huawei Technologies Co., Ltd.

Inventor: Yang Gao
Grouping devices for voice control

Patent number: 10031722

Abstract: Techniques for creating groups of devices for controlling these groups with voice commands are described herein. For instance, an environment may include an array of secondary devices (or “smart appliances”, or simply “devices”) that are configured to perform an array of operations. Users may request to create different groups of these devices, such that the users may control entire groups at a single time with individual voice commands.

Type: Grant

Filed: June 26, 2015

Date of Patent: July 24, 2018

Assignee: Amazon Technologies, Inc.

Inventors: Rohan Mutagi, He Lu, Willy Lew Yuk Vong, Michael Dale Whiteley, Fred Torok, Shikher Sitoke, David Ross Bronaugh, Bo Li
Automatic accuracy estimation for audio transcriptions

Patent number: 10002606

Abstract: Embodiments of the present invention provide an approach for estimating the accuracy of a transcription of a voice recording. Specifically, in a typical embodiment, each word of a transcription of a voice recording is checked against a customer-specific dictionary and/or a common language dictionary. The number of words not found in either dictionary is determined. An accuracy number for the transcription is calculated from the number of said words not found and the total number of words in the transcription.

Type: Grant

Filed: November 16, 2017

Date of Patent: June 19, 2018

Assignee: International Business Machines Corporation

Inventors: James E. Bostick, John M. Ganci, John P. Kaemmerer, Craig M. Trim
Mapping device capabilities to a predefined set

Patent number: 9984686

Abstract: Techniques for defining a set of predefined device capabilities generally offered by available voice-controllable devices are described herein. Thereafter, as a particular user introduces new secondary devices into his environment and registers these devices, the techniques may identify the capabilities of the new device and map these capabilities to one or more of the predefined device capabilities of the set.

Type: Grant

Filed: June 26, 2015

Date of Patent: May 29, 2018

Assignee: Amazon Technologies, Inc.

Inventors: Rohan Mutagi, Willy Lew Yuk Vong, Michael Dale Whiteley, He Lu
Automatic speech recognition using multi-dimensional models

Patent number: 9984683

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for automatic speech recognition using multi-dimensional models. In some implementations, audio data that describes an utterance is received. A transcription for the utterance is determined using an acoustic model that includes a neural network having first memory blocks for time information and second memory blocks for frequency information. The transcription for the utterance is provided as output of an automated speech recognizer.

Type: Grant

Filed: July 22, 2016

Date of Patent: May 29, 2018

Assignee: Google LLC

Inventors: Bo Li, Tara N. Sainath
Method and apparatus for speech recognition

Patent number: 9953647

Abstract: A method and apparatus for speech recognition are provided. The method and the apparatus calculate signal to noise ratios (SNRs) of speech signals from a user received at speech recognition apparatuses. The method and the apparatus recognize a reference speech signal having a maximum SNR among the SNRs.

Type: Grant

Filed: January 12, 2016

Date of Patent: April 24, 2018

Assignee: Samsung Electronics Co., Ltd.

Inventors: Minyoung Mun, YoungSang Choi
Techniques for user identification of and translation of media

Patent number: 9946712

Abstract: A computer-implemented technique includes receiving, at a computing device including one or more processors, a user input (i) identifying a portion of a media stream being output from the computing device and (ii) indicating a request to translate the portion of the media stream from a source language to a target language. The technique includes transmitting, from the computing device, the portion of the media stream to a translation server via a network in response to receiving the user input. The technique includes receiving, at the computing device, a translated portion of the media stream from the translation server via the network, the translated portion of the media stream having been translated from the source language to the target language by the translation server. The technique also includes outputting, at the computing device, the translated portion of the media stream.

Type: Grant

Filed: June 13, 2013

Date of Patent: April 17, 2018

Assignee: GOOGLE LLC

Inventor: Hong Shen
Method for determining alcohol consumption, and recording medium and terminal for carrying out same

Patent number: 9934793

Abstract: Disclosed are a method for determining whether a person is drunk after consuming alcohol capable of analyzing alcohol consumption in a time domain by analyzing a voice, and a recording medium and a terminal for carrying out same.

Type: Grant

Filed: January 24, 2014

Date of Patent: April 3, 2018

Assignee: FOUNDATION OF SOONGSIL UNIVERSITY-INDUSTRY COOPERATION

Inventors: Myung Jin Bae, Sang Gil Lee, Geum Ran Baek
Method for determining alcohol consumption, and recording medium and terminal for carrying out same

Patent number: 9916844

Abstract: Disclosed are a method for determining whether a person is drunk after consuming alcohol on the basis of a difference among a plurality of formant energy energies, which are generated by applying linear predictive coding according to a plurality of linear prediction orders, and a recording medium and a terminal for carrying out the method.

Type: Grant

Filed: January 28, 2014

Date of Patent: March 13, 2018

Assignee: FOUNDATION OF SOONGSIL UNIVERSITY-INDUSTRY COOPERATION

Inventors: Myung Jin Bae, Sang Gil Lee, Geum Ran Baek
Audio encoding apparatus and method, audio decoding apparatus and method, and audio reproducing apparatus

Patent number: 9906883

Abstract: An audio encoding apparatus and method that encodes hybrid contents including an object sound, a background sound, and metadata, and an audio decoding apparatus and method that decodes the encoded hybrid contents are provided. The audio encoding apparatus may include a mixing unit to generate an intermediate channel signal by mixing a background sound and an object sound, a matrix information encoding unit to encode matrix information used for the mixing, an audio encoding unit to encode the intermediate channel signal, and a metadata encoding unit to encode metadata including control information of the object sound.

Type: Grant

Filed: September 4, 2014

Date of Patent: February 27, 2018

Assignee: Electronics and Telecommunications Research Institute

Inventors: Seung Kwon Beack, Tae Jin Lee, Jong Mo Sung, Kyeong Ok Kang, Jeong Il Seo, Dae Young Jang, Yong Ju Lee, Jin Woong Kim
Speech processing system, speech processing method, speech processing program, vehicle including speech processing system on board, and microphone placing method

Patent number: 9905243

Abstract: A system of this invention is directed to a speech processing system that efficiently performs noise suppression processing for a plurality of noise sources spreading in a lateral direction with respect to a speaker of interest.

Type: Grant

Filed: January 16, 2014

Date of Patent: February 27, 2018

Assignee: NEC CORPORATION

Inventors: Masanori Tsujikawa, Ken Hanazawa, Akihiko Sugiyama
Method for determining alcohol consumption, and recording medium and terminal for carrying out same

Patent number: 9899039

Abstract: Disclosed is a method for determining alcohol consumption capable of analyzing alcohol consumption in a time domain by analyzing a formant slope of a voice signal, and a recording medium and a terminal for carrying out same. An terminal for determining whether a person is drunk comprises: a voice input unit for generating a voice frame by receiving a voice signal; a voiced/unvoiced sound analysis unit for determining whether a received voiced frame corresponds to a voiced sound; a formant frequency extraction unit for extracting a plurality of formant frequencies of the voice frame corresponding to the voiced sound; and an alcohol consumption determining unit for calculating a formant slope between the plurality of formant frequencies, and determining the state of alcohol consumption depending on the formant slope, thereby determining whether a person is drunk by analyzing the formant slope of an inputted voice.

Type: Grant

Filed: January 24, 2014

Date of Patent: February 20, 2018

Assignee: FOUNDATION OF SOONGSIL UNIVERSITY-INDUSTRY COOPERATION

Inventors: Myung Jin Bae, Sang Gil Lee, Geum Ran Baek
Automatic accuracy estimation for audio transcriptions

Patent number: 9892725

Abstract: Embodiments of the present invention provide an approach for estimating the accuracy of a transcription of a voice recording. Specifically, in a typical embodiment, each word of a transcription of a voice recording is checked against a customer-specific dictionary and/or a common language dictionary. The number of words not found in either dictionary is determined. An accuracy number for the transcription is calculated from the number of said words not found and the total number of words in the transcription.

Type: Grant

Filed: January 5, 2017

Date of Patent: February 13, 2018

Assignee: International Business Machines Corporation

Inventors: James E. Bostick, John M. Ganci, Jr., John P. Kaemmerer, Craig M. Trim
Techniques for translating text via wearable computing device

Patent number: 9870357

Abstract: A method of presenting translated content items is disclosed. It is detected that a content item has been captured by a device of a user. It is identified that the content item is a candidate content item for translation. The candidate content item is translated; and the translated candidate content item is presented via a user interface of a wearable display of the device.

Type: Grant

Filed: October 28, 2013

Date of Patent: January 16, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventor: Tomer Cohen
Methods and systems for providing universal portability in machine learning

Patent number: 9836450

Abstract: Systems, methods, and apparatuses are presented for a trained language model to be stored in an efficient manner such that the trained language model may be utilized in virtually any computing device to conduct natural language processing. Unlike other natural language processing engines that may be computationally intensive to the point of being capable of running only on high performance machines, the organization of the natural language models according to the present disclosures allows for natural language processing to be performed even on smaller devices, such as mobile devices.

Type: Grant

Filed: December 9, 2015

Date of Patent: December 5, 2017

Assignee: Sansa AI Inc.

Inventors: Schuyler D. Erle, Robert J. Munro, Brendan D. Callahan, Gary C. King, Jason Brenier, James B. Robinson
Proving file ownership

Patent number: 9812138

Abstract: A robust digital fingerprint of a file ensures that one able to produce the robust digital fingerprint has possession of the file. A client obtains information that is unpredictable to the client and uses that information to modify the file and generate a robust digital fingerprint from the modified file. A server, with access to the same unpredictable information, verifies the generated robust digital fingerprint. An algorithm for generating the robust digital fingerprint has a property that different representations of the same content will produce matching digital fingerprints.

Type: Grant

Filed: September 3, 2014

Date of Patent: November 7, 2017

Assignee: Amazon Technologies, Inc.

Inventor: Thibault Candebat

1 2 3 next