Patents Examined by Kee Young Lee

Dual model speaker identification

Patent number: 9711148

Abstract: A processing system receives an audio signal encoding an utterance and determines that a first portion of the audio signal corresponds to a predefined phrase. The processing system accesses one or more text-dependent models associated with the predefined phrase and determines a first confidence based on the one or more text-dependent models associated with the predefined phrase, the first confidence corresponding to a first likelihood that a particular speaker spoke the utterance. The processing system determines a second confidence for a second portion of the audio signal using one or more text-independent models, the second confidence corresponding to a second likelihood that the particular speaker spoke the utterance. The processing system then determines that the particular speaker spoke the utterance based at least in part on the first confidence and the second confidence.

Type: Grant

Filed: July 18, 2013

Date of Patent: July 18, 2017

Assignee: Google Inc.

Inventors: Matthew Sharifi, Dominik Roblek
Voiced sound interval classification device, voiced sound interval classification method and voiced sound interval classification program

Patent number: 9530435

Abstract: The voiced sound interval classification device comprises a vector calculation unit which calculates, from a power spectrum time series of voice signals, a multidimensional vector series as a vector series of a power spectrum having as many dimensions as the number of microphones, a difference calculation unit which calculates, with respect to each time of the multidimensional vector series, a vector of a difference between the time and the preceding time, a sound source direction estimation unit which estimates, as a sound source direction, a main component of the differential vector, and a voiced sound interval determination unit which determines whether each sound source direction is in a voiced sound interval or a voiceless sound interval by using a predetermined voiced sound index indicative of a likelihood of a voiced sound interval of the voice signal applied at each time.

Type: Grant

Filed: January 25, 2012

Date of Patent: December 27, 2016

Assignee: NEC CORPORATION

Inventor: Yoshifumi Onishi
Reducing octave errors during pitch determination for noisy audio signals

Patent number: 9530434

Abstract: Octave errors may be reduced during pitch determination for noisy audio signals. Pitch may be tracked over time by determining amplitudes at harmonics for individual time windows of an input signal. Octave errors may be reduced in individual time windows by fitting amplitudes of corresponding harmonics across successive time windows to identify spurious harmonics caused by octave error. A given harmonic may be identified as either being associated with the same pitch as adjacent harmonics in the given time window or being spurious based on parameters of the fitting function.

Type: Grant

Filed: July 18, 2013

Date of Patent: December 27, 2016

Assignee: KnuEdge Incorporated

Inventors: Massimo Mascaro, David C. Bradley
Voice enhancement and/or speech features extraction on noisy audio signals using successively refined transforms

Patent number: 9484044

Abstract: Voice enhancement and/or speech features extraction may be performed on noisy audio signals using successively refined transforms. Downsampled versions of an input signal may be obtained, which include a first downsampled signal with a lower sampling rate than a second downsampled signal. Successive transforms may be performed on the input signal to obtain a corresponding sound model of the input signal. The successive transforms performed may include: (1) performing a first transform on the first downsampled signal to yield a first pitch estimate; (2) performing a second transform on the second downsampled signal to yield a second pitch estimate and a first harmonics estimate based on the first pitch estimate; and (3) performing a third transform on the input signal to yield a third pitch estimate and a second harmonics estimate based on the second pitch estimate and the first harmonics estimate.

Type: Grant

Filed: July 17, 2013

Date of Patent: November 1, 2016

Assignee: KnuEdge Incorporated

Inventors: Massimo Mascaro, David C. Bradley
Natural language translation techniques

Patent number: 9436681

Abstract: Techniques are described for translating natural language input to a machine-readable form that accurately represents the semantic meaning of the input intended by the user.

Type: Grant

Filed: July 16, 2013

Date of Patent: September 6, 2016

Assignee: Amazon Technologies, Inc.

Inventors: William Tunstall-Pedoe, Robert Peter Stacey, Thomas Ashton, Adam John Phillip Wood
Techniques for automatic generation of natural language text

Patent number: 9411804

Abstract: Techniques for use in connection with a system for automatically generating text. Techniques include accessing information specifying at least one referential expression for at least a first referent and at least one anaphoric expression for at least the first referent; accessing a template that includes human-language text and a first tag that serves as a placeholder for a first text portion including a reference to at least the first referent; automatically identifying, using at least one system rule and at least one processor, text to use for the first text portion at least in part by determining whether to use as the text for the first text portion the at least one referential expression or the at least one anaphoric expression; and automatically generating output text including the human-language text and the identified text for the first text portion.

Type: Grant

Filed: July 17, 2013

Date of Patent: August 9, 2016

Assignee: YSEOP SA

Inventors: Alain Kaeser, Emmanuel Vignon, Ludan Stoecklé
Semantics-oriented analysis of log message content

Patent number: 9336203

Abstract: A log message is processed. The log message to be processed is received. One or more portions of the log message to be separately extracted are identified. A value is extracted from each identified portion. Extracting the value includes using an extraction rule. The extraction rule is associated with the identified portion.

Type: Grant

Filed: July 19, 2013

Date of Patent: May 10, 2016

Assignee: TIBCO Software Inc.

Inventor: Michael Perrone
Speech processing apparatus, control method thereof, storage medium storing control program thereof, and vehicle, information processing apparatus, and information processing system including the speech processing apparatus

Patent number: 9299360

Abstract: A speech processing apparatus acquires pseudo speech from a mixture of sound including desired speech and noise. A first microphone inputs a first mixture sound, including desired speech and noise, and outputs a first mixture signal. A second microphone opens to the sound space and is disposed at a focus position of an interface, that is part of a boundary of the sound space and has one of a quadratic surface shape and a pseudo surface shape approximating a quadratic surface, inputs a second mixture sound including the desired speech reflected by the interface and the noise reflected by the interface at a ratio different from the first mixture sound, and outputs a second mixture signal. A noise suppression circuit suppresses an estimated noise signal based on the first mixture signal and the second mixture signal and outputs a pseudo speech signal.

Type: Grant

Filed: December 3, 2011

Date of Patent: March 29, 2016

Assignee: NEC CORPORATION

Inventors: Takayuki Arakawa, Akihiko Sugiyama
Apparatus and method for processing voice signal

Patent number: 9165561

Abstract: A voice signal processing method processes voice signals acquired by a microphone. A voice processing device acquires first voice signals according to a first sampling frequency, and samples second voice signals from the first voice signals according to a second sampling frequency. The second voice signals are encoded to obtain a basic voice package. A voiceprint data package of each voice signal frame of the first voice signals is obtained using a curve fitting method, and a pitch data package of each voice signal frame of the first voice signals is obtained according to pitch distribution of twelve central octave keys of a standard piano. The voiceprint data package and the pitch data package are embedded into the basic audio package to generate a final voice package of the first voice signals.

Type: Grant

Filed: January 13, 2014

Date of Patent: October 20, 2015

Assignee: HON HAI PRECISION INDUSTRY CO., LTD.

Inventor: Chun-Te Wu