Patents Examined by Josiah Hernandez

Audio decoding method and apparatus for reconstructing high frequency components with less computation

Patent number: 7444289

Abstract: An audio decoding method and apparatus for reconstructing high frequency components with less computation are provided. The audio decoding apparatus includes a decoder, a channel similarity determination unit, a high frequency component generation unit, and an audio synthesizing unit. The audio decoding method generates high frequency components of frames while skipping every other frame for each channel signal; when right and left channel signals are similar to each other, generates high frequency components of the skipped frame for any one channel signal by using the generated high frequency components of the corresponding frame for the other channel signal; and when the right and left channel signals are not similar to each other, generates high frequency components of the skipped frames for each channel signal by using previous frames for the relevant channel signal.

Type: Grant

Filed: September 2, 2003

Date of Patent: October 28, 2008

Assignee: Samsung Electronics Co., Ltd.

Inventors: Yoonhark Oh, Mathew Manu
Semi-automatic construction method for knowledge base of encyclopedia question answering system

Patent number: 7428487

Abstract: In a semi-automatic construction method for knowledge base of an encyclopedia question answering system, concept-oriented systematic templates are designed and important fact information related to entries is automatically extracted from summary information and body of the encyclopedia to semi-automatically construct the knowledge base of the encyclopedia question answering system. The method includes the steps of: (a) designing structure of the knowledge base with a plurality of templates for each entry and a plurality attributes related to each of the templates; (b) extracting structured information including the entry, an attribute name and attribute values from summary information of the encyclopedia; (c) extracting unstructured information including an attribute name and attribute values of the entry from a body of the encyclopedia; and (d) storing the structured information and the unstructured information in corresponding template and attribute of the knowledge base according to the entry.

Type: Grant

Filed: April 21, 2004

Date of Patent: September 23, 2008

Assignee: Electronics and Telecommunications Research Institute

Inventors: Ji Hyun Wang, Eui Sok Chung, Myung Gil Jang
Defining atom units between phone and syllable for TTS systems

Patent number: 7418389

Abstract: A method for identifying common multiphone units to add to a unit inventory for a text-to-speech generator is disclosed. The common multiphone units are units that are larger than a phone, but smaller than a syllable. The method slices each syllable into a plurality of slices. These slices are then sorted and the frequency of each slice is determined. Those slices whose frequencies exceed a threshold are added to the unit inventory. The remaining slices are decomposed according to a predetermined set of rules to determine if they contain slices that should be added to the unit inventory.

Type: Grant

Filed: January 11, 2005

Date of Patent: August 26, 2008

Assignee: Microsoft Corporation

Inventors: Min Chu, Yong Zhao
Voice data input device and method

Patent number: 7418384

Abstract: A data input device for inputting numeric data by voice includes a range prediction part, a history holding part, a speech recognition part, a recognition result holding part, a comparison part, a presentation part, and a result storing part. The range prediction part estimates a range of a value expected to be input on the basis of meter-reading history data held in the history holding part. The speech recognition part recognizes speech representing a meter reading and stores the recognition result in the recognition result holding part. The comparison part determines whether or not the meter reading for this month represented by the data stored in the recognition result holding part is within the prediction range. If the meter reading for this month is within the prediction range, the presentation part presents the recognition result to a user, and the speech recognition result is stored in the result storing part.

Type: Grant

Filed: October 20, 2003

Date of Patent: August 26, 2008

Assignee: Canon Kabushiki Kaisha

Inventor: Yasuo Okutani
Systems and methods for managing and building directed dialogue portal applications

Patent number: 7395206

Abstract: A directed dialogue portal system management scheme is disclosed, providing a manager application layer over portal system managed applications. The manager application layer may include a global grammar (e.g., the grammar of each managed application) and each managed application may inherit behaviors of the manager application. Responses to prompts received by a first managed application that are unrecognized or are directed to a second managed application are sent to the manager application. The manager application sends the response to the second application. The scheme is transparent to the user and, from the user's perspective, a user is able to directly access any application or services in a directed dialogue portal system from other locations in the system.

Type: Grant

Filed: January 14, 2005

Date of Patent: July 1, 2008

Assignee: Unisys Corporation

Inventors: James S. Irwin, Alan Weiman, Owen Simon Dallaway
Method and system for the automatic generation of speech features for scoring high entropy speech

Patent number: 7392187

Abstract: A method and system for automatically generating a scoring model for scoring a speech sample are disclosed. One or more training speech samples are received in response to a prompt. One or more speech features are determined for each of the training speech samples. A scoring model is then generated based on the speech features. At least one of the training speech samples may be a high entropy speech sample. An evaluation speech sample is received and a score is assigned to the evaluation speech sample using the scoring model. The evaluation speech sample may be a high entropy speech sample.

Type: Grant

Filed: September 20, 2004

Date of Patent: June 24, 2008

Assignee: Educational Testing Service

Inventors: Isaac Bejar, Klaus Zechner
System and method for effectively implementing an optimized language model for speech recognition

Patent number: 7392186

Abstract: A system and method for effectively implementing an optimized language model for speech recognition includes initial language models each created by combining source models according to selectable interpolation coefficients that define proportional relationships for combining the source models. A rescoring module iteratively utilizes the initial language models to process input development data for calculating word-error rates that each correspond to a different one of the initial language models. An optimized language model is then selected from the initial language models by identifying an optimal word-error rate from among the foregoing word-error rates. The speech recognizer may then utilize the optimized language model for effectively performing various speech recognition procedures.

Type: Grant

Filed: March 30, 2004

Date of Patent: June 24, 2008

Assignees: Sony Corporation, Sony Electronics Inc.

Inventors: Lei Duan, Gustavo Abrego, Xavier Menendez-Pidal, Lex Olorenshaw
Method and system for unified speech and graphic user interfaces

Patent number: 7389235

Abstract: A method for unifying speech user interface and graphic user interface commands includes the steps of receiving (52) user entered text via a GUI, processing (54) the user-entered text, monitoring (60) the user-entered text, adding input context (62) to the user-entered text, and, updating (74, 76, and 78) a speech recognizer with the user-entered text and semantic information. Updating the speech recognizer can include the step of accepting new text information and input context to update a speech grammar (74) and recognition vocabulary of the speech recognizer. The method can include the step of updating the GUI (72) by updating GUI directives (68) and elements (70) to maintain the GUI unified with the speech grammar. The method can further include the step of forming a window (402) enabling the display of a speech interface command as a user constructs the speech interface command using the GUI (400).

Type: Grant

Filed: September 30, 2003

Date of Patent: June 17, 2008

Assignee: Motorola, Inc.

Inventor: Joseph L. Dvorak
Identifying language attributes through probabilistic analysis

Patent number: 7386438

Abstract: A system and method for identifying language attributes through probabilistic analysis is described. A set of language classes and a plurality of training documents are defined, Each language class identifies a language and a character set encoding. Occurrences of one or more document properties within each training document are evaluated. For each language class, a probability for the document properties set conditioned on the occurrence of the language class is calculated. Byte occurrences within each training document are evaluated. For each language class, a probability for the byte occurrences conditioned on the occurrence of the language class is calculated.

Type: Grant

Filed: August 4, 2003

Date of Patent: June 10, 2008

Assignee: Google Inc.

Inventors: Alexander Franz, Brian Milch, Eric Jackson, Jenny Zhou, Benjamin Diament
Process and system for semantically recognizing, correcting, and suggesting domain specific speech

Patent number: 7383172

Abstract: Semantic understanding of hypotheses returned by a speech engine could improve the quality of recognition and in cases of misrecognition speed the identification of errors and potential substitutions. Unfortunately, semantic recognition using natural language parsers is hard since semantic and syntactic rules for processing language are complex and computationally expensive. Additionally, semantic recognition should be performed in a knowledge domain—a domain of interest such as radiology, pathology, or tort law. This adds additional complexity to building semantic rules. The method described achieves semantic understanding by coupling a speech recognition engine to a semantic recognizer, which draws from a database of domain sentences derived from a document corpus, and a knowledge base created for these domain sentences. The method is able to identify in near real-time the best sentence hypotheses from the speech recognizer and its associated meanings, i.e.

Type: Grant

Filed: August 9, 2004

Date of Patent: June 3, 2008

Inventor: Patrick William Jamieson
Acoustic model creation method as well as acoustic model creation apparatus and speech recognition apparatus

Patent number: 7366669

Abstract: To provide an acoustic model which can absorb the fluctuation of a phonemic environment in an interval longer than a syllable, with the number of parameters of the acoustic model suppressed to be small, a phoneme-connected syllable HMM/syllable-connected HMM set is generated in such a way that a phoneme-connected syllable HMM set corresponding to individual syllables is generated by combining phoneme HMMs. A preliminary experiment is conducted using the phoneme-connected syllable HMM set and training speech data. Any misrecognized syllable and the preceding syllable of the misrecognized syllable are checked using results of a preliminary experiment syllable label data. The combination between a correct answer syllable for the misrecognized syllable and the preceding syllable of the misrecognized syllable is extracted as a syllable connection. A syllable-connected HMM corresponding to this syllable connection is added into the phoneme-connected syllable HMM set.

Type: Grant

Filed: March 8, 2004

Date of Patent: April 29, 2008

Assignee: Seiko Epson Corporation

Inventors: Masanobu Nishitani, Yasunaga Miyazawa, Hiroshi Matsumoto, Kazumasa Yamamoto
Method and device for recovering interrupted voice input

Patent number: 7366672

Abstract: A method for recovering interrupted voice input, particularly voice input into communication, audio and/or navigation systems installed in a vehicle detects, during an ongoing voice input process, an event that requires interruption of the voice input process, stores an incomplete sequence of the voice input, and interrupts the voice input process in response to detection of the event. The method can further detect that the event has finished and can recover the voice input process based on the stored sequence. A corresponding device is also described.

Type: Grant

Filed: May 13, 2005

Date of Patent: April 29, 2008

Assignee: Nokia Corporation

Inventor: Joerg Pietruszka
Relative delta computations for determining the meaning of language inputs

Patent number: 7366666

Abstract: A method for processing language input can include the step of determining at least two possible meanings for a language input. For each possible meaning, a probability that the possible meaning is a correct interpretation of the language input can be determined. At least one relative data computation can be computed based at least in part upon the probabilities. At least one irregularity within the language input can be detected based upon the relative delta computation. The irregularity can include mumble, ambiguous input, and/or compound input. At least one programmatic action can be performed responsive to the detection of the irregularity.

Type: Grant

Filed: October 1, 2003

Date of Patent: April 29, 2008

Assignee: International Business Machines Corporation

Inventors: Rajesh Balchandran, Linda M. Boyer
Method and apparatus for dynamic modification of command weights in a natural language understanding system

Patent number: 7349845

Abstract: A method and system for dynamically assigning weights to the subset of commands in a natural language dialog system based on prior context of the user's interaction with the system. The search space of the translation process may be reduced when some context information is available. A user presents input to the natural language understanding system. The system translates the user input into a formal command and calculates a weight value for a next set of formal commands based on the formal command. The command weights may then be dynamically boosted for the next set of formal commands before executing the formal command. The exemplary aspects of the present invention reduce the time needed to complete a task since the search space of the translation process may be reduced if context information is available and improve the accuracy of the process by using knowledge that users regularly use repeating patterns for repeating tasks.

Type: Grant

Filed: September 3, 2003

Date of Patent: March 25, 2008

Assignee: International Business Machines Corporation

Inventors: Daniel Mark Coffman, Jan Kleindienst, Ganesh N. Ramaswamy
Speech processing apparatus and mobile communication terminal

Patent number: 7330813

Abstract: A speech processing apparatus able to enhance formants more naturally, wherein a speech analyzing unit analyzes an input speech signal to find LPCs and converts the LPCs to LSPs, a speech decoding unit calculates a distance between adjacent orders of the LSPs by an LSP analytical processing unit and calculates LSP adjusting amounts of larger values for LSPs of adjacent orders closer in distance by an LSP adjusting amount calculating unit, an LSP adjusting unit adjusts the LSPs based on the LSP adjusting amounts such that the LSPs of adjacent orders closer in distance become closer, an LSP-LPC converting unit converts the adjusted LSPs to LPCs, and an LPC combining unit uses the LPCs and sound source parameters to obtain formant-enhanced speech.

Type: Grant

Filed: August 5, 2003

Date of Patent: February 12, 2008

Assignee: Fujitsu limited

Inventor: Mutsumi Saito
System and method for accented modification of a language model

Patent number: 7315811

Abstract: A system and method for a speech recognition technology that allows language models for a particular language to be customized through the addition of alternate pronunciations that are specific to the accent of the dictator, for a subset of the words in the language model. The system includes the steps of identifying the pronunciation differences that are best handled by modifying the pronunciations of the language model, identifying target words in the language model for pronunciation modification, and creating a accented speech file used to modify the language model.

Type: Grant

Filed: December 8, 2004

Date of Patent: January 1, 2008

Assignee: Dictaphone Corporation

Inventors: William F. Cote, Amy J. Uhrbach, Jill Carrier, Wensheng (Vincent) Han
Method and apparatus for smoothing fundamental frequency discontinuities across synthesized speech segments

Patent number: 7286986

Abstract: A method of smoothing fundamental frequency discontinuities at boundaries of concatenated speech segments includes determining, for each speech segment, a beginning fundamental frequency value and an ending fundamental frequency value. The method further includes adjusting the fundamental frequency contour of each of the speech segments according to a linear function calculated for each particular speech segment, and dependent on the beginning and ending fundamental frequency values of the corresponding speech segment. The method calculates the linear function for each speech segment according to a coupled spring model with three springs for each segment. A first spring constant, associated with the first spring and the second spring, is proportional to a duration of voicing in the associated speech segment. A second spring constant, associated with the third spring, models a non-linear restoring force that resists a change in slope of the segment fundamental frequency contour.

Type: Grant

Filed: August 1, 2003

Date of Patent: October 23, 2007

Assignee: Rhetorical Systems Limited

Inventor: David Talkin
System and method for speech generation from brain activity

Patent number: 7275035

Abstract: In a method of assisting a subject to generate speech, at least one first neural impulse is sensed from a first preselected location in the subject's brain. A first preselected sound is associated with the first neural impulse. The first preselected sound is generated in an audible format. In an apparatus for assisting the subject to generate speech, at least one sensor senses a neural impulse in the subject's brain and generates a signal representative thereof. An electronic speech generator generates a phoneme in response to the generation of the signal. An audio system generates audible sounds corresponding to the phoneme based upon the signal received from the speech generator.

Type: Grant

Filed: December 8, 2004

Date of Patent: September 25, 2007

Assignee: Neural Signals, Inc.

Inventor: Philip R. Kennedy