Patents Examined by David Hudspeth

Systems and methods for generating weighted finite-state automata representing grammars

Patent number: 7398197

Abstract: A context-free grammar can be represented by a weighted finite-state transducer. This representation can be used to efficiently compile that grammar into a weighted finite-state automaton that accepts the strings allowed by the grammar with the corresponding weights. The rules of a context-free grammar are input. A finite-state automaton is generated from the input rules. Strongly connected components of the finite-state automaton are identified. An automaton is generated for each strongly connected component. A topology that defines a number of states, and that uses active ones of the non-terminal symbols of the context-free grammar as the labels between those states, is defined. The topology is expanded by replacing a transition, and its beginning and end states, with the automaton that includes, as a state, the symbol used as the label on that transition. The topology can be fully expanded or dynamically expanded as required to recognize a particular input string.

Type: Grant

Filed: December 5, 2006

Date of Patent: July 8, 2008

Assignee: AT&T Corp.

Inventors: Mehryar Mohri, Mark-Jan Nederhof
Chinese romanization

Patent number: 7398199

Abstract: The invention disclose: Chinese spelling scheme, Chinese alphabetic writing and Phonetic Symbols scheme. The Chinese spelling scheme which is used English letters to mark the tones, it is reduced to 5 sound symbols of the monosyllabic words from 6 spelling letters of other schemes at most. Between the syllables is clearly demarcated. It can be used as marks in sound and tone for Chinese characters, and also can form an alphabetic writing independently which follows the law of international languages. It's highly integrated with word, sound, and code which can be displayed each other and converted each other. It can be translated directly with the Chinese-language sentence and foreign languages. The Phonetic Symbols do not go beyond the range of 26 letters and symbols of common used English keyboard, which can not only be used to mark in English and other languages, but also can be used as a phonetic symbol written language independently.

Type: Grant

Filed: March 23, 2004

Date of Patent: July 8, 2008

Inventor: Xue Sheng Gong
Modulating one or more parameters of an audio or video perceptual coding system in response to supplemental information

Patent number: 7395211

Abstract: A method of modifying the operation of the encoder function and/or the decoder function of a perceptual coding system in accordance with supplemental information, such as a watermark, so that the supplemental information may be detectable in the output of the decoder function. One or more parameters are modulated in the encoder function and/or the decoder function in response to the supplemental information.

Type: Grant

Filed: August 15, 2001

Date of Patent: July 1, 2008

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Matthew Aubrey Watson, Michael Mead Truman, Stephen Decker Vernon, Brett Graham Crockett
Progressive to lossless embedded audio coder (PLEAC) with multiple factorization reversible transform

Patent number: 7395210

Abstract: A system and method for lossless and/or progressive to lossless data coding (e.g., audio and/or image) is provided. The system and method employ a multiple factorization reversible transform component that provides quantized coefficients based, at least in part, upon a multiple factorization reversible transform. The multiple factorization reversible transform component can employ an N-point modulated lapped transform in accordance with one aspect of the present invention. The multiple factorization reversible transform component can comprise a modulation stage, a pre-FFT rotation stage, a complex FFT stage and a post-FFT rotation stage.

Type: Grant

Filed: November 21, 2002

Date of Patent: July 1, 2008

Assignee: Microsoft Corporation

Inventor: Jin Li
Method of and apparatus for improving research and/or communication interaction with animals such as dolphins and the like, and providing more facile communication with humans lacking speaking capability

Patent number: 7392192

Abstract: A novel method and microprocessor-controlled apparatus are provided for improving research and/or communication interaction between human and animals, such as dolphins and the like, or between normal humans and speech-impaired humans, by playing back audible pre-recorded human-language phrases corresponding to and descriptive of the meaning of sounds and/or gestures or responses produced by the animal or the speech-impaired human and in response to stimuli, conditions or environmental events to which the animal (or impaired human) is subjected, wherein the pre-recorded phrases represent what the human would say or describe if subjected to such stimuli, conditions or environmental events; such that during actual real-time interacting with the animals (or such impaired humans) one actually hears “from” the animal or speech-impaired human, spoken language phrases descriptive of their condition.

Type: Grant

Filed: October 25, 2002

Date of Patent: June 24, 2008

Inventor: Robert H. Rines
Method and device to distinguish between voice conversation and automated speech recognition

Patent number: 7392191

Abstract: A method and device for performing some preprocessing on voice transmissions depending upon the intended destination of the transmission. The device includes a receiving component configured to receive a voice signal from a source over a network. The device also includes a processing component configured to determine a destination address associated with the received signal, determine a signal processing algorithm from a plurality of signal processing algorithms based on the determined address, and process the voice signal according to the specified algorithm. The device further includes a delivery component configured to send the processed signal to the associated address.

Type: Grant

Filed: June 18, 2001

Date of Patent: June 24, 2008

Assignee: Intellisist, Inc.

Inventor: Gilad Odinak
Method and system for the automatic generation of speech features for scoring high entropy speech

Patent number: 7392187

Abstract: A method and system for automatically generating a scoring model for scoring a speech sample are disclosed. One or more training speech samples are received in response to a prompt. One or more speech features are determined for each of the training speech samples. A scoring model is then generated based on the speech features. At least one of the training speech samples may be a high entropy speech sample. An evaluation speech sample is received and a score is assigned to the evaluation speech sample using the scoring model. The evaluation speech sample may be a high entropy speech sample.

Type: Grant

Filed: September 20, 2004

Date of Patent: June 24, 2008

Assignee: Educational Testing Service

Inventors: Isaac Bejar, Klaus Zechner
System for speech recognition with multi-part recognition

Patent number: 7392189

Abstract: A speech recognition system for processing voice inputs from a user to select a list element from a list or group of list elements. Recognition procedures are carried out on the voice input of the user. One recognition procedure separates the voice input of a whole word into at least one sequence of speech subunits to produce a vocabulary of list elements. Another recognition procedure compares the voice input of the whole word with the vocabulary of list elements.

Type: Grant

Filed: February 21, 2003

Date of Patent: June 24, 2008

Assignee: Harman Becker Automotive Systems GmbH

Inventors: Marcus Hennecke, Walter Koch, Gerhard Nüβle, Richard Reng
Optimized windows and methods therefore for gradient-descent based window optimization for linear prediction analysis in the ITU-T G.723.1 speech coding standard

Patent number: 7389226

Abstract: Primary and alternate optimization procedures are used to improve the ITU-T G.723.1 speech coding standard (the “Standard”) by replacing the Hamming window of the Standard with an optimized window, with two windows, or with two windows and an additional performance of an autocorrelation method. When two windows replace the Hamming window, at least one of which is an optimized window, generally the first is used to determine optimized unquantized LP coefficients which are used to define an optimized perceptual weighting filter, and the second is used to determine optimized unquantized LP coefficients which are used to determine optimized synthesis coefficients. Optimized windows created using the primary and alternate optimization procedures and used in the Standard yield improvements in the objective and subjective quality of synthesized speech produced by the Standard. The improved Standard, methods, and window can all be implemented as computer readable software code.

Type: Grant

Filed: December 17, 2002

Date of Patent: June 17, 2008

Assignee: NTT Docomo, Inc.

Inventor: Wai C. Chu
Method and apparatus for testing a software program using mock translation input method editor

Patent number: 7389223

Abstract: A method, apparatus, and computer instructions for testing software programs running on a data processing system. Text is translated from the source language to the target language to form translated text in response to a user input, containing the text in a source language. The text is entered through a computer interface in the data processing system. The translated text is inserted into a user interface of the software program to be tested to form inserted, translated text. The software program is written using the target language. A determination is made as to whether the software program functions correctly using the inserted, translated text.

Type: Grant

Filed: September 18, 2003

Date of Patent: June 17, 2008

Assignee: International Business Machines Corporation

Inventors: Steven Edward Atkin, Joseph C. Ross, Minto Tsai, Keiichi Yamamoto
Task parallelization in a text-to-text system

Patent number: 7389222

Abstract: Parallelization of word alignment for a text-to-text operation. The training data is divided into multiple groups, and training is carried out of each group on separate processors. Different techniques can be carried out to increase the speed of the processing. The hookups can be done only once for all of multiple different iterations. Moreover, parallel operations can apply only to the counts, since this may be the most time-consuming part.

Type: Grant

Filed: April 26, 2006

Date of Patent: June 17, 2008

Assignee: Language Weaver, Inc.

Inventors: Greg Langmead, Kenji Yamada, Kevin Knight, Daniel Marcu
System and method for classification of voice signals

Patent number: 7389230

Abstract: A system and method for classifying a voice signal to one of a set of predefined categories, based upon a statistical analysis of features extracted from the voice signal. The system includes an acoustic processor and a classifier. The acoustic processor extracts features that are characteristic of the voice signal and generates feature vectors using the extracted spectral features. The classifier uses the feature vectors to compute the probability that the voice signal belongs to each of the predefined categories and classifies the voice signal to a predefined category that is associated with the highest probability.

Type: Grant

Filed: April 22, 2003

Date of Patent: June 17, 2008

Assignee: International Business Machines Corporation

Inventor: Israel Nelken
Method and system for unified speech and graphic user interfaces

Patent number: 7389235

Abstract: A method for unifying speech user interface and graphic user interface commands includes the steps of receiving (52) user entered text via a GUI, processing (54) the user-entered text, monitoring (60) the user-entered text, adding input context (62) to the user-entered text, and, updating (74, 76, and 78) a speech recognizer with the user-entered text and semantic information. Updating the speech recognizer can include the step of accepting new text information and input context to update a speech grammar (74) and recognition vocabulary of the speech recognizer. The method can include the step of updating the GUI (72) by updating GUI directives (68) and elements (70) to maintain the GUI unified with the speech grammar. The method can further include the step of forming a window (402) enabling the display of a speech interface command as a user constructs the speech interface command using the GUI (400).

Type: Grant

Filed: September 30, 2003

Date of Patent: June 17, 2008

Assignee: Motorola, Inc.

Inventor: Joseph L. Dvorak
Natural error handling in speech recognition

Patent number: 7386454

Abstract: A user interface, and associated techniques, that permit a fast and efficient way of correcting speech recognition errors, or of diminishing their impact. The user may correct mistakes in a natural way, essentially by repeating the information that was incorrectly recognized previously. Such a mechanism closely approximates what human-to-human dialogue would be in similar circumstances. Such a system fully takes advantage of all the information provided by the user, and on its own estimates the quality of the recognition in order to determine the correct sequence of words in the fewest number of steps.

Type: Grant

Filed: July 31, 2002

Date of Patent: June 10, 2008

Assignee: International Business Machines Corporation

Inventors: Ramesh A. Gopinath, Benoit Maison, Brian C. Wu
Identifying language attributes through probabilistic analysis

Patent number: 7386438

Abstract: A system and method for identifying language attributes through probabilistic analysis is described. A set of language classes and a plurality of training documents are defined, Each language class identifies a language and a character set encoding. Occurrences of one or more document properties within each training document are evaluated. For each language class, a probability for the document properties set conditioned on the occurrence of the language class is calculated. Byte occurrences within each training document are evaluated. For each language class, a probability for the byte occurrences conditioned on the occurrence of the language class is calculated.

Type: Grant

Filed: August 4, 2003

Date of Patent: June 10, 2008

Assignee: Google Inc.

Inventors: Alexander Franz, Brian Milch, Eric Jackson, Jenny Zhou, Benjamin Diament
Code, system and method for representing a natural-language text in a form suitable for text manipulation

Patent number: 7386442

Abstract: A computer method, system and code, for representing a natural-language document in a vector form suitable for text manipulation operations are disclosed. The method involves determining (a) for each of a plurality of terms selected from one of (i) non-generic words in the document, (ii) proximately arranged word groups in the document, and (iii) a combination of (i) and (ii), a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term.

Type: Grant

Filed: July 1, 2003

Date of Patent: June 10, 2008

Assignee: Word Data Corp.

Inventors: Peter J. Dehlinger, Shao Chin
Optimization of an objective measure for estimating mean opinion score of synthesized speech

Patent number: 7386451

Abstract: A method is provided for optimizing an objective measure used to estimate mean opinion score or naturalness of synthesized speech from a speech synthesizer. The method includes using an objective measure that has components derived directly from textual information used to form synthesized utterances. The objective measure has a high correlation with mean opinion score such that a relationship can be formed between the objective measure and corresponding mean opinion score. The objective measure is altered to provide a different function of textual information derived from the utterances so as to improve the relationship between the scores of the objective measure and subjective ratings of the synthesized utterances.

Type: Grant

Filed: September 11, 2003

Date of Patent: June 10, 2008

Assignee: Microsoft Corporation

Inventors: Min Chu, Hu Peng, Yong Zhao
Pitch adaptive equalization for improved audio

Patent number: 7383175

Abstract: A pitch adaptive circuit (200) includes an equalizer control circuit (206) that evaluates the pitch of the speech signals that are being processed and depending on the pitch information, the equalizer control circuit (206) selects an equalizer (208, 210) to shape the decoded speech signals. By selecting the best equalizer (208 or 210) to use based on the pitch information, improvements in audio quality are provided automatically without user intervention.

Type: Grant

Filed: March 25, 2003

Date of Patent: June 3, 2008

Assignee: Motorola, Inc.

Inventors: Patrick J. Doran, Stephen S. Shiao
Semantic stenography using short note input data

Patent number: 7383171

Abstract: A method and apparatus converts input data such as short notes into a global text realization to provide semantically-coherent grammatical text. In various exemplary embodiments, an individual inputs short notes into a computer system, the computer system associates local text realizations with the short notes. Subsequently, the user may select the appropriate local text realizations, which may be converted to semantic representations and to semantically coherent grammatical text or a global text realization.

Type: Grant

Filed: December 5, 2003

Date of Patent: June 3, 2008

Assignee: Xerox Corporation

Inventors: Marc Dymetman, Caroline Brun, Aurelien Max
System and method for analyzing automatic speech recognition performance data

Patent number: 7383170

Abstract: In a disclosed method for interpreting automatic speech recognition (ASR) performance data, a data processing system may receive user input that selects a log file to be processed. The log file may contain log records produced by an ASR system as a result of verbal interaction between an individual and the ASR system. In response to receiving the user input, the data processing system may automatically interpret data in the log records and generate interpretation results. The interpretation results may include a duration for a system prompt communicated to the individual by the ASR system, a user response to the system prompt, and a duration for the user response. The user response may include a textual representation of a verbal response from the individual, obtained through ASR. The interpretation results may also include an overall duration for the telephone call.

Type: Grant

Filed: October 10, 2003

Date of Patent: June 3, 2008

Assignee: AT&T Knowledge Ventures, L.P.

Inventors: Scott H. Mills, John M. Martin

prev 1 2 3 4 5 6 7 8 9 … next