Patents Examined by Talivaldis Ivars {hacek over (S)}mits

Statistically driven sentence realizing method and apparatus

Patent number: 7266491

Abstract: A method of, and system for, generating a sentence from a semantic representation maps the semantic representation to an unordered set of syntactic nodes. Simplified generation grammar rules and statistical goodness measure values from a corresponding analysis grammar are then used to create a tree structure to order the syntactic nodes. The sentence is then generated from the tree structure. The generation grammar is a simplified (context free) version of a corresponding full (context sensitive) analysis grammar. In the generation grammar, conditions on each rule are ignored except those directly related to the semantic representation. The statistical goodness measure values, which are calculated through an analysis training phase in which a corpus of example sentences is processed using the full analysis grammar, are used to guide the generation choice to prefer substructures most commonly found in a particular syntactic/semantic context during analysis.

Type: Grant

Filed: June 14, 2005

Date of Patent: September 4, 2007

Assignee: Microsoft Corporation

Inventors: Kevin W. Humphreys, David Neal Weise, Michael V. Calcagno
Enhanced go-back feature system and method for use in a voice portal

Patent number: 7260530

Abstract: A system, method and computer program product are provided for transitioning among states during use of a voice browser. Initially, a script is received at a voice browser from a web server utilizing a network. Next, the script is executed utilizing the voice browser. A plurality of states is then automatically tracked during the execution of the script utilizing the voice browser. Upon receiving a request from a user during the execution of the script to transition to a previous state, the voice browser automatically transitions to the previous state of the script.

Type: Grant

Filed: February 15, 2002

Date of Patent: August 21, 2007

Assignee: BeVocal, Inc.

Inventor: Laura A. Werner
Disambiguating results within a speech based IVR session

Patent number: 7260537

Abstract: Within an interactive voice response system, a method of automatically disambiguating results presented to a user can include determining the identity of a user within an interactive voice response session, receiving user inputs specifying selections in an interactive voice response menu hierarchy, and storing historical information specifying the user selections within a profile associated with the identity of the user. For at least one subsequent input from the user, identifying the historical information associated with the identity of the user and using the historical information to reduce a number of possible selections in the interactive voice response menu hierarchy which are presented to the user.

Type: Grant

Filed: March 25, 2003

Date of Patent: August 21, 2007

Assignee: International Business Machines Corporation

Inventors: Thomas E. Creamer, Brent L. Davis, Peeyush Jaiswal, Victor S. Moore
Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system

Patent number: 7254535

Abstract: A method for equalizing a speech signal generated within a pressurized air delivery system, the method including the steps of: generating an inhalation noise model (1152) based on inhalation noise; receiving an input signal (802) that includes a speech signal; and equalizing the speech signal (1156) based on the noise model.

Type: Grant

Filed: June 30, 2004

Date of Patent: August 7, 2007

Assignee: Motorola, Inc.

Inventors: William M. Kushner, Sara M. Harton, Mark A. Jasiuk
Computer-aided reading system and method with cross-language reading wizard

Patent number: 7254527

Abstract: A computer-aided reading system offers assistance to a user who is reading in a non-native language, as the user needs help, without requiring the user to divert attention away from the text. In one implementation, the reading system is implemented as a reading wizard for a browser program. The reading wizard is exposed via a graphical user interface (UI) that allows the user to select a word, phrase, sentence, or other grouping of words in the non-native text. The reading wizard automatically determines whether the selected one word comprises part of a phrase; allows the user to choose whether to view a translation of a single word or a translation of a phrase that includes the single word in response to selection by the user of the single word. The multiple translations are presented in a pop-up window, in the form of a scrollable box and is scrollable, located near the selected text to minimize distraction of the user.

Type: Grant

Filed: October 15, 2004

Date of Patent: August 7, 2007

Assignee: Microsoft Corporation

Inventor: Endong Xun
System and method of generating dictionary entries

Patent number: 7254530

Abstract: A system for automatically generating a dictionary from full text articles extracts <term, definition> pairs from full text articles and stores the <term, definition> pairs as dictionary entries. The system includes a computer readable corpus having a plurality of documents therein. A pattern processing module (120) and a grammar processing module (125) are provided for extracting <term, definition> pairs from the corpus and storing the <term, definition> pairs in a dictionary database (145). A routing processing module selectively routes sentences in the corpus to at least one of the pattern processing module or grammar processing module. In one embodiment, the routing module is incorporated into the pattern processing module which then selectively routes a portion of the sentences to the grammar processing module.

Type: Grant

Filed: September 26, 2002

Date of Patent: August 7, 2007

Assignee: The Trustees of Columbia University in the City of New York

Inventors: Judith L. Klavans, Smaranda Muresan
Simultaneous plural-voice text-to-speech synthesizer

Patent number: 7249021

Abstract: A multiple-voice instructing unit (17) instructs pitch deforming ratio and mixing ratio to a multiple-voice synthesis unit (16). The multiple voice synthesis unit (16) generates a standard voice signal by means of waveform superimposition based on voice element data read from a voice element database (15) and prosodic information from a voice element selecting unit (14), expands/contracts the time base of the above standard voice signal based on the prosodic information and instruction information from the multiple-voice instructing unit (17) to change a voice pitch, and mixes the standard voice signal with an expansion/contraction voice signal for outputting via an output terminal (18). Accordingly, a concurrent vocalization by multiple speakers based on the same text can be implemented without the need of time-division, parallel text analyzing and prosody generating and of adding pitch converting as post-processing.

Type: Grant

Filed: December 27, 2001

Date of Patent: July 24, 2007

Assignee: Sharp Kabushiki Kaisha

Inventors: Tomokazu Morio, Osamu Kimura
Statistical method and apparatus for learning translation relationships among phrases

Patent number: 7249012

Abstract: The present invention learns phrase translation relationships by receiving a parallel aligned corpus with phrases to be learned identified in a source language. Candidate phrases in a target language are generated and an inside score is calculated based on word association scores for words inside the source language phrase and candidate phrase. An outside score is calculated based on word association scores for words outside the source language phrase and candidate phrase. The inside and outside scores are combined to obtain a joint score.

Type: Grant

Filed: November 20, 2002

Date of Patent: July 24, 2007

Assignee: Microsoft Corporation

Inventor: Robert C. Moore
Audio segmentation with energy-weighted bandwidth bias

Patent number: 7243062

Abstract: A method (200) and apparatus (100) for segmenting a sequence of audio samples into homogeneous segments (550 and 555) are disclosed. The method (200) forms a sequence of frames (701 to 704) along the sequence of audio samples, and extracts, for each frame, a data feature. The data features form a sequence of data features. Transition points in the sequence of data features are thin detected by applying the Bayesian Information Criterion to the sequence of data features. The transition points define the homogeneous segments (550 and 555). Preferably the data feature is single-dimensional and a leptokurtic distribution is used as an event model in the Bayesian Information Criterion.

Type: Grant

Filed: October 25, 2002

Date of Patent: July 10, 2007

Assignee: Canon Kabushiki Kaisha

Inventor: Timothy John Wark
Acronym extraction system and method of identifying acronyms and extracting corresponding expansions from text

Patent number: 7236923

Abstract: An acronym expansion system of the present invention receives electronic documents and extracts acronyms and their corresponding expansions. A part-of-speech tagger decomposes text into string tokens or words and tags them with their part-of-speech, while an acronym identifier determines whether a word is a potential acronym based on various conditions. An expansion identifier retrieves lists of words preceding and following a potential acronym to search for the expansion. The resulting word lists are examined sequentially to identify and retrieve an expansion for the potential acronym. An expansion extractor receives the potential acronym and a processed word list to retrieve the expansion of the potential acronym from that list. The extractor may utilize information from prior search iterations, and verifies an extracted expansion against a set of rules to remove spurious expansions.

Type: Grant

Filed: August 7, 2002

Date of Patent: June 26, 2007

Assignee: ITT Manufacturing Enterprises, Inc.

Inventor: Kalyan M Gupta
Speech recognition system using normalized voiced segment spectrogram analysis

Patent number: 7233899

Abstract: Computer comparison of one or more dictionary entries with a sound record of a human utterance to determine whether and where each dictionary entry is contained within the sound record. The record is segmented, and for each vocalized segment a spectrogram is obtained, and for other segments symbolic and numeric data are obtained. The spectrogram of a vocalized segment is then processed using a method selected from a group consisting of a triple time transform, a triple frequency transform, a linear-piecewise-linear transform, and combinations thereof, to decrease noise and to eliminate variations in pronunciation. Each entry in the dictionary is then compared with every sequence of segments of substantially the same length in the sound record. The comparison takes into account the formant profiles within each vocalized segment and symbolic and numeric data for other segments are obtained in the record and in the dictionary entries.

Type: Grant

Filed: March 7, 2002

Date of Patent: June 19, 2007

Inventors: Vitaliy S. Fain, Samuel V. Fain
Computer-aided reading system and method with cross-language reading wizard

Patent number: 7228268

Abstract: A computer-aided reading system offers assistance to a user who is reading in a non-native language, as the user needs help, without requiring the user to divert attention away from the text. In one implementation, the reading system is implemented as a reading wizard for a browser program. The reading wizard is exposed via a graphical user interface (UI) that allows the user to select a word, phrase, sentence, or other grouping of words in the non-native text. The reading wizard automatically determines whether the selected one word comprises part of a phrase; presents one or more translations of at least the selected word in a native language or, if the selected word comprises part of a phrase, presents at least one translation of the phrase in a native language. The multiple translations are presented in a pop-up window, in the form of a scrollable box and is scrollable, located near the selected text to minimize distraction of the user.

Type: Grant

Filed: October 15, 2004

Date of Patent: June 5, 2007

Assignee: Microsoft Corporation

Inventor: Endong Xun
Computer-aided reading system and method with cross-language reading wizard

Patent number: 7228269

Abstract: A computer-aided reading system offers assistance to a user who is reading in a non-native language, as the user needs help, without requiring the user to divert attention away from the text. In one implementation, the reading system is implemented as a reading wizard for a browser program. The reading wizard is exposed via a graphical user interface (UI) that allows the user to select a word, phrase, sentence, or other grouping of words in the non-native text, and view multiple translations of the selected text in the user's own native language. The multiple translations are presented in a pop-up window, in the form of a scrollable box and is scrollable, located near the selected text to minimize distraction.

Type: Grant

Filed: October 15, 2004

Date of Patent: June 5, 2007

Assignee: Microsoft Corporation

Inventor: Endong Xun
Voice control method

Patent number: 7228273

Abstract: A voice control method that allows vocal characteristics of a character to diversely be set in a computer game where characters are capable of voice output is provided. The voice control method comprises, converting a voice that is externally input or provided in advance, based upon attribute information on the character; and an output step for outputting the converted voice as voice of the character. According to this method, the voice produced by a character that appears in a computer game can be set in accordance with the character's characteristics and various voices for each character set by each player can be created.

Type: Grant

Filed: November 12, 2002

Date of Patent: June 5, 2007

Assignee: Sega Corporation

Inventor: Yutaka Okunoki
Generating with Lexical Functional Grammars

Patent number: 7225121

Abstract: A process for generating with unification based grammars such as Lexical Functional Grammars which uses construction and analysis of generation guides to determine internal facts and eliminate incomplete edges prior to constructing a generation chart. The generation guide can then be used in the construction of the generation chart to efficiently generate with unification-based grammars such as Lexical Functional Grammars. The generation guide is an instance of a grammar that has been specialized to the input and only contains those parts of the grammar that are relevant to the input. When the generation guide is analyzed to determine internal facts a smaller generation chart is produced.

Type: Grant

Filed: September 27, 2002

Date of Patent: May 29, 2007

Assignee: Palo Alto Research Center Incorporated

Inventors: John T. Maxwell, III, Hadar Shemtov
Audio signal encoding method combining codes having different frame lengths and data rates

Patent number: 7222068

Abstract: A system for transmitting audio signals over a telecommunications link generates the signals as two or more alternative feeds, for example at different data rates. The two feeds are encoded using coding methods having a frame structure with different frame lengths. To facilitate switching between the two, the input signal is notionally divided into temporal portions and each is coded by taking it, plus enough of the next (or preceding) portion to make up a whole number of frames, and encoding it, whereby the encoded portions overlap—at least for one of the feeds. The overlap is lost upon decoding by discarding duplicate material.

Type: Grant

Filed: November 19, 2001

Date of Patent: May 22, 2007

Assignee: British Telecommunications public limited company

Inventors: Anthony R Leaning, Richard J Whiting
Systems and methods for dynamic re-configurable speech recognition

Patent number: 7209880

Abstract: Speech recognition models are dynamically re-configurable based on user information, application information, background information such as background noise and transducer information such as transducer response characteristics to provide users with alternate input modes to keyboard text entry. Word recognition lattices are generated for each data field of an application and dynamically concatenated into a single word recognition lattice. A language model is applied to the concatenated word recognition lattice to determine the relationships between the word recognition lattices and repeated until the generated word recognition lattices are acceptable or differ from a predetermined value only by a threshold amount. These techniques of dynamic re-configurable speech recognition provide for deployment of speech recognition on small devices such as mobile phones and personal digital assistants as well environments such as office, home or vehicle while maintaining the accuracy of the speech recognition.

Type: Grant

Filed: March 6, 2002

Date of Patent: April 24, 2007

Assignee: AT&T Corp.

Inventors: Bojana Gajic, Shrikanth Sambasivan Narayanan, Sarangarajan Parthasarathy, Richard Cameron Rose, Aaron Edward Rosenberg
Efficient excitation quantization in noise feedback coding with general noise shaping

Patent number: 7206740

Abstract: In a Noise Feedback Coding (NFC) system operable in a ZERO-STATE condition and a ZERO-INPUT condition, the NFC system including at least one filter having a filter memory, a method of updating the filter memory. The method comprises: (a) producing a ZERO-STATE contribution to the filter memory when the NFC system is in the ZERO-STATE condition; (b) producing a ZERO-INPUT contribution to the filter memory when the NFC system is in the ZERO-INPUT condition; and (c) updating the filter memory as a function of both the ZERO-STATE contribution and the ZERO-INPUT contribution.

Type: Grant

Filed: August 12, 2002

Date of Patent: April 17, 2007

Assignee: Broadcom Corporation

Inventors: Jes Thyssen, Juin-Hwey Chen
Prosody generating device, prosody generating method, and program

Patent number: 7200558

Abstract: A prosody generation apparatus capable of suppressing distortion that occurs when generating prosodic patterns and therefore generating a natural prosody is provided. A prosody changing point extraction unit in this apparatus extracts a prosody changing point located at the beginning and the ending of a sentence, the beginning and the ending of a breath group, an accent position and the like. A selection rule and a transformation rule of a prosodic pattern including the prosody changing point is generated by means of a statistical or learning technique and the thus generate rules are stored in a representative prosodic pattern selection rule table and a transformation rule table beforehand. A pattern selection unit selects a representative prosodic pattern from the representative prosodic pattern selection rule table according to the selection rule.

Type: Grant

Filed: March 8, 2002

Date of Patent: April 3, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Yumiko Kato, Takahiro Kamai
System for handling frequently asked questions in a natural language dialog service

Patent number: 7197460

Abstract: A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.

Type: Grant

Filed: December 19, 2002

Date of Patent: March 27, 2007

Assignee: AT&T Corp.

Inventors: Narendra K. Gupta, Mazin G Rahim, Giuseppe Riccardi

prev 1 2 3 4 5 6 7 next