Patents Examined by Matthew J. Sked

Conceptual world representation natural language understanding system and method

Patent number: 7917354

Abstract: A Natural Language Understanding system is provided for indexing of free text documents. The system according to the invention utilizes typographical and functional segmentation of text to identify those portions of free text that carry meaning. The system then uses words and multi-word terms and phrases identified in the free to text to identify concepts in the free text. The system uses a lexicon of terms linked to a formal ontology that is independent of a specific language to extract concepts from the free text based on the words and multi-word terms in the free text. The formal ontology contains both language independent domain knowledge concepts and language dependent linguistic concepts that govern the relationships between concepts and contain the rules about how language works. The system according to the current invention may preferably be used to index medical documents and assign codes from independent coding systems, such as, SNOMED, ICD-9 and ICD-10.

Type: Grant

Filed: February 13, 2009

Date of Patent: March 29, 2011

Assignee: Nuance Communications, Inc.

Inventors: Werner Ceusters, Mick O'Donnell, Frank Montyne, Frederik Coppens, Maarten Van Mol
Word detection

Patent number: 7917355

Abstract: Methods, systems, and apparatus, including computer program products, in which data from web documents are partitioned into a training corpus and a development corpus are provided. First word probabilities for words are determined for the training corpus, and second word probabilities for the words are determined for the development corpus. Uncertainty values based on the word probabilities for the training corpus and the development corpus are compared, and new words are identified based on the comparison.

Type: Grant

Filed: August 23, 2007

Date of Patent: March 29, 2011

Assignee: Google Inc.

Inventors: Jun Wu, Tang Xi Liu, Feng Hong, Yonggang Wang, Bo Yang, Lei Zhang
Spoken language identification system and methods for training and operating same

Patent number: 7917361

Abstract: A method for training a spoken language identification system to identify an unknown language as one of a plurality of known candidate languages includes the process of creating a sound inventory comprising a plurality of sound tokens, the collective plurality of sound tokens provided from a subset of the known candidate languages. The method further includes providing a plurality of training samples, each training sample composed within one of the known candidate languages. Further included is the process of generating one or more training vectors from each training database, wherein each training vector is defined as a function of said plurality of sound tokens provided from said subset of the known candidate languages. The method further includes associating each training vector with the candidate language of the corresponding training sample.

Type: Grant

Filed: September 19, 2005

Date of Patent: March 29, 2011

Assignee: Agency for Science, Technology and Research

Inventors: Haizhou Li, Bin Ma, George M. White
Context based word prediction

Patent number: 7912700

Abstract: Context-based word prediction is provided. A software application utilizes words contained in an application document to provide context-based word prediction in the same or a related document. The software application creates an application defined data source and populates the data source with words occurring in a document. When the same or a related document is being edited via an input method, for example, typing, speech recognition, electronic handwriting, etc., a prediction engine presents candidate words from the application defined data source that match current text input, and the user may choose from the presented candidate words for automatic population into the document being edited. Information from the application defined data source may be transferred between computing devices, for example, between a mobile computing device and a desktop (non-mobile) computing device.

Type: Grant

Filed: February 8, 2007

Date of Patent: March 22, 2011

Assignee: Microsoft Corporation

Inventors: Jason Bower, Kenji Furuuchi, Simon Liu, Kenichi Morimoto, Daryn Robbins, Chet Laughlin, Peter Davis
Apparatus and method for integrated phrase-based and free-form speech-to-speech translation

Patent number: 7912727

Abstract: An apparatus and method that integrates both phrase-based and free-form speech-to-speech translation approaches using probability models. The starting step of the method is to receive vocal communication in a source language. Then store the received vocal communication. Then decipher the content of the vocal communication. Then locate in a multilingual dictionary module the corresponding translation of the deciphered vocal communication provided a preset sentence exists in a speech recognition module for the vocal communication. Then translate the vocal communication into the target language provided there is no corresponding translation located in the multilingual dictionary module. Then synthesize the translated target language when there is no corresponding translation for the vocal communication in the multilingual dictionary module. Then store the sound of the translated target language. Then play the sound of the translated target language.

Type: Grant

Filed: May 29, 2008

Date of Patent: March 22, 2011

Assignee: International Business Machines Corporation

Inventors: Yuqing Gao, Liang Gu, Hong-Kwang Kuo
Scalable encoding device, scalable decoding device, and method thereof

Patent number: 7904292

Abstract: A scalable encoding device for realizing scalable encoding by CELP encoding of a stereo sound signal and improving the encoding efficiency. In this device, an adder and a multiplier obtain an average of a first channel signal CH1 and a second channel signal CH2 as a monaural signal M. A CELP encoder for a monaural signal subjects the monaural signal M to CELP encoding, outputs the obtained encoded parameter to outside, and outputs a synthesized monaural signal M? synthesized by using the encoded parameter to a first channel signal encoder. By using the synthesized monaural signal M? and the second channel signal CH2, the first channel signal encoder subjects the first channel signal CH1 to CELP encoding to minimize the sum of the encoding distortion of the first channel signal CH1 and the encoding distortion of the second channel signal CH2.

Type: Grant

Filed: September 28, 2005

Date of Patent: March 8, 2011

Assignee: Panasonic Corporation

Inventors: Michiyo Goto, Koji Yoshida, Hiroyuki Ehara, Masahiro Oshikiri
Device and method of modeling acoustic characteristics with HMM and collating the same with a voice characteristic vector sequence

Patent number: 7895040

Abstract: According to an embodiment, voice recognition apparatus includes units of: acoustic processing, voice interval detecting, dictionary, collating, search target selecting, storing and determining, and voice recognition method includes processes of: selecting a search range on basis of a beam search, setting and storing a standard frame, storing an output probability of a certain transition path, determining whether or not the output probability of a certain path is stored. Number of times of calculation of the output probability is reduced by selecting the search range on basis of the beam search, calculating the output probability of the certain transition path only once in an interval from when the standard frame is set to when the standard frame is renewed, and storing and using thus calculated value as an approximate value of the output probability in subsequent frames.

Type: Grant

Filed: March 30, 2007

Date of Patent: February 22, 2011

Assignee: Kabushiki Kaisha Toshiba

Inventors: Masaru Sakai, Shinichi Tanaka
Efficient presentation of correction options in a speech interface based upon user selection probability

Patent number: 7885816

Abstract: A method, a system, and an apparatus for efficiently presenting correction options. The present invention is capable of analyzing user voice commands and sorting multiple input requests based on user selection probability to determine whether a confirmation step should be presented and, if so, the manner in which the confirmation step should be presented. In particular, the method requests an information input from the user and then assigns a confidence level to the information input. If the confidence level is LOW, then the system performs an immediate confirmation step. If the confidence level assigned is MEDIUM or HIGH, then the information is placed into a data set that is confirmed in a batch confirmation step. The batch confirmation step presents the captured information to the user for confirmation. If any of the information is incorrect, then the method sorts the information in ascending order by confidence level and creates a menu of items that may be changed. The user then makes the change.

Type: Grant

Filed: December 8, 2003

Date of Patent: February 8, 2011

Assignee: International Business Machines Corporation

Inventors: Brent L. Davis, J. Scott Gee, James R. Lewis, Vanessa V. Michelini, Melanie D. Polkosky
Method and apparatus for enrollment and verification of speaker authentication

Patent number: 7877254

Abstract: The present invention provides a method and apparatus for enrollment and verification of speaker authentication. The method for enrollment of speaker authentication, comprising: extracting an acoustic feature vector sequence from an enrollment utterance of a speaker; and generating a speaker template using the acoustic feature vector sequence; wherein said step of extracting an acoustic feature vector sequence comprises: generating a filter-bank for the enrollment utterance of the speaker for filtering locations and energies of formants in the spectrum of the enrollment utterance based on the enrollment utterance; filtering the spectrum of the enrollment utterance by the generated filter-bank; and generating the acoustic feature vector sequence from the filtered enrollment utterance.

Type: Grant

Filed: March 28, 2007

Date of Patent: January 25, 2011

Assignee: Kabushiki Kaisha Toshiba

Inventors: Jian Luan, Pei Ding, Lei He, Jie Hao
Representing n-gram language models for compact storage and fast retrieval

Patent number: 7877258

Abstract: Systems, methods, and apparatuses, including computer program products, are provided for representing language models. In some implementations, a computer-implemented method is provided. The method includes generating a compact language model including receiving a collection of n-grams from the corpus, each n-gram of the collection having a corresponding first probability of occurring in the corpus and generating a trie representing the collection of n-grams. The method also includes using the language model to identify a second probability of a particular string of words occurring.

Type: Grant

Filed: March 29, 2007

Date of Patent: January 25, 2011

Assignee: Google Inc.

Inventors: Ciprian Chelba, Thorsten Brants
Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic

Patent number: 7873511

Abstract: An audio encoder, an audio decoder or an audio processor includes a filter for generating a filtered audio signal, the filter having a variable warping characteristic, the characteristic being controllable in response to a time-varying control signal, the control signal indicating a small or no warping characteristic or a comparatively high warping characteristic. Furthermore, a controller is connected for providing the time-varying control signal, which depends on the audio signal. The filtered audio signal can be introduced to an encoding processor having different encoding algorithms, one of which is a coding algorithm adapted to a specific signal pattern. Alternatively, the filter is a post-filter receiving a decoded audio signal.

Type: Grant

Filed: June 30, 2006

Date of Patent: January 18, 2011

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Juergen Herre, Bernhard Grill, Markus Multrus, Stefan Bayer, Ulrich Kraemer, Jens Hirschfeld, Stefan Wabnik, Gerald Schuller
Motor vehicle with a speech interface

Patent number: 7873517

Abstract: A motor vehicle has a speech interface for an acoustic input of commands for operating the motor vehicle or a module of the motor vehicle. The speech interface includes a speech recognition database in which a substantial portion of commands or command components, which can be input, are stored in a version according to a pronunciation in a first language and in a version according to a pronunciation in at least a second language, and a speech recognition engine for automatically comparing an acoustic command to commands and/or command components, which are stored in the speech recognition database, in a version according to the pronunciation in the first language and to commands and/or command components, which are stored in the speech recognition database, in a version according to the pronunciation in the second language.

Type: Grant

Filed: November 9, 2006

Date of Patent: January 18, 2011

Assignee: Volkswagen of America, Inc.

Inventors: Ramon Prieto, M. Kashif Imam, Carsten Bergmann, Wai Yin Cheung, Carly Williams
Transient noise removal system using wavelets

Patent number: 7869994

Abstract: A transient noise removal system removes or dampens undesired transients from speech. When the transient noise removal system receives a speech frame, the system performs a wavelet transform analysis. The speech frame may be represented by one or more wavelet coefficients across one or more wavelet levels. For a given wavelet level, the transient noise-removal system may determine a wavelet threshold. The transient noise removal system may compare the threshold corresponding to a wavelet level to the wavelet coefficients within that level. The transient noise removal system may attenuate each wavelet coefficient based on a comparison to a threshold.

Type: Grant

Filed: January 30, 2007

Date of Patent: January 11, 2011

Assignee: QNX Software Systems Co.

Inventors: Rajeev Nongpiur, Shreyas A. Paranjpe, Phillip A. Hetherington
Group foreign language teaching system and method

Patent number: 7869988

Abstract: A method and system for teaching a foreign language to a user who has knowledge of a base language is disclosed. The method and system may include delivering a video presentation simultaneously to a plurality of users. The method and system may also include simultaneously delivering a plurality of mixed known language-foreign language audio and/or text streams to the plurality of users, each of the plurality of mixed known language-foreign language audio and/or text streams corresponding to the video presentation.

Type: Grant

Filed: November 3, 2006

Date of Patent: January 11, 2011

Assignee: K12 Inc.

Inventors: Michael C. Wood, Jonathan Ram Dariyanani
Avoiding repeated misunderstandings in spoken dialog system

Patent number: 7865364

Abstract: A method for improving speech recognition accuracy includes utilizing skiplists or lists of values that cannot occur because of improbability or impossibility. A table or list is stored in a dialog manager module. The table includes a plurality of information items and a corresponding list of improbable values for each of the plurality of information items. A plurality of recognized ordered interpretations is received from an automatic speech recognition (ASR) engine. Each of the plurality of recognized ordered interpretations each includes a number of information items. A value of one or more of the received information items for a first recognized ordered interpretation is compared to a table to determine if the value of the one of the received information items matches any of the list of improbable values for the corresponding information item.

Type: Grant

Filed: May 5, 2006

Date of Patent: January 4, 2011

Assignee: Nuance Communications, Inc.

Inventor: Marc Helbing
Disfluency detection for a speech-to-speech translation system using phrase-level machine translation with weighted finite state transducers

Patent number: 7860719

Abstract: A computer-implemented method for creating a disfluency translation lattice includes providing a plurality of weighted finite state transducers including a translation model, a language model, and a phrase segmentation model as input, performing a cascaded composition of the weighted finite state transducers to create a disfluency translation lattice, and storing the disfluency translation lattice to a computer-readable media.

Type: Grant

Filed: August 19, 2006

Date of Patent: December 28, 2010

Assignee: International Business Machines Corporation

Inventors: Sameer Raj Maskey, Yuqing Gao, Bowen Zhou
Reranking QA answers using language modeling

Patent number: 7856350

Abstract: In a QA (Question/Answer) system, candidate answers in response to a question received are ranked by probabilities estimated by a language model. The language model is created based on an ordered centroid created from the question and information learned from an information source such as the Internet.

Type: Grant

Filed: August 11, 2006

Date of Patent: December 21, 2010

Assignee: Microsoft Corporation

Inventors: Ming Zhou, Yi Chen
Scalable encoding apparatus, scalable decoding apparatus, scalable encoding method, scalable decoding method, communication terminal apparatus, and base station apparatus

Patent number: 7848925

Abstract: A scalable encoding apparatus, a scalable decoding apparatus and the like are disclosed which can achieve a band scalable LSP encoding that exhibits both a high quantization efficiency and a high performance. In these apparatuses, a narrow band-to-wide band converter receives and converts a quantized narrow band LSP to a wide band, and then outputs the quantized narrow band LSP as converted (i.e., a converted wide band LSP parameter) to an LSP-to-LPC converter. The LSP-to-LPC converter converts the quantized narrow band LSP as converted to a linear prediction coefficient and then outputs it to a pre-emphasizer. The pre-emphasizer calculates and outputs the pre-emphasized linear prediction coefficient to an LPC-to-LSP converter. The LPC-to-LSP converter converts the pre-emphasized linear prediction coefficient to a pre-emphasized quantized narrow band LSP as wide band converted, and then outputs it to a prediction quantizer.

Type: Grant

Filed: September 15, 2005

Date of Patent: December 7, 2010

Assignee: Panasonic Corporation

Inventor: Hiroyuki Ehara
Method and apparatus for low bit rate encoding and decoding

Patent number: 7835907

Abstract: An apparatus and method of low bit rate encoding and reproducing. The method includes transforming input audio signals in a time domain into spectral signals in a frequency domain, extracting important-spectrum components from the spectral signals in the frequency domain, and quantizing the important-spectrum components, extracting residual-spectrum components other than the important-spectrum components from the spectral signals in the frequency domain, and calculating and quantizing a noise level of the residual-spectrum components, and encoding the quantized important-spectrum components and the quantized noise level losslessly, and outputting encoded bitstreams.

Type: Grant

Filed: December 21, 2005

Date of Patent: November 16, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: Junghoe Kim, Eunmi Oh, Boris Kudryashov, Konstantin Osipov
Encoding method, apparatus and device and decoding method

Patent number: 7835906

Abstract: The present invention relates to encoding technology. The encoding method includes selecting a second encoding mode for encoding an input frame signal according to an analysis on signal characteristic of the input frame signal; obtaining coding demand values for a preset first encoding mode and the second encoding mode which are used to encode the input frame signal; determining, from the above encoding modes based on the coding demand values, an encoding mode for encoding the input frame signal; and multiplexing information of the determined encoding mode and encoded data which are encoded according to the determined encoding mode. Hence, the compatibility and the prioritization in terms of the encoding modes can be achieved.

Type: Grant

Filed: May 28, 2010

Date of Patent: November 16, 2010

Assignee: Huawei Technologies Co., Ltd.

Inventors: Lei Miao, Fengyan Qi, Qing Zhang

prev 1 2 3 4 5 6 … next