Patents Examined by Talivaldis Ivars Smit

Load balancing based upon speech processing specific factors

Patent number: 7953603

Abstract: A machine readable storage can include a set of instructions for load balancing. The storage can include a plug-in receptor of a load balancer. The plug-in receptor can be compliant with a known industry standard and can be is associated with a two or more load balancing algorithms. The load balancer can utilize selected ones of the load balancing algorithms to determine which of two or more voice servers are to handle incoming speech processing requests. Selected ones of the load balancing algorithms can include a speech utilization algorithm. The speech utilization algorithm can calculate a speech utilization score for at least one of the voice servers based upon speech processing specific factors.

Type: Grant

Filed: December 21, 2005

Date of Patent: May 31, 2011

Assignee: International Business Machines Corporation

Inventors: Mario E. De Armas, Matthew W. Hartley, Joseph I. Herman, Wendi L. Nusbickel, Geetika Tandon
Dual-transform coding of audio signals

Patent number: 7953595

Abstract: Methods, devices, and systems for coding and decoding audio are disclosed. At least two transforms are applied on an audio signal, each with different transform periods for better resolutions at both low and high frequencies. The transform coefficients are selected and combined such that the data rate remains similar as a single transform. The transform coefficients may be coded with a fast lattice vector quantizer. The quantizer has a high rate quantizer and a low rate quantizer. The high rate quantizer includes a scheme to truncate the lattice. The low rate quantizer includes a table based searching method. The low rate quantizer may also include a table based indexing scheme. The high rate quantizer may further include Huffman coding for the quantization indices of transform coefficients to improve the quantizing/coding efficiency.

Type: Grant

Filed: October 18, 2006

Date of Patent: May 31, 2011

Assignee: Polycom, Inc.

Inventors: Minjie Xie, Peter Chu
Method and apparatus for audio encoding and decoding using wideband psychoacoustic modeling and bandwidth extension

Patent number: 7953605

Abstract: A novel bandwidth extension technique allows information to be encoded and decoded using a fractal self similarity model or an accurate spectral replacement model, or both. Also a multi-band temporal amplitude coding technique, useful as an enhancement to any coding/decoding technique, helps with accurate reconstruction of the temporal envelope and employs a utility filterbank. A perceptual coder using a comodulation masking release model, operating typically with more conventional perceptual coders, makes the perceptual model more accurate and hence increases the efficiency of the overall perceptual coder.

Type: Grant

Filed: October 6, 2006

Date of Patent: May 31, 2011

Inventors: Deepen Sinha, Anibal J. S. Ferreira, Erumbi Vallabhan Harinarayanan
Handheld electronic device and method employing logical proximity of characters in spell checking

Patent number: 7949516

Abstract: An improved handheld electronic device and associated method employing an improved spell checking routine enable proposed spelling corrections having a close logical proximity to an active input to be output at a position of preference for easy selection by the user. By way of example, a base character and the various accented forms thereof can be said to have a logical proximity to one another that is closer than their logical proximity to any character having a different base character, whether additionally having a diacritical element or not.

Type: Grant

Filed: August 31, 2007

Date of Patent: May 24, 2011

Assignee: Research In Motion Limited

Inventors: Vadim Fux, Michael Elizarov, Sergey V. Kolomiets
System and method for recognizing speech securely using a secure multi-party computation protocol

Patent number: 7937270

Abstract: A system and method recognizes speech securely using a secure multi-party computation protocol. The system includes a client and a server. The client is configured to provide securely speech in a form of an observation sequence of symbols, and the server is configured to provide securely a multiple trained hidden Markov models (HMMs), each trained HMM including a multiple states, a state transition probability distribution and an initial state distribution, and each state including a subset of the observation symbols and an observation symbol probability distribution. The observation symbol probability distributions are modeled by mixtures of Gaussian distributions. Also included are means for determining securely, for each HMM, a likelihood the observation sequence is produced by the states of the HMM, and means for determining a particular symbol with a maximum likelihood of a particular subset of the symbols corresponding to the speech.

Type: Grant

Filed: January 16, 2007

Date of Patent: May 3, 2011

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Paris Smaragdis, Madhusudana Shashanka
Natural language processing of disfluent sentences

Patent number: 7930168

Abstract: An advanced model that includes new processes is provided for use as a component of an effective disfluency identifier. The disfluency identifier tags edited words in transcribed speech. A speech recognition unit in combination with a part-of-speech tagger, a disfluency identifier, and a parser form a natural language system that helps machines properly interpret spoken utterances.

Type: Grant

Filed: October 4, 2005

Date of Patent: April 19, 2011

Assignee: Robert Bosch GmbH

Inventors: Fuliang Weng, Qi Zhang
Automatic identification of dialog timing problems for an interactive speech dialog application using speech log data indicative of cases of barge-in and timing problems

Patent number: 7930183

Abstract: A method of analyzing dialog between a user and an interactive application having dialog turns is provided. The method includes accessing information indicative of a plurality of dialog turns between the application and at least one user and identifying instances where the application determined a response was received before an associated prompt had completed. The accessed information includes information related to operation of the application with a first grammar to recognize the response. The method includes identifying whether the response was received in a particular limited time period from when the associated prompt began. If the response was received in the limited time period, the method determines whether the response included one or more terms from the associated prompt by performing recognition on the response using a second grammar having more information related to grammar of a language than the first grammar.

Type: Grant

Filed: March 29, 2006

Date of Patent: April 19, 2011

Assignee: Microsoft Corporation

Inventors: Julian J. Odell, Stephen F. Potter
Apparatus and method of encoding and decoding audio signals using hierarchical block switching and linear prediction coding

Patent number: 7930177

Abstract: In one embodiment, the method includes receiving an audio data frame having at least one channel. The channel is subdivided into a plurality of blocks, and at least two of the blocks are capable of different lengths. The embodiment further includes obtaining information from the audio signal indicating the subdivision of the channel into the blocks, and decoding the channel based on the obtained information. In one embodiment, an optimum prediction order is obtained for each block in the channel, where a prediction order indicates a number of prediction coefficients. The optimum prediction order indicates a minimum one of a global prediction order and a local prediction order. The global prediction order is determined based on a maximum permitted prediction order, and the local prediction order is determined based on a length of the block.

Type: Grant

Filed: September 24, 2008

Date of Patent: April 19, 2011

Assignee: LG Electronics Inc.

Inventor: Tilman Liebchen
System and method for distributing multilingual documents

Patent number: 7925495

Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.

Type: Grant

Filed: February 11, 2009

Date of Patent: April 12, 2011

Assignee: Xerox Corporation

Inventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
Waveform interpolation speech coding apparatus and method for reducing complexity thereof

Patent number: 7899667

Abstract: A waveform interpolation speech coding apparatus and method for reducing complexity thereof are disclosed. The waveform interpolation speech coding apparatus includes: a waveform interpolation encoding unit for receiving a speech signal, calculating parameters for a waveform interpolation from the received speech signal, and quantizing the calculating parameters; and a realignment parameter calculating unit for restoring a characteristic waveform (CW) using the quantized parameter, calculating a realignment parameter that maximizes a cross-correlation among consecutive CWs for the restored CW.

Type: Grant

Filed: December 19, 2006

Date of Patent: March 1, 2011

Assignee: Electronics and Telecommunications Research Institute

Inventors: Kyung-Jin Byun, Ik-Soo Eo, Hee-Bum Jung, Nak-Woong Eum
Joint training of feature extraction and acoustic model parameters for speech recognition

Patent number: 7885812

Abstract: Parameters for a feature extractor and acoustic model of a speech recognition module are trained. An objective function is utilized to determine values for the feature extractor parameters and the acoustic model parameters.

Type: Grant

Filed: November 15, 2006

Date of Patent: February 8, 2011

Assignee: Microsoft Corporation

Inventors: Alejandro Acero, James G. Droppo, Milind V. Mahajan
Speech quality assessment method and system

Patent number: 7856355

Abstract: In one embodiment, distortion in a received speech signal is estimated using at least one model trained based on subjective quality assessment data. A speech quality assessment for the received speech signal is then determined based on the estimated distortion.

Type: Grant

Filed: July 5, 2005

Date of Patent: December 21, 2010

Assignee: Alcatel-Lucent USA Inc.

Inventor: Doh-Suk Kim
Method for reducing decoder complexity in waveform interpolation speech decoding by converting dimension of vector

Patent number: 7848923

Abstract: Provided is a method for converting a dimension of a vector. The vector dimension conversion method for vector quantization includes the steps of: extracting a specific parameter having a pitch period from an input speech signal and then generating a vector of a dimension that varies according to the pitch period; dividing an entire frequency domain of the generated vector of the variable dimension into at least two frequency domains; and converting the vector of the variable dimension into vectors of mutually different fixed dimensions according to the divided frequency domains. Thereby, not only an error due to the vector dimension conversion is suppressed but codebook memory required for the vector quantization is effectively reduced.

Type: Grant

Filed: April 24, 2006

Date of Patent: December 7, 2010

Assignee: Electronics and Telecommunications Research Institute

Inventors: Kyung Jin Byun, Ik Soo Eo, Hee Bum Jung
Content-based audio playback emphasis

Patent number: 7844464

Abstract: Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.

Type: Grant

Filed: July 22, 2005

Date of Patent: November 30, 2010

Assignee: Multimodal Technologies, Inc.

Inventors: Kjell Schubert, Juergen Fritsch, Michael Finke, Detlef Koll
Simplifying query terms with transliteration

Patent number: 7835903

Abstract: Methods, systems, and apparatus, including computer program products, operable to perform operations including receiving from a user a search query; and receiving an indication of a user preference to apply transliteration in simplifying the query terms of the search query. Alternatively, the operations include receiving from a user a search query of query terms; applying transliteration in simplifying the query terms; and using the simplified query terms to identify synonyms to use in augmenting the search query. Alternatively, the operations include receiving from a user a search query; identifying the user interface language as a small language or not a small language; simplifying each query term to a simplified form; and if the user interface language is a small language, for each original query term that has a simplified form different from the original term, using the original query term as-is and not providing any synonyms for the query term.

Type: Grant

Filed: April 19, 2006

Date of Patent: November 16, 2010

Assignee: Google Inc.

Inventor: Ruchira S. Datta
Method and apparatus for normalizing voice feature vector by backward cumulative histogram

Patent number: 7835909

Abstract: A method and apparatus for normalizing a histogram utilizing a backward cumulative histogram which can cumulate a probability distribution function in an order from a greatest to smallest value so as to estimate a noise robust histogram. A method of normalizing a speech feature vector includes: extracting the speech feature vector from a speech signal; calculating a probability distribution function using the extracted speech feature vector; calculating a backward cumulative distribution function by cumulating the probability distribution function in an order from a largest to smallest value; and normalizing a histogram using the backward cumulative distribution function.

Type: Grant

Filed: December 12, 2006

Date of Patent: November 16, 2010

Assignee: Samsung Electronics Co., Ltd.

Inventors: So-Young Jeong, Gil Jin Jang, Kwang Cheol Oh
Efficient capitalization through user modeling

Patent number: 7827025

Abstract: A method of automatically capitalizing text utilizes a capitalization model. The capitalization model is trained from data that is taken from documents associated with a particular user. In particular, documents that are authored by the user such as e-mails, are used to train the model.

Type: Grant

Filed: April 6, 2004

Date of Patent: November 2, 2010

Assignee: Microsoft Corporation

Inventors: Peter K. L. Mau, Dong Yu
Computer application environment and communication system employing automatic identification of human conversational behavior

Patent number: 7822607

Abstract: The disclosed technology is a computer application that establishes communication between a conversation-finder module and a computer application environment. The conversation-finder module determines a conversational floor based on three or more floor determination inputs. The conversational floor associates at least two of the three or more floor determination inputs as being on the conversational floor. The conversation-finder module and the computer application environment can be responsive to each other and can adapt to each other. That is (either or both), the computer application environment can adapt to the conversational floor(s) determined by the conversation-finder module; and the conversation-finder module can determine the conversational floors responsive to a floor determination input and/or control input from the computer application environment.

Type: Grant

Filed: April 10, 2006

Date of Patent: October 26, 2010

Assignee: Palo Alto Research Center Incorporated

Inventors: Paul M Aoki, Margaret H Szymanski, James D Thornton, Allison G Woodruff, Nicolas B Ducheneaut, Robert J Moore
Method of measuring degree of enhancement to voice signal

Patent number: 7818168

Abstract: A method of measuring the degree of enhancement made to a voice signal by receiving the voice signal, identifying formant regions in the voice signal, computing stationarity for each identified formant region, enhancing the voice signal, identifying formant regions in the enhanced voice signal that correspond to those identified in the received voice signal, computing stationarity for each formant region identified in the enhanced voice signal, comparing corresponding stationarity results for the received and enhanced voice signals, and calculating at least one user-definable statistic of the comparison results as the degree of enhancement made to the received voice signal.

Type: Grant

Filed: December 1, 2006

Date of Patent: October 19, 2010

Assignee: The United States of America as represented by the Director, National Security Agency

Inventor: Adolf Cusmariu
Graph-based ranking algorithms for text processing

Patent number: 7809548

Abstract: The present invention provides a method of processing at least one natural language text using a graph. The method includes determining a plurality of text units based upon the natural language text, associating the plurality of text units with a plurality of graph nodes, and determining at least one connecting relation between at least two of the plurality of text units. The method also includes associating the at least one connecting relation with at least one graph edge connecting at least two of the plurality of graph nodes and determining a plurality of rankings associated with the plurality of graph nodes based upon the at least one graph edge. The method can also include a graphical visualization of at least one important text unit in a natural language text or collection of texts. Methods for word sense disambiguation, keyword extraction, and sentence extraction are also provided.

Type: Grant

Filed: March 9, 2005

Date of Patent: October 5, 2010

Assignee: University of North Texas

Inventors: Rada Mihalcea, Paul Tarau

prev … 10 11 12 13 14 15 16 17 18 … next