Patents Examined by Donald L. Storm

Accounting for non-monotonicity of quality as a function of quantization in quality and rate control for digital audio

Patent number: 7295971

Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

Type: Grant

Filed: November 14, 2006

Date of Patent: November 13, 2007

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Digital recording and playback system with voice recognition capability for concurrent text generation

Patent number: 7295969

Abstract: A recording and playback system is provided. The system includes an audio capturing device configured to receive an analog input and an encoder coupled to the audio capturing device configured to generate a digital signal based on the analog input. The system further includes a recognition engine coupled to the audio capturing device and configured to generate text data based on the analog input, wherein the encoder and the recognition engine simultaneously generate the digital signal and the text data such that the digital signal and the text data can be provided in a synchronized manner.

Type: Grant

Filed: March 8, 2004

Date of Patent: November 13, 2007

Assignees: Sony Corporation, Sony Electronics, Inc.

Inventor: Takashi Nakatsuyama
Quality control quantization loop and bitrate control quantization loop for quality and rate control for digital audio

Patent number: 7295973

Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

Type: Grant

Filed: February 24, 2005

Date of Patent: November 13, 2007

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Ming-Chieh Lee, Naveen Thumpudi
Extended finite state grammar for speech recognition systems

Patent number: 7289950

Abstract: An extended finite state grammar structure is generated from a finite state grammar. The extended finite state grammar structure includes word subgraphs representing a set of pre-defined word strings for words in the finite state grammar, and a set of all possible word strings for the words. The extended finite state grammar structure can be used to transform audio input into one or more of the word strings.

Type: Grant

Filed: September 21, 2004

Date of Patent: October 30, 2007

Assignee: Apple Inc.

Inventors: Jerome R. Bellegarda, Kim E. A. Silverman
High-quality speech synthesis device and method by classification and prediction processing of synthesized sound

Patent number: 7283961

Abstract: There is disclosed a speech processing device in which prediction taps for finding prediction values of the speech of high sound quality are extracted from the synthesized sound obtained on affording linear prediction coefficients and residual signals, generated from a preset code, to a speech synthesis filter, speech of high sound quality being higher in sound quality than the synthesized sound, and in which the prediction taps are used along with preset tap coefficients to perform preset predictive calculations to find the prediction values of the speech of high sound quality. The speech of high sound quality is higher in sound quality than the synthesized sound.

Type: Grant

Filed: August 3, 2001

Date of Patent: October 16, 2007

Assignee: Sony Corporation

Inventors: Tetsujiro Kondo, Tsutomu Watanabe, Masaaki Hattori, Hiroto Kimura, Yasuhiro Fujimori
Correcting model bias during quality and rate control for digital audio

Patent number: 7283952

Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

Type: Grant

Filed: February 24, 2005

Date of Patent: October 16, 2007

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Measuring and using reliability of complexity estimates during quality and rate control for digital audio

Patent number: 7277848

Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level In a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

Type: Grant

Filed: February 24, 2005

Date of Patent: October 2, 2007

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Method and apparatus for multimodal communication with user control of delivery modality

Patent number: 7272564

Abstract: A method and apparatus for providing multimodal communication outputs information, such as retrieved content, in a first modality. An output modality change command is generated, such as via a multimodal user input interface, or other suitable mechanism. A multimodal communication apparatus and method then reprovides the previously output information as reprovided information in a different output modality in response to receiving the output modality change command. Accordingly, a user or unit may have content delivered in one modality and redelivered in a different preferred modality or modalities. Accordingly, a user or device may request output modalities dynamically so that content can be delivered using a different user preference after the content has already been provided in a first modality.

Type: Grant

Filed: March 22, 2002

Date of Patent: September 18, 2007

Assignee: Motorola, Inc.

Inventors: W. Garland Phillips, Dwight Randall Smith
Training machine learning by sequential conditional generalized iterative scaling

Patent number: 7266492

Abstract: A system and method facilitating training machine learning systems utilizing sequential conditional generalized iterative scaling is provided. The invention includes an expected value update component that modifies an expected value based, at least in part, upon a feature function of an input vector and an output value, a sum of lambda variable and a normalization variable. The invention further includes an error calculator that calculates an error based, at least in part, upon the expected value and an observed value. The invention also includes a parameter update component that modifies a trainable parameter based, at least in part, upon the error. A variable update component that updates at least one of the sum of lambda variable and the normalization variable based, at least in part, upon the error is also provided.

Type: Grant

Filed: August 16, 2006

Date of Patent: September 4, 2007

Assignee: Microsoft Corporation

Inventor: Joshua Theodore Goodman
Accounting for non-monotonicity of quality as a function of quantization in quality and rate control for digital audio

Patent number: 7263482

Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. For example, an encoder selects a quantization level within a range of quantization levels, where the selecting accounts for non-monotonicity of quality measure as a function of quantization level within the range. The encoder then quantizes audio information by the quantization level. Or, an encoder determines first and second quality measures associated with a first and second quantization levels, respectively, then determines a third quantization level within a quantization level range based upon location of a target quality on a trajectory of quality measure as a function of quantization level. The first and second quantization levels define endpoints of the quantization level range, and the first and second quality measures define endpoints of the trajectory. The function relates logarithm of quality measure in proportion to inverse logarithm of quantization level.

Type: Grant

Filed: February 24, 2005

Date of Patent: August 28, 2007

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Generating a task-adapted acoustic model from one or more different corpora

Patent number: 7263487

Abstract: The present invention generates a task-dependent acoustic model from a supervised task-independent corpus and further adapted it with an unsupervised task dependent corpus. The task-independent corpus includes task-independent training data which has an acoustic representation of words and a sequence of transcribed words corresponding to the acoustic representation. A relevance measure is defined for each of the words in the task-independent data. The relevance measure is used to weight the data associated with each of the words in the task-independent training data. The task-dependent acoustic model is then trained based on the weighted data for the words in the task-independent training data.

Type: Grant

Filed: September 29, 2005

Date of Patent: August 28, 2007

Assignee: Microsoft Corporation

Inventor: Mei Yuh Hwang
Filtering of control parameters in quality and rate control for digital audio

Patent number: 7260525

Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.

Type: Grant

Filed: February 24, 2005

Date of Patent: August 21, 2007

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Ming-Chieh Lee, Naveen Thumpudi
Database searching and retrieval using phoneme and word lattice

Patent number: 7257533

Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.

Type: Grant

Filed: September 22, 2005

Date of Patent: August 14, 2007

Assignee: Canon Kabushiki Kaisha

Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
Method and device for analyzing a wave signal and method and apparatus for pitch detection

Patent number: 7251596

Abstract: The present invention provides a unique wave-trigon transformation (WTT) method for performing transformation process over a wave signal. The present invention also provides a pitch detecting method and apparatus for detecting pitch based on the WTT process as well as a sentence detecting method and apparatus for detecting a sentence in a sound signal based on the WTT process. The pitch detecting method and apparatus can effectively detect pitch in a sound signal. In the WTT process, an inputted wave signal (such as a sound signal) is transformed into a series of trigons, and an energy-width spectrum is formed using these trigons. For a sound signal containing voice, the distribution of trigons transformed from the sound signal has a certain pattern. By analyzing the pattern, whether a pitch is contained in the sound signal can be determined. In particular, existence of a pitch can be determined by determining and evaluating the periodicity of trigons in a candidate chained peak in the energy-width spectrum.

Type: Grant

Filed: December 23, 2002

Date of Patent: July 31, 2007

Assignee: Canon Kabushiki Kaisha

Inventors: Lianshan Zhu, Tao Yu
Quantization matrices using normalized-block pattern of digital audio

Patent number: 7249016

Abstract: Quantization matrices facilitate digital audio encoding and decoding. An audio encoder generates and compresses quantization matrices; an audio decoder decompresses and applies the quantization matrices. The invention includes several techniques and tools, which can be used in combination or separately. For example, the audio encoder can generate quantization matrices from critical band patterns for blocks of audio data. The encoder can compute the quantization matrices directly from the critical band patterns, which can be computed from the same audio data that is being compressed. The audio encoder/decoder can use different modes for generating/applying quantization matrices depending on the coding channel mode of multi-channel audio data. The audio encoder/decoder can use different compression/decompression modes for the quantization matrices, including a parametric compression/decompression mode.

Type: Grant

Filed: February 17, 2005

Date of Patent: July 24, 2007

Assignee: Microsoft Corporation

Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
Controlling the listening horizon of an automatic speech recognition system for use in handsfree conversational dialogue

Patent number: 7240011

Abstract: Conversational dialog with a computer or other processor-based device without requiring push-to-talk functionality. In one embodiment, a computer-implemented method first determines that a user desires to engage in a dialog. Based thereon the method turns on a speech recognition functionality for a period of time referred to as a listening horizon. Upon the listening horizon expiring, the method turns off the speech recognition functionality.

Type: Grant

Filed: October 24, 2005

Date of Patent: July 3, 2007

Assignee: Microsoft Corporation

Inventor: Eric J. Horvitz
Synonyms mechanism for natural language systems

Patent number: 7231343

Abstract: Roughly described, a natural language interface to a back-end application incorporates synonyms automatically added to user input to enhance the natural language interpretation. Synonyms can be learned from user input and written into a synonyms database. Their selection can be based on tokens identified in user input. Natural language interpretation can be performed by agents arranged in a network, which parse the user input in a distributed manner. In an embodiment, a particular agent of the natural language interpreter receives a first message that includes the user input, returns a message claiming at least a portion of the user input, and subsequently receives a second message delegating actuation of at least that portion to the particular agent.

Type: Grant

Filed: December 20, 2002

Date of Patent: June 12, 2007

Assignee: iAnywhere Solutions, Inc.

Inventors: Nicholas K. Treadgold, Babak Hodjat
Method, system, and computer program product in a logically partitioned data processing system for displaying messages in a management console's native language

Patent number: 7231342

Abstract: A method, system, and computer program product within a logically partitioned data processing system that includes multiple partitions and a management console are described for displaying messages in a language specified by the console. A language is specified by the management console. The specification is transmitted to each one of the partitions. A message is generated within one of the partitions. The partition then utilizes the specification to select a translation of the message into the language specified by the management console. The translation is then transmitted from the partition to the management console for display by the management console.

Type: Grant

Filed: January 9, 2003

Date of Patent: June 12, 2007

Assignee: International Business Machines Corporation

Inventors: Mark Steven Edwards, Ya-Huey Juan, Truc Duy Nguyen
Emphasis of short-duration transient speech features

Patent number: 7219065

Abstract: A sound processor including a microphone (1), a pre-amplifier (2), a bank of N parallel filters (3), means for detecting short-duration transitions in the envelope signal of each filter channel, and means for applying gain to the outputs of these filter channels in which the gain is related to a function of the second-order derivative of the slow-varying envelope signal in each filter channel, to assist in perception of low-intensity short-duration speech features in said signal.

Type: Grant

Filed: October 25, 2000

Date of Patent: May 15, 2007

Inventors: Andrew E. Vandali, Graeme M. Clark
Method and system for preventing error amplification in natural language dialogues

Patent number: 7194409

Abstract: A method and system for allowing a user to interface to an interactive voice response system via natural language commands. The system plays a prompt that initiates user interaction. In certain embodiments, the system detects initial user speech, wherein the initial user speech begins during the prompt or during a silence after the prompt. Then, the system determines whether the user speech restarts (second user speech) within a predetermined time period, wherein the predetermined time period is dependent upon whether the initial user speech began during the prompt or during the silence. If the user speech does restart, then the system uses the second user speech for recognition purposes. If the user speech does not restart, then the system uses the initial user speech for recognition purposes.

Type: Grant

Filed: November 30, 2001

Date of Patent: March 20, 2007

Inventors: Bruce Balentine, Rex Stringham, Ralph Melaragno, Justin Munroe

prev 1 2 3 4 5 6 … next