Patents Examined by Donald L. Storm
  • Patent number: 7295971
    Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.
    Type: Grant
    Filed: November 14, 2006
    Date of Patent: November 13, 2007
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
  • Patent number: 7295969
    Abstract: A recording and playback system is provided. The system includes an audio capturing device configured to receive an analog input and an encoder coupled to the audio capturing device configured to generate a digital signal based on the analog input. The system further includes a recognition engine coupled to the audio capturing device and configured to generate text data based on the analog input, wherein the encoder and the recognition engine simultaneously generate the digital signal and the text data such that the digital signal and the text data can be provided in a synchronized manner.
    Type: Grant
    Filed: March 8, 2004
    Date of Patent: November 13, 2007
    Assignees: Sony Corporation, Sony Electronics, Inc.
    Inventor: Takashi Nakatsuyama
  • Patent number: 7295973
    Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.
    Type: Grant
    Filed: February 24, 2005
    Date of Patent: November 13, 2007
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Ming-Chieh Lee, Naveen Thumpudi
  • Patent number: 7289950
    Abstract: An extended finite state grammar structure is generated from a finite state grammar. The extended finite state grammar structure includes word subgraphs representing a set of pre-defined word strings for words in the finite state grammar, and a set of all possible word strings for the words. The extended finite state grammar structure can be used to transform audio input into one or more of the word strings.
    Type: Grant
    Filed: September 21, 2004
    Date of Patent: October 30, 2007
    Assignee: Apple Inc.
    Inventors: Jerome R. Bellegarda, Kim E. A. Silverman
  • Patent number: 7283961
    Abstract: There is disclosed a speech processing device in which prediction taps for finding prediction values of the speech of high sound quality are extracted from the synthesized sound obtained on affording linear prediction coefficients and residual signals, generated from a preset code, to a speech synthesis filter, speech of high sound quality being higher in sound quality than the synthesized sound, and in which the prediction taps are used along with preset tap coefficients to perform preset predictive calculations to find the prediction values of the speech of high sound quality. The speech of high sound quality is higher in sound quality than the synthesized sound.
    Type: Grant
    Filed: August 3, 2001
    Date of Patent: October 16, 2007
    Assignee: Sony Corporation
    Inventors: Tetsujiro Kondo, Tsutomu Watanabe, Masaaki Hattori, Hiroto Kimura, Yasuhiro Fujimori
  • Patent number: 7283952
    Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.
    Type: Grant
    Filed: February 24, 2005
    Date of Patent: October 16, 2007
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
  • Patent number: 7277848
    Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level In a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.
    Type: Grant
    Filed: February 24, 2005
    Date of Patent: October 2, 2007
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
  • Patent number: 7272564
    Abstract: A method and apparatus for providing multimodal communication outputs information, such as retrieved content, in a first modality. An output modality change command is generated, such as via a multimodal user input interface, or other suitable mechanism. A multimodal communication apparatus and method then reprovides the previously output information as reprovided information in a different output modality in response to receiving the output modality change command. Accordingly, a user or unit may have content delivered in one modality and redelivered in a different preferred modality or modalities. Accordingly, a user or device may request output modalities dynamically so that content can be delivered using a different user preference after the content has already been provided in a first modality.
    Type: Grant
    Filed: March 22, 2002
    Date of Patent: September 18, 2007
    Assignee: Motorola, Inc.
    Inventors: W. Garland Phillips, Dwight Randall Smith
  • Patent number: 7266492
    Abstract: A system and method facilitating training machine learning systems utilizing sequential conditional generalized iterative scaling is provided. The invention includes an expected value update component that modifies an expected value based, at least in part, upon a feature function of an input vector and an output value, a sum of lambda variable and a normalization variable. The invention further includes an error calculator that calculates an error based, at least in part, upon the expected value and an observed value. The invention also includes a parameter update component that modifies a trainable parameter based, at least in part, upon the error. A variable update component that updates at least one of the sum of lambda variable and the normalization variable based, at least in part, upon the error is also provided.
    Type: Grant
    Filed: August 16, 2006
    Date of Patent: September 4, 2007
    Assignee: Microsoft Corporation
    Inventor: Joshua Theodore Goodman
  • Patent number: 7263482
    Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. For example, an encoder selects a quantization level within a range of quantization levels, where the selecting accounts for non-monotonicity of quality measure as a function of quantization level within the range. The encoder then quantizes audio information by the quantization level. Or, an encoder determines first and second quality measures associated with a first and second quantization levels, respectively, then determines a third quantization level within a quantization level range based upon location of a target quality on a trajectory of quality measure as a function of quantization level. The first and second quantization levels define endpoints of the quantization level range, and the first and second quality measures define endpoints of the trajectory. The function relates logarithm of quality measure in proportion to inverse logarithm of quantization level.
    Type: Grant
    Filed: February 24, 2005
    Date of Patent: August 28, 2007
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
  • Patent number: 7263487
    Abstract: The present invention generates a task-dependent acoustic model from a supervised task-independent corpus and further adapted it with an unsupervised task dependent corpus. The task-independent corpus includes task-independent training data which has an acoustic representation of words and a sequence of transcribed words corresponding to the acoustic representation. A relevance measure is defined for each of the words in the task-independent data. The relevance measure is used to weight the data associated with each of the words in the task-independent training data. The task-dependent acoustic model is then trained based on the weighted data for the words in the task-independent training data.
    Type: Grant
    Filed: September 29, 2005
    Date of Patent: August 28, 2007
    Assignee: Microsoft Corporation
    Inventor: Mei Yuh Hwang
  • Patent number: 7260525
    Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.
    Type: Grant
    Filed: February 24, 2005
    Date of Patent: August 21, 2007
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Ming-Chieh Lee, Naveen Thumpudi
  • Patent number: 7257533
    Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.
    Type: Grant
    Filed: September 22, 2005
    Date of Patent: August 14, 2007
    Assignee: Canon Kabushiki Kaisha
    Inventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
  • Patent number: 7251596
    Abstract: The present invention provides a unique wave-trigon transformation (WTT) method for performing transformation process over a wave signal. The present invention also provides a pitch detecting method and apparatus for detecting pitch based on the WTT process as well as a sentence detecting method and apparatus for detecting a sentence in a sound signal based on the WTT process. The pitch detecting method and apparatus can effectively detect pitch in a sound signal. In the WTT process, an inputted wave signal (such as a sound signal) is transformed into a series of trigons, and an energy-width spectrum is formed using these trigons. For a sound signal containing voice, the distribution of trigons transformed from the sound signal has a certain pattern. By analyzing the pattern, whether a pitch is contained in the sound signal can be determined. In particular, existence of a pitch can be determined by determining and evaluating the periodicity of trigons in a candidate chained peak in the energy-width spectrum.
    Type: Grant
    Filed: December 23, 2002
    Date of Patent: July 31, 2007
    Assignee: Canon Kabushiki Kaisha
    Inventors: Lianshan Zhu, Tao Yu
  • Patent number: 7249016
    Abstract: Quantization matrices facilitate digital audio encoding and decoding. An audio encoder generates and compresses quantization matrices; an audio decoder decompresses and applies the quantization matrices. The invention includes several techniques and tools, which can be used in combination or separately. For example, the audio encoder can generate quantization matrices from critical band patterns for blocks of audio data. The encoder can compute the quantization matrices directly from the critical band patterns, which can be computed from the same audio data that is being compressed. The audio encoder/decoder can use different modes for generating/applying quantization matrices depending on the coding channel mode of multi-channel audio data. The audio encoder/decoder can use different compression/decompression modes for the quantization matrices, including a parametric compression/decompression mode.
    Type: Grant
    Filed: February 17, 2005
    Date of Patent: July 24, 2007
    Assignee: Microsoft Corporation
    Inventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
  • Patent number: 7240011
    Abstract: Conversational dialog with a computer or other processor-based device without requiring push-to-talk functionality. In one embodiment, a computer-implemented method first determines that a user desires to engage in a dialog. Based thereon the method turns on a speech recognition functionality for a period of time referred to as a listening horizon. Upon the listening horizon expiring, the method turns off the speech recognition functionality.
    Type: Grant
    Filed: October 24, 2005
    Date of Patent: July 3, 2007
    Assignee: Microsoft Corporation
    Inventor: Eric J. Horvitz
  • Patent number: 7231343
    Abstract: Roughly described, a natural language interface to a back-end application incorporates synonyms automatically added to user input to enhance the natural language interpretation. Synonyms can be learned from user input and written into a synonyms database. Their selection can be based on tokens identified in user input. Natural language interpretation can be performed by agents arranged in a network, which parse the user input in a distributed manner. In an embodiment, a particular agent of the natural language interpreter receives a first message that includes the user input, returns a message claiming at least a portion of the user input, and subsequently receives a second message delegating actuation of at least that portion to the particular agent.
    Type: Grant
    Filed: December 20, 2002
    Date of Patent: June 12, 2007
    Assignee: iAnywhere Solutions, Inc.
    Inventors: Nicholas K. Treadgold, Babak Hodjat
  • Patent number: 7231342
    Abstract: A method, system, and computer program product within a logically partitioned data processing system that includes multiple partitions and a management console are described for displaying messages in a language specified by the console. A language is specified by the management console. The specification is transmitted to each one of the partitions. A message is generated within one of the partitions. The partition then utilizes the specification to select a translation of the message into the language specified by the management console. The translation is then transmitted from the partition to the management console for display by the management console.
    Type: Grant
    Filed: January 9, 2003
    Date of Patent: June 12, 2007
    Assignee: International Business Machines Corporation
    Inventors: Mark Steven Edwards, Ya-Huey Juan, Truc Duy Nguyen
  • Patent number: 7219065
    Abstract: A sound processor including a microphone (1), a pre-amplifier (2), a bank of N parallel filters (3), means for detecting short-duration transitions in the envelope signal of each filter channel, and means for applying gain to the outputs of these filter channels in which the gain is related to a function of the second-order derivative of the slow-varying envelope signal in each filter channel, to assist in perception of low-intensity short-duration speech features in said signal.
    Type: Grant
    Filed: October 25, 2000
    Date of Patent: May 15, 2007
    Inventors: Andrew E. Vandali, Graeme M. Clark
  • Patent number: 7194409
    Abstract: A method and system for allowing a user to interface to an interactive voice response system via natural language commands. The system plays a prompt that initiates user interaction. In certain embodiments, the system detects initial user speech, wherein the initial user speech begins during the prompt or during a silence after the prompt. Then, the system determines whether the user speech restarts (second user speech) within a predetermined time period, wherein the predetermined time period is dependent upon whether the initial user speech began during the prompt or during the silence. If the user speech does restart, then the system uses the second user speech for recognition purposes. If the user speech does not restart, then the system uses the initial user speech for recognition purposes.
    Type: Grant
    Filed: November 30, 2001
    Date of Patent: March 20, 2007
    Inventors: Bruce Balentine, Rex Stringham, Ralph Melaragno, Justin Munroe