Patents Examined by Donald L. Storm
-
Patent number: 7295971Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.Type: GrantFiled: November 14, 2006Date of Patent: November 13, 2007Assignee: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Patent number: 7295969Abstract: A recording and playback system is provided. The system includes an audio capturing device configured to receive an analog input and an encoder coupled to the audio capturing device configured to generate a digital signal based on the analog input. The system further includes a recognition engine coupled to the audio capturing device and configured to generate text data based on the analog input, wherein the encoder and the recognition engine simultaneously generate the digital signal and the text data such that the digital signal and the text data can be provided in a synchronized manner.Type: GrantFiled: March 8, 2004Date of Patent: November 13, 2007Assignees: Sony Corporation, Sony Electronics, Inc.Inventor: Takashi Nakatsuyama
-
Patent number: 7295973Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.Type: GrantFiled: February 24, 2005Date of Patent: November 13, 2007Assignee: Microsoft CorporationInventors: Wei-Ge Chen, Ming-Chieh Lee, Naveen Thumpudi
-
Patent number: 7289950Abstract: An extended finite state grammar structure is generated from a finite state grammar. The extended finite state grammar structure includes word subgraphs representing a set of pre-defined word strings for words in the finite state grammar, and a set of all possible word strings for the words. The extended finite state grammar structure can be used to transform audio input into one or more of the word strings.Type: GrantFiled: September 21, 2004Date of Patent: October 30, 2007Assignee: Apple Inc.Inventors: Jerome R. Bellegarda, Kim E. A. Silverman
-
Patent number: 7283961Abstract: There is disclosed a speech processing device in which prediction taps for finding prediction values of the speech of high sound quality are extracted from the synthesized sound obtained on affording linear prediction coefficients and residual signals, generated from a preset code, to a speech synthesis filter, speech of high sound quality being higher in sound quality than the synthesized sound, and in which the prediction taps are used along with preset tap coefficients to perform preset predictive calculations to find the prediction values of the speech of high sound quality. The speech of high sound quality is higher in sound quality than the synthesized sound.Type: GrantFiled: August 3, 2001Date of Patent: October 16, 2007Assignee: Sony CorporationInventors: Tetsujiro Kondo, Tsutomu Watanabe, Masaaki Hattori, Hiroto Kimura, Yasuhiro Fujimori
-
Patent number: 7283952Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.Type: GrantFiled: February 24, 2005Date of Patent: October 16, 2007Assignee: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Patent number: 7277848Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level In a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.Type: GrantFiled: February 24, 2005Date of Patent: October 2, 2007Assignee: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Patent number: 7272564Abstract: A method and apparatus for providing multimodal communication outputs information, such as retrieved content, in a first modality. An output modality change command is generated, such as via a multimodal user input interface, or other suitable mechanism. A multimodal communication apparatus and method then reprovides the previously output information as reprovided information in a different output modality in response to receiving the output modality change command. Accordingly, a user or unit may have content delivered in one modality and redelivered in a different preferred modality or modalities. Accordingly, a user or device may request output modalities dynamically so that content can be delivered using a different user preference after the content has already been provided in a first modality.Type: GrantFiled: March 22, 2002Date of Patent: September 18, 2007Assignee: Motorola, Inc.Inventors: W. Garland Phillips, Dwight Randall Smith
-
Patent number: 7266492Abstract: A system and method facilitating training machine learning systems utilizing sequential conditional generalized iterative scaling is provided. The invention includes an expected value update component that modifies an expected value based, at least in part, upon a feature function of an input vector and an output value, a sum of lambda variable and a normalization variable. The invention further includes an error calculator that calculates an error based, at least in part, upon the expected value and an observed value. The invention also includes a parameter update component that modifies a trainable parameter based, at least in part, upon the error. A variable update component that updates at least one of the sum of lambda variable and the normalization variable based, at least in part, upon the error is also provided.Type: GrantFiled: August 16, 2006Date of Patent: September 4, 2007Assignee: Microsoft CorporationInventor: Joshua Theodore Goodman
-
Patent number: 7263482Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. For example, an encoder selects a quantization level within a range of quantization levels, where the selecting accounts for non-monotonicity of quality measure as a function of quantization level within the range. The encoder then quantizes audio information by the quantization level. Or, an encoder determines first and second quality measures associated with a first and second quantization levels, respectively, then determines a third quantization level within a quantization level range based upon location of a target quality on a trajectory of quality measure as a function of quantization level. The first and second quantization levels define endpoints of the quantization level range, and the first and second quality measures define endpoints of the trajectory. The function relates logarithm of quality measure in proportion to inverse logarithm of quantization level.Type: GrantFiled: February 24, 2005Date of Patent: August 28, 2007Assignee: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Patent number: 7263487Abstract: The present invention generates a task-dependent acoustic model from a supervised task-independent corpus and further adapted it with an unsupervised task dependent corpus. The task-independent corpus includes task-independent training data which has an acoustic representation of words and a sequence of transcribed words corresponding to the acoustic representation. A relevance measure is defined for each of the words in the task-independent data. The relevance measure is used to weight the data associated with each of the words in the task-independent training data. The task-dependent acoustic model is then trained based on the weighted data for the words in the task-independent training data.Type: GrantFiled: September 29, 2005Date of Patent: August 28, 2007Assignee: Microsoft CorporationInventor: Mei Yuh Hwang
-
Patent number: 7260525Abstract: An audio encoder regulates quality and bitrate with a control strategy. The strategy includes several features. First, an encoder regulates quantization using quality, minimum bit count, and maximum bit count parameters. Second, an encoder regulates quantization using a noise measure that indicates reliability of a complexity measure. Third, an encoder normalizes a control parameter value according to block size for a variable-size block. Fourth, an encoder uses a bit-count control loop de-linked from a quality control loop. Fifth, an encoder addresses non-monotonicity of quality measurement as a function of quantization level when selecting a quantization level. Sixth, an encoder uses particular interpolation rules to find a quantization level in a quality or bit-count control loop. Seventh, an encoder filters a control parameter value to smooth quality. Eighth, an encoder corrects model bias by adjusting a control parameter value in view of current buffer fullness.Type: GrantFiled: February 24, 2005Date of Patent: August 21, 2007Assignee: Microsoft CorporationInventors: Wei-Ge Chen, Ming-Chieh Lee, Naveen Thumpudi
-
Patent number: 7257533Abstract: A data structure is provided for annotating data files within a database. The annotation data comprises a phoneme and word lattice which allows the quick and efficient searching of data files within the database in response to a user's input query. The structure of the annotation data is such that it allows the input query to be made by voice and can be used for annotating various kinds of data files, such as audio data files, video data files, multimedia data files etc. The annotation data may be generated from the data files themselves or may be input by the user either from a voiced input or from a typed input.Type: GrantFiled: September 22, 2005Date of Patent: August 14, 2007Assignee: Canon Kabushiki KaishaInventors: Jason Peter Andrew Charlesworth, Philip Neil Garner, Jebu Jacob Rajan
-
Patent number: 7251596Abstract: The present invention provides a unique wave-trigon transformation (WTT) method for performing transformation process over a wave signal. The present invention also provides a pitch detecting method and apparatus for detecting pitch based on the WTT process as well as a sentence detecting method and apparatus for detecting a sentence in a sound signal based on the WTT process. The pitch detecting method and apparatus can effectively detect pitch in a sound signal. In the WTT process, an inputted wave signal (such as a sound signal) is transformed into a series of trigons, and an energy-width spectrum is formed using these trigons. For a sound signal containing voice, the distribution of trigons transformed from the sound signal has a certain pattern. By analyzing the pattern, whether a pitch is contained in the sound signal can be determined. In particular, existence of a pitch can be determined by determining and evaluating the periodicity of trigons in a candidate chained peak in the energy-width spectrum.Type: GrantFiled: December 23, 2002Date of Patent: July 31, 2007Assignee: Canon Kabushiki KaishaInventors: Lianshan Zhu, Tao Yu
-
Patent number: 7249016Abstract: Quantization matrices facilitate digital audio encoding and decoding. An audio encoder generates and compresses quantization matrices; an audio decoder decompresses and applies the quantization matrices. The invention includes several techniques and tools, which can be used in combination or separately. For example, the audio encoder can generate quantization matrices from critical band patterns for blocks of audio data. The encoder can compute the quantization matrices directly from the critical band patterns, which can be computed from the same audio data that is being compressed. The audio encoder/decoder can use different modes for generating/applying quantization matrices depending on the coding channel mode of multi-channel audio data. The audio encoder/decoder can use different compression/decompression modes for the quantization matrices, including a parametric compression/decompression mode.Type: GrantFiled: February 17, 2005Date of Patent: July 24, 2007Assignee: Microsoft CorporationInventors: Wei-Ge Chen, Naveen Thumpudi, Ming-Chieh Lee
-
Patent number: 7240011Abstract: Conversational dialog with a computer or other processor-based device without requiring push-to-talk functionality. In one embodiment, a computer-implemented method first determines that a user desires to engage in a dialog. Based thereon the method turns on a speech recognition functionality for a period of time referred to as a listening horizon. Upon the listening horizon expiring, the method turns off the speech recognition functionality.Type: GrantFiled: October 24, 2005Date of Patent: July 3, 2007Assignee: Microsoft CorporationInventor: Eric J. Horvitz
-
Patent number: 7231343Abstract: Roughly described, a natural language interface to a back-end application incorporates synonyms automatically added to user input to enhance the natural language interpretation. Synonyms can be learned from user input and written into a synonyms database. Their selection can be based on tokens identified in user input. Natural language interpretation can be performed by agents arranged in a network, which parse the user input in a distributed manner. In an embodiment, a particular agent of the natural language interpreter receives a first message that includes the user input, returns a message claiming at least a portion of the user input, and subsequently receives a second message delegating actuation of at least that portion to the particular agent.Type: GrantFiled: December 20, 2002Date of Patent: June 12, 2007Assignee: iAnywhere Solutions, Inc.Inventors: Nicholas K. Treadgold, Babak Hodjat
-
Patent number: 7231342Abstract: A method, system, and computer program product within a logically partitioned data processing system that includes multiple partitions and a management console are described for displaying messages in a language specified by the console. A language is specified by the management console. The specification is transmitted to each one of the partitions. A message is generated within one of the partitions. The partition then utilizes the specification to select a translation of the message into the language specified by the management console. The translation is then transmitted from the partition to the management console for display by the management console.Type: GrantFiled: January 9, 2003Date of Patent: June 12, 2007Assignee: International Business Machines CorporationInventors: Mark Steven Edwards, Ya-Huey Juan, Truc Duy Nguyen
-
Patent number: 7219065Abstract: A sound processor including a microphone (1), a pre-amplifier (2), a bank of N parallel filters (3), means for detecting short-duration transitions in the envelope signal of each filter channel, and means for applying gain to the outputs of these filter channels in which the gain is related to a function of the second-order derivative of the slow-varying envelope signal in each filter channel, to assist in perception of low-intensity short-duration speech features in said signal.Type: GrantFiled: October 25, 2000Date of Patent: May 15, 2007Inventors: Andrew E. Vandali, Graeme M. Clark
-
Patent number: 7194409Abstract: A method and system for allowing a user to interface to an interactive voice response system via natural language commands. The system plays a prompt that initiates user interaction. In certain embodiments, the system detects initial user speech, wherein the initial user speech begins during the prompt or during a silence after the prompt. Then, the system determines whether the user speech restarts (second user speech) within a predetermined time period, wherein the predetermined time period is dependent upon whether the initial user speech began during the prompt or during the silence. If the user speech does restart, then the system uses the second user speech for recognition purposes. If the user speech does not restart, then the system uses the initial user speech for recognition purposes.Type: GrantFiled: November 30, 2001Date of Patent: March 20, 2007Inventors: Bruce Balentine, Rex Stringham, Ralph Melaragno, Justin Munroe