Patents by Inventor Kazuhito Koishida

Kazuhito Koishida has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Sub-band voice codec with multi-stage codebooks and redundant coding

Publication number: 20080040105

Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.

Type: Application

Filed: October 9, 2007

Publication date: February 14, 2008

Applicant: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Hosam Khalil, Xiaoqin Sun, Wei-Ge Chen
Sub-band voice codec with multi-stage codebooks and redundant coding

Publication number: 20080040121

Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.

Type: Application

Filed: October 9, 2007

Publication date: February 14, 2008

Applicant: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Hosam Khalil, Xiaoqin Sun, Wei-Ge Chen
LPC-harmonic vocoder with superframe structure

Patent number: 7315815

Abstract: An enhanced low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.

Type: Grant

Filed: September 22, 1999

Date of Patent: January 1, 2008

Assignee: Microsoft Corporation

Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
LPC-harmonic vocoder with superframe structure

Patent number: 7286982

Abstract: An enhanced low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.

Type: Grant

Filed: July 20, 2004

Date of Patent: October 23, 2007

Assignee: Microsoft Corporation

Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
Sub-band voice codec with multi-stage codebooks and redundant coding

Patent number: 7280960

Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.

Type: Grant

Filed: August 4, 2005

Date of Patent: October 9, 2007

Assignee: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Hosam A. Khalil, Xiaoqin Sun, Wei-Ge Chen
Shape and scale parameters for extended-band frequency coding

Publication number: 20070174063

Abstract: An audio encoder performs frequency extension coding that comprises determining one or more shape parameters using a displacement vector that corresponds to a displacement of an even number (e.g., an even number of sub-bands between a sub-band in a baseband frequency range and a sub-band in an extended-band frequency range). The shape parameters can be determined on a per-audio-block basis. Restricting a displacement to an even number (in frequency extension coding or in other signal modulation schemes) can improve the quality of reconstructed audio. An audio encoder also can perform frequency extension coding that comprises determining one or more scale parameters at one or more audio blocks, and determining one or more anchor points for interpolating the one or more scale parameters.

Type: Application

Filed: January 20, 2006

Publication date: July 26, 2007

Applicant: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen, Kazuhito Koishida, Chao He
Sub-band voice codec with multi-stage codebooks and redundant coding

Patent number: 7177804

Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.

Type: Grant

Filed: May 31, 2005

Date of Patent: February 13, 2007

Assignee: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Hosam A. Khalil, Xiaoqin Sun, Wei-Ge Chen
Modification of codewords in dictionary used for efficient coding of digital media spectral data

Publication number: 20070016414

Abstract: Coding of spectral data by representing certain portions of the spectral data as a scaled version of a code-vector, where the code-vector is chosen from either a fixed predetermined codebook or a codebook taken from a baseband. Various optional features are described for modifying the code-vectors in the codebook according to some rules which allow the code-vector to better represent the data they are modeling. The code-vector modification comprises a linear or non-linear transform of one or more code-vectors, such as, by exponentiation, negation, reversing, or combining elements from plural code-vectors.

Type: Application

Filed: July 15, 2005

Publication date: January 18, 2007

Applicant: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen, Kazuhito Koishida
Robust decoder

Publication number: 20060271373

Abstract: Techniques and tools related to delayed or lost coded audio information are described. For example, a concealment technique for one or more missing frames is selected based on one or more factors that include a classification of each of one or more available frames near the one or more missing frames. As another example, information from a concealment signal is used to produce substitute information that is relied on in decoding a subsequent frame. As yet another example, a data structure having nodes corresponding to received packet delays is used to determine a desired decoder packet delay value.

Type: Application

Filed: May 31, 2005

Publication date: November 30, 2006

Applicant: Microsoft Corporation

Inventors: Hosam Khalil, Tian Wang, Kazuhito Koishida, Xiaoqin Sun, Wei-Ge Chen
SUB-BAND VOICE CODEC WITH MULTI-STAGE CODEBOOKS AND REDUNDANT CODING

Publication number: 20060271355

Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.

Type: Application

Filed: May 31, 2005

Publication date: November 30, 2006

Applicant: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Hosam Khalil, Xiaoqin Sun, Wei-Ge Chen
Robust decoder

Publication number: 20060271359

Abstract: Techniques and tools related to delayed or lost coded audio information are described. For example, a concealment technique for one or more missing frames is selected based on one or more factors that include a classification of each of one or more available frames near the one or more missing frames. As another example, information from a concealment signal is used to produce substitute information that is relied on in decoding a subsequent frame. As yet another example, a data structure having nodes corresponding to received packet delays is used to determine a desired decoder packet delay value.

Type: Application

Filed: August 4, 2005

Publication date: November 30, 2006

Applicant: Microsoft Corporation

Inventors: Hosam Khalil, Tian Wang, Kazuhito Koishida, Xiaoqin Sun, Wei-Ge Chen
Audio codec post-filter

Publication number: 20060271354

Abstract: Techniques and tools are described for processing reconstructed audio signals. For example, a reconstructed audio signal is filtered in the time domain using filter coefficients that are calculated, at least in part, in the frequency domain. As another example, producing a set of filter coefficients for filtering a reconstructed audio signal includes clipping one or more peaks of a set of coefficient values. As yet another example, for a sub-band codec, in a frequency region near an intersection between two sub-bands, a reconstructed composite signal is enhanced.

Type: Application

Filed: May 31, 2005

Publication date: November 30, 2006

Applicant: Microsoft Corporation

Inventors: Xiaoqin Sun, Tian Wang, Hosam Khalil, Kazuhito Koishida, Wei-Ge Chen
Sub-band voice codec with multi-stage codebooks and redundant coding

Publication number: 20060271357

Abstract: Techniques and tools related to coding and decoding of audio information are described. For example, redundant coded information for decoding a current frame includes signal history information associated with only a portion of a previous frame. As another example, redundant coded information for decoding a coded unit includes parameters for a codebook stage to be used in decoding the current coded unit only if the previous coded unit is not available. As yet another example, coded audio units each include a field indicating whether the coded unit includes main encoded information representing a segment of an audio signal, and whether the coded unit includes redundant coded information for use in decoding main encoded information.

Type: Application

Filed: August 4, 2005

Publication date: November 30, 2006

Applicant: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Hosam Khalil, Xiaoqin Sun, Wei-Ge Chen
Gain constrained noise suppression

Publication number: 20050278172

Abstract: A gain-constrained noise suppression for speech more precisely estimates noise, including during speech, to reduce musical noise artifacts introduced from noise suppression. The noise suppression operates by applying a spectral gain G(m, k) to each short-time spectrum value S(m, k) of a speech signal, where m is the frame number and k is the spectrum index. The spectrum values are grouped into frequency bins, and a noise characteristic estimated for each bin classified as a “noise bin.” An energy parameter is smoothed in both the time domain and the frequency domain to improve noise estimation per bin. The gain factors G(m, k) are calculated based on the current signal spectrum and the noise estimation, then smoothed before being applied to the signal spectral values S(m, k).

Type: Application

Filed: June 15, 2004

Publication date: December 15, 2005

Applicant: Microsoft Corporation

Inventors: Kazuhito Koishida, Feng Zhuge, Hosam Khalil, Tian Wang, Wei-ge Chen
Robust real-time speech codec

Publication number: 20050228651

Abstract: Various strategies for rate/quality control and loss resiliency in an audio codec are described. The various strategies can be used in combination or independently. For example, a real-time speech codec uses intra frame coding/decoding, adaptive multi-mode forward error correction [“FEC”], and rate/quality control techniques. Intra frames help a decoder recover quickly from packet losses, while compression efficiency is still emphasized with predicted frames. Various strategies for inserting intra frames and signaling intra/predicted frames are described. With the adaptive multi-mode FEC, an encoder adaptively selects between multiple modes to efficiently and quickly provide a level of FEC that takes into account the bandwidth currently available for FEC. The FEC information itself may be predictively encoded and decoded relative to primary encoded information. Various rate/quality and FEC control strategies allow additional adaptation to available bandwidth and network conditions.

Type: Application

Filed: March 31, 2004

Publication date: October 13, 2005

Inventors: Tian Wang, Hosam Khalil, Kazuhito Koishida, Wei-Ge Chen, Mu Han
LPC-harmonic vocoder with superframe structure

Publication number: 20050075869

Abstract: An enhanced_low-bit rate parametric voice coder that groups a number of frames from an underlying frame-based vocoder, such as MELP, into a superframe structure. Parameters are extracted from the group of underlying frames and quantized into the superframe which allows the bit rate of the underlying coding to be reduced without increasing the distortion. The speech data coded in the superframe structure can then be directly synthesized to speech or may be transcoded to a format so that an underlying frame-based vocoder performs the synthesis. The superframe structure includes additional error detection and correction data to reduce the distortion caused by the communication of bit errors.

Type: Application

Filed: July 20, 2004

Publication date: April 7, 2005

Applicant: Microsoft Corporation

Inventors: Allen Gersho, Vladimir Cuperman, Tian Wang, Kazuhito Koishida
Method for coding speech and music signals

Patent number: 6658383

Abstract: The present invention provides a transform coding method efficient for music signals that is suitable for use in a hybrid codec, whereby a common Linear Predictive (LP) synthesis filter is employed for both speech and music signals. The LP synthesis filter switches between a speech excitation generator and a transform excitation generator, in accordance with the coding of a speech or music signal, respectively. For coding speech signals, the conventional CELP technique may be used, while a novel asymmetrical overlap-add transform technique is applied for coding music signals. In performing the common LP synthesis filtering, interpolation of the LP coefficients is conducted for signals in overlap-add operation regions. The invention enables smooth transitions when the decoder switches between speech and music decoding modes.

Type: Grant

Filed: June 26, 2001

Date of Patent: December 2, 2003

Assignee: Microsoft Corporation

Inventors: Kazuhito Koishida, Vladimir Cuperman, Amir H. Majidimehr, Allen Gersho
Rate control strategies for speech and music coding

Patent number: 6647366

Abstract: A method and a system are provided for controlling the coding rates of a multimode coding system with respect to a sequence of input audio signal frames. The method eliminates or minimizes the overflow and underflow of a bit-stream buffer maintained by the coding system for temporarily recording bit-stream data prior to transmission or storage.

Type: Grant

Filed: December 28, 2001

Date of Patent: November 11, 2003

Assignee: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Vladimir Cuperman
Rate control strategies for speech and music coding

Publication number: 20030125932

Abstract: A method and a system are provided for controlling the coding rates of a multimode coding system with respect to a sequence of input audio signal frames. The method eliminates or minimizes the overflow and underflow of a bit-stream buffer maintained by the coding system for temporarily recording bit-stream data prior to transmission or storage.

Type: Application

Filed: December 28, 2001

Publication date: July 3, 2003

Applicant: Microsoft Corporation

Inventors: Tian Wang, Kazuhito Koishida, Vladimir Cuperman
Method for coding speech and music signals

Publication number: 20030004711

Abstract: The present invention provides a transform coding method efficient for music signals that is suitable for use in a hybrid codec, whereby a common Linear Predictive (LP) synthesis filter is employed for both speech and music signals. The LP synthesis filter switches between a speech excitation generator and a transform excitation generator, in accordance with the coding of a speech or music signal, respectively. For coding speech signals, the conventional CELP technique may be used, while a novel asymmetrical overlap-add transform technique is applied for coding music signals. In performing the common LP synthesis filtering, interpolation of the LP coefficients is conducted for signals in overlap-add operation regions. The invention enables smooth transitions when the decoder switches between speech and music decoding modes.

Type: Application

Filed: June 26, 2001

Publication date: January 2, 2003

Applicant: Microsoft Corporation

Inventors: Kazuhito Koishida, Vladimir Cuperman, Amir H. Majidimehr, Allen Gersho

prev 1 2 3 4