Patents by Inventor Dinei A. Florencio

Dinei A. Florencio has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Microphone array signal enhancement

Publication number: 20060104455

Abstract: A system and method facilitating signal enhancement utilizing an adaptive filter is provided. The invention includes an adaptive filter that filters an input based upon a plurality of adaptive coefficients, the adaptive filter modifying at least one of the adaptive coefficients based on a feedback output. The invention further includes a feedback component that provides the feedback output based, at least in part, upon a non-linear function of the acoustic reverberation reduced output. The invention further provides a noise statistics component that stores noise statistics associated with a noise portion of an input signal and a signal+noise statistics component that stores signal+noise statistics associated with a signal and noise portion of the input signal.

Type: Application

Filed: December 29, 2005

Publication date: May 18, 2006

Applicant: Microsoft Corporation

Inventors: Henrique Malvar, Dinei Florencio, Bradford Gillespie
System and process for time delay estimation in the presence of correlated noise and reverberation

Patent number: 7039200

Abstract: A system and process for estimating the time delay of arrival (TDOA) between a pair of audio sensors of a microphone array is presented. Generally, a generalized cross-correlation (GCC) technique is employed. However, this technique is improved to include provisions for both reducing the influence (including interference) from correlated ambient noise and reverberation noise in the sensor signals prior to computing the TDOA estimate. Two unique correlated ambient noise reduction procedures are also proposed. One involves the application of Wiener filtering, and the other a combination of Wiener filtering with a Gnn subtraction technique. In addition, two unique reverberation noise reduction procedures are proposed. Both involve applying a weighting factor to the signals prior to computing the TDOA which combines the effects of a traditional maximum likelihood (TML) weighting function and a phase transformation (PHAT) weighting function.

Type: Grant

Filed: March 31, 2003

Date of Patent: May 2, 2006

Assignee: Microsoft Corporation

Inventors: Yong Rui, Dinei A. Florencio
Tarp filter

Publication number: 20060078210

Abstract: Systems and methods for performing adaptive filtering are disclosed. The present invention generates probabilities that can be used in an encoder, such as an arithmetic encoder and generates those probabilities in a computationally efficient manner. Probabilities of previously encoded coefficients are employed, effectively, in generating probabilities of the coefficients without regard to directional information. Thus, a large amount of information is adaptively and efficiently used in generating the probabilities. For the coefficients, the probability is computed based at least partly on at least one probability of a previously computed probability of a neighboring coefficient. Then, the coefficients are encoded using those computed probabilities.

Type: Application

Filed: November 28, 2005

Publication date: April 13, 2006

Applicant: Microsoft Corporation

Inventors: Patrice Simard, Henrique Malvar, Dinei Florencio, David Steinkraus
System and process for robust sound source localization

Patent number: 6999593

Abstract: A system and process for finding the location of a sound source using direct approaches having weighting factors that mitigate the effect of both correlated and reverberation noise is presented. When more than two microphones are used, the traditional time-delay-of-arrival (TDOA) based sound source localization (SSL) approach involves two steps. The first step computes TDOA for each microphone pair, and the second step combines these estimates. This two-step process discards relevant information in the first step, thus degrading the SSL accuracy and robustness. In the present invention, direct, one-step, approaches are employed. Namely, a one-step TDOA SSL approach and a steered beam (SB) SSL approach are employed. Each of these approaches provides an accuracy and robustness not available with the traditional two-step approaches.

Type: Grant

Filed: May 28, 2003

Date of Patent: February 14, 2006

Assignee: Microsoft Corporation

Inventors: Yong Rui, Dinei A. Florencio
System and process for time delay estimation in the presence of correlated noise and reverberation

Publication number: 20050249038

Abstract: A system and process for estimating the time delay of arrival (TDOA) between a pair of audio sensors of a microphone array is presented. Generally, a generalized cross-correlation (GCC) technique is employed. However, this technique is improved to include provisions for both reducing the influence (including interference) from correlated ambient noise and reverberation noise in the sensor signals prior to computing the TDOA estimate. Two unique correlated ambient noise reduction procedures are also proposed. One involves the application of Wiener filtering, and the other a combination of Wiener filtering with a Gnn subtraction technique. In addition, two unique reverberation noise reduction procedures are proposed. Both involve applying a weighting factor to the signals prior to computing the TDOA which combines the effects of a traditional maximum likelihood (TML) weighting function and a phase transformation (PHAT) weighting function.

Type: Application

Filed: July 14, 2005

Publication date: November 10, 2005

Applicant: Microsoft Corporation

Inventors: Yong Rui, Dinei Florencio
System and method for real-time jitter control and packet-loss concealment in an audio signal

Publication number: 20050058145

Abstract: An “adaptive audio playback controller” operates by decoding and reading received packets of an audio signal into a signal buffer. Samples of the decoded audio signal are then played out of the signal buffer according to the needs of a player device. Jitter control and packet loss concealment are accomplished by continuously analyzing buffer content in real-time, and determining whether to provide unmodified playback from the buffer contents, whether to compress buffer content, stretch buffer content, or whether to provide for packet loss concealment for overly delayed or lost packets as a function of buffer content. Further, the adaptive audio playback controller also determines where to stretch or compress particular frames or signal segments in the signal buffer, and how much to stretch or compress such segments in order to optimize perceived playback quality.

Type: Application

Filed: September 15, 2003

Publication date: March 17, 2005

Applicant: Microsoft Corporation

Inventors: Dinei Florencio, Philip Chou, Li-Wei He
System and method for providing high-quality stretching and compression of a digital audio signal

Publication number: 20050055204

Abstract: An adaptive “temporal audio scaler” is provided for automatically stretching and compressing frames of audio signals received across a packet-based network. Prior to stretching or compressing segments of a current frame, the temporal audio scaler first computes a pitch period for each frame for sizing signal templates used for matching operations in stretching and compressing segments. Further, the temporal audio scaler also determines the type or types of segments comprising each frame. These segment types include “voiced” segments, “unvoiced” segments, and “mixed” segments which include both voiced and unvoiced portions. The stretching or compression methods applied to segments of each frame are then dependent upon the type of segments comprising each frame. Further, the amount of stretching and compression applied to particular segments is automatically variable for minimizing signal artifacts while still ensuring that an overall target stretching or compression ratio is maintained for each frame.

Type: Application

Filed: September 10, 2003

Publication date: March 10, 2005

Applicant: Microsoft Corporation

Inventors: Dinei Florencio, Philip Chou, Li-Wei He
System and method for real-time detection and preservation of speech onset in a signal

Publication number: 20050055201

Abstract: A “speech onset detector” provides a variable length frame buffer in combination with either variable transmission rate or temporal speech compression for buffered signal frames. The variable length buffer buffers frames that are not clearly identified as either speech or non-speech frames during an initial analysis. Buffering of signal frames continues until a current frame is identified as either speech or non-speech. If the current frame is identified as non-speech, buffered frames are encoded as non-speech frames. However, if the current frame is identified as a speech frame, buffered frames are searched for the actual onset point of the speech. Once that onset point is identified, the signal is either transmitted in a burst, or a time-scale modification of the buffered signal is applied for compressing buffered frames beginning with the frame in which onset point is detected. The compressed frames are then encoded as one or more speech frames.

Type: Application

Filed: September 10, 2003

Publication date: March 10, 2005

Applicant: Microsoft Corporation, Corporation in the State of Washington

Inventors: Dinei Florencio, Philip Chou
System and process for robust sound source localization

Publication number: 20040240680

Abstract: A system and process for finding the location of a sound source using direct approaches having weighting factors that mitigate the effect of both correlated and reverberation noise is presented. When more than two microphones are used, the traditional time-delay-of-arrival (TDOA) based sound source localization (SSL) approach involves two steps. The first step computes TDOA for each microphone pair, and the second step combines these estimates. This two-step process discards relevant information in the first step, thus degrading the SSL accuracy and robustness. In the present invention, direct, one-step, approaches are employed. Namely, a one-step TDOA SSL approach and a steered beam (SB) SSL approach are employed. Each of these approaches provides an accuracy and robustness not available with the traditional two-step approaches.

Type: Application

Filed: May 28, 2003

Publication date: December 2, 2004

Inventors: Yong Rui, Dinei A. Florencio
Spread spectrum modulation system and method for embedding digital information into digital or analog media

Publication number: 20040240524

Abstract: A system and method for embedding information into digital media and later detecting the embedded information using a unique spread spectrum modulation technique. In general, the present invention removes interference caused by an original signal from the detection process thereby eliminating a major source of detection error. The interference caused by the original signal is removed by using the encoder knowledge about the original signal and modulating the energy of the embedded mark to compensate for the original signal interference. The present invention also includes a novel redundant bit representation technique causes a resulting average over a large sample to tend to zero, thereby reducing the vulnerability of the present invention to malicious collusion attacks.

Type: Application

Filed: June 10, 2004

Publication date: December 2, 2004

Applicant: Microsoft Corporation

Inventors: Henrique S. Malvar, Dinei A. Florencio
Noise reduction systems and methods for voice applications

Publication number: 20040213419

Abstract: Various embodiments reduce noise within a particular environment, while isolating and capturing speech in a manner that allows operation within an otherwise noisy environment. In one embodiment, an array of one or more microphones is used to selectively eliminate noise emanating from known, generally fixed locations, and pass signals from a pre-specified region or regions with reduced distortion.

Type: Application

Filed: April 25, 2003

Publication date: October 28, 2004

Applicant: MICROSOFT CORPORATION

Inventors: Ankur Varma, Dinei Florencio
System and process for time delay estimation in the presence of correlated noise and reverberation

Publication number: 20040190730

Abstract: A system and process for estimating the time delay of arrival (TDOA) between a pair of audio sensors of a microphone array is presented. Generally, a generalized cross-correlation (GCC) technique is employed. However, this technique is improved to include provisions for both reducing the influence (including interference) from correlated ambient noise and reverberation noise in the sensor signals prior to computing the TDOA estimate. Two unique correlated ambient noise reduction procedures are also proposed. One involves the application of Wiener filtering, and the other a combination of Wiener filtering with a Gnn subtraction technique. In addition, two unique reverberation noise reduction procedures are proposed. Both involve applying a weighting factor to the signals prior to computing the TDOA which combines the effects of a traditional maximum likelihood (TML) weighting function and a phase transformation (PHAT) weighting function.

Type: Application

Filed: March 31, 2003

Publication date: September 30, 2004

Inventors: Yong Rui, Dinei A. Florencio
Spread spectrum modulation system and method for embedding digital information into digital or analog media

Patent number: 6778587

Abstract: A system and method for embedding information into digital media and later detecting the embedded information using a unique spread spectrum modulation technique. In general, the present invention removes interference caused by an original signal from the detection process thereby eliminating a major source of detection error. The interference caused by the original signal is removed by using the encoder knowledge about the original signal and modulating the energy of the embedded mark to compensate for the original signal interference. The present invention also includes a novel redundant bit representation technique causes a resulting average over a large sample to tend to zero, thereby reducing the vulnerability of the present invention to malicious collusion attacks.

Type: Grant

Filed: September 1, 2000

Date of Patent: August 17, 2004

Assignee: Microsoft Corporation

Inventors: Henrique S. Malvar, Dinei A. Florencio
Method and apparatus for converting the bitrate of an encoded bitstream without full re-encoding

Patent number: 6775325

Abstract: An apparatus and method for changing the bitrate of an encoded bitstream in a compressed domain, i.e., a domain where the input image is still represented by transform coefficients, e.g., DCT coefficients or wavelet coefficients is disclosed. The bitrate of the encoded bitstream is changed by applying a different quantization scale directly to the transform coefficients of the encoded bitstream. The requantization error or distortion introduced by the requantization process is addressed by propagating the requantization error to the following frame.

Type: Grant

Filed: April 6, 1999

Date of Patent: August 10, 2004

Assignee: Sarnoff Corporation

Inventor: Dinei A. Florencio
Packing variable-length code bits at fixed positions

Patent number: 6449394

Abstract: A conventional variable-length codebook is converted into a modified codebook in which each symbol is represented by the same number of bits, but at least one symbol can be represented by at least two different code words. Such symbols therefore may be said to have redundant bits, which may then be used to encode other symbols, thereby achieving a degree of data compression. By using fixed-length code words, the positions of code words in the resulting encoded data stream are known a priori. As a result, the encoded data can decoded in parallel. The present invention provides both the data compression advantages of variable-length encoding schemes and the ability to perform parallel decoding processing of fixed-length encoding schemes. The present invention can be embodied in either lossless or lossy implementations.

Type: Grant

Filed: July 9, 1999

Date of Patent: September 10, 2002

Assignees: Sarnoff Corporation, Motorola Inc.

Inventor: Dinei A. Florencio
Method and apparatus for imbedding a watermark into a bitstream representation of a digital image sequence

Patent number: 6208745

Abstract: A method and apparatus that inserts watermark information directly into an encoded video bitstream. The method identifies specific blocks or macroblocks in an encoded video bitstieam and inserts the watermark information directly into the bitstream such that these selected blocks are replaced with a block containing watermark information or augmented with watermark information.

Type: Grant

Filed: December 30, 1997

Date of Patent: March 27, 2001

Assignee: Sarnoff Corporation

Inventors: Dinei A. Florencio, Michael A. Isnardi

prev 1 2 3 4 5