Patents by Inventor Ivan Tashev
Ivan Tashev has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20110274291Abstract: A novel adaptive beamforming technique with enhanced noise suppression capability. The technique incorporates the sound-source presence probability into an adaptive blocking matrix. In one embodiment the sound-source presence probability is estimated based on the instantaneous direction of arrival of the input signals and voice activity detection. The technique guarantees robustness to steering vector errors without imposing ad hoc constraints on the adaptive filter coefficients. It can provide good suppression performance for both directional interference signals as well as isotropic ambient noise.Type: ApplicationFiled: July 21, 2011Publication date: November 10, 2011Applicant: Microsoft CorporationInventors: Ivan Tashev, Alejandro Acero, Byung-Jun Yoon
-
Publication number: 20110238414Abstract: A method for managing an interaction of a calling party to a communication partner is provided. The method includes automatically determining if the communication partner expects DTMF input. The method also includes translating speech input to one or more DTMF tones and communicating the one or more DTMF tones to the communication partner, if the communication partner expects DTMF input.Type: ApplicationFiled: March 29, 2010Publication date: September 29, 2011Applicant: MICROSOFT CORPORATIONInventors: Yun-Cheng Ju, Stefanie Tomko, Wei-Ting Frank Liu, Ivan Tashev
-
Patent number: 8005238Abstract: A novel adaptive beamforming technique with enhanced noise suppression capability. The technique incorporates the sound-source presence probability into an adaptive blocking matrix. In one embodiment the sound-source presence probability is estimated based on the instantaneous direction of arrival of the input signals and voice activity detection. The technique guarantees robustness to steering vector errors without imposing ad hoc constraints on the adaptive filter coefficients. It can provide good suppression performance for both directional interference signals as well as isotropic ambient noise.Type: GrantFiled: March 22, 2007Date of Patent: August 23, 2011Assignee: Microsoft CorporationInventors: Ivan Tashev, Alejandro Acero, Byung-Jun Yoon
-
Patent number: 8005237Abstract: A novel beamforming post-processor technique with enhanced noise suppression capability. The present beam forming post-processor technique is a non-linear post-processing technique for sensor arrays (e.g., microphone arrays) which improves the directivity and signal separation capabilities. The technique works in so-called instantaneous direction of arrival space, estimates the probability for sound coming from a given incident angle or look-up direction and applies a time-varying, gain based, spatio-temporal filter for suppressing sounds coming from directions other than the sound source direction resulting in minimal artifacts and musical noise.Type: GrantFiled: May 17, 2007Date of Patent: August 23, 2011Assignee: Microsoft Corp.Inventors: Ivan Tashev, Alejandro Acero
-
Publication number: 20110178798Abstract: A device for suppressing ambient sounds from speech received by a microphone array is provided. One embodiment of the device comprises a microphone array, a processor, an analog-to-digital converter, and memory comprising instructions stored therein that are executable by the processor.Type: ApplicationFiled: January 20, 2010Publication date: July 21, 2011Applicant: MICROSOFT CORPORATIONInventors: Jason Flaks, Ivan Tashev, Duncan McKay, Xudong Ni, Robert Heitkamp, Wei Guo, John Tardif, Leo Shing, Michael Baseflug
-
Patent number: 7852369Abstract: An omni-directional camera (a 360 degree camera) is proposed with an integrated microphone array. The primary application for such a camera is videoconferencing and meeting recording, and the device is designed to be placed on a meeting room table. The microphone array is in a planar configuration, and the microphones are located as close to the desktop as possible to eliminate sound reflections from the table. The camera is connected to the microphone array base with a thin cylindrical rod, which is acoustically invisible to the microphone array for the frequency range [50-4000] Hz. This provides a direct path from the person talking to all of the microphones in the array, and can therefore be used for sound source localization (determining the location of the talker) and beam-forming (improving the sound quality of the talker by filtering only sound from a particular direction). The camera array is elevated from the table to provide a near frontal viewpoint of the meeting participants.Type: GrantFiled: June 27, 2002Date of Patent: December 14, 2010Assignee: Microsoft Corp.Inventors: Ross Cutler, Ivan Tashev, Yong Rui, Anoop Gupta
-
Patent number: 7844059Abstract: A system and process for dereverberation of multi-channel audio streams is presented which uses reverberation suppression techniques. In general, the present system and process builds a frequency dependent model of the reverberation decay and uses spectral subtraction-based reverberation reduction to achieve the aforementioned suppression. This dereverberation system and process can be used to improve automatic speech recognition (ASR) results with minimal CPU overhead.Type: GrantFiled: June 24, 2005Date of Patent: November 30, 2010Assignee: Microsoft CorporationInventors: Ivan Tashev, Daniel Allred
-
Patent number: 7752040Abstract: An “Interference Canceller” provides a computationally efficient real-time technique for removing stationary-tone interference from signals. Typical sources of stationary tone contamination of signals include noise from power wiring (i.e., 50/60 Hz or 400 Hz and their harmonics), frame or line frequencies from electronic devices, and noise from computer fans, hard disk drives, etc. In general, the Interference Canceller adaptively builds and updates a model of stationary tone interference in consecutive frames of an input signal. This adaptively updated model is then used to extrapolate and subtract noise from subsequent frames of the input signal to generate a “clean” output signal. This output signal exhibits significant attenuation of stationary tone interference without eliminating important portions of the underlying signal or distorting the underlying signal with artifacts such as musical noise or nonlinear distortions.Type: GrantFiled: March 28, 2007Date of Patent: July 6, 2010Assignee: Microsoft CorporationInventors: Henrique S. Malvar, Ivan Tashev
-
Patent number: 7634533Abstract: Systems and methods are disclosed that facilitate real-time information exchange in a multimedia conferencing environment. Data Client(s) facilitate data collaboration between users and are maintained separately from audio/video (AV) Clients that provide real-time communication functionality. Data Clients can be remotely located with respect to one another and with respect to a server. A remote user Stand-in Device can be provided that comprises a display to present a remote user to local users, a digital automatic pan/tilt/zoom camera to capture imagery in, for example, a conference room and provide real-time information to an AV Client in a remote office, and a microphone array that can similarly provide real-time audio information from the conference room to an AV Client in the remote office. The invention further facilitates file transfer and presentation broadcast between Data Clients in a single location or in a plurality of disparate locations.Type: GrantFiled: April 30, 2004Date of Patent: December 15, 2009Assignee: Microsoft CorporationInventors: Eric Rudolph, Yong Rui, Henrique S Malvar, Li-Wei He, Michael F Cohen, Ivan Tashev
-
Patent number: 7626889Abstract: A “Sensor Array Post-Filter” provides an adaptive post-filter that accurately models and suppresses both diffuse and directional noise sources as well as interfering speech sources. The post-filter is applied to an output signal produced by a beamformer used to process signals produced by a sensor array. As a result, the Sensor Array Post-Filter operates to improve the signal-to-noise ratio (SNR) of beamformer output signals by providing adaptive post-filtering of the output signals. The post-filter is generated based on a generative statistical model for modeling signal and noise sources at distinct regions in a signal field that considers prior distributions trained to model an instantaneous direction of arrival for signals captured by sensors in the array.Type: GrantFiled: April 6, 2007Date of Patent: December 1, 2009Assignee: Microsoft CorporationInventors: Michael L. Seltzer, Ivan Tashev
-
Patent number: 7515721Abstract: A self-descriptive microphone array includes a microphone array memory, such as, for example a ROM, EEPROM, or other conventional memory, which contains a microphone array device description. This device description includes parametric information which defines operational characteristics and configuration of the microphone array. In further embodiments, the microphone array uses any of a variety of conventional wired or wireless computer interfaces, including serial, IEEE 1394, USB, Bluetooth™, etc., to connect to a computing device. Once connected, the microphone array provides its device description to the computing device. Sound processing software residing within the computing device is then automatically configured for optimally interacting with one or more analog or digital audio signals provided by the microphone array. In another embodiment, the microphone array performs integrated self calibration for automatically updating the device description.Type: GrantFiled: February 9, 2004Date of Patent: April 7, 2009Assignee: Microsoft CorporationInventors: Ivan Tashev, Henrique S. Malvar, Ross Cutler, Daniel J. Sisolak, John E. Unverferth, III
-
Publication number: 20090043497Abstract: The presentation of location information to a user that is distracted by traveling can result in the user quickly forgetting, or never even comprehending, key parts of the location information, such as the street number. Identification can be made of intersections and points of interest near the user's destination, which can then be provided instead of, or in addition to, the address, thereby increasing user comprehension and retention, especially when distracted. Map data can be parsed into addresses, intersections and points of interest databases. These databases can be accessed to identify proximate intersections and points of interest, which can then be filtered and subsequently ranked to identify one intersection, one point of interest, or both, that can be presented to the user to aid the user in comprehending and retaining the location information even when distracted.Type: ApplicationFiled: August 10, 2007Publication date: February 12, 2009Applicant: Microsoft CorporationInventors: Ivan Tashev, Michael Lewis Seltzer, Yun-Cheng Ju, Alex Acero
-
Patent number: 7487056Abstract: Precision and reliability of localization estimates derived from conventional localization systems are improved through a system and method for post-processing of initial localization data, even in environments which may include noise, reflections, or other interference. Such localization systems include conventional sound source localization (SSL) systems based on microphone array inputs, radio source location systems based on directional antenna array inputs, etc. In general, this post-processing system and method applies statistical real-time clustering to initial localization estimates, and then uses this real-time clustering in a multi-stage process to generate new localization estimates having improved precision and reliability relative to the initial localization estimates.Type: GrantFiled: August 22, 2005Date of Patent: February 3, 2009Assignee: Microsoft CorporationInventor: Ivan Tashev
-
Publication number: 20080288219Abstract: A novel beamforming post-processor technique with enhanced noise suppression capability. The present beam forming post-processor technique is a non-linear post-processing technique for sensor arrays (e.g., microphone arrays) which improves the directivity and signal separation capabilities. The technique works in so-called instantaneous direction of arrival space, estimates the probability for sound coming from a given incident angle or look-up direction and applies a time-varying, gain based, spatio-temporal filter for suppressing sounds coming from directions other than the sound source direction resulting in minimal artifacts and musical noise.Type: ApplicationFiled: May 17, 2007Publication date: November 20, 2008Applicant: Microsoft CorporationInventors: Ivan Tashev, Alejandro Acero
-
Publication number: 20080247274Abstract: A “Sensor Array Post-Filter” provides an adaptive post-filter that accurately models and suppresses both diffuse and directional noise sources as well as interfering speech sources. The post-filter is applied to an output signal produced by a beamformer used to process signals produced by a sensor array. As a result, the Sensor Array Post-Filter operates to improve the signal-to-noise ratio (SNR) of beamformer output signals by providing adaptive post-filtering of the output signals. The post-filter is generated based on a generative statistical model for modeling signal and noise sources at distinct regions in a signal field that considers prior distributions trained to model an instantaneous direction of arrival for signals captured by sensors in the array.Type: ApplicationFiled: April 6, 2007Publication date: October 9, 2008Applicant: MICROSOFT CORPORATIONInventors: Michael L. Seltzer, Ivan Tashev
-
Publication number: 20080243497Abstract: An “Interference Canceller” provides a computationally efficient real-time technique for removing stationary-tone interference from signals. Typical sources of stationary tone contamination of signals include noise from power wiring (i.e., 50/60 Hz or 400 Hz and their harmonics), frame or line frequencies from electronic devices, and noise from computer fans, hard disk drives, etc. In general, the Interference Canceller adaptively builds and updates a model of stationary tone interference in consecutive frames of an input signal. This adaptively updated model is then used to extrapolate and subtract noise from subsequent frames of the input signal to generate a “clean” output signal. This output signal exhibits significant attenuation of stationary tone interference without eliminating important portions of the underlying signal or distorting the underlying signal with artifacts such as musical noise or nonlinear distortions.Type: ApplicationFiled: March 28, 2007Publication date: October 2, 2008Applicant: MICROSOFT CORPORATIONInventors: Ivan Tashev, Henrique S. Malvar
-
Publication number: 20080232607Abstract: A novel adaptive beamforming technique with enhanced noise suppression capability. The technique incorporates the sound-source presence probability into an adaptive blocking matrix. In one embodiment the sound-source presence probability is estimated based on the instantaneous direction of arrival of the input signals and voice activity detection. The technique guarantees robustness to steering vector errors without imposing ad hoc constraints on the adaptive filter coefficients. It can provide good suppression performance for both directional interference signals as well as isotropic ambient noise.Type: ApplicationFiled: March 22, 2007Publication date: September 25, 2008Applicant: Microsoft CorporationInventors: Ivan Tashev, Alejandro Acero, Byung-Jun Yoon
-
Patent number: 7428309Abstract: An analog preamplifier measurement system for a microphone array builds on conventional microphone arrays by providing an integral “self-calibration system.” This self-calibration system automatically injects an excitation pulse of a known magnitude and phase to all preamplifier inputs within the microphone array. The resulting analog waveform from each preamplifier output is then measured. A frequency analysis, such as, for example, a Fourier or Fast Fourier Transform (FFT), or other conventional frequency analysis, of each of the resulting waveforms is then performed. The results of this frequency analysis are then used to automatically compute frequency-domain compensation gains (e.g., magnitude and phase gains) for each preamplifier for matching or balancing the responses of all of the preamplifiers with each other.Type: GrantFiled: February 4, 2004Date of Patent: September 23, 2008Assignee: Microsoft CorporationInventors: Henrique Malvar, Ivan Tashev
-
Patent number: 7415117Abstract: The ability to combine multiple audio signals captured from the microphones in a microphone array is frequently used in beamforming systems. Typically, beamforming involves processing the output audio signals of the microphone array in such a way as to make the microphone array act as a highly directional microphone. In other words, beamforming provides a “listening beam” which points to a particular sound source while often filtering out other sounds. A “generic beamformer,” as described herein automatically designs a set of beams (i.e., beamforming) that cover a desired angular space range within a prescribed search area. Beam design is a function of microphone geometry and operational characteristics, and also of noise models of the environment around the microphone array. One advantage of the generic beamformer is that it is applicable to any microphone array geometry and microphone type.Type: GrantFiled: March 2, 2004Date of Patent: August 19, 2008Assignee: Microsoft CorporationInventors: Ivan Tashev, Henrique Malvar
-
Patent number: 7394907Abstract: A system and process for sound source localization (SSL) utilizing beamsteering is presented. The present invention provides for improved beamsteering with less drain on system resources while providing accurate, real time results. To accomplish this, the present SSL system and process rejects as much as possible extraneous audio frames and analyzes only those frames exhibiting a well defined sound source. In addition, the number of beams is reduced as much as possible to save on processing time, but a full scan of the working volume is still made with the beams. And finally, interpolation is used to increase the precision of the technique.Type: GrantFiled: June 16, 2003Date of Patent: July 1, 2008Assignee: Microsoft CorporationInventor: Ivan Tashev