Patents by Inventor Jimeng Zheng
Jimeng Zheng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11967316Abstract: Embodiments of this application disclose method and apparatus for positioning a target audio signal by an audio interaction device, and an audio interaction device The method includes: obtaining audio signals in a plurality of directions in a space, and performing echo cancellation on the audio signal, the audio signal including a target-audio direct signal; obtaining weights of a plurality of time-frequency points in the audio signals, a weight of each time-frequency point indicating, at the time-frequency point, a relative proportion of the target-audio direct signal in the audio signals; weighting time-frequency components of the audio signal at the plurality of time-frequency points separately for each of the plurality of directions by using the weights of the plurality of time-frequency points, to obtain a weighted audio signal energy distribution; and obtaining a sound source azimuth corresponding to the target-audio direct signal in the audio signals accordingly.Type: GrantFiled: February 23, 2021Date of Patent: April 23, 2024Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Jimeng Zheng, Ian Ernan Liu, Yi Gao, Weiwei Li
-
System and method for generating spatial audio with uniform reverberation in real-time communication
Patent number: 11950088Abstract: A computer-implemented method for generating spatial audio with uniform reverberation in a real-time communication session is performed by a real-time communication software application running on an electronic communication device. The method includes removing the reverberation of recorded speech signals from far-end participants by the dereverberation approach, rendering the direct sound parts by filtering the output signals by head-related transfer functions of desired directions, generating reverberant sound parts by convolving the output signals from with uniform room impulse responses or an artificial reverberator, combining direct and reverberant sound components to generate spatialized speech signals. When speakers and listeners are located in two virtual conference rooms, the reverberation of the two rooms are coupled. The reverberant sound parts are then generated by convolving the output signals and coupled RIRs from the two rooms.Type: GrantFiled: July 7, 2022Date of Patent: April 2, 2024Assignee: Agora Lab, Inc.Inventors: Song Li, Jianyuan Feng, Bo Wu, Jimeng Zheng -
Patent number: 11908456Abstract: Embodiments of this application discloses an azimuth estimation method performed at a computing device, the method including: obtaining, in real time, multi-channel sampling signals and buffering the multi-channel sampling signals; performing wakeup word detection on one or more sampling signals of the multi-channel sampling signals, and determining a wakeup word detection score for each channel of the one or more sampling signals; performing a spatial spectrum estimation on the buffered multi-channel sampling signals to obtain a spatial spectrum estimation result, when the wakeup word detection scores of the one or more sampling signals indicates that a wakeup word exists in the one or more sampling signals; and determining an azimuth of a target voice associated with the multi-channel sampling signals according to the spatial spectrum estimation result and a highest wakeup word detection score, thereby improving the accuracy of the azimuth estimation in a voice interaction process.Type: GrantFiled: August 28, 2020Date of Patent: February 20, 2024Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Jimeng Zheng, Yi Gao, Meng Yu, Ian Ernan Liu
-
SYSTEM AND METHOD FOR GENERATING SPATIAL AUDIO WITH UNIFORM REVERBERATION IN REAL-TIME COMMUNICATION
Publication number: 20240015466Abstract: A computer-implemented method for generating spatial audio with uniform reverberation in a real-time communication session is performed by a real-time communication software application running on an electronic communication device. The method includes removing the reverberation of recorded speech signals from far-end participants by the dereverberation approach, rendering the direct sound parts by filtering the output signals by head-related transfer functions of desired directions, generating reverberant sound parts by convolving the output signals from with uniform room impulse responses or an artificial reverberator, combining direct and reverberant sound components to generate spatialized speech signals. When speakers and listeners are located in two virtual conference rooms, the reverberation of the two rooms are coupled. The reverberant sound parts are then generated by convolving the output signals and coupled RIRs from the two rooms.Type: ApplicationFiled: July 7, 2022Publication date: January 11, 2024Inventors: Song Li, Jianyuan Feng, Bo Wu, Jimeng Zheng -
Patent number: 11856376Abstract: This application discloses a sound acquisition component array, including: two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components. The two second sound acquisition components are located at a first side of a line connecting the two first sound acquisition components, and the two third sound acquisition components are located at a second side of the connecting line that is opposite to the first side of the connecting line; the two second sound acquisition components are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components are symmetrical about the perpendicular bisector; and a distance between the two first sound acquisition components, a distance between the two second sound acquisition components, and a distance between the two third sound acquisition components are respectively different from one another along a direction defined by the connecting line.Type: GrantFiled: May 12, 2021Date of Patent: December 26, 2023Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Jimeng Zheng, Yi Gao, Xuan Ji, Weiwei Li, Meng Yu, Kai Xia, Jun Feng, Zhu Chen, Hongyang Chen, Wenbin Yang, Yu Wang, Yong Liu
-
Publication number: 20230386492Abstract: A computer-implemented method for suppressing noise from audio signal uses both statistical noise estimation and neural network noise estimation to achieve more desirable noise reduction. The method is performed by a noise suppression computer software application running on an electronic device. The noise suppression computer software application first transforms the speech signal in time domain into frequency domain before determining a statistical noise estimate and a neural network noise estimate. The noise suppression computer software application merges the two noise estimates to derive a final noise estimate, and determines and refines a noise suppression filter. The filter is applied to the speech signal in frequency domain to obtain an enhanced signal. The enhanced signal is transformed back into time domain.Type: ApplicationFiled: May 24, 2022Publication date: November 30, 2023Inventors: Jimeng Zheng, Bo Wu, Xiaohan Zhao, Liangliang Wang, Ruofei Chen
-
Publication number: 20230013740Abstract: This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to each sound area in N sound areas; using the sound area as a target detection sound area, and generating a control signal corresponding to the target detection sound area according to sound area information corresponding to the target detection sound area; processing a speech input signal corresponding to the target detection sound area by using the control signal corresponding to the target detection sound area, to obtain a speech output signal corresponding to the target detection sound area; and generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area.Type: ApplicationFiled: September 13, 2022Publication date: January 19, 2023Inventors: Jimeng ZHENG, Lianwu CHEN, Weiwei Li, Zhiyi Duan, Meng YU, Dan Su, Kaiyu Jiang
-
Publication number: 20210266664Abstract: This application discloses a sound acquisition component array, including: two first sound acquisition components, two second sound acquisition components, and two third sound acquisition components. The two second sound acquisition components are located at a first side of a line connecting the two first sound acquisition components, and the two third sound acquisition components are located at a second side of the connecting line that is opposite to the first side of the connecting line; the two second sound acquisition components are symmetrical about a perpendicular bisector of the connecting line, and the two third sound acquisition components are symmetrical about the perpendicular bisector; and a distance between the two first sound acquisition components, a distance between the two second sound acquisition components, and a distance between the two third sound acquisition components are respectively different from one another along a direction defined by the connecting line.Type: ApplicationFiled: May 12, 2021Publication date: August 26, 2021Inventors: Jimeng Zheng, Yi Gao, Xuan Ji, Weiwei Li, Meng Yu, Kai Xia, Jun Feng, Zhu Chen, Hongyang Chen, Wenbin Yang, Yu Wang, Yong Liu
-
Publication number: 20210174792Abstract: Embodiments of this application disclose method and apparatus for positioning a target audio signal by an audio interaction device, and an audio interaction device The method includes: obtaining audio signals in a plurality of directions in a space, and performing echo cancellation on the audio signal, the audio signal including a target-audio direct signal; obtaining weights of a plurality of time-frequency points in the audio signals, a weight of each time-frequency point indicating, at the time-frequency point, a relative proportion of the target-audio direct signal in the audio signals; weighting time-frequency components of the audio signal at the plurality of time-frequency points separately for each of the plurality of directions by using the weights of the plurality of time-frequency points, to obtain a weighted audio signal energy distribution; and obtaining a sound source azimuth corresponding to the target-audio direct signal in the audio signals accordingly.Type: ApplicationFiled: February 23, 2021Publication date: June 10, 2021Inventors: Jimeng ZHENG, Ian Ernan LIU, Yi GAO, Weiwei LI
-
Publication number: 20200395005Abstract: Embodiments of this application discloses an azimuth estimation method performed at a computing device, the method including: obtaining, in real time, multi-channel sampling signals and buffering the multi-channel sampling signals; performing wakeup word detection on one or more sampling signals of the multi-channel sampling signals, and determining a wakeup word detection score for each channel of the one or more sampling signals; performing a spatial spectrum estimation on the buffered multi-channel sampling signals to obtain a spatial spectrum estimation result, when the wakeup word detection scores of the one or more sampling signals indicates that a wakeup word exists in the one or more sampling signals; and determining an azimuth of a target voice associated with the multi-channel sampling signals according to the spatial spectrum estimation result and a highest wakeup word detection score, thereby improving the accuracy of the azimuth estimation in a voice interaction process.Type: ApplicationFiled: August 28, 2020Publication date: December 17, 2020Inventors: Jimeng Zheng, Yi Gao, Meng Yu, Ian Eman Liu
-
Patent number: 10856080Abstract: Provided are, among other things, systems, methods and techniques for reducing echo in an audio signal. One representative embodiment involves obtaining an input signal, an estimate of a system-characterizing function, and a reference signal, each at a corresponding sample rate and each divided into a plurality of sub-bands; separately processing such sub-bands, where for a given sub-band the estimate of the system-characterizing function and the reference signal are processed to generate an echo-estimation signal and then the echo-estimation signal is subtracted from the input signal to provide an echo-corrected signal for such given sub-band; and combining the echo-corrected signal from each of different ones of the plurality of the sub-bands to provide a final output signal, with the echo-estimation signal generated using a processing sample rate that is lower than the sample rate for the input signal.Type: GrantFiled: October 16, 2018Date of Patent: December 1, 2020Assignee: Guoguang Electric Company LimitedInventors: Jimeng Zheng, Yuli You
-
Patent number: 10325583Abstract: Provided are, among other things, systems, methods and techniques for audio-signal processing. One representative embodiment includes HT sub-band analysis/decomposition modules, e.g., one for each audio channel and one for an echo reference signal. Each HT sub-band analysis/decomposition module includes a Hilbert Transformation module and an analysis/decomposition filter bank and provides sub-band outputs. Echo-cancellation modules, e.g., one for each audio channel, perform echo-cancellation processing on such sub-bands. Beamforming modules, e.g., one for each sub-band, then perform beamforming, e.g., across all audio channels. Finally, a resynthesis stage combines the different sub-band outputs in order to provide a system output signal.Type: GrantFiled: October 4, 2017Date of Patent: June 18, 2019Assignee: Guoguang Electric Company LimitedInventors: Jimeng Zheng, Yuli You
-
Publication number: 20190103088Abstract: Provided are, among other things, systems, methods and techniques for audio-signal processing. One representative embodiment includes HT sub-band analysis/decomposition modules, e.g., one for each audio channel and one for an echo reference signal. Each HT sub-band analysis/decomposition module includes a Hilbert Transformation module and an analysis/decomposition filter bank and provides sub-band outputs. Echo-cancellation modules, e.g., one for each audio channel, perform echo-cancellation processing on such sub-bands. Beamforming modules, e.g., one for each sub-band, then perform beamforming, e.g., across all audio channels. Finally, a resynthesis stage combines the different sub-band outputs in order to provide a system output signal.Type: ApplicationFiled: October 4, 2017Publication date: April 4, 2019Inventors: Jimeng Zheng, Yuli You
-
Publication number: 20190082259Abstract: Provided are, among other things, systems, methods and techniques for reducing echo in an audio signal. One representative embodiment involves obtaining an input signal, an estimate of a system-characterizing function, and a reference signal, each at a corresponding sample rate and each divided into a plurality of sub-bands; separately processing such sub-bands, where for a given sub-band the estimate of the system-characterizing function and the reference signal are processed to generate an echo-estimation signal and then the echo-estimation signal is subtracted from the input signal to provide an echo-corrected signal for such given sub-band; and combining the echo-corrected signal from each of different ones of the plurality of the sub-bands to provide a final output signal, with the echo-estimation signal generated using a processing sample rate that is lower than the sample rate for the input signal.Type: ApplicationFiled: October 16, 2018Publication date: March 14, 2019Inventors: Jimeng Zheng, Yuli You
-
Patent number: 10154343Abstract: Provided are, among other things, systems, methods and techniques for reducing echo in an audio signal. One representative embodiment involves obtaining an input signal, an estimate of a system-characterizing function, and a reference signal, each at a corresponding sample rate and each divided into a plurality of sub-bands; separately processing such sub-bands, where for a given sub-band the estimate of the system-characterizing function and the reference signal are processed to generate an echo-estimation signal and then the echo-estimation signal is subtracted from the input signal to provide an echo-corrected signal for such given sub-band; and combining the echo-corrected signal from each of different ones of the plurality of the sub-bands to provide a final output signal, with the echo-estimation signal generated using a processing sample rate that is lower than the sample rate for the input signal.Type: GrantFiled: September 14, 2017Date of Patent: December 11, 2018Assignee: Guoguang Electric Company LimitedInventors: Jimeng Zheng, Yuli You
-
Patent number: 10073607Abstract: A method of processing audio may include receiving, by a computing device, a plurality of real-time audio signals outputted by a plurality of microphones communicatively coupled to the computing device. The computing device may output to a display a graphical user interface (GUI) that presents audio information associated with the received audio signals. The one or more received audio signals may be processed based on a user input associated with the audio information presented via the GUI to generate one or more processed audio signals. The one or more processed audio signals may be output to, for example, one or more output devices such as speakers, headsets, and the like.Type: GrantFiled: July 1, 2015Date of Patent: September 11, 2018Assignee: QUALCOMM IncorporatedInventors: Lae-Hoon Kim, Erik Visser, Raghuveer Peri, Phuong Lam Ton, Jeremy Patrick Toman, Troy Schultz, Jimeng Zheng
-
Patent number: 10051364Abstract: A method of processing audio may include receiving, by a computing device, a plurality of real-time audio signals outputted by a plurality of microphones communicatively coupled to the computing device. The computing device may output to a display a graphical user interface (GUI) that presents audio information associated with the received audio signals. The one or more received audio signals may be processed based on a user input associated with the audio information presented via the GUI to generate one or more processed audio signals. The one or more processed audio signals may be output to, for example, one or more output devices such as speakers, headsets, and the like.Type: GrantFiled: July 1, 2015Date of Patent: August 14, 2018Assignee: QUALCOMM IncorporatedInventors: Lae-Hoon Kim, Erik Visser, Raghuveer Peri, Phuong Lam Ton, Jeremy Patrick Toman, Troy Schultz, Jimeng Zheng
-
Patent number: 9258661Abstract: A method includes receiving, at a processor, a first data frame at a first time from a first microphone. The method also includes receiving a second data frame at the first time from a second microphone. The method further includes calculating a power ratio of the first microphone and the second microphone based on the first data frame and the second data frame in response to determining that the first data frame and the second data frame are noise data frames.Type: GrantFiled: December 23, 2013Date of Patent: February 9, 2016Assignee: Qualcomm IncorporatedInventors: Jimeng Zheng, Ian Ernan Liu, Dinesh Ramakrishnan, Deepak Kumar Challa
-
Publication number: 20160004405Abstract: A method of processing audio may include receiving, by a computing device, a plurality of real-time audio signals outputted by a plurality of microphones communicatively coupled to the computing device. The computing device may output to a display a graphical user interface (GUI) that presents audio information associated with the received audio signals. The one or more received audio signals may be processed based on a user input associated with the audio information presented via the GUI to generate one or more processed audio signals. The one or more processed audio signals may be output to, for example, one or more output devices such as speakers, headsets, and the like.Type: ApplicationFiled: July 1, 2015Publication date: January 7, 2016Inventors: Lae-Hoon Kim, Erik Visser, Raghuveer Peri, Phuong Lam Ton, Jeremy Patrick Toman, Troy Schultz, Jimeng Zheng
-
Publication number: 20160004499Abstract: A method of processing audio may include receiving, by a computing device, a plurality of real-time audio signals outputted by a plurality of microphones communicatively coupled to the computing device. The computing device may output to a display a graphical user interface (GUI) that presents audio information associated with the received audio signals. The one or more received audio signals may be processed based on a user input associated with the audio information presented via the GUI to generate one or more processed audio signals. The one or more processed audio signals may be output to, for example, one or more output devices such as speakers, headsets, and the like.Type: ApplicationFiled: July 1, 2015Publication date: January 7, 2016Inventors: Lae-Hoon Kim, Erik Visser, Raghuveer Peri, Phuong Lam Ton, Jeremy Patrick Toman, Troy Schultz, Jimeng Zheng