Patents by Inventor Xingtao Zhang

Xingtao Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

INTER-CHANNEL PHASE DIFFERENCE PARAMETER ENCODING METHOD AND APPARATUS

Publication number: 20200126571

Abstract: This application discloses an IPD parameter encoding method, including: obtaining a reference parameter used to determine an IPD parameter encoding scheme of a current frame of a multi-channel signal; determining the IPD parameter encoding scheme of the current frame based on the reference parameter, where the determined IPD parameter encoding scheme of the current frame is one of at least two preset IPD parameter encoding schemes; and processing an IPD parameter of the current frame based on the determined IPD parameter encoding scheme of the current frame. The technical solutions provided in this application can improve encoding quality of the multi-channel signal.

Type: Application

Filed: December 20, 2019

Publication date: April 23, 2020

Inventors: Xingtao ZHANG, Haiting LI, Zexin LIU, Lei MIAO
METHOD AND DEVICE FOR SOUND SOURCE LOCALIZATION

Publication number: 20190342688

Abstract: A method and an apparatus for locating a sound source are provided. The method includes: obtaining M channels of audio signals of a preset format by microphone arrays located in different planes (S100); preprocessing the M channels of audio signals of the preset format, and projecting them onto the same plane, so as to obtain N channels of audio signals, where M?N (S200); performing a time-frequency transformation on each of the N channels of audio signals, so as to obtain frequency domain signals of the N channels of audio signals (S300); further calculating a covariance matrix of the frequency domain signals and performing a smoothing process (S400); performing an eigenvalue decomposition of the smoothed covariance matrix (S500); estimating the sound source direction according to an eigenvector corresponding to the maximum eigenvalue, so as to obtain a sound source orientation parameter (S600).

Type: Application

Filed: July 18, 2019

Publication date: November 7, 2019

Applicants: NANJING TWIRLING TECHNOLOGY CO., LTD., BEIJING TWIRLING IN TIME CO., LTD.

Inventors: Xuejing SUN, Xingtao ZHANG, Chen ZHANG
Method and apparatus for determining inter-channel time difference parameter

Patent number: 10388288

Abstract: A method and an apparatus for determining an inter-channel time difference parameter are provided, so that precision of a determined ITD parameter can adapt to channel quality. The method includes: determining a target search complexity from plurality of search complexities, where the plurality of search complexities are in a one-to-one correspondence with plurality of channel quality values; and performing search processing on a signal on a first sound channel and a signal on a second sound channel according to the target search complexity so as to determine a first inter-channel time difference ITD parameter corresponding to the first sound channel and the second sound channel.

Type: Grant

Filed: September 6, 2017

Date of Patent: August 20, 2019

Assignee: Huawei Technologies Co., Ltd.

Inventors: Xingtao Zhang, Lei Miao
Frame loss compensation processing method and apparatus

Patent number: 10354659

Abstract: A frame loss compensation processing method and apparatus is presented, where the method includes, when a ith frame is a lost frame, estimating a spectrum frequency parameter, a pitch period, and a gain of the ith frame according to at least one of an inter-frame relationship between first N frames of the ith frame or an intra-frame relationship between first N frames of the ith frame. A parameter of the ith frame is determined using the signal correlation between the first N frames, the signal energy stability between the first N frames, intra-frame signal correlation of each frame, and intra-frame signal energy stability of each frame.

Type: Grant

Filed: March 29, 2017

Date of Patent: July 16, 2019

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Zexin Liu, Xingtao Zhang, Bin Wang, Lei Miao
Speech/Audio Bitstream Decoding Method and Apparatus

Publication number: 20190214025

Abstract: A speech/audio bitstream decoding method includes acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame, performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame, and recovering a speech/audio signal using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the speech/audio bitstream decoding method help improve quality of an output speech/audio signal.

Type: Application

Filed: March 19, 2019

Publication date: July 11, 2019

Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
Method for Encoding Multi-Channel Signal and Encoder

Publication number: 20190189134

Abstract: A method for encoding a multi-channel signal and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial inter-channel time difference (ITD) value of the current frame, controlling, based on characteristic information of the multi-channel signal, a quantity of target frames that are allowed to appear continuously, where the characteristic information includes at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and an ITD value of a previous frame of the target frame is reused as an ITD value of the target frame, determining an ITD value of the current frame based on the initial ITD value and the quantity of target frames allowed to appear continuously, and encoding the multi-channel signal based on the ITD value of the current frame.

Type: Application

Filed: February 11, 2019

Publication date: June 20, 2019

Inventors: Haiting Li, Zexin Liu, Xingtao Zhang, Lei Miao
Multi-Channel Signal Encoding Method and Encoder

Publication number: 20190172474

Abstract: A multi-channel signal encoding method and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial multi-channel parameter of the current frame, determining a difference parameter based on the initial multi-channel parameter of the current frame and multi-channel parameters of previous K frames of the current frame, where the difference parameter represents a difference between the initial multi-channel parameter of the current frame and the multi-channel parameters of the previous K frames, and K is an integer greater than or equal to one, determining a multi-channel parameter of the current frame based on the difference parameter and a characteristic parameter of the current frame, and encoding the multi-channel signal based on the multi-channel parameter of the current frame. Hence, the method and the encoder ensure better accuracy of inter-channel information of a multi-channel signal.

Type: Application

Filed: February 11, 2019

Publication date: June 6, 2019

Inventors: Zexin Liu, Xingtao Zhang, Haiting Li, Lei Miao
Speech/audio bitstream decoding method and apparatus

Patent number: 10269357

Abstract: The present invention disclose a speech/audio bitstream decoding method including: acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame; performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame; and recovering a speech/audio signal by using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the present invention help improve quality of an output speech/audio signal.

Type: Grant

Filed: September 2, 2016

Date of Patent: April 23, 2019

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
Inter-Channel Phase Difference Parameter Extraction Method and Apparatus

Publication number: 20190096411

Abstract: An inter-channel phase difference (IPD) parameter extraction method and apparatus, where the extraction method includes obtaining a parameter obtaining an information extraction manner for a current frame of a multi-channel signal, obtaining an IPD parameter extraction manner for the current frame based on the parameter obtaining the information extraction manner, where the obtained IPD parameter extraction manner is one of at least two preset IPD parameter extraction manners, and obtaining an IPD parameter of the current frame based on the obtained IPD parameter extraction manner for the current frame.

Type: Application

Filed: November 27, 2018

Publication date: March 28, 2019

Inventors: Xingtao Zhang, Haiting Li, Zexin Liu, Lei Miao
Method and apparatus for determining inter-channel time difference parameter

Patent number: 10210873

Abstract: A method for determining an inter-channel time difference (ITD) parameter includes determining a reference parameter according to a time-domain signal on a first sound channel and a time-domain signal on a second sound channel, where the reference parameter corresponds to a sequence of obtaining the time-domain signal on the first sound channel and the time-domain signal on the second sound channel, determining a search range according to the reference parameter and a limiting value (Tmax), where the Tmax is determined according to a sampling rate of the time-domain signal on the first sound channel, and performing search processing within the search range based on a frequency-domain signal on the first sound channel and a frequency-domain signal on the second sound channel to determine a first ITD parameter corresponding to the first sound channel and the second sound channel.

Type: Grant

Filed: September 7, 2017

Date of Patent: February 19, 2019

Assignee: Huawei Technologies Co., Ltd.

Inventors: Xingtao Zhang, Lei Miao
Method and apparatus for decoding speech/audio bitstream

Patent number: 10121484

Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.

Type: Grant

Filed: June 28, 2017

Date of Patent: November 6, 2018

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
Method and apparatus for encoding stereo phase parameter

Patent number: 10008211

Abstract: Present disclosure discloses a method and an apparatus for encoding a stereo phase parameter, which relate to the field of information technologies and can improve an effect of stereo audio phase information. The method includes: first, acquiring a global stereo phase parameter of a current frame; then, determining a value of the global stereo phase parameter of the current frame, and adjusting the value of the global stereo phase parameter of the current frame according to a determining result of the value of the global stereo phase parameter of the current frame; and finally, encoding an adjusted value of the global stereo phase parameter of the current frame. The embodiments of the present disclosure are applicable to recovering stereo phase information.

Type: Grant

Filed: May 13, 2016

Date of Patent: June 26, 2018

Assignee: Huawei Technologies Co., Ltd.

Inventors: Xingtao Zhang, Lei Miao, Wenhai Wu
Inter-channel level difference processing method and apparatus

Patent number: 10002615

Abstract: An inter-channel level difference (ICLD) processing method and apparatus are disclosed. A stereo audio signal is received, and the stereo audio signal is parsed frame by frame, to obtain an ICLD of each sub-band of each subframe of each frame of the stereo audio signal; a sum of absolute values of the ICLDs of each subframe of any frame of the stereo audio signal is calculated; and when an absolute value of a difference between the sums of the absolute values of the ICLDs of each two subframes of the any frame is less than a preset threshold, a weighted ICLD value of each sub-band of the any frame is calculated in a first weighting manner; or otherwise, a weighted ICLD value of each sub-band of the any frame is calculated in a second weighting manner.

Type: Grant

Filed: November 4, 2015

Date of Patent: June 19, 2018

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Xingtao Zhang, Lei Miao
Method and Apparatus for Determining Inter-Channel Time Difference Parameter

Publication number: 20170372710

Abstract: A method for determining an inter-channel time difference (ITD) parameter includes determining a reference parameter according to a time-domain signal on a first sound channel and a time-domain signal on a second sound channel, where the reference parameter corresponds to a sequence of obtaining the time-domain signal on the first sound channel and the time-domain signal on the second sound channel, determining a search range according to the reference parameter and a limiting value (Tmax), where the Tmax is determined according to a sampling rate of the time-domain signal on the first sound channel, and performing search processing within the search range based on a frequency-domain signal on the first sound channel and a frequency-domain signal on the second sound channel to determine a first ITD parameter corresponding to the first sound channel and the second sound channel.

Type: Application

Filed: September 7, 2017

Publication date: December 28, 2017

Inventors: Xingtao Zhang, Lei Miao
Method and Apparatus for Determining Inter-Channel Time Difference Parameter

Publication number: 20170365265

Abstract: A method and an apparatus for determining an inter-channel time difference parameter are provided, so that precision of a determined ITD parameter can adapt to channel quality. The method includes: determining a target search complexity from plurality of search complexities, where the plurality of search complexities are in a one-to-one correspondence with plurality of channel quality values; and performing search processing on a signal on a first sound channel and a signal on a second sound channel according to the target search complexity so as to determine a first inter-channel time difference ITD parameter corresponding to the first sound channel and the second sound channel.

Type: Application

Filed: September 6, 2017

Publication date: December 21, 2017

Inventors: Xingtao Zhang, Lei Miao
Method and Apparatus for Decoding Speech/Audio Bitstream

Publication number: 20170301361

Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.

Type: Application

Filed: June 28, 2017

Publication date: October 19, 2017

Applicant: HUAWEI TECHNOLOGIES CO.,LTD.

Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
Frame Loss Compensation Processing Method and Apparatus

Publication number: 20170287493

Abstract: A frame loss compensation processing method and apparatus is presented, where the method includes, when a ith frame is a lost frame, estimating a spectrum frequency parameter, a pitch period, and a gain of the ith frame according to at least one of an inter-frame relationship between first N frames of the ith frame or an intra-frame relationship between first N frames of the ith frame. A parameter of the ith frame is determined using the signal correlation between the first N frames, the signal energy stability between the first N frames, intra-frame signal correlation of each frame, and intra-frame signal energy stability of each frame.

Type: Application

Filed: March 29, 2017

Publication date: October 5, 2017

Inventors: Zexin Liu, Xingtao Zhang, Bin Wang, Lei Miao
Method and apparatus for decoding speech/audio bitstream

Patent number: 9734836

Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.

Type: Grant

Filed: June 29, 2016

Date of Patent: August 15, 2017

Assignee: Huawei Technologies Co., Ltd.

Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
SPEECH/AUDIO BITSTREAM DECODING METHOD AND APPARATUS

Publication number: 20160372122

Abstract: The present invention disclose a speech/audio bitstream decoding method including: acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame; performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame; and recovering a speech/audio signal by using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the present invention help improve quality of an output speech/audio signal.

Type: Application

Filed: September 2, 2016

Publication date: December 22, 2016

Applicant: HUAWEI TECHNOLOGIES CO.,LTD.

Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
Method and Apparatus for Decoding Speech/Audio Bitstream

Publication number: 20160343382

Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.

Type: Application

Filed: June 29, 2016

Publication date: November 24, 2016

Inventors: Zexin Liu, Xingtao Zhang, Lei Miao

prev 1 2 3 next