Patents by Inventor Xingtao Zhang
Xingtao Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20200126571Abstract: This application discloses an IPD parameter encoding method, including: obtaining a reference parameter used to determine an IPD parameter encoding scheme of a current frame of a multi-channel signal; determining the IPD parameter encoding scheme of the current frame based on the reference parameter, where the determined IPD parameter encoding scheme of the current frame is one of at least two preset IPD parameter encoding schemes; and processing an IPD parameter of the current frame based on the determined IPD parameter encoding scheme of the current frame. The technical solutions provided in this application can improve encoding quality of the multi-channel signal.Type: ApplicationFiled: December 20, 2019Publication date: April 23, 2020Inventors: Xingtao ZHANG, Haiting LI, Zexin LIU, Lei MIAO
-
Publication number: 20190342688Abstract: A method and an apparatus for locating a sound source are provided. The method includes: obtaining M channels of audio signals of a preset format by microphone arrays located in different planes (S100); preprocessing the M channels of audio signals of the preset format, and projecting them onto the same plane, so as to obtain N channels of audio signals, where M?N (S200); performing a time-frequency transformation on each of the N channels of audio signals, so as to obtain frequency domain signals of the N channels of audio signals (S300); further calculating a covariance matrix of the frequency domain signals and performing a smoothing process (S400); performing an eigenvalue decomposition of the smoothed covariance matrix (S500); estimating the sound source direction according to an eigenvector corresponding to the maximum eigenvalue, so as to obtain a sound source orientation parameter (S600).Type: ApplicationFiled: July 18, 2019Publication date: November 7, 2019Applicants: NANJING TWIRLING TECHNOLOGY CO., LTD., BEIJING TWIRLING IN TIME CO., LTD.Inventors: Xuejing SUN, Xingtao ZHANG, Chen ZHANG
-
Patent number: 10388288Abstract: A method and an apparatus for determining an inter-channel time difference parameter are provided, so that precision of a determined ITD parameter can adapt to channel quality. The method includes: determining a target search complexity from plurality of search complexities, where the plurality of search complexities are in a one-to-one correspondence with plurality of channel quality values; and performing search processing on a signal on a first sound channel and a signal on a second sound channel according to the target search complexity so as to determine a first inter-channel time difference ITD parameter corresponding to the first sound channel and the second sound channel.Type: GrantFiled: September 6, 2017Date of Patent: August 20, 2019Assignee: Huawei Technologies Co., Ltd.Inventors: Xingtao Zhang, Lei Miao
-
Patent number: 10354659Abstract: A frame loss compensation processing method and apparatus is presented, where the method includes, when a ith frame is a lost frame, estimating a spectrum frequency parameter, a pitch period, and a gain of the ith frame according to at least one of an inter-frame relationship between first N frames of the ith frame or an intra-frame relationship between first N frames of the ith frame. A parameter of the ith frame is determined using the signal correlation between the first N frames, the signal energy stability between the first N frames, intra-frame signal correlation of each frame, and intra-frame signal energy stability of each frame.Type: GrantFiled: March 29, 2017Date of Patent: July 16, 2019Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Zexin Liu, Xingtao Zhang, Bin Wang, Lei Miao
-
Publication number: 20190214025Abstract: A speech/audio bitstream decoding method includes acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame, performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame, and recovering a speech/audio signal using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the speech/audio bitstream decoding method help improve quality of an output speech/audio signal.Type: ApplicationFiled: March 19, 2019Publication date: July 11, 2019Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
-
Publication number: 20190189134Abstract: A method for encoding a multi-channel signal and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial inter-channel time difference (ITD) value of the current frame, controlling, based on characteristic information of the multi-channel signal, a quantity of target frames that are allowed to appear continuously, where the characteristic information includes at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and an ITD value of a previous frame of the target frame is reused as an ITD value of the target frame, determining an ITD value of the current frame based on the initial ITD value and the quantity of target frames allowed to appear continuously, and encoding the multi-channel signal based on the ITD value of the current frame.Type: ApplicationFiled: February 11, 2019Publication date: June 20, 2019Inventors: Haiting Li, Zexin Liu, Xingtao Zhang, Lei Miao
-
Publication number: 20190172474Abstract: A multi-channel signal encoding method and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial multi-channel parameter of the current frame, determining a difference parameter based on the initial multi-channel parameter of the current frame and multi-channel parameters of previous K frames of the current frame, where the difference parameter represents a difference between the initial multi-channel parameter of the current frame and the multi-channel parameters of the previous K frames, and K is an integer greater than or equal to one, determining a multi-channel parameter of the current frame based on the difference parameter and a characteristic parameter of the current frame, and encoding the multi-channel signal based on the multi-channel parameter of the current frame. Hence, the method and the encoder ensure better accuracy of inter-channel information of a multi-channel signal.Type: ApplicationFiled: February 11, 2019Publication date: June 6, 2019Inventors: Zexin Liu, Xingtao Zhang, Haiting Li, Lei Miao
-
Patent number: 10269357Abstract: The present invention disclose a speech/audio bitstream decoding method including: acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame; performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame; and recovering a speech/audio signal by using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the present invention help improve quality of an output speech/audio signal.Type: GrantFiled: September 2, 2016Date of Patent: April 23, 2019Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
-
Publication number: 20190096411Abstract: An inter-channel phase difference (IPD) parameter extraction method and apparatus, where the extraction method includes obtaining a parameter obtaining an information extraction manner for a current frame of a multi-channel signal, obtaining an IPD parameter extraction manner for the current frame based on the parameter obtaining the information extraction manner, where the obtained IPD parameter extraction manner is one of at least two preset IPD parameter extraction manners, and obtaining an IPD parameter of the current frame based on the obtained IPD parameter extraction manner for the current frame.Type: ApplicationFiled: November 27, 2018Publication date: March 28, 2019Inventors: Xingtao Zhang, Haiting Li, Zexin Liu, Lei Miao
-
Patent number: 10210873Abstract: A method for determining an inter-channel time difference (ITD) parameter includes determining a reference parameter according to a time-domain signal on a first sound channel and a time-domain signal on a second sound channel, where the reference parameter corresponds to a sequence of obtaining the time-domain signal on the first sound channel and the time-domain signal on the second sound channel, determining a search range according to the reference parameter and a limiting value (Tmax), where the Tmax is determined according to a sampling rate of the time-domain signal on the first sound channel, and performing search processing within the search range based on a frequency-domain signal on the first sound channel and a frequency-domain signal on the second sound channel to determine a first ITD parameter corresponding to the first sound channel and the second sound channel.Type: GrantFiled: September 7, 2017Date of Patent: February 19, 2019Assignee: Huawei Technologies Co., Ltd.Inventors: Xingtao Zhang, Lei Miao
-
Patent number: 10121484Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.Type: GrantFiled: June 28, 2017Date of Patent: November 6, 2018Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
-
Patent number: 10008211Abstract: Present disclosure discloses a method and an apparatus for encoding a stereo phase parameter, which relate to the field of information technologies and can improve an effect of stereo audio phase information. The method includes: first, acquiring a global stereo phase parameter of a current frame; then, determining a value of the global stereo phase parameter of the current frame, and adjusting the value of the global stereo phase parameter of the current frame according to a determining result of the value of the global stereo phase parameter of the current frame; and finally, encoding an adjusted value of the global stereo phase parameter of the current frame. The embodiments of the present disclosure are applicable to recovering stereo phase information.Type: GrantFiled: May 13, 2016Date of Patent: June 26, 2018Assignee: Huawei Technologies Co., Ltd.Inventors: Xingtao Zhang, Lei Miao, Wenhai Wu
-
Patent number: 10002615Abstract: An inter-channel level difference (ICLD) processing method and apparatus are disclosed. A stereo audio signal is received, and the stereo audio signal is parsed frame by frame, to obtain an ICLD of each sub-band of each subframe of each frame of the stereo audio signal; a sum of absolute values of the ICLDs of each subframe of any frame of the stereo audio signal is calculated; and when an absolute value of a difference between the sums of the absolute values of the ICLDs of each two subframes of the any frame is less than a preset threshold, a weighted ICLD value of each sub-band of the any frame is calculated in a first weighting manner; or otherwise, a weighted ICLD value of each sub-band of the any frame is calculated in a second weighting manner.Type: GrantFiled: November 4, 2015Date of Patent: June 19, 2018Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Xingtao Zhang, Lei Miao
-
Publication number: 20170372710Abstract: A method for determining an inter-channel time difference (ITD) parameter includes determining a reference parameter according to a time-domain signal on a first sound channel and a time-domain signal on a second sound channel, where the reference parameter corresponds to a sequence of obtaining the time-domain signal on the first sound channel and the time-domain signal on the second sound channel, determining a search range according to the reference parameter and a limiting value (Tmax), where the Tmax is determined according to a sampling rate of the time-domain signal on the first sound channel, and performing search processing within the search range based on a frequency-domain signal on the first sound channel and a frequency-domain signal on the second sound channel to determine a first ITD parameter corresponding to the first sound channel and the second sound channel.Type: ApplicationFiled: September 7, 2017Publication date: December 28, 2017Inventors: Xingtao Zhang, Lei Miao
-
Publication number: 20170365265Abstract: A method and an apparatus for determining an inter-channel time difference parameter are provided, so that precision of a determined ITD parameter can adapt to channel quality. The method includes: determining a target search complexity from plurality of search complexities, where the plurality of search complexities are in a one-to-one correspondence with plurality of channel quality values; and performing search processing on a signal on a first sound channel and a signal on a second sound channel according to the target search complexity so as to determine a first inter-channel time difference ITD parameter corresponding to the first sound channel and the second sound channel.Type: ApplicationFiled: September 6, 2017Publication date: December 21, 2017Inventors: Xingtao Zhang, Lei Miao
-
Publication number: 20170301361Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.Type: ApplicationFiled: June 28, 2017Publication date: October 19, 2017Applicant: HUAWEI TECHNOLOGIES CO.,LTD.Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
-
Publication number: 20170287493Abstract: A frame loss compensation processing method and apparatus is presented, where the method includes, when a ith frame is a lost frame, estimating a spectrum frequency parameter, a pitch period, and a gain of the ith frame according to at least one of an inter-frame relationship between first N frames of the ith frame or an intra-frame relationship between first N frames of the ith frame. A parameter of the ith frame is determined using the signal correlation between the first N frames, the signal energy stability between the first N frames, intra-frame signal correlation of each frame, and intra-frame signal energy stability of each frame.Type: ApplicationFiled: March 29, 2017Publication date: October 5, 2017Inventors: Zexin Liu, Xingtao Zhang, Bin Wang, Lei Miao
-
Patent number: 9734836Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.Type: GrantFiled: June 29, 2016Date of Patent: August 15, 2017Assignee: Huawei Technologies Co., Ltd.Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
-
Publication number: 20160372122Abstract: The present invention disclose a speech/audio bitstream decoding method including: acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame; performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame; and recovering a speech/audio signal by using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the present invention help improve quality of an output speech/audio signal.Type: ApplicationFiled: September 2, 2016Publication date: December 22, 2016Applicant: HUAWEI TECHNOLOGIES CO.,LTD.Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
-
Publication number: 20160343382Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.Type: ApplicationFiled: June 29, 2016Publication date: November 24, 2016Inventors: Zexin Liu, Xingtao Zhang, Lei Miao