Patents by Inventor Xingtao Zhang

Xingtao Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200126571
    Abstract: This application discloses an IPD parameter encoding method, including: obtaining a reference parameter used to determine an IPD parameter encoding scheme of a current frame of a multi-channel signal; determining the IPD parameter encoding scheme of the current frame based on the reference parameter, where the determined IPD parameter encoding scheme of the current frame is one of at least two preset IPD parameter encoding schemes; and processing an IPD parameter of the current frame based on the determined IPD parameter encoding scheme of the current frame. The technical solutions provided in this application can improve encoding quality of the multi-channel signal.
    Type: Application
    Filed: December 20, 2019
    Publication date: April 23, 2020
    Inventors: Xingtao ZHANG, Haiting LI, Zexin LIU, Lei MIAO
  • Publication number: 20190342688
    Abstract: A method and an apparatus for locating a sound source are provided. The method includes: obtaining M channels of audio signals of a preset format by microphone arrays located in different planes (S100); preprocessing the M channels of audio signals of the preset format, and projecting them onto the same plane, so as to obtain N channels of audio signals, where M?N (S200); performing a time-frequency transformation on each of the N channels of audio signals, so as to obtain frequency domain signals of the N channels of audio signals (S300); further calculating a covariance matrix of the frequency domain signals and performing a smoothing process (S400); performing an eigenvalue decomposition of the smoothed covariance matrix (S500); estimating the sound source direction according to an eigenvector corresponding to the maximum eigenvalue, so as to obtain a sound source orientation parameter (S600).
    Type: Application
    Filed: July 18, 2019
    Publication date: November 7, 2019
    Applicants: NANJING TWIRLING TECHNOLOGY CO., LTD., BEIJING TWIRLING IN TIME CO., LTD.
    Inventors: Xuejing SUN, Xingtao ZHANG, Chen ZHANG
  • Patent number: 10388288
    Abstract: A method and an apparatus for determining an inter-channel time difference parameter are provided, so that precision of a determined ITD parameter can adapt to channel quality. The method includes: determining a target search complexity from plurality of search complexities, where the plurality of search complexities are in a one-to-one correspondence with plurality of channel quality values; and performing search processing on a signal on a first sound channel and a signal on a second sound channel according to the target search complexity so as to determine a first inter-channel time difference ITD parameter corresponding to the first sound channel and the second sound channel.
    Type: Grant
    Filed: September 6, 2017
    Date of Patent: August 20, 2019
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Xingtao Zhang, Lei Miao
  • Patent number: 10354659
    Abstract: A frame loss compensation processing method and apparatus is presented, where the method includes, when a ith frame is a lost frame, estimating a spectrum frequency parameter, a pitch period, and a gain of the ith frame according to at least one of an inter-frame relationship between first N frames of the ith frame or an intra-frame relationship between first N frames of the ith frame. A parameter of the ith frame is determined using the signal correlation between the first N frames, the signal energy stability between the first N frames, intra-frame signal correlation of each frame, and intra-frame signal energy stability of each frame.
    Type: Grant
    Filed: March 29, 2017
    Date of Patent: July 16, 2019
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Zexin Liu, Xingtao Zhang, Bin Wang, Lei Miao
  • Publication number: 20190214025
    Abstract: A speech/audio bitstream decoding method includes acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame, performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame, and recovering a speech/audio signal using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the speech/audio bitstream decoding method help improve quality of an output speech/audio signal.
    Type: Application
    Filed: March 19, 2019
    Publication date: July 11, 2019
    Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
  • Publication number: 20190189134
    Abstract: A method for encoding a multi-channel signal and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial inter-channel time difference (ITD) value of the current frame, controlling, based on characteristic information of the multi-channel signal, a quantity of target frames that are allowed to appear continuously, where the characteristic information includes at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and an ITD value of a previous frame of the target frame is reused as an ITD value of the target frame, determining an ITD value of the current frame based on the initial ITD value and the quantity of target frames allowed to appear continuously, and encoding the multi-channel signal based on the ITD value of the current frame.
    Type: Application
    Filed: February 11, 2019
    Publication date: June 20, 2019
    Inventors: Haiting Li, Zexin Liu, Xingtao Zhang, Lei Miao
  • Publication number: 20190172474
    Abstract: A multi-channel signal encoding method and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial multi-channel parameter of the current frame, determining a difference parameter based on the initial multi-channel parameter of the current frame and multi-channel parameters of previous K frames of the current frame, where the difference parameter represents a difference between the initial multi-channel parameter of the current frame and the multi-channel parameters of the previous K frames, and K is an integer greater than or equal to one, determining a multi-channel parameter of the current frame based on the difference parameter and a characteristic parameter of the current frame, and encoding the multi-channel signal based on the multi-channel parameter of the current frame. Hence, the method and the encoder ensure better accuracy of inter-channel information of a multi-channel signal.
    Type: Application
    Filed: February 11, 2019
    Publication date: June 6, 2019
    Inventors: Zexin Liu, Xingtao Zhang, Haiting Li, Lei Miao
  • Patent number: 10269357
    Abstract: The present invention disclose a speech/audio bitstream decoding method including: acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame; performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame; and recovering a speech/audio signal by using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the present invention help improve quality of an output speech/audio signal.
    Type: Grant
    Filed: September 2, 2016
    Date of Patent: April 23, 2019
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
  • Publication number: 20190096411
    Abstract: An inter-channel phase difference (IPD) parameter extraction method and apparatus, where the extraction method includes obtaining a parameter obtaining an information extraction manner for a current frame of a multi-channel signal, obtaining an IPD parameter extraction manner for the current frame based on the parameter obtaining the information extraction manner, where the obtained IPD parameter extraction manner is one of at least two preset IPD parameter extraction manners, and obtaining an IPD parameter of the current frame based on the obtained IPD parameter extraction manner for the current frame.
    Type: Application
    Filed: November 27, 2018
    Publication date: March 28, 2019
    Inventors: Xingtao Zhang, Haiting Li, Zexin Liu, Lei Miao
  • Patent number: 10210873
    Abstract: A method for determining an inter-channel time difference (ITD) parameter includes determining a reference parameter according to a time-domain signal on a first sound channel and a time-domain signal on a second sound channel, where the reference parameter corresponds to a sequence of obtaining the time-domain signal on the first sound channel and the time-domain signal on the second sound channel, determining a search range according to the reference parameter and a limiting value (Tmax), where the Tmax is determined according to a sampling rate of the time-domain signal on the first sound channel, and performing search processing within the search range based on a frequency-domain signal on the first sound channel and a frequency-domain signal on the second sound channel to determine a first ITD parameter corresponding to the first sound channel and the second sound channel.
    Type: Grant
    Filed: September 7, 2017
    Date of Patent: February 19, 2019
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Xingtao Zhang, Lei Miao
  • Patent number: 10121484
    Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.
    Type: Grant
    Filed: June 28, 2017
    Date of Patent: November 6, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
  • Patent number: 10008211
    Abstract: Present disclosure discloses a method and an apparatus for encoding a stereo phase parameter, which relate to the field of information technologies and can improve an effect of stereo audio phase information. The method includes: first, acquiring a global stereo phase parameter of a current frame; then, determining a value of the global stereo phase parameter of the current frame, and adjusting the value of the global stereo phase parameter of the current frame according to a determining result of the value of the global stereo phase parameter of the current frame; and finally, encoding an adjusted value of the global stereo phase parameter of the current frame. The embodiments of the present disclosure are applicable to recovering stereo phase information.
    Type: Grant
    Filed: May 13, 2016
    Date of Patent: June 26, 2018
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Xingtao Zhang, Lei Miao, Wenhai Wu
  • Patent number: 10002615
    Abstract: An inter-channel level difference (ICLD) processing method and apparatus are disclosed. A stereo audio signal is received, and the stereo audio signal is parsed frame by frame, to obtain an ICLD of each sub-band of each subframe of each frame of the stereo audio signal; a sum of absolute values of the ICLDs of each subframe of any frame of the stereo audio signal is calculated; and when an absolute value of a difference between the sums of the absolute values of the ICLDs of each two subframes of the any frame is less than a preset threshold, a weighted ICLD value of each sub-band of the any frame is calculated in a first weighting manner; or otherwise, a weighted ICLD value of each sub-band of the any frame is calculated in a second weighting manner.
    Type: Grant
    Filed: November 4, 2015
    Date of Patent: June 19, 2018
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Xingtao Zhang, Lei Miao
  • Publication number: 20170372710
    Abstract: A method for determining an inter-channel time difference (ITD) parameter includes determining a reference parameter according to a time-domain signal on a first sound channel and a time-domain signal on a second sound channel, where the reference parameter corresponds to a sequence of obtaining the time-domain signal on the first sound channel and the time-domain signal on the second sound channel, determining a search range according to the reference parameter and a limiting value (Tmax), where the Tmax is determined according to a sampling rate of the time-domain signal on the first sound channel, and performing search processing within the search range based on a frequency-domain signal on the first sound channel and a frequency-domain signal on the second sound channel to determine a first ITD parameter corresponding to the first sound channel and the second sound channel.
    Type: Application
    Filed: September 7, 2017
    Publication date: December 28, 2017
    Inventors: Xingtao Zhang, Lei Miao
  • Publication number: 20170365265
    Abstract: A method and an apparatus for determining an inter-channel time difference parameter are provided, so that precision of a determined ITD parameter can adapt to channel quality. The method includes: determining a target search complexity from plurality of search complexities, where the plurality of search complexities are in a one-to-one correspondence with plurality of channel quality values; and performing search processing on a signal on a first sound channel and a signal on a second sound channel according to the target search complexity so as to determine a first inter-channel time difference ITD parameter corresponding to the first sound channel and the second sound channel.
    Type: Application
    Filed: September 6, 2017
    Publication date: December 21, 2017
    Inventors: Xingtao Zhang, Lei Miao
  • Publication number: 20170301361
    Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.
    Type: Application
    Filed: June 28, 2017
    Publication date: October 19, 2017
    Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
    Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
  • Publication number: 20170287493
    Abstract: A frame loss compensation processing method and apparatus is presented, where the method includes, when a ith frame is a lost frame, estimating a spectrum frequency parameter, a pitch period, and a gain of the ith frame according to at least one of an inter-frame relationship between first N frames of the ith frame or an intra-frame relationship between first N frames of the ith frame. A parameter of the ith frame is determined using the signal correlation between the first N frames, the signal energy stability between the first N frames, intra-frame signal correlation of each frame, and intra-frame signal energy stability of each frame.
    Type: Application
    Filed: March 29, 2017
    Publication date: October 5, 2017
    Inventors: Zexin Liu, Xingtao Zhang, Bin Wang, Lei Miao
  • Patent number: 9734836
    Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.
    Type: Grant
    Filed: June 29, 2016
    Date of Patent: August 15, 2017
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Zexin Liu, Xingtao Zhang, Lei Miao
  • Publication number: 20160372122
    Abstract: The present invention disclose a speech/audio bitstream decoding method including: acquiring a speech/audio decoding parameter of a current speech/audio frame, where the foregoing current speech/audio frame is a redundant decoded frame or a speech/audio frame previous to the foregoing current speech/audio frame is a redundant decoded frame; performing post processing on the acquired speech/audio decoding parameter according to speech/audio parameters of X speech/audio frames, where the foregoing X speech/audio frames include M speech/audio frames previous to the foregoing current speech/audio frame and/or N speech/audio frames next to the foregoing current speech/audio frame; and recovering a speech/audio signal by using the post-processed speech/audio decoding parameter of the foregoing current speech/audio frame. The technical solutions of the present invention help improve quality of an output speech/audio signal.
    Type: Application
    Filed: September 2, 2016
    Publication date: December 22, 2016
    Applicant: HUAWEI TECHNOLOGIES CO.,LTD.
    Inventors: Xingtao Zhang, Zexin Liu, Lei Miao
  • Publication number: 20160343382
    Abstract: A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.
    Type: Application
    Filed: June 29, 2016
    Publication date: November 24, 2016
    Inventors: Zexin Liu, Xingtao Zhang, Lei Miao