Patents by Inventor Xuejing Sun

Xuejing Sun has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10332540
    Abstract: Example embodiments disclosed herein relate to filter coefficient updating in time domain filtering. A method of processing an audio signal is disclosed. The method includes obtaining a predetermined number of target gains for a first portion of the audio signal by analyzing the first portion of the audio signal. Each of the target gains is corresponding to a linear subband of the audio signal. The method also includes determining a filter coefficients for time domain filtering the first portion of the audio signal so as to approximate a frequency response given by the target gains. The filter coefficients are determined by iteratively selecting at least one target gain from the target gains and updating the filter coefficient based on the selected at least one target gain. Corresponding system and computer program product for processing an audio signal are also disclosed.
    Type: Grant
    Filed: September 15, 2016
    Date of Patent: June 25, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Dong Shi, Xuejing Sun
  • Patent number: 10334384
    Abstract: A method for processing audio data, the method comprising: receiving audio data corresponding to a plurality of instances of audio, including at least one of: (a) audio data from multiple endpoints, recorded separately or (b) audio data from a single endpoint corresponding to multiple talkers and including spatial information for each of the multiple talkers; rendering the audio data in a virtual acoustic space such that each of the instances of audio has a respective different virtual position in the virtual acoustic space; and scheduling the instances of audio to be played back with a playback overlap between at least two of the instances of audio, wherein the scheduling is performed, at least in part, according to a set of perceptually-motivated rules.
    Type: Grant
    Filed: February 3, 2016
    Date of Patent: June 25, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, Richard J. Cartwright, Michael P. Hollier, Michael Eckert
  • Patent number: 10311891
    Abstract: A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.
    Type: Grant
    Filed: February 15, 2017
    Date of Patent: June 4, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, Glenn N. Dickins
  • Patent number: 10224046
    Abstract: A method, an apparatus, logic (e.g., executable instructions encoded in a non-transitory computer-readable medium to carry out a method), and a non-transitory computer-readable medium configured with such instructions. The method is to generate and spatially render spatial comfort noise at a receiving endpoint of a conference system, such that the comfort noise has target spectral characteristics typical of comfort noise, and at least one spatial property that at least substantially matches at least one target spatial property. On version includes receiving one or more or more audio signals from other endpoints, combining the received audio signals with the spatial comfort noise signals, and rendering the combination of the received audio signals and the spatial comfort noise signals to a set of output signals for loudspeakers, such that the spatial comfort noise signals are continually in the output signal sin addition to output from the received audio signals.
    Type: Grant
    Filed: March 4, 2014
    Date of Patent: March 5, 2019
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Glenn N. Dickins, Xuejing Sun, Yen-Liang Shue, Heiko Purnhagen
  • Patent number: 10224040
    Abstract: The present application relates to packet loss concealment apparatus and method, and audio processing system. According to an embodiment, the packet loss concealment apparatus is provided for concealing packet losses in a stream of audio packets, each audio packet comprising at least one audio frame in transmission format comprising at least one monaural component and at least one spatial component. The packet loss concealment apparatus may comprises a first concealment unit for creating the at least one monaural component for a lost frame in a lost packet and a second concealment unit for creating the at least one spatial component for the lost frame. According to the embodiment, spatial artifacts such as incorrect angle and diffuseness may be avoided as far as possible in PLC for multi-channel spatial or sound field encoded audio signals.
    Type: Grant
    Filed: July 2, 2014
    Date of Patent: March 5, 2019
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Shen Huang, Xuejing Sun, Heiko Purnhagen
  • Patent number: 10200804
    Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: February 24, 2016
    Date of Patent: February 5, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Xuejing Sun, Lie Lu
  • Patent number: 10182207
    Abstract: The disclosure relates to handling nuisance in teleconference system. An endpoint device (400) for use in a teleconference includes an acquiring unit (401), a judging unit (402), a controller (403) and a processing unit (404). The acquiring unit acquires a media stream for presentation in the teleconference, and receives information from another device. The information includes a first estimation on whether the media stream is a nuisance to the teleconference. As the nuisance to a teleconference, audio or video signals are perceived by users as actually not relevant to the conference session or causing unpleasant feeling or confusion. The judging unit decides whether the media stream is the nuisance at least based on the information. The controller controls the processing of the media stream to degrade or suppress the presentation of the media stream in case that the media stream is decided as the nuisance. The processing unit processes the media stream under the control of the controller.
    Type: Grant
    Filed: February 16, 2016
    Date of Patent: January 15, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Taoran Lu, Hariharan Ganapathy-Kathirvelu, Peng Yin, Glenn N. Dickins, Xuejing Sun
  • Patent number: 10142049
    Abstract: A method of determining a near optimal forward error correction scheme for the transmission of audio data over a lossy packet switched network having preallocated estimated bandwidth, delay and packet losses, between at least a first and second communications devices, the method including the steps of: determining a first coding rate for the audio data; determining a peak redundancy coding rate for redundant versions of the audio data; determining an average redundancy coding rate over a period of time for redundant versions of the audio data; determining an objective function which maximizes a bitrate-perceptual audio quality mapping of the transmitted audio data including a playout function formulation; and optimizing the objective function to produce a forward error correction scheme providing a high bitrate perceptual audio quality.
    Type: Grant
    Filed: October 7, 2016
    Date of Patent: November 27, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, Dong Shi
  • Publication number: 20180336902
    Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve analyzing conversational dynamics of the conference recording. Some examples may involve searching the conference recording to determine instances of segment classifications. The segment classifications may be based, at least in part, on conversational dynamics data. Some implementations may involve segmenting the conference recording into a plurality of segments, each of the segments corresponding with a time interval and at least one of the segment classifications. Some implementations allow a listener to scan through a conference recording quickly according to segments, words, topics and/or talkers of interest.
    Type: Application
    Filed: February 3, 2016
    Publication date: November 22, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, Kai LI, Xuejing SUN
  • Patent number: 10103999
    Abstract: Some implementations involve controlling a jitter buffer size during a teleconference according to a jitter buffer size estimation algorithm based, at least in part, on a cumulative distribution function (CDF). The CDF may be based, at least in part, on a network jitter parameter. The CDF may be initialized according to a parametric model. At least one parameter of the parametric model may be based, at least in part, on legacy network jitter information.
    Type: Grant
    Filed: April 8, 2015
    Date of Patent: October 16, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: JiaQuan Huo, Xuejing Sun, Kai Li
  • Publication number: 20180287910
    Abstract: In a packet switched voice delivery application which utilizes a jitter buffer for the delivery of sequential packet data, a method of determining a measure of the output jitter of taking packets out of the buffer, the method including the step of: (a) forming a pull jitter measure comprising the differential fetch times between sequential pull packets dived by an expected time interval between packets.
    Type: Application
    Filed: September 27, 2016
    Publication date: October 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, JiaQuan Huo, Paul Holmberg
  • Publication number: 20180279063
    Abstract: A method for processing audio data, the method comprising: receiving audio data corresponding to a plurality of instances of audio, including at least one of: (a) audio data from multiple endpoints, recorded separately or (b) audio data from a single endpoint corresponding to multiple talkers and including spatial information for each of the multiple talkers; rendering the audio data in a virtual acoustic space such that each of the instances of audio has a respective different virtual position in the virtual acoustic space; and scheduling the instances of audio to be played back with a playback overlap between at least two of the instances of audio, wherein the scheduling is performed, at least in part, according to a set of perceptually-motivated rules.
    Type: Application
    Filed: February 3, 2016
    Publication date: September 27, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Xuejing SUN, Richard J. CARTWRIGHT, Michael P. HOLLIER, Michael ECKERT
  • Publication number: 20180254053
    Abstract: Example embodiments disclosed herein relate to filter coefficient updating in time domain filtering. A method of processing an audio signal is disclosed. The method includes obtaining a predetermined number of target gains for a first portion of the audio signal by analyzing the first portion of the audio signal. Each of the target gains is corresponding to a linear subband of the audio signal. The method also includes determining a filter coefficients for time domain filtering the first portion of the audio signal so as to approximate a frequency response given by the target gains. The filter coefficients are determined by iteratively selecting at least one target gain from the target gains and updating the filter coefficient based on the selected at least one target gain. Corresponding system and computer program product for processing an audio signal are also disclosed.
    Type: Application
    Filed: September 15, 2016
    Publication date: September 6, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Dong SHI, Xuejing SUN
  • Publication number: 20180191912
    Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving audio data corresponding to a recording of at least one conference involving a plurality of conference participants. In some examples, only a portion of the received audio data will be selected as playback audio data. The selection process may involve a topic selection process, a talkspurt filtering process and/or an acoustic feature selection process. Some examples involve receiving an indication of a target playback time duration. Selecting the portion of audio data may involve making a time duration of the playback audio data within a threshold time difference of the target playback time duration.
    Type: Application
    Filed: February 3, 2016
    Publication date: July 5, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, Xuejing SUN
  • Publication number: 20180190266
    Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving speech recognition results data, including a plurality of speech recognition lattices and a word recognition confidence score for each of a plurality of hypothesized words of the speech recognition lattices, for a conference recording. A primary word candidate and alternative word hypotheses may be determined for hypothesized words in the speech recognition lattices. A term frequency metric may be calculated for sorting the primary word candidates and the alternative word hypotheses. Hypothesized words may be rescored according to an alternative hypothesis list.
    Type: Application
    Filed: February 3, 2016
    Publication date: July 5, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Xuejing SUN, Richard J. CARTWRIGHT
  • Patent number: 10015443
    Abstract: Example embodiments disclosed herein relate to spatial congruency adjustment. A method for adjusting spatial congruency in a video conference is disclosed. The method in unwarping a visual scene captured by a video endpoint device into at least one rectilinear scene, the video endpoint device being configured to capture the visual scene in an omnidirectional manner, detecting spatial congruency between the at least one rectilinear scene and an auditory scene captured by an audio endpoint device that is positioned in relation to the video endpoint device. The spatial congruency being a degree of alignment between the auditory scene and the at least one rectilinear scene and in response to the detected spatial congruency being below the threshold, adjusting the spatial congruency. Corresponding system and computer program products are also disclosed.
    Type: Grant
    Filed: November 18, 2015
    Date of Patent: July 3, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, Michael Eckert
  • Patent number: 10014005
    Abstract: Embodiments are described for harmonicity estimation, audio classification, pitch determination and noise estimation. Measuring harmonicity of an audio signal includes calculation a log amplitude spectrum of audio signal. A first spectrum is derived by calculating each component of the first spectrum as a sum of components of the log amplitude spectrum on frequencies. In linear frequency scale, the frequencies are odd multiples of the component's frequency of the first spectrum. A second spectrum is derived by calculating each component of the second spectrum as a sum of components of the log amplitude spectrum on frequencies. In linear frequency scale, the frequencies are even multiples of the component's frequency of the second spectrum. A difference spectrum is derived subtracting the first spectrum from the second spectrum. A measure of harmonicity is generated as a monotonically increasing function of the maximum component of the difference spectrum within predetermined frequency range.
    Type: Grant
    Filed: March 21, 2013
    Date of Patent: July 3, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, Zhiwei Shuang, Shen Huang
  • Publication number: 20180167515
    Abstract: Example embodiments disclosed herein relate to audio signal processing based on remote user control. A method of processing an audio signal in an audio sender device is disclosed. The method includes receiving, at a current device, a control parameter from a remote device, the control parameter being generated based on a user input of the remote device and specifying a user preference for an audio signal to be transmitted to the remote device. The method also includes processing the audio signal based on the received control parameter and transmitting the processed audio signal to the remote device. Corresponding computer program product of processing an audio signal and corresponding device are also disclosed. Corresponding method in an audio receiver device and computer program product of processing an audio signal as well as corresponding device are also disclosed.
    Type: Application
    Filed: May 26, 2016
    Publication date: June 14, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Dong SHI, Xuejing SUN, Zhiwei SHUANG
  • Publication number: 20180139537
    Abstract: Example embodiments disclosed herein relate to separated audio analysis and processing. A system for processing an audio signal is disclosed. The system includes an audio analysis module configured to analyze an input audio signal to determine a processing parameter for the input audio signal, the input audio signal being represented in time domain. The system also includes an audio processing module configured to process the input audio signal in parallel with the audio analysis module. The audio processing module includes a time domain filter configured to filter the input audio signal to obtain an output audio signal in the time domain, and a filter controller configured to control a filter coefficient of the time domain filter based on the processing parameter determined by the audio analysis module. Corresponding method and computer program product of processing an audio signal are also disclosed.
    Type: Application
    Filed: May 26, 2016
    Publication date: May 17, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Dong SHI, Xuejing SUN
  • Publication number: 20180109874
    Abstract: Example embodiments disclosed herein relate to user experience oriented audio signal processing. There is provided a method for user experience oriented audio signal processing. The method includes obtaining a first audio signal from an audio sensor of an electronic device; computing, based on the first audio signal, a compensation factor for an acoustic path from the electronic device to a listener and applying the compensation factor to a second audio signal outputted from the electronic device. Corresponding system and computer program products are disclosed.
    Type: Application
    Filed: December 14, 2017
    Publication date: April 19, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guilin MA, Xiguang ZHENG, Chen ZHANG, Xuejing SUN