Patents by Inventor Xuejing Sun
Xuejing Sun has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10332540Abstract: Example embodiments disclosed herein relate to filter coefficient updating in time domain filtering. A method of processing an audio signal is disclosed. The method includes obtaining a predetermined number of target gains for a first portion of the audio signal by analyzing the first portion of the audio signal. Each of the target gains is corresponding to a linear subband of the audio signal. The method also includes determining a filter coefficients for time domain filtering the first portion of the audio signal so as to approximate a frequency response given by the target gains. The filter coefficients are determined by iteratively selecting at least one target gain from the target gains and updating the filter coefficient based on the selected at least one target gain. Corresponding system and computer program product for processing an audio signal are also disclosed.Type: GrantFiled: September 15, 2016Date of Patent: June 25, 2019Assignee: Dolby Laboratories Licensing CorporationInventors: Dong Shi, Xuejing Sun
-
Patent number: 10334384Abstract: A method for processing audio data, the method comprising: receiving audio data corresponding to a plurality of instances of audio, including at least one of: (a) audio data from multiple endpoints, recorded separately or (b) audio data from a single endpoint corresponding to multiple talkers and including spatial information for each of the multiple talkers; rendering the audio data in a virtual acoustic space such that each of the instances of audio has a respective different virtual position in the virtual acoustic space; and scheduling the instances of audio to be played back with a playback overlap between at least two of the instances of audio, wherein the scheduling is performed, at least in part, according to a set of perceptually-motivated rules.Type: GrantFiled: February 3, 2016Date of Patent: June 25, 2019Assignee: Dolby Laboratories Licensing CorporationInventors: Xuejing Sun, Richard J. Cartwright, Michael P. Hollier, Michael Eckert
-
Patent number: 10311891Abstract: A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.Type: GrantFiled: February 15, 2017Date of Patent: June 4, 2019Assignee: Dolby Laboratories Licensing CorporationInventors: Xuejing Sun, Glenn N. Dickins
-
Patent number: 10224046Abstract: A method, an apparatus, logic (e.g., executable instructions encoded in a non-transitory computer-readable medium to carry out a method), and a non-transitory computer-readable medium configured with such instructions. The method is to generate and spatially render spatial comfort noise at a receiving endpoint of a conference system, such that the comfort noise has target spectral characteristics typical of comfort noise, and at least one spatial property that at least substantially matches at least one target spatial property. On version includes receiving one or more or more audio signals from other endpoints, combining the received audio signals with the spatial comfort noise signals, and rendering the combination of the received audio signals and the spatial comfort noise signals to a set of output signals for loudspeakers, such that the spatial comfort noise signals are continually in the output signal sin addition to output from the received audio signals.Type: GrantFiled: March 4, 2014Date of Patent: March 5, 2019Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Glenn N. Dickins, Xuejing Sun, Yen-Liang Shue, Heiko Purnhagen
-
Patent number: 10224040Abstract: The present application relates to packet loss concealment apparatus and method, and audio processing system. According to an embodiment, the packet loss concealment apparatus is provided for concealing packet losses in a stream of audio packets, each audio packet comprising at least one audio frame in transmission format comprising at least one monaural component and at least one spatial component. The packet loss concealment apparatus may comprises a first concealment unit for creating the at least one monaural component for a lost frame in a lost packet and a second concealment unit for creating the at least one spatial component for the lost frame. According to the embodiment, spatial artifacts such as incorrect angle and diffuseness may be avoided as far as possible in PLC for multi-channel spatial or sound field encoded audio signals.Type: GrantFiled: July 2, 2014Date of Patent: March 5, 2019Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Shen Huang, Xuejing Sun, Heiko Purnhagen
-
Patent number: 10200804Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.Type: GrantFiled: February 24, 2016Date of Patent: February 5, 2019Assignee: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Xuejing Sun, Lie Lu
-
Patent number: 10182207Abstract: The disclosure relates to handling nuisance in teleconference system. An endpoint device (400) for use in a teleconference includes an acquiring unit (401), a judging unit (402), a controller (403) and a processing unit (404). The acquiring unit acquires a media stream for presentation in the teleconference, and receives information from another device. The information includes a first estimation on whether the media stream is a nuisance to the teleconference. As the nuisance to a teleconference, audio or video signals are perceived by users as actually not relevant to the conference session or causing unpleasant feeling or confusion. The judging unit decides whether the media stream is the nuisance at least based on the information. The controller controls the processing of the media stream to degrade or suppress the presentation of the media stream in case that the media stream is decided as the nuisance. The processing unit processes the media stream under the control of the controller.Type: GrantFiled: February 16, 2016Date of Patent: January 15, 2019Assignee: Dolby Laboratories Licensing CorporationInventors: Taoran Lu, Hariharan Ganapathy-Kathirvelu, Peng Yin, Glenn N. Dickins, Xuejing Sun
-
Patent number: 10142049Abstract: A method of determining a near optimal forward error correction scheme for the transmission of audio data over a lossy packet switched network having preallocated estimated bandwidth, delay and packet losses, between at least a first and second communications devices, the method including the steps of: determining a first coding rate for the audio data; determining a peak redundancy coding rate for redundant versions of the audio data; determining an average redundancy coding rate over a period of time for redundant versions of the audio data; determining an objective function which maximizes a bitrate-perceptual audio quality mapping of the transmitted audio data including a playout function formulation; and optimizing the objective function to produce a forward error correction scheme providing a high bitrate perceptual audio quality.Type: GrantFiled: October 7, 2016Date of Patent: November 27, 2018Assignee: Dolby Laboratories Licensing CorporationInventors: Xuejing Sun, Dong Shi
-
Publication number: 20180336902Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve analyzing conversational dynamics of the conference recording. Some examples may involve searching the conference recording to determine instances of segment classifications. The segment classifications may be based, at least in part, on conversational dynamics data. Some implementations may involve segmenting the conference recording into a plurality of segments, each of the segments corresponding with a time interval and at least one of the segment classifications. Some implementations allow a listener to scan through a conference recording quickly according to segments, words, topics and/or talkers of interest.Type: ApplicationFiled: February 3, 2016Publication date: November 22, 2018Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Richard J. CARTWRIGHT, Kai LI, Xuejing SUN
-
Patent number: 10103999Abstract: Some implementations involve controlling a jitter buffer size during a teleconference according to a jitter buffer size estimation algorithm based, at least in part, on a cumulative distribution function (CDF). The CDF may be based, at least in part, on a network jitter parameter. The CDF may be initialized according to a parametric model. At least one parameter of the parametric model may be based, at least in part, on legacy network jitter information.Type: GrantFiled: April 8, 2015Date of Patent: October 16, 2018Assignee: Dolby Laboratories Licensing CorporationInventors: JiaQuan Huo, Xuejing Sun, Kai Li
-
Publication number: 20180287910Abstract: In a packet switched voice delivery application which utilizes a jitter buffer for the delivery of sequential packet data, a method of determining a measure of the output jitter of taking packets out of the buffer, the method including the step of: (a) forming a pull jitter measure comprising the differential fetch times between sequential pull packets dived by an expected time interval between packets.Type: ApplicationFiled: September 27, 2016Publication date: October 4, 2018Applicant: Dolby Laboratories Licensing CorporationInventors: Xuejing Sun, JiaQuan Huo, Paul Holmberg
-
Publication number: 20180279063Abstract: A method for processing audio data, the method comprising: receiving audio data corresponding to a plurality of instances of audio, including at least one of: (a) audio data from multiple endpoints, recorded separately or (b) audio data from a single endpoint corresponding to multiple talkers and including spatial information for each of the multiple talkers; rendering the audio data in a virtual acoustic space such that each of the instances of audio has a respective different virtual position in the virtual acoustic space; and scheduling the instances of audio to be played back with a playback overlap between at least two of the instances of audio, wherein the scheduling is performed, at least in part, according to a set of perceptually-motivated rules.Type: ApplicationFiled: February 3, 2016Publication date: September 27, 2018Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Xuejing SUN, Richard J. CARTWRIGHT, Michael P. HOLLIER, Michael ECKERT
-
Publication number: 20180254053Abstract: Example embodiments disclosed herein relate to filter coefficient updating in time domain filtering. A method of processing an audio signal is disclosed. The method includes obtaining a predetermined number of target gains for a first portion of the audio signal by analyzing the first portion of the audio signal. Each of the target gains is corresponding to a linear subband of the audio signal. The method also includes determining a filter coefficients for time domain filtering the first portion of the audio signal so as to approximate a frequency response given by the target gains. The filter coefficients are determined by iteratively selecting at least one target gain from the target gains and updating the filter coefficient based on the selected at least one target gain. Corresponding system and computer program product for processing an audio signal are also disclosed.Type: ApplicationFiled: September 15, 2016Publication date: September 6, 2018Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Dong SHI, Xuejing SUN
-
Publication number: 20180191912Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving audio data corresponding to a recording of at least one conference involving a plurality of conference participants. In some examples, only a portion of the received audio data will be selected as playback audio data. The selection process may involve a topic selection process, a talkspurt filtering process and/or an acoustic feature selection process. Some examples involve receiving an indication of a target playback time duration. Selecting the portion of audio data may involve making a time duration of the playback audio data within a threshold time difference of the target playback time duration.Type: ApplicationFiled: February 3, 2016Publication date: July 5, 2018Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Richard J. CARTWRIGHT, Xuejing SUN
-
Publication number: 20180190266Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving speech recognition results data, including a plurality of speech recognition lattices and a word recognition confidence score for each of a plurality of hypothesized words of the speech recognition lattices, for a conference recording. A primary word candidate and alternative word hypotheses may be determined for hypothesized words in the speech recognition lattices. A term frequency metric may be calculated for sorting the primary word candidates and the alternative word hypotheses. Hypothesized words may be rescored according to an alternative hypothesis list.Type: ApplicationFiled: February 3, 2016Publication date: July 5, 2018Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Xuejing SUN, Richard J. CARTWRIGHT
-
Patent number: 10015443Abstract: Example embodiments disclosed herein relate to spatial congruency adjustment. A method for adjusting spatial congruency in a video conference is disclosed. The method in unwarping a visual scene captured by a video endpoint device into at least one rectilinear scene, the video endpoint device being configured to capture the visual scene in an omnidirectional manner, detecting spatial congruency between the at least one rectilinear scene and an auditory scene captured by an audio endpoint device that is positioned in relation to the video endpoint device. The spatial congruency being a degree of alignment between the auditory scene and the at least one rectilinear scene and in response to the detected spatial congruency being below the threshold, adjusting the spatial congruency. Corresponding system and computer program products are also disclosed.Type: GrantFiled: November 18, 2015Date of Patent: July 3, 2018Assignee: Dolby Laboratories Licensing CorporationInventors: Xuejing Sun, Michael Eckert
-
Patent number: 10014005Abstract: Embodiments are described for harmonicity estimation, audio classification, pitch determination and noise estimation. Measuring harmonicity of an audio signal includes calculation a log amplitude spectrum of audio signal. A first spectrum is derived by calculating each component of the first spectrum as a sum of components of the log amplitude spectrum on frequencies. In linear frequency scale, the frequencies are odd multiples of the component's frequency of the first spectrum. A second spectrum is derived by calculating each component of the second spectrum as a sum of components of the log amplitude spectrum on frequencies. In linear frequency scale, the frequencies are even multiples of the component's frequency of the second spectrum. A difference spectrum is derived subtracting the first spectrum from the second spectrum. A measure of harmonicity is generated as a monotonically increasing function of the maximum component of the difference spectrum within predetermined frequency range.Type: GrantFiled: March 21, 2013Date of Patent: July 3, 2018Assignee: Dolby Laboratories Licensing CorporationInventors: Xuejing Sun, Zhiwei Shuang, Shen Huang
-
Publication number: 20180167515Abstract: Example embodiments disclosed herein relate to audio signal processing based on remote user control. A method of processing an audio signal in an audio sender device is disclosed. The method includes receiving, at a current device, a control parameter from a remote device, the control parameter being generated based on a user input of the remote device and specifying a user preference for an audio signal to be transmitted to the remote device. The method also includes processing the audio signal based on the received control parameter and transmitting the processed audio signal to the remote device. Corresponding computer program product of processing an audio signal and corresponding device are also disclosed. Corresponding method in an audio receiver device and computer program product of processing an audio signal as well as corresponding device are also disclosed.Type: ApplicationFiled: May 26, 2016Publication date: June 14, 2018Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Dong SHI, Xuejing SUN, Zhiwei SHUANG
-
Publication number: 20180139537Abstract: Example embodiments disclosed herein relate to separated audio analysis and processing. A system for processing an audio signal is disclosed. The system includes an audio analysis module configured to analyze an input audio signal to determine a processing parameter for the input audio signal, the input audio signal being represented in time domain. The system also includes an audio processing module configured to process the input audio signal in parallel with the audio analysis module. The audio processing module includes a time domain filter configured to filter the input audio signal to obtain an output audio signal in the time domain, and a filter controller configured to control a filter coefficient of the time domain filter based on the processing parameter determined by the audio analysis module. Corresponding method and computer program product of processing an audio signal are also disclosed.Type: ApplicationFiled: May 26, 2016Publication date: May 17, 2018Applicant: Dolby Laboratories Licensing CorporationInventors: Dong SHI, Xuejing SUN
-
Publication number: 20180109874Abstract: Example embodiments disclosed herein relate to user experience oriented audio signal processing. There is provided a method for user experience oriented audio signal processing. The method includes obtaining a first audio signal from an audio sensor of an electronic device; computing, based on the first audio signal, a compensation factor for an acoustic path from the electronic device to a listener and applying the compensation factor to a second audio signal outputted from the electronic device. Corresponding system and computer program products are disclosed.Type: ApplicationFiled: December 14, 2017Publication date: April 19, 2018Applicant: Dolby Laboratories Licensing CorporationInventors: Guilin MA, Xiguang ZHENG, Chen ZHANG, Xuejing SUN