Patents by Inventor Lianwu CHEN

Lianwu CHEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SPEECH SIGNAL PROCESSING MODEL TRAINING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

Publication number: 20200051549

Abstract: Embodiments of the present invention provide a speech signal processing model training method, an electronic device and a storage medium. The embodiments of the present invention determines a target training loss function based on a training loss function of each of one or more speech signal processing tasks; inputs a task input feature of each speech signal processing task into a starting multi-task neural network, and updates model parameters of a shared layer and each of one or more task layers of the starting multi-task neural network corresponding to the one or more speech signal processing tasks by minimizing the target training loss function as a training objective, until the starting multi-task neural network converges, to obtain a speech signal processing model.

Type: Application

Filed: October 17, 2019

Publication date: February 13, 2020

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventors: Lianwu Chen, Meng Yu, Min Luo, Dan Su
Spatial error metrics of audio content

Patent number: 10492014

Abstract: Audio objects that are present in input audio content in one or more frames are determined. Output clusters that are present in output audio content in the one or more frames are also determined. Here, the audio objects in the input audio content are converted to the output clusters in the output audio content. One or more spatial error metrics are computed based at least in part on positional metadata of the audio objects and positional metadata of the output clusters.

Type: Grant

Filed: January 5, 2015

Date of Patent: November 26, 2019

Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Dirk Jeroen Breebaart, Lianwu Chen, Lie Lu, Antonio Mateos Sole, Nicolas R. Tsingos
Upmixing of audio signals

Patent number: 10362426

Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.

Type: Grant

Filed: February 9, 2016

Date of Patent: July 23, 2019

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Jun Wang, Lie Lu, Lianwu Chen, Mingqing Hu
AUDIO OBJECT CLUSTERING BASED ON RENDERER-AWARE PERCEPTUAL DIFFERENCE

Publication number: 20190182612

Abstract: Example embodiments disclosed herein relate to audio object clustering based on renderer-aware perceptual difference. A method of processing audio objects is provided. The method includes obtaining renderer-related information indicating a configuration of a renderer. The method also includes determining, based on the obtained renderer-related information, a rendering difference between a first audio object and a second audio object among the audio objects with respect to the renderer. The method further includes clustering the audio objects at least in part based on the rendering difference. Corresponding system, device, and computer program product are also disclosed.

Type: Application

Filed: July 13, 2017

Publication date: June 13, 2019

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Lianwu CHEN, Lie LU, Dirk Jeroen BREEBAART
Audio object clustering with single channel quality preservation

Patent number: 10278000

Abstract: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.

Type: Grant

Filed: December 12, 2016

Date of Patent: April 30, 2019

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Dirk Jeroen Breebaart, Lianwu Chen, Lie Lu
Processing object-based audio signals

Patent number: 10277997

Abstract: Example embodiments disclosed herein relate to audio signal processing. The audio signal has multiple audio objects. A method of processing an audio signal is disclosed. The method includes obtaining an object position for each of the audio objects; and determining cluster positions for grouping the audio objects into clusters based on the object positions, a plurality of object-to-cluster gains, and a set of metrics. The metrics indicate a quality of the cluster positions and a quality of the object-to-cluster gains, each of the cluster positions is a centroid of a respective one of the clusters, and one of the object-to-cluster gains defines a ratio of the respective audio object in one of the clusters. The method also includes determining the object-to-cluster gains based on the object positions, the cluster positions and the set of metrics; and generating a cluster signal based on the determined cluster positions and object-to-cluster gains.

Type: Grant

Filed: August 4, 2016

Date of Patent: April 30, 2019

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
Projection-based audio object extraction from audio content

Patent number: 10275685

Abstract: A method is disclosed for audio object extraction from an audio content which includes identifying a first set of projection spaces including a first subset for a first channel and a second subset for a second channel of the plurality of channels. The method may further include determining a first set of correlations between the first and second channels, each of the first set of correlations corresponding to one of the first subset of projection spaces and one of the second subset of projection spaces. Still further, the method may include extracting an audio object from an audio signal of the first channel at least in part based on a first correlation among the first set of correlations and the projection space from the first subset corresponding to the first correlation, the first correlation being greater than a first predefined threshold. Corresponding system and computer program products are also disclosed.

Type: Grant

Filed: December 18, 2015

Date of Patent: April 30, 2019

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Mingqing Hu, Lie Lu, Lianwu Chen
UPMIXING OF AUDIO SIGNALS

Publication number: 20190052991

Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.

Type: Application

Filed: February 9, 2016

Publication date: February 14, 2019

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jun WANG, Lie LU, Lianwu CHEN, Mingqing HU
Video content assisted audio object extraction

Patent number: 10200804

Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.

Type: Grant

Filed: February 24, 2016

Date of Patent: February 5, 2019

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Xuejing Sun, Lie Lu
UPMIXING OF AUDIO SIGNALS

Publication number: 20180262856

Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.

Type: Application

Filed: February 9, 2016

Publication date: September 13, 2018

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jun WANG, Lie LU, Lianwu CHEN, Mingqing HU
Processing Object-Based Audio Signals

Publication number: 20180227691

Abstract: Example embodiments disclosed herein relate to audio signal processing. The audio signal has multiple audio objects. A method of processing an audio signal is disclosed. The method includes obtaining an object position for each of the audio objects; and determining cluster positions for grouping the audio objects into clusters based on the object positions, a plurality of object-to-cluster gains, and a set of metrics. The metrics indicate a quality of the cluster positions and a quality of the object-to-cluster gains, each of the cluster positions is a centroid of a respective one of the clusters, and one of the object-to-cluster gains defines a ratio of the respective audio object in one of the clusters. The method also includes determining the object-to-cluster gains based on the object positions, the cluster positions and the set of metrics; and generating a cluster signal based on the determined cluster positions and object-to-cluster gains.

Type: Application

Filed: August 4, 2016

Publication date: August 9, 2018

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
AUDIO OBJECT EXTRACTION WITH SUB-BAND OBJECT PROBABILITY ESTIMATION

Publication number: 20180103333

Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.

Type: Application

Filed: October 16, 2017

Publication date: April 12, 2018

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Lianwu CHEN, Lie LU
Video Content Assisted Audio Object Extraction

Publication number: 20180054689

Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.

Type: Application

Filed: February 24, 2016

Publication date: February 22, 2018

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Lianwu CHEN, Xuejing SUN, Lie LU
Projection-Based Audio Object Extraction from Audio Content

Publication number: 20170344852

Abstract: A method is disclosed for audio object extraction from an audio content which includes identifying a first set of projection spaces including a first subset for a first channel and a second subset for a second channel of the plurality of channels. The method may further include determining a first set of correlations between the first and second channels, each of the first set of correlations corresponding to one of the first subset of projection spaces and one of the second subset of projection spaces. Still further, the method may include extracting an audio object from an audio signal of the first channel at least in part based on a first correlation among the first set of correlations and the projection space from the first subset corresponding to the first correlation, the first correlation being greater than a first predefined threshold. Corresponding system and computer program products are also disclosed.

Type: Application

Filed: December 18, 2015

Publication date: November 30, 2017

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Mingqing HU, Lie LU, Lianwu CHEN
Audio object clustering by utilizing temporal variations of audio objects

Patent number: 9830922

Abstract: Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.

Type: Grant

Filed: February 23, 2015

Date of Patent: November 28, 2017

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
METADATA-PRESERVED AUDIO OBJECT CLUSTERING

Publication number: 20170339506

Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.

Type: Application

Filed: December 10, 2015

Publication date: November 23, 2017

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Lianwu CHEN, Lie LU, Nicolas R. TSINGOS
Audio object extraction with sub-band object probability estimation

Patent number: 9820077

Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.

Type: Grant

Filed: July 23, 2015

Date of Patent: November 14, 2017

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu
Object clustering for rendering object-based audio content based on perceptual criteria

Patent number: 9805725

Abstract: Embodiments are directed a method of rendering object-based audio comprising determining an initial spatial position of objects having object audio data and associated metadata, determining a perceptual importance of the objects, and grouping the audio objects into a number of clusters based on the determined perceptual importance of the objects, such that a spatial error caused by moving an object from an initial spatial position to a second spatial position in a cluster is minimized for objects with a relatively high perceptual importance. The perceptual importance is based at least in part by a partial loudness of an object and content semantics of the object.

Type: Grant

Filed: November 25, 2013

Date of Patent: October 31, 2017

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Brett G. Crockett, Alan J. Seefeldt, Nicolas R. Tsingos, Rhonda Wilson, Dirk Jeroen Breebaart, Lie Lu, Lianwu Chen
AUDIO OBJECT EXTRACTION WITH SUB-BAND OBJECT PROBABILITY ESTIMATION

Publication number: 20170215019

Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.

Type: Application

Filed: July 23, 2015

Publication date: July 27, 2017

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Lianwu CHEN, Lie LU
Audio Object Clustering with Single Channel Quality Preservation

Publication number: 20170171687

Abstract: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.

Type: Application

Filed: December 12, 2016

Publication date: June 15, 2017

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Dirk Jeroen Breebaart, Lianwu Chen, Lie Lu

prev 1 2 3 next