Patents by Inventor Lianwu CHEN

Lianwu CHEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200051549
    Abstract: Embodiments of the present invention provide a speech signal processing model training method, an electronic device and a storage medium. The embodiments of the present invention determines a target training loss function based on a training loss function of each of one or more speech signal processing tasks; inputs a task input feature of each speech signal processing task into a starting multi-task neural network, and updates model parameters of a shared layer and each of one or more task layers of the starting multi-task neural network corresponding to the one or more speech signal processing tasks by minimizing the target training loss function as a training objective, until the starting multi-task neural network converges, to obtain a speech signal processing model.
    Type: Application
    Filed: October 17, 2019
    Publication date: February 13, 2020
    Applicant: Tencent Technology (Shenzhen) Company Limited
    Inventors: Lianwu Chen, Meng Yu, Min Luo, Dan Su
  • Patent number: 10492014
    Abstract: Audio objects that are present in input audio content in one or more frames are determined. Output clusters that are present in output audio content in the one or more frames are also determined. Here, the audio objects in the input audio content are converted to the output clusters in the output audio content. One or more spatial error metrics are computed based at least in part on positional metadata of the audio objects and positional metadata of the output clusters.
    Type: Grant
    Filed: January 5, 2015
    Date of Patent: November 26, 2019
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Dirk Jeroen Breebaart, Lianwu Chen, Lie Lu, Antonio Mateos Sole, Nicolas R. Tsingos
  • Patent number: 10362426
    Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
    Type: Grant
    Filed: February 9, 2016
    Date of Patent: July 23, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Jun Wang, Lie Lu, Lianwu Chen, Mingqing Hu
  • Publication number: 20190182612
    Abstract: Example embodiments disclosed herein relate to audio object clustering based on renderer-aware perceptual difference. A method of processing audio objects is provided. The method includes obtaining renderer-related information indicating a configuration of a renderer. The method also includes determining, based on the obtained renderer-related information, a rendering difference between a first audio object and a second audio object among the audio objects with respect to the renderer. The method further includes clustering the audio objects at least in part based on the rendering difference. Corresponding system, device, and computer program product are also disclosed.
    Type: Application
    Filed: July 13, 2017
    Publication date: June 13, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Lianwu CHEN, Lie LU, Dirk Jeroen BREEBAART
  • Patent number: 10278000
    Abstract: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.
    Type: Grant
    Filed: December 12, 2016
    Date of Patent: April 30, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Dirk Jeroen Breebaart, Lianwu Chen, Lie Lu
  • Patent number: 10277997
    Abstract: Example embodiments disclosed herein relate to audio signal processing. The audio signal has multiple audio objects. A method of processing an audio signal is disclosed. The method includes obtaining an object position for each of the audio objects; and determining cluster positions for grouping the audio objects into clusters based on the object positions, a plurality of object-to-cluster gains, and a set of metrics. The metrics indicate a quality of the cluster positions and a quality of the object-to-cluster gains, each of the cluster positions is a centroid of a respective one of the clusters, and one of the object-to-cluster gains defines a ratio of the respective audio object in one of the clusters. The method also includes determining the object-to-cluster gains based on the object positions, the cluster positions and the set of metrics; and generating a cluster signal based on the determined cluster positions and object-to-cluster gains.
    Type: Grant
    Filed: August 4, 2016
    Date of Patent: April 30, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
  • Patent number: 10275685
    Abstract: A method is disclosed for audio object extraction from an audio content which includes identifying a first set of projection spaces including a first subset for a first channel and a second subset for a second channel of the plurality of channels. The method may further include determining a first set of correlations between the first and second channels, each of the first set of correlations corresponding to one of the first subset of projection spaces and one of the second subset of projection spaces. Still further, the method may include extracting an audio object from an audio signal of the first channel at least in part based on a first correlation among the first set of correlations and the projection space from the first subset corresponding to the first correlation, the first correlation being greater than a first predefined threshold. Corresponding system and computer program products are also disclosed.
    Type: Grant
    Filed: December 18, 2015
    Date of Patent: April 30, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Mingqing Hu, Lie Lu, Lianwu Chen
  • Publication number: 20190052991
    Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
    Type: Application
    Filed: February 9, 2016
    Publication date: February 14, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jun WANG, Lie LU, Lianwu CHEN, Mingqing HU
  • Patent number: 10200804
    Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: February 24, 2016
    Date of Patent: February 5, 2019
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Xuejing Sun, Lie Lu
  • Publication number: 20180262856
    Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
    Type: Application
    Filed: February 9, 2016
    Publication date: September 13, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jun WANG, Lie LU, Lianwu CHEN, Mingqing HU
  • Publication number: 20180227691
    Abstract: Example embodiments disclosed herein relate to audio signal processing. The audio signal has multiple audio objects. A method of processing an audio signal is disclosed. The method includes obtaining an object position for each of the audio objects; and determining cluster positions for grouping the audio objects into clusters based on the object positions, a plurality of object-to-cluster gains, and a set of metrics. The metrics indicate a quality of the cluster positions and a quality of the object-to-cluster gains, each of the cluster positions is a centroid of a respective one of the clusters, and one of the object-to-cluster gains defines a ratio of the respective audio object in one of the clusters. The method also includes determining the object-to-cluster gains based on the object positions, the cluster positions and the set of metrics; and generating a cluster signal based on the determined cluster positions and object-to-cluster gains.
    Type: Application
    Filed: August 4, 2016
    Publication date: August 9, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
  • Publication number: 20180103333
    Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: October 16, 2017
    Publication date: April 12, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu CHEN, Lie LU
  • Publication number: 20180054689
    Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: February 24, 2016
    Publication date: February 22, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu CHEN, Xuejing SUN, Lie LU
  • Publication number: 20170344852
    Abstract: A method is disclosed for audio object extraction from an audio content which includes identifying a first set of projection spaces including a first subset for a first channel and a second subset for a second channel of the plurality of channels. The method may further include determining a first set of correlations between the first and second channels, each of the first set of correlations corresponding to one of the first subset of projection spaces and one of the second subset of projection spaces. Still further, the method may include extracting an audio object from an audio signal of the first channel at least in part based on a first correlation among the first set of correlations and the projection space from the first subset corresponding to the first correlation, the first correlation being greater than a first predefined threshold. Corresponding system and computer program products are also disclosed.
    Type: Application
    Filed: December 18, 2015
    Publication date: November 30, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Mingqing HU, Lie LU, Lianwu CHEN
  • Patent number: 9830922
    Abstract: Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.
    Type: Grant
    Filed: February 23, 2015
    Date of Patent: November 28, 2017
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
  • Publication number: 20170339506
    Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: December 10, 2015
    Publication date: November 23, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu CHEN, Lie LU, Nicolas R. TSINGOS
  • Patent number: 9820077
    Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: July 23, 2015
    Date of Patent: November 14, 2017
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu
  • Patent number: 9805725
    Abstract: Embodiments are directed a method of rendering object-based audio comprising determining an initial spatial position of objects having object audio data and associated metadata, determining a perceptual importance of the objects, and grouping the audio objects into a number of clusters based on the determined perceptual importance of the objects, such that a spatial error caused by moving an object from an initial spatial position to a second spatial position in a cluster is minimized for objects with a relatively high perceptual importance. The perceptual importance is based at least in part by a partial loudness of an object and content semantics of the object.
    Type: Grant
    Filed: November 25, 2013
    Date of Patent: October 31, 2017
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Brett G. Crockett, Alan J. Seefeldt, Nicolas R. Tsingos, Rhonda Wilson, Dirk Jeroen Breebaart, Lie Lu, Lianwu Chen
  • Publication number: 20170215019
    Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: July 23, 2015
    Publication date: July 27, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu CHEN, Lie LU
  • Publication number: 20170171687
    Abstract: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.
    Type: Application
    Filed: December 12, 2016
    Publication date: June 15, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Dirk Jeroen Breebaart, Lianwu Chen, Lie Lu