Patents by Inventor Lianwu CHEN

Lianwu CHEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11937064
    Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: May 5, 2022
    Date of Patent: March 19, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Nicolas R. Tsingos
  • Patent number: 11929091
    Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
    Type: Grant
    Filed: March 1, 2022
    Date of Patent: March 12, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Chunmao Zhang, Lianwu Chen, Ziyu Yang, Joshua Brandon Lando, David Matthew Fischer, Lie Lu
  • Patent number: 11930347
    Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.
    Type: Grant
    Filed: February 12, 2020
    Date of Patent: March 12, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu
  • Patent number: 11908483
    Abstract: This application relates to a method of extracting an inter channel feature from a multi-channel multi-sound source mixed audio signal performed at a computing device.
    Type: Grant
    Filed: August 12, 2021
    Date of Patent: February 20, 2024
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Rongzhi Gu, Shixiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu
  • Publication number: 20230013740
    Abstract: This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to each sound area in N sound areas; using the sound area as a target detection sound area, and generating a control signal corresponding to the target detection sound area according to sound area information corresponding to the target detection sound area; processing a speech input signal corresponding to the target detection sound area by using the control signal corresponding to the target detection sound area, to obtain a speech output signal corresponding to the target detection sound area; and generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area.
    Type: Application
    Filed: September 13, 2022
    Publication date: January 19, 2023
    Inventors: Jimeng ZHENG, Lianwu CHEN, Weiwei Li, Zhiyi Duan, Meng YU, Dan Su, Kaiyu Jiang
  • Publication number: 20220366933
    Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
    Type: Application
    Filed: March 1, 2022
    Publication date: November 17, 2022
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Chunmao ZHANG, Lianwu CHEN, Ziyu YANG, Joshua Brandon LANDO, David Matthew FISCHER, Lie LU
  • Patent number: 11450337
    Abstract: A multi-person speech separation method is provided for a terminal. The method includes extracting a hybrid speech feature from a hybrid speech signal requiring separation, N human voices being mixed in the hybrid speech signal, N being a positive integer greater than or equal to 2; extracting a masking coefficient of the hybrid speech feature by using a generative adversarial network (GAN) model, to obtain a masking matrix corresponding to the N human voices, wherein the GAN model comprises a generative network model and an adversarial network model; and performing a speech separation on the masking matrix corresponding to the N human voices and the hybrid speech signal by using the GAN model, and outputting N separated speech signals corresponding to the N human voices.
    Type: Grant
    Filed: September 17, 2020
    Date of Patent: September 20, 2022
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Lianwu Chen, Meng Yu, Yanmin Qian, Dan Su, Dong Yu
  • Patent number: 11430428
    Abstract: The present disclosure describes a method, apparatus, and storage medium for performing speech recognition. The method includes acquiring, by an apparatus, first to-be-processed speech information. The apparatus includes a memory storing instructions and a processor in communication with the memory. The method includes acquiring, by the apparatus, a first pause duration according to the first to-be-processed speech information; and in response to the first pause duration being greater than or equal to a first threshold, performing, by the apparatus, speech recognition on the first to-be-processed speech information to obtain a first result of sentence segmentation of speech, the first result of sentence segmentation of speech being text information, the first threshold being determined according to speech information corresponding to a previous moment.
    Type: Grant
    Filed: September 10, 2020
    Date of Patent: August 30, 2022
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Lianwu Chen, Jingliang Bai, Min Luo
  • Publication number: 20220272474
    Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: May 5, 2022
    Publication date: August 25, 2022
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu CHEN, Lie LU, Nicolas R. TSINGOS
  • Patent number: 11363398
    Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: December 10, 2015
    Date of Patent: June 14, 2022
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Nicolas R. Tsingos
  • Publication number: 20220159395
    Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.
    Type: Application
    Filed: February 12, 2020
    Publication date: May 19, 2022
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu
  • Patent number: 11264050
    Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
    Type: Grant
    Filed: April 24, 2019
    Date of Patent: March 1, 2022
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Chunmao Zhang, Lianwu Chen, Ziyu Yang, Joshua Brandon Lando, David Matthew Fischer, Lie Lu
  • Publication number: 20210375294
    Abstract: This application relates to a method of extracting an inter channel feature from a multi-channel multi-sound source mixed audio signal performed at a computing device.
    Type: Application
    Filed: August 12, 2021
    Publication date: December 2, 2021
    Inventors: Rongzhi Gu, Shixiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu
  • Patent number: 11158304
    Abstract: Embodiments of the present invention provide a speech signal processing model training method, an electronic device and a storage medium. The embodiments of the present invention determines a target training loss function based on a training loss function of each of one or more speech signal processing tasks; inputs a task input feature of each speech signal processing task into a starting multi-task neural network, and updates model parameters of a shared layer and each of one or more task layers of the starting multi-task neural network corresponding to the one or more speech signal processing tasks by minimizing the target training loss function as a training objective, until the starting multi-task neural network converges, to obtain a speech signal processing model.
    Type: Grant
    Filed: October 17, 2019
    Date of Patent: October 26, 2021
    Assignee: Tencent Technology (Shenzhen) Company Limited
    Inventors: Lianwu Chen, Meng Yu, Min Luo, Dan Su
  • Publication number: 20210056984
    Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
    Type: Application
    Filed: April 24, 2019
    Publication date: February 25, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Chunmao ZHANG, Lianwu CHEN, Ziyu YANG, Joshua Brandon LANDO, David Matthew FISCHER, Lie LU
  • Publication number: 20210005216
    Abstract: A multi-person speech separation method is provided for a terminal. The method includes extracting a hybrid speech feature from a hybrid speech signal requiring separation, N human voices being mixed in the hybrid speech signal, N being a positive integer greater than or equal to 2; extracting a masking coefficient of the hybrid speech feature by using a generative adversarial network (GAN) model, to obtain a masking matrix corresponding to the N human voices, wherein the GAN model comprises a generative network model and an adversarial network model; and performing a speech separation on the masking matrix corresponding to the N human voices and the hybrid speech signal by using the GAN model, and outputting N separated speech signals corresponding to the N human voices.
    Type: Application
    Filed: September 17, 2020
    Publication date: January 7, 2021
    Inventors: Lianwu CHEN, Meng YU, Yanmin QIAN, Dan SU, Dong YU
  • Publication number: 20200410985
    Abstract: The present disclosure describes a method, apparatus, and storage medium for performing speech recognition. The method includes acquiring, by an apparatus, first to-be-processed speech information. The apparatus includes a memory storing instructions and a processor in communication with the memory. The method includes acquiring, by the apparatus, a first pause duration according to the first to-be-processed speech information; and in response to the first pause duration being greater than or equal to a first threshold, performing, by the apparatus, speech recognition on the first to-be-processed speech information to obtain a first result of sentence segmentation of speech, the first result of sentence segmentation of speech being text information, the first threshold being determined according to speech information corresponding to a previous moment.
    Type: Application
    Filed: September 10, 2020
    Publication date: December 31, 2020
    Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Lianwu CHEN, Jingliang BAI, Min LUO
  • Publication number: 20200357389
    Abstract: A data processing method based on simultaneous interpretation, applied to a server in a simultaneous interpretation system, including: obtaining audio transmitted by a simultaneous interpretation device; processing the audio by using a simultaneous interpretation model to obtain an initial text; transmitting the initial text to a user terminal; receiving a modified text fed back by the user terminal, the modified text being obtained after the user terminal modifies the initial text; and updating the simultaneous interpretation model according to the initial text and the modified text.
    Type: Application
    Filed: July 28, 2020
    Publication date: November 12, 2020
    Inventors: Jingliang BAI, Caisheng OUYANG, Haikang LIU, Lianwu CHEN, Qi CHEN, Yulu ZHANG, Min LUO, Dan SU
  • Patent number: 10779106
    Abstract: Example embodiments disclosed herein relate to audio object clustering based on renderer-aware perceptual difference. A method of processing audio objects is provided. The method includes obtaining renderer-related information indicating a configuration of a renderer. The method also includes determining, based on the obtained renderer-related information, a rendering difference between a first audio object and a second audio object among the audio objects with respect to the renderer. The method further includes clustering the audio objects at least in part based on the rendering difference. Corresponding system, device, and computer program product are also disclosed.
    Type: Grant
    Filed: July 13, 2017
    Date of Patent: September 15, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
  • Patent number: 10638246
    Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: October 16, 2017
    Date of Patent: April 28, 2020
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu