Patents by Inventor Lianwu CHEN
Lianwu CHEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11937064Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.Type: GrantFiled: May 5, 2022Date of Patent: March 19, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Lie Lu, Nicolas R. Tsingos
-
Patent number: 11929091Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.Type: GrantFiled: March 1, 2022Date of Patent: March 12, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Chunmao Zhang, Lianwu Chen, Ziyu Yang, Joshua Brandon Lando, David Matthew Fischer, Lie Lu
-
Patent number: 11930347Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.Type: GrantFiled: February 12, 2020Date of Patent: March 12, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Lie Lu
-
Inter-channel feature extraction method, audio separation method and apparatus, and computing device
Patent number: 11908483Abstract: This application relates to a method of extracting an inter channel feature from a multi-channel multi-sound source mixed audio signal performed at a computing device.Type: GrantFiled: August 12, 2021Date of Patent: February 20, 2024Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Rongzhi Gu, Shixiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu -
Publication number: 20230013740Abstract: This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to each sound area in N sound areas; using the sound area as a target detection sound area, and generating a control signal corresponding to the target detection sound area according to sound area information corresponding to the target detection sound area; processing a speech input signal corresponding to the target detection sound area by using the control signal corresponding to the target detection sound area, to obtain a speech output signal corresponding to the target detection sound area; and generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area.Type: ApplicationFiled: September 13, 2022Publication date: January 19, 2023Inventors: Jimeng ZHENG, Lianwu CHEN, Weiwei Li, Zhiyi Duan, Meng YU, Dan Su, Kaiyu Jiang
-
Publication number: 20220366933Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.Type: ApplicationFiled: March 1, 2022Publication date: November 17, 2022Applicant: Dolby Laboratories Licensing CorporationInventors: Chunmao ZHANG, Lianwu CHEN, Ziyu YANG, Joshua Brandon LANDO, David Matthew FISCHER, Lie LU
-
Patent number: 11450337Abstract: A multi-person speech separation method is provided for a terminal. The method includes extracting a hybrid speech feature from a hybrid speech signal requiring separation, N human voices being mixed in the hybrid speech signal, N being a positive integer greater than or equal to 2; extracting a masking coefficient of the hybrid speech feature by using a generative adversarial network (GAN) model, to obtain a masking matrix corresponding to the N human voices, wherein the GAN model comprises a generative network model and an adversarial network model; and performing a speech separation on the masking matrix corresponding to the N human voices and the hybrid speech signal by using the GAN model, and outputting N separated speech signals corresponding to the N human voices.Type: GrantFiled: September 17, 2020Date of Patent: September 20, 2022Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Lianwu Chen, Meng Yu, Yanmin Qian, Dan Su, Dong Yu
-
Patent number: 11430428Abstract: The present disclosure describes a method, apparatus, and storage medium for performing speech recognition. The method includes acquiring, by an apparatus, first to-be-processed speech information. The apparatus includes a memory storing instructions and a processor in communication with the memory. The method includes acquiring, by the apparatus, a first pause duration according to the first to-be-processed speech information; and in response to the first pause duration being greater than or equal to a first threshold, performing, by the apparatus, speech recognition on the first to-be-processed speech information to obtain a first result of sentence segmentation of speech, the first result of sentence segmentation of speech being text information, the first threshold being determined according to speech information corresponding to a previous moment.Type: GrantFiled: September 10, 2020Date of Patent: August 30, 2022Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Lianwu Chen, Jingliang Bai, Min Luo
-
Publication number: 20220272474Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.Type: ApplicationFiled: May 5, 2022Publication date: August 25, 2022Applicant: Dolby Laboratories Licensing CorporationInventors: Lianwu CHEN, Lie LU, Nicolas R. TSINGOS
-
Patent number: 11363398Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.Type: GrantFiled: December 10, 2015Date of Patent: June 14, 2022Assignee: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Lie Lu, Nicolas R. Tsingos
-
Publication number: 20220159395Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.Type: ApplicationFiled: February 12, 2020Publication date: May 19, 2022Applicant: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Lie Lu
-
Patent number: 11264050Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.Type: GrantFiled: April 24, 2019Date of Patent: March 1, 2022Assignee: Dolby Laboratories Licensing CorporationInventors: Chunmao Zhang, Lianwu Chen, Ziyu Yang, Joshua Brandon Lando, David Matthew Fischer, Lie Lu
-
INTER-CHANNEL FEATURE EXTRACTION METHOD, AUDIO SEPARATION METHOD AND APPARATUS, AND COMPUTING DEVICE
Publication number: 20210375294Abstract: This application relates to a method of extracting an inter channel feature from a multi-channel multi-sound source mixed audio signal performed at a computing device.Type: ApplicationFiled: August 12, 2021Publication date: December 2, 2021Inventors: Rongzhi Gu, Shixiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu -
Patent number: 11158304Abstract: Embodiments of the present invention provide a speech signal processing model training method, an electronic device and a storage medium. The embodiments of the present invention determines a target training loss function based on a training loss function of each of one or more speech signal processing tasks; inputs a task input feature of each speech signal processing task into a starting multi-task neural network, and updates model parameters of a shared layer and each of one or more task layers of the starting multi-task neural network corresponding to the one or more speech signal processing tasks by minimizing the target training loss function as a training objective, until the starting multi-task neural network converges, to obtain a speech signal processing model.Type: GrantFiled: October 17, 2019Date of Patent: October 26, 2021Assignee: Tencent Technology (Shenzhen) Company LimitedInventors: Lianwu Chen, Meng Yu, Min Luo, Dan Su
-
Publication number: 20210056984Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.Type: ApplicationFiled: April 24, 2019Publication date: February 25, 2021Applicant: Dolby Laboratories Licensing CorporationInventors: Chunmao ZHANG, Lianwu CHEN, Ziyu YANG, Joshua Brandon LANDO, David Matthew FISCHER, Lie LU
-
Publication number: 20210005216Abstract: A multi-person speech separation method is provided for a terminal. The method includes extracting a hybrid speech feature from a hybrid speech signal requiring separation, N human voices being mixed in the hybrid speech signal, N being a positive integer greater than or equal to 2; extracting a masking coefficient of the hybrid speech feature by using a generative adversarial network (GAN) model, to obtain a masking matrix corresponding to the N human voices, wherein the GAN model comprises a generative network model and an adversarial network model; and performing a speech separation on the masking matrix corresponding to the N human voices and the hybrid speech signal by using the GAN model, and outputting N separated speech signals corresponding to the N human voices.Type: ApplicationFiled: September 17, 2020Publication date: January 7, 2021Inventors: Lianwu CHEN, Meng YU, Yanmin QIAN, Dan SU, Dong YU
-
Publication number: 20200410985Abstract: The present disclosure describes a method, apparatus, and storage medium for performing speech recognition. The method includes acquiring, by an apparatus, first to-be-processed speech information. The apparatus includes a memory storing instructions and a processor in communication with the memory. The method includes acquiring, by the apparatus, a first pause duration according to the first to-be-processed speech information; and in response to the first pause duration being greater than or equal to a first threshold, performing, by the apparatus, speech recognition on the first to-be-processed speech information to obtain a first result of sentence segmentation of speech, the first result of sentence segmentation of speech being text information, the first threshold being determined according to speech information corresponding to a previous moment.Type: ApplicationFiled: September 10, 2020Publication date: December 31, 2020Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventors: Lianwu CHEN, Jingliang BAI, Min LUO
-
Publication number: 20200357389Abstract: A data processing method based on simultaneous interpretation, applied to a server in a simultaneous interpretation system, including: obtaining audio transmitted by a simultaneous interpretation device; processing the audio by using a simultaneous interpretation model to obtain an initial text; transmitting the initial text to a user terminal; receiving a modified text fed back by the user terminal, the modified text being obtained after the user terminal modifies the initial text; and updating the simultaneous interpretation model according to the initial text and the modified text.Type: ApplicationFiled: July 28, 2020Publication date: November 12, 2020Inventors: Jingliang BAI, Caisheng OUYANG, Haikang LIU, Lianwu CHEN, Qi CHEN, Yulu ZHANG, Min LUO, Dan SU
-
Patent number: 10779106Abstract: Example embodiments disclosed herein relate to audio object clustering based on renderer-aware perceptual difference. A method of processing audio objects is provided. The method includes obtaining renderer-related information indicating a configuration of a renderer. The method also includes determining, based on the obtained renderer-related information, a rendering difference between a first audio object and a second audio object among the audio objects with respect to the renderer. The method further includes clustering the audio objects at least in part based on the rendering difference. Corresponding system, device, and computer program product are also disclosed.Type: GrantFiled: July 13, 2017Date of Patent: September 15, 2020Assignee: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
-
Patent number: 10638246Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.Type: GrantFiled: October 16, 2017Date of Patent: April 28, 2020Assignee: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Lie Lu