Patents by Inventor Lianwu CHEN

Lianwu CHEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Metadata-preserved audio object clustering

Patent number: 11937064

Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.

Type: Grant

Filed: May 5, 2022

Date of Patent: March 19, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu, Nicolas R. Tsingos
Blind detection of binauralized stereo content

Patent number: 11929091

Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.

Type: Grant

Filed: March 1, 2022

Date of Patent: March 12, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Chunmao Zhang, Lianwu Chen, Ziyu Yang, Joshua Brandon Lando, David Matthew Fischer, Lie Lu
Adaptive loudness normalization for audio object clustering

Patent number: 11930347

Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.

Type: Grant

Filed: February 12, 2020

Date of Patent: March 12, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu
Inter-channel feature extraction method, audio separation method and apparatus, and computing device

Patent number: 11908483

Abstract: This application relates to a method of extracting an inter channel feature from a multi-channel multi-sound source mixed audio signal performed at a computing device.

Type: Grant

Filed: August 12, 2021

Date of Patent: February 20, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Rongzhi Gu, Shixiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu
MULTI-REGISTER-BASED SPEECH DETECTION METHOD AND RELATED APPARATUS, AND STORAGE MEDIUM

Publication number: 20230013740

Abstract: This application discloses a multi-sound area-based speech detection method and related apparatus, and a storage medium, which is applied to the field of artificial intelligence. The method includes: obtaining sound area information corresponding to each sound area in N sound areas; using the sound area as a target detection sound area, and generating a control signal corresponding to the target detection sound area according to sound area information corresponding to the target detection sound area; processing a speech input signal corresponding to the target detection sound area by using the control signal corresponding to the target detection sound area, to obtain a speech output signal corresponding to the target detection sound area; and generating a speech detection result of the target detection sound area according to the speech output signal corresponding to the target detection sound area.

Type: Application

Filed: September 13, 2022

Publication date: January 19, 2023

Inventors: Jimeng ZHENG, Lianwu CHEN, Weiwei Li, Zhiyi Duan, Meng YU, Dan Su, Kaiyu Jiang
BLIND DETECTION OF BINAURALIZED STEREO CONTENT

Publication number: 20220366933

Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.

Type: Application

Filed: March 1, 2022

Publication date: November 17, 2022

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Chunmao ZHANG, Lianwu CHEN, Ziyu YANG, Joshua Brandon LANDO, David Matthew FISCHER, Lie LU
Multi-person speech separation method and apparatus using a generative adversarial network model

Patent number: 11450337

Abstract: A multi-person speech separation method is provided for a terminal. The method includes extracting a hybrid speech feature from a hybrid speech signal requiring separation, N human voices being mixed in the hybrid speech signal, N being a positive integer greater than or equal to 2; extracting a masking coefficient of the hybrid speech feature by using a generative adversarial network (GAN) model, to obtain a masking matrix corresponding to the N human voices, wherein the GAN model comprises a generative network model and an adversarial network model; and performing a speech separation on the masking matrix corresponding to the N human voices and the hybrid speech signal by using the GAN model, and outputting N separated speech signals corresponding to the N human voices.

Type: Grant

Filed: September 17, 2020

Date of Patent: September 20, 2022

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lianwu Chen, Meng Yu, Yanmin Qian, Dan Su, Dong Yu
Method, apparatus, and storage medium for segmenting sentences for speech recognition

Patent number: 11430428

Abstract: The present disclosure describes a method, apparatus, and storage medium for performing speech recognition. The method includes acquiring, by an apparatus, first to-be-processed speech information. The apparatus includes a memory storing instructions and a processor in communication with the memory. The method includes acquiring, by the apparatus, a first pause duration according to the first to-be-processed speech information; and in response to the first pause duration being greater than or equal to a first threshold, performing, by the apparatus, speech recognition on the first to-be-processed speech information to obtain a first result of sentence segmentation of speech, the first result of sentence segmentation of speech being text information, the first threshold being determined according to speech information corresponding to a previous moment.

Type: Grant

Filed: September 10, 2020

Date of Patent: August 30, 2022

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lianwu Chen, Jingliang Bai, Min Luo
METADATA-PRESERVED AUDIO OBJECT CLUSTERING

Publication number: 20220272474

Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.

Type: Application

Filed: May 5, 2022

Publication date: August 25, 2022

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Lianwu CHEN, Lie LU, Nicolas R. TSINGOS
Metadata-preserved audio object clustering

Patent number: 11363398

Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.

Type: Grant

Filed: December 10, 2015

Date of Patent: June 14, 2022

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu, Nicolas R. Tsingos
ADAPTIVE LOUDNESS NORMALIZATION FOR AUDIO OBJECT CLUSTERING

Publication number: 20220159395

Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.

Type: Application

Filed: February 12, 2020

Publication date: May 19, 2022

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu
Blind detection of binauralized stereo content

Patent number: 11264050

Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.

Type: Grant

Filed: April 24, 2019

Date of Patent: March 1, 2022

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Chunmao Zhang, Lianwu Chen, Ziyu Yang, Joshua Brandon Lando, David Matthew Fischer, Lie Lu
INTER-CHANNEL FEATURE EXTRACTION METHOD, AUDIO SEPARATION METHOD AND APPARATUS, AND COMPUTING DEVICE

Publication number: 20210375294

Abstract: This application relates to a method of extracting an inter channel feature from a multi-channel multi-sound source mixed audio signal performed at a computing device.

Type: Application

Filed: August 12, 2021

Publication date: December 2, 2021

Inventors: Rongzhi Gu, Shixiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu
Training method of speech signal processing model with shared layer, electronic device and storage medium

Patent number: 11158304

Abstract: Embodiments of the present invention provide a speech signal processing model training method, an electronic device and a storage medium. The embodiments of the present invention determines a target training loss function based on a training loss function of each of one or more speech signal processing tasks; inputs a task input feature of each speech signal processing task into a starting multi-task neural network, and updates model parameters of a shared layer and each of one or more task layers of the starting multi-task neural network corresponding to the one or more speech signal processing tasks by minimizing the target training loss function as a training objective, until the starting multi-task neural network converges, to obtain a speech signal processing model.

Type: Grant

Filed: October 17, 2019

Date of Patent: October 26, 2021

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventors: Lianwu Chen, Meng Yu, Min Luo, Dan Su
Blind Detection of Binauralized Stereo Content

Publication number: 20210056984

Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.

Type: Application

Filed: April 24, 2019

Publication date: February 25, 2021

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Chunmao ZHANG, Lianwu CHEN, Ziyu YANG, Joshua Brandon LANDO, David Matthew FISCHER, Lie LU
MULTI-PERSON SPEECH SEPARATION METHOD AND APPARATUS

Publication number: 20210005216

Abstract: A multi-person speech separation method is provided for a terminal. The method includes extracting a hybrid speech feature from a hybrid speech signal requiring separation, N human voices being mixed in the hybrid speech signal, N being a positive integer greater than or equal to 2; extracting a masking coefficient of the hybrid speech feature by using a generative adversarial network (GAN) model, to obtain a masking matrix corresponding to the N human voices, wherein the GAN model comprises a generative network model and an adversarial network model; and performing a speech separation on the masking matrix corresponding to the N human voices and the hybrid speech signal by using the GAN model, and outputting N separated speech signals corresponding to the N human voices.

Type: Application

Filed: September 17, 2020

Publication date: January 7, 2021

Inventors: Lianwu CHEN, Meng YU, Yanmin QIAN, Dan SU, Dong YU
METHOD, APPARATUS, AND STORAGE MEDIUM FOR SEGMENTING SENTENCES FOR SPEECH RECOGNITION

Publication number: 20200410985

Abstract: The present disclosure describes a method, apparatus, and storage medium for performing speech recognition. The method includes acquiring, by an apparatus, first to-be-processed speech information. The apparatus includes a memory storing instructions and a processor in communication with the memory. The method includes acquiring, by the apparatus, a first pause duration according to the first to-be-processed speech information; and in response to the first pause duration being greater than or equal to a first threshold, performing, by the apparatus, speech recognition on the first to-be-processed speech information to obtain a first result of sentence segmentation of speech, the first result of sentence segmentation of speech being text information, the first threshold being determined according to speech information corresponding to a previous moment.

Type: Application

Filed: September 10, 2020

Publication date: December 31, 2020

Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Lianwu CHEN, Jingliang BAI, Min LUO
DATA PROCESSING METHOD BASED ON SIMULTANEOUS INTERPRETATION, COMPUTER DEVICE, AND STORAGE MEDIUM

Publication number: 20200357389

Abstract: A data processing method based on simultaneous interpretation, applied to a server in a simultaneous interpretation system, including: obtaining audio transmitted by a simultaneous interpretation device; processing the audio by using a simultaneous interpretation model to obtain an initial text; transmitting the initial text to a user terminal; receiving a modified text fed back by the user terminal, the modified text being obtained after the user terminal modifies the initial text; and updating the simultaneous interpretation model according to the initial text and the modified text.

Type: Application

Filed: July 28, 2020

Publication date: November 12, 2020

Inventors: Jingliang BAI, Caisheng OUYANG, Haikang LIU, Lianwu CHEN, Qi CHEN, Yulu ZHANG, Min LUO, Dan SU
Audio object clustering based on renderer-aware perceptual difference

Patent number: 10779106

Abstract: Example embodiments disclosed herein relate to audio object clustering based on renderer-aware perceptual difference. A method of processing audio objects is provided. The method includes obtaining renderer-related information indicating a configuration of a renderer. The method also includes determining, based on the obtained renderer-related information, a rendering difference between a first audio object and a second audio object among the audio objects with respect to the renderer. The method further includes clustering the audio objects at least in part based on the rendering difference. Corresponding system, device, and computer program product are also disclosed.

Type: Grant

Filed: July 13, 2017

Date of Patent: September 15, 2020

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
Audio object extraction with sub-band object probability estimation

Patent number: 10638246

Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.

Type: Grant

Filed: October 16, 2017

Date of Patent: April 28, 2020

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu

1 2 3 next