Patents by Inventor Shilun LIN

Shilun LIN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Transforming text data into acoustic feature

Patent number: 12361923

Abstract: A concealed text feature corresponding to a text data block of a plurality of text data blocks included in the text data and at least one concealed text feature corresponding to at least one text data block subsequent to the text data block are generated. A coarse fusion is performed on (i) the concealed text feature corresponding to the text data block and (ii) the at least one concealed text feature corresponding to the at least one text data block subsequent to the text data block to obtain at least one coarse fusion text feature. A fine fusion is performed on the at least one coarse fusion text feature to obtain a fine fusion text feature corresponding to the text data block. A length corresponding to the fine fusion text feature is regulated. The fine fusion text feature with the regulated length is transformed into the acoustic feature.

Type: Grant

Filed: November 29, 2022

Date of Patent: July 15, 2025

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor: Shilun Lin
Audio synthesis method and apparatus, computer readable medium, and electronic device

Patent number: 12106746

Abstract: This application discloses a method, an apparatus, a computer readable medium, and an electronic device for audio synthesis. The method includes: acquiring mixed language text information comprising text characters corresponding to at least two language types; performing text coding processing on the mixed language text information based on the at least two language types, to obtain an intermediate semantic coding feature of the mixed language text information; acquiring a target tone feature corresponding to a target tone subject, and performing decoding processing on the intermediate semantic coding feature based on the target tone feature to obtain an acoustic feature; and performing acoustic coding processing on the acoustic feature to obtain an audio corresponding to the mixed language text information.

Type: Grant

Filed: March 24, 2022

Date of Patent: October 1, 2024

Assignee: Tencent Technology (Shenzhen) Company Limited

Inventor: Shilun Lin
TRANSFORMING TEXT DATA INTO ACOUSTIC FEATURE

Publication number: 20230087916

Abstract: A concealed text feature corresponding to a text data block of a plurality of text data blocks included in the text data and at least one concealed text feature corresponding to at least one text data block subsequent to the text data block are generated. A coarse fusion is performed on (i) the concealed text feature corresponding to the text data block and (ii) the at least one concealed text feature corresponding to the at least one text data block subsequent to the text data block to obtain at least one coarse fusion text feature. A fine fusion is performed on the at least one coarse fusion text feature to obtain a fine fusion text feature corresponding to the text data block. A length corresponding to the fine fusion text feature is regulated. The fine fusion text feature with the regulated length is transformed into the acoustic feature.

Type: Application

Filed: November 29, 2022

Publication date: March 23, 2023

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor: Shilun LIN
AUDIO PROCESSING METHOD AND APPARATUS, VOCODER, ELECTRONIC DEVICE, COMPUTER-READABLE STORAGE MEDIUM, AND COMPUTER PROGRAM PRODUCT

Publication number: 20230035504

Abstract: Embodiments of this application provide an audio processing method and apparatus, a vocoder, an electronic device, and a computer-readable storage medium. The audio processing method includes performing speech feature conversion on a text to obtain at least one acoustic feature frame; extracting a conditional feature corresponding to each acoustic feature frame from each acoustic feature frame of the at least one acoustic feature frame by a frame rate network; performing frequency division and time-domain down-sampling on the current frame of each acoustic feature frame to obtain n subframes corresponding to the current frame; synchronously predicting sample values corresponding to the current m adjacent sampling points on the n subframes to obtain m×n sub-prediction values; obtaining an audio prediction signal corresponding to the current frame; and performing audio synthesis on the audio prediction signal corresponding to each acoustic feature frame to obtain a target audio corresponding to the text.

Type: Application

Filed: October 13, 2022

Publication date: February 2, 2023

Inventors: Shilun LIN, Xinhui LI, Li LU
Speech recognition method, apparatus, and device, and storage medium

Patent number: 11450312

Abstract: A speech recognition method includes: obtaining speech information; and determining beginning and ending positions of a candidate speech segment in the speech information by using a weighted finite state transducer (WFST) network. The candidate speech segment is identified as corresponding to a preset keyword. The method also includes clipping the candidate speech segment from the speech information according to the beginning and ending positions of the candidate speech segment; detecting whether the candidate speech segment includes a preset keyword by using a machine learning model; and determining, upon determining that the candidate speech segment comprises the preset keyword, that the speech information comprises the preset keyword.

Type: Grant

Filed: June 12, 2020

Date of Patent: September 20, 2022

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Shilun Lin, Xilin Zhang, Wenhua Ma, Bo Liu, Xinhui Li, Li Lu, Xiucai Jiang
AUDIO SYNTHESIS METHOD AND APPARATUS, COMPUTER READABLE MEDIUM, AND ELECTRONIC DEVICE

Publication number: 20220215827

Abstract: This application discloses a method, an apparatus, a computer readable medium, and an electronic device for audio synthesis. The method includes: acquiring mixed language text information comprising text characters corresponding to at least two language types; performing text coding processing on the mixed language text information based on the at least two language types, to obtain an intermediate semantic coding feature of the mixed language text information; acquiring a target tone feature corresponding to a target tone subject, and performing decoding processing on the intermediate semantic coding feature based on the target tone feature to obtain an acoustic feature; and performing acoustic coding processing on the acoustic feature to obtain an audio corresponding to the mixed language text information.

Type: Application

Filed: March 24, 2022

Publication date: July 7, 2022

Applicant: Tencent Technology (Shenzhen) Company Limited

Inventor: Shilun LIN
SPEECH RECOGNITION METHOD, APPARATUS, AND DEVICE, AND STORAGE MEDIUM

Publication number: 20200312309

Abstract: A speech recognition method includes: obtaining speech information; and determining beginning and ending positions of a candidate speech segment in the speech information by using a weighted finite state transducer (WFST) network. The candidate speech segment is identified as corresponding to a preset keyword. The method also includes clipping the candidate speech segment from the speech information according to the beginning and ending positions of the candidate speech segment; detecting whether the candidate speech segment includes a preset keyword by using a machine learning model; and determining, upon determining that the candidate speech segment comprises the preset keyword, that the speech information comprises the preset keyword.

Type: Application

Filed: June 12, 2020

Publication date: October 1, 2020

Inventors: Shilun LIN, Xilin ZHANG, Wenhua MA, Bo LIU, Xinhui LI, Li LU, Xiucai JIANG

Transforming text data into acoustic feature

Audio synthesis method and apparatus, computer readable medium, and electronic device

TRANSFORMING TEXT DATA INTO ACOUSTIC FEATURE

AUDIO PROCESSING METHOD AND APPARATUS, VOCODER, ELECTRONIC DEVICE, COMPUTER-READABLE STORAGE MEDIUM, AND COMPUTER PROGRAM PRODUCT

Speech recognition method, apparatus, and device, and storage medium

AUDIO SYNTHESIS METHOD AND APPARATUS, COMPUTER READABLE MEDIUM, AND ELECTRONIC DEVICE

SPEECH RECOGNITION METHOD, APPARATUS, AND DEVICE, AND STORAGE MEDIUM