Patents by Inventor Shilun LIN

Shilun LIN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12361923
    Abstract: A concealed text feature corresponding to a text data block of a plurality of text data blocks included in the text data and at least one concealed text feature corresponding to at least one text data block subsequent to the text data block are generated. A coarse fusion is performed on (i) the concealed text feature corresponding to the text data block and (ii) the at least one concealed text feature corresponding to the at least one text data block subsequent to the text data block to obtain at least one coarse fusion text feature. A fine fusion is performed on the at least one coarse fusion text feature to obtain a fine fusion text feature corresponding to the text data block. A length corresponding to the fine fusion text feature is regulated. The fine fusion text feature with the regulated length is transformed into the acoustic feature.
    Type: Grant
    Filed: November 29, 2022
    Date of Patent: July 15, 2025
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Shilun Lin
  • Patent number: 12106746
    Abstract: This application discloses a method, an apparatus, a computer readable medium, and an electronic device for audio synthesis. The method includes: acquiring mixed language text information comprising text characters corresponding to at least two language types; performing text coding processing on the mixed language text information based on the at least two language types, to obtain an intermediate semantic coding feature of the mixed language text information; acquiring a target tone feature corresponding to a target tone subject, and performing decoding processing on the intermediate semantic coding feature based on the target tone feature to obtain an acoustic feature; and performing acoustic coding processing on the acoustic feature to obtain an audio corresponding to the mixed language text information.
    Type: Grant
    Filed: March 24, 2022
    Date of Patent: October 1, 2024
    Assignee: Tencent Technology (Shenzhen) Company Limited
    Inventor: Shilun Lin
  • Publication number: 20230087916
    Abstract: A concealed text feature corresponding to a text data block of a plurality of text data blocks included in the text data and at least one concealed text feature corresponding to at least one text data block subsequent to the text data block are generated. A coarse fusion is performed on (i) the concealed text feature corresponding to the text data block and (ii) the at least one concealed text feature corresponding to the at least one text data block subsequent to the text data block to obtain at least one coarse fusion text feature. A fine fusion is performed on the at least one coarse fusion text feature to obtain a fine fusion text feature corresponding to the text data block. A length corresponding to the fine fusion text feature is regulated. The fine fusion text feature with the regulated length is transformed into the acoustic feature.
    Type: Application
    Filed: November 29, 2022
    Publication date: March 23, 2023
    Applicant: Tencent Technology (Shenzhen) Company Limited
    Inventor: Shilun LIN
  • Publication number: 20230035504
    Abstract: Embodiments of this application provide an audio processing method and apparatus, a vocoder, an electronic device, and a computer-readable storage medium. The audio processing method includes performing speech feature conversion on a text to obtain at least one acoustic feature frame; extracting a conditional feature corresponding to each acoustic feature frame from each acoustic feature frame of the at least one acoustic feature frame by a frame rate network; performing frequency division and time-domain down-sampling on the current frame of each acoustic feature frame to obtain n subframes corresponding to the current frame; synchronously predicting sample values corresponding to the current m adjacent sampling points on the n subframes to obtain m×n sub-prediction values; obtaining an audio prediction signal corresponding to the current frame; and performing audio synthesis on the audio prediction signal corresponding to each acoustic feature frame to obtain a target audio corresponding to the text.
    Type: Application
    Filed: October 13, 2022
    Publication date: February 2, 2023
    Inventors: Shilun LIN, Xinhui LI, Li LU
  • Patent number: 11450312
    Abstract: A speech recognition method includes: obtaining speech information; and determining beginning and ending positions of a candidate speech segment in the speech information by using a weighted finite state transducer (WFST) network. The candidate speech segment is identified as corresponding to a preset keyword. The method also includes clipping the candidate speech segment from the speech information according to the beginning and ending positions of the candidate speech segment; detecting whether the candidate speech segment includes a preset keyword by using a machine learning model; and determining, upon determining that the candidate speech segment comprises the preset keyword, that the speech information comprises the preset keyword.
    Type: Grant
    Filed: June 12, 2020
    Date of Patent: September 20, 2022
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventors: Shilun Lin, Xilin Zhang, Wenhua Ma, Bo Liu, Xinhui Li, Li Lu, Xiucai Jiang
  • Publication number: 20220215827
    Abstract: This application discloses a method, an apparatus, a computer readable medium, and an electronic device for audio synthesis. The method includes: acquiring mixed language text information comprising text characters corresponding to at least two language types; performing text coding processing on the mixed language text information based on the at least two language types, to obtain an intermediate semantic coding feature of the mixed language text information; acquiring a target tone feature corresponding to a target tone subject, and performing decoding processing on the intermediate semantic coding feature based on the target tone feature to obtain an acoustic feature; and performing acoustic coding processing on the acoustic feature to obtain an audio corresponding to the mixed language text information.
    Type: Application
    Filed: March 24, 2022
    Publication date: July 7, 2022
    Applicant: Tencent Technology (Shenzhen) Company Limited
    Inventor: Shilun LIN
  • Publication number: 20200312309
    Abstract: A speech recognition method includes: obtaining speech information; and determining beginning and ending positions of a candidate speech segment in the speech information by using a weighted finite state transducer (WFST) network. The candidate speech segment is identified as corresponding to a preset keyword. The method also includes clipping the candidate speech segment from the speech information according to the beginning and ending positions of the candidate speech segment; detecting whether the candidate speech segment includes a preset keyword by using a machine learning model; and determining, upon determining that the candidate speech segment comprises the preset keyword, that the speech information comprises the preset keyword.
    Type: Application
    Filed: June 12, 2020
    Publication date: October 1, 2020
    Inventors: Shilun LIN, Xilin ZHANG, Wenhua MA, Bo LIU, Xinhui LI, Li LU, Xiucai JIANG