Patents by Inventor Ao Yao

Ao Yao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech conversion method and apparatus, storage medium, and electronic device

Patent number: 12223973

Abstract: Embodiments of the present application provide a speech conversion method and apparatus, a storage medium, and an electronic device. The method includes: acquiring a source speech to be converted and a target speech sample of a target speaker; recognizing a style category of the target speech sample, and extracting a target audio feature from the target speech sample according to the style category; extracting a source audio feature from the source speech; acquiring a first style feature of the target speech sample and determining a second style feature of the target speech sample according to the first style feature; fusing and mapping the source audio feature, the target audio feature, and the second style feature to obtain a joint encoding feature; and decoding the joint encoding feature, to obtain a target speech feature, and converting the source speech based on the target speech feature to obtain a target speech.

Type: Grant

Filed: August 9, 2024

Date of Patent: February 11, 2025

Assignee: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.

Inventors: Huapeng Sima, Ao Yao, Yiping Tang
Synthetic audio output method and apparatus, storage medium, and electronic device

Patent number: 12051400

Abstract: This application provide a synthetic audio output method and apparatus, a storage medium, and an electronic device. The method includes: inputting input text and a specified target identity identifier into an audio output model; extracting an identity feature sequence of a target identity by an identity recognition model; extracting a phoneme feature sequence corresponding to the input text by an encoding layer of a speech synthesis model; superimposing and inputting the identity feature sequence of the target identity and the phoneme feature sequence into a variable adapter of the speech synthesis model; and after duration prediction and alignment, energy prediction, and pitch prediction are performed on the phoneme feature sequence by the variable adapter, outputting a target Mel-frequency spectrum feature corresponding to the input text through a decoding layer of the speech synthesis model; and inputting the target Mel-frequency spectrum feature into a vocoder to output synthetic audio.

Type: Grant

Filed: February 7, 2024

Date of Patent: July 30, 2024

Assignee: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.

Inventors: Huapeng Sima, Haie Wu, Ao Yao, Da Jiang, Yiping Tang
INTENT RECOGNITION METHOD BASED ON DEEP LEARNING NETWORK

Publication number: 20210043197

Abstract: The present invention relates to the field of intelligent recognition, and discloses an intent recognition method based on a deep learning network, resolving a technical problem that accuracy of intent recognition is not high.

Type: Application

Filed: March 26, 2020

Publication date: February 11, 2021

Applicants: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD., NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.

Inventors: Huapeng SIMA, Ao YAO
Intent recognition method based on deep learning network

Patent number: 10916242

Abstract: The present invention relates to the field of intelligent recognition, and discloses an intent recognition method based on a deep learning network, resolving a technical problem that accuracy of intent recognition is not high.

Type: Grant

Filed: March 26, 2020

Date of Patent: February 9, 2021

Assignee: NANJING SILICON INTELLIGENCE TECHNOLOGY CO., LTD.

Inventors: Huapeng Sima, Ao Yao

Speech conversion method and apparatus, storage medium, and electronic device

Synthetic audio output method and apparatus, storage medium, and electronic device

INTENT RECOGNITION METHOD BASED ON DEEP LEARNING NETWORK

Intent recognition method based on deep learning network