Patents by Inventor Fan Ping Meng

Fan Ping Meng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech translation method and apparatus utilizing prosodic information

Patent number: 9342509

Abstract: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.

Type: Grant

Filed: October 30, 2009

Date of Patent: May 17, 2016

Assignee: Nuance Communications, Inc.

Inventors: Fan Ping Meng, Yong Qin, Zhi Wei Shuang, Shi Lei Zhang
Audio archive generation and presentation

Patent number: 9210263

Abstract: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.

Type: Grant

Filed: April 9, 2015

Date of Patent: December 8, 2015

Assignee: International Business Machines Corporation

Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhi Wei Shuang
AUDIO ARCHIVE GENERATION AND PRESENTATION

Publication number: 20150215458

Abstract: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.

Type: Application

Filed: April 9, 2015

Publication date: July 30, 2015

Inventors: Fan Ping MENG, Yong QIN, Qin SHI, Zhi Wei SHUANG
Audio archive generation and presentation

Patent number: 9025736

Abstract: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.

Type: Grant

Filed: February 4, 2008

Date of Patent: May 5, 2015

Assignee: International Business Machines Corporation

Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhi Wei Shuang
Method and system for speech synthesis using dynamically updated acoustic unit sets

Patent number: 8321223

Abstract: A method for performing speech synthesis on textual content at a client. The method includes the steps of: performing speech synthesis on the textual content based on a current acoustical unit set Scurrent in a corpus at the client; analyzing the textual content and generating a list of target units with corresponding context features, selecting multiple acoustical unit candidates for each target unit according to the context features based on an acoustical unit set Stotal that is more plentiful than the current acoustical unit set Scurrent in the corpus at the client, and determining acoustical units suitable for speech synthesis for the textual content according to the multiple unit candidates; and updating the current acoustical unit set Scurrent in the corpus at the client based on the determined acoustical units.

Type: Grant

Filed: May 27, 2009

Date of Patent: November 27, 2012

Assignee: International Business Machines Corporation

Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhiwei Shuang
Method and apparatus for speech analysis and synthesis

Patent number: 8280739

Abstract: The present invention provides a speech analysis method comprising steps of obtaining a speech signal and a corresponding DEGG/EGG signal; regarding the speech signal as the output of a vocal tract filter in a source-filter model taking the DEGG/EGG signal as the input; and estimating the features of the vocal tract filter from the speech signal as the output and the DEGG/EGG signal as the input, wherein the features of the vocal tract filter are expressed by the state vectors of the vocal tract filter at selected time points, and the step of estimating is performed using Kalman filtering.

Type: Grant

Filed: April 3, 2008

Date of Patent: October 2, 2012

Assignee: Nuance Communications, Inc.

Inventors: Dan Ning Jiang, Fan Ping Meng, Yong Qin, Zhi Wei Shuang
Voice conversion method and system

Patent number: 8234110

Abstract: A method, system and computer program product for voice conversion. The method includes performing speech analysis on the speech of a source speaker to achieve speech information; performing spectral conversion based on said speech information, to at least achieve a first spectrum similar to the speech of a target speaker; performing unit selection on the speech of said target speaker at least using said first spectrum as a target; replacing at least part of said first spectrum with the spectrum of the selected target speaker's speech unit; and performing speech reconstruction at least based on the replaced spectrum.

Type: Grant

Filed: September 29, 2008

Date of Patent: July 31, 2012

Assignee: Nuance Communications, Inc.

Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhi Wei Shuang
SPEECH TRANSLATION METHOD AND APPARATUS

Publication number: 20100114556

Abstract: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.

Type: Application

Filed: October 30, 2009

Publication date: May 6, 2010

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Fan Ping Meng, Yong Qin, Zhi Wei Shuang, Shi Lei Zhang
METHOD AND SYSTEM FOR SPEECH SYNTHESIS

Publication number: 20090299746

Abstract: A method for performing speech synthesis to a textual content at a client. The method includes the steps of: performing speech synthesis to the textual content based on a current acoustical unit set Scurrent in a corpus at the client; analyzing the textual content and generating a list of target units with corresponding context features, selecting multiple acoustical unit candidates for each target unit according to the context features based on an acoustical unit set Stotal that is more plentiful than the current acoustical unit set Scurrent in the corpus at the client, and determining acoustical units suitable for speech synthesis for the textual content according to the multiple unit candidates; and updating the current acoustical unit set Scurrent in the corpus at the client based on the determined acoustical units.

Type: Application

Filed: May 27, 2009

Publication date: December 3, 2009

Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhiwei Shuang
VOICE CONVERSION METHOD AND SYSTEM

Publication number: 20090089063

Abstract: A method, system and computer program product for voice conversion. The method includes performing speech analysis on the speech of a source speaker to achieve speech information; performing spectral conversion based on said speech information, to at least achieve a first spectrum similar to the speech of a target speaker; performing unit selection on the speech of said target speaker at least using said first spectrum as a target; replacing at least part of said first spectrum with the spectrum of the selected target speaker's speech unit; and performing speech reconstruction at least based on the replaced spectrum.

Type: Application

Filed: September 29, 2008

Publication date: April 2, 2009

Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhi Wei Shuang
METHOD AND APPARATUS FOR SPEECH ANALYSIS AND SYNTHESIS

Publication number: 20080288258

Abstract: The present invention provides a speech analysis method comprising steps of obtaining a speech signal and a corresponding DEGG/EGG signal; regarding the speech signal as the output of a vocal tract filter in a source-filter model taking the DEGG/EGG signal as the input; and estimating the features of the vocal tract filter from the speech signal as the output and the DEGG/EGG signal as the input, wherein the features of the vocal tract filter are expressed by the state vectors of the vocal tract filter at selected time points, and the step of estimating is performed using Kalman filtering.

Type: Application

Filed: April 3, 2008

Publication date: November 20, 2008

Applicant: International Business Machines Corporation

Inventors: Dan Ning Jiang, Fan Ping Meng, Yong Qin, Zhi Wei Shuang
AUDIO ARCHIVE GENERATION AND PRESENTATION

Publication number: 20080187109

Abstract: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.

Type: Application

Filed: February 4, 2008

Publication date: August 7, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: FAN PING MENG, Yong Qin, Qin Shi, Zhi Wei Shuang