Patents by Inventor Fan Ping Meng
Fan Ping Meng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9342509Abstract: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.Type: GrantFiled: October 30, 2009Date of Patent: May 17, 2016Assignee: Nuance Communications, Inc.Inventors: Fan Ping Meng, Yong Qin, Zhi Wei Shuang, Shi Lei Zhang
-
Patent number: 9210263Abstract: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.Type: GrantFiled: April 9, 2015Date of Patent: December 8, 2015Assignee: International Business Machines CorporationInventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhi Wei Shuang
-
Publication number: 20150215458Abstract: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.Type: ApplicationFiled: April 9, 2015Publication date: July 30, 2015Inventors: Fan Ping MENG, Yong QIN, Qin SHI, Zhi Wei SHUANG
-
Patent number: 9025736Abstract: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.Type: GrantFiled: February 4, 2008Date of Patent: May 5, 2015Assignee: International Business Machines CorporationInventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhi Wei Shuang
-
Patent number: 8321223Abstract: A method for performing speech synthesis on textual content at a client. The method includes the steps of: performing speech synthesis on the textual content based on a current acoustical unit set Scurrent in a corpus at the client; analyzing the textual content and generating a list of target units with corresponding context features, selecting multiple acoustical unit candidates for each target unit according to the context features based on an acoustical unit set Stotal that is more plentiful than the current acoustical unit set Scurrent in the corpus at the client, and determining acoustical units suitable for speech synthesis for the textual content according to the multiple unit candidates; and updating the current acoustical unit set Scurrent in the corpus at the client based on the determined acoustical units.Type: GrantFiled: May 27, 2009Date of Patent: November 27, 2012Assignee: International Business Machines CorporationInventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhiwei Shuang
-
Patent number: 8280739Abstract: The present invention provides a speech analysis method comprising steps of obtaining a speech signal and a corresponding DEGG/EGG signal; regarding the speech signal as the output of a vocal tract filter in a source-filter model taking the DEGG/EGG signal as the input; and estimating the features of the vocal tract filter from the speech signal as the output and the DEGG/EGG signal as the input, wherein the features of the vocal tract filter are expressed by the state vectors of the vocal tract filter at selected time points, and the step of estimating is performed using Kalman filtering.Type: GrantFiled: April 3, 2008Date of Patent: October 2, 2012Assignee: Nuance Communications, Inc.Inventors: Dan Ning Jiang, Fan Ping Meng, Yong Qin, Zhi Wei Shuang
-
Patent number: 8234110Abstract: A method, system and computer program product for voice conversion. The method includes performing speech analysis on the speech of a source speaker to achieve speech information; performing spectral conversion based on said speech information, to at least achieve a first spectrum similar to the speech of a target speaker; performing unit selection on the speech of said target speaker at least using said first spectrum as a target; replacing at least part of said first spectrum with the spectrum of the selected target speaker's speech unit; and performing speech reconstruction at least based on the replaced spectrum.Type: GrantFiled: September 29, 2008Date of Patent: July 31, 2012Assignee: Nuance Communications, Inc.Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhi Wei Shuang
-
Publication number: 20100114556Abstract: A method and apparatus for speech translation. The method includes: receiving a source speech; extracting non-text information in the source speech; translating the source speech into a target speech; and adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech. The apparatus includes: a receiving module for receiving source speech; an extracting module for extracting non-text information in the source speech; a translation module for translating the source speech into a target speech; and an adjusting module for adjusting the translated target speech according to the extracted non-text information so that the target speech preserves the non-text information in the source speech.Type: ApplicationFiled: October 30, 2009Publication date: May 6, 2010Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Fan Ping Meng, Yong Qin, Zhi Wei Shuang, Shi Lei Zhang
-
Publication number: 20090299746Abstract: A method for performing speech synthesis to a textual content at a client. The method includes the steps of: performing speech synthesis to the textual content based on a current acoustical unit set Scurrent in a corpus at the client; analyzing the textual content and generating a list of target units with corresponding context features, selecting multiple acoustical unit candidates for each target unit according to the context features based on an acoustical unit set Stotal that is more plentiful than the current acoustical unit set Scurrent in the corpus at the client, and determining acoustical units suitable for speech synthesis for the textual content according to the multiple unit candidates; and updating the current acoustical unit set Scurrent in the corpus at the client based on the determined acoustical units.Type: ApplicationFiled: May 27, 2009Publication date: December 3, 2009Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhiwei Shuang
-
Publication number: 20090089063Abstract: A method, system and computer program product for voice conversion. The method includes performing speech analysis on the speech of a source speaker to achieve speech information; performing spectral conversion based on said speech information, to at least achieve a first spectrum similar to the speech of a target speaker; performing unit selection on the speech of said target speaker at least using said first spectrum as a target; replacing at least part of said first spectrum with the spectrum of the selected target speaker's speech unit; and performing speech reconstruction at least based on the replaced spectrum.Type: ApplicationFiled: September 29, 2008Publication date: April 2, 2009Inventors: Fan Ping Meng, Yong Qin, Qin Shi, Zhi Wei Shuang
-
Publication number: 20080288258Abstract: The present invention provides a speech analysis method comprising steps of obtaining a speech signal and a corresponding DEGG/EGG signal; regarding the speech signal as the output of a vocal tract filter in a source-filter model taking the DEGG/EGG signal as the input; and estimating the features of the vocal tract filter from the speech signal as the output and the DEGG/EGG signal as the input, wherein the features of the vocal tract filter are expressed by the state vectors of the vocal tract filter at selected time points, and the step of estimating is performed using Kalman filtering.Type: ApplicationFiled: April 3, 2008Publication date: November 20, 2008Applicant: International Business Machines CorporationInventors: Dan Ning Jiang, Fan Ping Meng, Yong Qin, Zhi Wei Shuang
-
Publication number: 20080187109Abstract: A method, information processing system, and computer program storage product for automatically generating auditory archives in a customer service environment are disclosed. A communication link with an end user is established. An information form is retrieved. The information form includes at least a category choice information set and at least one audio recoding information set. The end user is prompted to answer a set of questions based on information in the information form. A data set associated with each answer to each question in the set of questions given by the end user is stored. The data is stored under a set of fields corresponding to a question. Each data set stored under the set of fields for each question in the set of questions are combined with each other. An audio archive file is generated including the data sets that have been combined.Type: ApplicationFiled: February 4, 2008Publication date: August 7, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: FAN PING MENG, Yong Qin, Qin Shi, Zhi Wei Shuang