Patents by Inventor Chih-Chung Kuo

Chih-Chung Kuo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for providing mobile information server and portable device therein

Patent number: 7756950

Abstract: A system and method for providing mobile information, a server and a portable device therein are provided. The server comprises an intelligent download manager and the portable device comprises a file browse manager. The intelligent download manager determines a downloaded file update rule and a file browse rule according to any combination of a document attribute, a browse record, and a document preference. The files to be downloaded to the portable device can be determined automatically according to the downloaded file update rule. The file browse manager provides an intelligent browse mode related to the browse sequence of the downloaded files according to the file browse rule. Therefore, information really interesting to the user can be stored in the limited space by the present invention and the user can access the information quickly and efficiently.

Type: Grant

Filed: June 15, 2006

Date of Patent: July 13, 2010

Assignee: Industrial Technology Research Institute

Inventors: Hsu-Chih Wu, Chih-Chung Kuo, Chieh-Chih Chang, Miao-Ru Hsu
Automatic speech segmentation and verification using segment confidence measures

Patent number: 7472066

Abstract: An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit segmentor segments the recorded speech corpus into N test speech unit segments referring to the phonetic information of the known text script. Then, a segmental verifier is applied to obtain a confidence measure of syllable segmentation for verifying the correctness of the cutting points of test speech unit segments. A phonetic verifier obtains a confidence measure of syllable verification by using verification models for verifying whether the recorded speech corpus is correctly recorded. Finally, a speech unit inspector integrates the confidence measure of syllable segmentation and the confidence measure of syllable verification to determine whether the test speech unit segment is accepted or not.

Type: Grant

Filed: February 23, 2004

Date of Patent: December 30, 2008

Assignee: Industrial Technology Research Institute

Inventors: Chih-Chung Kuo, Chi-Shiang Kuo, Jau-Hung Chen
SPEECH SYNTHESIZER GENERATING SYSTEM AND METHOD THEREOF

Publication number: 20080319752

Abstract: A speech synthesizer generating system and a method thereof are provided. A speech synthesizer generator in the speech synthesizer generating system automatically generates a speech synthesizer conforming to a speech output specification input by a user. In addition, a recording script is automatically generated by a recording script generator in the speech synthesizer generating system according to the speech output specification, and a customized or expanded speech material is recorded according to the recording script. After the speech material is uploaded to the speech synthesizer generating system, the speech synthesizer generator automatically generates a speech synthesizer conforming to the speech output specification. The speech synthesizer then synthesizes and outputs a speech output at a user end.

Type: Application

Filed: October 21, 2007

Publication date: December 25, 2008

Applicant: INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE

Inventors: Chih-Chung Kuo, Min-Hsin Shen
Method for generating text script of high efficiency

Patent number: 7447625

Abstract: This proposal presents performance indices and search criteria for the text script generation in the design of corpus-based TTS systems. Based on our criteria a new search method is presented to solve the text selection problem more systematically and efficiently, unlike previous researches either concentrated on covering rate or on hit rate. By control a weighting factor, the covering rate of unit types can be increased to improve the robustness of the TTS system. Finally, the scalable and controllable design of the multi-stage search can produce various kinds of text scripts ideally suitable for the requirement of various kinds of corpus-based TTS systems.

Type: Grant

Filed: March 10, 2003

Date of Patent: November 4, 2008

Assignee: Industrial Technology Research Institute

Inventors: Chih-Chung Kuo, Jing-Yi Huang
METHOD AND SYSTEM FOR EXECUTING CORRELATIVE SERVICES

Publication number: 20080133515

Abstract: A method and a system for executing correlative services are provided. In the method and the system, an event type corresponding to an input message is determined through semantic analysis. After collecting the necessary execution information of the event type according to the input message, a user database, or by inquiring the user or another system, the system automatically executes various correlative services of the event type. Therefore, the system can help users to execute correlative services more correctly and more efficiently.

Type: Application

Filed: January 23, 2007

Publication date: June 5, 2008

Applicant: INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE

Inventors: Shih-Chieh Chien, Chih-Chung Kuo, Jui-Hsin Hung
Method And Apparatus Of Generating Text Script For A Corpus-Based Text-To Speech System

Publication number: 20080091431

Abstract: A method of text script generation for a corpus-based text-to-speech system includes searching in a source corpus having L sentences, selecting N sentences with a best integrated efficiency as N best cases, and setting iteration k to be 1; for each case n of the N best cases, selecting Mk+1 best sentences with the best integrated efficiency from the unselected sentences in the source corpus; keeping N best cases out of the total unselected sentences for next iteration, and increasing iteration k by 1; and if a termination criterion being reached, setting the best case in the N traced cases as the text script, otherwise, returning to the (k+1)th iteration of searching in the unselected sentences for (k+1)th sentence; wherein the best integrated efficiency depends on a function of combining the covering rate of the synthesis unit type, the hit rate of the synthesis unit type, and the text script size.

Type: Application

Filed: December 14, 2007

Publication date: April 17, 2008

Inventors: Chih-Chung Kuo, Jing-Yi Huang
Method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure

Patent number: 7315813

Abstract: A method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure is disclosed. This method is based on comparison of speech segments segmented from a speech corpus, wherein speech segments are fully prosody-aligned to each other before distortion measure. With prosody alignment embedded in selection process, distortion resulting from possible prosody modification in synthesis could be taken into account objectively in selection phase. In order to carry out the purpose of the present invention, automatic segmentation, pitch marking and PSOLA method work together for prosody alignment. Two distortion measures, MFCC and PSQM are used for comparing two prosody-aligned segments of speech because of human perceptual consideration.

Type: Grant

Filed: July 29, 2002

Date of Patent: January 1, 2008

Assignee: Industrial Technology Research Institute

Inventors: Chih-Chung Kuo, Chi-Shiang Kuo
METHOD FOR SPEECH QUALITY DEGRADATION ESTIMATION AND METHOD FOR DEGRADATION MEASURES CALCULATION AND APPARATUSES THEREOF

Publication number: 20070233469

Abstract: A method for speech quality degradation estimation, a method for degradation measures calculation, and the apparatuses thereof are provided. The first method above estimates the speech quality of a speech signal that is modified by a pitch-synchronous prosody modification method, which comprises the following steps. First, extract at least one source pitchmark from the speech signal, and then maps the source pitchmark(s) to at least one target pitchmark(s). Finally, calculate at least one degradation measure based on the mapping between the source and the target pitchmarks. The degradation measures include several weighted pitch-related functions and duration-related functions, where the weighting functions can be calculated based on the speech signal or the pitchmark(s) mapping mentioned above.

Type: Application

Filed: June 29, 2006

Publication date: October 4, 2007

Applicant: INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE

Inventors: Shi-Han Chen, Chih-Chung Kuo, Shun-Ju Chen
SYSTEM AND METHOD FOR PROVIDING MOBILE INFORMATION SERVER AND PORTABLE DEVICE THEREIN

Publication number: 20070180058

Abstract: A system and method for providing mobile information, a server and a portable device therein are provided. The server comprises an intelligent download manager and the portable device comprises a file browse manager. The intelligent download manager determines a downloaded file update rule and a file browse rule according to any combination of a document attribute, a browse record, and a document preference. The files to be downloaded to the portable device can be determined automatically according to the downloaded file update rule. The file browse manager provides an intelligent browse mode related to the browse sequence of the downloaded files according to the file browse rule. Therefore, information really interesting to the user can be stored in the limited space by the present invention and the user can access the information quickly and efficiently.

Type: Application

Filed: June 15, 2006

Publication date: August 2, 2007

Inventors: Hsu-Chih Wu, Chih-Chung Kuo, Chieh-Chih Chang, Miao-Ru Hsu
Pronunciation assessment method and system based on distinctive feature analysis

Publication number: 20060136225

Abstract: A method and system for pronunciation assessment based on distinctive feature analysis is provided. It evaluates a user's pronunciation by one or more distinctive feature (DF) assessor. It may further construct a phone assessor with DF assessors to evaluate a user's phone pronunciation, and even construct a continuous speech pronunciation assessor with phone assessor to get the final pronunciation score for a word or a sentence. Each DF assessor further includes a feature extractor and a distinctive feature classifier, and can be realized differently. This is based on the different characteristic of the distinctive feature. A score mapper may be included to standardize the output for each DF assessor. Each speech phone can be described as a “bundle” of DFs. The invention is a novel and qualitative solution based on the DF of speech sounds for pronunciation assessment.

Type: Application

Filed: June 21, 2005

Publication date: June 22, 2006

Inventors: Chih-Chung Kuo, Che-Yao Yang, Ke-Shiu Chen, Miao-Ru Hsu
Automatic speech segmentation and verification method and system

Publication number: 20050060151

Abstract: An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit segmentor segments the recorded speech corpus into N test speech unit segments referring to the phonetic information of the known text script. Then, a segmental verifier is applied to obtain a confidence measure of syllable segmentation for verifying the correctness of the cutting points of test speech unit segments. A phonetic verifier obtains a confidence measure of syllable verification by using verification models for verifying whether the recorded speech corpus is correctly recorded. Finally, a speech unit inspector integrates the confidence measure of syllable segmentation and the confidence measure of syllable verification to determine whether the test speech unit segment is accepted or not.

Type: Application

Filed: February 23, 2004

Publication date: March 17, 2005

Applicant: Industrial Technology Research Institute

Inventors: Chih-Chung Kuo, Chi-Shiang Kuo, Jau-Hung Chen
Method for generating text script of high efficiency

Publication number: 20040054536

Abstract: This proposal presents performance indices and search criteria for the text script generation in the design of corpus-based TTS systems. Based on our criteria a new search method is presented to solve the text selection problem more systematically and efficiently, unlike previous researches either concentrated on covering rate or on hit rate. By control a weighting factor, the covering rate of unit types can be increased to improve the robustness of the TTS system. Finally, the scalable and controllable design of the multi-stage search can produce various kinds of text scripts ideally suitable for the requirement of various kinds of corpus-based TTS systems.

Type: Application

Filed: March 10, 2003

Publication date: March 18, 2004

Inventors: Chih-Chung Kuo, Jing-Yi Huang
Method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure

Publication number: 20030195743

Abstract: A method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure is disclosed. This method is based on comparison of speech segments segmented from a speech corpus, wherein speech segments are fully prosody-aligned to each other before distortion measure. With prosody alignment embedded in selection process, distortion resulting from possible prosody modification in synthesis could be taken into account objectively in selection phase. In order to carry out the purpose of the present invention, automatic segmentation, pitch marking and PSOLA method work together for prosody alignment. Two distortion measures, MFCC and PSQM are used for comparing two prosody-aligned segments of speech because of human perceptual consideration.

Type: Application

Filed: July 29, 2002

Publication date: October 16, 2003

Applicant: Industrial Technology Research Institute

Inventors: Chih-Chung Kuo, Chi-Shiang Kuo
Pitch shift method with conserved timbre

Patent number: 5872727

Abstract: An improved method for shifting the pitches of a tone is disclosed. It comprises: (a) subjecting a digitized original waveform to a whitening process using an all-zero filter (AZF) to obtain a whitened waveform; (b) resampling the whitened waveform at a desired scaling ratio to obtain a scaled and whitened waveform; (c) subjecting the scaled and whitened waveform to a coloring process using an all-pole filter (APF) to obtain a synthesized waveform. In a preferred embodiment, the all-zero filter performs the transformation function of: ##EQU1## and the all-pole filter performs the transformation function of: ##EQU2## wherein the a.sub.i 's and b.sub.i 's are linear predictive coefficients. The whitened waveforms can be compressed and stored as wavetables, which can be subsequently retrieved and decompressed before resampling.

Type: Grant

Filed: November 19, 1996

Date of Patent: February 16, 1999

Assignee: Industrial Technology Research Institute

Inventor: Chih-Chung Kuo

prev 1 2 3