Patents by Inventor Chih-Chung Kuo

Chih-Chung Kuo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7756950
    Abstract: A system and method for providing mobile information, a server and a portable device therein are provided. The server comprises an intelligent download manager and the portable device comprises a file browse manager. The intelligent download manager determines a downloaded file update rule and a file browse rule according to any combination of a document attribute, a browse record, and a document preference. The files to be downloaded to the portable device can be determined automatically according to the downloaded file update rule. The file browse manager provides an intelligent browse mode related to the browse sequence of the downloaded files according to the file browse rule. Therefore, information really interesting to the user can be stored in the limited space by the present invention and the user can access the information quickly and efficiently.
    Type: Grant
    Filed: June 15, 2006
    Date of Patent: July 13, 2010
    Assignee: Industrial Technology Research Institute
    Inventors: Hsu-Chih Wu, Chih-Chung Kuo, Chieh-Chih Chang, Miao-Ru Hsu
  • Patent number: 7472066
    Abstract: An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit segmentor segments the recorded speech corpus into N test speech unit segments referring to the phonetic information of the known text script. Then, a segmental verifier is applied to obtain a confidence measure of syllable segmentation for verifying the correctness of the cutting points of test speech unit segments. A phonetic verifier obtains a confidence measure of syllable verification by using verification models for verifying whether the recorded speech corpus is correctly recorded. Finally, a speech unit inspector integrates the confidence measure of syllable segmentation and the confidence measure of syllable verification to determine whether the test speech unit segment is accepted or not.
    Type: Grant
    Filed: February 23, 2004
    Date of Patent: December 30, 2008
    Assignee: Industrial Technology Research Institute
    Inventors: Chih-Chung Kuo, Chi-Shiang Kuo, Jau-Hung Chen
  • Publication number: 20080319752
    Abstract: A speech synthesizer generating system and a method thereof are provided. A speech synthesizer generator in the speech synthesizer generating system automatically generates a speech synthesizer conforming to a speech output specification input by a user. In addition, a recording script is automatically generated by a recording script generator in the speech synthesizer generating system according to the speech output specification, and a customized or expanded speech material is recorded according to the recording script. After the speech material is uploaded to the speech synthesizer generating system, the speech synthesizer generator automatically generates a speech synthesizer conforming to the speech output specification. The speech synthesizer then synthesizes and outputs a speech output at a user end.
    Type: Application
    Filed: October 21, 2007
    Publication date: December 25, 2008
    Applicant: INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE
    Inventors: Chih-Chung Kuo, Min-Hsin Shen
  • Patent number: 7447625
    Abstract: This proposal presents performance indices and search criteria for the text script generation in the design of corpus-based TTS systems. Based on our criteria a new search method is presented to solve the text selection problem more systematically and efficiently, unlike previous researches either concentrated on covering rate or on hit rate. By control a weighting factor, the covering rate of unit types can be increased to improve the robustness of the TTS system. Finally, the scalable and controllable design of the multi-stage search can produce various kinds of text scripts ideally suitable for the requirement of various kinds of corpus-based TTS systems.
    Type: Grant
    Filed: March 10, 2003
    Date of Patent: November 4, 2008
    Assignee: Industrial Technology Research Institute
    Inventors: Chih-Chung Kuo, Jing-Yi Huang
  • Publication number: 20080133515
    Abstract: A method and a system for executing correlative services are provided. In the method and the system, an event type corresponding to an input message is determined through semantic analysis. After collecting the necessary execution information of the event type according to the input message, a user database, or by inquiring the user or another system, the system automatically executes various correlative services of the event type. Therefore, the system can help users to execute correlative services more correctly and more efficiently.
    Type: Application
    Filed: January 23, 2007
    Publication date: June 5, 2008
    Applicant: INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE
    Inventors: Shih-Chieh Chien, Chih-Chung Kuo, Jui-Hsin Hung
  • Publication number: 20080091431
    Abstract: A method of text script generation for a corpus-based text-to-speech system includes searching in a source corpus having L sentences, selecting N sentences with a best integrated efficiency as N best cases, and setting iteration k to be 1; for each case n of the N best cases, selecting Mk+1 best sentences with the best integrated efficiency from the unselected sentences in the source corpus; keeping N best cases out of the total unselected sentences for next iteration, and increasing iteration k by 1; and if a termination criterion being reached, setting the best case in the N traced cases as the text script, otherwise, returning to the (k+1)th iteration of searching in the unselected sentences for (k+1)th sentence; wherein the best integrated efficiency depends on a function of combining the covering rate of the synthesis unit type, the hit rate of the synthesis unit type, and the text script size.
    Type: Application
    Filed: December 14, 2007
    Publication date: April 17, 2008
    Inventors: Chih-Chung Kuo, Jing-Yi Huang
  • Patent number: 7315813
    Abstract: A method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure is disclosed. This method is based on comparison of speech segments segmented from a speech corpus, wherein speech segments are fully prosody-aligned to each other before distortion measure. With prosody alignment embedded in selection process, distortion resulting from possible prosody modification in synthesis could be taken into account objectively in selection phase. In order to carry out the purpose of the present invention, automatic segmentation, pitch marking and PSOLA method work together for prosody alignment. Two distortion measures, MFCC and PSQM are used for comparing two prosody-aligned segments of speech because of human perceptual consideration.
    Type: Grant
    Filed: July 29, 2002
    Date of Patent: January 1, 2008
    Assignee: Industrial Technology Research Institute
    Inventors: Chih-Chung Kuo, Chi-Shiang Kuo
  • Publication number: 20070233469
    Abstract: A method for speech quality degradation estimation, a method for degradation measures calculation, and the apparatuses thereof are provided. The first method above estimates the speech quality of a speech signal that is modified by a pitch-synchronous prosody modification method, which comprises the following steps. First, extract at least one source pitchmark from the speech signal, and then maps the source pitchmark(s) to at least one target pitchmark(s). Finally, calculate at least one degradation measure based on the mapping between the source and the target pitchmarks. The degradation measures include several weighted pitch-related functions and duration-related functions, where the weighting functions can be calculated based on the speech signal or the pitchmark(s) mapping mentioned above.
    Type: Application
    Filed: June 29, 2006
    Publication date: October 4, 2007
    Applicant: INDUSTRIAL TECHNOLOGY RESEARCH INSTITUTE
    Inventors: Shi-Han Chen, Chih-Chung Kuo, Shun-Ju Chen
  • Publication number: 20070180058
    Abstract: A system and method for providing mobile information, a server and a portable device therein are provided. The server comprises an intelligent download manager and the portable device comprises a file browse manager. The intelligent download manager determines a downloaded file update rule and a file browse rule according to any combination of a document attribute, a browse record, and a document preference. The files to be downloaded to the portable device can be determined automatically according to the downloaded file update rule. The file browse manager provides an intelligent browse mode related to the browse sequence of the downloaded files according to the file browse rule. Therefore, information really interesting to the user can be stored in the limited space by the present invention and the user can access the information quickly and efficiently.
    Type: Application
    Filed: June 15, 2006
    Publication date: August 2, 2007
    Inventors: Hsu-Chih Wu, Chih-Chung Kuo, Chieh-Chih Chang, Miao-Ru Hsu
  • Publication number: 20060136225
    Abstract: A method and system for pronunciation assessment based on distinctive feature analysis is provided. It evaluates a user's pronunciation by one or more distinctive feature (DF) assessor. It may further construct a phone assessor with DF assessors to evaluate a user's phone pronunciation, and even construct a continuous speech pronunciation assessor with phone assessor to get the final pronunciation score for a word or a sentence. Each DF assessor further includes a feature extractor and a distinctive feature classifier, and can be realized differently. This is based on the different characteristic of the distinctive feature. A score mapper may be included to standardize the output for each DF assessor. Each speech phone can be described as a “bundle” of DFs. The invention is a novel and qualitative solution based on the DF of speech sounds for pronunciation assessment.
    Type: Application
    Filed: June 21, 2005
    Publication date: June 22, 2006
    Inventors: Chih-Chung Kuo, Che-Yao Yang, Ke-Shiu Chen, Miao-Ru Hsu
  • Publication number: 20050060151
    Abstract: An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit segmentor segments the recorded speech corpus into N test speech unit segments referring to the phonetic information of the known text script. Then, a segmental verifier is applied to obtain a confidence measure of syllable segmentation for verifying the correctness of the cutting points of test speech unit segments. A phonetic verifier obtains a confidence measure of syllable verification by using verification models for verifying whether the recorded speech corpus is correctly recorded. Finally, a speech unit inspector integrates the confidence measure of syllable segmentation and the confidence measure of syllable verification to determine whether the test speech unit segment is accepted or not.
    Type: Application
    Filed: February 23, 2004
    Publication date: March 17, 2005
    Applicant: Industrial Technology Research Institute
    Inventors: Chih-Chung Kuo, Chi-Shiang Kuo, Jau-Hung Chen
  • Publication number: 20040054536
    Abstract: This proposal presents performance indices and search criteria for the text script generation in the design of corpus-based TTS systems. Based on our criteria a new search method is presented to solve the text selection problem more systematically and efficiently, unlike previous researches either concentrated on covering rate or on hit rate. By control a weighting factor, the covering rate of unit types can be increased to improve the robustness of the TTS system. Finally, the scalable and controllable design of the multi-stage search can produce various kinds of text scripts ideally suitable for the requirement of various kinds of corpus-based TTS systems.
    Type: Application
    Filed: March 10, 2003
    Publication date: March 18, 2004
    Inventors: Chih-Chung Kuo, Jing-Yi Huang
  • Publication number: 20030195743
    Abstract: A method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure is disclosed. This method is based on comparison of speech segments segmented from a speech corpus, wherein speech segments are fully prosody-aligned to each other before distortion measure. With prosody alignment embedded in selection process, distortion resulting from possible prosody modification in synthesis could be taken into account objectively in selection phase. In order to carry out the purpose of the present invention, automatic segmentation, pitch marking and PSOLA method work together for prosody alignment. Two distortion measures, MFCC and PSQM are used for comparing two prosody-aligned segments of speech because of human perceptual consideration.
    Type: Application
    Filed: July 29, 2002
    Publication date: October 16, 2003
    Applicant: Industrial Technology Research Institute
    Inventors: Chih-Chung Kuo, Chi-Shiang Kuo
  • Patent number: 5872727
    Abstract: An improved method for shifting the pitches of a tone is disclosed. It comprises: (a) subjecting a digitized original waveform to a whitening process using an all-zero filter (AZF) to obtain a whitened waveform; (b) resampling the whitened waveform at a desired scaling ratio to obtain a scaled and whitened waveform; (c) subjecting the scaled and whitened waveform to a coloring process using an all-pole filter (APF) to obtain a synthesized waveform. In a preferred embodiment, the all-zero filter performs the transformation function of: ##EQU1## and the all-pole filter performs the transformation function of: ##EQU2## wherein the a.sub.i 's and b.sub.i 's are linear predictive coefficients. The whitened waveforms can be compressed and stored as wavetables, which can be subsequently retrieved and decompressed before resampling.
    Type: Grant
    Filed: November 19, 1996
    Date of Patent: February 16, 1999
    Assignee: Industrial Technology Research Institute
    Inventor: Chih-Chung Kuo