Architecture Of Speech Synthesizers (epo) Patents (Class 704/E13.006)
  • Publication number: 20110282668
    Abstract: A method of and system for speech synthesis. First and second text inputs are received in a text-to-speech system, and processed into respective first and second speech outputs corresponding to stored speech respectively from first and second speakers using a processor of the system. The second speech output of the second speaker is adapted to sound like the first speech output of the first speaker.
    Type: Application
    Filed: May 14, 2010
    Publication date: November 17, 2011
    Applicant: GENERAL MOTORS LLC
    Inventors: Jeffrey M. Stefan, Gaurav Talwar, Rathinavelu Chengalvarayan
  • Publication number: 20100217601
    Abstract: A speech processing system includes a plurality of signal analyzers that extract salient signal attributes of an input voice signal. A difference module computes the differences in the salient signal attributes. One or more control modules control a plurality of speech generators using an output signal from the difference module in a speech-locked loop (SLL), the speech generators use the output signal to generate a voice signal.
    Type: Application
    Filed: February 9, 2010
    Publication date: August 26, 2010
    Inventors: Keng Hoong Wee, Lorenzo Turicchia, Rahul Sarpeshkar
  • Publication number: 20090254349
    Abstract: A speech synthesizer can execute speech content editing at high speed and generate speech content easily. The speech synthesizer includes a small speech element DB (101), a small speech element selection unit (102), a small speech element concatenation unit (103), a prosody modification unit (104), a large speech element DB (105), a correspondence DB (106) that associates the small speech element DB (101) with the large speech element DB (105), a speech element candidate obtainment unit (107), a large speech element selection unit (108), and a large speech element concatenation unit (109). By editing synthetic speech using the small speech element DB (101) and performing quality enhancement on an editing result using the large speech element DB (105), speech content can be generated easily on a mobile terminal.
    Type: Application
    Filed: May 11, 2007
    Publication date: October 8, 2009
    Inventors: Yoshifumi Hirose, Yumiko Kato, Takahiro Kamai
  • Publication number: 20080312929
    Abstract: The present invention discloses a text-to-speech system that provides output variability. The system can include a finite state grammar, a variability engine and a text-to-speech engine. The finite state grammar can contain a phrase role consisting of one or more phrase elements. The phrase rule can deterministically generate a variable text phrase based upon at least one random number. The phrase rule can include a definition for each of the phrase elements. Each definition can be associated with at least one defined text string. The variability engine can construct a random text phrase responsive to receiving an action command, wherein said finite state grammar is used to create the text phrase. The variability engine can also rely on user-specified weights to adjust the output probabilities. The speech-to-text engine can convert the text phrase generated by the variability engine into speech output.
    Type: Application
    Filed: June 12, 2007
    Publication date: December 18, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: OSCAR J. BLASS, PARITOSH D. PATEL, HARVEY M. RUBACK, ROBERTO VILA