Patents by Inventor Ellen Eide

Ellen Eide has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20080059168
    Abstract: Methods and arrangements for representing the speech waveform in terms of a set of abstract, linguistic distinctions in order to derive a set of discriminative features for use in a speech recognizer. By combining the distinctive feature representation with an original waveform representation, it is possible to achieve a reduction in word error rate of 33% on an automatic speech recognition task.
    Type: Application
    Filed: October 31, 2007
    Publication date: March 6, 2008
    Applicant: International Business Machines Corporation
    Inventor: Ellen Eide
  • Publication number: 20070185715
    Abstract: A method for generating a frequency warping function comprising preparing the training speech of a source and a target speaker; performing frame alignment on the training speech of the speakers; selecting aligned frames from the frame-aligned training speech of the speakers; extracting corresponding sets of formant parameters from the selected aligned frames; and generating a frequency warping function based on the corresponding sets of formant parameters. The step of selecting aligned frames preferably selects a pair of aligned frames in the middle of the same or similar frame-aligned phonemes with the same or similar contexts in the speech of the source speaker and target speaker. The step of generating a frequency warping function preferably uses the various pairs of corresponding formant parameters in the corresponding sets of formant parameters as key positions in a piecewise linear frequency warping function to generate the frequency warping function.
    Type: Application
    Filed: January 17, 2007
    Publication date: August 9, 2007
    Applicant: International Business Machines Corporation
    Inventors: Shuang Wei, Raimo Bakis, Ellen Eide, Liqin Shen
  • Publication number: 20070055526
    Abstract: Disclosed is a method, a system and a computer program product for text-to-speech synthesis. The computer program product comprises a computer useable medium including a computer readable program, where the computer readable program when executed on the computer causes the computer to operate in accordance with a text-to-speech synthesis function by operations that include, responsive to at least one phrase represented as recorded human speech to be employed in synthesizing speech, labeling the phrase according to a symbolic categorization of prosodic phenomena; and constructing a data structure that includes word/prosody-categories and word/prosody-category sequences for the phrase, and that further includes information pertaining to a phone sequence associated with the constituent word or word sequence for the phrase.
    Type: Application
    Filed: August 25, 2005
    Publication date: March 8, 2007
    Inventors: Ellen Eide, Raul Fernandez, John Pitrelli, Mahesh Viswanathan
  • Publication number: 20060229872
    Abstract: A technique for producing speech output in a text-to-speech system is provided. A message is created for communication to a user in a natural language generator of the text-to-speech system. The message is annotated in the natural language generator with a synthetic speech output style. The message is conveyed to the user through a speech synthesis system in communication with the natural language generator, wherein the message is conveyed in accordance with the synthetic speech output style.
    Type: Application
    Filed: March 29, 2005
    Publication date: October 12, 2006
    Applicant: International Business Machines Corporation
    Inventors: Ellen Eide, Wael Hamza
  • Publication number: 20060229873
    Abstract: A technique for producing speech output in an automatic dialog system is provided. Communication is received from a user at the automatic dialog system. A context of the communication from the user is detected in a context detector of the automatic dialog system. A message is provided to the user from a text-to-speech system of the automatic dialog system in communication with the context detector, wherein the message is provided in accordance with the detected context of the communication.
    Type: Application
    Filed: March 29, 2005
    Publication date: October 12, 2006
    Applicant: International Business Machines Corporation
    Inventors: Ellen Eide, Wael Hamza, Michael Picheny
  • Publication number: 20060229876
    Abstract: A method, apparatus and a computer program product to generate an audible speech word that corresponds to text. The method includes providing a text word and, in response to the text word, processing pre-recorded speech segments that are derived from a plurality of speakers to selectively concatenate together speech segments based on at least one cost function to form audio data for generating an audible speech word that corresponds to the text word. A data structure is also provided for use in a concatenative text-to-speech system that includes a plurality of speech segments derived from a plurality of speakers, where each speech segment includes an associated attribute vector each of which is comprised of at least one attribute vector element that identifies the speaker from which the speech segment was derived.
    Type: Application
    Filed: April 7, 2005
    Publication date: October 12, 2006
    Inventors: Andrew Aaron, Ellen Eide, Wael Hamza, Michael Picheny, Charles Rutherfoord, Zhi Shuang, Maria Smith
  • Publication number: 20060031062
    Abstract: A driving directions system loads into memory a limited subset of prerecorded, spoken utterances of geographic names from a mass media storage. The subset of spoken utterances may be limited, for example, to the geographic names within a predetermined radius (e.g., a few miles) of the driver's present location. The present location of the driver may be manually entered into the driving directions system by the driver, or automatically determined using a global positioning system (“GPS”) receiver. As the vehicle moves from its present location, the driving directions system loads into memory new names from the mass media storage and overwrites, if necessary, those which are now geographically out of range. Based on the current location of the driving, the driving directions system can audibly output geographic names from the run-time memory.
    Type: Application
    Filed: August 6, 2004
    Publication date: February 9, 2006
    Applicant: International Business Machines Corporation
    Inventors: Raimo Bakis, Ellen Eide, Wael Hamza
  • Publication number: 20050273338
    Abstract: Examples of paralinguistic events (e.g., breaths, coughs, sighs, etc.) are recorded. A text-to-speech (“TTS”) engine may insert the examples into a stream of synthetic speech using, for example, markup. The synthetic speech may include a combination of normal text and paralinguistic text.
    Type: Application
    Filed: June 4, 2004
    Publication date: December 8, 2005
    Inventors: Andrew Aaron, Raimo Bakis, Ellen Eide, Wael Hamza
  • Publication number: 20050234724
    Abstract: Disclosed is a system and method for improving the intelligibility of speech output by a speech synthesizer by determining if uncommon words exist in the text, and if it is determined that an uncommon word exists in the text, pausing the output of the synthesized speech of the uncommon word to offset the uncommon word from its surrounding speech.
    Type: Application
    Filed: April 15, 2004
    Publication date: October 20, 2005
    Inventors: Andrew Aaron, Ellen Eide
  • Publication number: 20050096909
    Abstract: Systems and methods are provided for expressive text-to-speech which include identifying text to convert to speech, selecting a speech style sheet from a set of available speech style sheets, the speech style sheet defining desired speech characteristics, marking the text to associate the text with the selected speech style sheet, and converting the text to speech having the desired speech characteristics by applying a low level markup associated with the speech style sheet.
    Type: Application
    Filed: October 29, 2003
    Publication date: May 5, 2005
    Inventors: Raimo Bakis, Andrew Aaron, Ellen Eide, Thiruvilwamalai Raman
  • Publication number: 20050071163
    Abstract: Systems and methods for speech synthesis and, in particular, text-to-speech systems and methods for converting a text input to a synthetic waveform by processing prosodic and phonetic content of a spoken example of the text input to accurately mimic the input speech style and pronunciation. Systems and methods provide an interface to a TTS system to allow a user to input a text string and a spoken utterance of the text string, extract prosodic parameters from the spoken input, and process the prosodic parameters to derive corresponding markup for the text input to enable a more natural sounding synthesized speech.
    Type: Application
    Filed: September 26, 2003
    Publication date: March 31, 2005
    Applicant: International Business Machines Corporation
    Inventors: Andy Aaron, Raimo Bakis, Ellen Eide, Wael Hamza