Grapheme To Phoneme, Detection Of Language (epo) Patents (Class 704/E13.012)

SYSTEM AND METHOD FOR PERFORMING SPEECH SYNTHESIS WITH A CACHE OF PHONEME SEQUENCES

Publication number: 20120010877

Abstract: Disclosed are systems, methods, and computer readable media for performing speech synthesis. The method embodiment comprises applying a first part of a speech synthesizer to a text corpus to obtain a plurality of phoneme sequences, the first part of the speech synthesizer only identifying possible phoneme sequences, for each of the obtained plurality of phoneme sequences, identifying joins that would be calculated to synthesize each of the plurality of respective phoneme sequences, and adding the identified joins to a cache for use in speech synthesis.

Type: Application

Filed: July 13, 2011

Publication date: January 12, 2012

Applicant: AT&T Intellectual Property II, L.P.

Inventor: Alistair D. Conkie
Searching Spoken Media According to Phonemes Derived From Expanded Concepts Expressed As Text

Publication number: 20110040774

Abstract: According to one embodiment, searching media includes receiving a search query comprising search terms. At least one search term is expanded to yield a set of conceptually equivalent terms. The set of conceptually equivalent terms is converted to a set of search phonemes. Files that record phonemes are searched according to the set of search phonemes. A file that includes a phoneme that matches at least one search phoneme is selected and output to a client.

Type: Application

Filed: August 14, 2009

Publication date: February 17, 2011

Applicant: Raytheon Company

Inventors: Bruce E. Peoples, Michael R. Johnson, Kristopher D. Barr
Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis

Patent number: 7869999

Abstract: A system and method for generating synthetic speech, which operates in a computer implemented Text-To-Speech system. The system comprises at least a speaker database that has been previously created from user recordings, a Front-End system to receive an input text and a Text-To-Speech engine. The Front-End system generates multiple phonetic transcriptions for each word of the input text, and the TTS engine uses a cost function to select which phonetic transcription is the more appropriate for searching the speech segments within the speaker database to be concatenated and synthesized.

Type: Grant

Filed: August 10, 2005

Date of Patent: January 11, 2011

Assignee: Nuance Communications, Inc.

Inventors: Christel Amato, Hubert Crepy, Stephane Revelin, Claire Waast-Richard
METHOD AND APPARATUS PERTAINING TO THE PROCESSING OF SAMPLED AUDIO CONTENT USING A FAST SPEECH RECOGNITION SEARCH PROCESS

Publication number: 20080162128

Abstract: One provides (101) a plurality of frames of sampled audio content and then processes (102) that plurality of frames using a speech recognition search process that comprises, at least in part, determining whether to search each subword boundary contained within each frame on a frame-by-frame basis. These teachings will also readily accommodate determining whether to search each word boundary contained within each frame on a frame-by-frame basis.

Type: Application

Filed: December 29, 2006

Publication date: July 3, 2008

Applicant: MOTOROLA, INC.

Inventor: Yan Ming Cheng

SYSTEM AND METHOD FOR PERFORMING SPEECH SYNTHESIS WITH A CACHE OF PHONEME SEQUENCES

Searching Spoken Media According to Phonemes Derived From Expanded Concepts Expressed As Text

Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis

METHOD AND APPARATUS PERTAINING TO THE PROCESSING OF SAMPLED AUDIO CONTENT USING A FAST SPEECH RECOGNITION SEARCH PROCESS