Speech Synthesis; Text To Speech Systems (epo) Patents (Class 704/E13.001)

E Subclasses

Methods for producing synthetic speech; speech synthesizers (epo) (Class 704/E13.002)

Concept-to-speech synthesizers; generation of natural phrases not from text but from machine-based concepts (EPO) (Class 704/E13.003)
Sound editing, manipulating voice of the synthesizer (EPO) (Class 704/E13.004)

Details of speech synthesis systems, e.g., synthesizer architecture, memory management, etc. (epo) (Class 704/E13.005)

Elementary speech units used in speech synthesizers; concatenation rules (epo) (Class 704/E13.009)

Concatenation (EPO) (Class 704/E13.01)

Text analysis, generation of parameters for speech synthesis out of text, e.g., grapheme to phoneme translation, prosody generation, stress, or intonation determination, etc. (epo) (Class 704/E13.011)

SPEECH SYNTHESIZING APPARATUS, METHOD, AND PROGRAM

Publication number: 20100076768

Abstract: Disclosed is a speech synthesizing apparatus including a segment selection unit that selects a segment suited to a target segment environment from candidate segments, includes a prosody change amount calculation unit that calculates prosody change amount of each candidate segment based on prosody information of candidate segments and the target segment environment, a selection criterion calculation unit that calculates a selection criterion based on the prosody change amount, a candidate selection unit that narrows down selection candidates based on the prosody change amount and the selection criterion, and an optimum segment search unit than searches for an optimum segment from among the narrowed-down candidate segments.

Type: Application

Filed: February 15, 2008

Publication date: March 25, 2010

Applicant: NEC CORPORATION

Inventors: Masanori Kato, Reishi Kondo, Yasuyuki Mitsui
Speech Enhancement Employing a Perceptual Model

Publication number: 20100076769

Abstract: Speech enhancement based on a psycho-acoustic model is disclosed that is capable of preserving the fidelity of speech while sufficiently suppressing noise including the processing artifact known as “musical noise”.

Type: Application

Filed: March 14, 2008

Publication date: March 25, 2010

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Rongshan Yu
Coarticulation Method for Audio-Visual Text-to-Speech Synthesis

Publication number: 20100076762

Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data. second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.

Type: Application

Filed: November 30, 2009

Publication date: March 25, 2010

Applicant: AT&T Corp.

Inventors: Eric Cosatto, Hans Peter Graf, Juergen Schroeter
HOME APPLIANCE SYSTEM

Publication number: 20100066554

Abstract: A home appliance system includes a home appliance outputting product information as a sound and a mobile terminal confirming the product information based on the sound. The mobile terminal can receive the sound, convert the sound into the product information and output the product information to an external user and a repairman.

Type: Application

Filed: September 1, 2009

Publication date: March 18, 2010

Inventors: Phal Jin LEE, Hoi Jin JEONG, Jong Hye HAN, Young Soo KIM, In Haeng CHO, Si Moon JEON
METHOD AND APPARATUS FOR IMPROVING TRANSACTION SUCCESS RATES FOR VOICE REMINDER APPLICATIONS IN E-COMMERCE

Publication number: 20100070282

Abstract: Methods and apparatuses are disclosed for improving transaction success rates for voice reminder applications in e-commerce. In one embodiment of the invention, the voice reminder applications in e-commerce utilizes a network-based text-to-speech (TTS) alert system, which can generate a purchase reminder associated with a recipient's potential purchase. The network-based text-to-speech (TTS) alert system can also deliver the purchase reminder to a recipient's voicemail and leave a transaction identifier number and a centralized or a recipient-specific call-back phone number to the recipient's voicemail. A recipient can utilize the transaction identifier number, the centralized or the recipient-specific call-back phone number, and optionally a recipient-specific password to make a phone call to retrieve the purchase reminder previously delivered to the recipient's voicemail by the network-based text-to-speech (TTS) alert system.

Type: Application

Filed: September 18, 2009

Publication date: March 18, 2010

Inventors: Samuel Cho, Oon-Gil Paik
MULTIMEDIA DATA TRANSFER FOR A PERSONAL COMMUNICATION DEVICE

Publication number: 20100068991

Abstract: In a portable multimedia device, data is passed between a sender and receiver unit by way of voice channel only. Multimedia data is vocalized and then forwarded to a receiver unit by way of the voice channel without the use of a backend server. Once received at the receiver unit, the vocalized data can be converted to an audio signal that can then output by way of an audio output device (such as a speaker, earphone, etc.).

Type: Application

Filed: November 23, 2009

Publication date: March 18, 2010

Applicant: APPLE INC.

Inventor: Anthony M. Fadell
Hands-Free and Non-Visually Occluding Object Information Interaction System

Publication number: 20100063821

Abstract: Technologies are described herein for providing a hands-free and non-visually occluding interaction with object information. In one method, a visual capture of a portion of an object is received through a hands-free and non-visually occluding visual capture device. An audio capture is also received from a user through a hands-free and non-visually occluding audio capture device. The audio capture may include a request for information about a portion of the object in the visual capture. The information is retrieved and is transmitted to the user for playback through a hands-free and non-visually occluding audio output device.

Type: Application

Filed: September 9, 2008

Publication date: March 11, 2010

Inventors: Joseph C. Marsh, Eric M. Smith
Speech synthesis with dynamic constraints

Publication number: 20100057467

Abstract: A method is disclosed for providing speech parameters to be used for synthesis of a speech utterance. In at least one embodiment, the method includes receiving an input time series of first speech parameter vectors, preparing at least one input time series of second speech parameter vectors consisting of dynamic speech parameters, extracting from the input time series of first and second speech parameter vectors partial time series of first speech parameter vectors and corresponding partial time series of second speech parameter vectors, converting the corresponding partial time series of first and second speech parameter vectors into partial time series of third speech parameter vectors, wherein the conversion is done independently for each set of partial time series and can be started as soon as the vectors of the input time series of the first speech parameter vectors have been received.

Type: Application

Filed: June 25, 2009

Publication date: March 4, 2010

Inventor: Johan Wouters
Information processing system, portable information terminal and its control method, information providing device and its control method, and program

Publication number: 20100057571

Abstract: A portable information terminal includes: a position detecting section that detects a position of the portable information terminal; a selecting section that selects a guidance target object for which voice guidance is to be provided, and detects a guidance target direction that is a direction in which the guidance target object exists with respect to a reference direction, on the basis of a direction in which the guidance target object exists with respect to the position of the portable information terminal; a voice synthesis section that generates a synthetic voice so that a guidance voice for the guidance target object selected by the selecting section is heard from the guidance target direction; and a voice output section that outputs the synthetic voice generated by the voice synthesis section.

Type: Application

Filed: April 29, 2008

Publication date: March 4, 2010

Applicant: Sony Corporation

Inventors: Kazuyuki Yamamoto, Toshio Mamiya, Hidetoshi Kabasawa, Katsuhiko Yamada, Takashi Yamada, Hideaki Kumagai
SYSTEM AND METHOD FOR VARIABLE TEXT-TO-SPEECH WITH MINIMIZED DISTRACTION TO OPERATOR OF AN AUTOMOTIVE VEHICLE

Publication number: 20100057464

Abstract: A text-to-speech (TTS) system implemented in an automotive vehicle is dynamically tuned to increase intelligibility over a wide variety of vehicle operating states and environmental conditions by tuning characteristics of the synthesized voice in response to measured operating states. To decrease distractions to an operator of the vehicle, an embodiment of the invention prevents updates to the synthesized voice character from taking effect while a message phrase is being played. Instead, voice characteristics are updated only during natural phrase breaks. In another embodiment of the invention, a damping filter is applied to calculated changes in voice characteristics to prevent excessively rapid changes from being applied, reducing the likelihood of distracting the vehicle operator. In another embodiment of the invention, both phrase-break detectors and damping filters are employed.

Type: Application

Filed: August 29, 2008

Publication date: March 4, 2010

Inventors: DAVID MICHAEL KIRSCH, Ritchie Winson Huang
PHONETIC NATURAL LANGUAGE TRANSLATION SYSTEM

Publication number: 20100049497

Abstract: The phonetic natural language translation system receives audio output from an electro acoustic device connected as a component in an audio system presented in a theater or auditorium, to identify any speech signal contained within the audio output. The speech signals are broken down into recognizable phonemes. The sequentially generated phonemes are then regrouped to form recognizable words in one of the 6,700 languages spoken around the world. Sentences are then formed using the grammatical rules of the recognized language so that each sentence translated into each of the audience's preferred language without any external translators. The preferred language of each audience is identified during ticket booking; an algorithm stores the audience seat number along with preferred language. The translated audio signals are distributed to an each seat's armrest such that each viewer listens and understands the foreign language audible program or speech in their own preferred language.

Type: Application

Filed: September 19, 2009

Publication date: February 25, 2010

Inventor: Johnson ("Johnson") Manuel-Devadoss ("Smith")
METHOD AND SYSTEM FOR PROVIDING META DATA FOR A WORK

Publication number: 20100036666

Abstract: A method for providing meta data for a work includes designating a file for uploading data associated therewith to a telematics unit operatively connected to a vehicle; and using meta data associated with the designed file, obtaining phonetic meta data for the designed file from an on-line service. The method further includes creating a phonetic meta data file associated with the designed file and including the obtained phonetic meta data, and transferring the phonetic metal data file to the telematics unit. Also disclosed herein is a system for providing the same.

Type: Application

Filed: August 8, 2008

Publication date: February 11, 2010

Applicant: GM GLOBAL TECHNOLOGY OPERATIONS, INC.

Inventors: Nathan D. Ampunan, Timothy J. Grost, Kevin W. Owens
AUTOMATIC SPEECH RECOGNITION SYSTEM AND METHOD FOR AIRCRAFT

Publication number: 20100030400

Abstract: A system and method which implement automatic speech recognition (ASR) and text-to-speech (TTS) programs to permit pilots, co-pilots, and other persons to more quickly and easily perform control and monitoring tasks on aircraft. The system may be used to automatically change the frequency of an aircraft radio when a pilot or co-pilot is instructed to do so by ATC.

Type: Application

Filed: July 13, 2006

Publication date: February 4, 2010

Applicant: GARMIN INTERNATIONAL, INC.

Inventors: Joseph L. Komer, Joseph E. Gepner, Charles Gregory Sherwood
TRANSPARENT VOICE REGISTRATION AND VERIFICATION METHOD AND SYSTEM

Publication number: 20100027767

Abstract: Transparent voice registration of a party is provided in order to provide voice verification for communications with a service center. Verbal communication spoken by a party during interaction between the party and an agent of the service center is captured. A voice model associated with the captured communication is created and stored in order to provide voice verification during a subsequent call to the service center. When a requester contacts the service center, a comparison of the voice of the requester and a voice model of the person that the requester claims to be is performed, in order to verify the identity of the requester. Additionally, a voice model associated with a party is automatically updated after a subsequent communication between the party and the service center.

Type: Application

Filed: July 30, 2008

Publication date: February 4, 2010

Applicant: AT&T Intellectual Property I, L.P.

Inventor: Mazin GILBERT
SEMANTIC NETWORK METHODS TO DISAMBIGUATE NATURAL LANGUAGE MEANING

Publication number: 20100030723

Abstract: A computer implemented data processor system automatically disambiguates a contextual meaning of natural language symbols to enable precise meanings to be stored for later retrieval from a natural language database, so that natural language database design is automatic, to enable flexible and efficient natural language interfaces to computers, household appliances and hand-held devices.

Type: Application

Filed: October 10, 2009

Publication date: February 4, 2010

Inventor: Lawrence Au
ASL Glove with 3-Axis Accelerometers

Publication number: 20100023314

Abstract: A sign language recognition apparatus and method is provided for translating hand gestures into speech or written text. The apparatus includes a number of 3-axis accelerometers on fingers and back of the palm to measure dynamic and static gestures, an analog multiplexer and a programmable micro controller to detect hand postures of American Sign Language and send them to a host via serial communication. The sensors are connected to a microprocessor to search a library of gestures and generate output signals that can then be used to produce a synthesized voice or written text. The apparatus includes sensors such as accelerometers on the fingers and thumb and two accelerometers on the back of the hand to detect motion and orientation of the hand. Sensors are also provided on the back of the hand or wrist to detect forearm rotation, an angle sensor to detect flexing of the elbow, two sensors on the upper arm to detect arm elevation and rotation, and a sensor on the upper arm to detect arm twist.

Type: Application

Filed: August 8, 2007

Publication date: January 28, 2010

Inventor: Jose Hernandez-Rebollar
FACILITATING TEXT-TO-SPEECH CONVERSION OF A USERNAME OR A NETWORK ADDRESS CONTAINING A USERNAME

Publication number: 20100010816

Abstract: To facilitate text-to-speech conversion of a username, a first or last name of a user associated with the username may be retrieved, and a pronunciation of the username may be determined based at least in part on whether the name forms at least part of the username. To facilitate text-to-speech conversion of a domain name having a top level domain and at least one other level domain, a pronunciation for the top level domain may be determined based at least in part upon whether the top level domain is one of a predetermined set of top level domains. Each other level domain may be searched for one or more recognized words therewithin, and a pronunciation of the other level domain may be determined based at least in part on an outcome of the search. The username and domain name may form part of a network address such as an email address, URL or URI.

Type: Application

Filed: July 11, 2008

Publication date: January 14, 2010

Inventors: Matthew Bells, Jennifer Elizabeth Lhotak, Michael Angelo Nanni
VOICE DIRECTED SYSTEM AND METHOD CONFIGURED FOR ASSURED MESSAGING TO MULTIPLE RECIPIENTS

Publication number: 20100004933

Abstract: A communications system transmits messages via a wireless network to multiple users nearly simultaneously in real-time. Each user has a terminal that receives a message and plays the message for the user. The terminal may also wait for the user to verbally acknowledge the arrival of the message before continuing with its normally executing application. The sender of the message may track, for each intended recipient, the delivery of the message, the accessing of the message by the user, and the acknowledgement by the user that the message was understood.

Type: Application

Filed: September 17, 2009

Publication date: January 7, 2010

Inventors: Lawrence R. Sweeney, James D. Maloy, Claudine Astorri, Linda Boyle
SPEECH SEPARATING APPARATUS, SPEECH SYNTHESIZING APPARATUS, AND VOICE QUALITY CONVERSION APPARATUS

Publication number: 20100004934

Abstract: A speech separating apparatus includes: a PARCOR calculating unit (102) that extracts vocal tract information from an input speech signal; a filter smoothing unit (103) that smoothes, in a first time constant, the vocal tract information extracted by the PARCOR calculating unit (102); an inverse filtering unit (104) that calculates a filter coefficient of a filter having a frequency amplitude response characteristic inverse to the vocal tract information smoothed by the filter smoothing unit (103), so as to filter the input speech signal using the filter having the calculated filter coefficient; and a voicing source modeling unit (105) that cuts out, from the input speech signal filtered by the inverse filtering unit (104), a waveform included in a second time constant shorter than the first time constant, so as to calculate, for each waveform that is taken, voicing source information from the each waveform.

Type: Application

Filed: August 6, 2008

Publication date: January 7, 2010

Inventors: Yoshifumi Hirose, Takahiro Kamai
PSAP CAPABILITIES DEFINING SYSTEM AND METHOD FOR HANDLING EMERGENCY TEXT MESSAGING

Publication number: 20100002846

Abstract: A system and method for routing an emergency data message to a PSAP may include receiving an emergency data message and cell code identifier indicative of a location of a wireless communications device of a user. A PSAP local to the user may be selected, and the emergency data message is sent to the selected PSAP. The location may be a centralized network location associated with an emergency network address, where the emergency network address is an easy address for users to remember.

Type: Application

Filed: October 24, 2008

Publication date: January 7, 2010

Inventors: Amar N. Ray, Carl M. Coppage, Lynn T. Greene, Robert J. Morrill
SYSTEM AND METHOD FOR EXTRACTION OF META DATA FROM A DIGITAL MEDIA STORAGE DEVICE FOR MEDIA SELECTION IN A VEHICLE

Publication number: 20090326949

Abstract: A method is provided for extracting meta data from a digital media storage device in a vehicle over a communication link between a control module of the vehicle and the digital media storage device. The method includes establishing a communication link between control module of the vehicle and the digital media storage device, identifying a media file on the digital media storage device, and retrieving meta data from a media file, the meta data including a plurality of entries, wherein at least one of the plurality of entries includes text data. The method further includes identifying the text data in an entry of the media file and storing the plurality of entries in a memory.

Type: Application

Filed: April 3, 2007

Publication date: December 31, 2009

Inventors: Brian L. Douthitt, Karl W. Schripsema, Michae; J. Sims
METHODS AND SYSTEMS FOR SIGHT IMPAIRED WIRELESS CAPABILITY

Publication number: 20090313021

Abstract: A method for sending data to a sight impaired user, the method comprising, receiving data from a data resource, determining whether the data is compatible with a Symbian API, transcoding the data into a first format compatible with the Symbian API, determining whether the data is compatible with a TALKS filter, transcoding the data into a second format compatible with the TALKS filter, determining whether the data is usable by a sight impaired user, transcoding the data into a third format usable by a sight impaired user responsive to determining that the data is not usable by a sight impaired user, converting a data type definition associated with the data into a format compatible with a user profile, sending the received data to a user mobile device, wherein the mobile device is operative to convert the data into an audible output.

Type: Application

Filed: June 13, 2008

Publication date: December 17, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Franco Carminati, Francesco Levantini, Giuseppe Vendramin
TEXT-TO-SPEECH USER INTERFACE CONTROL

Publication number: 20090313020

Abstract: A system and method includes a detecting computer readable text associated with a device, detecting a starting point for a text-to-speech conversion of text, beginning the text-to-speech conversion upon detection of movement of a pointing device in a direction of text flow, and controlling a rate of the text-to-speech conversion based on a rate of movement of the pointing device in relation to the text to be converted.

Type: Application

Filed: June 12, 2008

Publication date: December 17, 2009

Applicant: Nokia Corporation

Inventor: Rami Arto Koivunen
Speech Generation User Interface

Publication number: 20090313024

Abstract: A system and a method for speech generation which assist the speech of those with a disability or a medical condition such as cerebral palsy, motor neurone disease or a dysarthia following a stroke. The system has a user interface having a multiplicity of states each of which correspond to a sound and a selector for making a selection of a state or a combination of states. The system also has a processor for processing the selected state or combination of states and an audio output for outputting the sound or combination of sounds. The sounds associated with the states can be phonemes or phonics and the user interface is typically a manually operable device such as a mouse, trackball, joystick or other device that allows a user to distinguish between states by manipulating the interface to a number of positions.

Type: Application

Filed: February 1, 2007

Publication date: December 17, 2009

Applicant: The University of Dundee

Inventors: Rolf Black, Annula Waller, Eric Abel, Iain Murray, Graham Pullin
SYSTEM AND METHOD OF GRANTING IDENTIFICATION CODES TO ELECTRONIC TEACHING MATERIAL CONTENTS' SENTENCE STRUCTURES, SYSTEM AND METHOD OF SEARCHING DATA OF ELECTRONIC TEACHING MATERIAL CONTENTS, SYSTEM AND METHOD OF MANAGING POINTS OF USE AND SERVICE OF ELECTRONIC TEACHING MATERIAL CONTENTS

Publication number: 20090306968

Abstract: Disclosed is a system, which grants identification code to sentence structures of electronic teaching material contents, includes the following units. The identification code production unit distinguishes each syllable of electronic teaching material content's selected sentence structure according to type of language, and produces peculiar identification code using the first phoneme or syllable of each syllable. The identification code grant unit grants identification code to metadata of file which stores electronic teaching material contents of above.

Type: Application

Filed: August 12, 2009

Publication date: December 10, 2009

Applicants: EGC & C CO., LTD.

Inventor: Yonghwa Kim
SYSTEMS AND METHODS FOR REDUCING SPEECH INTELLIGIBILITY WHILE PRESERVING ENVIRONMENTAL SOUNDS

Publication number: 20090306988

Abstract: An audio privacy system reduces the intelligibility of speech in an audio signal while preserving prosodic information, such as pitch, relative energy and intonation so that a listener has the ability to recognize environmental sounds but not the speech itself. An audio signal is processed to separate non-vocalic information, such as pitch and relative energy of speech, from vocalic regions, after which syllables are identified within the vocalic regions. Representations of the vocalic regions are computed to produce a vocal tract transfer function and an excitation. The vocal tract transfer function for each syllable is then replaced with the vocal tract transfer function from another prerecorded vocalic sound. In one aspect, the identity of the replacement vocalic sound is independent of the identity of the syllable being replaced.

Type: Application

Filed: June 6, 2008

Publication date: December 10, 2009

Applicant: FUJI XEROX CO., LTD

Inventors: Francine Chen, John Adcock
SYSTEM AND METHOD FOR SYNTHETICALLY GENERATED SPEECH DESCRIBING MEDIA CONTENT

Publication number: 20090306985

Abstract: Disclosed herein are systems, methods, and computer readable-media for providing an automatic synthetically generated voice describing media content, the method comprising receiving one or more pieces of metadata for a primary media content, selecting at least one piece of metadata for output, and outputting the at least one piece of metadata as synthetically generated speech with the primary media content. Other aspects of the invention involve alternative output, output speech simultaneously with the primary media content, output speech during gaps in the primary media content, translate metadata in foreign language, tailor voice, accent, and language to match the metadata and/or primary media content. A user may control output via a user interface or output may be customized based on preferences in a user profile.

Type: Application

Filed: June 6, 2008

Publication date: December 10, 2009

Applicant: AT&T Labs

Inventors: Linda ROBERTS, Hong Thi Nguyen, Horst J. Schroeter
TECHNIQUES TO MANAGE VEHICLE COMMUNICATIONS

Publication number: 20090298474

Abstract: Techniques to manage vehicle communications are described. A mobile computing device may include a communication module operative to establish a first communication channel with a message server, and a second communication channel with a vehicle system, a message application module communicatively coupled to the communication module, the message application module operative to receive a user message over the first communication channel, and send an event message over the second communication channel, and a vehicle message application module communicatively coupled to the message application module, the vehicle message application module operative to receive a read control directive over the second communication channel, convert text information from the user message to mail audio information, and send the mail audio information over the second communication channel. Other embodiments are described and claimed.

Type: Application

Filed: May 30, 2008

Publication date: December 3, 2009

Applicant: Palm, Inc.

Inventor: Moses George
METHODS, APPARATUSES, AND COMPUTER PROGRAM PRODUCTS FOR PROVIDING AN AUDIBLE INTERFACE TO PUBLISH/SUBSCRIBE SERVICES

Publication number: 20090299957

Abstract: An apparatus may include a processor configured to receive content. The received content may at least partially comprise audio content. The processor may be further configured to generate an audible content posting from the received content. The processor may be additionally configured to store the generated audible content posting in a database comprising a publish/subscribe service. In some embodiments, the processor may be further configured to provide the audible content posting to remote device users via an audible interface to the publish/subscribe service.

Type: Application

Filed: June 2, 2008

Publication date: December 3, 2009

Applicant: Nokia Corporation

Inventor: Jonathan Ledlie
Messaging Gaming Patrons

Publication number: 20090299833

Abstract: The subject matter of this specification can be embodied in, among other things, a method that includes monitoring game play information transmitted from a gaming device to a server. The monitored information specifies a gaming patron's interactions with the gaming device. The method also includes recording, in a data store of the server, player information comprising the game play information for the gaming patron and a communication device identifier that specifies a communication device associated with the gaming patron. The method includes generating at the server an electronic message that includes one or more offers personalized for the gaming patron based on the recorded player information and transmitting, using a messaging interface, the electronic message to the communication device specified by the communication device identifier.

Type: Application

Filed: April 1, 2009

Publication date: December 3, 2009

Inventors: Eugene Estep, Fredrick C. Combs
SYSTEM AND METHOD OF PROVIDING A SPOKEN DIALOG INTERFACE TO A WEBSITE

Publication number: 20090292529

Abstract: Disclosed is a system and method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes converting data from a structured database associated with a website to a structured text data set and a structured task knowledge base, extracting linguistic items from the structured database, and training a spoken dialog service component using at least one of the structured text data, the structured task knowledge base, or the linguistic items. The system includes modules configured to implement the method.

Type: Application

Filed: July 31, 2009

Publication date: November 26, 2009

Applicant: AT&T Corp.

Inventors: Srinivas Bangalore, Junlan Feng, Mazin G. Rahim
SIGNAL PROCESSING METHOD, PROCESSING APPARTUS AND VOICE DECODER

Publication number: 20090292542

Abstract: The present invention discloses a signal processing method adapted to process a synthesized signal in packet loss concealment. The method includes the following steps: receiving a good frame following a lost frame, obtaining an energy ratio of energy of a signal in the signal of the good frame signal to energy of a synthesized signal corresponding to the same time of the good frame; and adjusting the synthesized signal in accordance with the energy ratio. The present invention also discloses a signal processing apparatus and a voice decoder.

Type: Application

Filed: August 11, 2009

Publication date: November 26, 2009

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Wuzhou ZHAN, Dongqi WANG, Yongfeng TU, Jing WANG, Qing ZHANG, Lei MIAO, Jianfeng XU, Chen HU, Yi YANG, Zhengzhong DU, Fengyan QI
Describing Elements in a Virtual World Based on Changes Since a Previous Encounter

Publication number: 20090287490

Abstract: Embodiments of the invention may be used to enhance the presentation of a virtual environment for certain users, e.g., a visually impaired user. Because users may visit, and revisit, locations within the virtual environment, the state of elements in the virtual environment may change. Accordingly, audible descriptions of an object, person or environment, may be adjusted to prevent redundant or unnecessary descriptions. For example, when the user encounters a given element a second time, rather than describe each characteristic of the element, only changes to the characteristics of the element are described.

Type: Application

Filed: May 14, 2008

Publication date: November 19, 2009

Inventors: Brian John Cragun, Zachary Adam Garbow, Christopher A. Peterson
Multilingual Administration Of Enterprise Data With User Selected Target Language Translation

Publication number: 20090271175

Abstract: Methods, systems, and computer program products are provided multilingual administration of enterprise data. Embodiments include retrieving enterprise data; extracting text from the enterprise data for rendering from a digital media file, the extracted text being in a source language; prompting a user to select a target language; receiving from the user a selection of a target language; translating the extracted text in the source language to translated text in the target language; converting the translated text to synthesized speech in the target language; and recording the synthesized speech in the target language in a digital media file.

Type: Application

Filed: April 24, 2008

Publication date: October 29, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: William K. Bodin, David Jaramillo, Ann Marie Maynard
TONE DETECTION FOR SIGNALS SENT THROUGH A VOCODER

Publication number: 20090265173

Abstract: A tone detector and associated method for use with EVRC-B and GSM vocoders to enable reliable detection of system connect tones over a wireless communication system. The tone detection method examines a number of sequential data frames of the signal received from the vocoder and determines that the tone is present if the spectral energy at frequencies around the tone is much higher than that at neighboring frequencies and if the calculated center frequency of the data frames is at or near the frequency of the tone.

Type: Application

Filed: April 18, 2008

Publication date: October 22, 2009

Applicant: GENERAL MOTORS CORPORATION

Inventors: Sethu K. Madhavan, Jijun Yin, Qin Jiang, Darrel James Van Buer
PLAYBACK OF MULTIMEDIA DURING MULTI-WAY COMMUNICATIONS

Publication number: 20090265022

Abstract: Multimedia playback technique embodiments are presented which facilitate the playback of an arbitrary media recording during a multi-party communication over a real-time multi-way communication system via a user's communication device. The recorded media can be interjected into a multi-party communication on a real time basis. This is generally accomplished by the media recording being inserted into a media stream being processed by the user's communication device as part of the multi-party communication. This can be done by either replacing a portion of the media stream with the media recording or mixing the media recording with a portion of the media stream. Once inserted, the media recording is transmitted as part of the media stream to a least one other party to the communication.

Type: Application

Filed: June 24, 2008

Publication date: October 22, 2009

Applicant: Microsoft Corporation

Inventors: Darko Kirovski, Ydo Wexler, Christopher A. Meek
SYSTEM AND METHOD FOR ANSWERING A COMMUNICATION NOTIFICATION

Publication number: 20090259472

Abstract: Disclosed herein are systems, methods, and computer readable-media for answering a communication notification. The method for answering a communication notification comprises receiving a notification of communication from a user, converting information related to the notification to speech, outputting the information as speech to the user, and receiving from the user an instruction to accept or ignore the incoming communication associated with the notification. In one embodiment, information related to the notification comprises one or more of a telephone number, an area code, a geographic origin of the request, caller id, a voice message, address book information, a text message, an email, a subject line, an importance level, a photograph, a video clip, metadata, an IP address, or a domain name. Another embodiment involves notification assigned an importance level and repeat attempts at notification if it is of high importance.

Type: Application

Filed: April 14, 2008

Publication date: October 15, 2009

Applicant: AT& T Labs

Inventor: Horst Schroeter
Intelligent Text-to-Speech Conversion

Publication number: 20090254345

Abstract: Techniques for improved text-to-speech processing are disclosed. The improved text-to-speech processing can convert text from an electronic document into an audio output that includes speech associated with the text as well as audio contextual cues. One aspect provides audio contextual cues to the listener when outputting speech (spoken text) pertaining to a document. The audio contextual cues can be based on an analysis of a document prior to a text-to-speech conversion. Another aspect can produce an audio summary for a file. The audio summary for a document can thereafter be presented to a user so that the user can hear a summary of the document without having to process the document to produce its spoken text via text-to-speech conversion.

Type: Application

Filed: April 5, 2008

Publication date: October 8, 2009

Inventors: Christopher Brian Fleizach, Reginald Dean Hudson
Vocal Alert Unit Having Automatic Situation Awareness

Publication number: 20090248398

Abstract: A system and method for instructing dynamic nodes in a dynamically changing mobile network how to maneuver. A receiver receives situation data indicative of a respective situation of each dynamic node in space and a situation unit coupled to the receiver determines the respective situation of each dynamic node. An analysis unit coupled to the situation unit analyzes the respective situation of each dynamic node in combination with specified criteria to generate respective situation awareness date for each dynamic node. A dynamic selector unit coupled to the analysis unit determines from the respective situation awareness data appropriate action to be performed by each node; and a communication unit coupled to the dynamic selector unit conveys to the respective dynamic node command data to permit rendering of a personalized command for informing the respective node of appropriate action to be carried out thereby.

Type: Application

Filed: September 13, 2006

Publication date: October 1, 2009

Applicant: Elta Systems Ltd

Inventors: Moshe Aviran, Alexander Zussman
INTERACTIVE MARKETING SYSTEM

Publication number: 20090240567

Abstract: The present invention is a system which enables a marketing team to initiate and sustain directed and interactive communication with thousands or millions of existing or prospective customers. In the preferred embodiment, a marketer accesses a database and selects a group of qualified prospects, using lifestyle dimensions and or demographic information, from a stable group of prospective or existing customers conducting financial transactions online. Once a prospect list is selected, the marketer designs a series of questions, typically using branch and skip logic, and the system deploys the question sequence to the target list in the form of a response-redeemable savings coupon. When prospects are next performing their financial transactions online, they are presented with a lifestyle-relevant coupon which is immediately redeemable by responding to the question/communication, therewith lowering the respondent's bill instantaneously upon response, as in the case of online bill payment.

Type: Application

Filed: February 23, 2009

Publication date: September 24, 2009

Applicant: Micronotes, LLC

Inventors: Devon Kinkead, Merritt W. Mayher, Charla Jones, Venkat Rangamani
METHOD AND SYSTEM FOR EVENT REMINDER USING AN EARPIECE

Publication number: 20090238386

Abstract: A method for administering an audio message to a user of an earpiece can include receiving event information from a paired communication device, updating a personal event calendar by ordering event information to generate a first event list, generating a modified event list by grouping events in the first event list according to acceptance criteria based on event priority of event types, and generating an audio token for collective events in the modified event list for audible delivery to the ear canal. Events can be ordered by event name, event location, event data, event importance, event invitees, or event category. Other embodiments are disclosed.

Type: Application

Filed: December 23, 2008

Publication date: September 24, 2009

Applicant: Personics Holding, Inc

Inventors: John Usher, Steven Goldstein
VOICE SYNTHESIS DEVICE

Publication number: 20090234652

Abstract: The voice synthesis device includes: an emotion input unit (202) which obtains an utterance mode of a voice waveform for which voice synthesis is to be performed; a prosody generation unit (205) which generate a prosody which is used when a language-processed text is uttered in the obtained utterance mode; a characteristic tone selection unit (203) which selects a characteristic tone based on the utterance mode, the characteristic tone is observed when the text is uttered in the obtained utterance mode: a characteristic tone temporal position estimation unit (604) which (i) judges whether or not each of phonemes included in a phonologic sequence of the text is to be uttered with the characteristic tone, based on the phonologic sequence, the characteristic tone, and the prosody, and (ii) decide a phoneme which is an utterance position where the text is uttered with the characteristic tone: and an element selection unit (606) and an element connection unit (209) which generates the voice waveform based on the p

Type: Application

Filed: May 2, 2006

Publication date: September 17, 2009

Inventors: Yumiko Kato, Takahiro Kamai
COMMUNICATION DEVICE AND METHOD OF PROCESSING TEXT MESSAGE IN THE COMMUNICATION DEVICE

Publication number: 20090228278

Abstract: The application discloses a communication device and method of processing a text message in the communication device. An aspect of the present application is a method of processing text message in a communication device, the method including receiving a text message from an external sender, receiving a request to transform the text message into voice data, transforming the received text message into voice data according to the request, and transmitting the voice data to an external sound reproduction device through a wireless communication module.

Type: Application

Filed: March 9, 2009

Publication date: September 10, 2009

Inventors: Ji Young Huh, Sun Ryang Kim, Woong Chang Kim
SENTENCE READING ALOUD APPARATUS, CONTROL METHOD FOR CONTROLLING THE SAME, AND CONTROL PROGRAM FOR CONTROLLING THE SAME

Publication number: 20090222269

Abstract: An apparatus for voice synthesis includes: a word database for storing words and voices; a syllable database for storing syllables and voices; a processor for executing a process including: extracting a word from a document, generating a voice signal based on the extracted voice when the extracted word is included in the word database synthesizing a voice signal based on the extracted voice associated with the one or more syllables corresponding to the extracted word when the extracted word is not found in the word database; a speaker for producing a voice based on either of the generated and the synthesized voice signal; and a display for selectively displaying the extracted word when the voice based on the synthesized voice signal is produced by the speaker.

Type: Application

Filed: May 11, 2009

Publication date: September 3, 2009

Inventor: Shinichiro MORI
IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD AND RECORDING MEDIUM

Publication number: 20090216536

Abstract: An image processing apparatus comprises an image data input portion that inputs image data and a text data input portion that inputs text data. The text data inputted by the text data input portion is converted into voice data by a voice data converter, and this obtained voice data and the image data inputted by the image data input portion are connected to each other by a connector, and then a file including the voice data and the image data connected to each other is created.

Type: Application

Filed: February 18, 2009

Publication date: August 27, 2009

Applicant: KONICA MINOLTA BUSINESS TECHNOLOGIES, INC.

Inventors: Kenji Matsuhara, Hiroaki Kubo, Nobuhiro Mishima, Kazuo Inui
COMMUNICATION SYSTEM FOR BUILDING SPEECH DATABASE FOR SPEECH SYNTHESIS, RELAY DEVICE THEREFOR, AND RELAY METHOD THEREFOR

Publication number: 20090210221

Abstract: A relay device 20 duplicates speech data received from a communication terminal that is engaged in voice communication with another communication terminal. The duplicated speech data is transmitted to and is stored at a media processing device 40. Media processing device 40 builds a database for speech synthesis based on the stored speech data.

Type: Application

Filed: February 19, 2009

Publication date: August 20, 2009

Inventors: Shin-ichi Isobe, Takuji Sakaguchi, Motoshi Tamura, Masami Yabusaki
INTERACTIVE DOLL OR STUFFED ANIMAL

Publication number: 20090209170

Abstract: A toy has a body in the form of a doll or a living being, an audio output to reproduce audio effects, a memory to provide audio data, a controller to control the reproduction of different audio effects based on the reading of the audio data, and a sensor for generating detection signals correlated with the proximity of an object to the toy. The controller is designed in such a way that a selection of the reproduced audio effects depends on an evaluation of the detection signals and of signals generated by a provided interface device that is connected through devices in the play accessories being fed to that controller, and/or being made available for access.

Type: Application

Filed: February 6, 2009

Publication date: August 20, 2009

Inventor: Wolfgang RICHTER
METHOD AND APPARATUS FOR CONTROLLING PLAY OF AN AUDIO SIGNAL

Publication number: 20090204404

Abstract: Apparatus and methods conforming to the present invention comprise a method of controlling playback of an audio signal through analysis of a corresponding close caption signal in conjunction with analysis of the corresponding audio signal. Objection text or other specified text in the close caption signal is identified through comparison with user identified objectionable text. Upon identification of the objectionable text, the audio signal is analyzed to identify the audio portion corresponding to the objectionable text. Upon identification of the audio portion, the audio signal may be controlled to mute the audible objectionable text.

Type: Application

Filed: April 20, 2009

Publication date: August 13, 2009

Applicant: ClearPlay Inc.

Inventors: Matthew T. Jarman, William S. Meisel
SPEECH GENERATING MEANS FOR USE WITH SIGNAL SENSORS

Publication number: 20090204403

Abstract: An apparatus includes receiving circuitry for receiving a signal; and a speech module for converting the signal into speech.

Type: Application

Filed: February 11, 2009

Publication date: August 13, 2009

Applicant: OMEGA ENGINEERING, INC.

Inventors: Milton B. Hollander, Shahin Baghai, Feng Liu
SYSTEM AND METHOD FOR EMAIL NOTIFICATION

Publication number: 20090204680

Abstract: Email subscribers are notified of the receipt of new email messages when they are not at their computers via voice or page. An email notification server polls the email server corresponding to the subscriber's email account for the presence of new email messages. New email messages are obtained. Header information is extracted. If new email notification is by voicemail, the extracted header information is converted from text to voice. A voicemail message containing the extracted header information is saved on the voicemail system corresponding to the subscriber for whom the email message was intended. The email notification server can also send a page to notify the subscriber of the presence of new email.

Type: Application

Filed: April 15, 2009

Publication date: August 13, 2009

Applicant: AT&T Intellectual Property I, L.P.

Inventor: Mark Kirkpatrick

prev 1 2 3 4 5 6 7 8 next