Speech Synthesis; Text To Speech Systems (epo) Patents (Class 704/E13.001)
  • Publication number: 20100076768
    Abstract: Disclosed is a speech synthesizing apparatus including a segment selection unit that selects a segment suited to a target segment environment from candidate segments, includes a prosody change amount calculation unit that calculates prosody change amount of each candidate segment based on prosody information of candidate segments and the target segment environment, a selection criterion calculation unit that calculates a selection criterion based on the prosody change amount, a candidate selection unit that narrows down selection candidates based on the prosody change amount and the selection criterion, and an optimum segment search unit than searches for an optimum segment from among the narrowed-down candidate segments.
    Type: Application
    Filed: February 15, 2008
    Publication date: March 25, 2010
    Applicant: NEC CORPORATION
    Inventors: Masanori Kato, Reishi Kondo, Yasuyuki Mitsui
  • Publication number: 20100076769
    Abstract: Speech enhancement based on a psycho-acoustic model is disclosed that is capable of preserving the fidelity of speech while sufficiently suppressing noise including the processing artifact known as “musical noise”.
    Type: Application
    Filed: March 14, 2008
    Publication date: March 25, 2010
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Rongshan Yu
  • Publication number: 20100076762
    Abstract: A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first data comprising one or more parameters associated with noise-producing orifice images of sequences of at least three concatenated phonemes which correspond to an input stimulus. The processor reads, based on the first data. second data comprising images of a noise-producing entity. The processor generates an animated sequence of the noise-producing entity.
    Type: Application
    Filed: November 30, 2009
    Publication date: March 25, 2010
    Applicant: AT&T Corp.
    Inventors: Eric Cosatto, Hans Peter Graf, Juergen Schroeter
  • Publication number: 20100066554
    Abstract: A home appliance system includes a home appliance outputting product information as a sound and a mobile terminal confirming the product information based on the sound. The mobile terminal can receive the sound, convert the sound into the product information and output the product information to an external user and a repairman.
    Type: Application
    Filed: September 1, 2009
    Publication date: March 18, 2010
    Inventors: Phal Jin LEE, Hoi Jin JEONG, Jong Hye HAN, Young Soo KIM, In Haeng CHO, Si Moon JEON
  • Publication number: 20100070282
    Abstract: Methods and apparatuses are disclosed for improving transaction success rates for voice reminder applications in e-commerce. In one embodiment of the invention, the voice reminder applications in e-commerce utilizes a network-based text-to-speech (TTS) alert system, which can generate a purchase reminder associated with a recipient's potential purchase. The network-based text-to-speech (TTS) alert system can also deliver the purchase reminder to a recipient's voicemail and leave a transaction identifier number and a centralized or a recipient-specific call-back phone number to the recipient's voicemail. A recipient can utilize the transaction identifier number, the centralized or the recipient-specific call-back phone number, and optionally a recipient-specific password to make a phone call to retrieve the purchase reminder previously delivered to the recipient's voicemail by the network-based text-to-speech (TTS) alert system.
    Type: Application
    Filed: September 18, 2009
    Publication date: March 18, 2010
    Inventors: Samuel Cho, Oon-Gil Paik
  • Publication number: 20100068991
    Abstract: In a portable multimedia device, data is passed between a sender and receiver unit by way of voice channel only. Multimedia data is vocalized and then forwarded to a receiver unit by way of the voice channel without the use of a backend server. Once received at the receiver unit, the vocalized data can be converted to an audio signal that can then output by way of an audio output device (such as a speaker, earphone, etc.).
    Type: Application
    Filed: November 23, 2009
    Publication date: March 18, 2010
    Applicant: APPLE INC.
    Inventor: Anthony M. Fadell
  • Publication number: 20100063821
    Abstract: Technologies are described herein for providing a hands-free and non-visually occluding interaction with object information. In one method, a visual capture of a portion of an object is received through a hands-free and non-visually occluding visual capture device. An audio capture is also received from a user through a hands-free and non-visually occluding audio capture device. The audio capture may include a request for information about a portion of the object in the visual capture. The information is retrieved and is transmitted to the user for playback through a hands-free and non-visually occluding audio output device.
    Type: Application
    Filed: September 9, 2008
    Publication date: March 11, 2010
    Inventors: Joseph C. Marsh, Eric M. Smith
  • Publication number: 20100057467
    Abstract: A method is disclosed for providing speech parameters to be used for synthesis of a speech utterance. In at least one embodiment, the method includes receiving an input time series of first speech parameter vectors, preparing at least one input time series of second speech parameter vectors consisting of dynamic speech parameters, extracting from the input time series of first and second speech parameter vectors partial time series of first speech parameter vectors and corresponding partial time series of second speech parameter vectors, converting the corresponding partial time series of first and second speech parameter vectors into partial time series of third speech parameter vectors, wherein the conversion is done independently for each set of partial time series and can be started as soon as the vectors of the input time series of the first speech parameter vectors have been received.
    Type: Application
    Filed: June 25, 2009
    Publication date: March 4, 2010
    Inventor: Johan Wouters
  • Publication number: 20100057571
    Abstract: A portable information terminal includes: a position detecting section that detects a position of the portable information terminal; a selecting section that selects a guidance target object for which voice guidance is to be provided, and detects a guidance target direction that is a direction in which the guidance target object exists with respect to a reference direction, on the basis of a direction in which the guidance target object exists with respect to the position of the portable information terminal; a voice synthesis section that generates a synthetic voice so that a guidance voice for the guidance target object selected by the selecting section is heard from the guidance target direction; and a voice output section that outputs the synthetic voice generated by the voice synthesis section.
    Type: Application
    Filed: April 29, 2008
    Publication date: March 4, 2010
    Applicant: Sony Corporation
    Inventors: Kazuyuki Yamamoto, Toshio Mamiya, Hidetoshi Kabasawa, Katsuhiko Yamada, Takashi Yamada, Hideaki Kumagai
  • Publication number: 20100057464
    Abstract: A text-to-speech (TTS) system implemented in an automotive vehicle is dynamically tuned to increase intelligibility over a wide variety of vehicle operating states and environmental conditions by tuning characteristics of the synthesized voice in response to measured operating states. To decrease distractions to an operator of the vehicle, an embodiment of the invention prevents updates to the synthesized voice character from taking effect while a message phrase is being played. Instead, voice characteristics are updated only during natural phrase breaks. In another embodiment of the invention, a damping filter is applied to calculated changes in voice characteristics to prevent excessively rapid changes from being applied, reducing the likelihood of distracting the vehicle operator. In another embodiment of the invention, both phrase-break detectors and damping filters are employed.
    Type: Application
    Filed: August 29, 2008
    Publication date: March 4, 2010
    Inventors: DAVID MICHAEL KIRSCH, Ritchie Winson Huang
  • Publication number: 20100049497
    Abstract: The phonetic natural language translation system receives audio output from an electro acoustic device connected as a component in an audio system presented in a theater or auditorium, to identify any speech signal contained within the audio output. The speech signals are broken down into recognizable phonemes. The sequentially generated phonemes are then regrouped to form recognizable words in one of the 6,700 languages spoken around the world. Sentences are then formed using the grammatical rules of the recognized language so that each sentence translated into each of the audience's preferred language without any external translators. The preferred language of each audience is identified during ticket booking; an algorithm stores the audience seat number along with preferred language. The translated audio signals are distributed to an each seat's armrest such that each viewer listens and understands the foreign language audible program or speech in their own preferred language.
    Type: Application
    Filed: September 19, 2009
    Publication date: February 25, 2010
    Inventor: Johnson ("Johnson") Manuel-Devadoss ("Smith")
  • Publication number: 20100036666
    Abstract: A method for providing meta data for a work includes designating a file for uploading data associated therewith to a telematics unit operatively connected to a vehicle; and using meta data associated with the designed file, obtaining phonetic meta data for the designed file from an on-line service. The method further includes creating a phonetic meta data file associated with the designed file and including the obtained phonetic meta data, and transferring the phonetic metal data file to the telematics unit. Also disclosed herein is a system for providing the same.
    Type: Application
    Filed: August 8, 2008
    Publication date: February 11, 2010
    Applicant: GM GLOBAL TECHNOLOGY OPERATIONS, INC.
    Inventors: Nathan D. Ampunan, Timothy J. Grost, Kevin W. Owens
  • Publication number: 20100030400
    Abstract: A system and method which implement automatic speech recognition (ASR) and text-to-speech (TTS) programs to permit pilots, co-pilots, and other persons to more quickly and easily perform control and monitoring tasks on aircraft. The system may be used to automatically change the frequency of an aircraft radio when a pilot or co-pilot is instructed to do so by ATC.
    Type: Application
    Filed: July 13, 2006
    Publication date: February 4, 2010
    Applicant: GARMIN INTERNATIONAL, INC.
    Inventors: Joseph L. Komer, Joseph E. Gepner, Charles Gregory Sherwood
  • Publication number: 20100027767
    Abstract: Transparent voice registration of a party is provided in order to provide voice verification for communications with a service center. Verbal communication spoken by a party during interaction between the party and an agent of the service center is captured. A voice model associated with the captured communication is created and stored in order to provide voice verification during a subsequent call to the service center. When a requester contacts the service center, a comparison of the voice of the requester and a voice model of the person that the requester claims to be is performed, in order to verify the identity of the requester. Additionally, a voice model associated with a party is automatically updated after a subsequent communication between the party and the service center.
    Type: Application
    Filed: July 30, 2008
    Publication date: February 4, 2010
    Applicant: AT&T Intellectual Property I, L.P.
    Inventor: Mazin GILBERT
  • Publication number: 20100030723
    Abstract: A computer implemented data processor system automatically disambiguates a contextual meaning of natural language symbols to enable precise meanings to be stored for later retrieval from a natural language database, so that natural language database design is automatic, to enable flexible and efficient natural language interfaces to computers, household appliances and hand-held devices.
    Type: Application
    Filed: October 10, 2009
    Publication date: February 4, 2010
    Inventor: Lawrence Au
  • Publication number: 20100023314
    Abstract: A sign language recognition apparatus and method is provided for translating hand gestures into speech or written text. The apparatus includes a number of 3-axis accelerometers on fingers and back of the palm to measure dynamic and static gestures, an analog multiplexer and a programmable micro controller to detect hand postures of American Sign Language and send them to a host via serial communication. The sensors are connected to a microprocessor to search a library of gestures and generate output signals that can then be used to produce a synthesized voice or written text. The apparatus includes sensors such as accelerometers on the fingers and thumb and two accelerometers on the back of the hand to detect motion and orientation of the hand. Sensors are also provided on the back of the hand or wrist to detect forearm rotation, an angle sensor to detect flexing of the elbow, two sensors on the upper arm to detect arm elevation and rotation, and a sensor on the upper arm to detect arm twist.
    Type: Application
    Filed: August 8, 2007
    Publication date: January 28, 2010
    Inventor: Jose Hernandez-Rebollar
  • Publication number: 20100010816
    Abstract: To facilitate text-to-speech conversion of a username, a first or last name of a user associated with the username may be retrieved, and a pronunciation of the username may be determined based at least in part on whether the name forms at least part of the username. To facilitate text-to-speech conversion of a domain name having a top level domain and at least one other level domain, a pronunciation for the top level domain may be determined based at least in part upon whether the top level domain is one of a predetermined set of top level domains. Each other level domain may be searched for one or more recognized words therewithin, and a pronunciation of the other level domain may be determined based at least in part on an outcome of the search. The username and domain name may form part of a network address such as an email address, URL or URI.
    Type: Application
    Filed: July 11, 2008
    Publication date: January 14, 2010
    Inventors: Matthew Bells, Jennifer Elizabeth Lhotak, Michael Angelo Nanni
  • Publication number: 20100004933
    Abstract: A communications system transmits messages via a wireless network to multiple users nearly simultaneously in real-time. Each user has a terminal that receives a message and plays the message for the user. The terminal may also wait for the user to verbally acknowledge the arrival of the message before continuing with its normally executing application. The sender of the message may track, for each intended recipient, the delivery of the message, the accessing of the message by the user, and the acknowledgement by the user that the message was understood.
    Type: Application
    Filed: September 17, 2009
    Publication date: January 7, 2010
    Inventors: Lawrence R. Sweeney, James D. Maloy, Claudine Astorri, Linda Boyle
  • Publication number: 20100004934
    Abstract: A speech separating apparatus includes: a PARCOR calculating unit (102) that extracts vocal tract information from an input speech signal; a filter smoothing unit (103) that smoothes, in a first time constant, the vocal tract information extracted by the PARCOR calculating unit (102); an inverse filtering unit (104) that calculates a filter coefficient of a filter having a frequency amplitude response characteristic inverse to the vocal tract information smoothed by the filter smoothing unit (103), so as to filter the input speech signal using the filter having the calculated filter coefficient; and a voicing source modeling unit (105) that cuts out, from the input speech signal filtered by the inverse filtering unit (104), a waveform included in a second time constant shorter than the first time constant, so as to calculate, for each waveform that is taken, voicing source information from the each waveform.
    Type: Application
    Filed: August 6, 2008
    Publication date: January 7, 2010
    Inventors: Yoshifumi Hirose, Takahiro Kamai
  • Publication number: 20100002846
    Abstract: A system and method for routing an emergency data message to a PSAP may include receiving an emergency data message and cell code identifier indicative of a location of a wireless communications device of a user. A PSAP local to the user may be selected, and the emergency data message is sent to the selected PSAP. The location may be a centralized network location associated with an emergency network address, where the emergency network address is an easy address for users to remember.
    Type: Application
    Filed: October 24, 2008
    Publication date: January 7, 2010
    Inventors: Amar N. Ray, Carl M. Coppage, Lynn T. Greene, Robert J. Morrill
  • Publication number: 20090326949
    Abstract: A method is provided for extracting meta data from a digital media storage device in a vehicle over a communication link between a control module of the vehicle and the digital media storage device. The method includes establishing a communication link between control module of the vehicle and the digital media storage device, identifying a media file on the digital media storage device, and retrieving meta data from a media file, the meta data including a plurality of entries, wherein at least one of the plurality of entries includes text data. The method further includes identifying the text data in an entry of the media file and storing the plurality of entries in a memory.
    Type: Application
    Filed: April 3, 2007
    Publication date: December 31, 2009
    Inventors: Brian L. Douthitt, Karl W. Schripsema, Michae; J. Sims
  • Publication number: 20090313021
    Abstract: A method for sending data to a sight impaired user, the method comprising, receiving data from a data resource, determining whether the data is compatible with a Symbian API, transcoding the data into a first format compatible with the Symbian API, determining whether the data is compatible with a TALKS filter, transcoding the data into a second format compatible with the TALKS filter, determining whether the data is usable by a sight impaired user, transcoding the data into a third format usable by a sight impaired user responsive to determining that the data is not usable by a sight impaired user, converting a data type definition associated with the data into a format compatible with a user profile, sending the received data to a user mobile device, wherein the mobile device is operative to convert the data into an audible output.
    Type: Application
    Filed: June 13, 2008
    Publication date: December 17, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Franco Carminati, Francesco Levantini, Giuseppe Vendramin
  • Publication number: 20090313020
    Abstract: A system and method includes a detecting computer readable text associated with a device, detecting a starting point for a text-to-speech conversion of text, beginning the text-to-speech conversion upon detection of movement of a pointing device in a direction of text flow, and controlling a rate of the text-to-speech conversion based on a rate of movement of the pointing device in relation to the text to be converted.
    Type: Application
    Filed: June 12, 2008
    Publication date: December 17, 2009
    Applicant: Nokia Corporation
    Inventor: Rami Arto Koivunen
  • Publication number: 20090313024
    Abstract: A system and a method for speech generation which assist the speech of those with a disability or a medical condition such as cerebral palsy, motor neurone disease or a dysarthia following a stroke. The system has a user interface having a multiplicity of states each of which correspond to a sound and a selector for making a selection of a state or a combination of states. The system also has a processor for processing the selected state or combination of states and an audio output for outputting the sound or combination of sounds. The sounds associated with the states can be phonemes or phonics and the user interface is typically a manually operable device such as a mouse, trackball, joystick or other device that allows a user to distinguish between states by manipulating the interface to a number of positions.
    Type: Application
    Filed: February 1, 2007
    Publication date: December 17, 2009
    Applicant: The University of Dundee
    Inventors: Rolf Black, Annula Waller, Eric Abel, Iain Murray, Graham Pullin
  • Publication number: 20090306968
    Abstract: Disclosed is a system, which grants identification code to sentence structures of electronic teaching material contents, includes the following units. The identification code production unit distinguishes each syllable of electronic teaching material content's selected sentence structure according to type of language, and produces peculiar identification code using the first phoneme or syllable of each syllable. The identification code grant unit grants identification code to metadata of file which stores electronic teaching material contents of above.
    Type: Application
    Filed: August 12, 2009
    Publication date: December 10, 2009
    Applicants: EGC & C CO., LTD.
    Inventor: Yonghwa Kim
  • Publication number: 20090306988
    Abstract: An audio privacy system reduces the intelligibility of speech in an audio signal while preserving prosodic information, such as pitch, relative energy and intonation so that a listener has the ability to recognize environmental sounds but not the speech itself. An audio signal is processed to separate non-vocalic information, such as pitch and relative energy of speech, from vocalic regions, after which syllables are identified within the vocalic regions. Representations of the vocalic regions are computed to produce a vocal tract transfer function and an excitation. The vocal tract transfer function for each syllable is then replaced with the vocal tract transfer function from another prerecorded vocalic sound. In one aspect, the identity of the replacement vocalic sound is independent of the identity of the syllable being replaced.
    Type: Application
    Filed: June 6, 2008
    Publication date: December 10, 2009
    Applicant: FUJI XEROX CO., LTD
    Inventors: Francine Chen, John Adcock
  • Publication number: 20090306985
    Abstract: Disclosed herein are systems, methods, and computer readable-media for providing an automatic synthetically generated voice describing media content, the method comprising receiving one or more pieces of metadata for a primary media content, selecting at least one piece of metadata for output, and outputting the at least one piece of metadata as synthetically generated speech with the primary media content. Other aspects of the invention involve alternative output, output speech simultaneously with the primary media content, output speech during gaps in the primary media content, translate metadata in foreign language, tailor voice, accent, and language to match the metadata and/or primary media content. A user may control output via a user interface or output may be customized based on preferences in a user profile.
    Type: Application
    Filed: June 6, 2008
    Publication date: December 10, 2009
    Applicant: AT&T Labs
    Inventors: Linda ROBERTS, Hong Thi Nguyen, Horst J. Schroeter
  • Publication number: 20090298474
    Abstract: Techniques to manage vehicle communications are described. A mobile computing device may include a communication module operative to establish a first communication channel with a message server, and a second communication channel with a vehicle system, a message application module communicatively coupled to the communication module, the message application module operative to receive a user message over the first communication channel, and send an event message over the second communication channel, and a vehicle message application module communicatively coupled to the message application module, the vehicle message application module operative to receive a read control directive over the second communication channel, convert text information from the user message to mail audio information, and send the mail audio information over the second communication channel. Other embodiments are described and claimed.
    Type: Application
    Filed: May 30, 2008
    Publication date: December 3, 2009
    Applicant: Palm, Inc.
    Inventor: Moses George
  • Publication number: 20090299957
    Abstract: An apparatus may include a processor configured to receive content. The received content may at least partially comprise audio content. The processor may be further configured to generate an audible content posting from the received content. The processor may be additionally configured to store the generated audible content posting in a database comprising a publish/subscribe service. In some embodiments, the processor may be further configured to provide the audible content posting to remote device users via an audible interface to the publish/subscribe service.
    Type: Application
    Filed: June 2, 2008
    Publication date: December 3, 2009
    Applicant: Nokia Corporation
    Inventor: Jonathan Ledlie
  • Publication number: 20090299833
    Abstract: The subject matter of this specification can be embodied in, among other things, a method that includes monitoring game play information transmitted from a gaming device to a server. The monitored information specifies a gaming patron's interactions with the gaming device. The method also includes recording, in a data store of the server, player information comprising the game play information for the gaming patron and a communication device identifier that specifies a communication device associated with the gaming patron. The method includes generating at the server an electronic message that includes one or more offers personalized for the gaming patron based on the recorded player information and transmitting, using a messaging interface, the electronic message to the communication device specified by the communication device identifier.
    Type: Application
    Filed: April 1, 2009
    Publication date: December 3, 2009
    Inventors: Eugene Estep, Fredrick C. Combs
  • Publication number: 20090292529
    Abstract: Disclosed is a system and method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes converting data from a structured database associated with a website to a structured text data set and a structured task knowledge base, extracting linguistic items from the structured database, and training a spoken dialog service component using at least one of the structured text data, the structured task knowledge base, or the linguistic items. The system includes modules configured to implement the method.
    Type: Application
    Filed: July 31, 2009
    Publication date: November 26, 2009
    Applicant: AT&T Corp.
    Inventors: Srinivas Bangalore, Junlan Feng, Mazin G. Rahim
  • Publication number: 20090292542
    Abstract: The present invention discloses a signal processing method adapted to process a synthesized signal in packet loss concealment. The method includes the following steps: receiving a good frame following a lost frame, obtaining an energy ratio of energy of a signal in the signal of the good frame signal to energy of a synthesized signal corresponding to the same time of the good frame; and adjusting the synthesized signal in accordance with the energy ratio. The present invention also discloses a signal processing apparatus and a voice decoder.
    Type: Application
    Filed: August 11, 2009
    Publication date: November 26, 2009
    Applicant: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Wuzhou ZHAN, Dongqi WANG, Yongfeng TU, Jing WANG, Qing ZHANG, Lei MIAO, Jianfeng XU, Chen HU, Yi YANG, Zhengzhong DU, Fengyan QI
  • Publication number: 20090287490
    Abstract: Embodiments of the invention may be used to enhance the presentation of a virtual environment for certain users, e.g., a visually impaired user. Because users may visit, and revisit, locations within the virtual environment, the state of elements in the virtual environment may change. Accordingly, audible descriptions of an object, person or environment, may be adjusted to prevent redundant or unnecessary descriptions. For example, when the user encounters a given element a second time, rather than describe each characteristic of the element, only changes to the characteristics of the element are described.
    Type: Application
    Filed: May 14, 2008
    Publication date: November 19, 2009
    Inventors: Brian John Cragun, Zachary Adam Garbow, Christopher A. Peterson
  • Publication number: 20090271175
    Abstract: Methods, systems, and computer program products are provided multilingual administration of enterprise data. Embodiments include retrieving enterprise data; extracting text from the enterprise data for rendering from a digital media file, the extracted text being in a source language; prompting a user to select a target language; receiving from the user a selection of a target language; translating the extracted text in the source language to translated text in the target language; converting the translated text to synthesized speech in the target language; and recording the synthesized speech in the target language in a digital media file.
    Type: Application
    Filed: April 24, 2008
    Publication date: October 29, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: William K. Bodin, David Jaramillo, Ann Marie Maynard
  • Publication number: 20090265173
    Abstract: A tone detector and associated method for use with EVRC-B and GSM vocoders to enable reliable detection of system connect tones over a wireless communication system. The tone detection method examines a number of sequential data frames of the signal received from the vocoder and determines that the tone is present if the spectral energy at frequencies around the tone is much higher than that at neighboring frequencies and if the calculated center frequency of the data frames is at or near the frequency of the tone.
    Type: Application
    Filed: April 18, 2008
    Publication date: October 22, 2009
    Applicant: GENERAL MOTORS CORPORATION
    Inventors: Sethu K. Madhavan, Jijun Yin, Qin Jiang, Darrel James Van Buer
  • Publication number: 20090265022
    Abstract: Multimedia playback technique embodiments are presented which facilitate the playback of an arbitrary media recording during a multi-party communication over a real-time multi-way communication system via a user's communication device. The recorded media can be interjected into a multi-party communication on a real time basis. This is generally accomplished by the media recording being inserted into a media stream being processed by the user's communication device as part of the multi-party communication. This can be done by either replacing a portion of the media stream with the media recording or mixing the media recording with a portion of the media stream. Once inserted, the media recording is transmitted as part of the media stream to a least one other party to the communication.
    Type: Application
    Filed: June 24, 2008
    Publication date: October 22, 2009
    Applicant: Microsoft Corporation
    Inventors: Darko Kirovski, Ydo Wexler, Christopher A. Meek
  • Publication number: 20090259472
    Abstract: Disclosed herein are systems, methods, and computer readable-media for answering a communication notification. The method for answering a communication notification comprises receiving a notification of communication from a user, converting information related to the notification to speech, outputting the information as speech to the user, and receiving from the user an instruction to accept or ignore the incoming communication associated with the notification. In one embodiment, information related to the notification comprises one or more of a telephone number, an area code, a geographic origin of the request, caller id, a voice message, address book information, a text message, an email, a subject line, an importance level, a photograph, a video clip, metadata, an IP address, or a domain name. Another embodiment involves notification assigned an importance level and repeat attempts at notification if it is of high importance.
    Type: Application
    Filed: April 14, 2008
    Publication date: October 15, 2009
    Applicant: AT& T Labs
    Inventor: Horst Schroeter
  • Publication number: 20090254345
    Abstract: Techniques for improved text-to-speech processing are disclosed. The improved text-to-speech processing can convert text from an electronic document into an audio output that includes speech associated with the text as well as audio contextual cues. One aspect provides audio contextual cues to the listener when outputting speech (spoken text) pertaining to a document. The audio contextual cues can be based on an analysis of a document prior to a text-to-speech conversion. Another aspect can produce an audio summary for a file. The audio summary for a document can thereafter be presented to a user so that the user can hear a summary of the document without having to process the document to produce its spoken text via text-to-speech conversion.
    Type: Application
    Filed: April 5, 2008
    Publication date: October 8, 2009
    Inventors: Christopher Brian Fleizach, Reginald Dean Hudson
  • Publication number: 20090248398
    Abstract: A system and method for instructing dynamic nodes in a dynamically changing mobile network how to maneuver. A receiver receives situation data indicative of a respective situation of each dynamic node in space and a situation unit coupled to the receiver determines the respective situation of each dynamic node. An analysis unit coupled to the situation unit analyzes the respective situation of each dynamic node in combination with specified criteria to generate respective situation awareness date for each dynamic node. A dynamic selector unit coupled to the analysis unit determines from the respective situation awareness data appropriate action to be performed by each node; and a communication unit coupled to the dynamic selector unit conveys to the respective dynamic node command data to permit rendering of a personalized command for informing the respective node of appropriate action to be carried out thereby.
    Type: Application
    Filed: September 13, 2006
    Publication date: October 1, 2009
    Applicant: Elta Systems Ltd
    Inventors: Moshe Aviran, Alexander Zussman
  • Publication number: 20090240567
    Abstract: The present invention is a system which enables a marketing team to initiate and sustain directed and interactive communication with thousands or millions of existing or prospective customers. In the preferred embodiment, a marketer accesses a database and selects a group of qualified prospects, using lifestyle dimensions and or demographic information, from a stable group of prospective or existing customers conducting financial transactions online. Once a prospect list is selected, the marketer designs a series of questions, typically using branch and skip logic, and the system deploys the question sequence to the target list in the form of a response-redeemable savings coupon. When prospects are next performing their financial transactions online, they are presented with a lifestyle-relevant coupon which is immediately redeemable by responding to the question/communication, therewith lowering the respondent's bill instantaneously upon response, as in the case of online bill payment.
    Type: Application
    Filed: February 23, 2009
    Publication date: September 24, 2009
    Applicant: Micronotes, LLC
    Inventors: Devon Kinkead, Merritt W. Mayher, Charla Jones, Venkat Rangamani
  • Publication number: 20090238386
    Abstract: A method for administering an audio message to a user of an earpiece can include receiving event information from a paired communication device, updating a personal event calendar by ordering event information to generate a first event list, generating a modified event list by grouping events in the first event list according to acceptance criteria based on event priority of event types, and generating an audio token for collective events in the modified event list for audible delivery to the ear canal. Events can be ordered by event name, event location, event data, event importance, event invitees, or event category. Other embodiments are disclosed.
    Type: Application
    Filed: December 23, 2008
    Publication date: September 24, 2009
    Applicant: Personics Holding, Inc
    Inventors: John Usher, Steven Goldstein
  • Publication number: 20090234652
    Abstract: The voice synthesis device includes: an emotion input unit (202) which obtains an utterance mode of a voice waveform for which voice synthesis is to be performed; a prosody generation unit (205) which generate a prosody which is used when a language-processed text is uttered in the obtained utterance mode; a characteristic tone selection unit (203) which selects a characteristic tone based on the utterance mode, the characteristic tone is observed when the text is uttered in the obtained utterance mode: a characteristic tone temporal position estimation unit (604) which (i) judges whether or not each of phonemes included in a phonologic sequence of the text is to be uttered with the characteristic tone, based on the phonologic sequence, the characteristic tone, and the prosody, and (ii) decide a phoneme which is an utterance position where the text is uttered with the characteristic tone: and an element selection unit (606) and an element connection unit (209) which generates the voice waveform based on the p
    Type: Application
    Filed: May 2, 2006
    Publication date: September 17, 2009
    Inventors: Yumiko Kato, Takahiro Kamai
  • Publication number: 20090228278
    Abstract: The application discloses a communication device and method of processing a text message in the communication device. An aspect of the present application is a method of processing text message in a communication device, the method including receiving a text message from an external sender, receiving a request to transform the text message into voice data, transforming the received text message into voice data according to the request, and transmitting the voice data to an external sound reproduction device through a wireless communication module.
    Type: Application
    Filed: March 9, 2009
    Publication date: September 10, 2009
    Inventors: Ji Young Huh, Sun Ryang Kim, Woong Chang Kim
  • Publication number: 20090222269
    Abstract: An apparatus for voice synthesis includes: a word database for storing words and voices; a syllable database for storing syllables and voices; a processor for executing a process including: extracting a word from a document, generating a voice signal based on the extracted voice when the extracted word is included in the word database synthesizing a voice signal based on the extracted voice associated with the one or more syllables corresponding to the extracted word when the extracted word is not found in the word database; a speaker for producing a voice based on either of the generated and the synthesized voice signal; and a display for selectively displaying the extracted word when the voice based on the synthesized voice signal is produced by the speaker.
    Type: Application
    Filed: May 11, 2009
    Publication date: September 3, 2009
    Inventor: Shinichiro MORI
  • Publication number: 20090216536
    Abstract: An image processing apparatus comprises an image data input portion that inputs image data and a text data input portion that inputs text data. The text data inputted by the text data input portion is converted into voice data by a voice data converter, and this obtained voice data and the image data inputted by the image data input portion are connected to each other by a connector, and then a file including the voice data and the image data connected to each other is created.
    Type: Application
    Filed: February 18, 2009
    Publication date: August 27, 2009
    Applicant: KONICA MINOLTA BUSINESS TECHNOLOGIES, INC.
    Inventors: Kenji Matsuhara, Hiroaki Kubo, Nobuhiro Mishima, Kazuo Inui
  • Publication number: 20090210221
    Abstract: A relay device 20 duplicates speech data received from a communication terminal that is engaged in voice communication with another communication terminal. The duplicated speech data is transmitted to and is stored at a media processing device 40. Media processing device 40 builds a database for speech synthesis based on the stored speech data.
    Type: Application
    Filed: February 19, 2009
    Publication date: August 20, 2009
    Inventors: Shin-ichi Isobe, Takuji Sakaguchi, Motoshi Tamura, Masami Yabusaki
  • Publication number: 20090209170
    Abstract: A toy has a body in the form of a doll or a living being, an audio output to reproduce audio effects, a memory to provide audio data, a controller to control the reproduction of different audio effects based on the reading of the audio data, and a sensor for generating detection signals correlated with the proximity of an object to the toy. The controller is designed in such a way that a selection of the reproduced audio effects depends on an evaluation of the detection signals and of signals generated by a provided interface device that is connected through devices in the play accessories being fed to that controller, and/or being made available for access.
    Type: Application
    Filed: February 6, 2009
    Publication date: August 20, 2009
    Inventor: Wolfgang RICHTER
  • Publication number: 20090204404
    Abstract: Apparatus and methods conforming to the present invention comprise a method of controlling playback of an audio signal through analysis of a corresponding close caption signal in conjunction with analysis of the corresponding audio signal. Objection text or other specified text in the close caption signal is identified through comparison with user identified objectionable text. Upon identification of the objectionable text, the audio signal is analyzed to identify the audio portion corresponding to the objectionable text. Upon identification of the audio portion, the audio signal may be controlled to mute the audible objectionable text.
    Type: Application
    Filed: April 20, 2009
    Publication date: August 13, 2009
    Applicant: ClearPlay Inc.
    Inventors: Matthew T. Jarman, William S. Meisel
  • Publication number: 20090204403
    Abstract: An apparatus includes receiving circuitry for receiving a signal; and a speech module for converting the signal into speech.
    Type: Application
    Filed: February 11, 2009
    Publication date: August 13, 2009
    Applicant: OMEGA ENGINEERING, INC.
    Inventors: Milton B. Hollander, Shahin Baghai, Feng Liu
  • Publication number: 20090204680
    Abstract: Email subscribers are notified of the receipt of new email messages when they are not at their computers via voice or page. An email notification server polls the email server corresponding to the subscriber's email account for the presence of new email messages. New email messages are obtained. Header information is extracted. If new email notification is by voicemail, the extracted header information is converted from text to voice. A voicemail message containing the extracted header information is saved on the voicemail system corresponding to the subscriber for whom the email message was intended. The email notification server can also send a page to notify the subscriber of the presence of new email.
    Type: Application
    Filed: April 15, 2009
    Publication date: August 13, 2009
    Applicant: AT&T Intellectual Property I, L.P.
    Inventor: Mark Kirkpatrick