Patents Examined by Eunice Ng
  • Patent number: 7280958
    Abstract: The invention concerns a method (500) and system (100) for suppressing receiver audio regeneration. The method (500) includes the steps of receiving a communication signal (502), at a Radio Frequency (RF) unit (102), demodulating the communication signal to an audio signal (504), monitoring a volume level of the audio signal (506), and shifting the pitch of the audio signal when the volume level reaches a predetermined threshold (508), and playing the pitch-shifted audio signal out of a speaker to produce a pitch-shifted acoustic signal (510). The method can shift the pitch of the audio signal to produce a pitch-shifted acoustic signal with signal properties suppressing regeneration of the acoustic signal onto the audio signal at the RF unit. The amount of pitch-shifting can be a function of the volume level.
    Type: Grant
    Filed: September 30, 2005
    Date of Patent: October 9, 2007
    Assignee: Motorola, Inc.
    Inventors: Peter M. Pavlov, Jason D. McIntosh, Graeme P. Johnson
  • Patent number: 7280966
    Abstract: A method for responding to an electronic mail message with a limited input device such as a phone includes audibly rendering the question and a set of proposed answers typically provided in the electronic mail message by the sender of the electronic mail message. A language model indicative of the proposed answers is provided to a speech recognizer. The response from the user is obtained and converted to a textual response using the speech recognizer and language model. A second electronic e-mail message is then sent back to the sender. The second electronic mail message includes the textual response.
    Type: Grant
    Filed: May 11, 2006
    Date of Patent: October 9, 2007
    Assignee: Microsoft Corporation
    Inventors: Yun-cheng Ju, Peter K. L. Mau
  • Patent number: 7269553
    Abstract: Methods and systems for filtering synthesized or reconstructed speech are implemented. A filter based on a set of linear predictive coding (LPC) coefficients is constructed by transforming the LPC coefficients to the pseudo-cepstrum, a domain existing between LPC domain and the line spectral frequency (LSF) domain. The resulting filter can emphasize spectral frequencies associated with various formants, or spectral peaks, of an inverse transfer function relating to the LPC coefficients, and can de-emphasize spectral frequencies associated with various spectral minima, or spectral valleys, of the inverse transfer function relating to the LPC coefficients.
    Type: Grant
    Filed: October 14, 2003
    Date of Patent: September 11, 2007
    Assignee: AT&T Corp.
    Inventors: Hong-Goo Kang, Kim Hong Kook
  • Patent number: 7243062
    Abstract: A method (200) and apparatus (100) for segmenting a sequence of audio samples into homogeneous segments (550 and 555) are disclosed. The method (200) forms a sequence of frames (701 to 704) along the sequence of audio samples, and extracts, for each frame, a data feature. The data features form a sequence of data features. Transition points in the sequence of data features are thin detected by applying the Bayesian Information Criterion to the sequence of data features. The transition points define the homogeneous segments (550 and 555). Preferably the data feature is single-dimensional and a leptokurtic distribution is used as an event model in the Bayesian Information Criterion.
    Type: Grant
    Filed: October 25, 2002
    Date of Patent: July 10, 2007
    Assignee: Canon Kabushiki Kaisha
    Inventor: Timothy John Wark
  • Patent number: 7228273
    Abstract: A voice control method that allows vocal characteristics of a character to diversely be set in a computer game where characters are capable of voice output is provided. The voice control method comprises, converting a voice that is externally input or provided in advance, based upon attribute information on the character; and an output step for outputting the converted voice as voice of the character. According to this method, the voice produced by a character that appears in a computer game can be set in accordance with the character's characteristics and various voices for each character set by each player can be created.
    Type: Grant
    Filed: November 12, 2002
    Date of Patent: June 5, 2007
    Assignee: Sega Corporation
    Inventor: Yutaka Okunoki
  • Patent number: 7203639
    Abstract: Acoustic signals are analyzed by two-dimensional (2-D) processing of the one-dimensional (1-D) speech signal in the time-frequency plane. The short-space 2-D Fourier transform of a frequency-related representation (e.g., spectrogram) of the signal is obtained. The 2-D transformation maps harmonically-related signal components to a concentrated entity in the new 2-D plane (compressed frequency-related representation). The series of operations to produce the compressed frequency-related representation is referred to as the “grating compression transform” (GCT), consistent with sine-wave grating patterns in the frequency-related representation reduced to smeared impulses. The GCT provides for speech pitch estimation. The operations may, for example, determine pitch estimates of voiced speech or provide noise filtering or speaker separation in a multiple speaker acoustic signal.
    Type: Grant
    Filed: September 13, 2002
    Date of Patent: April 10, 2007
    Assignee: Massachusetts Institute of Technology
    Inventor: Thomas F. Quatieri, Jr.
  • Patent number: 7203643
    Abstract: A system and method for transmitting speech activity in a distributed voice recognition system. The distributed voice recognition system includes a local VR engine in a subscriber unit and a server VR engine on a server. The local VR engine comprises an advanced feature extraction (AFE) module that extracts features from a speech signal, and a voice activity detection (VAD) module that detects voice activity within a speech signal. The combined results from the VAD module and feature extraction module are provided in an efficient manner to a remote device, such as a server, in the form of advanced front end features, thereby enabling the server to process speech segments free of silence regions. Various aspects of efficient speech segment transmission are disclosed.
    Type: Grant
    Filed: May 28, 2002
    Date of Patent: April 10, 2007
    Assignee: Qualcomm Incorporated
    Inventor: Harinath Garudadri
  • Patent number: 7200558
    Abstract: A prosody generation apparatus capable of suppressing distortion that occurs when generating prosodic patterns and therefore generating a natural prosody is provided. A prosody changing point extraction unit in this apparatus extracts a prosody changing point located at the beginning and the ending of a sentence, the beginning and the ending of a breath group, an accent position and the like. A selection rule and a transformation rule of a prosodic pattern including the prosody changing point is generated by means of a statistical or learning technique and the thus generate rules are stored in a representative prosodic pattern selection rule table and a transformation rule table beforehand. A pattern selection unit selects a representative prosodic pattern from the representative prosodic pattern selection rule table according to the selection rule.
    Type: Grant
    Filed: March 8, 2002
    Date of Patent: April 3, 2007
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Yumiko Kato, Takahiro Kamai
  • Patent number: 7197460
    Abstract: A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.
    Type: Grant
    Filed: December 19, 2002
    Date of Patent: March 27, 2007
    Assignee: AT&T Corp.
    Inventors: Narendra K. Gupta, Mazin G Rahim, Giuseppe Riccardi
  • Patent number: 7191129
    Abstract: A system and method for mining data from stored telephone conversations is provided. Users request advanced data processing on the recorded data, either on the live data stream or the data in storage. Processes search the recorded data for keywords and phrases that the user provides the PTR. User can also request more sophisticated analysis of the recorded data for deeper contextual meaning of the conversations. Context information may include identifying the users, the locations and times referred to by the users during the conference, etc. Additional searches related to the obtained information are performed and the extracted information is compared to similar information obtained from previous meetings. Voice inflections and any emotional stress present in the voices of the users can also be detected and added to the collected information. Search terms can also be highlighted in the results.
    Type: Grant
    Filed: October 23, 2002
    Date of Patent: March 13, 2007
    Assignee: International Business Machines Corporation
    Inventors: Michael Wayne Brown, Joseph Herbert McIntyre, Victor S. Moore, Michael A. Paolini, Scott Lee Winters
  • Patent number: 7191117
    Abstract: A method for generating subtitles for audiovisual material received and analyses a text file containing dialogue spoken in audiovisual material and provides a signal representative of the text. The text information and audio signal are aligned in time using time alignment speech recognition and the text and timing information are then output to a subtitle file. Colors can be assigned to different speakers or groups of speakers. Subtitles are derived by receiving and analyzing a text file containing dialogue spoken by considering each word in turn and the next information signal, assigning a score to each subtitle in a plurality of different possible subtitle formatting options which lead to that word. The steps are then repeated until all the words in the text information signal have been used and the subtitle formatting option which gives the best overall score is then derived.
    Type: Grant
    Filed: June 11, 2001
    Date of Patent: March 13, 2007
    Assignee: British Broadcasting Corporation
    Inventors: David Graham Kirby, Christopher Edward Poole, Adam Wiewiorka, William Oscar Lahr
  • Patent number: 7184956
    Abstract: The invention relates to a method and a transcription system (T) for transcribing dictations, in which a dictation file (5) is converted into a text file (8), and subsequently the text file (8) is compared with the dictation file (5). To increase the speed for the subsequent correction, provision is made that during transcription of the dictation file (5) a confidence value is generated for a transcribed text passage of the text file (8), and a comparison of the text file (8) with the dictation file (5) takes place only in respect of those text passages for which the confidence value of the text passage is below a confidence limit, i.e. a text passage recognized as possibly defective is present.
    Type: Grant
    Filed: October 28, 2002
    Date of Patent: February 27, 2007
    Assignee: Koninklijke Philips Electronics N.V.
    Inventor: Kwaku Frimpong-Ansah
  • Patent number: 7181388
    Abstract: The invention relates to pre-processing of a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units. According to one aspect of the invention the sequence of character units and the sequence of phoneme units are aligned using a statistical algorithm. The aligned sequence of character units and aligned sequence of phoneme units are interleaved by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.
    Type: Grant
    Filed: November 11, 2002
    Date of Patent: February 20, 2007
    Assignee: Nokia Corporation
    Inventor: Jilei Tian
  • Patent number: 7181386
    Abstract: A context-free grammar can be represented by a weighted finite-state transducer. This representation can be used to efficiently compile that grammar into a weighted finite-state automaton that accepts the strings allowed by the grammar with the corresponding weights. The rules of a context-free grammar are input. A finite-state automaton is generated from the input rules. Strongly connected components of the finite-state automaton are identified. An automaton is generated for each strongly connected component. A topology that defines a number of states, and that uses active ones of the non-terminal symbols of the context-free grammar as the labels between those states, is defined. The topology is expanded by replacing a transition, and its beginning and end states, with the automaton that includes, as a state, the symbol used as the label on that transition. The topology can be fully expanded or dynamically expanded as required to recognize a particular input string.
    Type: Grant
    Filed: July 18, 2002
    Date of Patent: February 20, 2007
    Assignee: AT&T Corp.
    Inventors: Mehryar Mohri, Mark-Jan Nederhof
  • Patent number: 7177815
    Abstract: A method of presenting a multi-modal help dialog move to a user in a multi-modal dialog system is disclosed. The method comprises presenting an audio portion of the multi-modal help dialog move that explains available ways of user inquiry and presenting a corresponding graphical action performed on a user interface associated with the audio portion. The multi-modal help dialog move is context-sensitive and uses current display information and dialog contextual information to present a multi-modal help move that is currently related to the user. A user request or a problematic dialog detection module may trigger the multi-modal help move.
    Type: Grant
    Filed: December 19, 2002
    Date of Patent: February 13, 2007
    Assignee: AT&T Corp.
    Inventors: Patrick Ehlen, Helen Hastie, Michael Johnston
  • Patent number: 7177811
    Abstract: A method is provided for customizing a multi-media message created by a sender for a recipient, in which the multi-media message includes an animated entity audibly presenting speech converted from text by the sender. At least one image is received from the sender. Each of the at least one image is associated with a tag. The sender is presented with options to insert the tag associated with one of the at least one image into the sender text.
    Type: Grant
    Filed: March 6, 2006
    Date of Patent: February 13, 2007
    Assignee: AT&T Corp.
    Inventors: Joern Ostermann, Barbara Buda, Mehmet Reha Civanlar, Eric Cosatto, Hans Peter Graf, Thomas M. Isaacson, Yann Andre LeCun
  • Patent number: 7177816
    Abstract: A method of presenting a multi-modal help dialog move to a user in a multi-modal dialog system is disclosed. The method comprises presenting an audio portion of the multi-modal help dialog move that explains available ways of user inquiry and presenting a corresponding graphical action performed on a user interface associated with the audio portion. The multi-modal help dialog move is context-sensitive and uses current display information and dialog contextual information to present a multi-modal help move that is currently related to the user. A user request or a problematic dialog detection module may trigger the multi-modal help move.
    Type: Grant
    Filed: December 19, 2002
    Date of Patent: February 13, 2007
    Assignee: AT&T Corp.
    Inventors: Patrick Ehlen, Helen Hastie, Michael Johnston
  • Patent number: 7171351
    Abstract: A method, computer readable medium and system are provided which retrieve hint sentences from a sentence database in response to a query. An input component receives the query having terms. A search engine expands the query by including synonyms of the terms to obtain expanded terms. The search engine then combines the expanded terms to form dependency triples from the expanded terms. From the formed dependency triples, dependency triples which are not found in a dependency triples database are discarded to obtain remaining dependency triples from the expanded terms. The search engine then searches the sentence database using the remaining dependency triples as search parameters.
    Type: Grant
    Filed: September 19, 2002
    Date of Patent: January 30, 2007
    Assignee: Microsoft Corporation
    Inventor: Ming Zhou
  • Patent number: 7167832
    Abstract: A spoken dialog system and method having a dialog management module are disclosed. The dialog management module includes a plurality of dialog motivators for handling various operations during a spoken dialog. The dialog motivators comprise an error-handling, disambiguation, assumption, confirmation, missing information, and continuation. The spoken dialog system uses the assumption dialog motivator in either a-priori or a-posteriori modes. A-priori assumption is based on predefined requirements for the call flow and a-posteriori assumption can work with the confirmation dialog motivator to assume the content of received user input and confirm received user input.
    Type: Grant
    Filed: October 11, 2002
    Date of Patent: January 23, 2007
    Assignee: AT&T Corp.
    Inventors: Alicia Abella, Allen Louis Gorin
  • Patent number: 7165024
    Abstract: A method automatically determines groups of words or phrases that are descriptive names of a small set of documents, as well as infers concepts in the small set of documents that are more general and more specific than the descriptive names, without any prior knowledge of the hierarchy or the concepts, in a language independent manner. The descriptive names and the concepts may not even be explicitly contained in the documents. The primary application of the invention is for searching of the World Wide Web, but the invention is not limited solely to use with the World Wide Web and may be applied to any set of documents. Classes of features are identified in order to promote understanding of a set of documents. Preferably, there are three classes of features. “Self” features or terms describe the cluster as a whole. “Parent” features or terms describe more general concepts. “Child” features or terms describe specializations of the cluster.
    Type: Grant
    Filed: July 31, 2002
    Date of Patent: January 16, 2007
    Assignee: NEC Laboratories America, Inc.
    Inventors: Eric J. Glover, Stephen R. Lawrence, David M. Pennock