Patents Examined by Eunice Ng
-
Patent number: 7280958Abstract: The invention concerns a method (500) and system (100) for suppressing receiver audio regeneration. The method (500) includes the steps of receiving a communication signal (502), at a Radio Frequency (RF) unit (102), demodulating the communication signal to an audio signal (504), monitoring a volume level of the audio signal (506), and shifting the pitch of the audio signal when the volume level reaches a predetermined threshold (508), and playing the pitch-shifted audio signal out of a speaker to produce a pitch-shifted acoustic signal (510). The method can shift the pitch of the audio signal to produce a pitch-shifted acoustic signal with signal properties suppressing regeneration of the acoustic signal onto the audio signal at the RF unit. The amount of pitch-shifting can be a function of the volume level.Type: GrantFiled: September 30, 2005Date of Patent: October 9, 2007Assignee: Motorola, Inc.Inventors: Peter M. Pavlov, Jason D. McIntosh, Graeme P. Johnson
-
Patent number: 7280966Abstract: A method for responding to an electronic mail message with a limited input device such as a phone includes audibly rendering the question and a set of proposed answers typically provided in the electronic mail message by the sender of the electronic mail message. A language model indicative of the proposed answers is provided to a speech recognizer. The response from the user is obtained and converted to a textual response using the speech recognizer and language model. A second electronic e-mail message is then sent back to the sender. The second electronic mail message includes the textual response.Type: GrantFiled: May 11, 2006Date of Patent: October 9, 2007Assignee: Microsoft CorporationInventors: Yun-cheng Ju, Peter K. L. Mau
-
Patent number: 7269553Abstract: Methods and systems for filtering synthesized or reconstructed speech are implemented. A filter based on a set of linear predictive coding (LPC) coefficients is constructed by transforming the LPC coefficients to the pseudo-cepstrum, a domain existing between LPC domain and the line spectral frequency (LSF) domain. The resulting filter can emphasize spectral frequencies associated with various formants, or spectral peaks, of an inverse transfer function relating to the LPC coefficients, and can de-emphasize spectral frequencies associated with various spectral minima, or spectral valleys, of the inverse transfer function relating to the LPC coefficients.Type: GrantFiled: October 14, 2003Date of Patent: September 11, 2007Assignee: AT&T Corp.Inventors: Hong-Goo Kang, Kim Hong Kook
-
Patent number: 7243062Abstract: A method (200) and apparatus (100) for segmenting a sequence of audio samples into homogeneous segments (550 and 555) are disclosed. The method (200) forms a sequence of frames (701 to 704) along the sequence of audio samples, and extracts, for each frame, a data feature. The data features form a sequence of data features. Transition points in the sequence of data features are thin detected by applying the Bayesian Information Criterion to the sequence of data features. The transition points define the homogeneous segments (550 and 555). Preferably the data feature is single-dimensional and a leptokurtic distribution is used as an event model in the Bayesian Information Criterion.Type: GrantFiled: October 25, 2002Date of Patent: July 10, 2007Assignee: Canon Kabushiki KaishaInventor: Timothy John Wark
-
Patent number: 7228273Abstract: A voice control method that allows vocal characteristics of a character to diversely be set in a computer game where characters are capable of voice output is provided. The voice control method comprises, converting a voice that is externally input or provided in advance, based upon attribute information on the character; and an output step for outputting the converted voice as voice of the character. According to this method, the voice produced by a character that appears in a computer game can be set in accordance with the character's characteristics and various voices for each character set by each player can be created.Type: GrantFiled: November 12, 2002Date of Patent: June 5, 2007Assignee: Sega CorporationInventor: Yutaka Okunoki
-
Patent number: 7203639Abstract: Acoustic signals are analyzed by two-dimensional (2-D) processing of the one-dimensional (1-D) speech signal in the time-frequency plane. The short-space 2-D Fourier transform of a frequency-related representation (e.g., spectrogram) of the signal is obtained. The 2-D transformation maps harmonically-related signal components to a concentrated entity in the new 2-D plane (compressed frequency-related representation). The series of operations to produce the compressed frequency-related representation is referred to as the “grating compression transform” (GCT), consistent with sine-wave grating patterns in the frequency-related representation reduced to smeared impulses. The GCT provides for speech pitch estimation. The operations may, for example, determine pitch estimates of voiced speech or provide noise filtering or speaker separation in a multiple speaker acoustic signal.Type: GrantFiled: September 13, 2002Date of Patent: April 10, 2007Assignee: Massachusetts Institute of TechnologyInventor: Thomas F. Quatieri, Jr.
-
Patent number: 7203643Abstract: A system and method for transmitting speech activity in a distributed voice recognition system. The distributed voice recognition system includes a local VR engine in a subscriber unit and a server VR engine on a server. The local VR engine comprises an advanced feature extraction (AFE) module that extracts features from a speech signal, and a voice activity detection (VAD) module that detects voice activity within a speech signal. The combined results from the VAD module and feature extraction module are provided in an efficient manner to a remote device, such as a server, in the form of advanced front end features, thereby enabling the server to process speech segments free of silence regions. Various aspects of efficient speech segment transmission are disclosed.Type: GrantFiled: May 28, 2002Date of Patent: April 10, 2007Assignee: Qualcomm IncorporatedInventor: Harinath Garudadri
-
Patent number: 7200558Abstract: A prosody generation apparatus capable of suppressing distortion that occurs when generating prosodic patterns and therefore generating a natural prosody is provided. A prosody changing point extraction unit in this apparatus extracts a prosody changing point located at the beginning and the ending of a sentence, the beginning and the ending of a breath group, an accent position and the like. A selection rule and a transformation rule of a prosodic pattern including the prosody changing point is generated by means of a statistical or learning technique and the thus generate rules are stored in a representative prosodic pattern selection rule table and a transformation rule table beforehand. A pattern selection unit selects a representative prosodic pattern from the representative prosodic pattern selection rule table according to the selection rule.Type: GrantFiled: March 8, 2002Date of Patent: April 3, 2007Assignee: Matsushita Electric Industrial Co., Ltd.Inventors: Yumiko Kato, Takahiro Kamai
-
Patent number: 7197460Abstract: A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.Type: GrantFiled: December 19, 2002Date of Patent: March 27, 2007Assignee: AT&T Corp.Inventors: Narendra K. Gupta, Mazin G Rahim, Giuseppe Riccardi
-
Patent number: 7191129Abstract: A system and method for mining data from stored telephone conversations is provided. Users request advanced data processing on the recorded data, either on the live data stream or the data in storage. Processes search the recorded data for keywords and phrases that the user provides the PTR. User can also request more sophisticated analysis of the recorded data for deeper contextual meaning of the conversations. Context information may include identifying the users, the locations and times referred to by the users during the conference, etc. Additional searches related to the obtained information are performed and the extracted information is compared to similar information obtained from previous meetings. Voice inflections and any emotional stress present in the voices of the users can also be detected and added to the collected information. Search terms can also be highlighted in the results.Type: GrantFiled: October 23, 2002Date of Patent: March 13, 2007Assignee: International Business Machines CorporationInventors: Michael Wayne Brown, Joseph Herbert McIntyre, Victor S. Moore, Michael A. Paolini, Scott Lee Winters
-
Patent number: 7191117Abstract: A method for generating subtitles for audiovisual material received and analyses a text file containing dialogue spoken in audiovisual material and provides a signal representative of the text. The text information and audio signal are aligned in time using time alignment speech recognition and the text and timing information are then output to a subtitle file. Colors can be assigned to different speakers or groups of speakers. Subtitles are derived by receiving and analyzing a text file containing dialogue spoken by considering each word in turn and the next information signal, assigning a score to each subtitle in a plurality of different possible subtitle formatting options which lead to that word. The steps are then repeated until all the words in the text information signal have been used and the subtitle formatting option which gives the best overall score is then derived.Type: GrantFiled: June 11, 2001Date of Patent: March 13, 2007Assignee: British Broadcasting CorporationInventors: David Graham Kirby, Christopher Edward Poole, Adam Wiewiorka, William Oscar Lahr
-
Patent number: 7184956Abstract: The invention relates to a method and a transcription system (T) for transcribing dictations, in which a dictation file (5) is converted into a text file (8), and subsequently the text file (8) is compared with the dictation file (5). To increase the speed for the subsequent correction, provision is made that during transcription of the dictation file (5) a confidence value is generated for a transcribed text passage of the text file (8), and a comparison of the text file (8) with the dictation file (5) takes place only in respect of those text passages for which the confidence value of the text passage is below a confidence limit, i.e. a text passage recognized as possibly defective is present.Type: GrantFiled: October 28, 2002Date of Patent: February 27, 2007Assignee: Koninklijke Philips Electronics N.V.Inventor: Kwaku Frimpong-Ansah
-
Patent number: 7181388Abstract: The invention relates to pre-processing of a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units. According to one aspect of the invention the sequence of character units and the sequence of phoneme units are aligned using a statistical algorithm. The aligned sequence of character units and aligned sequence of phoneme units are interleaved by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.Type: GrantFiled: November 11, 2002Date of Patent: February 20, 2007Assignee: Nokia CorporationInventor: Jilei Tian
-
Patent number: 7181386Abstract: A context-free grammar can be represented by a weighted finite-state transducer. This representation can be used to efficiently compile that grammar into a weighted finite-state automaton that accepts the strings allowed by the grammar with the corresponding weights. The rules of a context-free grammar are input. A finite-state automaton is generated from the input rules. Strongly connected components of the finite-state automaton are identified. An automaton is generated for each strongly connected component. A topology that defines a number of states, and that uses active ones of the non-terminal symbols of the context-free grammar as the labels between those states, is defined. The topology is expanded by replacing a transition, and its beginning and end states, with the automaton that includes, as a state, the symbol used as the label on that transition. The topology can be fully expanded or dynamically expanded as required to recognize a particular input string.Type: GrantFiled: July 18, 2002Date of Patent: February 20, 2007Assignee: AT&T Corp.Inventors: Mehryar Mohri, Mark-Jan Nederhof
-
Patent number: 7177815Abstract: A method of presenting a multi-modal help dialog move to a user in a multi-modal dialog system is disclosed. The method comprises presenting an audio portion of the multi-modal help dialog move that explains available ways of user inquiry and presenting a corresponding graphical action performed on a user interface associated with the audio portion. The multi-modal help dialog move is context-sensitive and uses current display information and dialog contextual information to present a multi-modal help move that is currently related to the user. A user request or a problematic dialog detection module may trigger the multi-modal help move.Type: GrantFiled: December 19, 2002Date of Patent: February 13, 2007Assignee: AT&T Corp.Inventors: Patrick Ehlen, Helen Hastie, Michael Johnston
-
Patent number: 7177811Abstract: A method is provided for customizing a multi-media message created by a sender for a recipient, in which the multi-media message includes an animated entity audibly presenting speech converted from text by the sender. At least one image is received from the sender. Each of the at least one image is associated with a tag. The sender is presented with options to insert the tag associated with one of the at least one image into the sender text.Type: GrantFiled: March 6, 2006Date of Patent: February 13, 2007Assignee: AT&T Corp.Inventors: Joern Ostermann, Barbara Buda, Mehmet Reha Civanlar, Eric Cosatto, Hans Peter Graf, Thomas M. Isaacson, Yann Andre LeCun
-
Patent number: 7177816Abstract: A method of presenting a multi-modal help dialog move to a user in a multi-modal dialog system is disclosed. The method comprises presenting an audio portion of the multi-modal help dialog move that explains available ways of user inquiry and presenting a corresponding graphical action performed on a user interface associated with the audio portion. The multi-modal help dialog move is context-sensitive and uses current display information and dialog contextual information to present a multi-modal help move that is currently related to the user. A user request or a problematic dialog detection module may trigger the multi-modal help move.Type: GrantFiled: December 19, 2002Date of Patent: February 13, 2007Assignee: AT&T Corp.Inventors: Patrick Ehlen, Helen Hastie, Michael Johnston
-
Patent number: 7171351Abstract: A method, computer readable medium and system are provided which retrieve hint sentences from a sentence database in response to a query. An input component receives the query having terms. A search engine expands the query by including synonyms of the terms to obtain expanded terms. The search engine then combines the expanded terms to form dependency triples from the expanded terms. From the formed dependency triples, dependency triples which are not found in a dependency triples database are discarded to obtain remaining dependency triples from the expanded terms. The search engine then searches the sentence database using the remaining dependency triples as search parameters.Type: GrantFiled: September 19, 2002Date of Patent: January 30, 2007Assignee: Microsoft CorporationInventor: Ming Zhou
-
Patent number: 7167832Abstract: A spoken dialog system and method having a dialog management module are disclosed. The dialog management module includes a plurality of dialog motivators for handling various operations during a spoken dialog. The dialog motivators comprise an error-handling, disambiguation, assumption, confirmation, missing information, and continuation. The spoken dialog system uses the assumption dialog motivator in either a-priori or a-posteriori modes. A-priori assumption is based on predefined requirements for the call flow and a-posteriori assumption can work with the confirmation dialog motivator to assume the content of received user input and confirm received user input.Type: GrantFiled: October 11, 2002Date of Patent: January 23, 2007Assignee: AT&T Corp.Inventors: Alicia Abella, Allen Louis Gorin
-
Patent number: 7165024Abstract: A method automatically determines groups of words or phrases that are descriptive names of a small set of documents, as well as infers concepts in the small set of documents that are more general and more specific than the descriptive names, without any prior knowledge of the hierarchy or the concepts, in a language independent manner. The descriptive names and the concepts may not even be explicitly contained in the documents. The primary application of the invention is for searching of the World Wide Web, but the invention is not limited solely to use with the World Wide Web and may be applied to any set of documents. Classes of features are identified in order to promote understanding of a set of documents. Preferably, there are three classes of features. “Self” features or terms describe the cluster as a whole. “Parent” features or terms describe more general concepts. “Child” features or terms describe specializations of the cluster.Type: GrantFiled: July 31, 2002Date of Patent: January 16, 2007Assignee: NEC Laboratories America, Inc.Inventors: Eric J. Glover, Stephen R. Lawrence, David M. Pennock