Patents Examined by Eunice Ng

Method and system for suppressing receiver audio regeneration

Patent number: 7280958

Abstract: The invention concerns a method (500) and system (100) for suppressing receiver audio regeneration. The method (500) includes the steps of receiving a communication signal (502), at a Radio Frequency (RF) unit (102), demodulating the communication signal to an audio signal (504), monitoring a volume level of the audio signal (506), and shifting the pitch of the audio signal when the volume level reaches a predetermined threshold (508), and playing the pitch-shifted audio signal out of a speaker to produce a pitch-shifted acoustic signal (510). The method can shift the pitch of the audio signal to produce a pitch-shifted acoustic signal with signal properties suppressing regeneration of the acoustic signal onto the audio signal at the RF unit. The amount of pitch-shifting can be a function of the volume level.

Type: Grant

Filed: September 30, 2005

Date of Patent: October 9, 2007

Assignee: Motorola, Inc.

Inventors: Peter M. Pavlov, Jason D. McIntosh, Graeme P. Johnson
Electronic mail replies with speech recognition

Patent number: 7280966

Abstract: A method for responding to an electronic mail message with a limited input device such as a phone includes audibly rendering the question and a set of proposed answers typically provided in the electronic mail message by the sender of the electronic mail message. A language model indicative of the proposed answers is provided to a speech recognizer. The response from the user is obtained and converted to a textual response using the speech recognizer and language model. A second electronic e-mail message is then sent back to the sender. The second electronic mail message includes the textual response.

Type: Grant

Filed: May 11, 2006

Date of Patent: October 9, 2007

Assignee: Microsoft Corporation

Inventors: Yun-cheng Ju, Peter K. L. Mau
Pseudo-cepstral adaptive short-term post-filters for speech coders

Patent number: 7269553

Abstract: Methods and systems for filtering synthesized or reconstructed speech are implemented. A filter based on a set of linear predictive coding (LPC) coefficients is constructed by transforming the LPC coefficients to the pseudo-cepstrum, a domain existing between LPC domain and the line spectral frequency (LSF) domain. The resulting filter can emphasize spectral frequencies associated with various formants, or spectral peaks, of an inverse transfer function relating to the LPC coefficients, and can de-emphasize spectral frequencies associated with various spectral minima, or spectral valleys, of the inverse transfer function relating to the LPC coefficients.

Type: Grant

Filed: October 14, 2003

Date of Patent: September 11, 2007

Assignee: AT&T Corp.

Inventors: Hong-Goo Kang, Kim Hong Kook
Audio segmentation with energy-weighted bandwidth bias

Patent number: 7243062

Abstract: A method (200) and apparatus (100) for segmenting a sequence of audio samples into homogeneous segments (550 and 555) are disclosed. The method (200) forms a sequence of frames (701 to 704) along the sequence of audio samples, and extracts, for each frame, a data feature. The data features form a sequence of data features. Transition points in the sequence of data features are thin detected by applying the Bayesian Information Criterion to the sequence of data features. The transition points define the homogeneous segments (550 and 555). Preferably the data feature is single-dimensional and a leptokurtic distribution is used as an event model in the Bayesian Information Criterion.

Type: Grant

Filed: October 25, 2002

Date of Patent: July 10, 2007

Assignee: Canon Kabushiki Kaisha

Inventor: Timothy John Wark
Voice control method

Patent number: 7228273

Abstract: A voice control method that allows vocal characteristics of a character to diversely be set in a computer game where characters are capable of voice output is provided. The voice control method comprises, converting a voice that is externally input or provided in advance, based upon attribute information on the character; and an output step for outputting the converted voice as voice of the character. According to this method, the voice produced by a character that appears in a computer game can be set in accordance with the character's characteristics and various voices for each character set by each player can be created.

Type: Grant

Filed: November 12, 2002

Date of Patent: June 5, 2007

Assignee: Sega Corporation

Inventor: Yutaka Okunoki
2-D processing of speech

Patent number: 7203639

Abstract: Acoustic signals are analyzed by two-dimensional (2-D) processing of the one-dimensional (1-D) speech signal in the time-frequency plane. The short-space 2-D Fourier transform of a frequency-related representation (e.g., spectrogram) of the signal is obtained. The 2-D transformation maps harmonically-related signal components to a concentrated entity in the new 2-D plane (compressed frequency-related representation). The series of operations to produce the compressed frequency-related representation is referred to as the “grating compression transform” (GCT), consistent with sine-wave grating patterns in the frequency-related representation reduced to smeared impulses. The GCT provides for speech pitch estimation. The operations may, for example, determine pitch estimates of voiced speech or provide noise filtering or speaker separation in a multiple speaker acoustic signal.

Type: Grant

Filed: September 13, 2002

Date of Patent: April 10, 2007

Assignee: Massachusetts Institute of Technology

Inventor: Thomas F. Quatieri, Jr.
Method and apparatus for transmitting speech activity in distributed voice recognition systems

Patent number: 7203643

Abstract: A system and method for transmitting speech activity in a distributed voice recognition system. The distributed voice recognition system includes a local VR engine in a subscriber unit and a server VR engine on a server. The local VR engine comprises an advanced feature extraction (AFE) module that extracts features from a speech signal, and a voice activity detection (VAD) module that detects voice activity within a speech signal. The combined results from the VAD module and feature extraction module are provided in an efficient manner to a remote device, such as a server, in the form of advanced front end features, thereby enabling the server to process speech segments free of silence regions. Various aspects of efficient speech segment transmission are disclosed.

Type: Grant

Filed: May 28, 2002

Date of Patent: April 10, 2007

Assignee: Qualcomm Incorporated

Inventor: Harinath Garudadri
Prosody generating device, prosody generating method, and program

Patent number: 7200558

Abstract: A prosody generation apparatus capable of suppressing distortion that occurs when generating prosodic patterns and therefore generating a natural prosody is provided. A prosody changing point extraction unit in this apparatus extracts a prosody changing point located at the beginning and the ending of a sentence, the beginning and the ending of a breath group, an accent position and the like. A selection rule and a transformation rule of a prosodic pattern including the prosody changing point is generated by means of a statistical or learning technique and the thus generate rules are stored in a representative prosodic pattern selection rule table and a transformation rule table beforehand. A pattern selection unit selects a representative prosodic pattern from the representative prosodic pattern selection rule table according to the selection rule.

Type: Grant

Filed: March 8, 2002

Date of Patent: April 3, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Yumiko Kato, Takahiro Kamai
System for handling frequently asked questions in a natural language dialog service

Patent number: 7197460

Abstract: A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.

Type: Grant

Filed: December 19, 2002

Date of Patent: March 27, 2007

Assignee: AT&T Corp.

Inventors: Narendra K. Gupta, Mazin G Rahim, Giuseppe Riccardi
System and method for data mining of contextual conversations

Patent number: 7191129

Abstract: A system and method for mining data from stored telephone conversations is provided. Users request advanced data processing on the recorded data, either on the live data stream or the data in storage. Processes search the recorded data for keywords and phrases that the user provides the PTR. User can also request more sophisticated analysis of the recorded data for deeper contextual meaning of the conversations. Context information may include identifying the users, the locations and times referred to by the users during the conference, etc. Additional searches related to the obtained information are performed and the extracted information is compared to similar information obtained from previous meetings. Voice inflections and any emotional stress present in the voices of the users can also be detected and added to the collected information. Search terms can also be highlighted in the results.

Type: Grant

Filed: October 23, 2002

Date of Patent: March 13, 2007

Assignee: International Business Machines Corporation

Inventors: Michael Wayne Brown, Joseph Herbert McIntyre, Victor S. Moore, Michael A. Paolini, Scott Lee Winters
Generation of subtitles or captions for moving pictures

Patent number: 7191117

Abstract: A method for generating subtitles for audiovisual material received and analyses a text file containing dialogue spoken in audiovisual material and provides a signal representative of the text. The text information and audio signal are aligned in time using time alignment speech recognition and the text and timing information are then output to a subtitle file. Colors can be assigned to different speakers or groups of speakers. Subtitles are derived by receiving and analyzing a text file containing dialogue spoken by considering each word in turn and the next information signal, assigning a score to each subtitle in a plurality of different possible subtitle formatting options which lead to that word. The steps are then repeated until all the words in the text information signal have been used and the subtitle formatting option which gives the best overall score is then derived.

Type: Grant

Filed: June 11, 2001

Date of Patent: March 13, 2007

Assignee: British Broadcasting Corporation

Inventors: David Graham Kirby, Christopher Edward Poole, Adam Wiewiorka, William Oscar Lahr
Method of and system for transcribing dictations in text files and for revising the text

Patent number: 7184956

Abstract: The invention relates to a method and a transcription system (T) for transcribing dictations, in which a dictation file (5) is converted into a text file (8), and subsequently the text file (8) is compared with the dictation file (5). To increase the speed for the subsequent correction, provision is made that during transcription of the dictation file (5) a confidence value is generated for a transcribed text passage of the text file (8), and a comparison of the text file (8) with the dictation file (5) takes place only in respect of those text passages for which the confidence value of the text passage is below a confidence limit, i.e. a text passage recognized as possibly defective is present.

Type: Grant

Filed: October 28, 2002

Date of Patent: February 27, 2007

Assignee: Koninklijke Philips Electronics N.V.

Inventor: Kwaku Frimpong-Ansah
Method for compressing dictionary data

Patent number: 7181388

Abstract: The invention relates to pre-processing of a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units. According to one aspect of the invention the sequence of character units and the sequence of phoneme units are aligned using a statistical algorithm. The aligned sequence of character units and aligned sequence of phoneme units are interleaved by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.

Type: Grant

Filed: November 11, 2002

Date of Patent: February 20, 2007

Assignee: Nokia Corporation

Inventor: Jilei Tian
Systems and methods for generating weighted finite-state automata representing grammars

Patent number: 7181386

Abstract: A context-free grammar can be represented by a weighted finite-state transducer. This representation can be used to efficiently compile that grammar into a weighted finite-state automaton that accepts the strings allowed by the grammar with the corresponding weights. The rules of a context-free grammar are input. A finite-state automaton is generated from the input rules. Strongly connected components of the finite-state automaton are identified. An automaton is generated for each strongly connected component. A topology that defines a number of states, and that uses active ones of the non-terminal symbols of the context-free grammar as the labels between those states, is defined. The topology is expanded by replacing a transition, and its beginning and end states, with the automaton that includes, as a state, the symbol used as the label on that transition. The topology can be fully expanded or dynamically expanded as required to recognize a particular input string.

Type: Grant

Filed: July 18, 2002

Date of Patent: February 20, 2007

Assignee: AT&T Corp.

Inventors: Mehryar Mohri, Mark-Jan Nederhof
System and method of context-sensitive help for multi-modal dialog systems

Patent number: 7177815

Abstract: A method of presenting a multi-modal help dialog move to a user in a multi-modal dialog system is disclosed. The method comprises presenting an audio portion of the multi-modal help dialog move that explains available ways of user inquiry and presenting a corresponding graphical action performed on a user interface associated with the audio portion. The multi-modal help dialog move is context-sensitive and uses current display information and dialog contextual information to present a multi-modal help move that is currently related to the user. A user request or a problematic dialog detection module may trigger the multi-modal help move.

Type: Grant

Filed: December 19, 2002

Date of Patent: February 13, 2007

Assignee: AT&T Corp.

Inventors: Patrick Ehlen, Helen Hastie, Michael Johnston
Method for sending multi-media messages using customizable background images

Patent number: 7177811

Abstract: A method is provided for customizing a multi-media message created by a sender for a recipient, in which the multi-media message includes an animated entity audibly presenting speech converted from text by the sender. At least one image is received from the sender. Each of the at least one image is associated with a tag. The sender is presented with options to insert the tag associated with one of the at least one image into the sender text.

Type: Grant

Filed: March 6, 2006

Date of Patent: February 13, 2007

Assignee: AT&T Corp.

Inventors: Joern Ostermann, Barbara Buda, Mehmet Reha Civanlar, Eric Cosatto, Hans Peter Graf, Thomas M. Isaacson, Yann Andre LeCun
System and method of handling problematic input during context-sensitive help for multi-modal dialog systems

Patent number: 7177816

Abstract: A method of presenting a multi-modal help dialog move to a user in a multi-modal dialog system is disclosed. The method comprises presenting an audio portion of the multi-modal help dialog move that explains available ways of user inquiry and presenting a corresponding graphical action performed on a user interface associated with the audio portion. The multi-modal help dialog move is context-sensitive and uses current display information and dialog contextual information to present a multi-modal help move that is currently related to the user. A user request or a problematic dialog detection module may trigger the multi-modal help move.

Type: Grant

Filed: December 19, 2002

Date of Patent: February 13, 2007

Assignee: AT&T Corp.

Inventors: Patrick Ehlen, Helen Hastie, Michael Johnston
Method and system for retrieving hint sentences using expanded queries

Patent number: 7171351

Abstract: A method, computer readable medium and system are provided which retrieve hint sentences from a sentence database in response to a query. An input component receives the query having terms. A search engine expands the query by including synonyms of the terms to obtain expanded terms. The search engine then combines the expanded terms to form dependency triples from the expanded terms. From the formed dependency triples, dependency triples which are not found in a dependency triples database are discarded to obtain remaining dependency triples from the expanded terms. The search engine then searches the sentence database using the remaining dependency triples as search parameters.

Type: Grant

Filed: September 19, 2002

Date of Patent: January 30, 2007

Assignee: Microsoft Corporation

Inventor: Ming Zhou
Method for dialog management

Patent number: 7167832

Abstract: A spoken dialog system and method having a dialog management module are disclosed. The dialog management module includes a plurality of dialog motivators for handling various operations during a spoken dialog. The dialog motivators comprise an error-handling, disambiguation, assumption, confirmation, missing information, and continuation. The spoken dialog system uses the assumption dialog motivator in either a-priori or a-posteriori modes. A-priori assumption is based on predefined requirements for the call flow and a-posteriori assumption can work with the confirmation dialog motivator to assume the content of received user input and confirm received user input.

Type: Grant

Filed: October 11, 2002

Date of Patent: January 23, 2007

Assignee: AT&T Corp.

Inventors: Alicia Abella, Allen Louis Gorin
Inferring hierarchical descriptions of a set of documents

Patent number: 7165024

Abstract: A method automatically determines groups of words or phrases that are descriptive names of a small set of documents, as well as infers concepts in the small set of documents that are more general and more specific than the descriptive names, without any prior knowledge of the hierarchy or the concepts, in a language independent manner. The descriptive names and the concepts may not even be explicitly contained in the documents. The primary application of the invention is for searching of the World Wide Web, but the invention is not limited solely to use with the World Wide Web and may be applied to any set of documents. Classes of features are identified in order to promote understanding of a set of documents. Preferably, there are three classes of features. “Self” features or terms describe the cluster as a whole. “Parent” features or terms describe more general concepts. “Child” features or terms describe specializations of the cluster.

Type: Grant

Filed: July 31, 2002

Date of Patent: January 16, 2007

Assignee: NEC Laboratories America, Inc.

Inventors: Eric J. Glover, Stephen R. Lawrence, David M. Pennock

prev 1 2 3 next