Patents Examined by Michael Opsasnick

System and methods for using transcripts to train an automated directory assistance service

Patent number: 7447636

Abstract: An automated directory assistance (130) includes a training system (210). and a directory assistance system (220). The training system (210) trains and maintains the directory assistance system (220). The training system (210) includes a transcription module (310), a speech grammar estimation module (330), a listings statistics estimation module (340), and a required words determination module (350). The transcription module (310) obtains transcripts relating to directory service requests. The speech grammar estimation module (330) creates an n-gram grammar for multiple telephone numbers from the transcripts. The listings statistics estimation module (340) identifies words used to refer to each of the telephone numbers from the transcripts. The required words determination module (350) identifies at least one word that is required to request each of the telephone numbers from the transcripts.

Type: Grant

Filed: May 12, 2005

Date of Patent: November 4, 2008

Assignees: Verizon Corporate Services Group Inc., BBN Technologies Corp.

Inventors: Richard Mark Schwartz, Han Shu, John Makhoul, Long Nguyen
Structure skeletons for efficient voice navigation through generic hierarchical objects

Patent number: 7418382

Abstract: A system and method for providing fast and efficient conversation navigation via a hierarchical structure (structure skeleton) which fully describes functions and services supported by a dialog (conversational) system. In one aspect, a conversational system and method is provided to pre-load dialog menus and target addresses to their associated dialog managing procedures in order to handle multiple or complex modes, contexts or applications. For instance, a content server (web site) (106) can download a skeleton or tree structure (109) describing the content (page) (107) or service provided by the server (106) when the client (100) connects to the server (106). The skeleton is hidden (not spoken) to the user but the user can advance to a page of interest, or to a particular dialog service, by uttering a voice command which is recognized by the conversational system reacting appropriately (as per the user's command) using the information contained within the skeleton.

Type: Grant

Filed: October 1, 1999

Date of Patent: August 26, 2008

Assignee: International Business Machines Corporation

Inventor: Stephane H. Maes
Encoding and decoding method and apparatus using rising-transition detection and notification

Patent number: 7406410

Abstract: A decoding apparatus is provided. The decoding apparatus has a first decoding part for decoding a code word obtained by encoding an input signal using a Code-Excited Linear Prediction encoding method. A second decoding part decodes a code word obtained by encoding a signal with an encoding method other than the Code-Excited Linear Prediction encoding method. A rising-transition detection and notification part has a detection part that detects the existence of a rising-transition of amplitude of the input signal based on time variation of a gain of excitation vectors obtained by the first decoding part, and a notification part that notifies the second decoding part that the rising-transition of the amplitude exists.

Type: Grant

Filed: February 7, 2003

Date of Patent: July 29, 2008

Assignee: NTT DoCoMo, Inc.

Inventors: Kei Kikuiri, Nobuhiko Naka, Tomoyuki Ohya
Language input user interface

Patent number: 7403888

Abstract: A language input architecture receives input text (e.g., phonetic text of a character-based language) entered by a user from an input device (e.g., keyboard, voice recognition). The input text is converted to an output text (e.g., written language text of a character-based language). The language input architecture has a user interface that displays the output text and unconverted input text in line with one another. As the input text is converted, it is replaced in the UI with the converted output text. In addition to this in-line input feature, the UI enables in-place editing or error correction without requiring the user to switch modes from an entry mode to an edit mode. To assist with this in-place editing, the UI presents pop-up windows containing the phonetic text from which the output text was converted as well as first and second candidate lists that contain small and large sets of alternative candidates that might be used to replace the current output text.

Type: Grant

Filed: June 28, 2000

Date of Patent: July 22, 2008

Assignee: Microsoft Corporation

Inventors: Jian Wang, Gao Zhang, Jian Han, Zheng Chen, Xianoning Ling, Kai-Fu Lee
Code excited linear prediction speech decoder and method thereof

Patent number: 7398205

Abstract: An excitation vector generator includes an input vector providing system that is capable of providing an input vector having at least one pulse, each pulse having a predetermined position and a respective polarity. A fixed waveform storage system is capable of storing at least one fixed waveform. An arranging system is capable of arranging the at least one fixed waveform in accordance with the position and the polarity of the at least one pulse.

Type: Grant

Filed: June 2, 2006

Date of Patent: July 8, 2008

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Kazutoshi Yasunaga, Toshiyuki Morii, Hiroyuki Ehara
Prompt language translation for a telecommunications system

Patent number: 7398215

Abstract: A prompt translation application for use in a telecommunications messaging system provides an administrator with a plurality of messaging prompts in a base language for revising, translating, and editing. The administrator can nearly simultaneously revise both a visual component of a prompt and an audio component of the prompt, and save the revisions for use on any number of associated endpoints. In the preferred embodiments, revisions to the visual component are made by user input, such as keystrokes, and to the audio component by selection of audio segments to be played in a particular order.

Type: Grant

Filed: December 24, 2003

Date of Patent: July 8, 2008

Assignee: Inter-Tel, Inc.

Inventors: Ibrahim Mesbah, Eyor Alemayehu
Method and apparatus utilizing speech grammar rules written in a markup language

Patent number: 7389234

Abstract: The present invention provides a method and apparatus that utilize a context-free grammar written in a markup language format. The markup language format provides a hierarchical format in which grammar structures are delimited within and defined by a set of tags. The markup language format also provides grammar switch tags that indicate a transitions from the context-free grammar to a dictation grammar or a text buffer grammar. In addition, the markup language format provides for the designation of code to be executed when particular grammar structures are recognized from a speech signal.

Type: Grant

Filed: January 12, 2001

Date of Patent: June 17, 2008

Assignee: Microsoft Corporation

Inventors: Philipp H. Schmid, Ralph Lipe, Erik C. Ellerman, Robert L. Chambers
Method, apparatus, and program for certifying a voice profile when transmitting text messages for synthesized speech

Patent number: 7379872

Abstract: A mechanism is provided for authenticating and using a personal voice profile. The voice profile may be issued by a trusted third party, such as a certification authority. The personal voice profile may include information for generating a digest or digital signature for text messages. A speech synthesis system may speak the text message using the voice characteristics, such as prosodic characteristics, only if the voice profile is authenticated and the text message is valid and free of tampering.

Type: Grant

Filed: January 17, 2003

Date of Patent: May 27, 2008

Assignee: International Business Machines Corporation

Inventors: Rafael Graniello Cabezas, Jason Eric Moore, Elizabeth Silvia
Controlling speech recognition functionality in a computing device

Patent number: 7369997

Abstract: A system and method for use in computing systems that employ speech recognition capabilities is provided. Where recognized speech can be dictation and commands, one or more buttons may be used to change modes of said computing systems to accept spoken words as dictation, or to accept spoken words as commands, as well as activate a microphone used for the speech recognition. The change in mode may occur responsive to the manner in which a button is pressed, where the manner may include such depressions as taps, press and holds, thumbwheel slides, and other forms of button manipulation.

Type: Grant

Filed: August 1, 2001

Date of Patent: May 6, 2008

Assignee: Microsoft Corporation

Inventors: Robert Chambers, Charlton E. Lui
Speech recognition system, speech recognition method, speech synthesis system, speech synthesis method, and program product having increased accuracy

Patent number: 7369991

Abstract: The object of the present invention is to keep a high success rate in recognition with a low-volume of sound signal, without being affected by noise.

Type: Grant

Filed: March 4, 2003

Date of Patent: May 6, 2008

Assignee: NTT DoCoMo, Inc.

Inventors: Hiroyuki Manabe, Akira Hiraiwa, Toshiaki Sugimura
Selective enablement of speech recognition grammars

Patent number: 7366673

Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. Selecting can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Selecting can further include registering the speech grammar in the recognition system.

Type: Grant

Filed: June 15, 2001

Date of Patent: April 29, 2008

Assignee: International Business Machines Corporation

Inventors: Harvey M. Ruback, Steven G. Woodward
Measuring a talking quality of a telephone link in a telecommunications network

Patent number: 7366663

Abstract: For measuring the influence of noise on the talking quality of a telephone link in a telecommunications network, a talker speech signal (s(t)) and a degraded speech signal (s?(t)) are fed to an objective measurement device for obtaining an output signal (q) representing an estimated value of the talking quality. The degraded signal includes a returned signal (r(t)) originating from the network during transmission of the talker speech signal over the telephone link. The objective measurement provided by the device is a modified PSQM-like measurement, which is modified to include modelling of masking effects resulting from noise present in the returned signal. Preferably, the modelling includes noise suppression performed on a difference signal (D(t,f)) in a loudness density domain using noise estimation.

Type: Grant

Filed: October 11, 2001

Date of Patent: April 29, 2008

Assignee: Koninklijke KPN N.V.

Inventors: John Gerard Beerends, Andries Pieter Hekstra, Symon Ronald Appel
Method and apparatus for performing observation probability calculations

Patent number: 7356466

Abstract: A method and apparatus for calculating an observation probability includes a first operation unit that subtracts a mean of a first plurality of parameters of an input voice signal from a second parameter of an input voice signal, and multiplies the subtraction result to obtain a first output. The first output is squared and accumulated N times in a second operation unit to obtain a second output. A third operation unit subtracts a given weighted value from the second output to obtain a third output, and a comparator stores the third output for a comparator stores the third output in order to extract L outputs therefrom, and stores the L extracted outputs based on an order of magnitude of the extracted L outputs.

Type: Grant

Filed: June 20, 2003

Date of Patent: April 8, 2008

Assignee: Samsung Electronics Co., Ltd.

Inventors: Byung-Ho Min, Tae-Su Kim, Hyun-Woo Park, Ho-Rang Jang, Keun-Cheol Hong, Sung-Jae Kim
System and method for remotely enforcing operational protocols

Patent number: 7356474

Abstract: A system and method for remotely enforcing operational protocols is provided. In a remote environment, such as that found with a police environment, voice recognition technology is used to determine the situation and invoke actions according to an appropriate protocol. Actions may be set to be mandatory or discretionary. A secure log is maintained of the actions undertaken. Actions include automatically retrieving data from a remote database, automatically communicating with another unit or headquarters, and automating devices used in the remote environment. Voice recognition technology also extracts data from the user's speech and builds variables used as parameters in performing the actions. Data is returned to the user in either audible or textual form and either played to the user on a speaker or displayed on a display device.

Type: Grant

Filed: September 19, 2002

Date of Patent: April 8, 2008

Assignee: International Business Machines Corporation

Inventor: David Bruce Kumhyr
Apparatus, method, and program for speech synthesis with capability of providing word meaning immediately upon request by a user

Patent number: 7353175

Abstract: A word meaning explanation request to a word in document data, which is output as speech, is input from a user instruction input unit. When the word meaning explanation request is input, a text analysis unit analyzes already output document data, which is output as speech immediately before the word meaning explanation request is input. A word meaning search unit searches for a word meaning comment corresponding to a word meaning explanation request objective word obtained based on the analysis result. The word meaning comment is output.

Type: Grant

Filed: March 4, 2003

Date of Patent: April 1, 2008

Assignee: Canon Kabushiki Kaisha

Inventor: Kazue Kaneko
Automatic call distributor with language based routing system and method

Patent number: 7349843

Abstract: A method and system for determining a language of a call handled by an automatic call distributor is disclosed. The method includes the steps of detecting the call, sampling an audio portion of the call, fitting a plurality of templates to the sampled portion of the call, and determining a language of the call based upon a best relative fit between one of the plurality of audio templates and the sampled portion of the call.

Type: Grant

Filed: January 18, 2000

Date of Patent: March 25, 2008

Assignee: Rockwell Electronic Commercial Corp.

Inventor: Jim Beck
Adaptive noise state update for a voice activity detector

Patent number: 7346502

Abstract: There is provided a method of updating a noise state of a voice activity detector (VAD) for indicating an active voice mode and an inactive voice mode. The method comprises receiving an input signal having a plurality of frames, determining an elapsed time since the last update of the noise state, updating the noise state of the VAD if the elapsed time exceeds a predetermined time, determining an average minimum energy based on two or more of the plurality of frames, determining a current minimum energy based on a current frame of the plurality of frames, updating the noise state of the VAD if the average minimum energy is less than the current minimum energy, and updating the noise state of the VAD if the average minimum energy is greater than the current minimum energy plus a first predetermined value.

Type: Grant

Filed: January 26, 2006

Date of Patent: March 18, 2008

Assignee: Mindspeed Technologies, Inc.

Inventors: Yang Gao, Eyal Shlomot, Adil Benyassine
Method and apparatus for coding a noise-suppressed audio signal

Patent number: 7343283

Abstract: An unfiltered frame portion (2) from a second frame (503) is blended together with a filtered frame portion (1) from a first frame (501) to produce a combined frame portion (507). The combined frame portion (507) is then buffered (110) along with the filtered frame (501) for LPC analysis.

Type: Grant

Filed: October 23, 2002

Date of Patent: March 11, 2008

Assignee: Motorola, Inc.

Inventors: James Ashley, Michael McLaughlin
Apparatus and method for phonetically screening predetermined character strings

Patent number: 7337117

Abstract: An apparatus for phonetically screening predetermined character strings. The apparatus includes a text-to-speech module, and a phonetic screening module in communication with the text-to-speech module. The phonetic screening module is for replacing a first character string with a second character string based on a phonetic enunciation by the text-to-speech module of the first character string.

Type: Grant

Filed: September 21, 2004

Date of Patent: February 26, 2008

Assignee: AT&T Delaware Intellectual Property, Inc.

Inventor: Anita Hogans Simpson
Wideband speech coding with modulated noise highband excitation system and method

Patent number: 7330814

Abstract: A speech encoder/decoder for wideband speech with a partitioning of wideband into lowband and highband, convenient coding of the lowband, and LP excited by noise plus some periodicity for the highband. The embedded lowband may be extracted for a lower bit rate decoder.

Type: Grant

Filed: May 15, 2001

Date of Patent: February 12, 2008

Assignee: Texas Instruments Incorporated

Inventor: Alan V. McCree

1 2 next