Patents Examined by Michael Opsasnick
  • Patent number: 7447636
    Abstract: An automated directory assistance (130) includes a training system (210). and a directory assistance system (220). The training system (210) trains and maintains the directory assistance system (220). The training system (210) includes a transcription module (310), a speech grammar estimation module (330), a listings statistics estimation module (340), and a required words determination module (350). The transcription module (310) obtains transcripts relating to directory service requests. The speech grammar estimation module (330) creates an n-gram grammar for multiple telephone numbers from the transcripts. The listings statistics estimation module (340) identifies words used to refer to each of the telephone numbers from the transcripts. The required words determination module (350) identifies at least one word that is required to request each of the telephone numbers from the transcripts.
    Type: Grant
    Filed: May 12, 2005
    Date of Patent: November 4, 2008
    Assignees: Verizon Corporate Services Group Inc., BBN Technologies Corp.
    Inventors: Richard Mark Schwartz, Han Shu, John Makhoul, Long Nguyen
  • Patent number: 7418382
    Abstract: A system and method for providing fast and efficient conversation navigation via a hierarchical structure (structure skeleton) which fully describes functions and services supported by a dialog (conversational) system. In one aspect, a conversational system and method is provided to pre-load dialog menus and target addresses to their associated dialog managing procedures in order to handle multiple or complex modes, contexts or applications. For instance, a content server (web site) (106) can download a skeleton or tree structure (109) describing the content (page) (107) or service provided by the server (106) when the client (100) connects to the server (106). The skeleton is hidden (not spoken) to the user but the user can advance to a page of interest, or to a particular dialog service, by uttering a voice command which is recognized by the conversational system reacting appropriately (as per the user's command) using the information contained within the skeleton.
    Type: Grant
    Filed: October 1, 1999
    Date of Patent: August 26, 2008
    Assignee: International Business Machines Corporation
    Inventor: Stephane H. Maes
  • Patent number: 7406410
    Abstract: A decoding apparatus is provided. The decoding apparatus has a first decoding part for decoding a code word obtained by encoding an input signal using a Code-Excited Linear Prediction encoding method. A second decoding part decodes a code word obtained by encoding a signal with an encoding method other than the Code-Excited Linear Prediction encoding method. A rising-transition detection and notification part has a detection part that detects the existence of a rising-transition of amplitude of the input signal based on time variation of a gain of excitation vectors obtained by the first decoding part, and a notification part that notifies the second decoding part that the rising-transition of the amplitude exists.
    Type: Grant
    Filed: February 7, 2003
    Date of Patent: July 29, 2008
    Assignee: NTT DoCoMo, Inc.
    Inventors: Kei Kikuiri, Nobuhiko Naka, Tomoyuki Ohya
  • Patent number: 7403888
    Abstract: A language input architecture receives input text (e.g., phonetic text of a character-based language) entered by a user from an input device (e.g., keyboard, voice recognition). The input text is converted to an output text (e.g., written language text of a character-based language). The language input architecture has a user interface that displays the output text and unconverted input text in line with one another. As the input text is converted, it is replaced in the UI with the converted output text. In addition to this in-line input feature, the UI enables in-place editing or error correction without requiring the user to switch modes from an entry mode to an edit mode. To assist with this in-place editing, the UI presents pop-up windows containing the phonetic text from which the output text was converted as well as first and second candidate lists that contain small and large sets of alternative candidates that might be used to replace the current output text.
    Type: Grant
    Filed: June 28, 2000
    Date of Patent: July 22, 2008
    Assignee: Microsoft Corporation
    Inventors: Jian Wang, Gao Zhang, Jian Han, Zheng Chen, Xianoning Ling, Kai-Fu Lee
  • Patent number: 7398215
    Abstract: A prompt translation application for use in a telecommunications messaging system provides an administrator with a plurality of messaging prompts in a base language for revising, translating, and editing. The administrator can nearly simultaneously revise both a visual component of a prompt and an audio component of the prompt, and save the revisions for use on any number of associated endpoints. In the preferred embodiments, revisions to the visual component are made by user input, such as keystrokes, and to the audio component by selection of audio segments to be played in a particular order.
    Type: Grant
    Filed: December 24, 2003
    Date of Patent: July 8, 2008
    Assignee: Inter-Tel, Inc.
    Inventors: Ibrahim Mesbah, Eyor Alemayehu
  • Patent number: 7398205
    Abstract: An excitation vector generator includes an input vector providing system that is capable of providing an input vector having at least one pulse, each pulse having a predetermined position and a respective polarity. A fixed waveform storage system is capable of storing at least one fixed waveform. An arranging system is capable of arranging the at least one fixed waveform in accordance with the position and the polarity of the at least one pulse.
    Type: Grant
    Filed: June 2, 2006
    Date of Patent: July 8, 2008
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Kazutoshi Yasunaga, Toshiyuki Morii, Hiroyuki Ehara
  • Patent number: 7389234
    Abstract: The present invention provides a method and apparatus that utilize a context-free grammar written in a markup language format. The markup language format provides a hierarchical format in which grammar structures are delimited within and defined by a set of tags. The markup language format also provides grammar switch tags that indicate a transitions from the context-free grammar to a dictation grammar or a text buffer grammar. In addition, the markup language format provides for the designation of code to be executed when particular grammar structures are recognized from a speech signal.
    Type: Grant
    Filed: January 12, 2001
    Date of Patent: June 17, 2008
    Assignee: Microsoft Corporation
    Inventors: Philipp H. Schmid, Ralph Lipe, Erik C. Ellerman, Robert L. Chambers
  • Patent number: 7379872
    Abstract: A mechanism is provided for authenticating and using a personal voice profile. The voice profile may be issued by a trusted third party, such as a certification authority. The personal voice profile may include information for generating a digest or digital signature for text messages. A speech synthesis system may speak the text message using the voice characteristics, such as prosodic characteristics, only if the voice profile is authenticated and the text message is valid and free of tampering.
    Type: Grant
    Filed: January 17, 2003
    Date of Patent: May 27, 2008
    Assignee: International Business Machines Corporation
    Inventors: Rafael Graniello Cabezas, Jason Eric Moore, Elizabeth Silvia
  • Patent number: 7369991
    Abstract: The object of the present invention is to keep a high success rate in recognition with a low-volume of sound signal, without being affected by noise.
    Type: Grant
    Filed: March 4, 2003
    Date of Patent: May 6, 2008
    Assignee: NTT DoCoMo, Inc.
    Inventors: Hiroyuki Manabe, Akira Hiraiwa, Toshiaki Sugimura
  • Patent number: 7369997
    Abstract: A system and method for use in computing systems that employ speech recognition capabilities is provided. Where recognized speech can be dictation and commands, one or more buttons may be used to change modes of said computing systems to accept spoken words as dictation, or to accept spoken words as commands, as well as activate a microphone used for the speech recognition. The change in mode may occur responsive to the manner in which a button is pressed, where the manner may include such depressions as taps, press and holds, thumbwheel slides, and other forms of button manipulation.
    Type: Grant
    Filed: August 1, 2001
    Date of Patent: May 6, 2008
    Assignee: Microsoft Corporation
    Inventors: Robert Chambers, Charlton E. Lui
  • Patent number: 7366673
    Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. Selecting can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Selecting can further include registering the speech grammar in the recognition system.
    Type: Grant
    Filed: June 15, 2001
    Date of Patent: April 29, 2008
    Assignee: International Business Machines Corporation
    Inventors: Harvey M. Ruback, Steven G. Woodward
  • Patent number: 7366663
    Abstract: For measuring the influence of noise on the talking quality of a telephone link in a telecommunications network, a talker speech signal (s(t)) and a degraded speech signal (s?(t)) are fed to an objective measurement device for obtaining an output signal (q) representing an estimated value of the talking quality. The degraded signal includes a returned signal (r(t)) originating from the network during transmission of the talker speech signal over the telephone link. The objective measurement provided by the device is a modified PSQM-like measurement, which is modified to include modelling of masking effects resulting from noise present in the returned signal. Preferably, the modelling includes noise suppression performed on a difference signal (D(t,f)) in a loudness density domain using noise estimation.
    Type: Grant
    Filed: October 11, 2001
    Date of Patent: April 29, 2008
    Assignee: Koninklijke KPN N.V.
    Inventors: John Gerard Beerends, Andries Pieter Hekstra, Symon Ronald Appel
  • Patent number: 7356474
    Abstract: A system and method for remotely enforcing operational protocols is provided. In a remote environment, such as that found with a police environment, voice recognition technology is used to determine the situation and invoke actions according to an appropriate protocol. Actions may be set to be mandatory or discretionary. A secure log is maintained of the actions undertaken. Actions include automatically retrieving data from a remote database, automatically communicating with another unit or headquarters, and automating devices used in the remote environment. Voice recognition technology also extracts data from the user's speech and builds variables used as parameters in performing the actions. Data is returned to the user in either audible or textual form and either played to the user on a speaker or displayed on a display device.
    Type: Grant
    Filed: September 19, 2002
    Date of Patent: April 8, 2008
    Assignee: International Business Machines Corporation
    Inventor: David Bruce Kumhyr
  • Patent number: 7356466
    Abstract: A method and apparatus for calculating an observation probability includes a first operation unit that subtracts a mean of a first plurality of parameters of an input voice signal from a second parameter of an input voice signal, and multiplies the subtraction result to obtain a first output. The first output is squared and accumulated N times in a second operation unit to obtain a second output. A third operation unit subtracts a given weighted value from the second output to obtain a third output, and a comparator stores the third output for a comparator stores the third output in order to extract L outputs therefrom, and stores the L extracted outputs based on an order of magnitude of the extracted L outputs.
    Type: Grant
    Filed: June 20, 2003
    Date of Patent: April 8, 2008
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Byung-Ho Min, Tae-Su Kim, Hyun-Woo Park, Ho-Rang Jang, Keun-Cheol Hong, Sung-Jae Kim
  • Patent number: 7353175
    Abstract: A word meaning explanation request to a word in document data, which is output as speech, is input from a user instruction input unit. When the word meaning explanation request is input, a text analysis unit analyzes already output document data, which is output as speech immediately before the word meaning explanation request is input. A word meaning search unit searches for a word meaning comment corresponding to a word meaning explanation request objective word obtained based on the analysis result. The word meaning comment is output.
    Type: Grant
    Filed: March 4, 2003
    Date of Patent: April 1, 2008
    Assignee: Canon Kabushiki Kaisha
    Inventor: Kazue Kaneko
  • Patent number: 7349843
    Abstract: A method and system for determining a language of a call handled by an automatic call distributor is disclosed. The method includes the steps of detecting the call, sampling an audio portion of the call, fitting a plurality of templates to the sampled portion of the call, and determining a language of the call based upon a best relative fit between one of the plurality of audio templates and the sampled portion of the call.
    Type: Grant
    Filed: January 18, 2000
    Date of Patent: March 25, 2008
    Assignee: Rockwell Electronic Commercial Corp.
    Inventor: Jim Beck
  • Patent number: 7346502
    Abstract: There is provided a method of updating a noise state of a voice activity detector (VAD) for indicating an active voice mode and an inactive voice mode. The method comprises receiving an input signal having a plurality of frames, determining an elapsed time since the last update of the noise state, updating the noise state of the VAD if the elapsed time exceeds a predetermined time, determining an average minimum energy based on two or more of the plurality of frames, determining a current minimum energy based on a current frame of the plurality of frames, updating the noise state of the VAD if the average minimum energy is less than the current minimum energy, and updating the noise state of the VAD if the average minimum energy is greater than the current minimum energy plus a first predetermined value.
    Type: Grant
    Filed: January 26, 2006
    Date of Patent: March 18, 2008
    Assignee: Mindspeed Technologies, Inc.
    Inventors: Yang Gao, Eyal Shlomot, Adil Benyassine
  • Patent number: 7343283
    Abstract: An unfiltered frame portion (2) from a second frame (503) is blended together with a filtered frame portion (1) from a first frame (501) to produce a combined frame portion (507). The combined frame portion (507) is then buffered (110) along with the filtered frame (501) for LPC analysis.
    Type: Grant
    Filed: October 23, 2002
    Date of Patent: March 11, 2008
    Assignee: Motorola, Inc.
    Inventors: James Ashley, Michael McLaughlin
  • Patent number: 7337117
    Abstract: An apparatus for phonetically screening predetermined character strings. The apparatus includes a text-to-speech module, and a phonetic screening module in communication with the text-to-speech module. The phonetic screening module is for replacing a first character string with a second character string based on a phonetic enunciation by the text-to-speech module of the first character string.
    Type: Grant
    Filed: September 21, 2004
    Date of Patent: February 26, 2008
    Assignee: AT&T Delaware Intellectual Property, Inc.
    Inventor: Anita Hogans Simpson
  • Patent number: 7330814
    Abstract: A speech encoder/decoder for wideband speech with a partitioning of wideband into lowband and highband, convenient coding of the lowband, and LP excited by noise plus some periodicity for the highband. The embedded lowband may be extracted for a lower bit rate decoder.
    Type: Grant
    Filed: May 15, 2001
    Date of Patent: February 12, 2008
    Assignee: Texas Instruments Incorporated
    Inventor: Alan V. McCree