Patents Examined by Michael Opsasnick
-
Patent number: 7447636Abstract: An automated directory assistance (130) includes a training system (210). and a directory assistance system (220). The training system (210) trains and maintains the directory assistance system (220). The training system (210) includes a transcription module (310), a speech grammar estimation module (330), a listings statistics estimation module (340), and a required words determination module (350). The transcription module (310) obtains transcripts relating to directory service requests. The speech grammar estimation module (330) creates an n-gram grammar for multiple telephone numbers from the transcripts. The listings statistics estimation module (340) identifies words used to refer to each of the telephone numbers from the transcripts. The required words determination module (350) identifies at least one word that is required to request each of the telephone numbers from the transcripts.Type: GrantFiled: May 12, 2005Date of Patent: November 4, 2008Assignees: Verizon Corporate Services Group Inc., BBN Technologies Corp.Inventors: Richard Mark Schwartz, Han Shu, John Makhoul, Long Nguyen
-
Patent number: 7418382Abstract: A system and method for providing fast and efficient conversation navigation via a hierarchical structure (structure skeleton) which fully describes functions and services supported by a dialog (conversational) system. In one aspect, a conversational system and method is provided to pre-load dialog menus and target addresses to their associated dialog managing procedures in order to handle multiple or complex modes, contexts or applications. For instance, a content server (web site) (106) can download a skeleton or tree structure (109) describing the content (page) (107) or service provided by the server (106) when the client (100) connects to the server (106). The skeleton is hidden (not spoken) to the user but the user can advance to a page of interest, or to a particular dialog service, by uttering a voice command which is recognized by the conversational system reacting appropriately (as per the user's command) using the information contained within the skeleton.Type: GrantFiled: October 1, 1999Date of Patent: August 26, 2008Assignee: International Business Machines CorporationInventor: Stephane H. Maes
-
Patent number: 7406410Abstract: A decoding apparatus is provided. The decoding apparatus has a first decoding part for decoding a code word obtained by encoding an input signal using a Code-Excited Linear Prediction encoding method. A second decoding part decodes a code word obtained by encoding a signal with an encoding method other than the Code-Excited Linear Prediction encoding method. A rising-transition detection and notification part has a detection part that detects the existence of a rising-transition of amplitude of the input signal based on time variation of a gain of excitation vectors obtained by the first decoding part, and a notification part that notifies the second decoding part that the rising-transition of the amplitude exists.Type: GrantFiled: February 7, 2003Date of Patent: July 29, 2008Assignee: NTT DoCoMo, Inc.Inventors: Kei Kikuiri, Nobuhiko Naka, Tomoyuki Ohya
-
Patent number: 7403888Abstract: A language input architecture receives input text (e.g., phonetic text of a character-based language) entered by a user from an input device (e.g., keyboard, voice recognition). The input text is converted to an output text (e.g., written language text of a character-based language). The language input architecture has a user interface that displays the output text and unconverted input text in line with one another. As the input text is converted, it is replaced in the UI with the converted output text. In addition to this in-line input feature, the UI enables in-place editing or error correction without requiring the user to switch modes from an entry mode to an edit mode. To assist with this in-place editing, the UI presents pop-up windows containing the phonetic text from which the output text was converted as well as first and second candidate lists that contain small and large sets of alternative candidates that might be used to replace the current output text.Type: GrantFiled: June 28, 2000Date of Patent: July 22, 2008Assignee: Microsoft CorporationInventors: Jian Wang, Gao Zhang, Jian Han, Zheng Chen, Xianoning Ling, Kai-Fu Lee
-
Patent number: 7398215Abstract: A prompt translation application for use in a telecommunications messaging system provides an administrator with a plurality of messaging prompts in a base language for revising, translating, and editing. The administrator can nearly simultaneously revise both a visual component of a prompt and an audio component of the prompt, and save the revisions for use on any number of associated endpoints. In the preferred embodiments, revisions to the visual component are made by user input, such as keystrokes, and to the audio component by selection of audio segments to be played in a particular order.Type: GrantFiled: December 24, 2003Date of Patent: July 8, 2008Assignee: Inter-Tel, Inc.Inventors: Ibrahim Mesbah, Eyor Alemayehu
-
Patent number: 7398205Abstract: An excitation vector generator includes an input vector providing system that is capable of providing an input vector having at least one pulse, each pulse having a predetermined position and a respective polarity. A fixed waveform storage system is capable of storing at least one fixed waveform. An arranging system is capable of arranging the at least one fixed waveform in accordance with the position and the polarity of the at least one pulse.Type: GrantFiled: June 2, 2006Date of Patent: July 8, 2008Assignee: Matsushita Electric Industrial Co., Ltd.Inventors: Kazutoshi Yasunaga, Toshiyuki Morii, Hiroyuki Ehara
-
Patent number: 7389234Abstract: The present invention provides a method and apparatus that utilize a context-free grammar written in a markup language format. The markup language format provides a hierarchical format in which grammar structures are delimited within and defined by a set of tags. The markup language format also provides grammar switch tags that indicate a transitions from the context-free grammar to a dictation grammar or a text buffer grammar. In addition, the markup language format provides for the designation of code to be executed when particular grammar structures are recognized from a speech signal.Type: GrantFiled: January 12, 2001Date of Patent: June 17, 2008Assignee: Microsoft CorporationInventors: Philipp H. Schmid, Ralph Lipe, Erik C. Ellerman, Robert L. Chambers
-
Patent number: 7379872Abstract: A mechanism is provided for authenticating and using a personal voice profile. The voice profile may be issued by a trusted third party, such as a certification authority. The personal voice profile may include information for generating a digest or digital signature for text messages. A speech synthesis system may speak the text message using the voice characteristics, such as prosodic characteristics, only if the voice profile is authenticated and the text message is valid and free of tampering.Type: GrantFiled: January 17, 2003Date of Patent: May 27, 2008Assignee: International Business Machines CorporationInventors: Rafael Graniello Cabezas, Jason Eric Moore, Elizabeth Silvia
-
Patent number: 7369991Abstract: The object of the present invention is to keep a high success rate in recognition with a low-volume of sound signal, without being affected by noise.Type: GrantFiled: March 4, 2003Date of Patent: May 6, 2008Assignee: NTT DoCoMo, Inc.Inventors: Hiroyuki Manabe, Akira Hiraiwa, Toshiaki Sugimura
-
Patent number: 7369997Abstract: A system and method for use in computing systems that employ speech recognition capabilities is provided. Where recognized speech can be dictation and commands, one or more buttons may be used to change modes of said computing systems to accept spoken words as dictation, or to accept spoken words as commands, as well as activate a microphone used for the speech recognition. The change in mode may occur responsive to the manner in which a button is pressed, where the manner may include such depressions as taps, press and holds, thumbwheel slides, and other forms of button manipulation.Type: GrantFiled: August 1, 2001Date of Patent: May 6, 2008Assignee: Microsoft CorporationInventors: Robert Chambers, Charlton E. Lui
-
Patent number: 7366673Abstract: A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device; characterizing the selected speech grammar; and, based on the characterization, determining whether to process the speech grammar locally in the network connected client device, or remotely in a speech server in the network. Selecting can include establishing a communications session with a speech server; and, querying the speech server for a speech grammar over the established communications session. Selecting can further include registering the speech grammar in the recognition system.Type: GrantFiled: June 15, 2001Date of Patent: April 29, 2008Assignee: International Business Machines CorporationInventors: Harvey M. Ruback, Steven G. Woodward
-
Patent number: 7366663Abstract: For measuring the influence of noise on the talking quality of a telephone link in a telecommunications network, a talker speech signal (s(t)) and a degraded speech signal (s?(t)) are fed to an objective measurement device for obtaining an output signal (q) representing an estimated value of the talking quality. The degraded signal includes a returned signal (r(t)) originating from the network during transmission of the talker speech signal over the telephone link. The objective measurement provided by the device is a modified PSQM-like measurement, which is modified to include modelling of masking effects resulting from noise present in the returned signal. Preferably, the modelling includes noise suppression performed on a difference signal (D(t,f)) in a loudness density domain using noise estimation.Type: GrantFiled: October 11, 2001Date of Patent: April 29, 2008Assignee: Koninklijke KPN N.V.Inventors: John Gerard Beerends, Andries Pieter Hekstra, Symon Ronald Appel
-
Patent number: 7356474Abstract: A system and method for remotely enforcing operational protocols is provided. In a remote environment, such as that found with a police environment, voice recognition technology is used to determine the situation and invoke actions according to an appropriate protocol. Actions may be set to be mandatory or discretionary. A secure log is maintained of the actions undertaken. Actions include automatically retrieving data from a remote database, automatically communicating with another unit or headquarters, and automating devices used in the remote environment. Voice recognition technology also extracts data from the user's speech and builds variables used as parameters in performing the actions. Data is returned to the user in either audible or textual form and either played to the user on a speaker or displayed on a display device.Type: GrantFiled: September 19, 2002Date of Patent: April 8, 2008Assignee: International Business Machines CorporationInventor: David Bruce Kumhyr
-
Patent number: 7356466Abstract: A method and apparatus for calculating an observation probability includes a first operation unit that subtracts a mean of a first plurality of parameters of an input voice signal from a second parameter of an input voice signal, and multiplies the subtraction result to obtain a first output. The first output is squared and accumulated N times in a second operation unit to obtain a second output. A third operation unit subtracts a given weighted value from the second output to obtain a third output, and a comparator stores the third output for a comparator stores the third output in order to extract L outputs therefrom, and stores the L extracted outputs based on an order of magnitude of the extracted L outputs.Type: GrantFiled: June 20, 2003Date of Patent: April 8, 2008Assignee: Samsung Electronics Co., Ltd.Inventors: Byung-Ho Min, Tae-Su Kim, Hyun-Woo Park, Ho-Rang Jang, Keun-Cheol Hong, Sung-Jae Kim
-
Patent number: 7353175Abstract: A word meaning explanation request to a word in document data, which is output as speech, is input from a user instruction input unit. When the word meaning explanation request is input, a text analysis unit analyzes already output document data, which is output as speech immediately before the word meaning explanation request is input. A word meaning search unit searches for a word meaning comment corresponding to a word meaning explanation request objective word obtained based on the analysis result. The word meaning comment is output.Type: GrantFiled: March 4, 2003Date of Patent: April 1, 2008Assignee: Canon Kabushiki KaishaInventor: Kazue Kaneko
-
Patent number: 7349843Abstract: A method and system for determining a language of a call handled by an automatic call distributor is disclosed. The method includes the steps of detecting the call, sampling an audio portion of the call, fitting a plurality of templates to the sampled portion of the call, and determining a language of the call based upon a best relative fit between one of the plurality of audio templates and the sampled portion of the call.Type: GrantFiled: January 18, 2000Date of Patent: March 25, 2008Assignee: Rockwell Electronic Commercial Corp.Inventor: Jim Beck
-
Patent number: 7346502Abstract: There is provided a method of updating a noise state of a voice activity detector (VAD) for indicating an active voice mode and an inactive voice mode. The method comprises receiving an input signal having a plurality of frames, determining an elapsed time since the last update of the noise state, updating the noise state of the VAD if the elapsed time exceeds a predetermined time, determining an average minimum energy based on two or more of the plurality of frames, determining a current minimum energy based on a current frame of the plurality of frames, updating the noise state of the VAD if the average minimum energy is less than the current minimum energy, and updating the noise state of the VAD if the average minimum energy is greater than the current minimum energy plus a first predetermined value.Type: GrantFiled: January 26, 2006Date of Patent: March 18, 2008Assignee: Mindspeed Technologies, Inc.Inventors: Yang Gao, Eyal Shlomot, Adil Benyassine
-
Patent number: 7343283Abstract: An unfiltered frame portion (2) from a second frame (503) is blended together with a filtered frame portion (1) from a first frame (501) to produce a combined frame portion (507). The combined frame portion (507) is then buffered (110) along with the filtered frame (501) for LPC analysis.Type: GrantFiled: October 23, 2002Date of Patent: March 11, 2008Assignee: Motorola, Inc.Inventors: James Ashley, Michael McLaughlin
-
Patent number: 7337117Abstract: An apparatus for phonetically screening predetermined character strings. The apparatus includes a text-to-speech module, and a phonetic screening module in communication with the text-to-speech module. The phonetic screening module is for replacing a first character string with a second character string based on a phonetic enunciation by the text-to-speech module of the first character string.Type: GrantFiled: September 21, 2004Date of Patent: February 26, 2008Assignee: AT&T Delaware Intellectual Property, Inc.Inventor: Anita Hogans Simpson
-
Patent number: 7330814Abstract: A speech encoder/decoder for wideband speech with a partitioning of wideband into lowband and highband, convenient coding of the lowband, and LP excited by noise plus some periodicity for the highband. The embedded lowband may be extracted for a lower bit rate decoder.Type: GrantFiled: May 15, 2001Date of Patent: February 12, 2008Assignee: Texas Instruments IncorporatedInventor: Alan V. McCree