Patents Examined by Vijay Chawan

Speech recognition system having multiple speech recognizers

Patent number: 7228275

Abstract: A speech recognition system recognizes an input speech signal by using a first speech recognizer and a second speech recognizer each coupled to a decision module. Each of the first and second speech recognizers outputs first and second recognized speech texts and first and second associated confidence scores, respectively, and the decision module selects either the first or the second speech text depending upon which of the first or second confidence score is higher. The decision module may also adjust the first and second confidence scores to generate first and second adjusted confidence scores, respectively, and select either the first or second speech text depending upon which of the first or second adjusted confidence scores is higher. The first and second confidence scores may be adjusted based upon the location of a speaker, the identity or accent of the speaker, the context of the speech, and the like.

Type: Grant

Filed: January 13, 2003

Date of Patent: June 5, 2007

Assignees: Toyota InfoTechnology Center Co., Ltd., iAnywhere Solutions, Inc.

Inventors: Norikazu Endo, John R. Brookes, Benjamin K. Reaves, Babak Hodjat, Masahiko Funaki
Telephone apparatus

Patent number: 7228271

Abstract: The telephone apparatus of the present invention comprises a first voice band expander for generating a voiced signal frequency component by shifting the frequency of the voice signal received, a second voice band expander for generating a voiceless signal frequency component by shifting the frequency of the voice signal received, and a voice composer for composing the voice signal received, the output of the first voice band expander, and the output of the second voice band expander, which is able to output clear voices in aural communication.

Type: Grant

Filed: December 23, 2002

Date of Patent: June 5, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Toshimichi Tokuda, Takashi Kimura
Finding database match for file based on file characteristics

Patent number: 7228280

Abstract: A signature array of digitally sampled audio is formed from segment(s) of the digitally sampled audio by counting the number of occurrences within the segment(s) in each of a plurality of value bands or slots, such as amplitude bands. The signature array undergoes a fuzzy comparison with signatures arrays in the database. If more than one potential match is found, a more precise comparison is made. In the case of compact discs (CDs), five second sample segments may taken from the beginning, middle and end of each track to detect, e.g., the amplitude of the digitally sampled audio on the CD. A CD signature array may be formed of approximately 2000 value bands or slots by accumulating the occurrence of signals within each slot for all of the sample segments of the CD.

Type: Grant

Filed: July 21, 2000

Date of Patent: June 5, 2007

Assignee: Gracenote, Inc.

Inventors: Steven D. Scherf, Paul E. Quinn
Recognition of identification patterns

Patent number: 7228274

Abstract: The invention relates to a method and to a device for identifying markers in data blocks. The method is characterized in that a number of data blocks (201a 201n) are received by a receiver (202) and one specific data block from the number of data blocks (201a 201n) is analyzed in order to determine whether the specific data block contains a marker (203) or not. Once a data block that contains a marker (203) is identified, a full-rate signaling block (206) that is transmitted at full rate and a half-rate signaling block (207) that is transmitted at half rate are searched for markers. A predetermined reference pattern is provided for correlation that is subsequently divided into sub-ranges that in turn are correlated with a predetermined reference pattern to identify a signaling frame in the full-rate signaling blocks (206) and/or the half-rate signaling blocks (207).

Type: Grant

Filed: December 13, 2001

Date of Patent: June 5, 2007

Assignee: Infineon Technologies AG

Inventors: Johann Steger, Michael Weber
System and method for providing information using spoken dialogue interface

Patent number: 7225128

Abstract: There are provided a system and method for providing information using a spoken dialogue interface. The system includes a speech recognizer for transforming voice signals into sentences; a sentence analyzer for analyzing the sentences by their structural elements; a dialogue manager for extracting information on speech acts or intentions from the structural elements, and generating information on system's speech acts or intentions for a response to the extracted information on speech acts or intentions; a sentence generator for generating sentences based on the information on the system's speech acts or intentions for the response; a speech synthesizer for synthesizing the generated sentences into voices; an information extractor for extracting information required for the response from the Internet in real time; and a user modeling means for analyzing and classifying users' tendencies.

Type: Grant

Filed: March 31, 2003

Date of Patent: May 29, 2007

Assignee: Samsung Electronics Co., Ltd.

Inventors: Jeong-su Kim, Yong-beom Lee, Jae-won Lee, Hye-jeong Lee, Chan-min Park, Hee-kyoung Seo
Automatic interpreting system including a system for recognizing errors

Patent number: 7219050

Abstract: An automatic interpreting system, having at least an inputting unit for inputting utterances, an interpreting unit for recognizing the input utterance and interpreting the input utterance into a specified language, and an outputting unit for outputting interpretation results, includes a conversation history managing unit for determining the level of interpretation reliability calculated based on the interpretation result, having the user input their level of interpretation understanding, and recording and storing this data as interpretation history information, and a conversation status determination processing unit for, if the interpretation of an utterance is not understandable to the receiving party, determining the conversation status by selecting from categories pre-determined based on the levels of interpretation reliability and interpretation understanding for the previous utterance that are stored in the conversation history managing unit, and a recommended action presenting part for presenting to the

Type: Grant

Filed: September 7, 2001

Date of Patent: May 15, 2007

Assignee: NEC Corporation

Inventors: Kai Ishikawa, Shinichi Ando, Akitoshi Okumura
Wirelessly delivered owner's manual

Patent number: 7219063

Abstract: This invention is directed to a method of delivering vehicle owner's manual or other vehicle-specific information to the vehicle operator from a remote data center and associated vehicle information database by utilizing a voice recognition system at the remote data center and delivering the information to the vehicle operator in audible speech. The vehicle operator speaks his request in the vehicle and the data center recognizes the request, perhaps asks more questions, leads the vehicle operator through a spoken menu, and then provides the answer vocally to the vehicle operator over the speaker(s) located in the vehicle. The invention includes methodology for obtaining vehicle diagnostic information and controlling certain vehicle functions automatically via an embedded telematics control unit. The invention further includes remote telephone access outside the vehicle.

Type: Grant

Filed: November 18, 2004

Date of Patent: May 15, 2007

Assignee: ATX Technologies, Inc.

Inventors: Thomas Barton Schalk, Steve Alan Millstein
Legged robot, legged robot behavior control method, and storage medium

Patent number: 7219064

Abstract: To provide a robot which autonomously forms and performs an action plan in response to external factors without direct command input from an operator. When reading a story printed in a book or other print media or recorded in recording media or when reading a story downloaded through a network, the robot does not simply read every single word as it is written. Instead, the robot uses external factors, such as a change of time, a change of season, or a change in a user's mood, and dynamically alters the story as long as the changed contents are substantially the same as the original contents. As a result, the robot can read aloud the story whose contents would differ every time the story is read.

Type: Grant

Filed: October 23, 2001

Date of Patent: May 15, 2007

Assignee: Sony Corporation

Inventors: Hideki Nakakita, Tomoaki Kasuga
Method for transmitting character message using voice recognition in portable terminal

Patent number: 7209877

Abstract: A method for transmitting a character message using voice recognition in a portable terminal, includes the steps of: a) displaying a guidance message associated with a voice/character conversion service when the message is created; b) connecting the portable terminal to a base station when the voice/character conversion service is selected; c) transmitting a voice message from a user to the base station; d) storing the voice message in the base station; e) allowing the base station to convert the voice message into a character message; f) transmitting the character message to the portable terminal; g) inputting a destination number; and h) transmitting the character message to a destination terminal corresponding to the inputted destination number.

Type: Grant

Filed: January 14, 2003

Date of Patent: April 24, 2007

Assignee: Samsung Electronics Co., Ltd.

Inventor: Kyu-Sung Han
Method and apparatus for accessing targeted, personalized voice/audio web content through wireless devices

Patent number: 7206745

Abstract: A wireless web system allows users to navigate web pages that include links to audio content where the pages are provided over a data connection and the audio content is provided over a voice connection. An audio content reference generator generates a reference to a portion of static audio content and that audio content reference is provided to the user's wireless web client as a link on a wireless web page, or other page retrieved by the wireless web device over the data connection. The audio content reference and a telephone number of an audio server form the link on the page, so that when a user selects that link, the wireless device establishes a voice connection to the audio server using the telephone number and then provides the audio server with the audio content reference so that the user hears the specifically referenced audio content over the voice channel.

Type: Grant

Filed: March 8, 2004

Date of Patent: April 17, 2007

Assignee: Yahoo! Inc.

Inventors: Ramesh R. Sarukkai, Anurag Mendhekar
Multimodal quality assessment

Patent number: 7197452

Abstract: The quality of audiovisual material is assessed by measuring the audio an video quality and computing from these a combined measure. Using a parameter indicative of the degree of motion represented by the video, the computation employs one of a plurality of algorithms selected in dependence on the value of the parameter.

Type: Grant

Filed: March 8, 2002

Date of Patent: March 27, 2007

Assignee: British Telecommunications public limited company

Inventor: David S Hands
Device setter, device setting system, and recorded medium where device setting program recorded

Patent number: 7181394

Abstract: A word-device setting information storage part stores multiple device setting information in association with a single voice and a voice recognition part recognizes a voice input through a voice input part so that the multiple device setting information associated with the recognized voice is read from the word-device setting information storage part to a device control part, which in turn sets an internal device state of a device setting apparatus while an external device control part sets a state of an external device in response to the read multiple device setting information.

Type: Grant

Filed: December 25, 2000

Date of Patent: February 20, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventor: Noritaka Kusumoto
User interface for text to speech conversion

Patent number: 7174295

Abstract: An electronic device which includes a user interface having a display for displaying text and a speech synthesiser including a loudspeaker, arranged to convert an input, dependent upon a text, to an audio output representative of a person reading the text. The device may delay with respect to the audio output corresponding to a portion of text the display of the portion of text. The device may also or alternatively delay with respect to the audio output corresponding to a portion of text the highlighting of the portion of text within a displayed text.

Type: Grant

Filed: September 6, 2000

Date of Patent: February 6, 2007

Assignee: Nokia Corporation

Inventor: Mika Kivimaki
Noise suppression circuit for a wireless device

Patent number: 7174291

Abstract: An adaptive noise suppression system includes an input A/D converter, an analyzer, a filter, and a output D/A converter. The analyzer includes both feed-forward and feedback signal paths that allow it to compute a filtering coefficient, which is input to the filter. In these paths, feed-forward signal are processed by a signal to noise ratio estimator, a normalized coherence estimator, and a coherence mask. Also, feedback signals are processed by a auditory mask estimator. These two signal paths are coupled together via a noise suppression filter estimator. A method according to the present invention includes active signal processing to preserve speech-like signals and suppress incoherent noise signals. After a signal is processed in the feed-forward and feedback paths, the noise suppression filter estimator then outputs a filtering coefficient signal to the filter for filtering the noise out of the speech and noise digital signal.

Type: Grant

Filed: July 16, 2003

Date of Patent: February 6, 2007

Assignee: Research In Motion Limited

Inventors: Dean McArthur, Jim Reilly
Multimode speech coding apparatus and decoding apparatus

Patent number: 7167828

Abstract: Square sum calculator 603 calculates a square sum of evolution in smoothed quantized LSP parameter for each order. A first dynamic parameter is thereby obtained. Square sum calculator 605 calculates a square sum using a square value of each order. The square sum is a second dynamic parameter. Maximum value calculator 606 selects a maximum value from among square values for each order. The maximum value is a third dynamic parameter. The first to third dynamic parameters are output to mode determiner 607, which determines a speech mode by judging the parameters with respective thresholds to output mode information.

Type: Grant

Filed: January 10, 2001

Date of Patent: January 23, 2007

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventor: Hiroyuki Ehara
Noise spectrum subtraction method and system

Patent number: 7155387

Abstract: A method for reducing noise in a voice signal, and a voice operated system utilizing the same are presented. A noise component in a compressed digital signal representative of the voice signal is determined, and subtracted from the compressed digital signal.

Type: Grant

Filed: January 8, 2001

Date of Patent: December 26, 2006

Assignee: Art - Advanced Recognition Technologies Ltd.

Inventor: Amir Globerson
Method, system and module for multi-modal data fusion

Patent number: 7152033

Abstract: A method for multi-modal data fusion (100), a multi-modal system data fusion system (10) and module (24) that in use operates by receiving segments (125) of multi-modal data associated respectively with a modalitiy. Initiating (130) a dynamically variable wait period after one of the segments is received is then performed. The dynamically variable wait period has a duration determined from data fusion timing statistics of the system (10). A waiting (140) for reception of any further segments during the dynamically variable wait period is then effected and thereafter a fusing (145) of the segments received provides fused data that is sent (160) to a dialog manager (25).

Type: Grant

Filed: November 12, 2002

Date of Patent: December 19, 2006

Assignee: Motorola, Inc.

Inventors: Anurag Kumar Gupta, Ying Catherine Cheng
Phonetically based speech recognition system and method

Patent number: 7146319

Abstract: A speech recognition method includes a step of receiving a phonetic sequence output by a phonetic recognizer. The method also includes a step of matching the phonetic sequence with one of a plurality of reference phoneme sequences stored in a reference list that matches closest thereto. At least one of the plurality of reference phoneme sequences stored in the reference list includes additional information with respect to a phonetic sequence that is capable of being output by the phonetic recognizer.

Type: Grant

Filed: March 31, 2003

Date of Patent: December 5, 2006

Assignee: Novauris Technologies Ltd.

Inventor: Melvyn J. Hunt
Time-scale modification of data-compressed audio information

Patent number: 7143047

Abstract: A data-compressed audio waveform is temporally modified without requiring complete decompression of the audio signal. Packets of compressed audio data are first unpacked, to remove scaling that was applied in the formation of the packets. The unpacked data is then temporally modified, using one of a number of different approaches. This modification takes place while the audio information remains in a data-compressed format. New packets are then assembled from the modified data, to produce a data-compressed output stream that can be subsequently processed in a conventional manner to reproduce the desired sound. The assembly of the new packets employs a technique for inferring an auditory model from the original packets, to requantize the data in the output packets.

Type: Grant

Filed: September 17, 2004

Date of Patent: November 28, 2006

Assignee: Vulcan Patents LLC

Inventors: Michele M. Covell, Malcolm Slaney, Arthur Rothstein
Spelling words using an arbitrary phonetic alphabet

Patent number: 7143037

Abstract: Words are spelled by receiving recognizable words from a user of an interactive voice response system. The first letter of each recognizable word is identified, and a spelling is determined based on the first letters of the recognizable words. Statistics for previous users of the interactive voice response system are determined, where the statistics indicate the number of times each of the recognizable words has been used to indicate a letter. The recognizable word that is most commonly used for each letter is identified. The user is prompted with at least two recognizable words that are most commonly used, where each recognizable word corresponds to a different letter. A selection of one of the recognizable words provided to the user is received.

Type: Grant

Filed: June 12, 2002

Date of Patent: November 28, 2006

Assignee: Cisco Technology, Inc.

Inventor: Kevin L. Chestnut

prev 1 2 3 4 5 6 7 8 … next