Patents Examined by Vijay Chawan

Method and apparatus to perform speech recognition over a voice channel

Patent number: 7139704

Abstract: A method and apparatus to perform speech recognition over a voice channel is described.

Type: Grant

Filed: November 30, 2001

Date of Patent: November 21, 2006

Assignee: Intel Corporation

Inventor: David L. Graumann
Wideband speech coding system and method

Patent number: 7136810

Abstract: A speech encoder/decoder for wideband speech with a partitioning of wideband into lowband and highband, convenient coding of the lowband, and LP excited by noise plus some periodicity for the highband. The embedded lowband may be extracted for a lower bit rate decoder.

Type: Grant

Filed: August 1, 2001

Date of Patent: November 14, 2006

Assignee: Texas Instruments Incorporated

Inventors: Erdal Paksoy, Alan V. McCree
Interactive multimedia book

Patent number: 7136819

Abstract: An interactive multimedia book provides hands-on multimedia instruction to the user in response to voiced commands. The book is implemented on a computer system and includes both text and audio/video clips. The interactive multimedia book is accessed by voiced commands and natural language queries as the primary user input. The displayed text is written in a markup language and contains hyperlinks which link the current topic with other related topics. The user may command the book to read the text and, as the text is read by the voice synthesizer, a word which is also a hyperlink will change its attributes upon being spoken. The user will be able to observe or hear this and simply utter the word which is the hyperlink to navigate to the linked topic.

Type: Grant

Filed: December 1, 2003

Date of Patent: November 14, 2006

Inventor: Charles Lamont Whitham
Systems and methods for providing users with information in audible form

Patent number: 7136804

Abstract: Methods for providing users with information in audible form are provided. One such method comprises determining content sources from which information is to be retrieved for responding to a request from a user, retrieving grammar fragments corresponding to the content sources determined, providing a menu format including information for providing the grammar fragments to the user in audible form, and aggregating the grammar fragments retrieved such that the grammar fragments can be provided to the user in audible form in conformance with the menu format. Systems and computer-readable media are also provided.

Type: Grant

Filed: October 30, 2002

Date of Patent: November 14, 2006

Assignee: Hewlett-Packard Development Company, L.P.

Inventors: Lionel Lavallee, Peter Petersen, Gregory Pavlik
Syntax-driven, operator assisted voice recognition system and methods

Patent number: 7136814

Abstract: Methods and apparatus are described for effecting a computer transaction using speech as a primary input. The speech is captured using a speech recognition program. A context associated with the captured speech is determined. Where the context has been determined, the computer transaction is built based on the context and at least a portion of the captured speech. A representation of the computer transaction is presented to a human operator for verification. The computer transaction is effected upon verification by the human operator.

Type: Grant

Filed: November 3, 2000

Date of Patent: November 14, 2006

Assignee: The Procter & Gamble Company

Inventor: Theodore Van Fossen McConnell
Dynamic semantic control of a speech recognition system

Patent number: 7127393

Abstract: A method and apparatus are provided for automatically recognizing words of spoken speech using a computer-based speech recognition system according to a dynamic semantic model. In an embodiment, the speech recognition system recognizes speech and generates one or more word strings, each of which is a hypothesis of the speech, and creates and stores a probability value or score for each of the word strings. The word strings are ordered by probability value. The speech recognition system also creates and stores, for each of the word strings, one or more keyword-value pairs that represent semantic elements and semantic values of the semantic elements for the speech that was spoken. One or more dynamic semantic rules are defined that specify how a probability value of a word string should be modified based on information about external conditions, facts, or the environment of the application in relation to the semantic values of that word string.

Type: Grant

Filed: February 10, 2003

Date of Patent: October 24, 2006

Assignee: Speech Works International, Inc.

Inventors: Michael S. Phillips, Etienne Barnard, Jean-Guy Dahan, Michael J. Metzger
Frame erasure concealment technique for a bitstream-based feature extractor

Patent number: 7110947

Abstract: A frame erasure concealment technique for a bitstream-based feature extractor in a speech recognition system particularly suited for use in a wireless communication system operates to “delete” each frame in which an erasure is declared. The deletions thus reduce the length of the observation sequence, but have been found to provide for sufficient speech recognition based on both single word and “string” tests of the deletion technique.

Type: Grant

Filed: December 5, 2000

Date of Patent: September 19, 2006

Assignee: AT&T Corp.

Inventors: Richard Vandervoort Cox, Hong Kook Kim
Method and apparatus for aligning ambiguity in finite state transducers

Patent number: 7107205

Abstract: A method prepares a functional finite-state transducer (FST) with an epsilon or empty string on the input side for factorization into a bimachine. The method creates a left-deterministic input finite-state automation (FSA) by extracting and left-determinizing the input side of the functional FST. Subsequently, the corresponding sub-paths in the FST are identified for each arc in the left-deterministic FST and aligned.

Type: Grant

Filed: December 18, 2000

Date of Patent: September 12, 2006

Assignee: Xerox Corporation

Inventor: Andre Kempe
Bitstream data reduction coding by applying prediction

Patent number: 7107212

Abstract: A data processing apparatus for data processing an audio signal includes an input terminal (1) for receiving the audio signal, a 1-bit A/D converter (4) for A/D converting the audio signal to for a bitstream signal, a prediction unit (10) for carrying out a prediction step on the bitstream signal to form a predicted bitstream signal, a signal combination unit (42) for combining the bitstream signal and the predicted bitstream signal to form a residue bitstream signal, and an output terminal (14) for supplying the residual bitstream signal.

Type: Grant

Filed: November 25, 2002

Date of Patent: September 12, 2006

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Renatus J. Van Der Vleuten, Alphons A. M. L. Bruekers, Arnoldus W. J. Oomen
Computer-aided writing system and method with cross-language writing wizard

Patent number: 7107204

Abstract: A computer-aided writing system offers assistance to a user writing in a non-native language, as the user needs help, without requiring the user to divert attention away from the entry task. The writing system provides a user interface (UI) that integrates writing assistance with in-line text entry. When the user is unsure of a word's spelling or whether the word is appropriate, the user may enter a corresponding native word directly in line with the ongoing sentence. An error tolerant spelling tool accepts the native word (even if it is misspelled or mistyped) and derives the most probable non-native word for the given context. The spelling tool consults a bilingual dictionary to determine possible non-native word translation candidates, a non-native language model (e.g.

Type: Grant

Filed: April 24, 2000

Date of Patent: September 12, 2006

Assignee: Microsoft Corporation

Inventors: Ting Liu, Ming Zhou, Jian Wang
Speech communication apparatus with gain control for clear communication

Patent number: 7107209

Abstract: A speech communication apparatus which is used with a microphone being fixed to a predetermined position in the vicinity of the mouth in such a manner as to prevent the transmission of uncomfortable noise such as sneezing, coughing or throat-clearing noise to a partner. There is provided a speech communication apparatus including a speech communication microphone, a speaker and a communication unit for amplifying an output signal from the speech communication microphone, the speech communication apparatus includes the communication unit having an amplifier for amplifying an input signal and outputting the input signal so amplified, and a controller for controlling the gain of the amplifier in response to an excessive input signal, wherein the controller controls the gain of the amplifier such that a reproduced sound of an excessive input signal is reduced to a predetermined level only for a predetermined period of time when the excessive input signal is detected.

Type: Grant

Filed: November 9, 2001

Date of Patent: September 12, 2006

Assignee: Honda Giken Kogyo Kabushiki Kaisha

Inventors: Hajime Tabata, Yukio Miyamaru
Dual mode radio mobile terminal in which an analog or digital mode is determined by request of a voice function

Patent number: 7107208

Abstract: A method for operating a voice function of a dual-mode mobile communication apparatus including a speaker's voice recognition function and a voice output function of stored information while the mobile communication apparatus is operating in an analog mode is disclosed. The method comprises the step of determining whether a voice function request signal is input or not, switching a vocoder into a digital mode for operating the voice function, and operating the voice function in digital mode.

Type: Grant

Filed: May 31, 2001

Date of Patent: September 12, 2006

Assignee: Samsung Electronics Co., Ltd.

Inventors: Hee-Sun Cho, Kyung-Ha Lee, Sung-Bok Park
Communication apparatus

Patent number: 7107219

Abstract: A terminal is designed so that a user who employs numerical keys, allocated for the entry of dots, can enter Braille dot combinations that are used for the input of characters. The input characters may be output as speech for feedback. Further, when a terminal accesses a server, Braille dot combinations can be entered in the above described manner, and speech can be fed back from the server. Furthermore, the server can provide a service for the user in accordance with a character string input at the terminal.

Type: Grant

Filed: October 30, 2001

Date of Patent: September 12, 2006

Assignee: International Business Machines Corporation

Inventor: Kazuo Nemoto
Microphone array signal enhancement using mixture models

Patent number: 7103541

Abstract: A system and method facilitating signal enhancement utilizing mixture models is provided. The invention includes a signal enhancement adaptive system having a speech model, a noise model and a plurality of adaptive filter parameters. The signal enhancement adaptive system employs probabilistic modeling to perform signal enhancement of a plurality of windowed frequency transformed input signals received, for example, for an array of microphones. The signal enhancement adaptive system incorporates information about the statistical structure of speech signals. The signal enhancement adaptive system can be embedded in an overall enhancement system which also includes components of signal windowing and frequency transformation.

Type: Grant

Filed: June 27, 2002

Date of Patent: September 5, 2006

Assignee: Microsoft Corporation

Inventors: Hagai Attias, Li Deng
Method of using speech recognition to initiate a wireless application protocol (WAP) session

Patent number: 7103550

Abstract: A method of initiating a data session between a wireless device and an information network over a wireless communication link includes establishing a voice session between the wireless device and a voice recognition application over the wireless communication link. A speech request is conveyed to the voice recognition application for information to be obtained from the information network. The voice recognition application converts the speech request into search criteria. A search of the information network is then initiated using the speech criteria and the voice session is terminated. A data session is then established with the wireless device and the search results are pushed to the wireless device over the wireless communications link.

Type: Grant

Filed: June 29, 2001

Date of Patent: September 5, 2006

Assignee: Mitel Networks Corporation

Inventors: Warren Gallagher, Lisa Fast, Shawn Griffin
Method and apparatus for producing a waveform corresponding to a style of rendition using a packet stream

Patent number: 7099827

Abstract: Packet stream is generated by combining a plurality of packets corresponding to style-of-rendition identification information which are selected from among a number of packets usable for producing waveforms corresponding to various styles of rendition. Then, a waveform having characteristics of the style of rendition indicated by the style-of-rendition identification information is produced on the basis of the generated packet stream. The packet stream includes a plurality of packets and time information of the individual packets and controls the pitch, amplitude and shape of the waveform to be produced. By thus combining packets corresponding to the style-of-rendition identification information and producing a waveform on the basis of the packet stream, there can be provided a waveform corresponding to a desired style of rendition in a simplified manner with great facility.

Type: Grant

Filed: September 22, 2000

Date of Patent: August 29, 2006

Assignee: Yamaha Corporation

Inventor: Motoichi Tamura
Device and method for analyzing and representing sound signals in the musical notation

Patent number: 7096186

Abstract: Sound signal is received which contains sound characteristics to be represented in musical notation. The characteristics, such as a volume level of the sound signal, are extracted out of the received sound signal, and various parameters for use in subsequent analysis of the sound signal are set in accordance with the extracted characteristics. Also, a desired scale determining condition is set by a user. Pitch of the sound signal is determined using the thus-set parameters. The determined pitch is rounded to any one of scale notes, corresponding to the user-set scale determining condition. Also, a given unit note length is set as a predetermined criterion or reference for determining a note length, and a length of the scale note determined from the received sound signal is determined using the thus-set unit note length as a minimum determination unit, i.e., with an accuracy of the unit note length.

Type: Grant

Filed: August 10, 1999

Date of Patent: August 22, 2006

Assignee: Yamaha Corporation

Inventor: Tomoyuki Funaki
Noise suppression in beam-steered microphone array

Patent number: 7092882

Abstract: A system for suppressing unwanted signals in steerable microphone arrays. The lobes of a steerable microphone array are monitored, to identify lobes having large speech content and low noise content. One of the identified lobes is then used to deliver speech to a speech recognition system, as at a self-service kiosk.

Type: Grant

Filed: December 6, 2000

Date of Patent: August 15, 2006

Assignee: NCR Corporation

Inventors: Jon A. Arrowood, Michael S. Miller
Enhancing the intelligibility of received speech in a noisy environment

Patent number: 7089181

Abstract: A device receives a signal that includes human-interpretable audio information. The device automatically adjusts the volume of the audio information at the received end. The volume control is determined by an automatic volume control gain, which is calculated as a function of the automatic gain control gain, a weighted dynamic range compression gain, and a weighted constant gain.

Type: Grant

Filed: January 30, 2002

Date of Patent: August 8, 2006

Assignee: Intel Corporation

Inventor: Adoram Erell
Method for task classification using morphemes

Patent number: 7085720

Abstract: The invention concerns a method of task classification using morphemes which operates on the task objective of a user. The morphemes may be generated by clustering selected ones of the salient sub-morphemes selected from training speech which are semantically and syntactically similar. The method may include detecting morphemes present in the user's input communication, and making task-type classification decisions based on the detected morphemes in the user's input communication. The morphemes may be verbal and/or non-verbal.

Type: Grant

Filed: October 18, 2000

Date of Patent: August 1, 2006

Assignee: AT & T Corp.

Inventors: Allen Louis Gorin, Dijana Petrovska-Delacretaz, Giuseppe Riccardi, Jeremy Huntley Wright

prev 1 2 3 4 5 6 7 8 9 … next