Patents Examined by Daniel Nolan

Smart toys

Patent number: 6631351

Abstract: Talking toys perform simulated conversations with one another. The toys each include a forest of decision graphs. The forest of decision graphs is the same for each toy. Each of the decision graphs corresponds to a conversation and includes a number of nodes, each of which corresponds to a portion of the conversation. The nodes also include one or more contexts which connect the nodes to children nodes. As a result, the selection of the context directs the progression of conversation. The toys select a decision graph/conversation that includes all or most of the toys as participants. The conversation is then performed as the toys traverse the selected decision graph. The toys transfer messages back and forth via a wireless transmission and reception arrangement as they traverse the decision graph. The toys play the portions of the conversation through a speaker.

Type: Grant

Filed: September 14, 2000

Date of Patent: October 7, 2003

Assignee: Aidentity Matrix

Inventors: Surya Ramachandran, Anand Kancherlapalli, Michael C. Fu
Intelligent query engine for processing voice based queries

Patent number: 6615172

Abstract: An intelligent query system for processing voiced-based queries is disclosed. This distributed client-server system, typically implemented on an intranet or over the Internet accepts a user's queries at his/her computer, PDA or workstation using a speech input interface. After converting the user's query from speech to text, a 2-step algorithm employing a natural language engine, a database processor and a full-text SQL database is implemented to find a single answer that best matches the user's query. The system, as implemented, accepts environmental variables selected by the user and is scalable to provide answers to a variety and quantity of user-initiated queries.

Type: Grant

Filed: November 12, 1999

Date of Patent: September 2, 2003

Assignee: Phoenix Solutions, Inc.

Inventors: Ian M. Bennett, Bandi Ramesh Babu, Kishor Morkhandikar, Pallaki Gururaj
Real time audio transmission system supporting asynchronous input from a text-to-speech (TTS) engine

Patent number: 6615173

Abstract: A system for real time transmission of speech audio received from a text-to-speech (TTS) engine can include a TTS engine and a real time speech audio producer for receiving speech audio from the TTS engine over a network, and for producing formatted audio packets for transmission over the network according to the transmission interval. The transmission interval can be determined according to a packetization delay parameter.

Type: Grant

Filed: August 28, 2000

Date of Patent: September 2, 2003

Assignee: International Business Machines Corporation

Inventor: Joseph Celi, Jr.
HMM-based echo model for noise cancellation avoiding the problem of false triggers

Patent number: 6606595

Abstract: An automatic speech recognition system for the condition that an incoming caller's speech is quiet and a resulting echo (of a loud playing prompt) can cause the residual (the portion of the echo remaining after even echo cancellation) to be of the magnitude of the incoming speech input. Such loud echoes can falsely trigger the speech recognition system and interfere with the recognition of valid input speech. An echo model has been proven to alleviate this fairly common problem and to be effective in eliminating such false triggering. Further, this automatic speech recognition system enhanced the recognition of valid speech was provided within an existing hidden Markov modeling framework.

Type: Grant

Filed: August 31, 2000

Date of Patent: August 12, 2003

Assignee: Lucent Technologies Inc.

Inventors: Rathinavelu Chengalvarayan, Richard Harry Ketchum, Anand Rangaswamy Setlur, David Lynn Thomson
System and method for the creation and automatic deployment of personalized, dynamic and interactive voice services, including deployment through digital sound files

Patent number: 6606596

Abstract: A system and method for the creation and automatic deployment of personalized, dynamic and interactive voice services, including information derived from on-line analytical processing (OLAP) systems and other data repositories is disclosed. In particular, the system and method include the ability to deploy voice services through a digital sound file. The system and method access personalized information and generate personalized markup documents from the personalized information. The personalized markup document is used to generate a sound file that is made available to a subscriber of the voice service, for example, through an e-mail or by posting to a web site.

Type: Grant

Filed: December 7, 1999

Date of Patent: August 12, 2003

Assignee: Microstrategy, Incorporated

Inventors: Michael Zirngibl, Anurag Patnaik, Bodo Maass, Hannes Eberle
Speech recognition method for activating a hyperlink of an internet page

Patent number: 6604076

Abstract: A speech recognition method is disclosed for activating a hyperlink of an Internet page. In particular the method comprises determining hypertexts of the hyperlinks in text information, determining corresponding first phoneme sequences of hypertexts, receiving a spoken command from a user, determining a second phoneme sequence corresponding to the spoken command, determining the hyperlink selected by the user using the first and second phoneme sequences, activating the selected hyperlink, where one quality value is determined for each hypertext when first phoneme sequences are determined and where an extra hypertext is determined when the quality value of a hypertext is below a threshold and is assigned to the hypertext of the Internet page or in lieu of the hypertext, and where a first phoneme sequence determined for the extra hypertext has a quality value that exceeds the threshold.

Type: Grant

Filed: November 9, 2000

Date of Patent: August 5, 2003

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Martin Holley, Dieter Kubesch
Voice recognition apparatus

Patent number: 6604073

Abstract: Disclosed is a voice recognition apparatus which can prevent an erroneous manipulation due to erroneous voice recognition from being carried out even in a noisy environment. As long as a duration of utterance acquired based on the level of a voice signal uttered by an operator (user) approximately coincides with a duration of utterance acquired based on mouth image data acquired by capturing the mouth of the operator, the voice recognition apparatus outputs vocal-manipulation phrase data as the result of voice recognition.

Type: Grant

Filed: September 12, 2001

Date of Patent: August 5, 2003

Assignee: Pioneer Corporation

Inventor: Shoutarou Yoda
System of encoding and decoding speech signals

Patent number: 6604070

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: September 15, 2000

Date of Patent: August 5, 2003

Assignee: Conexant Systems, Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
Evaluation method, apparatus, and recording medium using optimum template pattern determination method, apparatus and optimum template pattern

Patent number: 6598019

Abstract: To improve the precision in correction of an input sentence by using a template pattern for model sentence. A plurality of template patterns for the model sentence are provided beforehand. Each of the template patterns is regarded as a plurality of templates of words/phrases based on expertise of language teachers with scores assigned to the words according to their importance. The scores and subsequently the input sentence are read and analyzed in comparison with each of the template patterns and the total of scores of matching words is calculated. A template pattern having the highest total score is selected as an optimum template pattern and the input sentence is corrected using the optimum template pattern. This method improves the likelihood that a template pattern containing a larger number of important words is selected as the optimum template pattern.

Type: Grant

Filed: June 20, 2000

Date of Patent: July 22, 2003

Assignee: Sunflare Co., Ltd.

Inventors: Naoyuki Tokuda, Hiroyuki Sasai
Voice-activated control for electrical device

Patent number: 6594630

Abstract: An apparatus for voice-activated control of an electrical device comprises a receiving arrangement for receiving audio data generated by user. A vioce recognition arrangement is provided for determining whether the received audio data is a command word for controlling the electrical device. The voice recognition arrangement includes a microprocessor for comparing the received audio data with voice recognition data previously stored in the voice recognition arrangement. The voice recognition arrangment generates at least one control signal based on the comparison when the comparison reaches a predetermined threshold value. A power control controls power delivered to the electrical device. The power control is responsive to at least one control signal generated by the voice recognition arrangement for operating the electrical device in response to the at least one audio command generated by the user.

Type: Grant

Filed: November 19, 1999

Date of Patent: July 15, 2003

Assignee: Voice Signal Technologies, Inc.

Inventors: Igor Zlokarnik, Daniel Lawrence Roth
Method and apparatus for coding successive pitch periods in speech signal

Patent number: 6584437

Abstract: A method and apparatus for coding successive pitch periods of a speech signal. Based on a priori knowledge of statistical properties of successive speech periods, a shaped lattice structure is designed to cover the most probable points in the pitch space. The codebook index search starts with finding an open-loop estimate in the pitch space considering all dimensions and refining the open-loop estimate in a closed-loop search separately in each dimension based on the shaped lattice structure. The closed-loop search for the first subframe is for obtaining an absolute pitch period or a delta pitch while the closed-loop search for each of the other subframes is for obtaining a delta pitch for the respective subframe.

Type: Grant

Filed: June 11, 2001

Date of Patent: June 24, 2003

Assignee: Nokia Mobile Phones Ltd.

Inventors: Ari Heikkinen, Vesa T. Ruoppila, Samuli Pietilä
Bitstream protocol for transmission of encoded voice signals

Patent number: 6581032

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: September 15, 2000

Date of Patent: June 17, 2003

Assignee: Conexant Systems, Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
Browser-based arrangement for developing voice enabled web applications using extensible markup language documents

Patent number: 6578000

Abstract: A unified web-based voice messaging system provides voice application control between a web browser and an application server via an hypertext transport protocol (HTTP) connection on an Internet Protocol (IP) network. The application server executes the voice-enabled web application by runtime execution of a first set of extensible markup language (XML) documents that define the voice-enabled web application to be executed. The application server generates an HTML form specifying selected application parameters from an XML document executable by the voice application. The HTML form is supplied to a browser, enabling a user of the browser to input or modify application parameters for the corresponding XML document into the form. The application server inserts the received input application parameters into the XML document, and stores the document.

Type: Grant

Filed: April 28, 2000

Date of Patent: June 10, 2003

Assignee: Cisco Technology, Inc.

Inventors: Lewis Dean Dodrill, Satish Joshi, Ryan Alan Danner, Susan Harrow Barban, Steven J. Martin
Codebook tables for encoding and decoding

Patent number: 6574593

Abstract: A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codecs are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech.

Type: Grant

Filed: September 15, 2000

Date of Patent: June 3, 2003

Assignee: Conexant Systems, Inc.

Inventors: Yang Gao, Adil Benyassine, Huan-yu Su, Eyal Shlomot, Jes Thyssen
Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training

Patent number: 6571208

Abstract: A reduced dimensionality eigenvoice analytical technique is used during training to develop context-dependent acoustic models for allophones. The eigenvoice technique is also used during run time upon the speech of a new speaker. The technique removes individual speaker idiosyncrasies, to produce more universally applicable and robust allophone models. In one embodiment the eigenvoice technique is used to identify the centroid of each speaker, which may then be “subtracted out” of the recognition equation. In another embodiment maximum likelihood estimation techniques are used to develop common decision tree frameworks that may be shared across all speakers when constructing the eigenvoice representation of speaker space.

Type: Grant

Filed: November 29, 1999

Date of Patent: May 27, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Roland Kuhn, Jean-Claude Junqua, Matteo Contolini
Portable information terminal, method of processing audio data, recording medium, and program

Patent number: 6567782

Abstract: An expansion processor has a buffer defining unit for defining one of two buffers as a present inverse quantization buffer and defining one of two buffers as a present restoration buffer, an inverse quantization processor for inversely quantizing a quantized value read for each sample from a DCT data buffer, an IDCT processor for effecting an IDCT process on the inversely quantized data to restore time-domain audio data from frequency-domain data, a low-pass filter processor for removing a high-frequency component from the restored audio data, and an audio data output unit for outputting successive restored samples of audio data to a DAC to output sound from a speaker.

Type: Grant

Filed: July 12, 2000

Date of Patent: May 20, 2003

Assignee: Sony Computer Entertainment Inc.

Inventor: Takayuki Wakimura
Digital filter design method and apparatus

Patent number: 6564184

Abstract: A digital filter design apparatus for noise suppression by spectral subtraction includes a first spectrum estimator for determining a high frequency resolution noisy speech power spectral density estimate from a noisy speech signal block. A second spectrum estimator determines a high frequency resolution background noise power spectral density estimate from a background noise signal block. Averaging units form a piece-wise constant noisy speech power spectral density estimate and a piece-wise constant background noise power spectral density estimate. These averaging units are controlled by devices for adapting the length of individual segments to the shape of the high frequency-resolution noisy speech power spectral density estimate and for using the same segmentation in both piecewise constant estimates.

Type: Grant

Filed: September 6, 2000

Date of Patent: May 13, 2003

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventor: Anders Eriksson
Codebook structure for changeable pulse multimode speech coding

Patent number: 6556966

Abstract: A speech compression system with a special fixed codebook structure and a new search routine is proposed for speech coding. The system is capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech. The codebook structure uses a plurality of subcodebooks. Each subcodebook is designed to fit a specific group of speech signals. A criterion value is calculated for each subcodebook to minimize an error signal in a minimization loop as part of the coding system. An external signal sets a maximum bitstream rate for delivering encoded speech into a communications system. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. Each codec is selectively activated to encode and decode the speech signals at different bit rates to enhance overall quality of the synthesized speech at a limited average bit rate.

Type: Grant

Filed: September 15, 2000

Date of Patent: April 29, 2003

Assignee: Conexant Systems, Inc.

Inventor: Yang Gao
Low complexity speaker verification using simplified hidden markov models with universal cohort models and automatic score thresholding

Patent number: 6556969

Abstract: A low complexity speaker verification system that employs universal cohort models an automatic score thresholding. The universal cohort models are generated using a simplified cohort model generating scheme. In certain embodiments of the invention, a simplified hidden Markov modeling (HMM) scheme is used to generate the cohort models. In addition, the low complexity speaker verification system is trained by various users of the low complexity speaker verification system. The total number of users of the low complexity speaker verification system may be modified over time as required by the specific application, and the universal cohort models may be updated accordingly to accommodate the new users. The present invention employs a combination of universal cohort modeling and thresholding to ensure high performance.

Type: Grant

Filed: September 30, 1999

Date of Patent: April 29, 2003

Assignee: Conexant Systems, Inc.

Inventors: Khaled Assaleh, Ayman Asadi
Method and apparatus for generating multilingual transcription groups

Patent number: 6549883

Abstract: The invention relates to a method and apparatus for generating transcriptions suitable for use in a speech-processing device. The invention provides processing the vocabulary item to derive a characteristic from the vocabulary item allowing to divide a pool of available languages in a first sub-group and a second sub-group. The vocabulary item manifests a higher probability of belonging to any one of the languages in the first sub-group than belonging to any language in the second sub-group. The invention further provides processing the vocabulary item to generate a group of transcriptions, the group of transcriptions being characterized by the absence of at least one transcription belonging to a language in the second sub-group of languages established for the vocabulary item. The group of transcriptions is then released for use by a speech-processing device.

Type: Grant

Filed: November 2, 1999

Date of Patent: April 15, 2003

Assignee: Nortel Networks Limited

Inventors: Marc A. Fabiani, Michael G. Sabourin

prev 1 2 3 4 5 6 7 8 … next