Patents Examined by Daniel Nolan

Signal encoding and decoding method with electronic watermarking

Patent number: 6539356

Abstract: An encoder which encodes a voice in accordance with LD-CELP (Low-Delay Code Excited Linear Prediction) of the ITU-T Recommendation G.728. When a vibration wave is encoded by vector quantization, the code is secretly combined with other data. The encoder stores dividing key data kidx by which 128 types of representative vector data (waveform codes) yj; j=0, 1, . . . , 127 are labeled with 0 or 1 in order from the uppermost bit. If the bit is “0”, the vectors are quantized by using only the waveform codes yj corresponding to the bit “0” of the dividing key data kidx as the selection objects. If the bit is “1”, the vectors are quantized by using only the waveform codes yj corresponding to the bit “1” of the dividing key data kidx as the selection objects. Thus, the outputted voice code is combined with another datum bit.

Type: Grant

Filed: September 8, 2000

Date of Patent: March 25, 2003

Assignee: Kowa Co., Ltd.

Inventors: Kineo Matsui, Munetoshi Iwakiri
Voice-interactive docking station for a portable computing device

Patent number: 6539358

Abstract: A voice-interactive docking station is provided for use with a portable computing device. The portable computing device includes at least one information management application and a corresponding database for storing the data associated with the information management application. The docking station generally includes a speech input device for receiving speech input, a speech recognizer for translating the speech input into voice command data, and an interface application for interacting with the applications residing on the portable computing device. In particular, the interface application, in response to voice command data, accesses the data associated with the information management application residing on the portable computing device. The docking station may further include a text-to-speech synthesizer for converting output data from the interface application into speech output data, and an audio system for generating audio output from the speech output data.

Type: Grant

Filed: May 24, 2000

Date of Patent: March 25, 2003

Assignee: Delphi Technologies, Inc.

Inventors: Bradley S. Coon, Ronald K. Reger
Method and circuit arrangement for speech level measurement in a speech signal processing system

Patent number: 6539350

Abstract: Speech level measurement is particularly significant for successful echo compensation in telecommunications systems, for noise suppression in a noisy environment, for example in military vehicles, or in speech recognition and in speech coding and decoding systems. A method is indicated which permits speech levels measurement only if features of speech are recognized and interferences and speech pauses are filtered out for the measurement. To this end, speech and pause detectors and a mean value generator are utilized, the time behavior of which is largely adapted to the perception capability of the human ear. Briefly spoken vowels thus are well detected, while nasal sounds or consonants are suppressed in the case of falling levels. A speech level measuring device is indicated which provides very accurate results in a short adaptation period.

Type: Grant

Filed: November 18, 1999

Date of Patent: March 25, 2003

Assignee: Alcatel

Inventor: Michael Walker
Processing an audio bitstream signal

Patent number: 6535845

Abstract: A data processing apparatus includes an input terminal for receiving an audio signal, a 1-bit A/D converter for A/D converting the audio signal into a 1-bit bitstream signal, and a prediction unit for carrying out a prediction step on the bitstream signal so as to obtain a predicted bitstream signal. The data processing apparatus further includes a signal combination unit for combining the bitstream signal and the predicted bitstream signal so as to obtain a residue bitstream signal. A recording apparatus or a transmitter apparatus can use the data processing apparatus. The residue bitstream signal is data compressed by lossless encoding and then error encoded and channel encoded prior to transmission through a media.

Type: Grant

Filed: November 30, 2000

Date of Patent: March 18, 2003

Assignee: Koninklijke Philips Electronics N.V.

Inventors: Renatus J. Van Der Vleuten, Alphons A. M. L. Bruekers, Arnoldus W. J. Oomen
Automatic detection of non-stationarity in speech signals

Patent number: 6535843

Abstract: When necessary to time scale a speech signal, it is advantageous to do it under influence of a signal that measures the small-window non-stationarity of the speech signal. Three measures of stationarity are disclosed: one that is based on time domain analysis, one that is based on frequency domain analysis, and one that is based on both time and frequency domain analysis.

Type: Grant

Filed: August 18, 1999

Date of Patent: March 18, 2003

Assignee: AT&T Corp.

Inventors: Ioannis G. Stylianou, David A. Kapilow, Juergen Schroeter
Apparatus and method of controlling a voice controlled operation

Patent number: 6532447

Abstract: An electronic device, e.g. a mobile telephone, includes speech recognition means for controlling the operation of the device in response to a voice command. The device includes a proximity sensor e.g. a capacitive, inductive, or IR-red proximity sensor, for providing a control signal indicative of whether an object is in proximity of the device, and control means for controlling the voice controlled operation of the device in response to the control signal.

Type: Grant

Filed: June 6, 2000

Date of Patent: March 11, 2003

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventor: Nils Christensson
Server based speech recognition user interface for wireless devices

Patent number: 6532446

Abstract: A wireless communication system that utilizes a remote voice recognition server system to translate voice input received from serviced mobile devices into a symbolic data file (e.g. alpha-numeric or control characters) that can be processed by the mobile devices. The translation process begins by establishing a voice communication channel between the serviced mobile device and the voice recognition server. A user of the mobile device then begins speaking in a fashion that may be detected by the voice recognition server system. Upon detecting the user's speech, the voice recognition server system translates the speech into a symbolic data file, which is then forwarded to the user through a separate data communication channel. The user, upon receiving the symbolic data file at the mobile device, reviews and edits the content and further utilizes the file as desired.

Type: Grant

Filed: August 21, 2000

Date of Patent: March 11, 2003

Assignee: Openwave Systems Inc.

Inventor: Peter F. King
Identifying voice mail messages using speaker identification

Patent number: 6529870

Abstract: A method and apparatus for identifying voice mail messages uses speaker identification to identify a voice mail message. The method preferably includes comparing the voice mail message to existing voice samples in order to determine a matching coefficient. If the matching coefficient is within an allowed range, the voice mail message is categorized as a matched voice mail message and a name indicator is coupled to the voice mail message. The apparatus includes a user interface, a processing unit, and a storage media. The user interface receives the voice mail message and allows access to the voice mail message by a voice mail recipient. The processing unit compares the voice mail message to the existing voice samples, determines the matching coefficient, and assigns the name indicator. The storage media stores the voice mail message.

Type: Grant

Filed: October 4, 1999

Date of Patent: March 4, 2003

Assignee: Avaya Technology Corporation

Inventor: Rajendra Prasad Mikkilineni
Audio signal pitch adjustment apparatus and method

Patent number: 6519558

Abstract: A signal processing method and apparatus is disclosed, which is capable of reproducing a coded audio signal by decoding it while shifting its pitch, and reproducing, from an original sound, a sound having a sufficiently higher pitch than the original sound with few operations and less cost for the decoder used in the signal processing apparatus, and an information serving medium for serving a program which implements the signal decoding and pitch shifting. In one embodiment, the method of providing a signal processing method for decoding a coded signal for reading, includes setting a pitch for the coded signal, decoding only a low frequency portion of the coded signal according to the set pitch, and shifting the pitch of the decoded read signal based on the set pitch.

Type: Grant

Filed: May 19, 2000

Date of Patent: February 11, 2003

Assignee: Sony Corporation

Inventor: Kyoya Tsutsui
Generating synthesized voice and instrumental sound

Patent number: 6513007

Abstract: There is provided a synthesized sound generating apparatus and method which can achieve responsive and high-quality speech synthesis based on a real-time convolution operation. Coefficients are generated by using dynamic cutting to extract characteristic information from a first signal. A convolution operation is performed on a second signal using the generated coefficients to generate a synthesized signal. As the convolution operation, an interpolation process is performed on the coefficients to prevent a rapid change in level of the generated synthesized signal upon switching of the coefficients.

Type: Grant

Filed: July 20, 2000

Date of Patent: January 28, 2003

Assignee: Yamaha Corporation

Inventor: Akio Takahashi
Optimized local feature extraction for automatic speech recognition

Patent number: 6513004

Abstract: The acoustic speech signal is decomposed into wavelets arranged in an asymmetrical tree data structure from which individual nodes may be selected to best extract local features, as needed to model specific classes of sound units. The wavelet packet transformation is smoothed through integration and compressed to apply a non-linearity prior to discrete cosine transformation. The resulting subband features such as cepstral coefficients may then be used to construct the speech recognizer's speech models. Using the local feature information extracted in this manner allows a single recognizer to be optimized for several different classes of sound units, thereby eliminating the need for parallel path recognizers.

Type: Grant

Filed: November 24, 1999

Date of Patent: January 28, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Luca Rigazio, David Kryze, Ted Applebaum, Jean-Claude Junqua
Method and apparatus for separating processing for language-understanding from an application and its functionality

Patent number: 6513010

Abstract: The invention relates to a method and an apparatus for separating processing for language understanding from an application and its functionality. The application for example contains functionality within a provided domain, whereby the domain and its functionality belongs to a speech controlled device such as a car-radio, a car-AC, a CD-player or almost any program controlled device.

Type: Grant

Filed: May 30, 2000

Date of Patent: January 28, 2003

Assignee: Voxi AB

Inventors: Erland Lewin, Marten Stenius, Daniel Adler
Automatic control of household activity using speech recognition and natural language

Patent number: 6513006

Abstract: Speech recognition and natural language parsing components are used to extract the meaning of the user's spoken input. The system stores a semantic representation of an electronic activity guide, and the contents of the guide can be mapped into the grammars used by the natural language parser. Thus, when the user wishes to navigate through the complex menu structure of the electronic activity guide, he or she only needs to speak in natural language sentences. The system automatically filters the contents of the guide and supplies the user with on-screen display or synthesized speech responses to the user's request. The system allows the user to communicate in a natural way with a variety of devices communicating with the home network or home gateway.

Type: Grant

Filed: June 6, 2001

Date of Patent: January 28, 2003

Assignee: Matsushita Electronic Industrial Co., Ltd.

Inventors: John Howard, Jean-Claude Junqua
Method and apparatus for recognizing tone languages using pitch information

Patent number: 6510410

Abstract: A method and an apparatus for automatic recognition of tone languages, employing the steps of converting the words of speech into an electrical signal, generating spectral features from the electrical signal, extracting pitch values from the electrical signal, combining said spectral features and the pitch values into acoustic feature vectors, comparing the acoustic feature vectors with prototypes of phonemes in an acoustic prototype database including prototypes of toned vowels to produce labels, and matching the labels to text using a decoder comprising a phonetic vocabulary and a language model database.

Type: Grant

Filed: July 28, 2000

Date of Patent: January 21, 2003

Assignee: International Business Machines Corporation

Inventors: Julian Chengjun Chen, Guo Kang Fu, Hai Ping Li, Li Qin Shen
Method and apparatus for performing refueling transactions using customer-issued speech commands

Patent number: 6510416

Abstract: A fuel delivery system performs a refueling operation in accordance with voice command signals audibly provided by a customer situated within a vehicle positioned for refueling. The customer generates audio signals representative of refueling transaction information, such as a request to initiate fuel dispensing activity, data indicative of the fuel type, fuel volume, purchase amount, and form of payment, and a request to terminate the refueling transaction. A microphone detects the customer voice signals and provides audio detection signals representative thereof. A speech recognition device converts the audio detection signals into corresponding fuel dispensing command signals. A controller generates fuel dispensing control signals in accordance with the fuel dispensing command signals and couples these control signals to the fuel dispenser assembly to effect operative control thereof.

Type: Grant

Filed: August 23, 1999

Date of Patent: January 21, 2003

Assignee: Tokheim Corporation

Inventor: Dinesh Lad
Speech band sampling rate expansion

Patent number: 6507820

Abstract: The present-invention relates to a method for the band expansion of speech for telephones, in particular for mobile telephones, by increasing the effective sampling rate of the speech signal by the insertion of additional samples and subsequent filtering of the expanded bandwidth speech signal.

Type: Grant

Filed: July 3, 2000

Date of Patent: January 14, 2003

Assignee: Telefonaktiebolaget LM Ericsson

Inventor: Petra Deutgen
Voice IP approval system using voice-enabled web based application server

Patent number: 6507817

Abstract: A web-based voice messaging system uses an application server, configured for executing a voice application defined by XML documents, that accesses subscriber attributes from a standardized information database server (such as LDAP), and messages from a standardized messaging server (such as IMAP). The application server, upon receiving a request from a browser serving a subscriber, accesses an IP based messaging server to obtain a stored message for the subscriber from a second party. The stored message identifies a text-based form. The application server converts the text-based form into an audible format. The application server generates an HTML document to present to the subscriber the form in audible format with audible prompts, based on the HTTP request and the text-based form. The application server then updates the text-based form based on an input from the subscriber. Thus, text-based forms can be retrieved and acted on via a telephone.

Type: Grant

Filed: June 27, 2000

Date of Patent: January 14, 2003

Assignee: Cisco Technology, Inc.

Inventors: William M. Wolfe, Lewis Dean Dodrill, Ryan Alan Danner
Speech recognition that adjusts automatically to input devices

Patent number: 6505161

Abstract: A system and method for customizing the operating characteristics of a speech recognition system with characteristics of a user device. The user device transmits information representing the operating characteristics of itself to the speech recognition system. The speech recognition system determines the speech characteristics of the user device from this information. The speech recognition system obtains the speech characteristics relating to the device from a database and configures the speech recognition system with these characteristics.

Type: Grant

Filed: May 1, 2000

Date of Patent: January 7, 2003

Assignee: Sprint Communications Company L.P.

Inventor: Douglas J. Brems
Network and method for providing an automatic recall telecommunications service with automatic speech recognition capability

Patent number: 6505163

Abstract: A network for providing a telecommunications service with automatic speech recognition to a telecommunications user, including a switch in communication with a telecommunications device associated with the telecommunications user for storing identifying information regarding a prior calling party to the telecommunications user, and for detecting a trigger specific to the service in response to a communication from the telecommunications device, and an intelligent resource server in communication with the switch for receiving from the switch the communication from the telecommunications device and a message including the identifying information, for playing an audible message for the telecommunications user in response to receiving the communication, the audible message containing the information regarding the prior calling party and prompting the telecommunications user to place an outgoing communication to the prior calling party, and for automatically recognizing a predetermined keyword spoken by the teleco

Type: Grant

Filed: August 9, 2000

Date of Patent: January 7, 2003

Assignee: BellSouth Intellectual Property Corporation

Inventors: Zeeman Zhang, Joseph Knoerle
Method and apparatus for using formant models in speech systems

Patent number: 6505152

Abstract: A model is provided for formants found in human speech. Under one aspect of the invention, the model is used in formant tracking by providing probabilities that describe the likelihood that a candidate formant is actually a formant in the speech signal. Other aspects of the invention use this formant tracking to improve the model by regenerating the model based on the formants detected by the formant tracker. Still other aspects of the invention use the formant tracking to compress a speech signal by removing some of the formants from the speech signal. A further aspect of the invention uses the formant model to synthesize speech. Under this aspect of the invention, the formant model is used to identify a most likely formant track for the synthesized speech. Based on this track, a series of resonators are used to introduce the formants into the speech signal.

Type: Grant

Filed: September 3, 1999

Date of Patent: January 7, 2003

Assignee: Microsoft Corporation

Inventor: Alejandro Acero

prev 1 2 3 4 5 6 7 8 9 … next