Patents by Inventor Jean-Claude Junqua

Jean-Claude Junqua has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Constraint-based speech recognition system and method

Publication number: 20030115057

Abstract: A constraint-based speech recognition system for use with a form-filling application employed over a telephone system is disclosed. The system comprises an input signal, wherein the input signal includes both speech input and non-speech input of a type generated by a user via a manually operated device. The system further comprises a constraint module operable to access an information database containing information suitable for use with speech recognition, and to generate candidate information based on the non-speech input and the information database, wherein the candidate information corresponds to a portion of the information. The system further comprises a speech recognition module operable to recognize speech based on the speech input and the candidate information. In an exemplary embodiment, the manually operated device is a touch-tone telephone keypad, and the information database is a lexicon encoded according to classes defined by the keys of the keypad.

Type: Application

Filed: December 13, 2001

Publication date: June 19, 2003

Inventors: Jean-Claude Junqua, Matteo Contolini
System and interactive form filling with fusion of data from multiple unreliable information sources

Publication number: 20030115060

Abstract: An automated form filling system includes an input receptive of a plurality of information inputs from a plurality of information sources. An information fuser is operable to select information from the plurality of information inputs based on a comparison of the information inputs, and based on knowledge relating to reliability of the information sources. A form filler is operable to fill an electronic form with the selected information.

Type: Application

Filed: September 16, 2002

Publication date: June 19, 2003

Inventors: Jean-Claude Junqua, Kirill Stoimenov, Roland Kuhn
Apparatus for efficient dispatch and selection of information in law enforcement applications

Patent number: 6571174

Abstract: A navigation apparatus is disclosed which may be used by law enforcement personnel for rapid intervention to a location while adding safety and reliability to the process. The apparatus includes a computer system, having an operating system, memory and a user interface. The system further includes a positioning system, such as a GPS system for determining the position of a vehicle. The positioning system communicates with the operating system. An information database, communicating with the operating system, contains data related to routing information concerning routes for travel by the vehicle. The routing information includes safety information concerning route safety in the traveling region accessible by the vehicle. The apparatus further includes a routing system in communication with the operating system that determines a route based at least in part on the routing information.

Type: Grant

Filed: August 14, 2001

Date of Patent: May 27, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Luca Rigazio, Philippe R. Morin, Jean-Claude Junqua
Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training

Patent number: 6571208

Abstract: A reduced dimensionality eigenvoice analytical technique is used during training to develop context-dependent acoustic models for allophones. The eigenvoice technique is also used during run time upon the speech of a new speaker. The technique removes individual speaker idiosyncrasies, to produce more universally applicable and robust allophone models. In one embodiment the eigenvoice technique is used to identify the centroid of each speaker, which may then be “subtracted out” of the recognition equation. In another embodiment maximum likelihood estimation techniques are used to develop common decision tree frameworks that may be shared across all speakers when constructing the eigenvoice representation of speaker space.

Type: Grant

Filed: November 29, 1999

Date of Patent: May 27, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Roland Kuhn, Jean-Claude Junqua, Matteo Contolini
Universal remote control allowing natural language modality for television and multimedia searches and requests

Patent number: 6553345

Abstract: The remote control unit supports multi-modal dialog with the user, through which the user can easily select programs for viewing or recording. The remote control houses a microphone into which the user can input natural language speech. The input speech is recognized and interpreted by a natural language parser that extracts the semantic content of the user's speech. The parser works in conjunction with an electronic program guide, through which the remote control system is able to ascertain what programs are available for viewing or recording and supply appropriate prompts to the user. In one embodiment, the remote control includes a touch screen display upon which the user may view prompts or make selections by pen input or tapping. Selections made on the touch screen automatically limit the context of the ongoing dialog between user and remote control, allowing the user to interact naturally with the unit.

Type: Grant

Filed: August 26, 1999

Date of Patent: April 22, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Roland Kuhn, Tony Davis, Jean-Claude Junqua, Yi Zhao, Weiying Li
Speaker and environment adaptation based on linear separation of variability sources

Publication number: 20030050780

Abstract: Linear approximation of the background noise is applied after feature extraction and prior to speaker adaptation to allow the speaker adaptation system to adapt the speech models to the enrolling user without distortion from background noise. The linear approximation is applied in the feature domain, such as in the cepstral domain. Any adaptation technique that is commutative in the feature domain may be used.

Type: Application

Filed: May 24, 2001

Publication date: March 13, 2003

Inventors: Luca Rigazio, Patrick Nguyen, David Kryze, Jean-Claude Junqua
Focused language models for improved speech input of structured documents

Publication number: 20030050778

Abstract: An e-mail message process is provided for use with a personal digital assistant which allows for the use of input speech messaging which is converted to text using a focused language model which is downloaded by a cellular phone connection to an Internet server which provides the focused language model based upon a topic for the intended e-mail message. The text that is generated from the input speech method can be summarized by the e-mail message processor and can be edited by the user. The generated e-mail message can then be transmitted again via cellular connection to an Internet e-mail server for transmitting the e-mail message to a recipient.

Type: Application

Filed: September 13, 2001

Publication date: March 13, 2003

Inventors: Patrick Nguyen, Luca Rigazio, Jean-Claude Junqua
Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification

Publication number: 20030046068

Abstract: A reduced dimensionality eigenvoice analytical technique is used during training to develop context-dependent acoustic models for allophones. Re-estimation processes are performed to more strongly separate speaker-dependent and speaker-independent components of the speech model. The eigenvoice technique is also used during run time upon the speech of a new speaker. The technique removes individual speaker idiosyncrasies, to produce more universally applicable and robust allophone models. In one embodiment the eigenvoice technique is used to identify the centroid of each speaker, which may then be “subtracted out” of the recognition equation.

Type: Application

Filed: May 4, 2001

Publication date: March 6, 2003

Inventors: Florent Perronnin, Roland Kuhn, Patrick Nguyen, Jean-Claude Junqua
Method for noise adaptation in automatic speech recognition using transformed matrices

Patent number: 6529872

Abstract: The improved noise adaptation technique employs a linear or non-linear transformation to the set of Jacobian matrices corresponding to an initial noise condition. An &agr;-adaptation parameter or artificial intelligence operation is employed in a linear or non-linear way to increase the adaptation bias added to the speech models. This corrects shortcomings of conventional Jacobian adaptation, which tend to underestimate the effect of noise. The improved adaptation technique is further enhanced by a reduced dimensionality, principal component analysis technique that reduces the computational burden, making the adaptation technique beneficial in embedded recognition systems.

Type: Grant

Filed: April 18, 2000

Date of Patent: March 4, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Christophe Cerisara, Luca Rigazio, Robert Boman, Jean-Claude Junqua
Apparatus for efficient dispatch and selection of information in law enforcement applications

Publication number: 20030040865

Abstract: A navigation apparatus is disclosed which may be used by law enforcement personnel for rapid intervention to a location while adding safety and reliability to the process. The apparatus includes a computer system, having an operating system, memory and a user interface. The system further includes a positioning system, such as a GPS system for determining the position of a vehicle. The positioning system communicates with the operating system. An information database, communicating with the operating system, contains data related to routing information concerning routes for travel by the vehicle. The routing information includes safety information concerning route safety in the traveling region accessible by the vehicle. The apparatus further includes a routing system in communication with the operating system that determines a route based at least in part on the routing information.

Type: Application

Filed: August 14, 2001

Publication date: February 27, 2003

Inventors: Luca Rigazio, Philippe R. Morin, Jean-Claude Junqua
Discriminative clustering methods for automatic speech recognition

Patent number: 6526379

Abstract: The discriminative clustering technique tests a provided set of Gaussian distributions corresponding to an acoustic vector space. A distance metric, such as the Bhattacharyya distance, is used to assess which distributions are sufficiently proximal to be merged into a new distribution. Merging is accomplished by computing the centroid of the new distribution by minimizing the Bhattacharyya distance between the parameters of the Gaussian distributions being merged.

Type: Grant

Filed: November 29, 1999

Date of Patent: February 25, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Luca Rigazio, Brice Tsakam, Jean-Claude Junqua
Method for efficient, safe and reliable data entry by voice under adverse conditions

Publication number: 20030033146

Abstract: A method and apparatus for data entry by voice under adverse conditions is disclosed. More specifically it provides a way for efficient and robust form filling by voice. A form can typically contain one or several fields that must be filled in. The user communicates to a speech recognition system and word spotting is performed upon the utterance. The spotted words of an utterance form a phrase that can contain field-specific values and/or commands. Recognized values are echoed back to the speaker via a text-to-speech system. Unreliable or unsafe inputs for which the confidence measure is found to be low (e.g. ill-pronounced speech or noises) are rejected by the spotter. Speaker adaptation is furthermore performed transparently to improve speech recognition accuracy. Other input modalities can be additionally supported (e.g. keyboard and touch-screen). The system maintains a dialogue history to enable editing and correction operations on all active fields.

Type: Application

Filed: August 3, 2001

Publication date: February 13, 2003

Inventors: Philippe R. Morin, Jean-Claude Junqua, Luca Rigazio, Robert C. Boman, Peter Veprek
Method and tool for customization of speech synthesizer databases using hierarchical generalized speech templates

Patent number: 6513008

Abstract: A speech synthesizer customization system provides a mechanism for generating a hierarchical customized user database. The customization system has a template management tool for generating the templates based on customization data from a user and associated replicated dynamic synthesis data from a text-to-speech (TTS) synthesizer. The replicated dynamic synthesis data is arranged in a dynamic data structure having hierarchical levels. The customization system further includes a user database that supplements a standard database of the synthesizer. The tool populates the user database with the templates such that the templates enable the user database to uniformly override subsequently generated speech synthesis data at all hierarchical levels of the dynamic data structure.

Type: Grant

Filed: March 15, 2001

Date of Patent: January 28, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Steve Pearson, Peter Veprek, Jean-Claude Junqua
Automatic control of household activity using speech recognition and natural language

Patent number: 6513006

Abstract: Speech recognition and natural language parsing components are used to extract the meaning of the user's spoken input. The system stores a semantic representation of an electronic activity guide, and the contents of the guide can be mapped into the grammars used by the natural language parser. Thus, when the user wishes to navigate through the complex menu structure of the electronic activity guide, he or she only needs to speak in natural language sentences. The system automatically filters the contents of the guide and supplies the user with on-screen display or synthesized speech responses to the user's request. The system allows the user to communicate in a natural way with a variety of devices communicating with the home network or home gateway.

Type: Grant

Filed: June 6, 2001

Date of Patent: January 28, 2003

Assignee: Matsushita Electronic Industrial Co., Ltd.

Inventors: John Howard, Jean-Claude Junqua
Optimized local feature extraction for automatic speech recognition

Patent number: 6513004

Abstract: The acoustic speech signal is decomposed into wavelets arranged in an asymmetrical tree data structure from which individual nodes may be selected to best extract local features, as needed to model specific classes of sound units. The wavelet packet transformation is smoothed through integration and compressed to apply a non-linearity prior to discrete cosine transformation. The resulting subband features such as cepstral coefficients may then be used to construct the speech recognizer's speech models. Using the local feature information extracted in this manner allows a single recognizer to be optimized for several different classes of sound units, thereby eliminating the need for parallel path recognizers.

Type: Grant

Filed: November 24, 1999

Date of Patent: January 28, 2003

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Luca Rigazio, David Kryze, Ted Applebaum, Jean-Claude Junqua
Text selection and recording by feedback and adaptation for development of personalized text-to-speech systems

Publication number: 20020193994

Abstract: A new speaker provides speech from which comparison snippets are extracted. The comparison snippets are compared with initial snippets stored in a recorded snippet database that is associated with a concatenative synthesizer. The comparison of the snippets to the initial snippets produces required sound units. A greedy selection algorithm is performed with the required sound units for identifying the smallest subset of the input text that contains all of the text for the new speaker to read. The new speaker then reads the optimally selected text and sound units are extracted from the human speech such that the recorded snippet database is modified and the speech synthesized adopts the voice quality and characteristics of the new speaker.

Type: Application

Filed: March 30, 2001

Publication date: December 19, 2002

Inventors: Nicholas Kibre, Steven Pearson, Brian Hanson, Jean-Claude Junqua
Speech synthesis employing concatenated prosodic and acoustic templates for phrases of multiple words

Patent number: 6496801

Abstract: A speech synthesis system for generating voice dialog for a message frame having a fixed and a variable portion. A prosody module selects a prosodic template for each of the fixed and variable portions wherein at least one portion comprises a phrase of multiple words. An acoustic module selects an acoustic template for each of the fixed and variable portions wherein at least one portion comprises a phrase of multiple words. A frame generator concatenates the respective prosodic templates and acoustic templates. A sound module generates the voice dialog in accordance with the concatenated prosodic and acoustic templates.

Type: Grant

Filed: November 2, 1999

Date of Patent: December 17, 2002

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Peter Veprek, Steve Pearson, Jean-Claude Junqua
Automatic search of audio channels by matching viewer-spoken words against closed-caption/audio content for interactive television

Patent number: 6480819

Abstract: A method and apparatus is provided to enable a user watching and/or listening to a program to search for new information in the stream of a telecommunications data. The apparatus includes a voice recognition system that recognizes the user's request and causes a search to be performed in the long stream of data of at least one other telecommunication channel. The system includes a storage device for storing and processing the request. Upon recognition of the request, the incoming signal or signals are scanned for matches with the request. Upon finding the match between the request and the incoming signal, information related to the data is brought to the viewer's attention. This can be accomplished by either changing the viewer's station or by bringing in a split screen display forward into the display.

Type: Grant

Filed: February 25, 1999

Date of Patent: November 12, 2002

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Robert Boman, Jean-Claude Junqua
Speech detection for noisy conditions

Patent number: 6480823

Abstract: The input signal is transformed into the frequency domain and then subdivided into bands corresponding to different frequency ranges. Adaptive thresholds are applied to the data from each frequency band separately. Thus the short-term band-limited energies are tested for the presence or absence of a speech signal. The adaptive threshold values are independently updated for each of the signal paths, using a histogram data structure to accumulate long-term data representing the mean and variance of energy within the respective frequency band. Endpoint detection is performed by a state machine that transitions from the speech absent state to the speech present state, and vice versa, depending on the results of the threshold comparisons. A partial speech detection system handles cases in which the input signal is truncated.

Type: Grant

Filed: March 24, 1998

Date of Patent: November 12, 2002

Assignee: Matsushita Electric Industrial Co., Ltd.

Inventors: Yi Zhao, Jean-Claude Junqua
Method and apparatus for feature domain joint channel and additive noise compensation

Publication number: 20020165712

Abstract: A method for performing noise adaptation of a target speech signal input to a speech recognition system, where the target speech signal contains both additive and convolutional noises. The method includes estimating an additive noise bias and a convolutional noise bias; in the target speech signal; and jointly compensating the target speech signal for the additive and convolutional noise biases in a feature domain.

Type: Application

Filed: March 15, 2002

Publication date: November 7, 2002

Inventors: Younes Souilmi, Luca Rigazio, Patrick Nguyen, Jean-Claude Junqua

prev 1 2 3 4 5 6 7 next