Patents by Inventor Luca Rigazio

Luca Rigazio has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20050159952
    Abstract: A method is provided for improving pattern matching in a speech recognition system having a plurality of acoustic models (20). Similarity measures for acoustic feature vectors (54) are determined in groups that are then buffered into cache memory (59). To further reduce computational processing, the acoustic data may be partitioned amongst a plurality of processing nodes (66, 67, 68). In addition, a priori knowledge of the spoken order may be used to establish the access order (124) used to copy records from the main speech parameter table (120, 200) into a sub-table (130, 204). The sub-table is processed such that the entries are in contiguous memory locations (206) and sorted according to the processing order (208). The speech processing algorithm is then directed to operate upon the sub-table (210) which causes the processor to load the sub-table into high speed cache memory (104, 212).
    Type: Application
    Filed: March 19, 2003
    Publication date: July 21, 2005
    Applicant: MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD
    Inventors: Patrick Nguyen, Luca Rigazio
  • Patent number: 6915259
    Abstract: Linear approximation of the background noise is applied after feature extraction and prior to speaker adaptation to allow the speaker adaptation system to adapt the speech models to the enrolling user without distortion from background noise. The linear approximation is applied in the feature domain, such as in the cepstral domain. Any adaptation technique that is commutative in the feature domain may be used.
    Type: Grant
    Filed: May 24, 2001
    Date of Patent: July 5, 2005
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Luca Rigazio, Patrick Nguyen, David Kryze, Jean-Claude Junqua
  • Patent number: 6907397
    Abstract: An embedded device for playing media files is capable of generating a play list of media files based on input speech from a user. It includes an indexer generating a plurality of speech recognition grammars. According to one aspect of the invention, the indexer generates speech recognition grammars based on contents of a media file header of the media file. According to another aspect of the invention, the indexer generates speech recognition grammars based on categories in a file path for retrieving the media file to a user location. When a speech recognizer receives an input speech from a user while in a selection mode, a media file selector compares the input speech received while in the selection mode to the plurality of speech recognition grammars, thereby selecting the media file.
    Type: Grant
    Filed: September 16, 2002
    Date of Patent: June 14, 2005
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: David Kryze, Luca Rigazio, Patrick Nguyen, Jean-Claude Junqua
  • Patent number: 6901364
    Abstract: An e-mail message process is provided for use with a personal digital assistant which allows for the use of input speech messaging which is converted to text using a focused language model which is downloaded by a cellular phone connection to an Internet server which provides the focused language model based upon a topic for the intended e-mail message. The text that is generated from the input speech method can be summarized by the e-mail message processor and can be edited by the user. The generated e-mail message can then be transmitted again via cellular connection to an Internet e-mail server for transmitting the e-mail message to a recipient.
    Type: Grant
    Filed: September 13, 2001
    Date of Patent: May 31, 2005
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Patrick Nguyen, Luca Rigazio, Jean-Claude Junqua
  • Patent number: 6889189
    Abstract: System speakers are switched to function as sound input transducers to improve recognizer performance and to support recognizer features. A crossbar switch is selectively activated, either manually or under software control, to allow system loudspeakers to function as sound input transducers that supplement the recognition system microphone or microphone array. Using loudspeakers as “microphones” improves speech recognition in noisy environments, thus attaining better recognition performance with little added system cost. The loudspeakers, positioned in physically separate locations also provide spatial information that can be used to determine the location of the person speaking and thereby offer different functionality for different persons. Acoustic models are selected based on environmental and vehicle operating conditions and may be adapted dynamically using ambient information obtained using the loudspeakers as sound input transducers.
    Type: Grant
    Filed: September 26, 2003
    Date of Patent: May 3, 2005
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Robert Boman, Luca Rigazio, Brian Hanson, Rathinavelu Chengalvarayan
  • Patent number: 6879954
    Abstract: A method is provided for improving pattern matching in a speech recognition system having a plurality of acoustic models. The improved method includes: receiving continuous speech input; generating a sequence of acoustic feature vectors that represent temporal and spectral behavior of the speech input; loading a first group of acoustic feature vectors from the sequence of acoustic feature vectors into a memory workspace accessible to a processor; loading an acoustic model from the plurality of acoustic models into the memory workspace; and determining a similarity measure for each acoustic feature vector of the first group of acoustic feature vectors in relation to the acoustic model. Prior to retrieving another group of acoustic feature vectors, similarity measures are computed for the first group of acoustic feature vectors in relation to each of the acoustic models employed by the speech recognition system.
    Type: Grant
    Filed: April 22, 2002
    Date of Patent: April 12, 2005
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Patrick Nguyen, Luca Rigazio
  • Publication number: 20050075881
    Abstract: A media capture device has an audio input receptive of user speech relating to a media capture activity in close temporal relation to the media capture activity. A plurality of focused speech recognition lexica respectively relating to media capture activities are stored on the device, and a speech recognizer recognizes the user speech based on a selected one of the focused speech recognition lexica. A media tagger tags captured media with generated speech recognition text, and a media annotator annotates the captured media with a sample of the user speech that is suitable for input to a speech recognizer. Tagging and annotating are based on close temporal relation between receipt of the user speech and capture of the captured media. Annotations may be converted to tags during post processing, employed to edit a lexicon using letter-to-sound rules and spelled word input, or matched directly to speech to retrieve captured media.
    Type: Application
    Filed: October 2, 2003
    Publication date: April 7, 2005
    Inventors: Luca Rigazio, Robert Boman, Patrick Nguyen, Jean-Claude Junqua
  • Publication number: 20050071159
    Abstract: System speakers are switched to function as sound input transducers to improve recognizer performance and to support recognizer features. A crossbar switch is selectively activated, either manually or under software control, to allow system loudspeakers to function as sound input transducers that supplement the recognition system microphone or microphone array. Using loudspeakers as “microphones” improves speech recognition in noisy environments, thus attaining better recognition performance with little added system cost. The loudspeakers, positioned in physically separate locations also provide spatial information that can be used to determine the location of the person speaking and thereby offer different functionality for different persons. Acoustic models are selected based on environmental and vehicle operating conditions and may be adapted dynamically using ambient information obtained using the loudspeakers as sound input transducers.
    Type: Application
    Filed: September 26, 2003
    Publication date: March 31, 2005
    Inventors: Robert Boman, Luca Rigazio, Brian Hanson, Rathinavelu Chengalvarayan
  • Publication number: 20050038655
    Abstract: An improved method is provided for constructing compact acoustic models for use in a speech recognizer. The method includes: partitioning speech data from a plurality of training speakers according to at least one speech related criteria (i.e., vocal tract length); grouping together the partitioned speech data from training speakers having a similar speech characteristic; and training an acoustic bubble model for each group using the speech data within the group.
    Type: Application
    Filed: August 13, 2003
    Publication date: February 17, 2005
    Inventors: Ambroise Mutel, Patrick Nguyen, Luca Rigazio
  • Publication number: 20050010411
    Abstract: A speech data mining system for use in generating a rich transcription having utility in call center management includes a speech differentiation module differentiating between speech of interacting speakers, and a speech recognition module improving automatic recognition of speech of one speaker based on interaction with another speaker employed as a reference speaker. A transcript generation module generates a rich transcript based on recognized speech of the speakers. Focused, interactive language models improve recognition of a customer on a low quality channel using context extracted from speech of a call center operator on a high quality channel with a speech model adapted to the operator. Mined speech data includes number of interaction turns, customer frustration phrases, operator polity, interruptions, and/or contexts extracted from speech recognition results, such as topics, complaints, solutions, and resolutions.
    Type: Application
    Filed: July 9, 2003
    Publication date: January 13, 2005
    Inventors: Luca Rigazio, Patrick Nguyen, Jean-Claude Junqua, Robert Boman
  • Publication number: 20040054541
    Abstract: An embedded device for playing media files is capable of generating a play list of media files based on input speech from a user. It includes an indexer generating a plurality of speech recognition grammars. According to one aspect of the invention, the indexer generates speech recognition grammars based on contents of a media file header of the media file. According to another aspect of the invention, the indexer generates speech recognition grammars based on categories in a file path for retrieving the media file to a user location. When a speech recognizer receives an input speech from a user while in a selection mode, a media file selector compares the input speech received while in the selection mode to the plurality of speech recognition grammars, thereby selecting the media file.
    Type: Application
    Filed: September 16, 2002
    Publication date: March 18, 2004
    Inventors: David Kryze, Luca Rigazio, Patrick Nguyen, Jean-Claude Junqua
  • Patent number: 6691091
    Abstract: A noise adaptation system and method provide for noise adaptation in a speech recognition system. The method includes the steps of generating a reference model based on a training speech signal, and compensating the reference model for additive noise in the cepstral domain. The reference model is also compensated for convolutional noise in the cepstral domain. In one embodiment, the convolutional noise is compensated for by estimating a convolutional bias between the reference model and a target speech signal. The estimated convolutional bias is transformed with a channel adaptation matrix, and the transformed convolutional bias is added to the reference model in the cepstral domain.
    Type: Grant
    Filed: July 31, 2000
    Date of Patent: February 10, 2004
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Christophe Cerisara, Luca Rigazio, Robert Boman, Jean-Claude Junqua
  • Patent number: 6687672
    Abstract: Methods and apparatus for blind channel estimation of a speech signal corrupted by a communication channel are provided. One method includes converting a noisy speech signal into either a cepstral representation or a log-spectral representation; estimating a correlation of the representation of the noisy speech signal; determining an average of the noisy speech signal; constructing and solving, subject to a minimization constraint, a system of linear equations utilizing a correlation structure of a clean speech training signal, the correlation of the representation of the noisy speech signal, and the average of the noisy speech signal; and selecting a sign of the solution of the system of linear equations to estimate an average clean speech signal in a processing window.
    Type: Grant
    Filed: March 15, 2002
    Date of Patent: February 3, 2004
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Younes Souilmi, Luca Rigazio, Patrick Nguyen, Jean-Claude Junqua
  • Publication number: 20030200085
    Abstract: A method is provided for improving pattern matching in a speech recognition system having a plurality of acoustic models. The improved method includes: receiving continuous speech input; generating a sequence of acoustic feature vectors that represent temporal and spectral behavior of the speech input; loading a first group of acoustic feature vectors from the sequence of acoustic feature vectors into a memory workspace accessible to a processor; loading an acoustic model from the plurality of acoustic models into the memory workspace; and determining a similarity measure for each acoustic feature vector of the first group of acoustic feature vectors in relation to the acoustic model. Prior to retrieving another group of acoustic feature vectors, similarity measures are computed for the first group of acoustic feature vectors in relation to each of the acoustic models employed by the speech recognition system.
    Type: Application
    Filed: April 22, 2002
    Publication date: October 23, 2003
    Inventors: Patrick Nguyen, Luca Rigazio
  • Publication number: 20030177003
    Abstract: Methods and apparatus for blind channel estimation of a speech signal corrupted by a communication channel are provided. One method includes converting a noisy speech signal into either a cepstral representation or a log-spectral representation; estimating a correlation of the representation of the noisy speech signal; determining an average of the noisy speech signal; constructing and solving, subject to a minimization constraint, a system of linear equations utilizing a correlation structure of a clean speech training signal, the correlation of the representation of the noisy speech signal, and the average of the noisy speech signal; and selecting a sign of the solution of the system of linear equations to estimate an average clean speech signal in a processing window.
    Type: Application
    Filed: March 15, 2002
    Publication date: September 18, 2003
    Inventors: Younes Souilmi, Luca Rigazio, Patrick Nguyen, Jean-Claude Junqua
  • Patent number: 6571174
    Abstract: A navigation apparatus is disclosed which may be used by law enforcement personnel for rapid intervention to a location while adding safety and reliability to the process. The apparatus includes a computer system, having an operating system, memory and a user interface. The system further includes a positioning system, such as a GPS system for determining the position of a vehicle. The positioning system communicates with the operating system. An information database, communicating with the operating system, contains data related to routing information concerning routes for travel by the vehicle. The routing information includes safety information concerning route safety in the traveling region accessible by the vehicle. The apparatus further includes a routing system in communication with the operating system that determines a route based at least in part on the routing information.
    Type: Grant
    Filed: August 14, 2001
    Date of Patent: May 27, 2003
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Luca Rigazio, Philippe R. Morin, Jean-Claude Junqua
  • Publication number: 20030050778
    Abstract: An e-mail message process is provided for use with a personal digital assistant which allows for the use of input speech messaging which is converted to text using a focused language model which is downloaded by a cellular phone connection to an Internet server which provides the focused language model based upon a topic for the intended e-mail message. The text that is generated from the input speech method can be summarized by the e-mail message processor and can be edited by the user. The generated e-mail message can then be transmitted again via cellular connection to an Internet e-mail server for transmitting the e-mail message to a recipient.
    Type: Application
    Filed: September 13, 2001
    Publication date: March 13, 2003
    Inventors: Patrick Nguyen, Luca Rigazio, Jean-Claude Junqua
  • Publication number: 20030050780
    Abstract: Linear approximation of the background noise is applied after feature extraction and prior to speaker adaptation to allow the speaker adaptation system to adapt the speech models to the enrolling user without distortion from background noise. The linear approximation is applied in the feature domain, such as in the cepstral domain. Any adaptation technique that is commutative in the feature domain may be used.
    Type: Application
    Filed: May 24, 2001
    Publication date: March 13, 2003
    Inventors: Luca Rigazio, Patrick Nguyen, David Kryze, Jean-Claude Junqua
  • Patent number: 6529872
    Abstract: The improved noise adaptation technique employs a linear or non-linear transformation to the set of Jacobian matrices corresponding to an initial noise condition. An &agr;-adaptation parameter or artificial intelligence operation is employed in a linear or non-linear way to increase the adaptation bias added to the speech models. This corrects shortcomings of conventional Jacobian adaptation, which tend to underestimate the effect of noise. The improved adaptation technique is further enhanced by a reduced dimensionality, principal component analysis technique that reduces the computational burden, making the adaptation technique beneficial in embedded recognition systems.
    Type: Grant
    Filed: April 18, 2000
    Date of Patent: March 4, 2003
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Christophe Cerisara, Luca Rigazio, Robert Boman, Jean-Claude Junqua
  • Publication number: 20030040865
    Abstract: A navigation apparatus is disclosed which may be used by law enforcement personnel for rapid intervention to a location while adding safety and reliability to the process. The apparatus includes a computer system, having an operating system, memory and a user interface. The system further includes a positioning system, such as a GPS system for determining the position of a vehicle. The positioning system communicates with the operating system. An information database, communicating with the operating system, contains data related to routing information concerning routes for travel by the vehicle. The routing information includes safety information concerning route safety in the traveling region accessible by the vehicle. The apparatus further includes a routing system in communication with the operating system that determines a route based at least in part on the routing information.
    Type: Application
    Filed: August 14, 2001
    Publication date: February 27, 2003
    Inventors: Luca Rigazio, Philippe R. Morin, Jean-Claude Junqua