Patents by Inventor Michael Galler

Michael Galler has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 6233561
    Abstract: A computer-implemented method and apparatus is provided for processing a spoken request from a user. A speech recognizer converts the spoken request into a digital format. A frame data structure associates semantic components of the digitized spoken request with predetermined slots. The slots are indicative of data which are used to achieve a predetermined goal. A speech understanding module which is connected to the speech recognizer and to the frame data structure determines semantic components of the spoken request. The slots are populated based upon the determined semantic components. A dialog manager which is connected to the speech understanding module may determine at least one slot which is unpopulated based upon the determined semantic components and in a preferred embodiment may provide confirmation of the populated slots. A computer generated-request is formulated in order for the user to provide data related to the unpopulated slot.
    Type: Grant
    Filed: April 12, 1999
    Date of Patent: May 15, 2001
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Jean-Claude Junqua, Roland Kuhn, Matteo Contolini, Murat Karaorman, Ken Field, Michael Galler, Yi Zhao
  • Patent number: 6182039
    Abstract: The speech recognizer incorporates a language model that reduces the number of acoustic pattern matching sequences that must be performed by the recognizer. The language model is based on knowledge of a pre-defined set of syntactically defined content and includes a data structure that organizes the content according to acoustic confusability. A spelled name recognition system based on the recognizer employs a language model based on classes of letters that the recognizer frequently confuses for one another. The language model data structure is optionally an N-gram data structure, a tree data structure, or an incrementally configured network that is built during a training sequence. The incrementally configured network has nodes that are selected based on acoustic distance from a predetermined lexicon.
    Type: Grant
    Filed: March 24, 1998
    Date of Patent: January 30, 2001
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Luca Rigazio, Jean-Claude Junqua, Michael Galler
  • Patent number: 5991720
    Abstract: The input speech is segmented using plural grammar networks, including a network that includes a filler model designed to represent noise or extraneous speech. Recognition processing results in plural lists of candidates, each list containing the N-best candidates generated. The lists are then separately aligned with the dictionary of valid names to generate two lists of valid names. The final recognition pass combines these two lists of names into a dynamic grammar and this dynamic grammar may be used to find the best candidate name using Viterbi recognition. A telephone call routing application based on the recognition system selects the best candidate name corresponding to the name spelled by the user, whether the user pronounces the name prior to spelling, or not.
    Type: Grant
    Filed: April 16, 1997
    Date of Patent: November 23, 1999
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Michael Galler, Jean-Claude Junqua
  • Patent number: 5946649
    Abstract: The present invention eliminates injection noise in speech produced by esophageal speakers. A speech input signal is digitized. One copy of the digitized signal is used for analysis and the other is passed through a gain switch to an amplifier as output. A Fast Fourier Transform and a mean value of the digitized speech input signal is calculated. The Fast Fourier Transform (FFT) is passed through a morphological filter to produce a filtered spectrum. An occurrence of injection noise is detected by calculating a derivative of the filtered spectrum and determining from the mean value and the derivative a location and value of a largest peak and a second largest peak in the filtered spectrum. If the largest peak is lower in frequency than the second largest peak, and if all points above 2 KHz are less than the mean, then an occurrence of injection noise has been detected.
    Type: Grant
    Filed: April 16, 1997
    Date of Patent: August 31, 1999
    Assignee: Technology Research Association of Medical Welfare Apparatus
    Inventors: Hector Raul Javkin, Michael Galler, Nancy Niedzielski, Robert Boman
  • Patent number: 5890111
    Abstract: Injection noise and silence are detected in an input speech signal and an external amplifier is switched on or off based on the detected injection noise or silence. The input speech signal is digitized and a first copy of the digitized signal is preemphasized. After the input speech signal is preemphasized, a predetermined number of Mel-frequency cepstral coefficients (MFCCs) and difference cepstra are calculated for each window of the speech signal. A measure of signal energy and a measure of the rate of change of the signal energy is computed. A second copy of the digitized input speech signal is processed using amplitude summation or by differencing a center-clipped signal. The measures of signal energy, rate of change of the signal energy, the Mel coefficients, difference cepstra, and either the amplitude summation value or the differenced value are combined to form an observation vector.
    Type: Grant
    Filed: December 24, 1996
    Date of Patent: March 30, 1999
    Assignee: Technology Research Association of Medical Welfare Apparatus
    Inventors: Hector Raul Javkin, Michael Galler, Nancy Niedzielski
  • Patent number: 5799065
    Abstract: The call routing device plugs into existing extensions of the office telephone network or PBX system and acts as a "virtual" operator, prompting incoming callers to spell the name of the desired recipient. The speech recognizer uses a multipass procedure employing Hidden Markov Models and dynamic programming. The N-best hypotheses are propagated between passes, allowing the more computationally costly routines to be reserved until the final pass, when the size of the search space is significantly reduced. The routing device prompts the user to confirm that the selected name is correct, whereupon the device signals the telephone network to automatically switch the incoming call to the telephone extension of the selected recipient.
    Type: Grant
    Filed: May 6, 1996
    Date of Patent: August 25, 1998
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Jean-Claude Junqua, Michael Galler