Patents Examined by David Kovacek
  • Patent number: 8175730
    Abstract: In order to analyze an information signal, a significant short-time spectrum is extracted from the information signal, the means for extracting being configured to extract such short-time spectra which come closer to a specific characteristic than other short-time spectra of the information signal. The short-time spectra extracted are then decomposed into component signals using ICA analysis, a component signal spectrum representing a profile spectrum of a tone source which generates a tone corresponding to the characteristic sought for. From a sequence of short-time spectra of the information signal and from the profile spectra determined, an amplitude envelope is eventually calculated for each profile spectrum, the amplitude envelope indicating how a profile spectrum of a tone source all in all changes over time.
    Type: Grant
    Filed: June 30, 2009
    Date of Patent: May 8, 2012
    Assignee: SONY Corporation
    Inventors: Christian Dittmar, Christian Uhle, Jürgen Herre
  • Patent number: 8170873
    Abstract: An approach to comparing events in word spotting, such as comparing putative and reference instances of a keyword, makes use of a set of models of subword units. For each of two acoustic events and for each of a series of times in each of the events, a probability associated with each of the models of the set of subword units is computed. Then, a quantity characterizing a comparison of the two acoustic events, one occurring in each of the two acoustic signals, is computed using the computed probabilities associated with each of the models.
    Type: Grant
    Filed: July 22, 2004
    Date of Patent: May 1, 2012
    Assignee: Nexidia Inc.
    Inventor: Robert W. Morris
  • Patent number: 8108207
    Abstract: Configurations herein provide a language processing mechanism operable to define a machine vocabulary and identify a machine language version of the words that preserves context and identifies the proper definition of the words by identifying and preserving context of a particular set of words, such as a sentence or paragraph. The machine vocabulary includes a definition section for each definition of a word. Each definition section includes a set of one or more definition elements. The definition elements include a predetermined format of definition fields, and each has a corresponding mask indicative of significant definition fields. The set of definition elements corresponding to a particular definition describe the usage of the word in a context matching that particular definition. Each definition element captures a characteristic of the definition according to fuzzy logic such that the definition elements collectively capture the context.
    Type: Grant
    Filed: September 1, 2009
    Date of Patent: January 31, 2012
    Assignee: Artificial Cognition Inc.
    Inventors: George H. Harvey, Donald R. Greenbaum, Charles H. Collins, Charles D. Harvey
  • Patent number: 8078465
    Abstract: Certain aspects and embodiments of the present invention are directed to systems and methods for monitoring and analyzing the language environment and the development of a key child. A key child's language environment and language development can be monitored without placing artificial limitations on the key child's activities or requiring a third party observer. The language environment can be analyzed to identify words, vocalizations, or other noises directed to or spoken by the key child, independent of content. The analysis can include the number of responses between the child and another, such as an adult and the number of words spoken by the child and/or another, independent of content of the speech. One or more metrics can be determined based on the analysis and provided to assist in improving the language environment and/or tracking language development of the key child.
    Type: Grant
    Filed: January 23, 2008
    Date of Patent: December 13, 2011
    Assignee: LENA Foundation
    Inventors: Terrance Paul, Dongxin Xu, Umit Yapenel, Sharmistha Gray
  • Patent number: 8036893
    Abstract: A system for use in speech recognition includes an acoustic module accessing a plurality of distinct-language acoustic models, each based upon a different language; a lexicon module accessing at least one lexicon model; and a speech recognition output module. The speech recognition output module generates a first speech recognition output using a first model combination that combines one of the plurality of distinct-language acoustic models with the at least one lexicon model. In response to a threshold determination, the speech recognition output module generates a second speech recognition output using a second model combination that combines a different one of the plurality of distinct-language acoustic models with the at least one distinct-language lexicon model.
    Type: Grant
    Filed: July 22, 2004
    Date of Patent: October 11, 2011
    Assignee: Nuance Communications, Inc.
    Inventor: David E. Reich
  • Patent number: 7983906
    Abstract: There is provided a voice activity detection method for indicating an active voice mode and an inactive voice mode. The method comprises receiving a first portion of an input signal; determining that the first portion of the input signal includes an active voice signal; indicating the active voice mode in response to the determining that the first portion of the input signal includes the active voice signal; receiving a second portion of the input signal immediately following the first portion of the input signal; determining that the second portion of the input signal includes an inactive voice signal; extending the indicating the active voice mode for a period of time after determining that the second portion of the input signal includes the inactive voice signal, wherein the period of time varies based on one or more conditions; and indicating the inactive voice mode after expiration of the period of time.
    Type: Grant
    Filed: January 26, 2006
    Date of Patent: July 19, 2011
    Assignee: Mindspeed Technologies, Inc.
    Inventors: Yang Gao, Eyal Shlomot, Adil Benyassine
  • Patent number: 7966176
    Abstract: A system includes an acoustic input engine configured to accept a speech input, to recognize phonemes of the speech input, and to create word strings based on the recognized phonemes. The system includes a semantic engine coupled to the acoustic engine and operable to identify actions and to identify objects by parsing the word strings. The system also includes an action-object pairing system to identify a dominant entry from the identified actions and the identified objects, to select a complement to the dominant entry from the identified actions and the identified objects, and to form an action-object pair that includes the dominant entry and the complement. The system further includes an action-object routing table operable to provide a routing destination based on the action-object pair. The system also includes a call routing module to route a call to the routing destination.
    Type: Grant
    Filed: October 22, 2009
    Date of Patent: June 21, 2011
    Assignee: AT&T Intellectual Property I, L.P.
    Inventors: Robert R. Bushey, Michael Sabourin, Carl Potvin, Benjamin Anthony Knott, John Mills Martin
  • Patent number: 7962342
    Abstract: In one embodiment, the present invention is directed to reconfiguration of a communication device or other network node based on a determination that a user has a target (altered) physical condition.
    Type: Grant
    Filed: August 22, 2006
    Date of Patent: June 14, 2011
    Assignee: Avaya Inc.
    Inventors: Marc Coughlan, Alexander Forbes, Ciaran Gannon, Peter D. Runcie, Alexander Scholte
  • Patent number: 7925508
    Abstract: In one embodiment, the present invention is directed to a communication device that analyzes received utterances and detects when the speaker has a target physical condition.
    Type: Grant
    Filed: August 22, 2006
    Date of Patent: April 12, 2011
    Assignee: Avaya Inc.
    Inventor: Paul Roller Michaelis
  • Patent number: 7899671
    Abstract: Systems and techniques for analyzing voice recognition results in order to improve efficiency and accuracy of voice recognition. When a voice activated module undertakes a voice recognition attempt, it invokes a voice recognition module that constructs a list of voice recognition results. The list is analyzed by a results postprocessor that employs information relating to past recognition results and user information to make changes to the list. The results postprocessor may delete results that have been previously rejected during a current recognition transaction and may further alter and reorder the results list based on historical results. The results postprocessor may further alter and reorder the results list based on information relating to the user engaging in the recognition attempt.
    Type: Grant
    Filed: February 5, 2004
    Date of Patent: March 1, 2011
    Assignee: Avaya, Inc.
    Inventors: Robert S. Cooper, Derek Sanders, Vladimir Sergeyevich Tokarev
  • Patent number: 7869989
    Abstract: Configurations herein provide a language processing mechanism operable to define a machine vocabulary and identify a machine language version of the words that preserves context and identifies the proper definition of the words by identifying and preserving context of a particular set of words, such as a sentence or paragraph. The machine vocabulary includes a definition section for each definition of a word. Each definition section includes a set of one or more definition elements. The definition elements include a predetermined format of definition fields, and each has a corresponding mask indicative of significant definition fields. The set of definition elements corresponding to a particular definition describe the usage of the word in a context matching that particular definition. Each definition element captures a characteristic of the definition according to fuzzy logic such that the definition elements collectively capture the context.
    Type: Grant
    Filed: January 27, 2006
    Date of Patent: January 11, 2011
    Assignee: Artificial Cognition Inc.
    Inventors: George H. Harvey, Donald R. Greenbaum, Charles H. Collins, Charles D. Harvey
  • Patent number: 7835909
    Abstract: A method and apparatus for normalizing a histogram utilizing a backward cumulative histogram which can cumulate a probability distribution function in an order from a greatest to smallest value so as to estimate a noise robust histogram. A method of normalizing a speech feature vector includes: extracting the speech feature vector from a speech signal; calculating a probability distribution function using the extracted speech feature vector; calculating a backward cumulative distribution function by cumulating the probability distribution function in an order from a largest to smallest value; and normalizing a histogram using the backward cumulative distribution function.
    Type: Grant
    Filed: December 12, 2006
    Date of Patent: November 16, 2010
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: So-Young Jeong, Gil Jin Jang, Kwang Cheol Oh
  • Patent number: 7822602
    Abstract: An audio input signal is filtered using an adaptive filter to generate a prediction output signal with reduced noise, wherein the filter is implemented using a plurality of coefficients to generate a plurality of prediction errors and to generate an error from the plurality of prediction errors, wherein the absolute values of the coefficients are continuously reduced by a plurality of reduction parameters.
    Type: Grant
    Filed: August 21, 2006
    Date of Patent: October 26, 2010
    Assignee: Trident Microsystems (Far East) Ltd.
    Inventor: Joern Fischer
  • Patent number: 7788095
    Abstract: A method and apparatus for indexing one or more audio signals using a speech to text engine and a phoneme detection engine, and generating a combined lattice comprising a text part and a phoneme part. A word to be searched is searched for in the text part, and if not found, or is found with low certainty is divided into phonemes and searched for in the phoneme parts of the lattice.
    Type: Grant
    Filed: November 18, 2007
    Date of Patent: August 31, 2010
    Assignee: Nice Systems, Ltd.
    Inventors: Moshe Wasserblant, Barak Eilam, Yuval Lubowich, Maor Nissan
  • Patent number: 7778836
    Abstract: A system and method are disclosed for switching contexts within a spoken dialog between a user and a spoken dialog system. The spoken dialog system utilizes modular subdialogs that are invoked by at least one flow controller that is a finite state model and that associated with a dialog manager. The spoken dialog system includes a dialog manager with a flow controller and a reusable subdialog module. The method includes, while the spoken dialog is being controlled by the subdialog module that was invoked by the flow controller, receiving context-changing input associated with speech from a user that changes a dialog context and comparing the context-changing input to at least one context shift. And, if any of the context shifts are activated by the comparing step, then passing control of the spoken dialog to the flow controller with context shift message and destination state.
    Type: Grant
    Filed: August 19, 2008
    Date of Patent: August 17, 2010
    Assignee: AT&T Intellectual Property II, L.P.
    Inventors: Giuseppe Di Fabbrizio, Charles Alfred Lewis
  • Patent number: 7751535
    Abstract: A system for implementing voice services can include at least one virtual machine, such as a Java 2 Enterprise Edition (J2EE) virtual machine. The virtual machine can include a bean container for handling software beans, such as Enterprise Java Beans. The bean container can include a voice browser bean. The voice browser bean can include a VoiceXML browser.
    Type: Grant
    Filed: April 28, 2004
    Date of Patent: July 6, 2010
    Assignee: Nuance Communications, Inc.
    Inventors: Thomas E. Creamer, Victor S. Moore, Wendi L. Nusbickel, Ricardo Dos Santos, James J. Sliwa
  • Patent number: 7742580
    Abstract: Systems and techniques for improved user prompting. A system according to one aspect of the invention includes a central server hosting various modules providing services to users. The modules suitably employ voice recognition in order to interpret user inputs. Each module has access to user information that includes information indicating the user's experience with each function of each module. When a module needs to issue a prompt to the user, it retrieves and examines the user information to determine the user's experience with the module and function. Suitably, each module is operative to categorize a user as belonging to an experience category, such as novice, intermediate and expert based on the user's level of experience with the function. The module selects a prompt associated with the user's level of experience with the function and presents it to the user.
    Type: Grant
    Filed: February 5, 2004
    Date of Patent: June 22, 2010
    Assignee: Avaya, Inc.
    Inventors: Robert S. Cooper, Derek Sanders, Vladimir Sergeyevich Tokarev
  • Patent number: 7698126
    Abstract: Embodiments of a localization system are disclosed. In one embodiment, a plurality of localization components provide localized data that is localized to one or more distinct markets. A translation matching component receives a localization request corresponding to input data to be localized. The translation matching component accesses the plurality of localization components based on the localization request. The translation matching component selects and outputs localized data from one or more of the plurality of localization components based on pre-determined criteria. In one embodiment, the translation matching component selects the localized data based on a time required to obtain the localized data. In another embodiment, the localization components provide confidence scores associated with the localized data, the translation matching component selecting the localized data based on the confidence scores.
    Type: Grant
    Filed: April 29, 2005
    Date of Patent: April 13, 2010
    Assignee: Microsoft Corporation
    Inventors: Bernhard Kohlmeier, Lori A. Brownell, Wei Wu, Shenghua (Ed) Ye, Jordi Mola Marti, Jan Anders Nelson, Mohammed El-Gammal, Julie D. Bennett
  • Patent number: 7684978
    Abstract: The present invention overcomes problems of tandem coding method such as degradation of speech quality, increased system latency and computations. An apparatus for trans-coding between code excited linear prediction (CELP) type codecs with different bandwidths, includes: a format parameter translating unit for generating output formant parameters by translating formant parameters from input CELP format to output CELP format; a formant parameter quantizing unit for receiving the output format formant parameters and quantizing the output format formant filter coefficients; an excited parameter translating unit for generating output excitation parameters by translating excitation parameters from input CELP format to output CELP format; and an excitation quantizing unit for receiving the output format excitation parameters and quantizing the output format excitation parameters.
    Type: Grant
    Filed: October 30, 2003
    Date of Patent: March 23, 2010
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Jongmo Sung, Sang Taick Park, Do Young Kim, Bong Tae Kim
  • Patent number: 7680659
    Abstract: A method of training language model parameters trains discriminative model parameters in the language model based on a performance measure having discrete values.
    Type: Grant
    Filed: June 1, 2005
    Date of Patent: March 16, 2010
    Assignee: Microsoft Corporation
    Inventors: Jianfeng Gao, Hisami Suzuki