Patents by Inventor Mark A. Fanty

Mark A. Fanty has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10832682
    Abstract: The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.
    Type: Grant
    Filed: February 6, 2020
    Date of Patent: November 10, 2020
    Assignee: Nuance Communications, Inc.
    Inventor: Mark Fanty
  • Publication number: 20200175990
    Abstract: The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.
    Type: Application
    Filed: February 6, 2020
    Publication date: June 4, 2020
    Applicant: Nuance Communications, Inc.
    Inventor: Mark Fanty
  • Patent number: 10559303
    Abstract: The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.
    Type: Grant
    Filed: May 23, 2016
    Date of Patent: February 11, 2020
    Assignee: Nuance Communications, Inc.
    Inventor: Mark Fanty
  • Patent number: 10403272
    Abstract: An intelligent assistant can be used to facilitate an end user's participation in a virtual meeting. The intelligent assistant can receive recognized speech from a speech recognition engine, and then forward the recognized speech to a natural language understanding engine which in turn ascribes a meaning to the recognized speech. The ascribed meaning can include a semantic representation of the recognized speech and thereby a semantic representation of voice commands enshrined within the recognized speech. Using the ascribed meaning, the intelligent assistant then searches for meeting information associated with the virtual meeting and uses that meeting information to join the end user to the virtual meeting. Meeting information can include a conference call number, a participant code, a moderator code or a link to an online meeting.
    Type: Grant
    Filed: March 7, 2014
    Date of Patent: September 3, 2019
    Assignee: Nuance Communications, Inc.
    Inventors: Mark Fanty, Simona Gandrabur
  • Publication number: 20180174582
    Abstract: The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.
    Type: Application
    Filed: May 23, 2016
    Publication date: June 21, 2018
    Applicant: Nuance Communications, Inc.
    Inventor: Mark Fanty
  • Patent number: 9666192
    Abstract: Methods and apparatus for reducing latency in speech recognition applications. The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.
    Type: Grant
    Filed: May 26, 2015
    Date of Patent: May 30, 2017
    Assignee: Nuance Communications, Inc.
    Inventor: Mark Fanty
  • Publication number: 20160351196
    Abstract: Methods and apparatus for reducing latency in speech recognition applications. The method comprises receive first audio comprising speech from a user of a computing device, detecting an end of speech in the first audio, generating an ASR result based, at least in part, on a portion of the first audio prior to the detected end of speech, determining whether a valid action can be performed by a speech-enabled application installed on the computing device using the ASR result, and processing second audio when it is determined that a valid action cannot be performed by the speech-enabled application using the ASR result.
    Type: Application
    Filed: May 26, 2015
    Publication date: December 1, 2016
    Applicant: Nuance Communications, Inc.
    Inventor: Mark Fanty
  • Patent number: 9183843
    Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture. An indication of the availability of the remote speech recognition to perform speech recognition at a point in time may be provided to a user of the client device via a user interface of the client device.
    Type: Grant
    Filed: January 22, 2013
    Date of Patent: November 10, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Mark Fanty, Timothy Lynch, Michael J. Newman, Anthony Gillet, David Mark Krowitz, Michael D. Edgington
  • Patent number: 8990091
    Abstract: In one embodiment, a method comprises classifying a representation of audio data of a dialog turn in a dialog system to a classification. The method may further comprise taking a security action on the classified representation of the audio data of the dialog turn as a function of the classification. The security action can be suppressing the representation of the audio data, encrypting the representation of the audio data, releasing the representation of the audio data, partially suppressing the representation of the audio data, partially encrypting the representation of the audio data, partially releasing the representation of the audio data, or a command.
    Type: Grant
    Filed: July 27, 2012
    Date of Patent: March 24, 2015
    Assignee: Nuance Communications, Inc.
    Inventors: Solomon Z. Lerner, Mark Fanty
  • Publication number: 20140032219
    Abstract: In one embodiment, a method comprises classifying a representation of audio data of a dialog turn in a dialog system to a classification. The method may further comprise taking a security action on the classified representation of the audio data of the dialog turn as a function of the classification. The security action can be suppressing the representation of the audio data, encrypting the representation of the audio data, releasing the representation of the audio data, partially suppressing the representation of the audio data, partially encrypting the representation of the audio data, partially releasing the representation of the audio data, or a command.
    Type: Application
    Filed: July 27, 2012
    Publication date: January 30, 2014
    Inventors: Solomon Z. Lerner, Mark Fanty
  • Patent number: 6789062
    Abstract: A telephone-based interactive speech recognition system is retrained using variable weighting and incremental retraining. Variable weighting involves changing the relative influence of particular measurement data to be reflected in a statistical model. Statistical model data is determined based upon an initial set of measurement data determined from an initial set of speech utterances. When new statistical model data is to be generated to reflect new measurement data determined from new speech utterances, a weighting factor is applied to the new measurement data to generate weighted new measurement data. The new statistical model data is then determined based upon the initial set of measurement data and the weighted new measurement data. Incremental retraining involves generating new statistical model data using prior statistical model data to reduce the amount of prior measurement data that must be maintained and processed.
    Type: Grant
    Filed: February 25, 2000
    Date of Patent: September 7, 2004
    Assignee: SpeechWorks International, Inc.
    Inventors: Michael S. Phillips, Krishna K. Govindarajan, Mark Fanty, Etienne Barnard
  • Patent number: 6535851
    Abstract: Phonetic units are identified in a body of utterance data according to a novel segmentation approach. A body of received utterance data is processed and a set of candidate phonetic unit boundaries is determined that defines a set of candidate phonetic units. The set of candidate phonetic unit boundaries is determined based upon changes in Cepstral coefficient values, changes in utterance energy, changes in phonetic classification, broad category analysis (retroflex, back vowels, front vowels) and sonorant onset detection. The set of candidate phonetic unit boundaries is filtered by priority and proximity to other candidate phonetic units and by silence regions. The set of candidate phonetic units is filtered using no-cross region analysis to generate a set of filtered candidate phonetic units. No-cross region analysis generally involves discarding candidate phonetic units that completely span an energy up, energy down, dip or broad category type no-cross region.
    Type: Grant
    Filed: March 24, 2000
    Date of Patent: March 18, 2003
    Assignee: SpeechWorks, International, Inc.
    Inventors: Mark Fanty, Michael S. Phillips
  • Patent number: 6405170
    Abstract: A method and apparatus are provided for improving the performance of an interactive speech application. The interactive speech application is developed and deployed for use by one or more callers. During execution, the interactive speech application stores, in a log, event information that describes each task carried out by the interactive speech application in response to interaction with the one or more callers. The application also stores one or more sets of audio information, in which each of the sets of audio information is associated with one or more utterances by one of the callers. Each of the sets of audio information is associated with one of the tasks represented in the log. After the log is established, an analytical report is displayed. The report describes selective actions taken by the interactive speech application while executing, and selective actions taken by one or more callers while interacting with the interactive speech application.
    Type: Grant
    Filed: September 22, 1998
    Date of Patent: June 11, 2002
    Assignee: SpeechWorks International, Inc.
    Inventors: Michael S. Phillips, Mark A. Fanty, Krishna K. Govindarajan
  • Patent number: 6389394
    Abstract: An approach for automatically modifying a pronunciation dictionary in a speech recognition system based on patterns of alternate pronunciations is described. A representation of the pronunciation dictionary, such as a plurality of dynamically linked phoneme values, is obtained. One or more pattern definitions are obtained. The pattern definitions specify zero or more phonemes to be substituted for zero or more phonemes of all words in the pronunciation dictionary. The linked phoneme values are modified by adding, for each path of each word, alternate paths that use each of the substitute phonemes according to the pattern definitions, thereby creating an expanded set of dynamically linked phoneme values. One or more example pronunciations of a particular word are then obtained. One or more best paths through the expanded set of phoneme values are determined for each of the example pronunciations and used to find the overall best path(s).
    Type: Grant
    Filed: February 9, 2000
    Date of Patent: May 14, 2002
    Assignee: SpeechWorks International, Inc.
    Inventor: Mark Fanty
  • Patent number: 5621857
    Abstract: Improved system and method for speaker-independent speech token recognition are described. The system is neural network-based and involves processing a sequence of spoken utterances, e.g. separately articulated letters of a name, to identify the same based upon a highest probability match of each utterance with learned speech tokens, e.g. the letters of the English language alphabet, and based upon a highest probability match of the uttered sequence with a defined utterance library, e.g. a list of names. First, the spoken utterance is digitized or captured and processed into a spectral representation. Second, discrete time frames of the DFT are classified phonetically. Third, the time-frame outputs are used by a modified Viterbi search to locate segment boundaries, near which such segment boundaries lies the information that is needed to discriminate letters. Fourth, the segmented or bounded representation is reclassified using such information into individual hypothesized letters.
    Type: Grant
    Filed: December 20, 1991
    Date of Patent: April 15, 1997
    Assignee: Oregon Graduate Institute of Science and Technology
    Inventors: Ronald A. Cole, Mark A. Fanty