Patents by Inventor Girija Yegnanarayanan

Girija Yegnanarayanan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20130041685
    Abstract: Techniques for presenting alternative hypotheses for medical facts may include identifying, using at least one statistical fact extraction model, a plurality of alternative hypotheses for a medical fact to be extracted from a portion of text documenting a patient encounter. At least two of the alternative hypotheses may be selected, and the selected hypotheses may be presented to a user documenting the patient encounter.
    Type: Application
    Filed: October 12, 2012
    Publication date: February 14, 2013
    Applicant: Nuance Communications, Inc.
    Inventor: Girija Yegnanarayanan
  • Publication number: 20130035961
    Abstract: Techniques for applying user corrections to medical fact extraction may include extracting a first set of one or more medical facts from a first portion of text documenting a patient encounter. A correction to the first set of medical facts may be received from a user. The correction may identify a fact that should be associated with the first portion of the text. A second set of one or more medical facts may be extracted from a second portion of the text based at least in part on the user's correction to the first set of medical facts. Extracting the second set of facts may include extracting one or more facts similar to the identified fact from the second portion of the text.
    Type: Application
    Filed: October 12, 2012
    Publication date: February 7, 2013
    Applicant: Nuance communications, Inc.
    Inventor: Girija Yegnanarayanan
  • Patent number: 8335688
    Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.
    Type: Grant
    Filed: August 20, 2004
    Date of Patent: December 18, 2012
    Assignee: Multimodal Technologies, LLC
    Inventors: Girija Yegnanarayanan, Michael Finke, Juergen Fritsch, Detlef Koll, Monika Woszczyna
  • Publication number: 20120245961
    Abstract: An original text that is a representation of a narration of a patient encounter provided by a clinician may be received and re-formatted to produce a formatted text. One or more clinical facts may be extracted from the formatted text. A first fact of the clinical facts may be extracted from a first portion of the formatted text, and the first portion of the formatted text may be a formatted version of a first portion of the original text. A linkage may be maintained between the first fact and the first portion of the original text.
    Type: Application
    Filed: June 8, 2012
    Publication date: September 27, 2012
    Applicant: Nuance Communications, Inc.
    Inventor: Girija Yegnanarayanan
  • Publication number: 20120245926
    Abstract: An original text that is a representation of a narration of a patient encounter provided by a clinician may be received and re-formatted to produce a formatted text. One or more clinical facts may be extracted from the formatted text. A first fact of the clinical facts may be extracted from a first portion of the formatted text, and the first portion of the formatted text may be a formatted version of a first portion of the original text. A linkage may be maintained between the first fact and the first portion of the original text.
    Type: Application
    Filed: June 5, 2012
    Publication date: September 27, 2012
    Applicant: Nuance Communications, Inc.
    Inventors: Frank Montyne, David Decraene, Joeri Van der Vloet, Johan Raedemaeker, Ignace Desimpel, Frederik Coppens, Tom Deray, James R. Flanagan, Mariana Casella dos Santos, Marnix Holvoet, Maria van Gurp, David Hellman, Girija Yegnanarayanan, Karen Anne Doyle
  • Publication number: 20120212337
    Abstract: An original text that is a representation of a narration of a patient encounter provided by a clinician may be received and re-formatted to produce a formatted text. One or more clinical facts may be extracted from the formatted text. A first fact of the clinical facts may be extracted from a first portion of the formatted text, and the first portion of the formatted text may be a formatted version of a first portion of the original text. A linkage may be maintained between the first fact and the first portion of the original text.
    Type: Application
    Filed: February 18, 2011
    Publication date: August 23, 2012
    Applicant: Nuance Communications, Inc.
    Inventors: Frank Montyne, David Decraene, Joeri Van der Vloet, Johan Raedemaeker, Ignace Desimpel, Frederik Coppens, Tom Deray, James R. Flanagan, Mariana Casella dos Santos, Marnix Holvoet, Maria van Gurp, David Hellman, Girija Yegnanarayanan, Karen Anne Doyle
  • Publication number: 20100299135
    Abstract: Techniques are disclosed for automatically generating structured documents based on speech, including identification of relevant concepts and their interpretation. In one embodiment, a structured document generator uses an integrated process to generate a structured textual document (such as a structured textual medical report) based on a spoken audio stream. The spoken audio stream may be recognized using a language model which includes a plurality of sub-models arranged in a hierarchical structure. Each of the sub-models may correspond to a concept that is expected to appear in the spoken audio stream. Different portions of the spoken audio stream may be recognized using different sub-models. The resulting structured textual document may have a hierarchical structure that corresponds to the hierarchical structure of the language sub-models that were used to generate the structured textual document.
    Type: Application
    Filed: May 22, 2009
    Publication date: November 25, 2010
    Inventors: Juergen Fritsch, Michael Finke, Detlef Koll, Monika Woszczyna, Girija Yegnanarayanan
  • Patent number: 7584103
    Abstract: Techniques are disclosed for automatically generating structured documents based on speech, including identification of relevant concepts and their interpretation. In one embodiment, a structured document generator uses an integrated process to generate a structured textual document (such as a structured textual medical report) based on a spoken audio stream. The spoken audio stream may be recognized using a language model which includes a plurality of sub-models arranged in a hierarchical structure. Each of the sub-models may correspond to a concept that is expected to appear in the spoken audio stream. Different portions of the spoken audio stream may be recognized using different sub-models. The resulting structured textual document may have a hierarchical structure that corresponds to the hierarchical structure of the language sub-models that were used to generate the structured textual document.
    Type: Grant
    Filed: August 20, 2004
    Date of Patent: September 1, 2009
    Assignee: Multimodal Technologies, Inc.
    Inventors: Juergen Fritsch, Michael Finke, Detlef Koll, Monika Woszczyna, Girija Yegnanarayanan
  • Publication number: 20090048833
    Abstract: Techniques are disclosed for automatically generating structured documents based on speech, including identification of relevant concepts and their interpretation. In one embodiment, a structured document generator uses an integrated process to generate a structured textual document (such as a structured textual medical report) based on a spoken audio stream. The spoken audio stream may be recognized using a language model which includes a plurality of sub-models arranged in a hierarchical structure. Each of the sub-models may correspond to a concept that is expected to appear in the spoken audio stream. Different portions of the spoken audio stream may be recognized using different sub-models. The resulting structured textual document may have a hierarchical structure that corresponds to the hierarchical structure of the language sub-models that were used to generate the structured textual document.
    Type: Application
    Filed: October 17, 2008
    Publication date: February 19, 2009
    Inventors: Juergen Fritsch, Michael Finke, Detlef Koll, Monika Woszczyna, Girija Yegnanarayanan
  • Publication number: 20060074656
    Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model using discriminative training techniques, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.
    Type: Application
    Filed: September 16, 2005
    Publication date: April 6, 2006
    Inventors: Lambert Mathias, Girija Yegnanarayanan, Juergen Fritsch
  • Publication number: 20060041427
    Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.
    Type: Application
    Filed: August 20, 2004
    Publication date: February 23, 2006
    Inventors: Girija Yegnanarayanan, Michael Finke, Juergen Fritsch, Detlef Koll, Monika Woszczyna
  • Publication number: 20060041428
    Abstract: Techniques are disclosed for automatically generating structured documents based on speech, including identification of relevant concepts and their interpretation. In one embodiment, a structured document generator uses an integrated process to generate a structured textual document (such as a structured textual medical report) based on a spoken audio stream. The spoken audio stream may be recognized using a language model which includes a plurality of sub-models arranged in a hierarchical structure. Each of the sub-models may correspond to a concept that is expected to appear in the spoken audio stream. Different portions of the spoken audio stream may be recognized using different sub-models. The resulting structured textual document may have a hierarchical structure that corresponds to the hierarchical structure of the language sub-models that were used to generate the structured textual document.
    Type: Application
    Filed: August 20, 2004
    Publication date: February 23, 2006
    Inventors: Juergen Fritsch, Michael Finke, Detlef Koll, Monika Woszczyna, Girija Yegnanarayanan
  • Patent number: 6490555
    Abstract: A method of a continuous speech recognition system is given for discriminatively training hidden Markov for a system recognition vocabulary. An input word phrase is converted into a sequence of representative frames. A correct state sequence alignment with the sequence of representative frames is determined, the correct state sequence alignment corresponding to models of words in the input word phrase. A plurality of incorrect recognition hypotheses is determined representing words in the recognition vocabulary that do not correspond to the input word phrase, each hypothesis being a state sequence based on the word models in the acoustic model database. A correct segment of the correct word model state sequence alignment is selected for discriminative training. A frame segment of frames in the sequence of representative frames is determined that corresponds to the correct segment.
    Type: Grant
    Filed: April 5, 2000
    Date of Patent: December 3, 2002
    Assignee: ScanSoft, Inc.
    Inventors: Girija Yegnanarayanan, Vladimir Sejnoha, Ramesh Sarukkai
  • Patent number: 5794196
    Abstract: In the speech recognition system disclosed herein, an input utterance is submitted to both a large vocabulary isolated word speech recognition module and a small vocabulary continuous speech recognition module. The small vocabulary contains command words which can be combined in sequences to define commands to an application program. The two recognition modules generate respective scores for identified large vocabulary models and for sequences of small vocabulary models. The score provided by the continuous speech recognizer is normalized on the basis of the length of the speech input utterance and an arbitration algorithm selects among the candidates identified by the recognition modules. Without requiring the user to switch modes, text is output if a score from the isolated word recognizer is selected and a command is output if a score from the continuous speech recognizer is selected.
    Type: Grant
    Filed: June 24, 1996
    Date of Patent: August 11, 1998
    Assignee: Kurzweil Applied Intelligence, Inc.
    Inventors: Girija Yegnanarayanan, John Armstrong, III, Dong Hsu
  • Patent number: 5386492
    Abstract: Preliminary screening of vocabulary models is provided by successively applying two different high speed distance measuring functions which provide progressively increasing measurement accuracy. Both distance measuring functions utilize subsampled representations of the unknown speech segment and the vocabulary models. The initial screening function achieves very high speed by eliminating certain usual time warping constraints and by precalculating a table of distance values which can be used for all vocabulary models. The second screening function yields improved accuracy in spite of possible endpointing errors by comparing extra frames, preceding and following the presumed unknown word, with noise models appended to each vocabulary model.
    Type: Grant
    Filed: June 29, 1992
    Date of Patent: January 31, 1995
    Assignee: Kurzweil Applied Intelligence, Inc.
    Inventors: Brian H. Wilson, Girija Yegnanarayanan, Vladimir Sejnoha, William F. Ganong