Patents by Inventor Larissa Lapshina

Larissa Lapshina has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20070233488
    Abstract: The invention involves the loading and unloading of dynamic section grammars and language models in a speech recognition system. The values of the sections of the structured document are either determined in advance from a collection of documents of the same domain, document type, and speaker; or collected incrementally from documents of the same domain, document type, and speaker; or added incrementally to an already existing set of values. Speech recognition in the context of the given field is constrained to the contents of these dynamic values. If speech recognition fails or produces a poor match within this grammar or section language model, speech recognition against a larger, more general vocabulary that is not constrained to the given section is performed.
    Type: Application
    Filed: March 29, 2006
    Publication date: October 4, 2007
    Applicant: Dictaphone Corporation
    Inventors: Alwin Carus, Larissa Lapshina, Raghu Vemula
  • Publication number: 20070203707
    Abstract: A system and method for filtering documents to determine section boundaries between dictated and non-dictated text. The system and method identifies portions of a text report that correspond to an original dictation and, correspondingly, those portions that are not part of the original dictation. The system and method include comparing tokenized and normalized forms of the original dictation and the final report, determining mismatches between the two forms, and applying machine-learning techniques to identify document headers, footers, page turns, macros, and lists automatically and accurately.
    Type: Application
    Filed: February 27, 2006
    Publication date: August 30, 2007
    Applicant: Dictaphone Corporation
    Inventors: Alwin Carus, Larissa Lapshina, Bernardo Rechea
  • Publication number: 20060235687
    Abstract: A method for adaptive automatic error and mismatch correction is disclosed for use with a system having an automatic error and mismatch correction learning module, an automatic error and mismatch correction model, and a classifier module. The learning module operates by receiving pairs of documents, identifying and selecting effective candidate errors and mismatches, and generating classifiers corresponding to these selected errors and mismatches. The correction model operates by receiving a string of interpreted speech into the automatic error and mismatch correction module, identifying target tokens in the string of interpreted speech, creating a set of classifier features according to requirements of the automatic error and mismatch correction model, comparing the target tokens against the classifier features to detect errors and mismatches in the string of interpreted speech, and modifying the string of interpreted speech based upon the classifier features.
    Type: Application
    Filed: April 14, 2005
    Publication date: October 19, 2006
    Applicant: Dictaphone Corporation
    Inventors: Alwin Carus, Larissa Lapshina, Bernardo Rechea, Amy Uhrbach
  • Publication number: 20060116862
    Abstract: The present invention pertains to a system and method for the tokenization of text. The featurizer may be configured to receive input text and convert the input text into tokens. According to one aspect of the invention, the tokens may include only one type of character, the characters selected from the group consisting of letters, numbers, and punctuation. The tokenizer may also include a classifier. The classifier may be configured to receive the tokens from the featurizer. Furthermore, the classifier may be configured to analyze the tokens received from the featurizer to determine if the tokens may be input into a predetermined classification model using a preclassifier. If one of the tokens passes the preclassifier, then the token is classified using the predetermined classification model. Additionally, according to a first aspect of the invention, the tokenizer may also include a finalizer. The finalizer may be configured to receive the tokens and may be configured to produce a final output.
    Type: Application
    Filed: December 1, 2004
    Publication date: June 1, 2006
    Applicant: Dictaphone Corporation
    Inventors: Jill Carrier, Alwin Carus, William Cote, John Dowd, Kathryn Femina, Alan Frankel, Wensheng Han, Larissa Lapshina, Bernardo Rechea, Ana Santisteban, Amy Uhrbach
  • Publication number: 20060026003
    Abstract: A system and method is disclosed for Report Confidence Modeling (RCM) including automatic adaptive classification of ASR output documents to determine the most efficient document edit workflow to convert dictation into finished output. The RCM according to the present invention may include a mechanism to predict recognition accuracy of a document generated by an ASR engine. Predicted accuracy of the document allows an ASR application to sort recognized documents based on their estimated accuracy or quality and route them appropriately for further processing, editing and/or formatting.
    Type: Application
    Filed: July 28, 2005
    Publication date: February 2, 2006
    Inventors: Alwin Carus, Larissa Lapshina, Elizabeth Lovance