Patents by Inventor Bernardo Rechea
Bernardo Rechea has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20110320189Abstract: A system and method for filtering documents to determine section boundaries between dictated and non-dictated text. The system and method identifies portions of a text report that correspond to an original dictation and, correspondingly, those portions that are not part of the original dictation. The system and method include comparing tokenized and normalized forms of the original dictation and the final report, determining mismatches between the two forms, and applying machine-learning techniques to identify document headers, footers, page turns, macros, and lists automatically and accurately.Type: ApplicationFiled: September 9, 2011Publication date: December 29, 2011Applicant: Dictaphone CorporationInventors: Alwin B. Carus, Larissa Lapshina, Bernardo Rechea
-
Patent number: 8036889Abstract: A system and method for filtering documents to determine section boundaries between dictated and non-dictated text. The system and method identifies portions of a text report that correspond to an original dictation and, correspondingly, those portions that are not part of the original dictation. The system and method include comparing tokenized and normalized forms of the original dictation and the final report, determining mismatches between the two forms, and applying machine-learning techniques to identify document headers, footers, page turns, macros, and lists automatically and accurately.Type: GrantFiled: February 27, 2006Date of Patent: October 11, 2011Assignee: Nuance Communications, Inc.Inventors: Alwin B. Carus, Larissa Lapshina, Bernardo Rechea
-
Patent number: 7937263Abstract: The present invention pertains to a system and method for the tokenization of text. The featurizer may be configured to receive input text and convert the input text into tokens. According to one aspect of the invention, the tokens may include only one type of character, the characters selected from the group consisting of letters, numbers, and punctuation. The tokenizer may also include a classifier. The classifier may be configured to receive the tokens from the featurizer. Furthermore, the classifier may be configured to analyze the tokens received from the featurizer to determine if the tokens may be input into a predetermined classification model using a preclassifier. If one of the tokens passes the preclassifier, then the token is classified using the predetermined classification model. Additionally, according to a first aspect of the invention, the tokenizer may also include a finalizer. The finalizer may be configured to receive the tokens and may be configured to produce a final output.Type: GrantFiled: December 1, 2004Date of Patent: May 3, 2011Assignee: Dictaphone CorporationInventors: Jill Carrier, Alwin B. Carus, William F. Cote, John Dowd, Kathryn Del La Femina, Alan Frankel, Wensheng(Vincent) Han, Larissa Lapshina, Bernardo Rechea, Ana Santisteban, Amy J. Uhrbach
-
Patent number: 7565282Abstract: A method for adaptive automatic error and mismatch correction is disclosed for use with a system having an automatic error and mismatch correction learning module, an automatic error and mismatch correction model, and a classifier module. The learning module operates by receiving pairs of documents, identifying and selecting effective candidate errors and mismatches, and generating classifiers corresponding to these selected errors and mismatches. The correction model operates by receiving a string of interpreted speech into the automatic error and mismatch correction module, identifying target tokens in the string of interpreted speech, creating a set of classifier features according to requirements of the automatic error and mismatch correction model, comparing the target tokens against the classifier features to detect errors and mismatches in the string of interpreted speech, and modifying the string of interpreted speech based upon the classifier features.Type: GrantFiled: April 14, 2005Date of Patent: July 21, 2009Assignee: Dictaphone CorporationInventors: Alwin B Carus, Larissa Lapshina, Bernardo Rechea, Amy J. Uhrbach
-
Publication number: 20070203707Abstract: A system and method for filtering documents to determine section boundaries between dictated and non-dictated text. The system and method identifies portions of a text report that correspond to an original dictation and, correspondingly, those portions that are not part of the original dictation. The system and method include comparing tokenized and normalized forms of the original dictation and the final report, determining mismatches between the two forms, and applying machine-learning techniques to identify document headers, footers, page turns, macros, and lists automatically and accurately.Type: ApplicationFiled: February 27, 2006Publication date: August 30, 2007Applicant: Dictaphone CorporationInventors: Alwin Carus, Larissa Lapshina, Bernardo Rechea
-
Publication number: 20060235687Abstract: A method for adaptive automatic error and mismatch correction is disclosed for use with a system having an automatic error and mismatch correction learning module, an automatic error and mismatch correction model, and a classifier module. The learning module operates by receiving pairs of documents, identifying and selecting effective candidate errors and mismatches, and generating classifiers corresponding to these selected errors and mismatches. The correction model operates by receiving a string of interpreted speech into the automatic error and mismatch correction module, identifying target tokens in the string of interpreted speech, creating a set of classifier features according to requirements of the automatic error and mismatch correction model, comparing the target tokens against the classifier features to detect errors and mismatches in the string of interpreted speech, and modifying the string of interpreted speech based upon the classifier features.Type: ApplicationFiled: April 14, 2005Publication date: October 19, 2006Applicant: Dictaphone CorporationInventors: Alwin Carus, Larissa Lapshina, Bernardo Rechea, Amy Uhrbach
-
Publication number: 20060116862Abstract: The present invention pertains to a system and method for the tokenization of text. The featurizer may be configured to receive input text and convert the input text into tokens. According to one aspect of the invention, the tokens may include only one type of character, the characters selected from the group consisting of letters, numbers, and punctuation. The tokenizer may also include a classifier. The classifier may be configured to receive the tokens from the featurizer. Furthermore, the classifier may be configured to analyze the tokens received from the featurizer to determine if the tokens may be input into a predetermined classification model using a preclassifier. If one of the tokens passes the preclassifier, then the token is classified using the predetermined classification model. Additionally, according to a first aspect of the invention, the tokenizer may also include a finalizer. The finalizer may be configured to receive the tokens and may be configured to produce a final output.Type: ApplicationFiled: December 1, 2004Publication date: June 1, 2006Applicant: Dictaphone CorporationInventors: Jill Carrier, Alwin Carus, William Cote, John Dowd, Kathryn Femina, Alan Frankel, Wensheng Han, Larissa Lapshina, Bernardo Rechea, Ana Santisteban, Amy Uhrbach