Patents Assigned to Multimodal Technologies, LLC

Document Editing Using Anchors

Publication number: 20150112677

Abstract: A user edits text in a draft document by providing input including left and right “anchor” text and replacement text. In response, a document editing system identifies an instance of the left anchor text followed by the right anchor text in the draft document, and replaces text between these instances with the replacement text specified by the user. For example, the user may type a string containing the left anchor text followed by the replacement text followed by the right anchor text, in response to which the system may perform the replacement just described. As a result, the user may specify both the location of, and a correction for, text in the draft document without using cursor keys or other navigation commands to navigate to the location of the text to be corrected, thereby increasing correction efficiency by avoiding the delay associated with such manual navigation.

Type: Application

Filed: December 30, 2014

Publication date: April 23, 2015

Applicant: MULTIMODAL TECHNOLOGIES, LLC

Inventor: Kjell Schubert
Decoding-Time Prediction of Non-Verbalized Tokens

Publication number: 20150095025

Abstract: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.

Type: Application

Filed: December 16, 2014

Publication date: April 2, 2015

Applicant: Multimodal Technologies, LLC

Inventors: Juergen Fritsch, Anoop Deoras, Detlef Koll
Document editing using anchors

Patent number: 8959433

Abstract: A user edits text in a draft document by providing input including left and right “anchor” text and replacement text. In response, a document editing system identifies an instance of the left anchor text followed by the right anchor text in the draft document, and replaces text between these instances with the replacement text specified by the user. For example, the user may type a string containing the left anchor text followed by the replacement text followed by the right anchor text, in response to which the system may perform the replacement just described. As a result, the user may specify both the location of, and a correction for, text in the draft document without using cursor keys or other navigation commands to navigate to the location of the text to be corrected, thereby increasing correction efficiency by avoiding the delay associated with such manual navigation.

Type: Grant

Filed: August 19, 2007

Date of Patent: February 17, 2015

Assignee: Multimodal Technologies, LLC

Inventor: Kjell Schubert
Decoding-time prediction of non-verbalized tokens

Patent number: 8918317

Abstract: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.

Type: Grant

Filed: September 25, 2009

Date of Patent: December 23, 2014

Assignee: Multimodal Technologies, LLC

Inventors: Juergen Fritsch, Anoop Deoras, Detlef Koll
Content-based audio playback emphasis

Patent number: 8768706

Abstract: Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.

Type: Grant

Filed: August 20, 2010

Date of Patent: July 1, 2014

Assignee: Multimodal Technologies, LLC

Inventors: Kjell Schubert, Juergen Fritsch, Michael Finke, Detlef Koll
Applying Service Levels to Transcripts

Publication number: 20140039880

Abstract: Speech is transcribed to produce a draft transcript of the speech. Portions of the transcript having a high priority are identified. For example, particular sections of the transcript may be identified as high-priority sections. As another example, portions of the transcript requiring human verification may be identified as high-priority sections. High-priority portions of the transcript are verified at a first time, without verifying other portions of the transcript. Such other portions may or may not be verified at a later time. Limiting verification, either initially or entirely, to high-priority portions of the transcript limits the time required to perform such verification, thereby making it feasible to verify the most important portions of the transcript at an early stage without introducing an undue delay into the transcription process. Verifying the other portions of the transcript later ensures that early verification of the high-priority portions does not sacrifice overall verification accuracy.

Type: Application

Filed: October 11, 2013

Publication date: February 6, 2014

Applicant: Multimodal Technologies, LLC

Inventors: Detlef Koll, Michael Finke
Applying service levels to transcripts

Patent number: 8560314

Abstract: Speech is transcribed to produce a draft transcript of the speech. Portions of the transcript having a high priority are identified. For example, particular sections of the transcript may be identified as high-priority sections. As another example, portions of the transcript requiring human verification may be identified as high-priority sections. High-priority portions of the transcript are verified at a first time, without verifying other portions of the transcript. Such other portions may or may not be verified at a later time. Limiting verification, either initially or entirely, to high-priority portions of the transcript limits the time required to perform such verification, thereby making it feasible to verify the most important portions of the transcript at an early stage without introducing an undue delay into the transcription process. Verifying the other portions of the transcript later ensures that early verification of the high-priority portions does not sacrifice overall verification accuracy.

Type: Grant

Filed: June 21, 2007

Date of Patent: October 15, 2013

Assignee: Multimodal Technologies, LLC

Inventors: Detlef Koll, Michael Finke
Discriminative Training of Document Transcription System

Publication number: 20130166297

Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model using discriminative training techniques, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.

Type: Application

Filed: February 22, 2013

Publication date: June 27, 2013

Applicant: MULTIMODAL TECHNOLOGIES, LLC

Inventor: MULTIMODAL TECHNOLOGIES, LLC
Document Transcription System Training

Publication number: 20130103400

Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system my identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.

Type: Application

Filed: November 30, 2012

Publication date: April 25, 2013

Applicant: MULTIMODAL TECHNOLOGIES, LLC

Inventor: MULTIMODAL TECHNOLOGIES, LLC
Discriminative training of document transcription system

Patent number: 8412521

Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model using discriminative training techniques, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.

Type: Grant

Filed: September 16, 2005

Date of Patent: April 2, 2013

Assignee: Multimodal Technologies, LLC

Inventors: Lambert Mathias, Girija Yegnanarayanan, Juergen Fritsch
Document transcription system training

Patent number: 8335688

Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.

Type: Grant

Filed: August 20, 2004

Date of Patent: December 18, 2012

Assignee: Multimodal Technologies, LLC

Inventors: Girija Yegnanarayanan, Michael Finke, Juergen Fritsch, Detlef Koll, Monika Woszczyna
Verification of extracted data

Patent number: 8321199

Abstract: Facts are extracted from speech and recorded in a document using codings. Each coding represents an extracted fact and includes a code and a datum. The code may represent a type of the extracted fact and the datum may represent a value of the extracted fact. The datum in a coding is rendered based on a specified feature of the coding. For example, the datum may be rendered as boldface text to indicate that the coding has been designated as an “allergy.” In this way, the specified feature of the coding (e.g., “allergy”-ness) is used to modify the manner in which the datum is rendered. A user inspects the rendering and provides, based on the rendering, an indication of whether the coding was accurately designated as having the specified feature. A record of the user's indication may be stored, such as within the coding itself.

Type: Grant

Filed: April 30, 2010

Date of Patent: November 27, 2012

Assignee: Multimodal Technologies, LLC

Inventors: Detlef Koll, Michael Finke
Hybrid speech recognition

Patent number: 8249877

Abstract: A hybrid speech recognition system uses a client-side speech recognition engine and a server-side speech recognition engine to produce speech recognition results for the same speech. An arbitration engine produces speech recognition output based on one or both of the client-side and server-side speech recognition results.

Type: Grant

Filed: September 24, 2010

Date of Patent: August 21, 2012

Assignee: Multimodal Technologies, LLC

Inventor: Detlef Koll
Distributed speech recognition using one way communication

Patent number: 8249878

Abstract: A speech recognition client sends a speech stream and control stream in parallel to a server-side speech recognizer over a network. The network may be an unreliable, low-latency network. The server-side speech recognizer recognizes a first portion of the speech stream and, if a predetermined criterion is satisfied by the speech recognition result, waits until the speech recognizer has been reconfigured before recognizing a second portion of the speech stream. The speech recognition client receives recognition results from the server-side recognizer in response to requests from the client. The client may remotely reconfigure the state of the server-side recognizer during recognition.

Type: Grant

Filed: August 2, 2011

Date of Patent: August 21, 2012

Assignee: Multimodal Technologies, LLC

Inventors: Eric Carraux, Detlef Koll
Automatic detection and application of editing patterns in draft documents

Patent number: 8086453

Abstract: An error detection and correction system extracts editing patterns and derives correction rules from them by observing differences between draft documents and corresponding edited documents, and/or by observing editing operations performed on the draft documents to produce the edited documents. The system develops classifiers that partition the space of all possible contexts into equivalence classes and assigns one or more correction rules to each such class). Once the system has been trained, it may be used to detect and (optionally) correct errors in new draft documents. When presented with a draft document, the system identifies first content (e.g., text) in the draft document and identifies a context of the first content. The system identifies a correction rule based on the first content and the first context. The system may use a classifier to identify the correction rule. The system applies the correction rule to the first content to produce second content.

Type: Grant

Filed: January 26, 2009

Date of Patent: December 27, 2011

Assignee: Multimodal Technologies, LLC

Inventors: Koll Detlef, Juergen Fritsch, Michael Finke
Audio signal de-identification

Patent number: 8086458

Abstract: Techniques are disclosed for automatically de-identifying spoken audio signals. In particular, techniques are disclosed for automatically removing personally identifying information from spoken audio signals and replacing such information with non-personally identifying information. De-identification of a spoken audio signal may be performed by automatically generating a report based on the spoken audio signal. The report may include concept content (e.g., text) corresponding to one or more concepts represented by the spoken audio signal. The report may also include timestamps indicating temporal positions of speech in the spoken audio signal that corresponds to the concept content. Concept content that represents personally identifying information is identified. Audio corresponding to the personally identifying concept content is removed from the spoken audio signal. The removed audio may be replaced with non-personally identifying audio.

Type: Grant

Filed: October 24, 2008

Date of Patent: December 27, 2011

Assignee: Multimodal Technologies, LLC

Inventors: Michael Finke, Detlef Koll