Patents by Inventor Detlef Koll

Detlef Koll has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Providing computable guidance to relevant evidence in question-answering systems

Patent number: 9424523

Abstract: A computer-based system includes a computer-processable definition of a region in a data set. The system identifies a region of the data set based on the definition of the region. The system provides output to a user representing a question and the identified region of the data set. The system may also automatically generate an answer to the question based on the question and the data set, and provide output to the user representing the answer. The system may generate the answer based on a subset of the data set, and provide output to the user representing the subset of the data set. The user may provide feedback on the first answer to the system, which the system may use to improve subsequent answers to the same and other questions, and to disable the system's automatic question-answering function in response to disagreement between the user and the system.

Type: Grant

Filed: June 22, 2015

Date of Patent: August 23, 2016

Assignee: MModal IP LLC

Inventors: Detlef Koll, Thomas Polzin
Document Transcription System Training

Publication number: 20160196821

Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.

Type: Application

Filed: March 10, 2016

Publication date: July 7, 2016

Applicant: MModal IP LLC

Inventors: Girija Yegnanarayanan, Michael Finke, Juergen Fritsch, Detlef Koll, Monika Woszczyna
Document Extension in Dictation-Based Document Generation Workflow

Publication number: 20160179770

Abstract: An automatic speech recognizer is used to produce a structured document representing the contents of human speech. A best practice is applied to the structured document to produce a conclusion, such as a conclusion that required information is missing from the structured document. Content is inserted into the structured document based on the conclusion, thereby producing a modified document. The inserted content may be obtained by prompting a human user for the content and receiving input representing the content from the human user.

Type: Application

Filed: February 26, 2016

Publication date: June 23, 2016

Applicant: MModal IP LLC

Inventors: Detlef Koll, Juergen Fritsch, Michael Finke
Speech Recognition Using Loosely Coupled Components

Publication number: 20160086604

Abstract: An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.

Type: Application

Filed: December 3, 2015

Publication date: March 24, 2016

Applicant: MModal IP LLC

Inventors: Detlef Koll, Michael Finke
Document transcription system training

Patent number: 9286896

Abstract: A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.

Type: Grant

Filed: May 16, 2014

Date of Patent: March 15, 2016

Assignee: MModal IP LLC

Inventors: Girija Yegnanarayanan, Michael Finke, Juergen Fritsch, Detlef Koll, Monika Woszczyna
Document extension in dictation-based document generation workflow

Patent number: 9275643

Abstract: An automatic speech recognizer is used to produce a structured document representing the contents of human speech. A best practice is applied to the structured document to produce a conclusion, such as a conclusion that required information is missing from the structured document. Content is inserted into the structured document based on the conclusion, thereby producing a modified document. The inserted content may be obtained by prompting a human user for the content and receiving input representing the content from the human user.

Type: Grant

Filed: July 9, 2014

Date of Patent: March 1, 2016

Assignee: MModal IP LLC

Inventors: Detlef Koll, Juergen Fritsch, Michael Finke
Content-Based Audio Playback Emphasis

Publication number: 20160005402

Abstract: Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.

Type: Application

Filed: September 11, 2015

Publication date: January 7, 2016

Applicant: MModal IP LLC

Inventors: Kjell Schubert, Juergen Fritsch, Michael Finke, Detlef Koll
Providing Computable Guidance to Relevant Evidence in Question-Answering Systems

Publication number: 20150371145

Abstract: A computer-based system includes a computer-processable definition of a region in a data set. The system identifies a region of the data set based on the definition of the region. The system provides output to a user representing a question and the identified region of the data set. The system may also automatically generate an answer to the question based on the question and the data set, and provide output to the user representing the answer. The system may generate the answer based on a subset of the data set, and provide output to the user representing the subset of the data set. The user may provide feedback on the first answer to the system, which the system may use to improve subsequent answers to the same and other questions, and to disable the system's automatic question-answering function in response to disagreement between the user and the system.

Type: Application

Filed: June 22, 2015

Publication date: December 24, 2015

Applicant: MMODAL IP LLC

Inventors: Detlef Koll, Thomas Polzin
Speech recognition using loosely coupled components

Patent number: 9208786

Abstract: An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.

Type: Grant

Filed: March 3, 2015

Date of Patent: December 8, 2015

Assignee: MModal IP LLC

Inventors: Detlef Koll, Michael Finke
Content-based audio playback emphasis

Patent number: 9135917

Abstract: Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.

Type: Grant

Filed: June 27, 2014

Date of Patent: September 15, 2015

Assignee: MModal IP LLC

Inventors: Kjell Schubert, Juergen Fritsch, Michael Finke, Detlef Koll
Speech recognition using loosely coupled components

Patent number: 9082408

Abstract: An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.

Type: Grant

Filed: June 8, 2012

Date of Patent: July 14, 2015

Assignee: MModal IP LLC

Inventors: Detlef Koll, Michael Finke
Providing computable guidance to relevant evidence in question-answering systems

Patent number: 9082310

Abstract: A computer-based system includes a computer-processable definition of a region in a data set. The system identifies a region of the data set based on the definition of the region. The system provides output to a user representing a question and the identified region of the data set. The system may also automatically generate an answer to the question based on the question and the data set, and provide output to the user representing the answer. The system may generate the answer based on a subset of the data set, and provide output to the user representing the subset of the data set. The user may provide feedback on the first answer to the system, which the system may use to improve subsequent answers to the same and other questions, and to disable the system's automatic question-answering function in response to disagreement between the user and the system.

Type: Grant

Filed: February 10, 2011

Date of Patent: July 14, 2015

Assignee: MModal IP LLC

Inventors: Detlef Koll, Thomas Polzin
Speech Recognition Using Loosely Coupled Components

Publication number: 20150179172

Abstract: An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.

Type: Application

Filed: March 3, 2015

Publication date: June 25, 2015

Applicant: MModal IP LLC

Inventors: Detlef Koll, Michael Finke
Distributed Speech Recognition Using One Way Communication

Publication number: 20150170647

Abstract: A speech recognition client sends a speech stream and control stream in parallel to a server-side speech recognizer over a network. The network may be an unreliable, low-latency network. The server-side speech recognizer recognizes the speech stream continuously. The speech recognition client receives recognition results from the server-side recognizer in response to requests from the client. The client may remotely reconfigure the state of the server-side recognizer during recognition.

Type: Application

Filed: February 20, 2015

Publication date: June 18, 2015

Applicant: MModal IP LLC

Inventors: Eric Carraux, Detlef Koll
Structured Searching of Dynamic Structured Document Corpuses

Publication number: 20150154168

Abstract: A system includes a document corpus containing structured documents, which contain both text and annotations of the text. The system also includes a search engine which is adapted to perform structured searches of the structured documents. As new types of annotations are added to the system, the search engine is updated automatically to become capable of performing structured searches for the new types of annotations. For example, if a new natural language processing (NLP) component, adapted to generate annotations of a new type, is added to the system, then the system automatically updates a query language to include a definition of the new type of annotation. The search engine may then immediately be capable of processing structured queries which refer to the new type of annotation.

Type: Application

Filed: February 3, 2015

Publication date: June 4, 2015

Applicant: MMODAL IP LLC

Inventors: Detlef Koll, Juergen Fritsch
Decoding-Time Prediction of Non-Verbalized Tokens

Publication number: 20150095025

Abstract: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.

Type: Application

Filed: December 16, 2014

Publication date: April 2, 2015

Applicant: Multimodal Technologies, LLC

Inventors: Juergen Fritsch, Anoop Deoras, Detlef Koll
Structured searching of dynamic structured document corpuses

Patent number: 8959102

Abstract: A system includes a document corpus containing structured documents, which contain both text and annotations of the text. The system also includes a search engine which is adapted to perform structured searches of the structured documents. As new types of annotations are added to the system, the search engine is updated automatically to become capable of performing structured searches for the new types of annotations. For example, if a new natural language processing (NLP) component, adapted to generate annotations of a new type, is added to the system, then the system automatically updates a query language to include a definition of the new type of annotation. The search engine may then immediately be capable of processing structured queries which refer to the new type of annotation.

Type: Grant

Filed: October 8, 2011

Date of Patent: February 17, 2015

Assignee: MModal IP LLC

Inventors: Detlef Koll, Juergen Fritsch
Decoding-time prediction of non-verbalized tokens

Patent number: 8918317

Abstract: Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be integrated with speech decoding, rather than performed as a post-process to speech decoding.

Type: Grant

Filed: September 25, 2009

Date of Patent: December 23, 2014

Assignee: Multimodal Technologies, LLC

Inventors: Juergen Fritsch, Anoop Deoras, Detlef Koll
Document Extension in Dictation-Based Document Generation Workflow

Publication number: 20140324423

Abstract: An automatic speech recognizer is used to produce a structured document representing the contents of human speech. A best practice is applied to the structured document to produce a conclusion, such as a conclusion that required information is missing from the structured document. Content is inserted into the structured document based on the conclusion, thereby producing a modified document. The inserted content may be obtained by prompting a human user for the content and receiving input representing the content from the human user.

Type: Application

Filed: July 9, 2014

Publication date: October 30, 2014

Inventors: Detlef Koll, Juergen Fritsch, Michael Finke
Verification of Extracted Data

Publication number: 20140316772

Abstract: Facts are extracted from speech and recorded in a document using codings. Each coding represents an extracted fact and includes a code and a datum. The code may represent a type of the extracted fact and the datum may represent a value of the extracted fact. The datum in a coding is rendered based on a specified feature of the coding. For example, the datum may be rendered as boldface text to indicate that the coding has been designated as an “allergy.” In this way, the specified feature of the coding (e.g., “allergy”-ness) is used to modify the manner in which the datum is rendered. A user inspects the rendering and provides, based on the rendering, an indication of whether the coding was accurately designated as having the specified feature. A record of the user's indication may be stored, such as within the coding itself.

Type: Application

Filed: June 27, 2014

Publication date: October 23, 2014

Inventors: Detlef Koll, Michael Finke

prev 1 2 3 4 5 6 7 next