Patents Assigned to Nuance Communications Austria GmbH

Method for changing over from a first adaptive data processing version to a second adaptive data processing version

Patent number: 9009695

Abstract: The invention relates to a method and to a system for changing over from a first adaptive data processing version (V1) on data processing means using at least one data model (dm) which is continuously adapted on the basis of data processing results to a second adaptive data processing version (V2) also using at least one data model (DM) to be continuously adapted, characterized in that, in a first phase, the second adaptive data processing version (V2) is used in parallel to the first data processing version (V1), thereby continuously adapting said at least one data model (dm) related to the first version (V1) as well as that data model (DM) related to the second version (V2), and in that the performance of data processing by means of the second version (V2) in checked to comply with a quality criterion, where after in a second phase, as soon as said criterion is met, the results of the data processing by means of the second version (V2) are outputted to be used.

Type: Grant

Filed: May 9, 2007

Date of Patent: April 14, 2015

Assignee: Nuance Communications Austria GmbH

Inventor: Johannes Unfried
Text segmentation and label assignment with user interaction by means of topic specific language models and topic-specific label statistics

Patent number: 8688448

Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.

Type: Grant

Filed: September 14, 2012

Date of Patent: April 1, 2014

Assignee: Nuance Communications Austria GmbH

Inventors: Jochen Peters, Evgeny Matusov, Carsten Meyer, Dietrich Klakow
SPEECH RECOGNITION SYSTEM WITH HUGE VOCABULARY

Publication number: 20130185073

Abstract: The invention deals with speech recognition, such as a system for recognizing words in continuous speech. A speech recognition system is disclosed which is capable of recognizing a huge number of words, and in principle even an unlimited number of words. The speech recognition system comprises a word recognizer for deriving a best path through a word graph, and wherein words are assigned to the speech based on the best path. The word score being obtained from applying a phonemic language model to each word of the word graph. Moreover, the invention deals with an apparatus and a method for identifying words from a sound block and to computer readable code for implementing the method.

Type: Application

Filed: March 6, 2013

Publication date: July 18, 2013

Applicant: Nuance Communications Austria GmbH

Inventor: Nuance Communications Austria GmbH
SYNCHRONISE AN AUDIO CURSOR AND A TEXT CURSOR DURING EDITING

Publication number: 20130166304

Abstract: A speech recognition device (1) processes speech data (SD) of a dictation and thus establishes recognized text information (ETI) and link information (LI) of the dictation. In a synchronous playback mode of the speech recognition device (1), during the acoustic playback of the dictation a correction device (10) synchronously marks the word of the recognized text information (ETI) which word relates to the speech data (SD) just played back marked by the link information (LD is marked synchronously, the just marked word featuring the position of an audio cursor (AC). When a user of the speech recognition device (1) recognizes an incorrect word, he positions a text cursor (TC) at the incorrect word and corrects it. Cursor synchronization means (15) now make it possible to synchronize the text cursor (TC) with the audio cursor (AC) or the audio cursor (AC) with the text cursor (TC) so that the positioning of the respective cursor (AC, TC) is simplified considerably.

Type: Application

Filed: January 17, 2013

Publication date: June 27, 2013

Applicant: Nuance Communications Austria GmbH

Inventor: Nuance Communications Austria GmbH
Method and system for processing dictated information

Patent number: 8452594

Abstract: A method and a system for processing dictated information into a dynamic form are disclosed. The method comprises presenting an image (3) belonging to an image category to a user, dictating a first section of speech associated with the image category, retrieving an electronic document having a previously defined document structure (4) associated with the first section of speech, thus associating the document structure (4) with the image (3), wherein the document structure comprises at least one text field, presenting at least a part of the electronic document having the document structure (4) on a presenting unit (5), dictating a second section of speech and processing the second section of speech in a speech recognition engine (6) into dictated text and associating the dictated text with the text field.

Type: Grant

Filed: October 16, 2006

Date of Patent: May 28, 2013

Assignee: Nuance Communications Austria GmbH

Inventor: Mehmet Mert Oz
System for speech recognition and correction, correction device and method for creating a lexicon of alternatives

Patent number: 8447602

Abstract: In a speech recognition and correction system which comprises at least one speech recognition device (1) to which a spoken text (GT) can be fed, it being possible for said spoken text to be transcribe into a recognized text (ET), and a correction device (3) for correcting the text (ET) recognized by the at least one speech recognition device (1), said correction device being connected to the at least one speech recognition device (1) via a data network (2) for the transmission of the recognized text (ET) and where appropriate of the spoken text (GT), the correction device (3) has a lexicon of alternatives (23) which contains word parts, words and word sequences that can be displayed (22) by the correction device (3) as alternatives to individual word parts, words and word sequences of the recognized text.

Type: Grant

Filed: March 22, 2004

Date of Patent: May 21, 2013

Assignee: Nuance Communications Austria GmbH

Inventors: Heinrich Franz Bartosik, Carsten Meyer
Method and system for creating or updating entries in a speech recognition lexicon

Patent number: 8447606

Abstract: In a method and a system (20) for creating or updating entries in a speech recognition (SR) lexicon (7) of a speech recognition system, said entries mapping speech recognition (SR) phoneme sequences to words, said method comprising entering a respective word, and in the case that the word is a new word to be added to the SR lexicon, also entering at least one associated SR phoneme sequence through input means (26), it is provided that the SR phoneme sequence associated with the respective word is converted into speech by phoneme to speech conversion means (4.4), and the speech is played back by playback means (28), to control the match of the phoneme sequence and the word.

Type: Grant

Filed: February 4, 2008

Date of Patent: May 21, 2013

Assignee: Nuance Communications Austria GmbH

Inventors: Andreas Neubacher, Gerhard Grobauer
METHOD AND SYSTEM FOR SPEECH BASED DOCUMENT HISTORY TRACKING

Publication number: 20130103401

Abstract: A method and a system of history tracking corrections in a speech based document are disclosed. The speech based document comprises one or more sections of text recognized or transcribed from sections of speech, wherein the sections of speech are dictated by a user and processed by a speech recognizer in a speech recognition system into corresponding sections of text of the speech based document.

Type: Application

Filed: December 14, 2012

Publication date: April 25, 2013

Applicant: Nuance Communications Austria GmbH

Inventor: Nuance Communications Austria GmbH
Speech recognition system with huge vocabulary

Patent number: 8417528

Abstract: The invention deals with speech recognition, such as a system for recognizing words in continuous speech. A speech recognition system is disclosed which is capable of recognizing a huge number of words, and in principle even an unlimited number of words. The speech recognition system comprises a word recognizer for deriving a best path through a word graph, and wherein words are assigned to the speech based on the best path. The word score being obtained from applying a phonemic language model to each word of the word graph. Moreover, the invention deals with an apparatus and a method for identifying words from a sound block and to computer readable code for implementing the method.

Type: Grant

Filed: February 3, 2012

Date of Patent: April 9, 2013

Assignee: Nuance Communications Austria GmbH

Inventor: Zsolt Saffer
TEXT SEGMENTATION AND LABEL ASSIGNMENT WITH USER INTERACTION BY MEANS OF TOPIC SPECIFIC LANGUAGE MODELS AND TOPIC-SPECIFIC LABEL STATISTICS

Publication number: 20130066625

Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.

Type: Application

Filed: September 14, 2012

Publication date: March 14, 2013

Applicant: Nuance Communications Austria GmbH

Inventors: Jochen Peters, Evgeny Matusov, Carsten Meyer, Dietrich Klakow
Synchronise an audio cursor and a text cursor during editing

Patent number: 8380509

Abstract: A speech recognition device (1) processes speech data (SD) of a dictation and establishes recognized text information (ETI) and link information (LI) of the dictation. In a synchronous playback mode of the speech recognition device (1), during acoustic playback of the dictation a correction device (10) synchronously marks the word of the recognized text information (ETI) which word relates to speech data (SD) just played back marked by link information (LI) is marked synchronously, the just marked word featuring the position of an audio cursor (AC). When a user of the speech recognition device (1) recognizes an incorrect word, he positions a text cursor (TC) at the incorrect word and corrects it. Cursor synchronization means (15) makes it possible to synchronize text cursor (TC) with audio cursor (AC) or audio cursor (AC) with text cursor (TC) so the positioning of the respective cursor (AC, TC) is simplified considerably.

Type: Grant

Filed: February 13, 2012

Date of Patent: February 19, 2013

Assignee: Nuance Communications Austria GmbH

Inventor: Wolfgang Gschwendtner
Method and system for speech based document history tracking

Patent number: 8364489

Abstract: A method and a system of history tracking corrections in a speech based document. The speech based document comprises one or more sections of text recognized or transcribed from sections of speech, wherein the sections of speech are dictated by a user and processed by a speech recognizer in a speech recognition system into corresponding sections of text of the speech based document. The method comprises associating at least one speech attribute to each section of text in the speech based document, said speech attribute comprising information related to said section of text, respectively; presenting said speech based document on a presenting unit; detecting an action being performed within any of said sections of text; and updating information of said speech attributes related to the kind of action detected on one of said sections of text for updating said speech based document.

Type: Grant

Filed: February 3, 2012

Date of Patent: January 29, 2013

Assignee: Nuance Communications Austria GmbH

Inventors: Gerhard Grobauer, Miklos Papai
Text segmentation and label assignment with user interaction by means of topic specific language models and topic-specific label statistics

Patent number: 8332221

Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.

Type: Grant

Filed: August 15, 2011

Date of Patent: December 11, 2012

Assignee: Nuance Communications Austria GmbH

Inventors: Jochen Peters, Evgeny Matusov, Carsten Meyer, Dietrich Klakow
METHOD AND SYSTEM FOR SPEECH BASED DOCUMENT HISTORY TRACKING

Publication number: 20120185249

Abstract: A method and a system of history tracking corrections in a speech based document. The speech based document comprises one or more sections of text recognized or transcribed from sections of speech, wherein the sections of speech are dictated by a user and processed by a speech recognizer in a speech recognition system into corresponding sections of text of the speech based document. The method comprises associating at least one speech attribute to each section of text in the speech based document, said speech attribute comprising information related to said section of text, respectively; presenting said speech based document on a presenting unit; detecting an action being performed within any of said sections of text; and updating information of said speech attributes related to the kind of action detected on one of said sections of text for updating said speech based document.

Type: Application

Filed: February 3, 2012

Publication date: July 19, 2012

Applicant: Nuance Communications Austria GMBH

Inventors: Gerhard Grobauer, Miklos Papai
SYNCHRONISE AN AUDIO CURSOR AND A TEXT CURSOR DURING EDITING

Publication number: 20120158405

Abstract: A speech recognition device (1) processes speech data (SD) of a dictation and establishes recognized text information (ETI) and link information (LI) of the dictation. In a synchronous playback mode of the speech recognition device (1), during acoustic playback of the dictation a correction device (10) synchronously marks the word of the recognized text information (ETI) which word relates to speech data (SD) just played back marked by link information (LI) is marked synchronously, the just marked word featuring the position of an audio cursor (AC). When a user of the speech recognition device (1) recognizes an incorrect word, he positions a text cursor (TC) at the incorrect word and corrects it. Cursor synchronization means (15) makes it possible to synchronize text cursor (TC) with audio cursor (AC) or audio cursor (AC) with text cursor (TC) so the positioning of the respective cursor (AC, TC) is simplified considerably.

Type: Application

Filed: February 13, 2012

Publication date: June 21, 2012

Applicant: Nuance Communications Austria GmbH

Inventor: Wolfgang Gschwendtner
Text segmentation and label assignment with user interaction by means of topic specific language models and topic-specific label statistics

Patent number: 8200487

Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labelling of successive parts of the document or the entire document.

Type: Grant

Filed: November 12, 2004

Date of Patent: June 12, 2012

Assignee: Nuance Communications Austria GmbH

Inventors: Jochen Peters, Evgeny Matusov, Carsten Meyer, Dietrich Klakow
SPEECH RECOGNITION SYSTEM WITH HUGE VOCABULARY

Publication number: 20120136662

Abstract: The invention deals with speech recognition, such as a system for recognizing words in continuous speech. A speech recognition system is disclosed which is capable of recognizing a huge number of words, and in principle even an unlimited number of words. The speech recognition system comprises a word recognizer for deriving a best path through a word graph, and wherein words are assigned to the speech based on the best path. The word score being obtained from applying a phonemic language model to each word of the word graph. Moreover, the invention deals with an apparatus and a method for identifying words from a sound block and to computer readable code for implementing the method.

Type: Application

Filed: February 3, 2012

Publication date: May 31, 2012

Applicant: Nuance Communications Austria GMBH

Inventor: Zsolt Saffer
TEXT SEGMENTATION AND LABEL ASSIGNMENT WITH USER INTERACTION BY MEANS OF TOPIC SPECIFIC LANGUAGE MODELS AND TOPIC-SPECIFIC LABEL STATISTICS

Publication number: 20120095751

Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.

Type: Application

Filed: August 15, 2011

Publication date: April 19, 2012

Applicant: Nuance Communications Austria GMBH

Inventors: Jochen PETERS, Evgeny MATUSOV, Carsten MEYER, Dietrich KLAKOW
Arrangement and method for reproducing audio data as well as computer program product for this

Patent number: 8150691

Abstract: During the replaying of audio data stored in a, which audio data corresponds to text data from a text composed of words, the replaying of the audio data in forward and reverse modes is controlled. Starting from particular momentary replay position in the audio data, a backward jump over a return distance corresponding to the length of about at least two words, to a target position, is automatically initiated for the replaying of the audio data in the reverse mode. Then, starting from the particular target position, a replay of the audio data in the forward sequence for just one part of the return distance is undertaken.

Type: Grant

Filed: October 13, 2003

Date of Patent: April 3, 2012

Assignee: Nuance Communications Austria GmbH

Inventor: Kwaku Frimpong-Ansah
Speech recognition system with huge vocabulary

Patent number: 8140336

Abstract: The invention deals with speech recognition, such as a system for recognizing words in continuous speech. A speech recognition system is disclosed which is capable of recognizing a huge number of words, and in principle even an unlimited number of words. The speech recognition system comprises a word recognizer for deriving a best path through a word graph, and wherein words are assigned to the speech based on the best path. The word score being obtained from applying a phonemic language model to each word of the word graph. Moreover, the invention deals with an apparatus and a method for identifying words from a sound block and to computer readable code for implementing the method.

Type: Grant

Filed: December 6, 2006

Date of Patent: March 20, 2012

Assignee: Nuance Communications Austria GmbH

Inventor: Zsolt Saffer

1 2 next