Patents Assigned to Nuance Communications Austria GmbH
-
Patent number: 9009695Abstract: The invention relates to a method and to a system for changing over from a first adaptive data processing version (V1) on data processing means using at least one data model (dm) which is continuously adapted on the basis of data processing results to a second adaptive data processing version (V2) also using at least one data model (DM) to be continuously adapted, characterized in that, in a first phase, the second adaptive data processing version (V2) is used in parallel to the first data processing version (V1), thereby continuously adapting said at least one data model (dm) related to the first version (V1) as well as that data model (DM) related to the second version (V2), and in that the performance of data processing by means of the second version (V2) in checked to comply with a quality criterion, where after in a second phase, as soon as said criterion is met, the results of the data processing by means of the second version (V2) are outputted to be used.Type: GrantFiled: May 9, 2007Date of Patent: April 14, 2015Assignee: Nuance Communications Austria GmbHInventor: Johannes Unfried
-
Patent number: 8688448Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.Type: GrantFiled: September 14, 2012Date of Patent: April 1, 2014Assignee: Nuance Communications Austria GmbHInventors: Jochen Peters, Evgeny Matusov, Carsten Meyer, Dietrich Klakow
-
Publication number: 20130185073Abstract: The invention deals with speech recognition, such as a system for recognizing words in continuous speech. A speech recognition system is disclosed which is capable of recognizing a huge number of words, and in principle even an unlimited number of words. The speech recognition system comprises a word recognizer for deriving a best path through a word graph, and wherein words are assigned to the speech based on the best path. The word score being obtained from applying a phonemic language model to each word of the word graph. Moreover, the invention deals with an apparatus and a method for identifying words from a sound block and to computer readable code for implementing the method.Type: ApplicationFiled: March 6, 2013Publication date: July 18, 2013Applicant: Nuance Communications Austria GmbHInventor: Nuance Communications Austria GmbH
-
Publication number: 20130166304Abstract: A speech recognition device (1) processes speech data (SD) of a dictation and thus establishes recognized text information (ETI) and link information (LI) of the dictation. In a synchronous playback mode of the speech recognition device (1), during the acoustic playback of the dictation a correction device (10) synchronously marks the word of the recognized text information (ETI) which word relates to the speech data (SD) just played back marked by the link information (LD is marked synchronously, the just marked word featuring the position of an audio cursor (AC). When a user of the speech recognition device (1) recognizes an incorrect word, he positions a text cursor (TC) at the incorrect word and corrects it. Cursor synchronization means (15) now make it possible to synchronize the text cursor (TC) with the audio cursor (AC) or the audio cursor (AC) with the text cursor (TC) so that the positioning of the respective cursor (AC, TC) is simplified considerably.Type: ApplicationFiled: January 17, 2013Publication date: June 27, 2013Applicant: Nuance Communications Austria GmbHInventor: Nuance Communications Austria GmbH
-
Patent number: 8452594Abstract: A method and a system for processing dictated information into a dynamic form are disclosed. The method comprises presenting an image (3) belonging to an image category to a user, dictating a first section of speech associated with the image category, retrieving an electronic document having a previously defined document structure (4) associated with the first section of speech, thus associating the document structure (4) with the image (3), wherein the document structure comprises at least one text field, presenting at least a part of the electronic document having the document structure (4) on a presenting unit (5), dictating a second section of speech and processing the second section of speech in a speech recognition engine (6) into dictated text and associating the dictated text with the text field.Type: GrantFiled: October 16, 2006Date of Patent: May 28, 2013Assignee: Nuance Communications Austria GmbHInventor: Mehmet Mert Oz
-
Patent number: 8447602Abstract: In a speech recognition and correction system which comprises at least one speech recognition device (1) to which a spoken text (GT) can be fed, it being possible for said spoken text to be transcribe into a recognized text (ET), and a correction device (3) for correcting the text (ET) recognized by the at least one speech recognition device (1), said correction device being connected to the at least one speech recognition device (1) via a data network (2) for the transmission of the recognized text (ET) and where appropriate of the spoken text (GT), the correction device (3) has a lexicon of alternatives (23) which contains word parts, words and word sequences that can be displayed (22) by the correction device (3) as alternatives to individual word parts, words and word sequences of the recognized text.Type: GrantFiled: March 22, 2004Date of Patent: May 21, 2013Assignee: Nuance Communications Austria GmbHInventors: Heinrich Franz Bartosik, Carsten Meyer
-
Patent number: 8447606Abstract: In a method and a system (20) for creating or updating entries in a speech recognition (SR) lexicon (7) of a speech recognition system, said entries mapping speech recognition (SR) phoneme sequences to words, said method comprising entering a respective word, and in the case that the word is a new word to be added to the SR lexicon, also entering at least one associated SR phoneme sequence through input means (26), it is provided that the SR phoneme sequence associated with the respective word is converted into speech by phoneme to speech conversion means (4.4), and the speech is played back by playback means (28), to control the match of the phoneme sequence and the word.Type: GrantFiled: February 4, 2008Date of Patent: May 21, 2013Assignee: Nuance Communications Austria GmbHInventors: Andreas Neubacher, Gerhard Grobauer
-
Publication number: 20130103401Abstract: A method and a system of history tracking corrections in a speech based document are disclosed. The speech based document comprises one or more sections of text recognized or transcribed from sections of speech, wherein the sections of speech are dictated by a user and processed by a speech recognizer in a speech recognition system into corresponding sections of text of the speech based document.Type: ApplicationFiled: December 14, 2012Publication date: April 25, 2013Applicant: Nuance Communications Austria GmbHInventor: Nuance Communications Austria GmbH
-
Patent number: 8417528Abstract: The invention deals with speech recognition, such as a system for recognizing words in continuous speech. A speech recognition system is disclosed which is capable of recognizing a huge number of words, and in principle even an unlimited number of words. The speech recognition system comprises a word recognizer for deriving a best path through a word graph, and wherein words are assigned to the speech based on the best path. The word score being obtained from applying a phonemic language model to each word of the word graph. Moreover, the invention deals with an apparatus and a method for identifying words from a sound block and to computer readable code for implementing the method.Type: GrantFiled: February 3, 2012Date of Patent: April 9, 2013Assignee: Nuance Communications Austria GmbHInventor: Zsolt Saffer
-
Publication number: 20130066625Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.Type: ApplicationFiled: September 14, 2012Publication date: March 14, 2013Applicant: Nuance Communications Austria GmbHInventors: Jochen Peters, Evgeny Matusov, Carsten Meyer, Dietrich Klakow
-
Patent number: 8380509Abstract: A speech recognition device (1) processes speech data (SD) of a dictation and establishes recognized text information (ETI) and link information (LI) of the dictation. In a synchronous playback mode of the speech recognition device (1), during acoustic playback of the dictation a correction device (10) synchronously marks the word of the recognized text information (ETI) which word relates to speech data (SD) just played back marked by link information (LI) is marked synchronously, the just marked word featuring the position of an audio cursor (AC). When a user of the speech recognition device (1) recognizes an incorrect word, he positions a text cursor (TC) at the incorrect word and corrects it. Cursor synchronization means (15) makes it possible to synchronize text cursor (TC) with audio cursor (AC) or audio cursor (AC) with text cursor (TC) so the positioning of the respective cursor (AC, TC) is simplified considerably.Type: GrantFiled: February 13, 2012Date of Patent: February 19, 2013Assignee: Nuance Communications Austria GmbHInventor: Wolfgang Gschwendtner
-
Patent number: 8364489Abstract: A method and a system of history tracking corrections in a speech based document. The speech based document comprises one or more sections of text recognized or transcribed from sections of speech, wherein the sections of speech are dictated by a user and processed by a speech recognizer in a speech recognition system into corresponding sections of text of the speech based document. The method comprises associating at least one speech attribute to each section of text in the speech based document, said speech attribute comprising information related to said section of text, respectively; presenting said speech based document on a presenting unit; detecting an action being performed within any of said sections of text; and updating information of said speech attributes related to the kind of action detected on one of said sections of text for updating said speech based document.Type: GrantFiled: February 3, 2012Date of Patent: January 29, 2013Assignee: Nuance Communications Austria GmbHInventors: Gerhard Grobauer, Miklos Papai
-
Patent number: 8332221Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.Type: GrantFiled: August 15, 2011Date of Patent: December 11, 2012Assignee: Nuance Communications Austria GmbHInventors: Jochen Peters, Evgeny Matusov, Carsten Meyer, Dietrich Klakow
-
Publication number: 20120185249Abstract: A method and a system of history tracking corrections in a speech based document. The speech based document comprises one or more sections of text recognized or transcribed from sections of speech, wherein the sections of speech are dictated by a user and processed by a speech recognizer in a speech recognition system into corresponding sections of text of the speech based document. The method comprises associating at least one speech attribute to each section of text in the speech based document, said speech attribute comprising information related to said section of text, respectively; presenting said speech based document on a presenting unit; detecting an action being performed within any of said sections of text; and updating information of said speech attributes related to the kind of action detected on one of said sections of text for updating said speech based document.Type: ApplicationFiled: February 3, 2012Publication date: July 19, 2012Applicant: Nuance Communications Austria GMBHInventors: Gerhard Grobauer, Miklos Papai
-
Publication number: 20120158405Abstract: A speech recognition device (1) processes speech data (SD) of a dictation and establishes recognized text information (ETI) and link information (LI) of the dictation. In a synchronous playback mode of the speech recognition device (1), during acoustic playback of the dictation a correction device (10) synchronously marks the word of the recognized text information (ETI) which word relates to speech data (SD) just played back marked by link information (LI) is marked synchronously, the just marked word featuring the position of an audio cursor (AC). When a user of the speech recognition device (1) recognizes an incorrect word, he positions a text cursor (TC) at the incorrect word and corrects it. Cursor synchronization means (15) makes it possible to synchronize text cursor (TC) with audio cursor (AC) or audio cursor (AC) with text cursor (TC) so the positioning of the respective cursor (AC, TC) is simplified considerably.Type: ApplicationFiled: February 13, 2012Publication date: June 21, 2012Applicant: Nuance Communications Austria GmbHInventor: Wolfgang Gschwendtner
-
Patent number: 8200487Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labelling of successive parts of the document or the entire document.Type: GrantFiled: November 12, 2004Date of Patent: June 12, 2012Assignee: Nuance Communications Austria GmbHInventors: Jochen Peters, Evgeny Matusov, Carsten Meyer, Dietrich Klakow
-
Publication number: 20120136662Abstract: The invention deals with speech recognition, such as a system for recognizing words in continuous speech. A speech recognition system is disclosed which is capable of recognizing a huge number of words, and in principle even an unlimited number of words. The speech recognition system comprises a word recognizer for deriving a best path through a word graph, and wherein words are assigned to the speech based on the best path. The word score being obtained from applying a phonemic language model to each word of the word graph. Moreover, the invention deals with an apparatus and a method for identifying words from a sound block and to computer readable code for implementing the method.Type: ApplicationFiled: February 3, 2012Publication date: May 31, 2012Applicant: Nuance Communications Austria GMBHInventor: Zsolt Saffer
-
Publication number: 20120095751Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labeling of successive parts of the document or the entire document.Type: ApplicationFiled: August 15, 2011Publication date: April 19, 2012Applicant: Nuance Communications Austria GMBHInventors: Jochen PETERS, Evgeny MATUSOV, Carsten MEYER, Dietrich KLAKOW
-
Patent number: 8150691Abstract: During the replaying of audio data stored in a, which audio data corresponds to text data from a text composed of words, the replaying of the audio data in forward and reverse modes is controlled. Starting from particular momentary replay position in the audio data, a backward jump over a return distance corresponding to the length of about at least two words, to a target position, is automatically initiated for the replaying of the audio data in the reverse mode. Then, starting from the particular target position, a replay of the audio data in the forward sequence for just one part of the return distance is undertaken.Type: GrantFiled: October 13, 2003Date of Patent: April 3, 2012Assignee: Nuance Communications Austria GmbHInventor: Kwaku Frimpong-Ansah
-
Patent number: 8140336Abstract: The invention deals with speech recognition, such as a system for recognizing words in continuous speech. A speech recognition system is disclosed which is capable of recognizing a huge number of words, and in principle even an unlimited number of words. The speech recognition system comprises a word recognizer for deriving a best path through a word graph, and wherein words are assigned to the speech based on the best path. The word score being obtained from applying a phonemic language model to each word of the word graph. Moreover, the invention deals with an apparatus and a method for identifying words from a sound block and to computer readable code for implementing the method.Type: GrantFiled: December 6, 2006Date of Patent: March 20, 2012Assignee: Nuance Communications Austria GmbHInventor: Zsolt Saffer