Patents by Inventor Werner Kriechbaum

Werner Kriechbaum has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for linking representation and realization data

Patent number: 7954044

Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).

Type: Grant

Filed: May 23, 2008

Date of Patent: May 31, 2011

Assignee: International Business Machines Corporation

Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
Method and apparatus for the automatic separating and indexing of multi-speaker conversations

Patent number: 7496510

Abstract: Disclosed are a method and apparatus for processing a continuous audio stream containing human speech in order to locate a particular speech-based transaction in the audio stream, applying both known speaker recognition and speech recognition techniques. Only the utterances of a particular predetermined speaker are transcribed thus providing an index and a summary of the underlying dialogue(s). In a first scenario, an incoming audio stream, e.g. a speech call from outside, is scanned in order to detect audio segments of the predetermined speaker. These audio segments are then indexed and only the indexed segments are transcribed into spoken or written language. In a second scenario, two or more speakers located in one room are using a multi-user speech recognition system (SRS). For each user there exists a different speaker model and optionally a different dictionary or vocabulary of words already known or trained by the speech or voice recognition system.

Type: Grant

Filed: November 30, 2001

Date of Patent: February 24, 2009

Assignee: International Business Machines Corporation

Inventors: Joachim Frank, Werner Kriechbaum, Gerhard Stenzel
METHOD AND APPARATUS FOR LINKING REPRESENTATION AND REALIZATION DATA

Publication number: 20080228490

Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).

Type: Application

Filed: May 23, 2008

Publication date: September 18, 2008

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
Method and apparatus for linking representation and realization data

Patent number: 7412643

Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).

Type: Grant

Filed: November 23, 1999

Date of Patent: August 12, 2008

Assignee: International Business Machines Corporation

Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
Method and computer system for encoding of information into a representation

Patent number: 7213151

Abstract: The present invention relates to a computer system and to a method for encoding of information into a representation comprising a plurality of segments, the order of the segments in the representation being irrelevant for a rendering of the representation, the method comprising the steps of: identification of the segments, permutation of the segments to encode the information.

Type: Grant

Filed: June 27, 2002

Date of Patent: May 1, 2007

Assignee: International Business Machines Corporation

Inventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
Method and system for the automatic segmentation of an audio stream into semantic or syntactic units

Patent number: 7120575

Abstract: A digitized speech signal (600) is input to an F0 (fundamental frequency) processor that computes (610) a continuous F0 data from the speech signal. By the criterion voicing state transition (voiced/unvoiced transitions) the speech signal is presegmented (620) into segments. For each segment (630) it is evaluated (640) whether F0 is defined or not defined i.e. whether F0 is ON or OFF. In case of F0=OFF a candidate segment boundary is assumed as described above and, starting from that boundary, prosodic features are computed (650). The feature values are input into a classification tree and each candidate segment is classified thereby revealing, as a result, the existence or non-existence of a semantic or syntactic speech unit.

Type: Grant

Filed: August 2, 2001

Date of Patent: October 10, 2006

Assignee: International Business Machines Corporation

Inventors: Martin Haase, Werner Kriechbaum, Gerhard Stenzel
Method and system for the automatic generation of multi-lingual synchronized sub-titles for audiovisual data

Patent number: 7117231

Abstract: The present invention provides a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language, with a second synchronized textual representation which can be used as an alternative subtitle including a transcription of the original language into another language. Time synchronous links can be built between the audio stream and, for instance, the textual representations of the words spoken in the audio stream. More particularly, the second representation can inherit the synchronization between the audio stream and the first representation using structure association information determined between the first and the second representation.

Type: Grant

Filed: November 27, 2001

Date of Patent: October 3, 2006

Assignee: International Business Machines Corporation

Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
Method of establishing a communication channel to intelligent support for ebusiness applications

Patent number: 7003090

Abstract: The present invention relates to method and system for providing online information in a networked user environment in which an end-user runs an application program and transmits data to an online server while running the application program. It is proposed to provide a request-button at the end-user application program dedicated to requesting information, and in particular help-information. When a help request is received at the communication server, a communication channel is promptly established between end-user and an agent. Information about the user activities sent in one or more transaction parts of an end-user intended business process and performed in the current application program session is read from the storage in the application server and is provided to the terminal of said agent in the help center. Advantageously, the same communication channel as used for performing the transactions is used for voice transmission for providing help or other information to the end-user.

Type: Grant

Filed: May 3, 2002

Date of Patent: February 21, 2006

Assignee: International Business Machines Corporation

Inventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
Method and system for the automatic amendment of speech recognition vocabularies

Patent number: 6975985

Abstract: The present invention provides a method and system to improve speech recognition using an existing audio realization of a spoken text and a true textual representation of the spoken text. The audio realization and the true textual representation can be aligned to reveal time stamps. A speech recognition can be performed on the audio realization to provide a hypothesis textual representation for the audio realization. The aligned true textual representation can be compared with the hypothesis textual representation. Single word pairs from the true and the hypothesis textual representations can be selected where the representations are different. Similarly, single word pairs can be selected from each representation where the representations are identical. A word or pronunciation database can be updated using the selected single word pairs together with the corresponding aligned audio realization.

Type: Grant

Filed: November 26, 2001

Date of Patent: December 13, 2005

Assignee: International Business Machines Corporation

Inventors: Werner Kriechbaum, Gerhard Stenzel
Method and system for generating a characteristic identifier for digital data and for detecting identical digital data

Patent number: 6799158

Abstract: A characteristic identifier for digital data is generated. Thereby, the information contained in a digital data set is reduced such that the resulting identifier is made comparable to another identifier made in the same manner. The generated identifiers are used for detecting identical digital data or to determine inexact copies of digital data. In one embodiment of the invention, the digital data is a digital audio signal and the characteristic identifier is called an audio signature. The comparison of identical audio data according to the invention can be carried out without a person actually listening to the audio data. The present invention can be used to establish automated processes to find potential unauthorized copies of audio data, e.g., music recordings, and therefore enables a better enforcement of copyrights in the audio industry.

Type: Grant

Filed: December 18, 2000

Date of Patent: September 28, 2004

Assignee: International Business Machines Corporation

Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
Method of generating a link between a note of a digital score and a realization of the score

Patent number: 6768046

Abstract: A system and method of generating a link between a note of a digital score and a realization of the score are provided. To do so, a digital score is processed to generate an onset curve. The onset curve is then filtered to generate a first series of first time intervals, which each have a significant number of onsets. A realization of the digital score is also processed to generate a second series of second time intervals, which each have a significant dynamic change of the realization. The first and the second series of time intervals are then correlated to produce the link.

Type: Grant

Filed: November 14, 2002

Date of Patent: July 27, 2004

Assignee: International Business Machines Corporation

Inventors: Werner Kriechbaum, Gerhard Stenzel
Method and system for the automatic detection of similar or identical segments in audio recordings

Publication number: 20040093202

Abstract: Disclosed are a computerized method and system for the identification of identical or similar audio recordings or segments of audio recordings. Identity or similarity between a first audio segment of a first audio stream and at least a second audio segment of an at least second audio stream is determined by digitizing at least the first audio segment and the at least second audio segment of said audio streams, calculating characteristic signatures from at least one local feature of the first audio segment and the at least second audio segment, aligning the at least two characteristic signatures, comparing the at least two aligned characteristic signatures and calculating a distance between the aligned characteristic signatures and determining identity or similarity between the at least two audio segments based on the determined distance.

Type: Application

Filed: September 12, 2003

Publication date: May 13, 2004

Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
Method of generating a link between a note of a digital score and a realization of the score

Publication number: 20030188626

Abstract: The invention relates to a method of generating a link between a note of a digital score and a realization of the score, the method comprising the steps of:

Type: Application

Filed: November 14, 2002

Publication date: October 9, 2003

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Werner Kriechbaum, Gerhard Stenzel
Method for creating a database index for a piece of music and for retrieval of piece of music

Publication number: 20030120679

Abstract: A method for creating a database index and for storing of a piece of music in a database includes extracting at least one property of the piece of music from a digital score of the piece of music, and creating the database index for the piece of music using the property.

Type: Application

Filed: October 16, 2002

Publication date: June 26, 2003

Applicant: International Business Machines Corporation

Inventors: Werner Kriechbaum, Gerhard Stenzel
Method and computer system for encoding of information into a representation

Publication number: 20030074561

Abstract: The present invention relates to a computer system and to a method for encoding of information into a representation comprising a plurality of segments, the order of the segments in the representation being irrelevant for a rendering of the representation, the method comprising the steps of:

Type: Application

Filed: June 27, 2002

Publication date: April 17, 2003

Applicant: International Business Machines Corporation

Inventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
Method of establishing a communication channel to intelligent support for ebusiness applications

Publication number: 20030043991

Abstract: The present invention relates to method and system for providing online information in a networked user environment in which an end-user runs an application program and transmits data to an online server while running the application program. It is proposed to provide a request-button at the end-user application program dedicated to requesting information, and in particular help-information. When a help request is received at the communication server, a communication channel is promptly established between end-user and an agent. Information about the user activities sent in one or more transaction parts of an end-user intended business process and performed in the current application program session is read from the storage in the application server and is provided to the terminal of said agent in the help center. Advantageously, the same communication channel as used for performing the transactions is used for voice transmission for providing help or other information to the end-user.

Type: Application

Filed: May 3, 2002

Publication date: March 6, 2003

Applicant: International Business Machines Corporation

Inventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
Method and apparatus for computer input using the skin as sensory feedback

Publication number: 20030001874

Abstract: The present invention relates to method and system for entering information for processing by a computing device. In order to improve the manual entering of information specially in so-called “non-desktop” environments or for visually impaired users as for example blind persons or car drivers while driving a car it is proposed to use a part of the human skin (11) as a feedback organ when entering information. Therefore, an information entering device (10) is provided having a pressure-sensing area (12) for decoding exerted pressure patterns and transform them into an information input signal, and a contact area (16) able to be coupled between said pressure-sensing area (12) and a respective part of the human skin (11) or clothes covering it. The exerted pressure patterns are conducted to the skin (11) whereby a respective sensory feedback is provided to the person.

Type: Application

Filed: May 9, 2002

Publication date: January 2, 2003

Applicant: International Business Machines Corporation

Inventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
Method and apparatus for providing authentication of a rendered realization

Publication number: 20020168089

Abstract: Disclosed are a method, apparatus, and program for providing authentication of a rendered multimedia realization. A renderer and a watermark generator are integrated wherein the renderer receives a symbolic stream, e.g. in the case of a text-to-speech system a text, and generates a realization, e.g. an audio signal representing a spoken version of the text. An identification is embedded into the signal by the watermark generator using standard steganographic methods. Such a serial integration of renderer and watermark generator is applicable to all known renderers and watermarking techniques. The mechanism enables inheritance of originality of the original representation or realization to the rendered realization.

Type: Application

Filed: May 9, 2002

Publication date: November 14, 2002

Applicant: International Business Machines Corporation

Inventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
Method and apparatus for the automatic separating and indexing of multi-speaker conversations

Publication number: 20020091517

Abstract: Disclosed are a method and apparatus for processing a continuous audio stream containing human speech in order to locate a particular speech-based transaction in the audio stream, applying both known speaker recognition and speech recognition techniques. Hereby it is enabled that only the utterances of a particular predetermined speaker are transcribed thus providing an index and a summary of the underlying dialogue(s).

Type: Application

Filed: November 30, 2001

Publication date: July 11, 2002

Applicant: IBM Corporation

Inventors: Joachim Frank, Werner Kriechbaum, Gerhard Stenzel
Method and system for the automatic generation of multi-lingual synchronized sub-titles for audiovisual data

Publication number: 20020087569

Abstract: The present invention provides a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language, with a second synchronized textual representation which can be used as an alternative subtitle including a transcription of the original language into another language. Time synchronous links can be built between the audio stream and, for instance, the textual representations of the words spoken in the audio stream. More particularly, the second representation can inherit the synchronization between the audio stream and the first representation using structure association information determined between the first and the second representation.

Type: Application

Filed: November 27, 2001

Publication date: July 4, 2002

Applicant: International Business Machines Corporation

Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel

1 2 next