Patents by Inventor Werner Kriechbaum
Werner Kriechbaum has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7954044Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).Type: GrantFiled: May 23, 2008Date of Patent: May 31, 2011Assignee: International Business Machines CorporationInventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
-
Patent number: 7496510Abstract: Disclosed are a method and apparatus for processing a continuous audio stream containing human speech in order to locate a particular speech-based transaction in the audio stream, applying both known speaker recognition and speech recognition techniques. Only the utterances of a particular predetermined speaker are transcribed thus providing an index and a summary of the underlying dialogue(s). In a first scenario, an incoming audio stream, e.g. a speech call from outside, is scanned in order to detect audio segments of the predetermined speaker. These audio segments are then indexed and only the indexed segments are transcribed into spoken or written language. In a second scenario, two or more speakers located in one room are using a multi-user speech recognition system (SRS). For each user there exists a different speaker model and optionally a different dictionary or vocabulary of words already known or trained by the speech or voice recognition system.Type: GrantFiled: November 30, 2001Date of Patent: February 24, 2009Assignee: International Business Machines CorporationInventors: Joachim Frank, Werner Kriechbaum, Gerhard Stenzel
-
Publication number: 20080228490Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).Type: ApplicationFiled: May 23, 2008Publication date: September 18, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
-
Patent number: 7412643Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).Type: GrantFiled: November 23, 1999Date of Patent: August 12, 2008Assignee: International Business Machines CorporationInventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
-
Patent number: 7213151Abstract: The present invention relates to a computer system and to a method for encoding of information into a representation comprising a plurality of segments, the order of the segments in the representation being irrelevant for a rendering of the representation, the method comprising the steps of: identification of the segments, permutation of the segments to encode the information.Type: GrantFiled: June 27, 2002Date of Patent: May 1, 2007Assignee: International Business Machines CorporationInventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
-
Method and system for the automatic segmentation of an audio stream into semantic or syntactic units
Patent number: 7120575Abstract: A digitized speech signal (600) is input to an F0 (fundamental frequency) processor that computes (610) a continuous F0 data from the speech signal. By the criterion voicing state transition (voiced/unvoiced transitions) the speech signal is presegmented (620) into segments. For each segment (630) it is evaluated (640) whether F0 is defined or not defined i.e. whether F0 is ON or OFF. In case of F0=OFF a candidate segment boundary is assumed as described above and, starting from that boundary, prosodic features are computed (650). The feature values are input into a classification tree and each candidate segment is classified thereby revealing, as a result, the existence or non-existence of a semantic or syntactic speech unit.Type: GrantFiled: August 2, 2001Date of Patent: October 10, 2006Assignee: International Business Machines CorporationInventors: Martin Haase, Werner Kriechbaum, Gerhard Stenzel -
Patent number: 7117231Abstract: The present invention provides a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language, with a second synchronized textual representation which can be used as an alternative subtitle including a transcription of the original language into another language. Time synchronous links can be built between the audio stream and, for instance, the textual representations of the words spoken in the audio stream. More particularly, the second representation can inherit the synchronization between the audio stream and the first representation using structure association information determined between the first and the second representation.Type: GrantFiled: November 27, 2001Date of Patent: October 3, 2006Assignee: International Business Machines CorporationInventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
-
Patent number: 7003090Abstract: The present invention relates to method and system for providing online information in a networked user environment in which an end-user runs an application program and transmits data to an online server while running the application program. It is proposed to provide a request-button at the end-user application program dedicated to requesting information, and in particular help-information. When a help request is received at the communication server, a communication channel is promptly established between end-user and an agent. Information about the user activities sent in one or more transaction parts of an end-user intended business process and performed in the current application program session is read from the storage in the application server and is provided to the terminal of said agent in the help center. Advantageously, the same communication channel as used for performing the transactions is used for voice transmission for providing help or other information to the end-user.Type: GrantFiled: May 3, 2002Date of Patent: February 21, 2006Assignee: International Business Machines CorporationInventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
-
Patent number: 6975985Abstract: The present invention provides a method and system to improve speech recognition using an existing audio realization of a spoken text and a true textual representation of the spoken text. The audio realization and the true textual representation can be aligned to reveal time stamps. A speech recognition can be performed on the audio realization to provide a hypothesis textual representation for the audio realization. The aligned true textual representation can be compared with the hypothesis textual representation. Single word pairs from the true and the hypothesis textual representations can be selected where the representations are different. Similarly, single word pairs can be selected from each representation where the representations are identical. A word or pronunciation database can be updated using the selected single word pairs together with the corresponding aligned audio realization.Type: GrantFiled: November 26, 2001Date of Patent: December 13, 2005Assignee: International Business Machines CorporationInventors: Werner Kriechbaum, Gerhard Stenzel
-
Patent number: 6799158Abstract: A characteristic identifier for digital data is generated. Thereby, the information contained in a digital data set is reduced such that the resulting identifier is made comparable to another identifier made in the same manner. The generated identifiers are used for detecting identical digital data or to determine inexact copies of digital data. In one embodiment of the invention, the digital data is a digital audio signal and the characteristic identifier is called an audio signature. The comparison of identical audio data according to the invention can be carried out without a person actually listening to the audio data. The present invention can be used to establish automated processes to find potential unauthorized copies of audio data, e.g., music recordings, and therefore enables a better enforcement of copyrights in the audio industry.Type: GrantFiled: December 18, 2000Date of Patent: September 28, 2004Assignee: International Business Machines CorporationInventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
-
Patent number: 6768046Abstract: A system and method of generating a link between a note of a digital score and a realization of the score are provided. To do so, a digital score is processed to generate an onset curve. The onset curve is then filtered to generate a first series of first time intervals, which each have a significant number of onsets. A realization of the digital score is also processed to generate a second series of second time intervals, which each have a significant dynamic change of the realization. The first and the second series of time intervals are then correlated to produce the link.Type: GrantFiled: November 14, 2002Date of Patent: July 27, 2004Assignee: International Business Machines CorporationInventors: Werner Kriechbaum, Gerhard Stenzel
-
Publication number: 20040093202Abstract: Disclosed are a computerized method and system for the identification of identical or similar audio recordings or segments of audio recordings. Identity or similarity between a first audio segment of a first audio stream and at least a second audio segment of an at least second audio stream is determined by digitizing at least the first audio segment and the at least second audio segment of said audio streams, calculating characteristic signatures from at least one local feature of the first audio segment and the at least second audio segment, aligning the at least two characteristic signatures, comparing the at least two aligned characteristic signatures and calculating a distance between the aligned characteristic signatures and determining identity or similarity between the at least two audio segments based on the determined distance.Type: ApplicationFiled: September 12, 2003Publication date: May 13, 2004Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
-
Publication number: 20030188626Abstract: The invention relates to a method of generating a link between a note of a digital score and a realization of the score, the method comprising the steps of:Type: ApplicationFiled: November 14, 2002Publication date: October 9, 2003Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Werner Kriechbaum, Gerhard Stenzel
-
Publication number: 20030120679Abstract: A method for creating a database index and for storing of a piece of music in a database includes extracting at least one property of the piece of music from a digital score of the piece of music, and creating the database index for the piece of music using the property.Type: ApplicationFiled: October 16, 2002Publication date: June 26, 2003Applicant: International Business Machines CorporationInventors: Werner Kriechbaum, Gerhard Stenzel
-
Publication number: 20030074561Abstract: The present invention relates to a computer system and to a method for encoding of information into a representation comprising a plurality of segments, the order of the segments in the representation being irrelevant for a rendering of the representation, the method comprising the steps of:Type: ApplicationFiled: June 27, 2002Publication date: April 17, 2003Applicant: International Business Machines CorporationInventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
-
Publication number: 20030043991Abstract: The present invention relates to method and system for providing online information in a networked user environment in which an end-user runs an application program and transmits data to an online server while running the application program. It is proposed to provide a request-button at the end-user application program dedicated to requesting information, and in particular help-information. When a help request is received at the communication server, a communication channel is promptly established between end-user and an agent. Information about the user activities sent in one or more transaction parts of an end-user intended business process and performed in the current application program session is read from the storage in the application server and is provided to the terminal of said agent in the help center. Advantageously, the same communication channel as used for performing the transactions is used for voice transmission for providing help or other information to the end-user.Type: ApplicationFiled: May 3, 2002Publication date: March 6, 2003Applicant: International Business Machines CorporationInventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
-
Publication number: 20030001874Abstract: The present invention relates to method and system for entering information for processing by a computing device. In order to improve the manual entering of information specially in so-called “non-desktop” environments or for visually impaired users as for example blind persons or car drivers while driving a car it is proposed to use a part of the human skin (11) as a feedback organ when entering information. Therefore, an information entering device (10) is provided having a pressure-sensing area (12) for decoding exerted pressure patterns and transform them into an information input signal, and a contact area (16) able to be coupled between said pressure-sensing area (12) and a respective part of the human skin (11) or clothes covering it. The exerted pressure patterns are conducted to the skin (11) whereby a respective sensory feedback is provided to the person.Type: ApplicationFiled: May 9, 2002Publication date: January 2, 2003Applicant: International Business Machines CorporationInventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
-
Publication number: 20020168089Abstract: Disclosed are a method, apparatus, and program for providing authentication of a rendered multimedia realization. A renderer and a watermark generator are integrated wherein the renderer receives a symbolic stream, e.g. in the case of a text-to-speech system a text, and generates a realization, e.g. an audio signal representing a spoken version of the text. An identification is embedded into the signal by the watermark generator using standard steganographic methods. Such a serial integration of renderer and watermark generator is applicable to all known renderers and watermarking techniques. The mechanism enables inheritance of originality of the original representation or realization to the rendered realization.Type: ApplicationFiled: May 9, 2002Publication date: November 14, 2002Applicant: International Business Machines CorporationInventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
-
Publication number: 20020091517Abstract: Disclosed are a method and apparatus for processing a continuous audio stream containing human speech in order to locate a particular speech-based transaction in the audio stream, applying both known speaker recognition and speech recognition techniques. Hereby it is enabled that only the utterances of a particular predetermined speaker are transcribed thus providing an index and a summary of the underlying dialogue(s).Type: ApplicationFiled: November 30, 2001Publication date: July 11, 2002Applicant: IBM CorporationInventors: Joachim Frank, Werner Kriechbaum, Gerhard Stenzel
-
Publication number: 20020087569Abstract: The present invention provides a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language, with a second synchronized textual representation which can be used as an alternative subtitle including a transcription of the original language into another language. Time synchronous links can be built between the audio stream and, for instance, the textual representations of the words spoken in the audio stream. More particularly, the second representation can inherit the synchronization between the audio stream and the first representation using structure association information determined between the first and the second representation.Type: ApplicationFiled: November 27, 2001Publication date: July 4, 2002Applicant: International Business Machines CorporationInventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel