Patents by Inventor Werner Kriechbaum

Werner Kriechbaum has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7954044
    Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).
    Type: Grant
    Filed: May 23, 2008
    Date of Patent: May 31, 2011
    Assignee: International Business Machines Corporation
    Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
  • Patent number: 7496510
    Abstract: Disclosed are a method and apparatus for processing a continuous audio stream containing human speech in order to locate a particular speech-based transaction in the audio stream, applying both known speaker recognition and speech recognition techniques. Only the utterances of a particular predetermined speaker are transcribed thus providing an index and a summary of the underlying dialogue(s). In a first scenario, an incoming audio stream, e.g. a speech call from outside, is scanned in order to detect audio segments of the predetermined speaker. These audio segments are then indexed and only the indexed segments are transcribed into spoken or written language. In a second scenario, two or more speakers located in one room are using a multi-user speech recognition system (SRS). For each user there exists a different speaker model and optionally a different dictionary or vocabulary of words already known or trained by the speech or voice recognition system.
    Type: Grant
    Filed: November 30, 2001
    Date of Patent: February 24, 2009
    Assignee: International Business Machines Corporation
    Inventors: Joachim Frank, Werner Kriechbaum, Gerhard Stenzel
  • Publication number: 20080228490
    Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).
    Type: Application
    Filed: May 23, 2008
    Publication date: September 18, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
  • Patent number: 7412643
    Abstract: A method and apparatus for creating links between a representation, (e.g. text data,) and a realization, (e.g. corresponding audio data,) is provided. According to the invention the realization is structured by combining a time-stamped version of the representation generated from the realization with structural information from the representation. Thereby so called hyper links between representation and realization are created. These hyper links are used for performing search operations in realization data equivalent to those which are possible in representation data, enabling an improved access to the realization (e.g. via audio databases).
    Type: Grant
    Filed: November 23, 1999
    Date of Patent: August 12, 2008
    Assignee: International Business Machines Corporation
    Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
  • Patent number: 7213151
    Abstract: The present invention relates to a computer system and to a method for encoding of information into a representation comprising a plurality of segments, the order of the segments in the representation being irrelevant for a rendering of the representation, the method comprising the steps of: identification of the segments, permutation of the segments to encode the information.
    Type: Grant
    Filed: June 27, 2002
    Date of Patent: May 1, 2007
    Assignee: International Business Machines Corporation
    Inventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
  • Patent number: 7120575
    Abstract: A digitized speech signal (600) is input to an F0 (fundamental frequency) processor that computes (610) a continuous F0 data from the speech signal. By the criterion voicing state transition (voiced/unvoiced transitions) the speech signal is presegmented (620) into segments. For each segment (630) it is evaluated (640) whether F0 is defined or not defined i.e. whether F0 is ON or OFF. In case of F0=OFF a candidate segment boundary is assumed as described above and, starting from that boundary, prosodic features are computed (650). The feature values are input into a classification tree and each candidate segment is classified thereby revealing, as a result, the existence or non-existence of a semantic or syntactic speech unit.
    Type: Grant
    Filed: August 2, 2001
    Date of Patent: October 10, 2006
    Assignee: International Business Machines Corporation
    Inventors: Martin Haase, Werner Kriechbaum, Gerhard Stenzel
  • Patent number: 7117231
    Abstract: The present invention provides a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language, with a second synchronized textual representation which can be used as an alternative subtitle including a transcription of the original language into another language. Time synchronous links can be built between the audio stream and, for instance, the textual representations of the words spoken in the audio stream. More particularly, the second representation can inherit the synchronization between the audio stream and the first representation using structure association information determined between the first and the second representation.
    Type: Grant
    Filed: November 27, 2001
    Date of Patent: October 3, 2006
    Assignee: International Business Machines Corporation
    Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
  • Patent number: 7003090
    Abstract: The present invention relates to method and system for providing online information in a networked user environment in which an end-user runs an application program and transmits data to an online server while running the application program. It is proposed to provide a request-button at the end-user application program dedicated to requesting information, and in particular help-information. When a help request is received at the communication server, a communication channel is promptly established between end-user and an agent. Information about the user activities sent in one or more transaction parts of an end-user intended business process and performed in the current application program session is read from the storage in the application server and is provided to the terminal of said agent in the help center. Advantageously, the same communication channel as used for performing the transactions is used for voice transmission for providing help or other information to the end-user.
    Type: Grant
    Filed: May 3, 2002
    Date of Patent: February 21, 2006
    Assignee: International Business Machines Corporation
    Inventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
  • Patent number: 6975985
    Abstract: The present invention provides a method and system to improve speech recognition using an existing audio realization of a spoken text and a true textual representation of the spoken text. The audio realization and the true textual representation can be aligned to reveal time stamps. A speech recognition can be performed on the audio realization to provide a hypothesis textual representation for the audio realization. The aligned true textual representation can be compared with the hypothesis textual representation. Single word pairs from the true and the hypothesis textual representations can be selected where the representations are different. Similarly, single word pairs can be selected from each representation where the representations are identical. A word or pronunciation database can be updated using the selected single word pairs together with the corresponding aligned audio realization.
    Type: Grant
    Filed: November 26, 2001
    Date of Patent: December 13, 2005
    Assignee: International Business Machines Corporation
    Inventors: Werner Kriechbaum, Gerhard Stenzel
  • Patent number: 6799158
    Abstract: A characteristic identifier for digital data is generated. Thereby, the information contained in a digital data set is reduced such that the resulting identifier is made comparable to another identifier made in the same manner. The generated identifiers are used for detecting identical digital data or to determine inexact copies of digital data. In one embodiment of the invention, the digital data is a digital audio signal and the characteristic identifier is called an audio signature. The comparison of identical audio data according to the invention can be carried out without a person actually listening to the audio data. The present invention can be used to establish automated processes to find potential unauthorized copies of audio data, e.g., music recordings, and therefore enables a better enforcement of copyrights in the audio industry.
    Type: Grant
    Filed: December 18, 2000
    Date of Patent: September 28, 2004
    Assignee: International Business Machines Corporation
    Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
  • Patent number: 6768046
    Abstract: A system and method of generating a link between a note of a digital score and a realization of the score are provided. To do so, a digital score is processed to generate an onset curve. The onset curve is then filtered to generate a first series of first time intervals, which each have a significant number of onsets. A realization of the digital score is also processed to generate a second series of second time intervals, which each have a significant dynamic change of the realization. The first and the second series of time intervals are then correlated to produce the link.
    Type: Grant
    Filed: November 14, 2002
    Date of Patent: July 27, 2004
    Assignee: International Business Machines Corporation
    Inventors: Werner Kriechbaum, Gerhard Stenzel
  • Publication number: 20040093202
    Abstract: Disclosed are a computerized method and system for the identification of identical or similar audio recordings or segments of audio recordings. Identity or similarity between a first audio segment of a first audio stream and at least a second audio segment of an at least second audio stream is determined by digitizing at least the first audio segment and the at least second audio segment of said audio streams, calculating characteristic signatures from at least one local feature of the first audio segment and the at least second audio segment, aligning the at least two characteristic signatures, comparing the at least two aligned characteristic signatures and calculating a distance between the aligned characteristic signatures and determining identity or similarity between the at least two audio segments based on the determined distance.
    Type: Application
    Filed: September 12, 2003
    Publication date: May 13, 2004
    Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel
  • Publication number: 20030188626
    Abstract: The invention relates to a method of generating a link between a note of a digital score and a realization of the score, the method comprising the steps of:
    Type: Application
    Filed: November 14, 2002
    Publication date: October 9, 2003
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Werner Kriechbaum, Gerhard Stenzel
  • Publication number: 20030120679
    Abstract: A method for creating a database index and for storing of a piece of music in a database includes extracting at least one property of the piece of music from a digital score of the piece of music, and creating the database index for the piece of music using the property.
    Type: Application
    Filed: October 16, 2002
    Publication date: June 26, 2003
    Applicant: International Business Machines Corporation
    Inventors: Werner Kriechbaum, Gerhard Stenzel
  • Publication number: 20030074561
    Abstract: The present invention relates to a computer system and to a method for encoding of information into a representation comprising a plurality of segments, the order of the segments in the representation being irrelevant for a rendering of the representation, the method comprising the steps of:
    Type: Application
    Filed: June 27, 2002
    Publication date: April 17, 2003
    Applicant: International Business Machines Corporation
    Inventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
  • Publication number: 20030043991
    Abstract: The present invention relates to method and system for providing online information in a networked user environment in which an end-user runs an application program and transmits data to an online server while running the application program. It is proposed to provide a request-button at the end-user application program dedicated to requesting information, and in particular help-information. When a help request is received at the communication server, a communication channel is promptly established between end-user and an agent. Information about the user activities sent in one or more transaction parts of an end-user intended business process and performed in the current application program session is read from the storage in the application server and is provided to the terminal of said agent in the help center. Advantageously, the same communication channel as used for performing the transactions is used for voice transmission for providing help or other information to the end-user.
    Type: Application
    Filed: May 3, 2002
    Publication date: March 6, 2003
    Applicant: International Business Machines Corporation
    Inventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
  • Publication number: 20030001874
    Abstract: The present invention relates to method and system for entering information for processing by a computing device. In order to improve the manual entering of information specially in so-called “non-desktop” environments or for visually impaired users as for example blind persons or car drivers while driving a car it is proposed to use a part of the human skin (11) as a feedback organ when entering information. Therefore, an information entering device (10) is provided having a pressure-sensing area (12) for decoding exerted pressure patterns and transform them into an information input signal, and a contact area (16) able to be coupled between said pressure-sensing area (12) and a respective part of the human skin (11) or clothes covering it. The exerted pressure patterns are conducted to the skin (11) whereby a respective sensory feedback is provided to the person.
    Type: Application
    Filed: May 9, 2002
    Publication date: January 2, 2003
    Applicant: International Business Machines Corporation
    Inventors: Werner Kriechbaum, Ronald Pfeifer, Gerhard Stenzel
  • Publication number: 20020168089
    Abstract: Disclosed are a method, apparatus, and program for providing authentication of a rendered multimedia realization. A renderer and a watermark generator are integrated wherein the renderer receives a symbolic stream, e.g. in the case of a text-to-speech system a text, and generates a realization, e.g. an audio signal representing a spoken version of the text. An identification is embedded into the signal by the watermark generator using standard steganographic methods. Such a serial integration of renderer and watermark generator is applicable to all known renderers and watermarking techniques. The mechanism enables inheritance of originality of the original representation or realization to the rendered realization.
    Type: Application
    Filed: May 9, 2002
    Publication date: November 14, 2002
    Applicant: International Business Machines Corporation
    Inventors: Carsten Guenther, Werner Kriechbaum, Siegfried Kunzmann, Bernhard Hubert Zeller
  • Publication number: 20020091517
    Abstract: Disclosed are a method and apparatus for processing a continuous audio stream containing human speech in order to locate a particular speech-based transaction in the audio stream, applying both known speaker recognition and speech recognition techniques. Hereby it is enabled that only the utterances of a particular predetermined speaker are transcribed thus providing an index and a summary of the underlying dialogue(s).
    Type: Application
    Filed: November 30, 2001
    Publication date: July 11, 2002
    Applicant: IBM Corporation
    Inventors: Joachim Frank, Werner Kriechbaum, Gerhard Stenzel
  • Publication number: 20020087569
    Abstract: The present invention provides a method and system for computerized synchronization of an audio stream having a first synchronized textual representation usable for subtitling of the audio stream in the original language, with a second synchronized textual representation which can be used as an alternative subtitle including a transcription of the original language into another language. Time synchronous links can be built between the audio stream and, for instance, the textual representations of the words spoken in the audio stream. More particularly, the second representation can inherit the synchronization between the audio stream and the first representation using structure association information determined between the first and the second representation.
    Type: Application
    Filed: November 27, 2001
    Publication date: July 4, 2002
    Applicant: International Business Machines Corporation
    Inventors: Uwe Fischer, Stefan Hoffmann, Werner Kriechbaum, Gerhard Stenzel