Patents by Inventor Krzysztof Marasek
Krzysztof Marasek has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8635065Abstract: The present invention discloses an apparatus for automatic extraction of important events in audio signals comprising: signal input means for supplying audio signals; audio signal fragmenting means for partitioning audio signals supplied by the signal input means into audio fragments of a predetermined length and for allocating a sequence of one or more audio fragments to a respective audio window; feature extracting means for analyzing acoustic characteristics of the audio signals comprised in the audio fragments and for analyzing acoustic characteristics of the audio signals comprised in the audio windows; and important event extraction means for extracting important events in audio signals supplied by the audio signal fragmenting means based on predetermined important event classifying rules depending on acoustic characteristics of the audio signals comprised in the audio fragments and on acoustic characteristics of the audio signals comprised in the audio windows, wherein each important event extractedType: GrantFiled: November 10, 2004Date of Patent: January 21, 2014Assignee: Sony Deutschland GmbHInventors: Silke Goronzy-Thomae, Thomas Kemp, Ralf Kompe, Yin Hay Lam, Krzysztof Marasek, Raquel Tato
-
Patent number: 7962330Abstract: An apparatus for automatic dissection of segmented audio signals, wherein at least one information signal for identifying programs included in said audio signals and for identifying contents included in said programs. Content detection device detects programs and contents belonging to the respective programs in the information signal. Program weighting device weights each program includes in the information signal based on the contents of the respective program detected by the content detection device. Program ranking device indentifies programmers of the same category and ranking said programs based on a weighting result for each program provided by the program weighting device.Type: GrantFiled: November 10, 2004Date of Patent: June 14, 2011Assignee: Sony Deutschland GmbHInventors: Silke Goronzy, Thomas Kemp, Ralf Kompe, Yin Hay Lam, Krzysztof Marasek, Raquel Tato
-
Patent number: 7680654Abstract: An audio data segmentation apparatus for segmenting of audio data including for supplying audio data, dividing the audio data supplied into audio clips of a predetermined length, discriminating the audio clips into predetermined audio classes, the audio classes identifying a kind of audio data included in the respective audio clip and segmenting for segmenting the audio data into audio meta patterns based on a sequence of audio classes of consecutive audio clips, each meta pattern being allocated to a predetermined type of contents of the audio data. It is difficult to achieve good results with known methods for segmentation of audio data into meta patterns since the rules for the allocation of the meta patterns are dissatisfying.Type: GrantFiled: November 10, 2004Date of Patent: March 16, 2010Assignee: Sony Deutschland GmbHInventors: Silke Goronzy, Thomas Kemp, Ralf Kompe, Yin Hay Lam, Krzysztof Marasek, Raquel Tato
-
Patent number: 7292981Abstract: A method for predicting a misrecognition in a speech recognition system, is based on; the insight that variations in a speech input signal are different depending on the origin of the signal being a speech or a non-speech event. The method comprises steps for receiving a speech input signal, extracting at least one signal variation feature of the speech input signal, and applying a signal variation meter to the speech input signal for deriving a signal variation measure.Type: GrantFiled: October 4, 2004Date of Patent: November 6, 2007Assignee: Sony Deutschland GmbHInventors: Thomas Kemp, Yin Hay Lam, Krzysztof Marasek
-
Publication number: 20050160449Abstract: Apparatus and method for automatic dissection of segmented audio signals According to the present invention, an apparatus for automatic dissection of segmented audio signals, wherein at least one information signal for identifying programmes included in said audio signals and for identifying contents included in said programmes is provided, comprises: content detection means for detecting programmes and contents belonging to the respective programmes in the information signal; programme weighting means for weighting each programme comprised in the information signal based on the contents of the respective programme detected by the content detection means; and programme ranking means for identifying programmes of the same category and ranking said programmes based on a weighting result for each programme provided by the programme weighting means.Type: ApplicationFiled: November 10, 2004Publication date: July 21, 2005Inventors: Silke Goronzy, Thomas Kemp, Ralf Kompe, Yin Lam, Krzysztof Marasek, Raquel Tato
-
Publication number: 20050131688Abstract: An apparatus for classifying audio signals comprises audio signal clipping means for partitioning audio signals into audio clips, and class discrimination means for discriminating the audio clips provided by the audio signal clipping means into predetermined audio classes based on predetermined audio class classifying rules, by analysing acoustic characteristics of the audio signals comprised in the audio clips, wherein a predetermined audio class classifying rule is provided for each audio class, and each audio class represents a respective kind of audio signals comprised in the corresponding audio clip. The determination process to find acceptable audio class classifying rules for each audio class according to the prior art is depending on both the used raw audio signals and the personal experience of the person conducting the determination process. Thus, the determination process usually is very difficult, time consuming and subjective.Type: ApplicationFiled: November 10, 2004Publication date: June 16, 2005Inventors: Silke Goronzy, Thomas Kemp, Ralf Kompe, Yin Lam, Krzysztof Marasek, Raquel Tato
-
Publication number: 20050120368Abstract: A method and an apparatus for effecting the method are proposed that allow to define a subset of video signals from a source set of video signals on the basis of meta data available for the source set of video signals. The meta data assign a generic term to a sub-section of the audio channel of the source set of video signals, a class description to one or more sub-units of the sub-section for classifying the origin of the respective sub-unit, a category allocation to a segment, which is formed by a string of one or more classified sub-units of a sub-section, and a rating value to the segment for rating the reliability of the category allocation of the segment. The method includes steps for selecting segments of a sub-section with a rating value above a defined threshold value, assigning a priority value to each category, and specifying a first subset of video signals by defining an arrangement of selected segments by an order based on the respective priority and rating values related to each segment.Type: ApplicationFiled: November 10, 2004Publication date: June 2, 2005Inventors: Silke Goronzy, Thomas Kemp, Ralf Kompe, Yin Lam, Krzysztof Marasek, Raquel Tato
-
Publication number: 20050114388Abstract: An audio data segmentation apparatus for segmenting of audio data comprises audio data input means for supplying audio data, audio data clipping means for dividing the audio data supplied by the audio data input means into audio clips of a predetermined length, class discrimination means for discriminating the audio clips supplied by the audio data clipping means into predetermined audio classes, the audio classes identifying a kind of audio data included in the respective audio clip and segmenting means for segmenting the audio data into audio meta patterns based on a sequence of audio classes of consecutive audio clips, each meta pattern being allocated to a predetermined type of contents of the audio data. It is difficult to achieve good results with known methods for segmentation of audio data into meta patterns since the rules for the allocation of the meta patterns are dissatisfying.Type: ApplicationFiled: November 10, 2004Publication date: May 26, 2005Inventors: Silke Goronzy, Thomas Kemp, Ralf Kompe, Yin Lam, Krzysztof Marasek, Raquel Tato
-
Publication number: 20050114135Abstract: Based on the insight that variations in a speech input signal are different depending on the origin of the signal being a speech or a non-speech event, the present invention proposes method for predicting a misrecognition in a speech recognition system with steps for receiving a speech input signal, extracting at least one signal variation feature of the speech input signal, and applying a signal variation meter to the speech input signal for deriving a signal variation measure.Type: ApplicationFiled: October 4, 2004Publication date: May 26, 2005Inventors: Thomas Kemp, Yin Hay Lam, Krzysztof Marasek
-
Publication number: 20050102135Abstract: The present invention discloses an apparatus for automatic extraction of important events in audio signals comprising: signal input means for supplying audio signals; audio signal fragmenting means for partitioning audio signals supplied by the signal input means into audio fragments of a predetermined length and for allocating a sequence of one or more audio fragments to a respective audio window; feature extracting means for analysing acoustic characteristics of the audio signals comprised in the audio fragments and for analysing acoustic characteristics of the audio signals comprised in the audio windows; and important event extraction means for extracting important events in audio signals supplied by the audio signal fragmenting means based on predetermined important event classifying rules depending on acoustic characteristics of the audio signals comprised in the audio fragments and on acoustic characteristics of the audio signals comprised in the audio windows, wherein each important event extracted byType: ApplicationFiled: November 10, 2004Publication date: May 12, 2005Inventors: Silke Goronzy, Thomas Kemp, Ralf Kompe, Yin Lam, Krzysztof Marasek, Raquel Tato
-
Patent number: 6882972Abstract: To avoid an over-adaptation of a current acoustic model (CAM) to certain and frequently occuring words for speech phrases during on-line speaker adaptation of speech recognizers it is suggested to count adaptation numbers (aj) for each of said speech phrases (SPj) as numbers of times in that a distinct speech phrase (SPj) has been used as a basis for adapting said current acoustic model (CAM) and further to make the strength of adaptation of the current acoustic model (CAM) on the basis of said distinct speech phrase (SPj) dependent on its specific adaptation number (aj) so as to decrease the influence of frequent speech phrases (SPj) in the received speech flow on the adaptation process.Type: GrantFiled: October 5, 2001Date of Patent: April 19, 2005Assignee: Sony International (Europe) GmbHInventors: Ralf Kompe, Silke Goronzy, Krzysztof Marasek
-
Publication number: 20030069728Abstract: To detect and determine a current emotional state (CES) of a human being from a spoken speech input (SI), it is suggested in a method for detecting emotions to identify first and second feature classes (A, E) with, in particular distinct, dimensions of an underlying emotional manifold (EM) or emotional space (ES) and/or with subspaces thereof.Type: ApplicationFiled: October 4, 2002Publication date: April 10, 2003Inventors: Raquel Tato, Thomas Kemp, Krzysztof Marasek
-
Publication number: 20020082833Abstract: To increase the performance rate of large vocabulary continuous speech recognition applications it is suggested to first give only a rough estimation on whether or not recognized utterance (U) is accepted or rejected in its entirety. In the case of an acceptance of the utterance (U) a thorough reanalysis is performed afterwards to extract the meaning, intention, contained key-phrases/keywords and the confidence of the contained key-phrases/keywords. Therefore, the computational burden is focussed on the important sections of the utterance (U), namely on the key-phrases/keywords.Type: ApplicationFiled: November 14, 2001Publication date: June 27, 2002Inventors: Krzysztof Marasek, Thomas Kemp, Silke Goronzy, Ralf Kompe
-
Publication number: 20020072894Abstract: To avoid an over-adaptation of a current acoustic model (CAM) to certain and frequently occuring words for speech phrases during on-line speaker adaptation of speech recognizers it is suggested to count adaptation numbers (aj) for each of said speech phrases (SPj) as numbers of times in that a distinct speech phrase (SPj) has been used as a basis for adapting said current acoustic model (CAM) and further to make the strength of adaptation of the current acoustic model (CAM) on the basis of said distinct speech phrase (SPj) dependent on its specific adaptation number (aj) so as to decrease the influence of frequent speech phrases (SPj) in the received speech flow on the adaptation process.Type: ApplicationFiled: October 5, 2001Publication date: June 13, 2002Inventors: Ralf Kompe, Silke Goronzy, Krzysztof Marasek