Abstract: A method and apparatus is provided for annotating video content with metadata generated using speech recognition technology. The method begins by rendering video content on a display device. A segment of speech is received from a user such that the speech segment annotates a portion of the video content currently being rendered. The speech segment is converted to a text-segment and the text-segment is associated with the rendered portion of the video content. The text segment is stored in a selectively retrievable manner so that it is associated with the rendered portion of the video content.
Abstract: A method and apparatus is provided for annotating video content with metadata generated using speech recognition technology. The method begins by rendering video content on a display device. A segment of speech is received from a user such that the speech segment annotates a portion of the video content currently being rendered. The speech segment is converted to a text-segment and the text-segment is associated with the rendered portion of the video content. The text segment is stored in a selectively retrievable manner so that it is associated with the rendered portion of the video content.
Abstract: A video processing device includes a background audio change detector that detects background audio changes in audio data corresponding to particular video data. The video processing device detects semantically meaningful video scenes using detected background audio changes and delimits segments of the video data.
Type:
Grant
Filed:
December 14, 2001
Date of Patent:
September 4, 2007
Assignee:
Hewlett-Packard Development Company, L.P.