Patents by Inventor Jonathan T. Foote

Jonathan T. Foote has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Intersection detection in panoramic video

Publication number: 20040004659

Abstract: Provides a system for detecting an intersection between more than one panoramic video sequence and detecting the orientation of the sequences forming the intersection. Video images and corresponding location data are received. If required, the images and location data is processed to ensure the images contain location data. An intersection between two paths is then derived from the video images by deriving a rough intersection between two images, determining a neighborhood for the two images, and dividing each image in the neighborhood into strips. An identifying value is derived from each strip to create a row of strip values which are then converted to the frequency domain. A distance measure is taken between strips in the frequency domain, and the intersection is determined from the images having the smallest distance measure between them.

Type: Application

Filed: July 2, 2002

Publication date: January 8, 2004

Inventors: Jonathan T. Foote, Donald Kimber, Xinding Sun, John Adcock
Method and system for retrieving and sequencing music by rhythmic similarity

Publication number: 20030205124

Abstract: A method for measuring the similarity between the beat spectra of two or more audio works. A distance formula is used to measure the similarity by rhythm and tempo between shortened beat spectra B1(L) and B2(L). The result is a vector which measures the similarity of rhythm and tempo. A distance formula is used to measure the rhythmic similarity between the scaled beat spectra B1(L) and B2(L). The result is a measure of rhythmically similar music regardless of the tempo. The method can be used in a wide variety of applications, including concatenating music with similar tempos, automatic music sequencing, classification of music into genres, search for music with similar rhythmic structures, search for music with similar rhythmic and tempo structures, and ranking music according to a similarity measure.

Type: Application

Filed: April 1, 2003

Publication date: November 6, 2003

Inventors: Jonathan T. Foote, Matthew L. Cooper
Method for automatically producing optimal summaries of linear media

Publication number: 20030161396

Abstract: Optimal summaries of a linear media source are automatically produced by parameterizing a linear media source. The parameterized linear media source is used to create a similarity array in which each array element includes the value of a similarity measurement between a two portions of the parameterized media signal. A segment fitness function, adapted for measuring the similarity between a segment of the parameterized media signal and the entire parameterized media signal, is optimized to find an optimal segment location. The portion of the linear media source corresponding to the optimal segment location is selected as the optimal summary. This method produces optimal summaries of any type of linear media, such as video, audio, or text information.

Type: Application

Filed: February 28, 2002

Publication date: August 28, 2003

Inventors: Jonathan T. Foote, John Boreczky
Video production and compaction with collage picture frame user interface

Publication number: 20030095720

Abstract: A method, system, and apparatus for easily creating a video collage from a video is provided. By segmenting the video into a set number of video segments and providing an interface for a user to select images which represent the video segments and insert the selected images into a video collage template, a video collage may be easily created in a short amount of time. The system is designed to assign values to the video inserted in a video collage and compact the video based on these values thereby creating a small file which may be easily stored or transmitted.

Type: Application

Filed: November 16, 2001

Publication date: May 22, 2003

Inventors: Patrick Chiu, Shingo Uchihashi, John S. Boreczky, Jonathan T. Foote, Andreas Girgensohn, Lynn D. Wilcox
Systems and methods for the automatic extraction of audio excerpts

Publication number: 20030083871

Abstract: A method of extracting audio excerpts comprises: segmenting audio data into a plurality of audio data segments; setting a fitness criteria for the plurality of audio data segments; analyzing the plurality of audio data segments based on the fitness criteria; and selecting one of the plurality of audio data segments that satisfies the fitness criteria. In various exemplary embodiments, the method of extracting audio excerpts further comprises associating the selected one of the plurality of audio data segments with video data. In such embodiments, associating the selected one of the plurality of audio data segments with video data may comprise associating the selected one of the plurality of audio data segments with a keyframe.

Type: Application

Filed: November 1, 2001

Publication date: May 1, 2003

Applicant: Fuji Xerox Co., LTD.

Inventors: Jonathan T. Foote, Matthew L. Cooper, Lynn D. Wilcox
Systems and methods for providing a spatially indexed panoramic video

Publication number: 20030063133

Abstract: Systems and methods generate a video for virtual reality wherein the video is both panoramic and spatially indexed. In embodiments, a video system includes a controller, a database including spatial data, and a user interface in which a video is rendered in response to a specified action. The video includes a plurality of images retrieved from the database. Each of the images is panoramic and spatially indexed in accordance with a predetermined position along a virtual path in a virtual environment.

Type: Application

Filed: April 5, 2002

Publication date: April 3, 2003

Applicant: FUJI XEROX CO., LTD.

Inventors: Jonathan T. Foote, Donald G. Kimber
Automatic video summarization using a measure of shot importance and a frame-packing method

Patent number: 6535639

Abstract: A measure of importance is calculated for segmented parts of a video. The segmented parts are determined by segmenting the video into component shots and then merging by iteration the component shots based on similarity or other factors. Segmentation may also be determined by clustering frames of the video, and creating segments from the same cluster ID. The measure of importance is calculated based on a normalized weight of each segment and on length and rarity of each shot/segmented part. The importance measure may be utilized to generate a video summary by selecting the most important segments and generating representative frames for the selected segments. A thresholding process is applied to the importance score to provide a predetermined number or an appropriate number generated on the fly of shots or segments to be represented by frames. The representative frames are then packed into the video summary.

Type: Grant

Filed: March 12, 1999

Date of Patent: March 18, 2003

Assignees: Fuji Xerox Co., Ltd., Xerox Corporation

Inventors: Shingo Uchihachi, Jonathan T. Foote, Lynn Wilcox
Systems and methods for the automatic segmentation and clustering of ordered information

Publication number: 20030048946

Abstract: Techniques segmenting ordered information such as audio, video and text are provided by windowing and parameterizing an ordered information stream and storing of the parameterized and windowed information into a two-dimensional representation such as a matrix. The similarity between the parameter vectors is determined and an orthogonal matrix decomposition such as singular value decomposition is applied to the similarity matrix. The singular values or eigenvalues of the resulting decomposition indicate major components or segments of the ordered information. The boundaries of the major components may be determined using the determined singular vectors to provide, for example, smart cut-and-paste of ordered information in which boundaries are automatically identified by the singular vectors; automatic categorization and retrieval of ordered information and automatic summarization of ordered information.

Type: Application

Filed: September 7, 2001

Publication date: March 13, 2003

Applicant: Fuji Xerox Co., LTD.

Inventors: Jonathan T. Foote, Matthew Cooper
Systems and methods for embedding data by dimensional compression and expansion

Publication number: 20020172395

Abstract: The systems and methods of this invention watermark an original data file using dimensional compression and expansion. The original data file extends along a given dimension and has portions that extend along that given dimension. The information is embedded into the data file by selectively dimensionally compressing or expanding a size of each of some or all of the portions along the given dimension, which can be space or time. The portions of the data file are selectively dimensionally expanded or compressed according to a given encoding scheme. This encoding scheme can use the kind of modification, the relationships between the type of modification between adjacent portions, or the duration or degree of compression or expansion to store a portion of the embedded information. The portions of the embedded information can be individual bits of binary or trinary information, or can be a portion of analog information.

Type: Application

Filed: March 25, 2002

Publication date: November 21, 2002

Applicant: FUJI XEROX CO., LTD.

Inventors: Jonathan T. Foote, John E. Adcock
Method and system for compensating for parallax in multiple camera systems

Publication number: 20020122113

Abstract: A camera array captures plural component images which are combined into a single scene. In one embodiment, each camera of the array is a fixed digital camera. The images from each camera are warped to a common coordinate system and the disparity between overlapping images is reduced using disparity estimation techniques.

Type: Application

Filed: November 20, 2001

Publication date: September 5, 2002

Inventor: Jonathan T. Foote
Methods and apparatuses for segmenting an audio-visual recording using image similarity searching and audio speaker recognition

Patent number: 6404925

Abstract: Methods for segmenting audio-video recording of meetings containing slide presentations by one or more speakers are described. These segments serve as indexes into the recorded meeting. If an agenda is provided for the meeting, these segments can be labeled using information from the agenda. The system automatically detects intervals of video that correspond to presentation slides. Under the assumption that only one person is speaking during an interval when slides are displayed in the video, possible speaker intervals are extracted from the audio soundtrack by finding these regions. Since the same speaker may talk across multiple slide intervals, the acoustic data from these intervals is clustered to yield an estimate of the number of distinct speakers and their order. Clustering the audio data from these intervals yields an estimate of the number of different speakers and their order.

Type: Grant

Filed: March 11, 1999

Date of Patent: June 11, 2002

Assignees: Fuji Xerox Co., Ltd., Xerox Corporation

Inventors: Jonathan T. Foote, Lynn Wilcox
Media browser using multimodal analysis

Patent number: 6366296

Abstract: A media browser, graphical user interface and method for browsing a media file wherein a user selects at least one feature in a media file and is provided with information regarding the existence of the selected feature in the media file. Based on the information, the user can identify and playback portions of interest in a media file. Features in a media file, such as a speaker's identity, applause, silence, motion, or video cuts, are preferably automatically time-wise evaluated in the media file using known methods. Metadata generated based on the time-wise feature evaluation are preferably mapped to confidence score values that represent a probability of a corresponding feature's existence in the media file. Confidence score information is preferably presented graphically to a user as part of a graphical user interface, and is used to interactively browse the media file.

Type: Grant

Filed: September 11, 1998

Date of Patent: April 2, 2002

Assignees: Xerox Corporation, Fuji Xerox Co., Ltd.

Inventors: John S. Boreczky, Andreas Girgensohn, Jonathan T. Foote
METHODS AND APPARATUSES FOR VIDEO SEGMENTATION, CLASSIFICATION, AND RETRIEVAL USING IMAGE CLASS STATISTICAL MODELS

Publication number: 20020028021

Abstract: Techniques for classifying video frames using statistical models of transform coefficients are disclosed. After optionally being decimated in time and space, image frames are transformed using a discrete cosine transform or Hadamard transform. The methods disclosed model image composition and operate on grayscale images. The resulting transform matrices are reduced using truncation, principal component analysis, or linear discriminant analysis to produce feature vectors. Feature vectors of training images for image classes are used to compute image class statistical models. Once image class statistical models are derived, individual frames are classified by the maximum likelihood resulting from the image class statistical models. Thus, the probabilities that a feature vector derived from a frame would be produced from each of the image class statistical models are computed.

Type: Application

Filed: March 11, 1999

Publication date: March 7, 2002

Inventors: JONATHAN T. FOOTE, LYNN WILCOX, ANDREAS GIRGENSOHN
Tactile computer interface

Patent number: 6219034

Abstract: A computer system includes a computer processor, an operating system operative in connection with the computer processor, and a display responsive to the operating system. The system also has a pointing device that includes a position sensor and a tactile actuator. A pointing device driver is responsive to the position sensor, and the tactile actuator is responsive to the pointing device driver. A general-purpose application is responsive to the pointing device driver and to the operating system and in communication with the display, and the pointing device driver is also responsive to the general purpose application. The system further includes a profile that maps region changes associated with material displayed on the screen to tactile signals to be sent to the tactile actuator.

Type: Grant

Filed: February 23, 1998

Date of Patent: April 17, 2001

Inventors: Kristofer E. Elbing, Jonathan T. Foote

prev 1 2 3