Patents by Inventor Shih-Fu Chang

Shih-Fu Chang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYSTEMS AND METHODS FOR MOBILE SEARCH USING BAG OF HASH BITS AND BOUNDARY RERANKING

Publication number: 20130254191

Abstract: Determining ranked candidate media in response to media query data corresponding to a query media includes receiving the media query data including feature data of the query media, coordinate data, and boundary data, matching the features with corresponding features of an media database using the feature data to identify features in the media database within a predetermined hamming distance in a hash table from the corresponding features of the query media to obtain matched features in the media database, determining candidate media whose number of matched features exceeds a matched feature threshold, generating a geometry similarity score between the query media and each candidate media using the feature data and the coordinate data, generating a boundary similarity score between the query media and each candidate media using the boundary data, ranking the candidate media based on the numbers of matched features, the geometry similarity scores and the boundary similarity scores.

Type: Application

Filed: December 6, 2012

Publication date: September 26, 2013

Applicant: THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK

Inventors: Junfeng He, Shih-Fu Chang, Tai-Hsu Lin
System and method for extracting text captions from video and generating video summaries

Patent number: 8488682

Abstract: Caption boxes which are embedded in video content can be located and the text within the caption boxes decoded. Real time processing is enhanced by locating caption box regions in the compressed video domain and performing pixel based processing operations within the region of the video frame in which a caption box is located. The captions boxes are further refined by identifying word regions within the caption boxes and then applying character and word recognition processing to the identified word regions. Domain based models are used to improve text recognition results. The extracted caption box text can be used to detect events of interest in the video content and a semantic model applied to extract a segment of video of the event of interest.

Type: Grant

Filed: December 19, 2007

Date of Patent: July 16, 2013

Assignee: The Trustees of Columbia University in the City of New York

Inventors: Shih-Fu Chang, Dongqing Zhang
SYSTEMS AND METHODS FOR IDENTIFICATION OF FLUID AND SUBSTRATE COMPOSITION OR PHYSICO-CHEMICAL PROPERTIES

Publication number: 20130073221

Abstract: Techniques for identifying a composition of a target fluid using a set of vectors representing known residue patterns for a two or more fluids including said target fluid is provided. An exemplary method includes storing one or more digital measurements of residue for the target fluid, extracting one or more descriptive features from the measurements; and processing descriptive features to identify the composition of the target fluid. The processing includes using a machine learning algorithm trained with data linking residue morphology to fluid composition. A distance between a vector representing said one or more descriptive features and said set of vectors representing known residue patterns is determined, and a residue is assigned to one or more of the known residue patterns.

Type: Application

Filed: September 14, 2012

Publication date: March 21, 2013

Inventors: Daniel Attinger, Frederic Zenhausern, Cedric Hurth, Shih-Fu Chang, Zhenguo Li
Video description system and method

Patent number: 8370869

Abstract: Systems and methods for describing video content establish video description records which include an object set (24), an object hierarchy (26) and entity relation graphs (28). Video objects can include global objects, segment objects and local objects. The video objects are further defined by a number of features organized in classes, which in turn are further defined by a number of feature descriptors (36, 38, and 40). The relationships (44) between and among the objects in the object set (24) are defined by the object hierarchy (26) and entity relation graphs (28). The video description records provide a standard vehicle for describing the content and context of video information for subsequent access and processing by computer applications such as search engines, filters and archive systems.

Type: Grant

Filed: June 6, 2006

Date of Patent: February 5, 2013

Assignee: The Trustees of Columbia University in the City of New York

Inventors: Seungyup Paek, Ana Benitez, Shih-Fu Chang, Atul Puri, Qian Huang, Chung-Sheng Li, John R. Smith, Lawrence Bergman
System and method for dynamically and interactively searching media data

Patent number: 8364673

Abstract: Systems and methods for searching a database of media content wherein the user can dynamically and interactively perform searches and navigate search results. One or more search anchors are received, and at least one of the search anchors is associated with an anchor cell on a navigation map. One or more documents assigned to at least one cell on the navigation map can be determined, and the cells are populated with search results based at least in part on the search anchors. At least one of the documents is then displayed to a user.

Type: Grant

Filed: December 15, 2010

Date of Patent: January 29, 2013

Inventors: Shih-Fu Chang, Eric Zavesky
Method and system for optimal video transcoding based on utility function descriptors

Patent number: 8218617

Abstract: Techniques for generating utility-based descriptors from compressed multimedia information are disclosed. A preferred method includes the steps of receiving least a segment of compressed multimedia information, determining two or more portions of utility based descriptor information based on one or more adaptation operations, each corresponding to a unique target rate, adapting the compressed multimedia segment by each the portions of utility based descriptor information to generate adapted multimedia segments, using a quality management method to generate measurement for each adapted multimedia segment, and generating a utility based descriptors based on the portions of utility based descriptor information and corresponding quality measurements.

Type: Grant

Filed: October 14, 2004

Date of Patent: July 10, 2012

Assignee: The Trustees of Columbia University in the City of New York

Inventors: Jae-Gon Kim, Yong Wang, Shih-Fu Chang, Kyeongok Kang, Jinwoong Kim
RAPID IMAGE ANNOTATION VIA BRAIN STATE DECODING AND VISUAL PATTERN MINING

Publication number: 20120089552

Abstract: Human visual perception is able to recognize a wide range of targets but has limited throughput. Machine vision can process images at a high speed but suffers from inadequate recognition accuracy of general target classes. Systems and methods are provided that combine the strengths of both systems and improve upon existing multimedia processing systems and methods to provide enhanced multimedia labeling, categorization, searching, and navigation.

Type: Application

Filed: August 8, 2011

Publication date: April 12, 2012

Inventors: Shih-Fu Chang, Jun Wang, Paul Sajda, Eric Pohlmeyer, Barbara Hanna, David Jangraw
Video concept classification using audio-visual atoms

Patent number: 8135221

Abstract: A method for determining a classification for a video segment, comprising the steps of: breaking the video segment into a plurality of short-term video slices, each including a plurality of video frames and an audio signal; analyzing the video frames for each short-term video slice to form a plurality of region tracks; analyzing each region track to form a visual feature vector and a motion feature vector; analyzing the audio signal for each short-term video slice to determine an audio feature vector; forming a plurality of short-term audio-visual atoms for each short-term video slice by combining the visual feature vector and the motion feature vector for a particular region track with the corresponding audio feature vector; and using a classifier to determine a classification for the video segment responsive to the short-term audio-visual atoms.

Type: Grant

Filed: October 7, 2009

Date of Patent: March 13, 2012

Assignees: Eastman Kodak Company, Columbia University

Inventors: Wei Jiang, Courtenay Cotton, Shih-Fu Chang, Daniel P. Ellis, Alexander C. Loui
System And Method For Annotating And Searching Media

Publication number: 20110314367

Abstract: A system and method for labeling and classifying multimedia data is provided that includes novel label propagation techniques and classification function characteristics. The system and method corrects and propagates a small number of potentially erroneous labels to a large amount of multimedia data and generate optimal ways of ranking, classification, and presentation of the data sets. The disclosed systems and methods improve upon prior systems and methods and provide an improved approach to the problems of imbalanced data sets and incorrect label data.

Type: Application

Filed: June 21, 2011

Publication date: December 22, 2011

Applicant: THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK

Inventors: Shih-Fu Chang, Jun Wang, Tony Jebara
METHODS AND ARCHITECTURE FOR INDEXING AND EDITING COMPRESSED VIDEO OVER THE WORLD WIDE WEB

Publication number: 20110255605

Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.

Type: Application

Filed: April 1, 2011

Publication date: October 20, 2011

Inventors: Shih-Fu CHANG, Horace J. MENG
MULTIMEDIA INTEGRATION DESCRIPTION SCHEME, METHOD AND SYSTEM FOR MPEG-7

Publication number: 20110258189

Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.

Type: Application

Filed: June 27, 2011

Publication date: October 20, 2011

Applicants: Columbia University in the City of New York, AT&T Intellectual Property II, L.P.

Inventors: Ana Belen Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
Multimedia integration description scheme, method and system for MPEG-7

Patent number: 7970822

Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.

Type: Grant

Filed: February 17, 2009

Date of Patent: June 28, 2011

Assignees: AT&T Intellectual Property II, L.P., Columbia University in the City of New York

Inventors: Ana Belen Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
SYSTEM AND METHOD FOR DYNAMICALLY AND INTERACTIVELY SEARCHING MEDIA DATA

Publication number: 20110145232

Abstract: The disclosed subject matter is directed to systems and methods for searching a database of media content wherein the user can dynamically and interactively perform searches and navigate search results. One or more search anchors are received, and at least one of the search anchors is associated with an anchor cell on a navigation map. One or more documents assigned to at least one cell on the navigation map can be determined, and the cells are populated with search results based at least in part on the search anchors. At least one of the documents is then displayed to a user.

Type: Application

Filed: December 15, 2010

Publication date: June 16, 2011

Applicant: The Trustees of Columbia University In The City Of New York

Inventors: Shih-Fu Chang, Eric Zavesky
VIDEO CONCEPT CLASSIFICATION USING AUDIO-VISUAL ATOMS

Publication number: 20110081082

Abstract: A method for determining a classification for a video segment, comprising the steps of: breaking the video segment into a plurality of short-term video slices, each including a plurality of video frames and an audio signal; analyzing the video frames for each short-term video slice to form a plurality of region tracks; analyzing each region track to form a visual feature vector and a motion feature vector; analyzing the audio signal for each short-term video slice to determine an audio feature vector; forming a plurality of short-term audio-visual atoms for each short-term video slice by combining the visual feature vector and the motion feature vector for a particular region track with the corresponding audio feature vector; and using a classifier to determine a classification for the video segment responsive to the short-term audio-visual atoms.

Type: Application

Filed: October 7, 2009

Publication date: April 7, 2011

Inventors: Wei Jiang, Courtenay Cotton, Shih-Fu Chang, Daniel P. Ellis, Alexander C. Loui
METHODS AND ARCHITECTURE FOR INDEXING AND EDITING COMPRESSED VIDEO OVER THE WORLD WIDE WEB

Publication number: 20110064136

Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.

Type: Application

Filed: September 2, 2010

Publication date: March 17, 2011

Inventors: Shih-Fu Chang, Horace J. Meng
SYSTEMS AND METHODS FOR IMAGE ARCHEOLOGY

Publication number: 20110025710

Abstract: Systems and methods are described for determining manipulation history among a plurality of images. The described techniques include selecting a pair of images from the plurality of images, detecting one or more manipulations operable to transform one of the images to the other, and based on the manipulations detected, determining a parent-child relationship between the pair or pairs of images. The described techniques can further include repeating the selecting two images, detecting manipulations, and determining the parent-child relationship for each pairs of images in the plurality of images, constructing a visual migration map for the images, and presenting the visual migration map in a user readable format.

Type: Application

Filed: August 23, 2010

Publication date: February 3, 2011

Applicant: The Trustees of Columbia University In The City Of New York

Inventors: Lyndon Kennedy, Shih-Fu Chang
Methods and architecture for indexing and editing compressed video over the world wide web

Patent number: 7817722

Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.

Type: Grant

Filed: December 4, 2003

Date of Patent: October 19, 2010

Assignee: The Trustees of Columbia University in the City of New York

Inventors: Shih-Fu Chang, Horace J. Meng
Multimedia integration description scheme, method and system for MPEG-7

Patent number: 7809760

Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.

Type: Grant

Filed: September 15, 2009

Date of Patent: October 5, 2010

Assignees: AT&T Intellectual Property II, L.P., Columbia University in the City of New York

Inventors: Ana Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
Active context-based concept fusion

Patent number: 7720851

Abstract: A context-based concept fusion method detects a first concept in an image record. The method includes automatically determining at least one other concept in the image record which has a contextual relationship with the first concept and which is to be labeled by a user of the method; and labeling the at least one other concept by the user with a ground truth label to be used in the context-based concept fusion method to improve detection of the first concept in the image record.

Type: Grant

Filed: December 22, 2006

Date of Patent: May 18, 2010

Assignee: Eastman Kodak Company

Inventors: Shih-Fu Chang, Wei Jiang, Alexander C. Loui
Systems and methods for interoperable multimedia content descriptions

Patent number: 7653635

Abstract: Systems and methods for generating standard description records from multimedia information are provided. The system includes at least one multimedia information input interface (180) receiving multimedia information, a computer processor, and a data storage system (150), operatively coupled to said processor, for storing said at least one description record. The processor performs object extraction processing to generate multimedia object descriptions (200, 201, 205) from the multimedia information, and object hierarchy processing (410, 420) to generate multimedia object hierarchy descriptions, to generate at least one description record including the multimedia object descriptions (200, 201, 205) and multimedia object hierarchy descriptions for content embedded within the multimedia information.

Type: Grant

Filed: November 5, 1999

Date of Patent: January 26, 2010

Assignee: The Trustees of Columbia University in the City of New York

Inventors: Seungup Paek, Ana Benitez, Shih-Fu Chang

prev 1 2 3 4 next