Patents by Inventor Shih-Fu Chang
Shih-Fu Chang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20130254191Abstract: Determining ranked candidate media in response to media query data corresponding to a query media includes receiving the media query data including feature data of the query media, coordinate data, and boundary data, matching the features with corresponding features of an media database using the feature data to identify features in the media database within a predetermined hamming distance in a hash table from the corresponding features of the query media to obtain matched features in the media database, determining candidate media whose number of matched features exceeds a matched feature threshold, generating a geometry similarity score between the query media and each candidate media using the feature data and the coordinate data, generating a boundary similarity score between the query media and each candidate media using the boundary data, ranking the candidate media based on the numbers of matched features, the geometry similarity scores and the boundary similarity scores.Type: ApplicationFiled: December 6, 2012Publication date: September 26, 2013Applicant: THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORKInventors: Junfeng He, Shih-Fu Chang, Tai-Hsu Lin
-
Patent number: 8488682Abstract: Caption boxes which are embedded in video content can be located and the text within the caption boxes decoded. Real time processing is enhanced by locating caption box regions in the compressed video domain and performing pixel based processing operations within the region of the video frame in which a caption box is located. The captions boxes are further refined by identifying word regions within the caption boxes and then applying character and word recognition processing to the identified word regions. Domain based models are used to improve text recognition results. The extracted caption box text can be used to detect events of interest in the video content and a semantic model applied to extract a segment of video of the event of interest.Type: GrantFiled: December 19, 2007Date of Patent: July 16, 2013Assignee: The Trustees of Columbia University in the City of New YorkInventors: Shih-Fu Chang, Dongqing Zhang
-
Publication number: 20130073221Abstract: Techniques for identifying a composition of a target fluid using a set of vectors representing known residue patterns for a two or more fluids including said target fluid is provided. An exemplary method includes storing one or more digital measurements of residue for the target fluid, extracting one or more descriptive features from the measurements; and processing descriptive features to identify the composition of the target fluid. The processing includes using a machine learning algorithm trained with data linking residue morphology to fluid composition. A distance between a vector representing said one or more descriptive features and said set of vectors representing known residue patterns is determined, and a residue is assigned to one or more of the known residue patterns.Type: ApplicationFiled: September 14, 2012Publication date: March 21, 2013Inventors: Daniel Attinger, Frederic Zenhausern, Cedric Hurth, Shih-Fu Chang, Zhenguo Li
-
Patent number: 8370869Abstract: Systems and methods for describing video content establish video description records which include an object set (24), an object hierarchy (26) and entity relation graphs (28). Video objects can include global objects, segment objects and local objects. The video objects are further defined by a number of features organized in classes, which in turn are further defined by a number of feature descriptors (36, 38, and 40). The relationships (44) between and among the objects in the object set (24) are defined by the object hierarchy (26) and entity relation graphs (28). The video description records provide a standard vehicle for describing the content and context of video information for subsequent access and processing by computer applications such as search engines, filters and archive systems.Type: GrantFiled: June 6, 2006Date of Patent: February 5, 2013Assignee: The Trustees of Columbia University in the City of New YorkInventors: Seungyup Paek, Ana Benitez, Shih-Fu Chang, Atul Puri, Qian Huang, Chung-Sheng Li, John R. Smith, Lawrence Bergman
-
Patent number: 8364673Abstract: Systems and methods for searching a database of media content wherein the user can dynamically and interactively perform searches and navigate search results. One or more search anchors are received, and at least one of the search anchors is associated with an anchor cell on a navigation map. One or more documents assigned to at least one cell on the navigation map can be determined, and the cells are populated with search results based at least in part on the search anchors. At least one of the documents is then displayed to a user.Type: GrantFiled: December 15, 2010Date of Patent: January 29, 2013Inventors: Shih-Fu Chang, Eric Zavesky
-
Patent number: 8218617Abstract: Techniques for generating utility-based descriptors from compressed multimedia information are disclosed. A preferred method includes the steps of receiving least a segment of compressed multimedia information, determining two or more portions of utility based descriptor information based on one or more adaptation operations, each corresponding to a unique target rate, adapting the compressed multimedia segment by each the portions of utility based descriptor information to generate adapted multimedia segments, using a quality management method to generate measurement for each adapted multimedia segment, and generating a utility based descriptors based on the portions of utility based descriptor information and corresponding quality measurements.Type: GrantFiled: October 14, 2004Date of Patent: July 10, 2012Assignee: The Trustees of Columbia University in the City of New YorkInventors: Jae-Gon Kim, Yong Wang, Shih-Fu Chang, Kyeongok Kang, Jinwoong Kim
-
Publication number: 20120089552Abstract: Human visual perception is able to recognize a wide range of targets but has limited throughput. Machine vision can process images at a high speed but suffers from inadequate recognition accuracy of general target classes. Systems and methods are provided that combine the strengths of both systems and improve upon existing multimedia processing systems and methods to provide enhanced multimedia labeling, categorization, searching, and navigation.Type: ApplicationFiled: August 8, 2011Publication date: April 12, 2012Inventors: Shih-Fu Chang, Jun Wang, Paul Sajda, Eric Pohlmeyer, Barbara Hanna, David Jangraw
-
Patent number: 8135221Abstract: A method for determining a classification for a video segment, comprising the steps of: breaking the video segment into a plurality of short-term video slices, each including a plurality of video frames and an audio signal; analyzing the video frames for each short-term video slice to form a plurality of region tracks; analyzing each region track to form a visual feature vector and a motion feature vector; analyzing the audio signal for each short-term video slice to determine an audio feature vector; forming a plurality of short-term audio-visual atoms for each short-term video slice by combining the visual feature vector and the motion feature vector for a particular region track with the corresponding audio feature vector; and using a classifier to determine a classification for the video segment responsive to the short-term audio-visual atoms.Type: GrantFiled: October 7, 2009Date of Patent: March 13, 2012Assignees: Eastman Kodak Company, Columbia UniversityInventors: Wei Jiang, Courtenay Cotton, Shih-Fu Chang, Daniel P. Ellis, Alexander C. Loui
-
Publication number: 20110314367Abstract: A system and method for labeling and classifying multimedia data is provided that includes novel label propagation techniques and classification function characteristics. The system and method corrects and propagates a small number of potentially erroneous labels to a large amount of multimedia data and generate optimal ways of ranking, classification, and presentation of the data sets. The disclosed systems and methods improve upon prior systems and methods and provide an improved approach to the problems of imbalanced data sets and incorrect label data.Type: ApplicationFiled: June 21, 2011Publication date: December 22, 2011Applicant: THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORKInventors: Shih-Fu Chang, Jun Wang, Tony Jebara
-
Publication number: 20110255605Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.Type: ApplicationFiled: April 1, 2011Publication date: October 20, 2011Inventors: Shih-Fu CHANG, Horace J. MENG
-
Publication number: 20110258189Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.Type: ApplicationFiled: June 27, 2011Publication date: October 20, 2011Applicants: Columbia University in the City of New York, AT&T Intellectual Property II, L.P.Inventors: Ana Belen Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
-
Patent number: 7970822Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.Type: GrantFiled: February 17, 2009Date of Patent: June 28, 2011Assignees: AT&T Intellectual Property II, L.P., Columbia University in the City of New YorkInventors: Ana Belen Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
-
Publication number: 20110145232Abstract: The disclosed subject matter is directed to systems and methods for searching a database of media content wherein the user can dynamically and interactively perform searches and navigate search results. One or more search anchors are received, and at least one of the search anchors is associated with an anchor cell on a navigation map. One or more documents assigned to at least one cell on the navigation map can be determined, and the cells are populated with search results based at least in part on the search anchors. At least one of the documents is then displayed to a user.Type: ApplicationFiled: December 15, 2010Publication date: June 16, 2011Applicant: The Trustees of Columbia University In The City Of New YorkInventors: Shih-Fu Chang, Eric Zavesky
-
Publication number: 20110081082Abstract: A method for determining a classification for a video segment, comprising the steps of: breaking the video segment into a plurality of short-term video slices, each including a plurality of video frames and an audio signal; analyzing the video frames for each short-term video slice to form a plurality of region tracks; analyzing each region track to form a visual feature vector and a motion feature vector; analyzing the audio signal for each short-term video slice to determine an audio feature vector; forming a plurality of short-term audio-visual atoms for each short-term video slice by combining the visual feature vector and the motion feature vector for a particular region track with the corresponding audio feature vector; and using a classifier to determine a classification for the video segment responsive to the short-term audio-visual atoms.Type: ApplicationFiled: October 7, 2009Publication date: April 7, 2011Inventors: Wei Jiang, Courtenay Cotton, Shih-Fu Chang, Daniel P. Ellis, Alexander C. Loui
-
Publication number: 20110064136Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.Type: ApplicationFiled: September 2, 2010Publication date: March 17, 2011Inventors: Shih-Fu Chang, Horace J. Meng
-
Publication number: 20110025710Abstract: Systems and methods are described for determining manipulation history among a plurality of images. The described techniques include selecting a pair of images from the plurality of images, detecting one or more manipulations operable to transform one of the images to the other, and based on the manipulations detected, determining a parent-child relationship between the pair or pairs of images. The described techniques can further include repeating the selecting two images, detecting manipulations, and determining the parent-child relationship for each pairs of images in the plurality of images, constructing a visual migration map for the images, and presenting the visual migration map in a user readable format.Type: ApplicationFiled: August 23, 2010Publication date: February 3, 2011Applicant: The Trustees of Columbia University In The City Of New YorkInventors: Lyndon Kennedy, Shih-Fu Chang
-
Patent number: 7817722Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.Type: GrantFiled: December 4, 2003Date of Patent: October 19, 2010Assignee: The Trustees of Columbia University in the City of New YorkInventors: Shih-Fu Chang, Horace J. Meng
-
Patent number: 7809760Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.Type: GrantFiled: September 15, 2009Date of Patent: October 5, 2010Assignees: AT&T Intellectual Property II, L.P., Columbia University in the City of New YorkInventors: Ana Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
-
Patent number: 7720851Abstract: A context-based concept fusion method detects a first concept in an image record. The method includes automatically determining at least one other concept in the image record which has a contextual relationship with the first concept and which is to be labeled by a user of the method; and labeling the at least one other concept by the user with a ground truth label to be used in the context-based concept fusion method to improve detection of the first concept in the image record.Type: GrantFiled: December 22, 2006Date of Patent: May 18, 2010Assignee: Eastman Kodak CompanyInventors: Shih-Fu Chang, Wei Jiang, Alexander C. Loui
-
Patent number: 7653635Abstract: Systems and methods for generating standard description records from multimedia information are provided. The system includes at least one multimedia information input interface (180) receiving multimedia information, a computer processor, and a data storage system (150), operatively coupled to said processor, for storing said at least one description record. The processor performs object extraction processing to generate multimedia object descriptions (200, 201, 205) from the multimedia information, and object hierarchy processing (410, 420) to generate multimedia object hierarchy descriptions, to generate at least one description record including the multimedia object descriptions (200, 201, 205) and multimedia object hierarchy descriptions for content embedded within the multimedia information.Type: GrantFiled: November 5, 1999Date of Patent: January 26, 2010Assignee: The Trustees of Columbia University in the City of New YorkInventors: Seungup Paek, Ana Benitez, Shih-Fu Chang