Patents by Inventor Shih-Fu Chang

Shih-Fu Chang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20130254191
    Abstract: Determining ranked candidate media in response to media query data corresponding to a query media includes receiving the media query data including feature data of the query media, coordinate data, and boundary data, matching the features with corresponding features of an media database using the feature data to identify features in the media database within a predetermined hamming distance in a hash table from the corresponding features of the query media to obtain matched features in the media database, determining candidate media whose number of matched features exceeds a matched feature threshold, generating a geometry similarity score between the query media and each candidate media using the feature data and the coordinate data, generating a boundary similarity score between the query media and each candidate media using the boundary data, ranking the candidate media based on the numbers of matched features, the geometry similarity scores and the boundary similarity scores.
    Type: Application
    Filed: December 6, 2012
    Publication date: September 26, 2013
    Applicant: THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK
    Inventors: Junfeng He, Shih-Fu Chang, Tai-Hsu Lin
  • Patent number: 8488682
    Abstract: Caption boxes which are embedded in video content can be located and the text within the caption boxes decoded. Real time processing is enhanced by locating caption box regions in the compressed video domain and performing pixel based processing operations within the region of the video frame in which a caption box is located. The captions boxes are further refined by identifying word regions within the caption boxes and then applying character and word recognition processing to the identified word regions. Domain based models are used to improve text recognition results. The extracted caption box text can be used to detect events of interest in the video content and a semantic model applied to extract a segment of video of the event of interest.
    Type: Grant
    Filed: December 19, 2007
    Date of Patent: July 16, 2013
    Assignee: The Trustees of Columbia University in the City of New York
    Inventors: Shih-Fu Chang, Dongqing Zhang
  • Publication number: 20130073221
    Abstract: Techniques for identifying a composition of a target fluid using a set of vectors representing known residue patterns for a two or more fluids including said target fluid is provided. An exemplary method includes storing one or more digital measurements of residue for the target fluid, extracting one or more descriptive features from the measurements; and processing descriptive features to identify the composition of the target fluid. The processing includes using a machine learning algorithm trained with data linking residue morphology to fluid composition. A distance between a vector representing said one or more descriptive features and said set of vectors representing known residue patterns is determined, and a residue is assigned to one or more of the known residue patterns.
    Type: Application
    Filed: September 14, 2012
    Publication date: March 21, 2013
    Inventors: Daniel Attinger, Frederic Zenhausern, Cedric Hurth, Shih-Fu Chang, Zhenguo Li
  • Patent number: 8370869
    Abstract: Systems and methods for describing video content establish video description records which include an object set (24), an object hierarchy (26) and entity relation graphs (28). Video objects can include global objects, segment objects and local objects. The video objects are further defined by a number of features organized in classes, which in turn are further defined by a number of feature descriptors (36, 38, and 40). The relationships (44) between and among the objects in the object set (24) are defined by the object hierarchy (26) and entity relation graphs (28). The video description records provide a standard vehicle for describing the content and context of video information for subsequent access and processing by computer applications such as search engines, filters and archive systems.
    Type: Grant
    Filed: June 6, 2006
    Date of Patent: February 5, 2013
    Assignee: The Trustees of Columbia University in the City of New York
    Inventors: Seungyup Paek, Ana Benitez, Shih-Fu Chang, Atul Puri, Qian Huang, Chung-Sheng Li, John R. Smith, Lawrence Bergman
  • Patent number: 8364673
    Abstract: Systems and methods for searching a database of media content wherein the user can dynamically and interactively perform searches and navigate search results. One or more search anchors are received, and at least one of the search anchors is associated with an anchor cell on a navigation map. One or more documents assigned to at least one cell on the navigation map can be determined, and the cells are populated with search results based at least in part on the search anchors. At least one of the documents is then displayed to a user.
    Type: Grant
    Filed: December 15, 2010
    Date of Patent: January 29, 2013
    Inventors: Shih-Fu Chang, Eric Zavesky
  • Patent number: 8218617
    Abstract: Techniques for generating utility-based descriptors from compressed multimedia information are disclosed. A preferred method includes the steps of receiving least a segment of compressed multimedia information, determining two or more portions of utility based descriptor information based on one or more adaptation operations, each corresponding to a unique target rate, adapting the compressed multimedia segment by each the portions of utility based descriptor information to generate adapted multimedia segments, using a quality management method to generate measurement for each adapted multimedia segment, and generating a utility based descriptors based on the portions of utility based descriptor information and corresponding quality measurements.
    Type: Grant
    Filed: October 14, 2004
    Date of Patent: July 10, 2012
    Assignee: The Trustees of Columbia University in the City of New York
    Inventors: Jae-Gon Kim, Yong Wang, Shih-Fu Chang, Kyeongok Kang, Jinwoong Kim
  • Publication number: 20120089552
    Abstract: Human visual perception is able to recognize a wide range of targets but has limited throughput. Machine vision can process images at a high speed but suffers from inadequate recognition accuracy of general target classes. Systems and methods are provided that combine the strengths of both systems and improve upon existing multimedia processing systems and methods to provide enhanced multimedia labeling, categorization, searching, and navigation.
    Type: Application
    Filed: August 8, 2011
    Publication date: April 12, 2012
    Inventors: Shih-Fu Chang, Jun Wang, Paul Sajda, Eric Pohlmeyer, Barbara Hanna, David Jangraw
  • Patent number: 8135221
    Abstract: A method for determining a classification for a video segment, comprising the steps of: breaking the video segment into a plurality of short-term video slices, each including a plurality of video frames and an audio signal; analyzing the video frames for each short-term video slice to form a plurality of region tracks; analyzing each region track to form a visual feature vector and a motion feature vector; analyzing the audio signal for each short-term video slice to determine an audio feature vector; forming a plurality of short-term audio-visual atoms for each short-term video slice by combining the visual feature vector and the motion feature vector for a particular region track with the corresponding audio feature vector; and using a classifier to determine a classification for the video segment responsive to the short-term audio-visual atoms.
    Type: Grant
    Filed: October 7, 2009
    Date of Patent: March 13, 2012
    Assignees: Eastman Kodak Company, Columbia University
    Inventors: Wei Jiang, Courtenay Cotton, Shih-Fu Chang, Daniel P. Ellis, Alexander C. Loui
  • Publication number: 20110314367
    Abstract: A system and method for labeling and classifying multimedia data is provided that includes novel label propagation techniques and classification function characteristics. The system and method corrects and propagates a small number of potentially erroneous labels to a large amount of multimedia data and generate optimal ways of ranking, classification, and presentation of the data sets. The disclosed systems and methods improve upon prior systems and methods and provide an improved approach to the problems of imbalanced data sets and incorrect label data.
    Type: Application
    Filed: June 21, 2011
    Publication date: December 22, 2011
    Applicant: THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK
    Inventors: Shih-Fu Chang, Jun Wang, Tony Jebara
  • Publication number: 20110255605
    Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.
    Type: Application
    Filed: April 1, 2011
    Publication date: October 20, 2011
    Inventors: Shih-Fu CHANG, Horace J. MENG
  • Publication number: 20110258189
    Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.
    Type: Application
    Filed: June 27, 2011
    Publication date: October 20, 2011
    Applicants: Columbia University in the City of New York, AT&T Intellectual Property II, L.P.
    Inventors: Ana Belen Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
  • Patent number: 7970822
    Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.
    Type: Grant
    Filed: February 17, 2009
    Date of Patent: June 28, 2011
    Assignees: AT&T Intellectual Property II, L.P., Columbia University in the City of New York
    Inventors: Ana Belen Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
  • Publication number: 20110145232
    Abstract: The disclosed subject matter is directed to systems and methods for searching a database of media content wherein the user can dynamically and interactively perform searches and navigate search results. One or more search anchors are received, and at least one of the search anchors is associated with an anchor cell on a navigation map. One or more documents assigned to at least one cell on the navigation map can be determined, and the cells are populated with search results based at least in part on the search anchors. At least one of the documents is then displayed to a user.
    Type: Application
    Filed: December 15, 2010
    Publication date: June 16, 2011
    Applicant: The Trustees of Columbia University In The City Of New York
    Inventors: Shih-Fu Chang, Eric Zavesky
  • Publication number: 20110081082
    Abstract: A method for determining a classification for a video segment, comprising the steps of: breaking the video segment into a plurality of short-term video slices, each including a plurality of video frames and an audio signal; analyzing the video frames for each short-term video slice to form a plurality of region tracks; analyzing each region track to form a visual feature vector and a motion feature vector; analyzing the audio signal for each short-term video slice to determine an audio feature vector; forming a plurality of short-term audio-visual atoms for each short-term video slice by combining the visual feature vector and the motion feature vector for a particular region track with the corresponding audio feature vector; and using a classifier to determine a classification for the video segment responsive to the short-term audio-visual atoms.
    Type: Application
    Filed: October 7, 2009
    Publication date: April 7, 2011
    Inventors: Wei Jiang, Courtenay Cotton, Shih-Fu Chang, Daniel P. Ellis, Alexander C. Loui
  • Publication number: 20110064136
    Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.
    Type: Application
    Filed: September 2, 2010
    Publication date: March 17, 2011
    Inventors: Shih-Fu Chang, Horace J. Meng
  • Publication number: 20110025710
    Abstract: Systems and methods are described for determining manipulation history among a plurality of images. The described techniques include selecting a pair of images from the plurality of images, detecting one or more manipulations operable to transform one of the images to the other, and based on the manipulations detected, determining a parent-child relationship between the pair or pairs of images. The described techniques can further include repeating the selecting two images, detecting manipulations, and determining the parent-child relationship for each pairs of images in the plurality of images, constructing a visual migration map for the images, and presenting the visual migration map in a user readable format.
    Type: Application
    Filed: August 23, 2010
    Publication date: February 3, 2011
    Applicant: The Trustees of Columbia University In The City Of New York
    Inventors: Lyndon Kennedy, Shih-Fu Chang
  • Patent number: 7817722
    Abstract: A system and method is provided for editing and parsing compressed digital information. The compressed digital information may include visual information which is edited and parsed in the compressed domain. In a preferred embodiment, the present invention provides a method for detecting moving objects in a compressed digital bitstream which represents a sequence of fields or frames of video information for one or more captured scenes of video.
    Type: Grant
    Filed: December 4, 2003
    Date of Patent: October 19, 2010
    Assignee: The Trustees of Columbia University in the City of New York
    Inventors: Shih-Fu Chang, Horace J. Meng
  • Patent number: 7809760
    Abstract: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content.
    Type: Grant
    Filed: September 15, 2009
    Date of Patent: October 5, 2010
    Assignees: AT&T Intellectual Property II, L.P., Columbia University in the City of New York
    Inventors: Ana Benitez, Shih-Fu Chang, Qian Huang, Seungyup Paek, Atul Puri
  • Patent number: 7720851
    Abstract: A context-based concept fusion method detects a first concept in an image record. The method includes automatically determining at least one other concept in the image record which has a contextual relationship with the first concept and which is to be labeled by a user of the method; and labeling the at least one other concept by the user with a ground truth label to be used in the context-based concept fusion method to improve detection of the first concept in the image record.
    Type: Grant
    Filed: December 22, 2006
    Date of Patent: May 18, 2010
    Assignee: Eastman Kodak Company
    Inventors: Shih-Fu Chang, Wei Jiang, Alexander C. Loui
  • Patent number: 7653635
    Abstract: Systems and methods for generating standard description records from multimedia information are provided. The system includes at least one multimedia information input interface (180) receiving multimedia information, a computer processor, and a data storage system (150), operatively coupled to said processor, for storing said at least one description record. The processor performs object extraction processing to generate multimedia object descriptions (200, 201, 205) from the multimedia information, and object hierarchy processing (410, 420) to generate multimedia object hierarchy descriptions, to generate at least one description record including the multimedia object descriptions (200, 201, 205) and multimedia object hierarchy descriptions for content embedded within the multimedia information.
    Type: Grant
    Filed: November 5, 1999
    Date of Patent: January 26, 2010
    Assignee: The Trustees of Columbia University in the City of New York
    Inventors: Seungup Paek, Ana Benitez, Shih-Fu Chang