Patents by Inventor Prateek Sarkar

Prateek Sarkar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8943099
    Abstract: Methods, and systems, including computer programs encoded on computer-readable mediums, including a method for determining codomain relationship measures between domains pairs. The method includes accessing domain data specifying first and second domains; for each first domain: determining a total number of image links on web pages on the first domain, each image link being a link to an image on a second domain; identifying second domains hosting images to which image links on web pages on the first domain link; for each second domain: determining a number of image links from web pages on the first domain linking to images on the second domain; determining a codomain relationship measure for the first and second domain based on the number of image links from web pages on the first domain linking to images on the second domain and the total number of image links on web pages on the first domain.
    Type: Grant
    Filed: February 22, 2013
    Date of Patent: January 27, 2015
    Assignee: Google Inc.
    Inventors: Prateek Sarkar, Arcot J. Preetham, Hyung-Jin Kim, Gabriel Wolosin
  • Patent number: 8706660
    Abstract: Methods and system employing the same for optimizing an objective function are provided. The objective function assesses the quality of a candidate solution. One or more variables of an objective function are selected as pivot variables. Each of the variables include one or more candidate values. An upper bound function is generated from the objective function, where the pivot variables are held fixed. For each combination of the candidate values, one or more candidate solutions are searched using the upper bound function. One or more optimal solutions are selected from among the solutions to the searches.
    Type: Grant
    Filed: February 8, 2011
    Date of Patent: April 22, 2014
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Evgeniy Bart, Prateek Sarkar, John T. Maxwell, III
  • Patent number: 8625886
    Abstract: Methods and system employing the same for finding repeated structure for data extraction from document images are provided. A reference record and one or more reference fields thereof are identified from a document image. One or more candidate fields are generated for each of the reference fields. One or more best candidate records from the candidate fields are selected using a probabilistic model and an optimal record set is determined from the best candidate records.
    Type: Grant
    Filed: February 8, 2011
    Date of Patent: January 7, 2014
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Evgeniy Bart, Prateek Sarkar, Eric Saund
  • Patent number: 8452086
    Abstract: A user interface and method is embodied on a computer readable medium and executable on a computer. The user interface is a labeler which labels only foreground pixels of an image stored in a computing environment. The labeler operates in a Region mode/state and Brush mode/state, and includes a Tentative mode that permits an assigned label to be changed after pixels have been selected. Groups of pixels may be selected for labeling at once by a point-and-click command, and a pixel may belong to one or more groups of pixels which are stored in memory as image layers. The groups are formed dynamically by user selection actions, and/or through automatic recognition algorithms. Pixels already labeled with certain labels may be locked to not be altered by additional labeling operations. Unassigned pixels may be highlighted to increase the ease at which they are identified in an image. Comparisons between labeled images are undertaken to indicate differences between different groundtruth labeling.
    Type: Grant
    Filed: July 10, 2009
    Date of Patent: May 28, 2013
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Eric Saund, Jing Lin, Prateek Sarkar
  • Patent number: 8442319
    Abstract: Methods and systems for classifying markings on images in a document are undertaken according to marking types. The document containing the images is supplied to a segmenter which breaks the images into fragments of foreground pixel structures that are identified as being likely to be of the same marking type by finding connected components, extracting near-horizontal or -vertical rule lines and subdividing some connected components to obtain the fragments. The fragments are then supplied to a classifier, where the classifier provides a category score for each fragment, wherein the classifier is trained from the groundtruth images whose pixels are labeled according to known marking types. Thereafter, a same label is assigned to all pixels in a particular fragment, when the fragment is classified by the classifier.
    Type: Grant
    Filed: July 10, 2009
    Date of Patent: May 14, 2013
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Prateek Sarkar, Eric Saund
  • Patent number: 8340363
    Abstract: The present application is a method and system of interpreting an image by finding a configuration of multiple variables which optimizes an objective function with a factorizable upper bound, by applying an iterative algorithm that relies on efficient dynamic ordering of candidate configurations, in a priority queue, in a descending order of an upper bound score. As an example, consider a constellation model for an object. It specifies the appearance models for individual parts of objects, as well as spatial relations among these parts. These are combined into a single function whose value represents the likeness of the object in an image. To find the configuration in which the object is present in the image, we maximize this function over all candidate configurations. The purpose of the iterative algorithm mentioned above is to find such optimal configurations efficiently.
    Type: Grant
    Filed: May 27, 2010
    Date of Patent: December 25, 2012
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Prateek Sarkar, Evgeniy Bart
  • Patent number: 8285074
    Abstract: Methods of generating image anchor templates from low variance regions of document images of a first class are provided. The methods select a document image from the document images of the first class and align the other document images of the first class to the selected document image. Low variance regions are then determined by comparing the aligned document images and the selected document image and used to generate image anchor templates.
    Type: Grant
    Filed: September 1, 2010
    Date of Patent: October 9, 2012
    Assignees: Palo Alto Research Center Incorporated, Xerox Corporation
    Inventors: Eric Saund, Prateek Sarkar, Alejandro E. Brito, Marshall W. Bern, Francois Ragnet
  • Patent number: 8285058
    Abstract: Methods, and corresponding systems, of generating one or more image anchor templates for extracting data from a data field of a first class of documents are provided. The methods include generating one or more candidate image anchor templates from at least one of one or more exemplars of the first class; determining a quality score for each of the one or more candidate image anchor templates using a computer processor and known locations of the data field within the one or more exemplars of the first class; ranking the one or more candidate image anchor templates according to quality score; and selecting one or more of the most highly ranked image anchor templates.
    Type: Grant
    Filed: May 14, 2010
    Date of Patent: October 9, 2012
    Assignee: Palo Alto Research Center Incorporated
    Inventor: Prateek Sarkar
  • Patent number: 8285057
    Abstract: Methods, and corresponding systems, of generating one or more image anchor templates for discriminating between documents of a first class and documents of other classes are provided. The methods include generating one or more candidate image anchor templates; determining, using a computer processor, a quality score for each of the one or more candidate image anchor templates; ranking the one or more candidate image anchor templates according to the quality scores of the one or more candidate image anchor templates; and selecting one or more of the most highly ranked image anchor templates.
    Type: Grant
    Filed: May 14, 2010
    Date of Patent: October 9, 2012
    Assignee: Palo Alto Research Center Incorporated
    Inventor: Prateek Sarkar
  • Publication number: 20120201457
    Abstract: Methods and system employing the same for finding repeated structure for data extraction from document images are provided. A reference record and one or more reference fields thereof are identified from a document image. One or more candidate fields are generated for each of the reference fields. One or more best candidate records from the candidate fields are selected using a probabilistic model and an optimal record set is determined from the best candidate records.
    Type: Application
    Filed: February 8, 2011
    Publication date: August 9, 2012
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventors: Evgeniy Bart, Prateek Sarkar, Eric Saund
  • Publication number: 20120203721
    Abstract: Methods and system employing the same for optimizing an objective function are provided. The objective function assesses the quality of a candidate solution. One or more variables of an objective function are selected as pivot variables. Each of the variables include one or more candidate values. An upper bound function is generated from the objective function, where the pivot variables are held fixed. For each combination of the candidate values, one or more candidate solutions are searched using the upper bound function. One or more optimal solutions are selected from among the solutions to the searches.
    Type: Application
    Filed: February 8, 2011
    Publication date: August 9, 2012
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventors: Evgeniy Bart, Prateek Sarkar, John T. Maxwell, III
  • Publication number: 20120051649
    Abstract: Methods of generating image anchor templates from low variance regions of document images of a first class are provided. The methods select a document image from the document images of the first class and align the other document images of the first class to the selected document image. Low variance regions are then determined by comparing the aligned document images and the selected document image and used to generate image anchor templates.
    Type: Application
    Filed: September 1, 2010
    Publication date: March 1, 2012
    Applicants: XEROX CORPORATION, PALO ALTO RESEARCH CENTER INCORPORATED
    Inventors: Eric Saund, Prateek Sarkar, Alejandro E. Brito, Marshall W. Bern, Francois Ragnet
  • Publication number: 20110293187
    Abstract: The present application is a method and system of interpreting an image by finding a configuration of multiple variables which optimizes an objective function with a factorizable upper bound, by applying an iterative algorithm that relies on efficient dynamic ordering of candidate configurations, in a priority queue, in a descending order of an upper bound score. As an example, consider a constellation model for an object. It specifies the appearance models for individual parts of objects, as well as spatial relations among these parts. These are combined into a single function whose value represents the likeness of the object in an image. To find the configuration in which the object is present in the image, we maximize this function over all candidate configurations. The purpose of the iterative algorithm mentioned above is to find such optimal configurations efficiently.
    Type: Application
    Filed: May 27, 2010
    Publication date: December 1, 2011
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventors: Prateek Sarkar, Evgeniy Bart
  • Publication number: 20110280451
    Abstract: Methods, and corresponding systems, of generating one or more image anchor templates for discriminating between documents of a first class and documents of other classes are provided. The methods include generating one or more candidate image anchor templates; determining, using a computer processor, a quality score for each of the one or more candidate image anchor templates; ranking the one or more candidate image anchor templates according to the quality scores of the one or more candidate image anchor templates; and selecting one or more of the most highly ranked image anchor templates.
    Type: Application
    Filed: May 14, 2010
    Publication date: November 17, 2011
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventor: Prateek Sarkar
  • Publication number: 20110280490
    Abstract: Methods, and corresponding systems, of generating one or more image anchor templates for extracting data from a data field of a first class of documents are provided. The methods include generating one or more candidate image anchor templates from at least one of one or more exemplars of the first class; determining a quality score for each of the one or more candidate image anchor templates using a computer processor and known locations of the data field within the one or more exemplars of the first class; ranking the one or more candidate image anchor templates according to quality score; and selecting one or more of the most highly ranked image anchor templates.
    Type: Application
    Filed: May 14, 2010
    Publication date: November 17, 2011
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventor: Prateek Sarkar
  • Patent number: 8000538
    Abstract: A system and method for image performing classification through generative models of features occurring in an image. Category-conditional probability distributions of features occurring in a plurality of training images are maintained. Each distribution is assigned to a category. The features occurring in an unclassified image are identified. Category-conditional likelihoods for the identified features are determined using the category-conditional probability distributions for each category. The unclassified image is assigned to one such category based upon the category-conditional likelihoods.
    Type: Grant
    Filed: December 22, 2006
    Date of Patent: August 16, 2011
    Assignee: Palo Alto Research Center Incorporated
    Inventor: Prateek Sarkar
  • Publication number: 20110007366
    Abstract: Methods and systems for classifying markings on images in a document are undertaken according to marking types. The document containing the images is supplied to a segmenter which breaks the images into fragments of foreground pixel structures that are identified as being likely to be of the same marking type by finding connected components, extracting near-horizontal or -vertical rule lines and subdividing some connected components to obtain the fragments. The fragments are then supplied to a classifier, where the classifier provides a category score for each fragment, wherein the classifier is trained from the groundtruth images whose pixels are labeled according to known marking types. Thereafter, a same label is assigned to all pixels in a particular fragment, when the fragment is classified by the classifier.
    Type: Application
    Filed: July 10, 2009
    Publication date: January 13, 2011
    Applicant: Palo Alto Research Center Incorporated
    Inventors: Prateek Sarkar, Eric Saund
  • Publication number: 20110007964
    Abstract: A user interface and method is embodied on a computer readable medium and executable on a computer. The user interface is a labeler which labels only foreground pixels of an image stored in a computing environment. The labeler operates in a Region mode/state and Brush mode/state, and includes a Tentative mode that permits an assigned label to be changed after pixels have been selected. Groups of pixels may be selected for labeling at once by a point-and-click command, and a pixel may belong to one or more groups of pixels which are stored in memory as image layers. The groups are formed dynamically by user selection actions, and/or through automatic recognition algorithms. Pixels already labeled with certain labels may be locked to not be altered by additional labeling operations. Unassigned pixels may be highlighted to increase the ease at which they are identified in an image. Comparisons between labeled images are undertaken to indicate differences between different groundtruth labeling.
    Type: Application
    Filed: July 10, 2009
    Publication date: January 13, 2011
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventors: Eric Saund, Jing Lin, Prateek Sarkar
  • Publication number: 20080152238
    Abstract: A system and method for image performing classification through generative models of features occurring in an image. Category-conditional probability distributions of features occurring in a plurality of training images are maintained. Each distribution is assigned to a category. The features occurring in an unclassified image are identified. Category-conditional likelihoods for the identified features are determined using the category-conditional probability distributions for each category. The unclassified image is assigned to one such category based upon the category-conditional likelihoods.
    Type: Application
    Filed: December 22, 2006
    Publication date: June 26, 2008
    Inventor: Prateek Sarkar
  • Patent number: 7224836
    Abstract: Techniques are provided to classify patterns in isogenous pattern sources. Techniques are provided to determine a computationally inexpensive upperbound on the true score or joint probability of the field label and field features over all field labels. Candidate field labels associated with promising upperbound scores are dynamically queued. True scores are computed for a subset of the candidates fields resulting in reduced computations to determine a field label. Techniques are also provided to determine optimal variables for any system with shared constraints.
    Type: Grant
    Filed: December 20, 2002
    Date of Patent: May 29, 2007
    Assignee: Palo Alto Research Center Incorporated
    Inventor: Prateek Sarkar