Patents by Inventor Jonathan T. Betz

Jonathan T. Betz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10223406
    Abstract: Systems and methods for normalizing entities via name normalization are disclosed. In some implementations, a computer-implemented method of identifying duplicate objects in a plurality of objects is provided. Each object in the plurality of objects is associated with one or more facts, and each of the one or more facts having a value. The method includes: using a computer processor to perform: associating facts extracted from web documents with a plurality of objects; and for each of the plurality of objects, normalizing the value of a name fact, the name fact being among one or more facts associated with the object; processing the plurality of objects in accordance with the normalized value of the name facts of the plurality of objects. In some implementations, normalizing the value of the name fact is optionally carried out by applying a group of normalization rules to the value of the name fact.
    Type: Grant
    Filed: June 29, 2017
    Date of Patent: March 5, 2019
    Assignee: Google LLC
    Inventor: Jonathan T. Betz
  • Publication number: 20170300524
    Abstract: Systems and methods for normalizing entities via name normalization are disclosed. In some implementations, a computer-implemented method of identifying duplicate objects in a plurality of objects is provided. Each object in the plurality of objects is associated with one or more facts, and each of the one or more facts having a value. The method includes: using a computer processor to perform: associating facts extracted from web documents with a plurality of objects; and for each of the plurality of objects, normalizing the value of a name fact, the name fact being among one or more facts associated with the object; processing the plurality of objects in accordance with the normalized value of the name facts of the plurality of objects. In some implementations, normalizing the value of the name fact is optionally carried out by applying a group of normalization rules to the value of the name fact.
    Type: Application
    Filed: June 29, 2017
    Publication date: October 19, 2017
    Inventor: Jonathan T. Betz
  • Patent number: 9710549
    Abstract: Systems and methods for normalizing entities via name normalization are disclosed. In some implementations, a computer-implemented method of identifying duplicate objects in a plurality of objects is provided. Each object in the plurality of objects is associated with one or more facts, and each of the one or more facts having a value. The method includes: using a computer processor to perform: associating facts extracted from web documents with a plurality of objects; and for each of the plurality of objects, normalizing the value of a name fact, the name fact being among one or more facts associated with the object; processing the plurality of objects in accordance with the normalized value of the name facts of the plurality of objects. In some implementations, normalizing the value of the name fact is optionally carried out by applying a group of normalization rules to the value of the name fact.
    Type: Grant
    Filed: March 28, 2014
    Date of Patent: July 18, 2017
    Assignee: Google Inc.
    Inventor: Jonathan T. Betz
  • Patent number: 9558186
    Abstract: A system and method for extracting facts from documents. A fact is extracted from a first document. The attribute and value of the fact extracted from the first document are used as a seed attribute-value pair. A second document containing the seed attribute-value pair is analyzed to determine a contextual pattern used in the second document. The contextual pattern is used to extract other attribute-value pairs from the second document. The extracted attributes and values are stored as facts.
    Type: Grant
    Filed: August 14, 2014
    Date of Patent: January 31, 2017
    Assignee: Google Inc.
    Inventors: Jonathan T. Betz, Shubin Zhao
  • Patent number: 9208229
    Abstract: A system and method for corroborating a set of facts. If the anchor text of the references to a document matches the name of a set of facts, the referenced document is used to corroborate the set of facts. By analyzing the anchor text of the references to the document, the system is capable of determining if a document is relevant to the set of facts. These documents can then be used to corroborate or refute the facts, thereby improving their overall quality.
    Type: Grant
    Filed: March 31, 2006
    Date of Patent: December 8, 2015
    Assignee: Google Inc.
    Inventors: Jonathan T. Betz, Shubin Zhao
  • Patent number: 9092495
    Abstract: Systems and methods for automatic object reference identification and linking in a browseable fact repository database are provided. In some implementations, a method includes, identifying a set of values from a plurality of facts associated with an entity. The plurality of facts are stored in a fact repository, and a respective fact includes: an attribute and a corresponding value. The method further includes, responsive to a search for a first value included in a first fact in the plurality of facts: identifying a second fact associated with the entity; and causing to be displayed to a user: a link associated with the second fact, and information representing a confidence value associated with the second fact. The link, when selected, invokes a search of the fact repository in accordance with one or more search parameters, which include a value corresponding to an attribute included in the second fact.
    Type: Grant
    Filed: February 28, 2014
    Date of Patent: July 28, 2015
    Assignee: Google Inc.
    Inventors: Andrew W. Hogue, Jonathan T. Betz
  • Patent number: 8996470
    Abstract: Methods and systems for maintaining the internal consistency of a fact repository are described. Accessed objects are checked for attribute-value pairs that have links to other objects. For any link to an object, the name of the linked-to object is inserted into the attribute-value pair having the link. The accessed objects are filtered to remove attribute-value pairs meeting predefined criteria, possibly resulting in null objects. Links to null objects are identified and removed.
    Type: Grant
    Filed: May 31, 2005
    Date of Patent: March 31, 2015
    Assignee: Google Inc.
    Inventors: Andrew William Hogue, Robert Joseph Siemborski, Jonathan T. Betz
  • Publication number: 20140372473
    Abstract: A system and method for extracting facts from documents. A fact is extracted from a first document. The attribute and value of the fact extracted from the first document are used as a seed attribute-value pair. A second document containing the seed attribute-value pair is analyzed to determine a contextual pattern used in the second document. The contextual pattern is used to extract other attribute-value pairs from the second document. The extracted attributes and values are stored as facts.
    Type: Application
    Filed: August 14, 2014
    Publication date: December 18, 2014
    Inventors: Jonathan T. Betz, Shubin Zhao
  • Patent number: 8825471
    Abstract: A system and method for extracting facts from documents. A fact is extracted from a first document. The attribute and value of the fact extracted from the first document are used as a seed attribute-value pair. A second document containing the seed attribute-value pair is analyzed to determine a contextual pattern used in the second document. The contextual pattern is used to extract other attribute-value pairs from the second document. The extracted attributes and values are stored as facts.
    Type: Grant
    Filed: March 31, 2006
    Date of Patent: September 2, 2014
    Assignee: Google Inc.
    Inventors: Jonathan T. Betz, Shubin Zhao
  • Publication number: 20140214778
    Abstract: Systems and methods for normalizing entities via name normalization are disclosed. In some implementations, a computer-implemented method of identifying duplicate objects in a plurality of objects is provided. Each object in the plurality of objects is associated with one or more facts, and each of the one or more facts having a value. The method includes: using a computer processor to perform: associating facts extracted from web documents with a plurality of objects; and for each of the plurality of objects, normalizing the value of a name fact, the name fact being among one or more facts associated with the object; processing the plurality of objects in accordance with the normalized value of the name facts of the plurality of objects. In some implementations, normalizing the value of the name fact is optionally carried out by applying a group of normalization rules to the value of the name fact.
    Type: Application
    Filed: March 28, 2014
    Publication date: July 31, 2014
    Applicant: GOOGLE INC.
    Inventor: Jonathan T. Betz
  • Publication number: 20140195520
    Abstract: Systems and methods for automatic object reference identification and linking in a browseable fact repository database are provided. In some implementations, a method includes, identifying a set of values from a plurality of facts associated with an entity. The plurality of facts are stored in a fact repository, and a respective fact includes: an attribute and a corresponding value. The method further includes, responsive to a search for a first value included in a first fact in the plurality of facts: identifying a second fact associated with the entity; and causing to be displayed to a user: a link associated with the second fact, and information representing a confidence value associated with the second fact. The link, when selected, invokes a search of the fact repository in accordance with one or more search parameters, which include a value corresponding to an attribute included in the second fact.
    Type: Application
    Filed: February 28, 2014
    Publication date: July 10, 2014
    Applicant: GOOGLE INC.
    Inventors: Andrew W. Hogue, Jonathan T. Betz
  • Patent number: 8738643
    Abstract: A repository contains objects representing entities. The objects also include facts about the represented entities. The facts are derived from source documents. A synonymous name of an object is determined by identifying a source document from which one or more facts of the entity represented by the object were derived, identifying a plurality of linking documents that link to the source document through hyperlinks, each hyperlink having an anchor text, processing the anchor texts in the plurality of linking documents to generate a collection of synonym candidates for the entity represented by the object, and selecting a synonymous name for the entity represented by the object from the collection of synonym candidates.
    Type: Grant
    Filed: August 2, 2007
    Date of Patent: May 27, 2014
    Assignee: Google Inc.
    Inventors: Krzysztof Czuba, Jonathan T. Betz, Jeffrey C. Reynar
  • Patent number: 8719260
    Abstract: A method and system for identifying a subject of a document and facts included within are described. A source document that includes facts and linking documents that include hyperlinks to the source document are identified. The anchor texts of the hyperlinks are identified and candidate labels are generated based on the anchor texts. One of the candidate labels is selected as the unifying subject of the facts. An information set, distinct from the source document, is stored in a facts database, wherein the stored information set includes the unifying subject, and source document information associating the source document with the information set.
    Type: Grant
    Filed: November 22, 2011
    Date of Patent: May 6, 2014
    Assignee: Google Inc.
    Inventor: Jonathan T. Betz
  • Patent number: 8700568
    Abstract: A system and method identifying duplicate objects from a plurality of objects. The system and method normalizes name values of objects, groups objects into buckets based at least in part on the normalized name values, matches objects within the same bucket based on a selected matcher, and identifies the matching objects as duplicate objects.
    Type: Grant
    Filed: March 31, 2006
    Date of Patent: April 15, 2014
    Assignee: Google Inc.
    Inventor: Jonathan T. Betz
  • Patent number: 8682913
    Abstract: A method and system for corroborating facts extracted as attribute-value pairs from multiple sources. Facts associated with a common subject are identified and extracted from multiple sources. From the extracted facts, attributes satisfying a first corroboration requirement are identified. Attribute-value pairs satisfying a second corroboration requirement and having any of the identified attributes that satisfy the first corroboration requirement are identified. Predefined actions may be performed with respect to the identified attribute-value pairs and/or with respect to their status in a fact database.
    Type: Grant
    Filed: March 31, 2005
    Date of Patent: March 25, 2014
    Assignee: Google Inc.
    Inventor: Jonathan T. Betz
  • Patent number: 8682891
    Abstract: Links between facts associated with objects are automatically created and maintained in a fact repository. Names of objects are automatically identified in the facts, and collected into a list of names. The facts are then processed to identifying such names in the facts. Identified names are used as anchor text for search links. A search link includes a search query for a service engine which search the fact repository for facts associated with objects having the same name.
    Type: Grant
    Filed: September 4, 2012
    Date of Patent: March 25, 2014
    Assignee: Google Inc.
    Inventors: Andrew W. Hogue, Jonathan T. Betz
  • Publication number: 20130060746
    Abstract: Links between facts associated with objects are automatically created and maintained in a fact repository. Names of objects are automatically identified in the facts, and collected into a list of names. The facts are then processed to identifying such names in the facts. Identified names are used as anchor text for search links. A search link includes a search query for a service engine which search the fact repository for facts associated with objects having the same name.
    Type: Application
    Filed: September 4, 2012
    Publication date: March 7, 2013
    Inventors: Andrew Hogue, Jonathan T. Betz
  • Publication number: 20130041910
    Abstract: A system and method determines whether two objects are duplicate objects. The system and method matches common facts of the two objects based on a match measure, combines the entropies of the matching common facts, and determines whether the two objects are duplicate objects based on the sum of entropies.
    Type: Application
    Filed: August 14, 2012
    Publication date: February 14, 2013
    Inventors: Jonathan T. Betz, Vivek Menezes
  • Patent number: 8260785
    Abstract: Links between facts associated with objects are automatically created and maintained in a fact repository. Names of objects are automatically identified in the facts, and collected into a list of names. The facts are then processed to identifying such names in the facts. Identified names are used as anchor text for search links. A search link includes a search query for a service engine which search the fact repository for facts associated with objects having the same name.
    Type: Grant
    Filed: February 17, 2006
    Date of Patent: September 4, 2012
    Assignee: Google Inc.
    Inventors: Andrew W. Hogue, Jonathan T. Betz
  • Patent number: 8244689
    Abstract: A system and method determines whether two objects are duplicate objects. The system and method matches common facts of the two objects based on a match measure, combines the entropies of the matching common facts, and determines whether the two objects are duplicate objects based on the sum of entropies.
    Type: Grant
    Filed: February 17, 2006
    Date of Patent: August 14, 2012
    Assignee: Google Inc.
    Inventors: Jonathan T. Betz, Vivek Menezes