Patents by Inventor Mauricio A. Hernandez-Sherrington

Mauricio A. Hernandez-Sherrington has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11531717
    Abstract: Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
    Type: Grant
    Filed: February 19, 2020
    Date of Patent: December 20, 2022
    Assignee: International Business Machines Corporation
    Inventors: Oktie Hassanzadeh, Mauricio A. Hernandez-Sherrington, Ching-Tien Ho, Lucian Popa
  • Publication number: 20200183995
    Abstract: Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
    Type: Application
    Filed: February 19, 2020
    Publication date: June 11, 2020
    Inventors: Oktie Hassanzadeh, Mauricio A. Hernandez-Sherrington, Ching-Tien Ho, Lucian Popa
  • Patent number: 10585986
    Abstract: Methods, systems, and computer program products for entity structured representation and variant generation are provided herein. A computer-implemented method includes automatically parsing instances of a given entity type into semantic components by implementing a parser based at least in part on (i) the given entity type and (ii) items of information relevant to the given entity type; generating, based at least in part on (i) the semantic components and (ii) information pertaining to one or more valid component-specific variants, one or more variants of the semantic components; creating, based at least in part on the one or more variants of the one or more semantic components, one or more variants of at least one instance of an entity associated with the given entity type; and outputting, to at least one user, the one or more variants of the at least one instance of the entity.
    Type: Grant
    Filed: August 20, 2018
    Date of Patent: March 10, 2020
    Assignee: International Business Machines Corporation
    Inventors: Nikita Bhutani, Mauricio Hernandez-Sherrington, Yunyao Li, Min Li, Kun Qian
  • Publication number: 20200057806
    Abstract: Methods, systems, and computer program products for entity structured representation and variant generation are provided herein. A computer-implemented method includes automatically parsing instances of a given entity type into semantic components by implementing a parser based at least in part on (i) the given entity type and (ii) items of information relevant to the given entity type; generating, based at least in part on (i) the semantic components and (ii) information pertaining to one or more valid component-specific variants, one or more variants of the semantic components; creating, based at least in part on the one or more variants of the one or more semantic components, one or more variants of at least one instance of an entity associated with the given entity type; and outputting, to at least one user, the one or more variants of the at least one instance of the entity.
    Type: Application
    Filed: August 20, 2018
    Publication date: February 20, 2020
    Inventors: Nikita Bhutani, Mauricio Hernandez-Sherrington, Yunyao Li, Min Li, Kun Qian
  • Patent number: 9996607
    Abstract: Described herein are methods, systems and computer program products for entity resolution. Entity resolution, also known as entity matching or record linkage, seeks to identify equivalent data objects between or among datasets. An example method includes creating a deterministic model by defining an entity to be resolved, selecting two datasets for comparison, defining matching predicates for attributes of the datasets to select a set of candidate matches, and defining a precedence rule for the candidate matches to select a subset of the candidate matches. The method includes running the deterministic model on the two datasets. Running the deterministic model includes applying the matching predicates and the precedence rule to data in the datasets that correspond to the attributes. The method also includes applying a cardinality rule to results of the running, and outputting the matching candidates for which the cardinality rule is satisfied.
    Type: Grant
    Filed: October 31, 2014
    Date of Patent: June 12, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Bogdan Alexe, Douglas R. Burdick, Mauricio A. Hernandez-Sherrington, Hima P. Karanam, Rajasekar Krishnamurthy, Lucian Popa, Shivakumar Vaithyanathan
  • Patent number: 9703817
    Abstract: Embodiments of the present invention relate to a declarative framework for efficient incremental information integration. In one embodiment, a method of and computer program product for information integration is provided. An integration rule is received. A first data set is accessed. A first representation of the first data is generated set based on the plurality of integration rules. The first representation is flat and includes a plurality of records. At least one index is generated. The index encodes at least one hierarchical relationship among the plurality of records. A second representation is generated of the first representation based on the at least one index. The second representation comprising nested data.
    Type: Grant
    Filed: August 4, 2014
    Date of Patent: July 11, 2017
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Mauricio A. Hernandez-Sherrington, Lucian Popa, Li Qian
  • Publication number: 20160125067
    Abstract: Embodiments relate to entity resolution. One aspect includes creating a deterministic model by defining an entity to be resolved, selecting two datasets for comparison, defining matching predicates for attributes of the datasets to select a set of candidate matches, and defining a precedence rule for the candidate matches to select a subset of the candidate matches. An aspect further includes running the deterministic model on the two datasets. Running the deterministic model includes applying the matching predicates and the precedence rule to data in the datasets that correspond to the attributes. An aspect also includes applying a cardinality rule to results of the running, and outputting the matching candidates for which the cardinality rule is satisfied.
    Type: Application
    Filed: October 31, 2014
    Publication date: May 5, 2016
    Inventors: Bogdan Alexe, Douglas R. Burdick, Mauricio A. Hernandez-Sherrington, Hima P. Karanam, Rajasekar Krishnamurthy, Lucian Popa, Shivakumar Vaithyanathan
  • Publication number: 20160034478
    Abstract: Embodiments of the present invention relate to a declarative framework for efficient incremental information integration. In one embodiment, a method of and computer program product for information integration is provided. An integration rule is received. A first data set is accessed. A first representation of the first data is generated set based on the plurality of integration rules. The first representation is flat and includes a plurality of records. At least one index is generated. The index encodes at least one hierarchical relationship among the plurality of records. A second representation is generated of the first representation based on the at least one index. The second representation comprising nested data.
    Type: Application
    Filed: August 4, 2014
    Publication date: February 4, 2016
    Inventors: Mauricio A. Hernandez-Sherrington, Lucian Popa, Li Qian
  • Publication number: 20070179962
    Abstract: A method and system for specifying, in a schema mapping framework, a mapping between a source schema and a target schema. The source and target schemas are schemas included in respective groups of registered, heterogeneous schemas. The source and target schemas may be of different types. Serialized versions of the source and target schemas include source objects and target objects, respectively. A mapping model is serialized into mapping objects that include logical references representing the source objects and logical references representing the target objects. The logical references are resolved to the source objects and target objects, thereby storing pointers to the source objects and to the target objects. After resolving the logical references, the mapping model includes the logical references and the pointers to the source and target objects.
    Type: Application
    Filed: January 31, 2006
    Publication date: August 2, 2007
    Applicant: International Business Machines Corporation
    Inventors: Mauricio Hernandez-Sherrington, Lucian Popa, Mary Roth, Craig Salter
  • Publication number: 20070174231
    Abstract: A method and system for generating a query implementing a schema mapping. A mapping M is provided from a schema S to a schema T, where M relates S to T, and M includes a plurality of constraints. Schemas S and T each include one or more elements, and T includes at least one set type element. Mapping M is expressed in terms of at least one nested tuple-generating dependency. A query Q is generated where Q is capable of applying M to an input instance I to result in an output instance J, where I conforms to S, J conforms to T, and I and J satisfy the plurality of constraints. Instance J is in partitioned normal form (i.e., satisfies minimal union semantics) and includes no duplicate element instances.
    Type: Application
    Filed: January 6, 2006
    Publication date: July 26, 2007
    Applicant: International Business Machines Corporation
    Inventors: Mauricio Hernandez-Sherrington, Ching-Tien Ho, Lucian Popa
  • Publication number: 20060282454
    Abstract: Techniques are provided for viewing mappings between objects. A main view is displayed, wherein the main view shows one or more source objects, one or more target objects, and zero or more mappings between the one or more source objects and the one or more target objects. Input selecting a type of view to be displayed in the main view is received, wherein each type of view provides a different amount of detail regarding the mappings. In response to receiving the input, the selected type of view is created, and the created view is displayed. Additionally, techniques are provided for viewing objects. One or more objects along with mappings between the one or more objects are displayed. View filters are provided that may be applied to the one or more objects, wherein the view filters enable hiding at least one of mapped or unmapped objects. Moreover, techniques are provided for viewing nodes. A structure is displayed that includes one or more nodes.
    Type: Application
    Filed: June 10, 2005
    Publication date: December 14, 2006
    Inventors: Mauricio Hernandez-Sherrington, Robert LaVerne Hobbs, Kiranmayi Potu, Daina Pupons Wickham, Lingling Yan
  • Publication number: 20060282429
    Abstract: Various embodiments of a method, system and article of manufacture to discover relationships among a first set of elements and a second set of elements are provided. At least one metric algorithm is identified based on a metric selection parameter. A raw result is determined based on the at least one metric algorithm, a first specified structural description of the first set of elements and a second specified structural description of the second set of elements. The raw result comprises a plurality of relationship measurements and the raw result is ordered. In some embodiments, a balanced result is produced based on the raw result and a matching strategy algorithm. In other embodiments, the matching strategy algorithm is identified based on a matching strategy selection parameter.
    Type: Application
    Filed: June 10, 2005
    Publication date: December 14, 2006
    Inventors: Mauricio Hernandez-Sherrington, Ching-Tien Ho, Mary Roth, Lingling Yan