Patents by Inventor Ching-Tien Ho

Ching-Tien Ho has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11531717
    Abstract: Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
    Type: Grant
    Filed: February 19, 2020
    Date of Patent: December 20, 2022
    Assignee: International Business Machines Corporation
    Inventors: Oktie Hassanzadeh, Mauricio A. Hernandez-Sherrington, Ching-Tien Ho, Lucian Popa
  • Publication number: 20200183995
    Abstract: Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
    Type: Application
    Filed: February 19, 2020
    Publication date: June 11, 2020
    Inventors: Oktie Hassanzadeh, Mauricio A. Hernandez-Sherrington, Ching-Tien Ho, Lucian Popa
  • Patent number: 10599732
    Abstract: Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
    Type: Grant
    Filed: February 23, 2017
    Date of Patent: March 24, 2020
    Assignee: International Business Machines Corporation
    Inventors: Oktie Hassanzadeh, Mauricio A Hernandez, Ching-Tien Ho, Lucian Popa
  • Patent number: 10019437
    Abstract: A method includes receiving one or more natural language dependency parse trees as input. A hardware processor is used for processing the dependency parse trees by creating a mapping from nodes of the one or more dependency parse trees into actions, roles and contextual predicates. The mapping is used for information extraction. The actions include the verbs along with attributes of the verbs. The roles include arguments for the verbs. The contextual predicates include modifiers for the verbs.
    Type: Grant
    Filed: February 23, 2015
    Date of Patent: July 10, 2018
    Assignee: International Business Machines Corporation
    Inventors: Ching-Tien Ho, Benny Kimelfeld, Yunyao Li, Shivakumar Vaithyanathan
  • Patent number: 9792351
    Abstract: Various embodiments of a method, system and article of manufacture to discover relationships among a first set of elements and a second set of elements are provided. At least one metric algorithm is identified based on a metric selection parameter. A raw result is determined based on the at least one metric algorithm, a first specified structural description of the first set of elements and a second specified structural description of the second set of elements. The raw result comprises a plurality of relationship measurements and the raw result is ordered. In some embodiments, a balanced result is produced based on the raw result and a matching strategy algorithm. In other embodiments, the matching strategy algorithm is identified based on a matching strategy selection parameter.
    Type: Grant
    Filed: June 10, 2005
    Date of Patent: October 17, 2017
    Assignee: International Business Machines Corporation
    Inventors: Mauricio Antonio Hernandez-Sherrington, Ching-Tien Ho, Mary Ann Roth, Lingling Yan
  • Patent number: 9710534
    Abstract: Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
    Type: Grant
    Filed: May 7, 2013
    Date of Patent: July 18, 2017
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Oktie Hassanzadeh, Mauricio A. Hernandez, Ching-Tien Ho, Lucian Popa
  • Publication number: 20170161396
    Abstract: Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
    Type: Application
    Filed: February 23, 2017
    Publication date: June 8, 2017
    Inventors: Oktie HASSANZADEH, Mauricio A HERNANDEZ, Ching-Tien HO, Lucian POPA
  • Publication number: 20160246779
    Abstract: A method includes receiving one or more natural language dependency parse trees as input. A hardware processor is used for processing the dependency parse trees by creating a mapping from nodes of the one or more dependency parse trees into actions, roles and contextual predicates. The mapping is used for information extraction. The actions include the verbs along with attributes of the verbs. The roles include arguments for the verbs. The contextual predicates include modifiers for the verbs.
    Type: Application
    Filed: February 23, 2015
    Publication date: August 25, 2016
    Inventors: Ching-Tien Ho, Benny Kimelfeld, Yunyao Li, Shivakumar Vaithyanathan
  • Publication number: 20140337331
    Abstract: Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
    Type: Application
    Filed: May 7, 2013
    Publication date: November 13, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Oktie Hassanzadeh, Mauricio Antonio Hernandez-Sherrington, Ching-Tien Ho, Lucian Popa
  • Patent number: 8862636
    Abstract: A computer-implemented method for use with an extensible markup language (XML) document includes inputting a high-level mapping specification for a schema mapping; and generating a target XML document based on the mapping. The method may perform schema mapping-based XML transformation as a three-phase process comprising tuple extraction, XML-fragment generation, and data merging. The tuple extraction phase may be adapted to handle streamed XML data (as well as stored/indexed XML data). The data merging phase may use a hybrid method that can dynamically switch between main memory-based and disk-based algorithms based on the size of the XML data to be merged.
    Type: Grant
    Filed: June 5, 2008
    Date of Patent: October 14, 2014
    Assignee: International Business Machines Corporation
    Inventors: Wook-Shin Han, Ching-Tien Ho, Haifeng Jiang, Lucian Popa
  • Patent number: 8630989
    Abstract: Described herein are methods, systems, apparatuses and products for automatically discovering patterns in a text corpus. An aspect provides extracting at least one context string related to at least one annotator from the at least one text corpus; analyzing the at least one context string for at least one sequence, the at least one sequence comprised of at least one subsequence; determining at least one sequence signature for each at least one sequence by applying applicable rules to the at least one sequence; and grouping the at least one sequence signature into at least one group.
    Type: Grant
    Filed: May 27, 2011
    Date of Patent: January 14, 2014
    Assignee: International Business Machines Corporation
    Inventors: Sebastian Johannes Blohm, Vivian Yaw-Wen Chu, Ching-Tien Ho, Yunyao Li, Huaiyu Zhu
  • Publication number: 20120303661
    Abstract: Described herein are methods, systems, apparatuses and products for automatically discovering patterns in a text corpus. An aspect provides extracting at least one context string related to at least one annotator from the at least one text corpus; analyzing the at least one context string for at least one sequence, the at least one sequence comprised of at least one subsequence; determining at least one sequence signature for each at least one sequence by applying applicable rules to the at least one sequence; and grouping the at least one sequence signature into at least one group.
    Type: Application
    Filed: May 27, 2011
    Publication date: November 29, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Sebastian Johannes Blohm, Vivian Yaw-Wen Chu, Ching-Tien Ho, Yunyao Li, Huaiyu Zhu
  • Patent number: 7512642
    Abstract: A method and system for generating a query implementing a schema mapping. A mapping M is provided from a schema S to a schema T, where M relates S to T, and M includes a plurality of constraints. Schemas S and T each include one or more elements, and T includes at least one set type element. Mapping M is expressed in terms of at least one nested tuple-generating dependency. A query Q is generated where Q is capable of applying M to an input instance I to result in an output instance J, where I conforms to S, J conforms to T, and I and J satisfy the plurality of constraints. Instance J is in partitioned normal form (i.e., satisfies minimal union semantics) and includes no duplicate element instances.
    Type: Grant
    Filed: January 6, 2006
    Date of Patent: March 31, 2009
    Assignee: International Business Machines Corporation
    Inventors: Mauricio Antonio Hernandez-Sherrington, Ching-Tien Ho, Lucian Popa
  • Publication number: 20090043806
    Abstract: A method and apparatus are disclosed for querying streaming extensible markup language (XML) data comprising: routing elements to query nodes, the elements derived from the streaming extensible markup language data; filtering out elements not conforming to one or more predetermined path query patterns; adding remaining elements to one or more dynamic element lists; accessing a decision table to select and return a query node related to a cursor element from the dynamic element lists; and processing the cursor element related to the returned query node to produce an extracted tuple output.
    Type: Application
    Filed: June 5, 2008
    Publication date: February 12, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Wook-Shin Han, Ching-Tien Ho, Haifeng Jiang, Quanzhong Li
  • Publication number: 20090043736
    Abstract: A method and apparatus are disclosed for querying streaming extensible markup language (XML) data comprising: routing elements to query nodes, the elements derived from the streaming extensible markup language data; filtering out elements not conforming to one or more predetermined path query patterns; adding remaining elements to one or more dynamic element lists; accessing a decision table to select and return a query node related to a cursor element from the dynamic element lists; and processing the cursor element related to the returned query node to produce an extracted tuple output.
    Type: Application
    Filed: August 8, 2007
    Publication date: February 12, 2009
    Inventors: Wook-Shin Han, Ching-Tien Ho, Haifeng Jiang, Quanzhong Li
  • Publication number: 20080275907
    Abstract: A computer-implemented method for use with an extensible markup language (XML) document includes inputting a high-level mapping specification for a schema mapping; and generating a target XML document based on the mapping. The method may perform schema mapping-based XML transformation as a three-phase process comprising tuple extraction, XML-fragment generation, and data merging. The tuple extraction phase may be adapted to handle streamed XML data (as well as stored/indexed XML data). The data merging phase may use a hybrid method that can dynamically switch between main memory-based and disk-based algorithms based on the size of the XML data to be merged.
    Type: Application
    Filed: June 5, 2008
    Publication date: November 6, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: WOOK-SHIN HAN, CHING-TIEN HO, HAIFENG JIANG, LUCIAN POPA
  • Publication number: 20080243765
    Abstract: A method for generating nested mapping specifications and transformation queries based thereon. Basic mappings are generated based on source and target schemas and correspondences between elements of the schemas. A directed acyclic graph (DAG) is constructed whose edges represent ways in which each basic mapping is nestable under any of the other basic mappings. Any transitively implied edges are removed from the DAG. Root mappings of the DAG are identified. Trees of mappings are automatically extracted from the DAG, where each tree of mappings is rooted at a root mapping and expresses a nested mapping specification.
    Type: Application
    Filed: March 29, 2007
    Publication date: October 2, 2008
    Inventors: Ariel Fuxman, Mauricio Antonio Hernandez-Sherrington, Ching-Tien Ho, Renee J. Miller, Paolo Papotti, Lucian Popa
  • Publication number: 20080243772
    Abstract: A method and system for generating nested mapping specifications and transformation queries based thereon. Basic mappings are generated based on source and target schemas and correspondences between elements of the schemas. A directed acyclic graph (DAG) is constructed whose edges represent ways in which each basic mapping is nestable under any of the other basic mappings. Any transitively implied edges are removed from the DAG. Root mappings of the DAG are identified. Trees of mappings are automatically extracted from the DAG, where each tree of mappings is rooted at a root mapping and expresses a nested mapping specification. A transformation query is generated from the nested mapping specification by generating a first query for transforming source data into flat views of the target and a second query for nesting flat view data according to the target format. Generating the first query includes applying default Skolemization to the specification.
    Type: Application
    Filed: May 28, 2008
    Publication date: October 2, 2008
    Inventors: Ariel Fuxman, Mauricio Antonio Hernandez-Sherrington, Ching-Tien Ho, Renee J. Miller, Paolo Papotti, Lucian Popa
  • Publication number: 20080235260
    Abstract: A computer-implemented method for use with an extensible markup language (XML) document includes inputting a high-level mapping specification for a schema mapping; and generating a target XML document based on the mapping. The method may perform schema mapping-based XML transformation as a three-phase process comprising tuple extraction, XML-fragment generation, and data merging. The tuple extraction phase may be adapted to handle streamed XML data (as well as stored/indexed XML data). The data merging phase may use a hybrid method that can dynamically switch between main memory-based and disk-based algorithms based on the size of the XML data to be merged.
    Type: Application
    Filed: March 23, 2007
    Publication date: September 25, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: WOOK-SHIN HAN, CHING-TIEN HO, HAIFENG JIANG, LUCIAN POPA
  • Publication number: 20070174231
    Abstract: A method and system for generating a query implementing a schema mapping. A mapping M is provided from a schema S to a schema T, where M relates S to T, and M includes a plurality of constraints. Schemas S and T each include one or more elements, and T includes at least one set type element. Mapping M is expressed in terms of at least one nested tuple-generating dependency. A query Q is generated where Q is capable of applying M to an input instance I to result in an output instance J, where I conforms to S, J conforms to T, and I and J satisfy the plurality of constraints. Instance J is in partitioned normal form (i.e., satisfies minimal union semantics) and includes no duplicate element instances.
    Type: Application
    Filed: January 6, 2006
    Publication date: July 26, 2007
    Applicant: International Business Machines Corporation
    Inventors: Mauricio Hernandez-Sherrington, Ching-Tien Ho, Lucian Popa