Patents by Inventor Lipyeow Lim

Lipyeow Lim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20110106836
    Abstract: A method of semantic link discovery through translation of basic declarative language includes receiving a set of linkage specifications, receiving a set of data sources related to the linkage specifications, the set of data sources and the set of linkage requirements forming a basic declarative language query, translating the basic declarative language query into a standard language query, executing the standard language query, and returning results of the standard language query in response to the executing.
    Type: Application
    Filed: October 30, 2009
    Publication date: May 5, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Min Wang
  • Publication number: 20110106821
    Abstract: A method of semantic-aware record matching includes receiving source and target string record specifications associated with a source string record and a target string record, receiving semantic knowledge referring to tokens of the source string record and target string record, creating a first set of tokens for the source string record and a second set of tokens for the target string record based on the semantic knowledge, assigning a similarity score to the source string record and the target string record based on a semantic relationship between the first set of tokens and the second set of tokens, and matching the source string record and the target string record based on the similarity score.
    Type: Application
    Filed: October 30, 2009
    Publication date: May 5, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Min Wang
  • Publication number: 20110078187
    Abstract: A computer-implemented method, system, and computer program product for producing a semantic query by example are provided. The method includes receiving examples of potential results from querying a database table with an associated ontology, and extracting features from the database table and the examples based on the associated ontology. The method further includes training a classifier based on the examples and the extracted features, and applying the classifier to the database table to obtain a semantic query result. The method also includes outputting the semantic query result to a user interface, and requesting user feedback of satisfaction with the semantic query result. The method additionally includes updating the classifier and the semantic query result iteratively in response to the user feedback.
    Type: Application
    Filed: September 25, 2009
    Publication date: March 31, 2011
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lipyeow Lim, Haixun Wang, Min Wang
  • Patent number: 7895171
    Abstract: A method, information processing system, and computer readable storage product estimate a compression factor. A set of key values within an index are analyzed. Each key value is associated with a record identifier (“RID”) list comprising a set of RIDs. The index is in an uncompressed format and includes a total byte length. A number of RIDs associated with each key value is estimated for each key value in the set of key values. A total byte length for all RID deltas between each at least two consecutive RIDs within a RID list is estimated for each RID list based on the number of RIDs that have been determined. The total byte length estimated for each RID list is accumulated. A compression factor associated with the index is determined by dividing the total byte length that has been accumulated by the byte length of the index.
    Type: Grant
    Filed: March 27, 2008
    Date of Patent: February 22, 2011
    Assignee: International Business Machines Corporation
    Inventors: Bishwaranjan Bhattacharjee, Lipyeow Lim, George A. Mihaila
  • Patent number: 7895152
    Abstract: A metric network provides a descriptive model that explicitly expresses the relationships among all metrics of a business enterprise. Performance of each single business entity in the operational level is measured by a set of primitive metrics, each of which measures a specific aspect of the business entity. The primitive metrics construct the base on which the whole metric network is built.
    Type: Grant
    Filed: November 19, 2008
    Date of Patent: February 22, 2011
    Assignee: International Business Machines Corporation
    Inventors: Pu Huang, Hui Lei, Lipyeow Lim
  • Publication number: 20100312779
    Abstract: A method, information processing system, and computer program storage product retrieve data from a database. A search request is received from a user for a set of data in at least one database. An ontology query over is performed over at least one ontology associated with at least one database resulting in an ontological dataset associated with the search request in response to receiving the search request from the user. The ontological dataset includes at least one of a set of synonyms, a set of hypernyms, and a set of hyponyms, associated with the search request. A data query is performed over data in the at least one database using the ontological dataset in response to performing the ontology query. The set of data is returned to the user based on the data query that has been performed.
    Type: Application
    Filed: June 9, 2009
    Publication date: December 9, 2010
    Applicant: International Business Machines Corporation
    Inventors: Lipyeow LIM, Anastasios KEMENTSIETSIDIS, Min WANG
  • Patent number: 7752192
    Abstract: The present invention provides a computer implemented method, an apparatus, and a computer usable program product for indexing data. A controller identifies a set of data to be indexed, wherein a set of data structure trees represents the set of data. The controller merges the set of data structure trees to form a unified tree, wherein the unified tree contains a node for each unit of data in the set of data. The controller assigns an identifier to the node for each unit of data in the set of data that describes the node within the unified tree. The controller then serializes the unified tree to form a set of sequential series that represents the set of data structure trees, wherein the set of sequential series forms an index for the set of data.
    Type: Grant
    Filed: March 2, 2007
    Date of Patent: July 6, 2010
    Assignee: International Business Machines Corporation
    Inventors: Xiaohui Gu, Lipyeow Lim, Haixun Wang, Min Wang
  • Publication number: 20100161930
    Abstract: A method, system, and computer readable medium for collecting statistics associated with data in a database are disclosed. The method comprises determining an amount of memory needed to collect statistics for data associated with a defined data type in a relational database. The defined data type is based upon a mark-up language using a tree structure with one or more root-to-node paths therein. The amount of memory is allocated as determined for collecting the statistics for the data of the defined data type. A statistics collection is performed for the data of the defined data type in a single pass through the database and within the amount of memory which has been allocated. The performing includes at least determining a total number of instances of at least one path-identifier associated with a given value within a given set of documents.
    Type: Application
    Filed: December 22, 2008
    Publication date: June 24, 2010
    Applicant: International Business Machines Corp.
    Inventors: LIPYEOW LIM, George Andrei Mihaila, Min Wang
  • Publication number: 20100145986
    Abstract: A method, apparatus, and computer program for querying data and an associated ontology in a database. An ontology is associated with data in database. Responsive to receiving a query from a requestor, relational data in the database is identified using the query to form identified relational data. Ontological knowledge in the ontology is identified using the identified relational data and the ontology. A result is returned to the requestor.
    Type: Application
    Filed: February 24, 2010
    Publication date: June 10, 2010
    Applicant: International Business Machines Corporation
    Inventors: Lipyeow Lim, Haixun Wang, Min Wang
  • Patent number: 7730098
    Abstract: A method for supporting semantic matching queries in a database management system (DBMS) by extracting and storing the transitive/subsumption relationships from a given ontology data in a DBMS with native XML support. These transitive relationships are transformed into a set of XML documents that are natural mappings of the hierarchical structure of the transitive relationships. A table function construct expresses semantic matching queries in a declarative manner. The semantic matching queried are automatically rewritten or translated into standard SQL/XML search operators such as XQuery, XPath and XMLExists, and executed by the SQL/XML DBMS on the given instance data and the extracted transitive relationships data.
    Type: Grant
    Filed: March 2, 2007
    Date of Patent: June 1, 2010
    Assignee: International Business Machines Corporation
    Inventors: Lipyeow Lim, Haixun Wang, Min Wang
  • Patent number: 7693812
    Abstract: A method, apparatus, and computer program for querying data and an associated ontology in a database. An ontology is associated with data in database. Responsive to receiving a query from a requestor, relational data in the database is identified using the query to form identified relational data. Ontological knowledge in the ontology is identified using the identified relational data and the ontology. A result is returned to the requestor.
    Type: Grant
    Filed: January 17, 2007
    Date of Patent: April 6, 2010
    Assignee: International Business Machines Corporation
    Inventors: Lipyeow Lim, Haixun Wang, Min Wang
  • Publication number: 20100082545
    Abstract: A method, information processing system, and computer program storage product for compressing sorted values is disclosed. At least a first prefix and a second prefix in a plurality of prefixes are compared. Each prefix comprises at least a portion of a plurality of sorted values. A respective prefix comprises a set of consecutive characters including at least a first character of a respective sorted value. The respective sorted value further comprising a respective suffix comprising consecutive characters of the respective sorted value that are after the respective prefix. At least a respective first character of the first prefix and a respective first character of the second prefix are determined to be substantially identical. The first prefix is merged with the second prefix into a single prefix comprising the first character. A set of suffixes associated with the first prefix is updated to reflect an association with the second prefix.
    Type: Application
    Filed: September 30, 2008
    Publication date: April 1, 2010
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: BISHWARANJAN BHATTACHARJEE, Sherman Lau, Lipyeow Lim, Timothy R. Malkemus, Cathy Mcarthur, George A. Mihaila, Reza Sherkat, Zoltan G. Toth
  • Patent number: 7630993
    Abstract: A method, information processing system, and computer readable medium for generating a plurality of candidate database schemas including relational and mark-up language elements. An information model comprising a plurality of entities and at least one relationship defined there between is received. The information model has been annotated with at least one semantic characteristic, operational characteristic, and evolutional characteristic. The information model that has been annotated is analyzed. A score is associated with each entity based at least in part on attributes associated with each entity. Each entity is classified as one of a relational element and a mark-up language element. The information model that has been annotated is partitioned into a plurality of relational element mappings and a plurality of mark-up language element mappings. A plurality of database schemas associated with the information model that has been annotated is generated.
    Type: Grant
    Filed: May 29, 2007
    Date of Patent: December 8, 2009
    Assignee: International Business Machines Corporation
    Inventors: Sharon C. Adler, Yuan-Chi Chang, Lipyeow Lim, Mirella M. Moro
  • Patent number: 7613682
    Abstract: Disclosed are a method for collecting statistics associated with data in a database. The method comprises determining an amount of memory needed to collect statistics for data associated with a defined data type in a relational database. The defined data type is based upon a mark-up language using a tree structure with one or more root-to-node paths therein. The amount of memory as determined is allocated for collecting the statistics for the data of the defined data type. A statistics collection is performed for the data of the defined data type in a single pass through the database and within the amount of memory which has been allocated.
    Type: Grant
    Filed: May 16, 2006
    Date of Patent: November 3, 2009
    Assignee: International Business Machines Corporation
    Inventors: Lipyeow Lim, George Andrei Mihaila, Min Wang
  • Publication number: 20090248724
    Abstract: Techniques are disclosed for encoding a variable length structure such that it facilitates forward and reverse scans of a list of such structures as needed. While the techniques are applicable to a wide variety of applications, they are particularly well-suited for use with structures such as those found in compressed database indexes. For example, a computer-implemented method for processing one or more variable length data structures includes the following steps. Each variable length data structure is obtained. Each variable length structure comprises one or more data block. A variable length encoding process is applied to the one or more blocks of each variable length data structure which comprises setting a continuation data value in each block to a first value or a second value, wherein the setting of the continuation data values enables bi-directional scanning of each variable length structure.
    Type: Application
    Filed: March 27, 2008
    Publication date: October 1, 2009
    Inventors: Bishwaranjan Bhattacharjee, Lipyeow Lim, Timothy Ray Malkemus, George Andrei Mihaila
  • Publication number: 20090248723
    Abstract: Techniques are disclosed for selecting a delete-safe compression method for a plurality of delta encoded data values (e.g., delta encoded integers or deltas). For example, a computer-implemented method for selecting an optimal delete-safe compression algorithm from among two or more compression algorithms for use on a plurality of delta encoded data values includes the following steps. The maximum number of data values eliminated by each of the two or more compression algorithms is computed. For the plurality of delta encoded data values to be compressed, the minimum size of the plurality of delta encoded data values before compression thereof is computed. A delete-safe threshold value is computed based on the minimum size of the plurality of delta encoded data values. Then, the compression algorithm is selected from the two or more compression algorithms that achieves the delete-safe threshold value.
    Type: Application
    Filed: March 27, 2008
    Publication date: October 1, 2009
    Inventors: Bishwaranjan Bhattacharjee, Lipyeow Lim, Timothy Ray Malkemus, George Andrei Mihaila
  • Publication number: 20090248725
    Abstract: A method, information processing system, and computer readable storage product estimate a compression factor. A set of key values within an index are analyzed. Each key value is associated with a record identifier (“RID”) list comprising a set of RIDs. The index is in an uncompressed format and includes a total byte length. A number of RIDs associated with each key value is estimated for each key value in the set of key values. A total byte length for all RID deltas between each at least two consecutive RIDs within a RID list is estimated for each RID list based on the number of RIDs that have been determined. The total byte length estimated for each RID list is accumulated. A compression factor associated with the index is determined by dividing the total byte length that has been accumulated by the byte length of the index.
    Type: Application
    Filed: March 27, 2008
    Publication date: October 1, 2009
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: BISHWARANJAN BHATTACHARJEE, Lipyeow Lim, George A. Mihaila
  • Publication number: 20090138549
    Abstract: A metric network provides a descriptive model that explicitly expresses the relationships among all metrics of a business enterprise. Performance of each single business entity in the operational level is measured by a set of primitive metrics, each of which measures a specific aspect of the business entity. The primitive metrics construct the base on which the whole metric network is built.
    Type: Application
    Filed: November 19, 2008
    Publication date: May 28, 2009
    Applicant: International Business Machines Corporation
    Inventors: Pu Huang, Hui Lei, Lipyeow Lim
  • Patent number: 7533085
    Abstract: A method for searching deep web services is provided. The method in one aspect allows organizing communities, sources and schema attributes in a multi-tier containment relationship; searching representative schema attributes in one or more communities; searching representative services in one or more communities; searching for related schema attributes; and searching for related communities.
    Type: Grant
    Filed: August 14, 2006
    Date of Patent: May 12, 2009
    Assignee: International Business Machines Corporation
    Inventors: Yuan-chi Chang, Lipyeow Lim, Min Wang, Zhen Zhang
  • Patent number: 7529740
    Abstract: A method for organizing deep Web services is provided. In one aspect, the method obtains a collection of sources and their associated attributes and/or input modes, for instance, using a crawling algorithm. The method uses this information to organize the sources into communities. A mining algorithm such as the hyperclique mining algorithm is used to obtain cliques of highly correlated attributes. A clustering algorithm such as the hierarchical agglomerative clustering algorithm is used to further cluster the cliques of attributes into larger cliques, which in the present disclosure is referred to as signatures. The sources that are associated with each signature form a community and a graph representation of the communities is constructed, where the vertices are communities and the edges are the shared attributes.
    Type: Grant
    Filed: August 14, 2006
    Date of Patent: May 5, 2009
    Assignee: International Business Machines Corporation
    Inventors: Yuan-chi Chang, Lipyeow Lim, Min Wang, Zhen Zhang