Patents by Inventor Lipyeow Lim

Lipyeow Lim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Semantic Link Discovery

Publication number: 20110106836

Abstract: A method of semantic link discovery through translation of basic declarative language includes receiving a set of linkage specifications, receiving a set of data sources related to the linkage specifications, the set of data sources and the set of linkage requirements forming a basic declarative language query, translating the basic declarative language query into a standard language query, executing the standard language query, and returning results of the standard language query in response to the executing.

Type: Application

Filed: October 30, 2009

Publication date: May 5, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Min Wang
Semantic-Aware Record Matching

Publication number: 20110106821

Abstract: A method of semantic-aware record matching includes receiving source and target string record specifications associated with a source string record and a target string record, receiving semantic knowledge referring to tokens of the source string record and target string record, creating a first set of tokens for the source string record and a second set of tokens for the target string record based on the semantic knowledge, assigning a similarity score to the source string record and the target string record based on a semantic relationship between the first set of tokens and the second set of tokens, and matching the source string record and the target string record based on the similarity score.

Type: Application

Filed: October 30, 2009

Publication date: May 5, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Min Wang
SEMANTIC QUERY BY EXAMPLE

Publication number: 20110078187

Abstract: A computer-implemented method, system, and computer program product for producing a semantic query by example are provided. The method includes receiving examples of potential results from querying a database table with an associated ontology, and extracting features from the database table and the examples based on the associated ontology. The method further includes training a classifier based on the examples and the extracted features, and applying the classifier to the database table to obtain a semantic query result. The method also includes outputting the semantic query result to a user interface, and requesting user feedback of satisfaction with the semantic query result. The method additionally includes updating the classifier and the semantic query result iteratively in response to the user feedback.

Type: Application

Filed: September 25, 2009

Publication date: March 31, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lipyeow Lim, Haixun Wang, Min Wang
Compressibility estimation of non-unique indexes in a database management system

Patent number: 7895171

Abstract: A method, information processing system, and computer readable storage product estimate a compression factor. A set of key values within an index are analyzed. Each key value is associated with a record identifier (“RID”) list comprising a set of RIDs. The index is in an uncompressed format and includes a total byte length. A number of RIDs associated with each key value is estimated for each key value in the set of key values. A total byte length for all RID deltas between each at least two consecutive RIDs within a RID list is estimated for each RID list based on the number of RIDs that have been determined. The total byte length estimated for each RID list is accumulated. A compression factor associated with the index is determined by dividing the total byte length that has been accumulated by the byte length of the index.

Type: Grant

Filed: March 27, 2008

Date of Patent: February 22, 2011

Assignee: International Business Machines Corporation

Inventors: Bishwaranjan Bhattacharjee, Lipyeow Lim, George A. Mihaila
Method, apparatus and system for business performance monitoring and analysis using metric network

Patent number: 7895152

Abstract: A metric network provides a descriptive model that explicitly expresses the relationships among all metrics of a business enterprise. Performance of each single business entity in the operational level is measured by a set of primitive metrics, each of which measures a specific aspect of the business entity. The primitive metrics construct the base on which the whole metric network is built.

Type: Grant

Filed: November 19, 2008

Date of Patent: February 22, 2011

Assignee: International Business Machines Corporation

Inventors: Pu Huang, Hui Lei, Lipyeow Lim
ONTOLOGY-BASED SEARCHING IN DATABASE SYSTEMS

Publication number: 20100312779

Abstract: A method, information processing system, and computer program storage product retrieve data from a database. A search request is received from a user for a set of data in at least one database. An ontology query over is performed over at least one ontology associated with at least one database resulting in an ontological dataset associated with the search request in response to receiving the search request from the user. The ontological dataset includes at least one of a set of synonyms, a set of hypernyms, and a set of hyponyms, associated with the search request. A data query is performed over data in the at least one database using the ontological dataset in response to performing the ontology query. The set of data is returned to the user based on the data query that has been performed.

Type: Application

Filed: June 9, 2009

Publication date: December 9, 2010

Applicant: International Business Machines Corporation

Inventors: Lipyeow LIM, Anastasios KEMENTSIETSIDIS, Min WANG
Method and system for indexing and serializing data

Patent number: 7752192

Abstract: The present invention provides a computer implemented method, an apparatus, and a computer usable program product for indexing data. A controller identifies a set of data to be indexed, wherein a set of data structure trees represents the set of data. The controller merges the set of data structure trees to form a unified tree, wherein the unified tree contains a node for each unit of data in the set of data. The controller assigns an identifier to the node for each unit of data in the set of data that describes the node within the unified tree. The controller then serializes the unified tree to form a set of sequential series that represents the set of data structure trees, wherein the set of sequential series forms an index for the set of data.

Type: Grant

Filed: March 2, 2007

Date of Patent: July 6, 2010

Assignee: International Business Machines Corporation

Inventors: Xiaohui Gu, Lipyeow Lim, Haixun Wang, Min Wang
STATISTICS COLLECTION USING PATH-VALUE PAIRS FOR RELATIONAL DATABASES

Publication number: 20100161930

Abstract: A method, system, and computer readable medium for collecting statistics associated with data in a database are disclosed. The method comprises determining an amount of memory needed to collect statistics for data associated with a defined data type in a relational database. The defined data type is based upon a mark-up language using a tree structure with one or more root-to-node paths therein. The amount of memory is allocated as determined for collecting the statistics for the data of the defined data type. A statistics collection is performed for the data of the defined data type in a single pass through the database and within the amount of memory which has been allocated. The performing includes at least determining a total number of instances of at least one path-identifier associated with a given value within a given set of documents.

Type: Application

Filed: December 22, 2008

Publication date: June 24, 2010

Applicant: International Business Machines Corp.

Inventors: LIPYEOW LIM, George Andrei Mihaila, Min Wang
Querying Data and an Associated Ontology in a Database Management System

Publication number: 20100145986

Abstract: A method, apparatus, and computer program for querying data and an associated ontology in a database. An ontology is associated with data in database. Responsive to receiving a query from a requestor, relational data in the database is identified using the query to form identified relational data. Ontological knowledge in the ontology is identified using the identified relational data and the ontology. A result is returned to the requestor.

Type: Application

Filed: February 24, 2010

Publication date: June 10, 2010

Applicant: International Business Machines Corporation

Inventors: Lipyeow Lim, Haixun Wang, Min Wang
Method for supporting ontology-related semantic queries in DBMSs with XML support

Patent number: 7730098

Abstract: A method for supporting semantic matching queries in a database management system (DBMS) by extracting and storing the transitive/subsumption relationships from a given ontology data in a DBMS with native XML support. These transitive relationships are transformed into a set of XML documents that are natural mappings of the hierarchical structure of the transitive relationships. A table function construct expresses semantic matching queries in a declarative manner. The semantic matching queried are automatically rewritten or translated into standard SQL/XML search operators such as XQuery, XPath and XMLExists, and executed by the SQL/XML DBMS on the given instance data and the extracted transitive relationships data.

Type: Grant

Filed: March 2, 2007

Date of Patent: June 1, 2010

Assignee: International Business Machines Corporation

Inventors: Lipyeow Lim, Haixun Wang, Min Wang
Querying data and an associated ontology in a database management system

Patent number: 7693812

Abstract: A method, apparatus, and computer program for querying data and an associated ontology in a database. An ontology is associated with data in database. Responsive to receiving a query from a requestor, relational data in the database is identified using the query to form identified relational data. Ontological knowledge in the ontology is identified using the identified relational data and the ontology. A result is returned to the requestor.

Type: Grant

Filed: January 17, 2007

Date of Patent: April 6, 2010

Assignee: International Business Machines Corporation

Inventors: Lipyeow Lim, Haixun Wang, Min Wang
COMPRESSION OF SORTED VALUE INDEXES USING COMMON PREFIXES

Publication number: 20100082545

Abstract: A method, information processing system, and computer program storage product for compressing sorted values is disclosed. At least a first prefix and a second prefix in a plurality of prefixes are compared. Each prefix comprises at least a portion of a plurality of sorted values. A respective prefix comprises a set of consecutive characters including at least a first character of a respective sorted value. The respective sorted value further comprising a respective suffix comprising consecutive characters of the respective sorted value that are after the respective prefix. At least a respective first character of the first prefix and a respective first character of the second prefix are determined to be substantially identical. The first prefix is merged with the second prefix into a single prefix comprising the first character. A set of suffixes associated with the first prefix is updated to reflect an association with the second prefix.

Type: Application

Filed: September 30, 2008

Publication date: April 1, 2010

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: BISHWARANJAN BHATTACHARJEE, Sherman Lau, Lipyeow Lim, Timothy R. Malkemus, Cathy Mcarthur, George A. Mihaila, Reza Sherkat, Zoltan G. Toth
Generating database schemas for relational and markup language data from a conceptual model

Patent number: 7630993

Abstract: A method, information processing system, and computer readable medium for generating a plurality of candidate database schemas including relational and mark-up language elements. An information model comprising a plurality of entities and at least one relationship defined there between is received. The information model has been annotated with at least one semantic characteristic, operational characteristic, and evolutional characteristic. The information model that has been annotated is analyzed. A score is associated with each entity based at least in part on attributes associated with each entity. Each entity is classified as one of a relational element and a mark-up language element. The information model that has been annotated is partitioned into a plurality of relational element mappings and a plurality of mark-up language element mappings. A plurality of database schemas associated with the information model that has been annotated is generated.

Type: Grant

Filed: May 29, 2007

Date of Patent: December 8, 2009

Assignee: International Business Machines Corporation

Inventors: Sharon C. Adler, Yuan-Chi Chang, Lipyeow Lim, Mirella M. Moro
Statistics collection using path-identifiers for relational databases

Patent number: 7613682

Abstract: Disclosed are a method for collecting statistics associated with data in a database. The method comprises determining an amount of memory needed to collect statistics for data associated with a defined data type in a relational database. The defined data type is based upon a mark-up language using a tree structure with one or more root-to-node paths therein. The amount of memory as determined is allocated for collecting the statistics for the data of the defined data type. A statistics collection is performed for the data of the defined data type in a single pass through the database and within the amount of memory which has been allocated.

Type: Grant

Filed: May 16, 2006

Date of Patent: November 3, 2009

Assignee: International Business Machines Corporation

Inventors: Lipyeow Lim, George Andrei Mihaila, Min Wang
Method and Apparatus for Encoding List of Variable Length Structures to Support Bi-Directional Scans

Publication number: 20090248724

Abstract: Techniques are disclosed for encoding a variable length structure such that it facilitates forward and reverse scans of a list of such structures as needed. While the techniques are applicable to a wide variety of applications, they are particularly well-suited for use with structures such as those found in compressed database indexes. For example, a computer-implemented method for processing one or more variable length data structures includes the following steps. Each variable length data structure is obtained. Each variable length structure comprises one or more data block. A variable length encoding process is applied to the one or more blocks of each variable length data structure which comprises setting a continuation data value in each block to a first value or a second value, wherein the setting of the continuation data values enables bi-directional scanning of each variable length structure.

Type: Application

Filed: March 27, 2008

Publication date: October 1, 2009

Inventors: Bishwaranjan Bhattacharjee, Lipyeow Lim, Timothy Ray Malkemus, George Andrei Mihaila
Method and Apparatus for Selecting an Optimal Delete-Safe Compression Method on List of Delta Encoded Integers

Publication number: 20090248723

Abstract: Techniques are disclosed for selecting a delete-safe compression method for a plurality of delta encoded data values (e.g., delta encoded integers or deltas). For example, a computer-implemented method for selecting an optimal delete-safe compression algorithm from among two or more compression algorithms for use on a plurality of delta encoded data values includes the following steps. The maximum number of data values eliminated by each of the two or more compression algorithms is computed. For the plurality of delta encoded data values to be compressed, the minimum size of the plurality of delta encoded data values before compression thereof is computed. A delete-safe threshold value is computed based on the minimum size of the plurality of delta encoded data values. Then, the compression algorithm is selected from the two or more compression algorithms that achieves the delete-safe threshold value.

Type: Application

Filed: March 27, 2008

Publication date: October 1, 2009

Inventors: Bishwaranjan Bhattacharjee, Lipyeow Lim, Timothy Ray Malkemus, George Andrei Mihaila
COMPRESSABILITY ESTIMATION OF NON-UNIQUE INDEXES IN A DATABASE MANAGEMENT SYSTEM

Publication number: 20090248725

Abstract: A method, information processing system, and computer readable storage product estimate a compression factor. A set of key values within an index are analyzed. Each key value is associated with a record identifier (“RID”) list comprising a set of RIDs. The index is in an uncompressed format and includes a total byte length. A number of RIDs associated with each key value is estimated for each key value in the set of key values. A total byte length for all RID deltas between each at least two consecutive RIDs within a RID list is estimated for each RID list based on the number of RIDs that have been determined. The total byte length estimated for each RID list is accumulated. A compression factor associated with the index is determined by dividing the total byte length that has been accumulated by the byte length of the index.

Type: Application

Filed: March 27, 2008

Publication date: October 1, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: BISHWARANJAN BHATTACHARJEE, Lipyeow Lim, George A. Mihaila
Method, Apparatus and System for Business Performance Monitoring and Analysis Using Metric Network

Publication number: 20090138549

Abstract: A metric network provides a descriptive model that explicitly expresses the relationships among all metrics of a business enterprise. Performance of each single business entity in the operational level is measured by a set of primitive metrics, each of which measures a specific aspect of the business entity. The primitive metrics construct the base on which the whole metric network is built.

Type: Application

Filed: November 19, 2008

Publication date: May 28, 2009

Applicant: International Business Machines Corporation

Inventors: Pu Huang, Hui Lei, Lipyeow Lim
Method for searching deep web services

Patent number: 7533085

Abstract: A method for searching deep web services is provided. The method in one aspect allows organizing communities, sources and schema attributes in a multi-tier containment relationship; searching representative schema attributes in one or more communities; searching representative services in one or more communities; searching for related schema attributes; and searching for related communities.

Type: Grant

Filed: August 14, 2006

Date of Patent: May 12, 2009

Assignee: International Business Machines Corporation

Inventors: Yuan-chi Chang, Lipyeow Lim, Min Wang, Zhen Zhang
Method and apparatus for organizing data sources

Patent number: 7529740

Abstract: A method for organizing deep Web services is provided. In one aspect, the method obtains a collection of sources and their associated attributes and/or input modes, for instance, using a crawling algorithm. The method uses this information to organize the sources into communities. A mining algorithm such as the hyperclique mining algorithm is used to obtain cliques of highly correlated attributes. A clustering algorithm such as the hierarchical agglomerative clustering algorithm is used to further cluster the cliques of attributes into larger cliques, which in the present disclosure is referred to as signatures. The sources that are associated with each signature form a community and a graph representation of the communities is constructed, where the vertices are communities and the edges are the shared attributes.

Type: Grant

Filed: August 14, 2006

Date of Patent: May 5, 2009

Assignee: International Business Machines Corporation

Inventors: Yuan-chi Chang, Lipyeow Lim, Min Wang, Zhen Zhang

prev 1 2 3 4 next