Patents by Inventor Guogen Zhang

Guogen Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Efficient heuristic approach in selection of materialized views when there are multiple matchings to an SQL query

Patent number: 7089225

Abstract: A heuristic approach is used to order materialized view (MW) candidates in a list based on descending order of their reduction power. A query (e.g., SQL query) is then matched with the MVs in the list order, wherein searching is stopped when matching has been found. The query is matched with materialized views in the ordered list by identifying a materialized view candidate as follows: identifying an MV that is not locked by a REFRESH process; identifying a matching MV that does not require a regroup; identifying a matching MV that does not require a rejoin; identifying a matching MV that does not require a residual join; or identifying an MV with largest reduction power from the list of candidates.

Type: Grant

Filed: November 25, 2003

Date of Patent: August 8, 2006

Assignee: International Business Machines Corporation

Inventors: Ruiping Li, Irene Ching-Hua Liu, Lee-Chin Hsu Liu, Guogen Zhang
Streaming XPath algorithm for XPath value index key generation

Publication number: 20060106758

Abstract: A method generates hierarchical path index keys for single and multiple indexes with one scan of a document. Each data node of the document is scanned and matches to query nodes are identified. A data node matches a query node if the three conditions hold: if it is not the root step, there is a match for the query node in the previous step of the query; the data node matches the query node of the current step; and the edges of the data and query nodes match. A sub-tree of a data node can be skipped if the data node is not matched and its level is less than the fixed levels of the query. The matched data node is then placed in the match stacks corresponding to the match query nodes. The method uses transitivity properties among matching units to reduce the number of states that need to be tracked and to improve the evaluation of path expressions significantly.

Type: Application

Filed: November 16, 2004

Publication date: May 18, 2006

Inventors: Yao-Ching Chen, Irene Liu, Demai Ni, Guogen Zhang, Qinghua Zou
An Efficient Locking Protocol for Sub-Document Concurrency Control Using Prefix Encoded Node Identifiers in XML Databases

Publication number: 20060004758

Abstract: A system and method for concurrency control of hierarchically structured data is provided. Lock requests on a target node are processed by exploiting ancestor-descendant information encoded into prefix encoded node identifiers (IDs). A set of implicit locks on ancestor nodes along a path from an immediate parent of a target node to a root node is derived from an explicit lock request on a target node. A logical lock tree describing existing lock modes for ancestor nodes is consulted to determine compatibility with the derived set of implicit locks. If existing lock modes for ancestor nodes are compatible with the derived set of implicit locks, a lock request on a target node is granted. Otherwise, the lock request is denied. A lock release request follows the reverse process; a target node in a particular transaction is released, as are subsequent locks on its ancestors made by the same transaction.

Type: Application

Filed: May 4, 2004

Publication date: January 5, 2006

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: James Teng, Brian Vickery, Guogen Zhang
Self-Adaptive Prefix Encoding for Stable Node Identifiers

Publication number: 20060004858

Abstract: A variable-length binary string is utilized to encode node identifiers in a tree for an XML document object model. A general prefix encoding scheme is followed; a node identifier is generated by the concatenation of encodings at each level of a tree along a path from a root node to another particular node. Arbitrary insertions are supported without change to existing node identifier encodings. In addition, the method provides for document order when unsigned binary string comparison is used to compare encoded node identifiers. In support of sub-document concurrency control, prefix encoding provides a way to derive ancestor-descendant relationships among nodes in a tree. Lastly, the encoding method provides a natural pre-order clustering sequence, also known as depth-first clustering. If a prefix is applied to an encoding with a level number, starting with zero at the root, width-first clustering will result. A mixed clustering can also be supported.

Type: Application

Filed: May 4, 2004

Publication date: January 5, 2006

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Brian Tran, Guogen Zhang
Efficient type annontation of XML schema-validated XML documents without schema validation

Publication number: 20050177578

Abstract: Type annotation record information storage for annotated automaton encoding for high-performance XML schema validation is optimized in a space efficient aspect. Subsequent to type annotation record information organization, type annotation records are used for type annotation of validated XML documents, either by implementing annotation records and type annotation part of an algorithm only, or by skipping one or more validation steps in a full validation implementation. Given a schema context, a type annotation may be performed for a validated XML fragment as opposed to an entire document. In addition, default features such as attribute and type are supported.

Type: Application

Filed: February 10, 2004

Publication date: August 11, 2005

Inventors: Yao-Ching Chen, Ning Wang, Guogen Zhang
Efficient XML schema validation of XML fragments using annotated automaton encoding

Publication number: 20050177543

Abstract: A method and system for Extensible Markup Language (XML) schema validation, includes: loading an XML document into a runtime validation engine, where the runtime validation engine includes an XML schema validation parser; loading an annotated automaton encoding (AAE) for an XML schema definition into the XML schema validation parser; and validating the XML document against the XML schema definition by the XML schema validation parser utilizing the annotated automaton encoding. Each XML schema definition is compiled once into the AAE format, rather than being compiled each time an XML document is validated, and thus significant time is saved. The code for the runtime validation engine is fixed and does not vary depending on the XML schema definition, rather than varying for each XML schema definition, and thus space overhead is minimized. Flexibility in the validation process is provided without compromising performance.

Type: Application

Filed: February 10, 2004

Publication date: August 11, 2005

Inventors: Yao-Ching Chen, Fen-Ling Lin, Ning Wang, Guogen Zhang
Eliminating superfluous namespace declarations and undeclaring default namespaces in XML serialization processing

Publication number: 20050166140

Abstract: In one embodiment, at least a portion of an object model having at least one namespace is serialized. An ancestor namespace is searched for based on a current namespace declaration. The ancestor namespace is associated with an ancestor prefix and an ancestor uniform resource indicator (URI). The current namespace is associated with a current prefix and current URI. The search is performed to find an ancestor prefix matches the current prefix. When the current namespace is an implicit no default namespace and the ancestor namespace is an explicit default namespace based on, at least in part, the ancestor prefix, a serialized namespace declaration is generated for the current namespace.

Type: Application

Filed: January 27, 2004

Publication date: July 28, 2005

Inventors: Mengchu Cai, Jason Cu, Yue Huang, Guogen Zhang
Efficient heuristic approach in selection of materialized views when there are multiple matchings to an SQL query

Publication number: 20050114307

Abstract: A heuristic approach is used to order materialized view (MW) candidates in a list based on descending order of their reduction power. A query (e.g., SQL query) is then matched with the MVs in the list order, wherein searching is stopped as soon as good enough matching has been found. The query is matched with materialized views in the ordered list by identifying a materialized view candidate as follows: identifying an MV that is not locked by a REFRESH process; identifying a matching MV that does not require a regroup; identifying a matching MV that does not require a rejoin; identifying a matching MV that does not require a residual join; or identifying an MV with largest reduction power from the list of candidates.

Type: Application

Filed: November 25, 2003

Publication date: May 26, 2005

Inventors: Ruiping Li, Irene Liu, Lee-Chin Liu, Guogen Zhang
Query transformation for union all view join queries using join predicates for pruning and distribution

Publication number: 20050065926

Abstract: A method, apparatus, and article of manufacture for optimizing a query in a computer system, wherein the query is performed by the computer system to retrieve data from a database stored on the computer system. The optimization includes: (a) combining join predicates from a query with local predicates from each branch of one or more UNION ALL views referenced by the query; (b) analyzing the combined predicates; and (c) not generating the join when the analysis step indicates that the combined predicates lead to an empty result.

Type: Application

Filed: September 24, 2003

Publication date: March 24, 2005

Inventors: Yao-Ching Chen, Ding-Wei Chieh, Huong Tran, Yumi Tsuji, Guogen Zhang
Dynamic selection of optimal grouping sequence at runtime for grouping sets, rollup and cube operations in SQL query processing

Publication number: 20050027690

Abstract: A method, apparatus, and article of manufacture for optimizing a query in a computer system. During compilation of the query, a GROUP BY clause with one or more GROUPING SETS, ROLLUP or CUBE operations is maintained in its original form until after query rewrite. The GROUP BY clause with the GROUPING SETS, ROLLUP or CUBE operations is then translated into a plurality of levels having one or more grouping sets. After compilation of the query, a grouping sets sequence is dynamically determined for the GROUP BY clause with the GROUPING SETS, ROLLUP or CUBE operations based on intermediate grouping sets, in order to optimize the grouping sets sequence. The execution of the grouping sets sequence is optimized by selecting a smallest grouping set from a previous one of the levels as an input to a grouping set on a next one of the levels. Finally, a UNION ALL operation is performed on the grouping sets.

Type: Application

Filed: July 29, 2003

Publication date: February 3, 2005

Inventors: Guogen Zhang, Fen-Ling Lin, Jung-Hsin Hu, Yao-Ching Chen, Yun Wang, Glenn Yuki
Method, system, and program for optimizing the processing of queries involving set operators

Patent number: 6792420

Abstract: Provided is a method, system, and program for processing a query including a query operation on a table derived from a set operation on two result tables. The query operation is performed on each result table separately to produce two intermediate result tables. The set operator is then applied to the two intermediate result tables to produce a final result table that is a same result table that would have been produced by performing the query operation on the table derived from the set operation performed on the two result tables.

Type: Grant

Filed: June 29, 2001

Date of Patent: September 14, 2004

Assignee: International Business Machines Corporation

Inventors: Yao Ching Stephen Chen, Yumi Kimura Tsuji, Yun Wang, Guogen Zhang
Materialized view signature and efficient identification of materialized view candidates for queries

Publication number: 20040122804

Abstract: A method and system for efficiently identifying materialized view candidates for queries filters materialized views using certain criteria, using the materialized view signatures. This filtering rejects some of the unqualified materialized views prior to the performance of the query rewrite matching algorithm, resulting in a group of materialized view candidates. The query rewrite matching algorithm is then performed on the materialized view candidates. By first filtering the materialized views based on their signatures, the number of materialized views on which the query rewrite matching algorithm is performed is significantly reduced, improving performance.

Type: Application

Filed: December 19, 2002

Publication date: June 24, 2004

Inventors: Guogen Zhang, Irene Ching-Hua Liu, Ding-Wei Chieh, Lee-Chin Hsu Liu, Yao-Ching Stephen Chen, Hsuiying Yen Cheng, Brian Thinh-Vinh Tran, Yun Wang, Ruiping Li
Matching groupings, re-aggregation avoidance and comprehensive aggregate function derivation rules in query rewrites using materialized views

Publication number: 20040122814

Abstract: A method, apparatus, and article of manufacture for optimizing a query in a computer system, wherein the query is performed by the computer system to retrieve data from a database stored on the computer system. The optimization includes: identifying a materialized view candidate in the computer system, matching a grouping of the materialized view with a grouping of the query using column equivalence and functional dependency, in order to determine whether re-aggregation is necessary, deriving one or more aggregate functions requested by the query from the materialized view and any remaining tables in the query based on the matched groupings, and rewriting the query based on the matched groupings.

Type: Application

Filed: December 18, 2002

Publication date: June 24, 2004

Applicant: International Business Machines Corporation

Inventors: Guogen Zhang, Ruiping Li, Lee-Chin Hsu Liu, Yun Wang
Method, system, and program for optimizing aggregate processing

Publication number: 20040122815

Abstract: Disclosed is a method, system, and program for processing an aggregate function. Rows that contain a reference to intermediate result structures are grouped to form groups. For each group, aggregate element structures are formed from the intermediate result structures and, if the aggregate function specifies ordering, the aggregate element structures are sorted based on a sort key.

Type: Application

Filed: December 19, 2002

Publication date: June 24, 2004

Applicant: International Business Machines Corporation

Inventors: Yun Wang, Fen-Ling Lin, Yao Ching Stephen Chen, Mengchu Cai, Jason Alexander Cu, Allan Benjamin Lebovitz, Li-Mey Lee, Guogen Zhang
Method, system, and program for optimizing processing of nested functions

Publication number: 20040122795

Abstract: Disclosed is a method, system, and program for processing a function. A set of nested functions are received. A composite function is generated for the set of nested functions. A tagging template is generated for the set of nested functions that corresponds to the composite function. A result is produced by evaluating the composite function using the tagging template.

Type: Application

Filed: December 19, 2002

Publication date: June 24, 2004

Inventors: Fen-Ling Lin, Yao Ching Stephen Chen, Yun Wang, Guogen Zhang, Mengchu Cai, Jason Alexander Cu, Li-Mey Lee
Annotated automaton encoding of XML schema for high performance schema validation

Publication number: 20040073870

Abstract: A method and system for Extensible Markup Language (XML) schema validation, includes: loading an XML document into a runtime validation engine, where the runtime validation engine includes an XML schema validation parser; loading an annotated automaton encoding (AAE) for an XML schema definition into the XML schema validation parser; and validating the XML document against the XML schema definition by the XML schema validation parser utilizing the annotated automaton encoding. Each XML schema definition is compiled once into the AAE format, rather than being compiled each time an XML document is validated, and thus significant time is saved. The code for the runtime validation engine is fixed and does not vary depending on the XML schema definition, rather than varying for each XML schema definition, and thus space overhead is minimized. Flexibility in the validation process is provided without compromising performance.

Type: Application

Filed: April 16, 2003

Publication date: April 15, 2004

Inventors: You-Chin Fuh, Ning Wang, Yun Wang, Guogen Zhang
Method, system, and program for optimizing the processing of queries involving set operators

Publication number: 20030055814

Abstract: Provided is a method, system, and program for processing a query including a query operation on a table derived from a set operation on two result tables. The query operation is performed on each result table separately to produce two intermediate result tables. The set operator is then applied to the two intermediate result tables to produce a final result table that is a same result table that would have been produced by performing the query operation on the table derived from the set operation performed on the two result tables.

Type: Application

Filed: June 29, 2001

Publication date: March 20, 2003

Applicant: International Business Machines Corporation

Inventors: Yao Ching Stephen Chen, Yumi Kimura Tsuji, Yun Wang, Guogen Zhang
Method, system, and program for a join operation on a multi-column table and satellite tables including duplicate values

Patent number: 6374235

Abstract: Disclosed is a method, system, and program for performing a join operation on a multi-column table and at least two satellite tables having a join condition. Each satellite table is comprised of multiple rows and at least one join column. The multi-column table is comprised of multiple rows and at least one column corresponding to the join column in each satellite table. A join operation is performed on the rows of the satellite tables to generate concatenated rows of the satellite tables. One of the concatenated rows is joined to the multi-column table and a returned entry from the multi-column table is received. A determination is then made as to whether the returned entry matches the search criteria. If so, a determination is made as to whether one of the satellite tables has duplicates of values in the join column of the returned matching entry or the multi-column table has duplicate entries in the join columns.

Type: Grant

Filed: June 25, 1999

Date of Patent: April 16, 2002

Assignee: International Business Machines Corporation

Inventors: Stephen Yao Ching Chen, Patrick Michael Malone, Jerry Mukai, Jeffrey Ralph Ruggles, Desai Paramesh Sampatrai, Yun Wang, Guogen Zhang

prev 1 2 3 4 5