Patents by Inventor Guogen Zhang

Guogen Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7089225
    Abstract: A heuristic approach is used to order materialized view (MW) candidates in a list based on descending order of their reduction power. A query (e.g., SQL query) is then matched with the MVs in the list order, wherein searching is stopped when matching has been found. The query is matched with materialized views in the ordered list by identifying a materialized view candidate as follows: identifying an MV that is not locked by a REFRESH process; identifying a matching MV that does not require a regroup; identifying a matching MV that does not require a rejoin; identifying a matching MV that does not require a residual join; or identifying an MV with largest reduction power from the list of candidates.
    Type: Grant
    Filed: November 25, 2003
    Date of Patent: August 8, 2006
    Assignee: International Business Machines Corporation
    Inventors: Ruiping Li, Irene Ching-Hua Liu, Lee-Chin Hsu Liu, Guogen Zhang
  • Publication number: 20060106758
    Abstract: A method generates hierarchical path index keys for single and multiple indexes with one scan of a document. Each data node of the document is scanned and matches to query nodes are identified. A data node matches a query node if the three conditions hold: if it is not the root step, there is a match for the query node in the previous step of the query; the data node matches the query node of the current step; and the edges of the data and query nodes match. A sub-tree of a data node can be skipped if the data node is not matched and its level is less than the fixed levels of the query. The matched data node is then placed in the match stacks corresponding to the match query nodes. The method uses transitivity properties among matching units to reduce the number of states that need to be tracked and to improve the evaluation of path expressions significantly.
    Type: Application
    Filed: November 16, 2004
    Publication date: May 18, 2006
    Inventors: Yao-Ching Chen, Irene Liu, Demai Ni, Guogen Zhang, Qinghua Zou
  • Publication number: 20060004758
    Abstract: A system and method for concurrency control of hierarchically structured data is provided. Lock requests on a target node are processed by exploiting ancestor-descendant information encoded into prefix encoded node identifiers (IDs). A set of implicit locks on ancestor nodes along a path from an immediate parent of a target node to a root node is derived from an explicit lock request on a target node. A logical lock tree describing existing lock modes for ancestor nodes is consulted to determine compatibility with the derived set of implicit locks. If existing lock modes for ancestor nodes are compatible with the derived set of implicit locks, a lock request on a target node is granted. Otherwise, the lock request is denied. A lock release request follows the reverse process; a target node in a particular transaction is released, as are subsequent locks on its ancestors made by the same transaction.
    Type: Application
    Filed: May 4, 2004
    Publication date: January 5, 2006
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: James Teng, Brian Vickery, Guogen Zhang
  • Publication number: 20060004858
    Abstract: A variable-length binary string is utilized to encode node identifiers in a tree for an XML document object model. A general prefix encoding scheme is followed; a node identifier is generated by the concatenation of encodings at each level of a tree along a path from a root node to another particular node. Arbitrary insertions are supported without change to existing node identifier encodings. In addition, the method provides for document order when unsigned binary string comparison is used to compare encoded node identifiers. In support of sub-document concurrency control, prefix encoding provides a way to derive ancestor-descendant relationships among nodes in a tree. Lastly, the encoding method provides a natural pre-order clustering sequence, also known as depth-first clustering. If a prefix is applied to an encoding with a level number, starting with zero at the root, width-first clustering will result. A mixed clustering can also be supported.
    Type: Application
    Filed: May 4, 2004
    Publication date: January 5, 2006
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Brian Tran, Guogen Zhang
  • Publication number: 20050177578
    Abstract: Type annotation record information storage for annotated automaton encoding for high-performance XML schema validation is optimized in a space efficient aspect. Subsequent to type annotation record information organization, type annotation records are used for type annotation of validated XML documents, either by implementing annotation records and type annotation part of an algorithm only, or by skipping one or more validation steps in a full validation implementation. Given a schema context, a type annotation may be performed for a validated XML fragment as opposed to an entire document. In addition, default features such as attribute and type are supported.
    Type: Application
    Filed: February 10, 2004
    Publication date: August 11, 2005
    Inventors: Yao-Ching Chen, Ning Wang, Guogen Zhang
  • Publication number: 20050177543
    Abstract: A method and system for Extensible Markup Language (XML) schema validation, includes: loading an XML document into a runtime validation engine, where the runtime validation engine includes an XML schema validation parser; loading an annotated automaton encoding (AAE) for an XML schema definition into the XML schema validation parser; and validating the XML document against the XML schema definition by the XML schema validation parser utilizing the annotated automaton encoding. Each XML schema definition is compiled once into the AAE format, rather than being compiled each time an XML document is validated, and thus significant time is saved. The code for the runtime validation engine is fixed and does not vary depending on the XML schema definition, rather than varying for each XML schema definition, and thus space overhead is minimized. Flexibility in the validation process is provided without compromising performance.
    Type: Application
    Filed: February 10, 2004
    Publication date: August 11, 2005
    Inventors: Yao-Ching Chen, Fen-Ling Lin, Ning Wang, Guogen Zhang
  • Publication number: 20050166140
    Abstract: In one embodiment, at least a portion of an object model having at least one namespace is serialized. An ancestor namespace is searched for based on a current namespace declaration. The ancestor namespace is associated with an ancestor prefix and an ancestor uniform resource indicator (URI). The current namespace is associated with a current prefix and current URI. The search is performed to find an ancestor prefix matches the current prefix. When the current namespace is an implicit no default namespace and the ancestor namespace is an explicit default namespace based on, at least in part, the ancestor prefix, a serialized namespace declaration is generated for the current namespace.
    Type: Application
    Filed: January 27, 2004
    Publication date: July 28, 2005
    Inventors: Mengchu Cai, Jason Cu, Yue Huang, Guogen Zhang
  • Publication number: 20050114307
    Abstract: A heuristic approach is used to order materialized view (MW) candidates in a list based on descending order of their reduction power. A query (e.g., SQL query) is then matched with the MVs in the list order, wherein searching is stopped as soon as good enough matching has been found. The query is matched with materialized views in the ordered list by identifying a materialized view candidate as follows: identifying an MV that is not locked by a REFRESH process; identifying a matching MV that does not require a regroup; identifying a matching MV that does not require a rejoin; identifying a matching MV that does not require a residual join; or identifying an MV with largest reduction power from the list of candidates.
    Type: Application
    Filed: November 25, 2003
    Publication date: May 26, 2005
    Inventors: Ruiping Li, Irene Liu, Lee-Chin Liu, Guogen Zhang
  • Publication number: 20050065926
    Abstract: A method, apparatus, and article of manufacture for optimizing a query in a computer system, wherein the query is performed by the computer system to retrieve data from a database stored on the computer system. The optimization includes: (a) combining join predicates from a query with local predicates from each branch of one or more UNION ALL views referenced by the query; (b) analyzing the combined predicates; and (c) not generating the join when the analysis step indicates that the combined predicates lead to an empty result.
    Type: Application
    Filed: September 24, 2003
    Publication date: March 24, 2005
    Inventors: Yao-Ching Chen, Ding-Wei Chieh, Huong Tran, Yumi Tsuji, Guogen Zhang
  • Publication number: 20050027690
    Abstract: A method, apparatus, and article of manufacture for optimizing a query in a computer system. During compilation of the query, a GROUP BY clause with one or more GROUPING SETS, ROLLUP or CUBE operations is maintained in its original form until after query rewrite. The GROUP BY clause with the GROUPING SETS, ROLLUP or CUBE operations is then translated into a plurality of levels having one or more grouping sets. After compilation of the query, a grouping sets sequence is dynamically determined for the GROUP BY clause with the GROUPING SETS, ROLLUP or CUBE operations based on intermediate grouping sets, in order to optimize the grouping sets sequence. The execution of the grouping sets sequence is optimized by selecting a smallest grouping set from a previous one of the levels as an input to a grouping set on a next one of the levels. Finally, a UNION ALL operation is performed on the grouping sets.
    Type: Application
    Filed: July 29, 2003
    Publication date: February 3, 2005
    Inventors: Guogen Zhang, Fen-Ling Lin, Jung-Hsin Hu, Yao-Ching Chen, Yun Wang, Glenn Yuki
  • Patent number: 6792420
    Abstract: Provided is a method, system, and program for processing a query including a query operation on a table derived from a set operation on two result tables. The query operation is performed on each result table separately to produce two intermediate result tables. The set operator is then applied to the two intermediate result tables to produce a final result table that is a same result table that would have been produced by performing the query operation on the table derived from the set operation performed on the two result tables.
    Type: Grant
    Filed: June 29, 2001
    Date of Patent: September 14, 2004
    Assignee: International Business Machines Corporation
    Inventors: Yao Ching Stephen Chen, Yumi Kimura Tsuji, Yun Wang, Guogen Zhang
  • Publication number: 20040122804
    Abstract: A method and system for efficiently identifying materialized view candidates for queries filters materialized views using certain criteria, using the materialized view signatures. This filtering rejects some of the unqualified materialized views prior to the performance of the query rewrite matching algorithm, resulting in a group of materialized view candidates. The query rewrite matching algorithm is then performed on the materialized view candidates. By first filtering the materialized views based on their signatures, the number of materialized views on which the query rewrite matching algorithm is performed is significantly reduced, improving performance.
    Type: Application
    Filed: December 19, 2002
    Publication date: June 24, 2004
    Inventors: Guogen Zhang, Irene Ching-Hua Liu, Ding-Wei Chieh, Lee-Chin Hsu Liu, Yao-Ching Stephen Chen, Hsuiying Yen Cheng, Brian Thinh-Vinh Tran, Yun Wang, Ruiping Li
  • Publication number: 20040122814
    Abstract: A method, apparatus, and article of manufacture for optimizing a query in a computer system, wherein the query is performed by the computer system to retrieve data from a database stored on the computer system. The optimization includes: identifying a materialized view candidate in the computer system, matching a grouping of the materialized view with a grouping of the query using column equivalence and functional dependency, in order to determine whether re-aggregation is necessary, deriving one or more aggregate functions requested by the query from the materialized view and any remaining tables in the query based on the matched groupings, and rewriting the query based on the matched groupings.
    Type: Application
    Filed: December 18, 2002
    Publication date: June 24, 2004
    Applicant: International Business Machines Corporation
    Inventors: Guogen Zhang, Ruiping Li, Lee-Chin Hsu Liu, Yun Wang
  • Publication number: 20040122815
    Abstract: Disclosed is a method, system, and program for processing an aggregate function. Rows that contain a reference to intermediate result structures are grouped to form groups. For each group, aggregate element structures are formed from the intermediate result structures and, if the aggregate function specifies ordering, the aggregate element structures are sorted based on a sort key.
    Type: Application
    Filed: December 19, 2002
    Publication date: June 24, 2004
    Applicant: International Business Machines Corporation
    Inventors: Yun Wang, Fen-Ling Lin, Yao Ching Stephen Chen, Mengchu Cai, Jason Alexander Cu, Allan Benjamin Lebovitz, Li-Mey Lee, Guogen Zhang
  • Publication number: 20040122795
    Abstract: Disclosed is a method, system, and program for processing a function. A set of nested functions are received. A composite function is generated for the set of nested functions. A tagging template is generated for the set of nested functions that corresponds to the composite function. A result is produced by evaluating the composite function using the tagging template.
    Type: Application
    Filed: December 19, 2002
    Publication date: June 24, 2004
    Inventors: Fen-Ling Lin, Yao Ching Stephen Chen, Yun Wang, Guogen Zhang, Mengchu Cai, Jason Alexander Cu, Li-Mey Lee
  • Publication number: 20040073870
    Abstract: A method and system for Extensible Markup Language (XML) schema validation, includes: loading an XML document into a runtime validation engine, where the runtime validation engine includes an XML schema validation parser; loading an annotated automaton encoding (AAE) for an XML schema definition into the XML schema validation parser; and validating the XML document against the XML schema definition by the XML schema validation parser utilizing the annotated automaton encoding. Each XML schema definition is compiled once into the AAE format, rather than being compiled each time an XML document is validated, and thus significant time is saved. The code for the runtime validation engine is fixed and does not vary depending on the XML schema definition, rather than varying for each XML schema definition, and thus space overhead is minimized. Flexibility in the validation process is provided without compromising performance.
    Type: Application
    Filed: April 16, 2003
    Publication date: April 15, 2004
    Inventors: You-Chin Fuh, Ning Wang, Yun Wang, Guogen Zhang
  • Publication number: 20030055814
    Abstract: Provided is a method, system, and program for processing a query including a query operation on a table derived from a set operation on two result tables. The query operation is performed on each result table separately to produce two intermediate result tables. The set operator is then applied to the two intermediate result tables to produce a final result table that is a same result table that would have been produced by performing the query operation on the table derived from the set operation performed on the two result tables.
    Type: Application
    Filed: June 29, 2001
    Publication date: March 20, 2003
    Applicant: International Business Machines Corporation
    Inventors: Yao Ching Stephen Chen, Yumi Kimura Tsuji, Yun Wang, Guogen Zhang
  • Patent number: 6374235
    Abstract: Disclosed is a method, system, and program for performing a join operation on a multi-column table and at least two satellite tables having a join condition. Each satellite table is comprised of multiple rows and at least one join column. The multi-column table is comprised of multiple rows and at least one column corresponding to the join column in each satellite table. A join operation is performed on the rows of the satellite tables to generate concatenated rows of the satellite tables. One of the concatenated rows is joined to the multi-column table and a returned entry from the multi-column table is received. A determination is then made as to whether the returned entry matches the search criteria. If so, a determination is made as to whether one of the satellite tables has duplicates of values in the join column of the returned matching entry or the multi-column table has duplicate entries in the join columns.
    Type: Grant
    Filed: June 25, 1999
    Date of Patent: April 16, 2002
    Assignee: International Business Machines Corporation
    Inventors: Stephen Yao Ching Chen, Patrick Michael Malone, Jerry Mukai, Jeffrey Ralph Ruggles, Desai Paramesh Sampatrai, Yun Wang, Guogen Zhang