Patents by Inventor Guogen Zhang

Guogen Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20090112858
    Abstract: A system and method is provided for query processing comprises: creating an index of a database and ordering a set of index candidates from the index into a list based on a set of heuristic rules. A query defining a query path is then reduced into a list of single path expressions. Each index candidate is matched against the list of single path expressions according to the ordering of the index candidates. The matched candidate nodes are also verified to insure that they satisfy the query path.
    Type: Application
    Filed: October 25, 2007
    Publication date: April 30, 2009
    Applicant: International Business Machines Corporation
    Inventors: Guogen Zhang, Ruiping Li, Mengchu Cai
  • Patent number: 7493603
    Abstract: A method and system for Extensible Markup Language (XML) schema validation, includes: loading an XML document into a runtime validation engine, where the runtime validation engine includes an XML schema validation parser; loading an annotated automaton encoding (AAE) for an XML schema definition into the XML schema validation parser; and validating the XML document against the XML schema definition by the XML schema validation parser utilizing the annotated automaton encoding. Each XML schema definition is compiled once into the AAE format, rather than being compiled each time an XML document is validated, and thus significant time is saved. The code for the runtime validation engine is fixed and does not vary depending on the XML schema definition, rather than varying for each XML schema definition, and thus space overhead is minimized. Flexibility in the validation process is provided without compromising performance.
    Type: Grant
    Filed: April 16, 2003
    Date of Patent: February 17, 2009
    Assignee: International Business Machines Corporation
    Inventors: You-Chin Fuh, Ning Wang, Yun Wang, Guogen Zhang
  • Publication number: 20080313234
    Abstract: An XML schema is compiled into an annotated automaton encoding, which includes a parsing table for structural information and annotation for type information. The representation is extended to include a mapping from schema types to states in a parsing table. To validate a fragment against a schema type, it is necessary simply to determine the state corresponding to the schema type, and start the validation process from that state. When the process returns to the state, fragment validation has reached successful completion. This approach is more efficient than a general tree representation. Only the data representation of the schema information is handled, making it much easier than manipulating validation parser code generated by a parser generator. In addition, only one representation is needed for schema information for both document and fragment validation. This approach also provides a basis for incremental validation after update.
    Type: Application
    Filed: August 25, 2008
    Publication date: December 18, 2008
    Applicant: International Business Machines Corporation
    Inventors: Yao-Ching Stephen CHEN, Fen-Ling Lin, Ning Wang, Guogen Zhang
  • Publication number: 20080313136
    Abstract: Methods, systems, and computer program products for answering a query to be executed on a database comprising a first table and a second table, and in which the second table includes data from the first table. In one implementation, the method includes refreshing the data in the second table with corresponding data in the first table at a first time, receiving a first query on the first table at a second time that is subsequent to the first time, rewriting the first query into a second query on the first table and a third query on the second table, and executing the second query and the third query respectively on the first table and the second table to generate results for the first query.
    Type: Application
    Filed: August 30, 2007
    Publication date: December 18, 2008
    Applicant: International Business Machines Corporation
    Inventor: Guogen ZHANG
  • Publication number: 20080256045
    Abstract: A new sub-document versioning method for record storages of XML documents which uses virtual cutting points to ensure that a search tree is able to support multiple versions of sub-documents and provide efficient mechanisms for XML updating. Record boundaries and virtual cut points divide the two-dimensional space, the horizontal axis representing node identifiers in document order and vertical axis representing version numbers, into rectangles. The bottom corner of the rectangle is used to represent the rectangles and the corresponding information of the corner is added to the search tree index.
    Type: Application
    Filed: May 23, 2008
    Publication date: October 16, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: James Zu-Chia TENG, Guogen Zhang, Qinghua Zou
  • Patent number: 7437374
    Abstract: An XML schema is compiled into an annotated automaton encoding, which includes a parsing table for structural information and annotation for type information. The representation is extended to include a mapping from schema types to states in a parsing table. To validate a fragment against a schema type, it is necessary simply to determine the state corresponding to the schema type, and start the validation process from that state. When the process returns to the state, fragment validation has reached successful completion. This approach is more efficient than a general tree representation. Only the data representation of the schema information is handled, making it much easier than manipulating validation parser code generated by a parser generator. In addition, only one representation is needed for schema information for both document and fragment validation. This approach also provides a basis for incremental validation after update.
    Type: Grant
    Filed: February 10, 2004
    Date of Patent: October 14, 2008
    Assignee: International Business Machines Corporation
    Inventors: Yao-Ching Stephen Chen, Fen-Ling Lin, Ning Wang, Guogen Zhang
  • Publication number: 20080222176
    Abstract: A method and system for evaluating a path query are disclosed. The path query corresponds to a query tree including a plurality of query nodes. At least one query node corresponds to at least one predicate and is at a level. The predicate(s) are evaluated for previous query node(s). The method and system include scanning data nodes of a document and determining if the data nodes match the query nodes. The method and system also include placing data related to the data node in match stacks corresponding to matched query nodes. The data for the query node(s) include attribute(s) corresponding to the predicate(s). The method and system further include propagating a matching of the at least one query node backward to a matching of the at least one previous query node.
    Type: Application
    Filed: May 19, 2008
    Publication date: September 11, 2008
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Mengchu CAI, Jason Alexander Cu, Fen-Ling Lin, Guogen Zhang, Qinghua Zou
  • Patent number: 7346609
    Abstract: A method generates hierarchical path index keys for single and multiple indexes with one scan of a document. Each data node of the document is scanned and matches to query nodes are identified. A data node matches a query node if the three conditions hold: if it is not the root step, there is a match for the query node in the previous step of the query; the data node matches the query node of the current step; and the edges of the data and query nodes match. A sub-tree of a data node can be skipped if the data node is not matched and its level is less than the fixed levels of the query. The matched data node is then placed in the match stacks corresponding to the match query nodes. The method uses transitivity properties among matching units to reduce the number of states that need to be tracked and to improve the evaluation of path expressions significantly.
    Type: Grant
    Filed: November 16, 2004
    Date of Patent: March 18, 2008
    Assignee: International Business Machines Corporation
    Inventors: Yao-Ching Stephen Chen, Irene Ching-Hua Liu, Demai Ni, Guogen Zhang, Qinghua Zou
  • Publication number: 20070198479
    Abstract: A method and system for evaluating a path query are disclosed. The path query corresponds to a query tree including a plurality of query nodes. At least one query node corresponds to at least one predicate and is at a level. The predicate(s) are evaluated for previous query node(s). The method and system include scanning data nodes of a document and determining if the data nodes match the query nodes. The method and system also include placing data related to the data node in match stacks corresponding to matched query nodes. The data for the query node(s) include attribute(s) corresponding to the predicate(s). The method and system further include propagating a matching of the at least one query node backward to a matching of the at least one previous query node.
    Type: Application
    Filed: February 16, 2006
    Publication date: August 23, 2007
    Applicant: International Business Machines Corporation
    Inventors: Mengchu Cai, Jason Cu, Fen-Ling Lin, Guogen Zhang, Qinghua Zou
  • Publication number: 20070192285
    Abstract: Disclosed is method for processing an aggregate function. Rows that contain a reference to intermediate result structures are grouped to form groups. For each group, aggregate element structures are formed from the intermediate result structures and, if the aggregate function specifies ordering, the aggregate element structures are sorted based on a sort key.
    Type: Application
    Filed: April 20, 2007
    Publication date: August 16, 2007
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Yun WANG, Fen-Ling LIN, Yao Ching CHEN, Mengchu CAI, Jason CU, Allan LEBOVITZ, Li-Mey LEE, Guogen ZHANG
  • Patent number: 7246115
    Abstract: A method and system for efficiently identifying materialized view candidates for queries filters materialized views using certain criteria, using the materialized view signatures. This filtering rejects some of the unqualified materialized views prior to the performance of the query rewrite matching algorithm, resulting in a group of materialized view candidates. The query rewrite matching algorithm is then performed on the materialized view candidates. By first filtering the materialized views based on their signatures, the number of materialized views on which the query rewrite matching algorithm is performed is significantly reduced, improving performance.
    Type: Grant
    Filed: December 19, 2002
    Date of Patent: July 17, 2007
    Assignee: International Business Machines Corporation
    Inventors: Guogen Zhang, Irene Ching-Hua Liu, Ding-Wei Chieh, Lee-Chin Hsu Liu, Yao-Ching Stephen Chen, Hsuiying Yen Cheng, Brian Thinh-Vinh Tran, Yun Wang, Ruiping Li
  • Patent number: 7243098
    Abstract: Disclosed is a method, system, and program for processing an aggregate function. Rows that contain a reference to intermediate result structures are grouped to form groups. For each group, aggregate element structures are formed from the intermediate result structures and, if the aggregate function specifies ordering, the aggregate element structures are sorted based on a sort key.
    Type: Grant
    Filed: December 19, 2002
    Date of Patent: July 10, 2007
    Assignee: International Business Machines Corporation
    Inventors: Yun Wang, Fen-Ling Lin, Yao Ching Stephen Chen, Mengchu Cai, Jason Alexander Cu, Allan Benjamin Lebovitz, Li-Mey Lee, Guogen Zhang
  • Patent number: 7188098
    Abstract: A method, apparatus, and article of manufacture for optimizing a query in a computer system, wherein the query is performed by the computer system to retrieve data from a database stored on the computer system. The optimization includes: (a) combining join predicates from a query with local predicates from each branch of one or more UNION ALL views referenced by the query; (b) analyzing the combined predicates; and (c) not generating the join when the analysis step indicates that the combined predicates lead to an empty result.
    Type: Grant
    Filed: September 24, 2003
    Date of Patent: March 6, 2007
    Assignee: International Business Machines Corporation
    Inventors: Yao-Ching S. Chen, Ding-Wei L. Chieh, Huong T. Tran, Yumi K. Tsuji, Guogen Zhang
  • Publication number: 20070050436
    Abstract: A method for conversion between a decimal floating-point number and an order-preserving format has been disclosed. The method encodes numbers in the decimal floating-point format into a format which preserves value ordering. This encoding allows for fast and direct string comparison of two values. Such an encoding provides normalized representations for decimal floating-point numbers and supports type-insensitive comparisons. Type-insensitive comparisons are often used in database management systems, where the data type is not specified for values to compare. In addition, the original decimal floating-point format can be recovered from the order-preserving format.
    Type: Application
    Filed: August 26, 2005
    Publication date: March 1, 2007
    Applicant: International Business Machines Corporation
    Inventors: Yao-Ching Chen, Michael Cowlishaw, Christopher Crone, Fung Lee, Ronald Smith, Guogen Zhang, Qinghua Zou
  • Publication number: 20070043686
    Abstract: A new sub-document versioning method for record storages of XML documents which uses virtual cutting points to ensure that a search tree is able to support multiple versions of sub-documents and provide efficient mechanisms for XML updating. Record boundaries and virtual cut points divide the two-dimensional space, the horizontal axis representing node identifiers in document order and vertical axis representing version numbers, into rectangles. The bottom corner of the rectangle is used to represent the rectangles and the corresponding information of the corner is added to the search tree index.
    Type: Application
    Filed: August 22, 2005
    Publication date: February 22, 2007
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: JAMES TENG, GUOGEN ZHANG, QINGHUA ZOU
  • Publication number: 20070043751
    Abstract: A method and system for providing a scalable storage scheme for native hierarchically structured data of relational tables, includes a base table with indicator columns with information pertaining to hierarchically structured data of a document, data tables for storing the hierarchically structured data corresponding to the indicator columns, and node identifier indexes corresponding to the data tables for mapping between the indicator columns and the hierarchically structured data in the data tables. In an embodiment, actual data for each hierarchically structured data (such as XML) column is stored in a separate data table, and each data table has a separate node identifier index. The node identifier index is searched with a key containing the document identifier and a logical node identifier is used, and a record identifier of a record in the data table containing the node assigned the logical node identifier is retrieved.
    Type: Application
    Filed: August 22, 2005
    Publication date: February 22, 2007
    Applicant: International Business Machines Corporation
    Inventors: Yao-Ching Chen, Fen-Ling Lin, Jerry Mukai, Kalpana Shyam, James Teng, Guogen Zhang
  • Publication number: 20070043743
    Abstract: A storage of nodes of hierarchically structured data uses logical node identifiers to reference the nodes stored within and across record data structures. A node identifier index is used to map each logical node identifier to a record identifier for the record that contains the node. When a sub-tree is stored in a separate record, a proxy node is used to represent the sub-tree in the parent record. The mapping in the node identifier index reflects the storage of the sub-tree nodes in the separate record. Since the references between the records are through logical node identifiers, there is no limitation to the moving of records across pages, as long as the indices are updated or rebuilt to maintain synchronization with the resulting data pages. This approach is highly scalable and has a much smaller storage consumption than approaches that use explicit references between nodes.
    Type: Application
    Filed: August 22, 2005
    Publication date: February 22, 2007
    Inventors: Yao-Ching Chen, Yue Huang, Fen-Ling Lin, Brian Tran, Guogen Zhang
  • Patent number: 7124137
    Abstract: Disclosed is a method, system, and program for processing a function. A set of nested functions are received. A composite function is generated for the set of nested functions. A tagging template is generated for the set of nested functions that corresponds to the composite function. A result is produced by evaluating the composite function using the tagging template.
    Type: Grant
    Filed: December 19, 2002
    Date of Patent: October 17, 2006
    Assignee: International Business Machines Corporation
    Inventors: Fen-Ling Lin, Yao Ching Stephen Chen, Yun Wang, Guogen Zhang, Mengchu Cai, Jason Alexander Cu, Li-Mey Lee
  • Patent number: 7120864
    Abstract: In one embodiment, at least a portion of an object model having at least one namespace is serialized. An ancestor namespace is searched for based on a current namespace declaration. The ancestor namespace is associated with an ancestor prefix and an ancestor uniform resource indicator (URI). The current namespace is associated with a current prefix and current URI. The search is performed to find an ancestor prefix matches the current prefix. When the current namespace is an implicit no default namespace and the ancestor namespace is an explicit default namespace based on, at least in part, the ancestor prefix, a serialized namespace declaration is generated for the current namespace.
    Type: Grant
    Filed: January 27, 2004
    Date of Patent: October 10, 2006
    Assignee: International Business Machines Corporation
    Inventors: Mengchu Cai, Jason Alexander Cu, Yue Huang, Guogen Zhang
  • Patent number: 7089225
    Abstract: A heuristic approach is used to order materialized view (MW) candidates in a list based on descending order of their reduction power. A query (e.g., SQL query) is then matched with the MVs in the list order, wherein searching is stopped when matching has been found. The query is matched with materialized views in the ordered list by identifying a materialized view candidate as follows: identifying an MV that is not locked by a REFRESH process; identifying a matching MV that does not require a regroup; identifying a matching MV that does not require a rejoin; identifying a matching MV that does not require a residual join; or identifying an MV with largest reduction power from the list of candidates.
    Type: Grant
    Filed: November 25, 2003
    Date of Patent: August 8, 2006
    Assignee: International Business Machines Corporation
    Inventors: Ruiping Li, Irene Ching-Hua Liu, Lee-Chin Hsu Liu, Guogen Zhang