Patents by Inventor David Everett Simmen

David Everett Simmen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Collaborative planning for accelerating analytic queries

Patent number: 11023443

Abstract: A system and method for determining optimal query plans within distributed database system employing table operators for performing analytic operations for storing and processing multi-structured data. The optimization of a query plan proceeds through a collaborative exchange between a database system optimizer, or planner, and a table operator, wherein multiple communications between said optimizer and said table operator are conducted to exchange input and output information relevant to optimizing execution of the query and table operator.

Type: Grant

Filed: February 16, 2016

Date of Patent: June 1, 2021

Assignee: Teradata US, Inc.

Inventors: Derrick Poo-Ray Kondo, Tongxin Bai, Anjali Betawadkar-Norwood, Aditi Subodh Pandit, David Everett Simmen
COLLABORATIVE PLANNING FOR ACCELERATING ANALYTIC QUERIES

Publication number: 20160239544

Abstract: A system and method for determining optimal query plans within distributed database system employing table operators for performing analytic operations for storing and processing multi-structured data. The optimization of a query plan proceeds through a collaborative exchange between a database system optimizer, or planner, and a table operator, wherein multiple communications between said optimizer and said table operator are conducted to exchange input and output information relevant to optimizing execution of the query and table operator.

Type: Application

Filed: February 16, 2016

Publication date: August 18, 2016

Applicant: Teradata US, Inc.

Inventors: Derrick Poo-Ray Kondo, Tongxin Bai, Anjali Betawadkar-Norwood, Aditi Subodh Pandit, David Everett Simmen
Extensible system and method for information extraction in a data processing system

Patent number: 9418069

Abstract: A data mashup system having information extraction capabilities for receiving multiple streams of textual data, at least one of which contains unstructured textual data. A repository stores annotators that describe how to analyze the streams of textual data for specified unstructured data components. The annotators are applied to the data streams to identify and extract the specified data components according to the annotators. The extracted data components are tagged to generate structured data components and the specified unstructured data components in the input data streams are replaced with the tagged data components. The system then combines the tagged data from the multiple streams to form a mashup output data stream.

Type: Grant

Filed: May 26, 2010

Date of Patent: August 16, 2016

Assignee: International Business Machines Corporation

Inventors: Yunyao Li, Frederick Ralph Reiss, David Everett Simmen, Suresh Thalamati
Extensible system and method for information extraction in a data processing system

Patent number: 8805834

Abstract: A data mashup system having information extraction capabilities for receiving multiple streams of textual data, at least one of which contains unstructured textual data. A repository stores annotators that describe how to analyze the streams of textual data for specified unstructured data components. The annotators are applied to the data streams to identify and extract the specified data components according to the annotators. The extracted data components are tagged to generate structured data components and the specified unstructured data components in the input data streams are replaced with the tagged data components. The system then combines the tagged data from the multiple streams to form a mashup output data stream.

Type: Grant

Filed: March 7, 2012

Date of Patent: August 12, 2014

Assignee: International Business Machines Corporation

Inventors: Yunyao Li, Frederick Ralph Reiss, David Everett Simmen, Suresh Thalamati
Efficient processing of queries in federated database systems

Patent number: 8538985

Abstract: Methods and apparatus, including computer program products, implementing and using techniques for processing a federated query in a federated database system. A federated query is received at a federated database server. A federated query execution plan is generated based on the received federated query. The federated query execution plan defines one or more source servers of the federated database and a unique subquery to be executed on each of the source servers. The subqueries are distributed to the source servers in accordance with the federated query execution plan. The respective subqueries are executed asynchronously at the source servers. The subquery results are passed to a first designated source server defined in the federated query execution plan. The subquery results are joined and aggregated at the first designated source server into a final query result. The final query result is returned to the federated database server.

Type: Grant

Filed: March 11, 2008

Date of Patent: September 17, 2013

Assignee: International Business Machines Corporation

Inventors: Anjali Betawadkar-Norwood, Hamid Pirahesh, David Everett Simmen
System and method for the derivation and application of sub-iteration contexts in a transformation operation in a data integration system

Patent number: 8140596

Abstract: Methods and systems for improving a data transformation operation that converts a source data instance containing repeating elements into a target data instance having a user-specified structure, based solely on a user's specification of a target template. The methods and systems derive and calculate sub-iteration contexts by applying a selected heuristic to the source data instance and the target template, and use these sub-iteration contexts to create a target data instance having a repeating structure that agrees with the user-specified target template. The methods and systems can be customized by the selection of heuristic, and by the specification of explicit sub-iteration contexts that may override the derived contexts.

Type: Grant

Filed: October 15, 2009

Date of Patent: March 20, 2012

Assignee: International Business Machines Corporation

Inventors: Armageddon Rhabdizo Brown, David Everett Simmen
Automatically and adaptively determining execution plans for queries with parameter markers

Patent number: 7958113

Abstract: A method and system for automatically and adaptively determining query execution plans for parametric queries. A first classifier trained by an initial set of training points is generated. A query workload and/or database statistics are dynamically updated. A new set of training points is collected off-line. Using the new set of training points, the first classifier is modified into a second classifier. A database query is received at a runtime subsequent to the off-line phase. The query includes predicates having parameter markers bound to actual values. The predicates are associated with selectivities. A mapping of the selectivities into a plan determines the query execution plan. The determined query execution plan is included in an augmented set of training points, where the augmented set includes the initial set and the new set.

Type: Grant

Filed: May 22, 2008

Date of Patent: June 7, 2011

Assignee: International Business Machines Corporation

Inventors: Wei Fan, Guy Maring Lohman, Volker Gerhard Markl, Nimrod Megiddo, Jun Rao, David Everett Simmen, Julia Stoyanovich
System and Method for the Derivation and Application of Sub-iteration Contexts in a Transformation Operation in a Data Integration System

Publication number: 20110093514

Abstract: Methods and systems for improving a data transformation operation that converts a source data instance containing repeating elements into a target data instance having a user-specified structure, based solely on a user's specification of a target template. The methods and systems derive and calculate sub-iteration contexts by applying a selected heuristic to the source data instance and the target template, and use these sub-iteration contexts to create a target data instance having a repeating structure that agrees with the user-specified target template. The methods and systems can be customized by the selection of heuristic, and by the specification of explicit sub-iteration contexts that may override the derived contexts.

Type: Application

Filed: October 15, 2009

Publication date: April 21, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Armageddon Rhabdizo Brown, David Everett Simmen
System, method, and computer program product for progressive query processing

Patent number: 7716215

Abstract: A method, system, and computer program product to make query processing more robust in the face of optimization errors. The invention validates the statistics and assumptions used for compiling a query as the query is executed and, when necessary, progressively re-optimizes the query in mid-execution based on the knowledge learned during its partial execution. The invention selectively places a number of CHECK operators in a query execution plan to validate the optimizer's cardinality estimates against actual cardinalities. Errors beyond a threshold trigger re-optimization, and the optimizer decides whether the old plan is still optimal and whether to re-use previously computed results. The invention addresses arbitrary SQL queries whose plans can contain sub-queries, updates, trigger checking, and view maintenance operations.

Type: Grant

Filed: November 14, 2007

Date of Patent: May 11, 2010

Assignee: International Business Machines Corporation

Inventors: Guy Maring Lohman, Marki Volker, Mir Hamid Pirahesh, Vijayshankar Raman, David Everett Simmen
EFFICIENT PROCESSING OF QUERIES IN FEDERATED DATABASE SYSTEMS

Publication number: 20090234799

Abstract: Methods and apparatus, including computer program products, implementing and using techniques for processing a federated query in a federated database system. A federated query is received at a federated database server. A federated query execution plan is generated based on the received federated query. The federated query execution plan defines one or more source servers of the federated database and a unique subquery to be executed on each of the source servers. The subqueries are distributed to the source servers in accordance with the federated query execution plan. The respective subqueries are executed asynchronously at the source servers. The subquery results are passed to a first designated source server defined in the federated query execution plan. The subquery results are joined and aggregated at the first designated source server into a final query result. The final query result is returned to the federated database server.

Type: Application

Filed: March 11, 2008

Publication date: September 17, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Anjali Betawadkar-Norwood, Hamid Pirahesh, David Everett Simmen
AUTOMATICALLY AND ADAPTIVELY DETERMINING EXECUTION PLANS FOR QUERIES WITH PARAMETER MARKERS

Publication number: 20080222093

Abstract: A method and system for automatically and adaptively determining query execution plans for parametric queries. A first classifier trained by an initial set of training points is generated. A query workload and/or database statistics are dynamically updated. A new set of training points is collected off-line. Using the new set of training points, the first classifier is modified into a second classifier. A database query is received at a runtime subsequent to the off-line phase. The query includes predicates having parameter markers bound to actual values. The predicates are associated with selectivities. A mapping of the selectivities into a plan determines the query execution plan. The determined query execution plan is included in an augmented set of training points, where the augmented set includes the initial set and the new set.

Type: Application

Filed: May 22, 2008

Publication date: September 11, 2008

Inventors: Wei Fan, Guy Maring Lohman, Volker Gerhard Markl, Nimrod Megiddo, Jun Rao, David Everett Simmen, Julia Stoyanovich
AUTOMATICALLY AND ADAPTIVELY DETERMINING EXECUTION PLANS FOR QUERIES WITH PARAMETER MARKERS

Publication number: 20080195577

Abstract: A method for automatically and adaptively determining query execution plans for parametric queries. A first classifier trained by an initial set of training points is generated using a set of random decision trees (RDTs). A query workload and/or database statistics are dynamically updated. A new set of training points collected off-line is used to modify the first classifier into a second classifier. A database query is received at a runtime subsequent to the off line phase. The query includes predicates having parameter markers bound to actual values. The predicates are associated with selectivities. The query execution plan is determined by identifying an optimal average of posterior probabilities obtained across a set of RDTs and mapping the selectivities to a plan. The determined query execution plan is included in an augmented set of training points that includes the initial set and the new set.

Type: Application

Filed: February 9, 2007

Publication date: August 14, 2008

Inventors: Wei Fan, Guy Maring Lohman, Volker Gerhard Markl, Nimrod Megiddo, Jun Rao, David Everett Simmen, Julia Stoyanovich
SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR PROGRESSIVE QUERY PROCESSING

Publication number: 20080177722

Abstract: A method, system, and computer program product to make query processing more robust in the face of optimization errors. The invention validates the statistics and assumptions used for compiling a query as the query is executed and, when necessary, progressively re-optimizes the query in mid-execution based on the knowledge learned during its partial execution. The invention selectively places a number of CHECK operators in a query execution plan to validate the optimizer's cardinality estimates against actual cardinalities. Errors beyond a threshold trigger re-optimization, and the optimizer decides whether the old plan is still optimal and whether to re-use previously computed results. The invention addresses arbitrary SQL queries whose plans can contain sub-queries, updates, trigger checking, and view maintenance operations.

Type: Application

Filed: November 14, 2007

Publication date: July 24, 2008

Applicant: International Business Machines Corp.

Inventors: Guy Maring Lohman, Marki Volker, Mir Hamid Pirahesh, Vijayshankar Raman, David Everett Simmen
System, method, and computer program product for progressive query processing

Patent number: 7383246

Abstract: A method, system, and computer program product to make query processing more robust in the face of optimization errors. The invention validates the statistics and assumptions used for compiling a query as the query is executed and, when necessary, progressively re-optimizes the query in mid-execution based on the knowledge learned during its partial execution. The invention selectively places a number of CHECK operators in a query execution plan to validate the optimizer's cardinality estimates against actual cardinalities. Errors beyond a threshold trigger re-optimization, and the optimizer decides whether the old plan is still optimal and whether to re-use previously computed results. The invention addresses arbitrary SQL queries whose plans can contain sub-queries, updates, trigger checking, and view maintenance operations.

Type: Grant

Filed: October 31, 2003

Date of Patent: June 3, 2008

Assignee: International Business Machines Corporation

Inventors: Guy Maring Lohman, Marki Volker, Mir Hamid Pirahesh, Vijayshankar Raman, David Everett Simmen
System and method for reverse routing materialized query tables in a database

Patent number: 7185004

Abstract: A reverse routing system optimizes execution of a query that accesses data stored in one or more materialized query tables in a database of a computer system. The system receives a query directly referencing the materialized query table. The system identifies the referenced materialized query tables in a catalogue of materialized query tables and a defining query associated with the referenced materialized query table. The system substitutes the defining query for the referenced materialized query table in the received query. The system adds the referenced materialized query table to the set of eligible materialized query tables that are selected using query matching algorithms so that they can be considered for routing by the query optimizer.

Type: Grant

Filed: December 9, 2005

Date of Patent: February 27, 2007

Assignee: International Business Machines Corporation

Inventors: David Everett Simmen, Mir Hamid Pirahesh
Efficient sampling of a relational database

Patent number: 6993516

Abstract: A system, method and computer readable medium for sampling data from a relational database are disclosed, where an information processing system chooses rows from a table in a relational database for sampling, wherein data values are arranged into rows, rows are arranged into pages, and pages are arranged into tables. Pages are chosen for sampling according to a probability P and rows in a selected page are chosen for sampling according to a probability R, so that the overall probability of choosing a row for sampling is Q=PR. The probabilities P and R are based on the desired precision of estimates computed from a sample, as well as processing speed. The probabilities P and R are further based on either catalog statistics of the relational database or a pilot sample of rows from the relational database.

Type: Grant

Filed: December 26, 2002

Date of Patent: January 31, 2006

Assignee: International Business Machines Corporation

Inventors: Peter Jay Haas, Guy Maring Lohman, Mir Hamid Pirahesh, David Everett Simmen, Ashutosh Vir Vikram Singh, Michael Jeffrey Winer, Markos Zaharioudakis
Efficient sampling of a relational database

Publication number: 20040128290

Abstract: A system, method and computer readable medium for sampling data from a relational database are disclosed, where an information processing system chooses rows from a table in a relational database for sampling, wherein data values are arranged into rows, rows are arranged into pages, and pages are arranged into tables. Pages are chosen for sampling according to a probability P and rows in a selected page are chosen for sampling according to a probability R, so that the overall probability of choosing a row for sampling is Q=PR. The probabilities P and R are based on the desired precision of estimates computed from a sample, as well as processing speed. The probabilities P and R are further based on either catalog statistics of the relational database or a pilot sample of rows from the relational database.

Type: Application

Filed: December 26, 2002

Publication date: July 1, 2004

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Peter Jay Haas, Guy Maring Lohman, Mir Hamid Pirahesh, David Everett Simmen, Ashutosh Vir Vikram Singh, Michael Jeffrey Winer, Markos Zaharioudakis
System and method for identifying and constructing star joins for execution by bitmap ANDing

Patent number: 6105020

Abstract: A system and method for a relational database system for identifying star joins in a query and for breaking the query down for bitmap ANDing. The fact table of the star join is located, and cycles between and within dimension tables are broken. Then, the minimal set of tables necessary to execute the star join is identified, and the dimension tables that should appear in the bitmap ANDing plan are also identified. A bitmap ANDing plan is then generated, or, if the query does not qualify for bitmap ANDing, a conventional execution plan is generated.

Type: Grant

Filed: October 11, 1999

Date of Patent: August 15, 2000

Assignee: International Business Machines Corporation

Inventors: Bruce Gilbert Lindsay, Eugene Jon Shekita, David Everett Simmen, Kaarel Truuvert
Shared nothing parallel execution of procedural constructs in SQL

Patent number: 6081801

Abstract: An automated methodology, and an apparatus for practicing the methodology, which enables the power and flexibility inherent in shared nothing parallel database systems (MPP) to be utilized on complex queries which have, heretofore, contained query elements requiring local computation or local coordination of data computation performed across the nodes of the distributed system. The present invention provides these features and advantages by identifying and marking the subgraphs containing these types of query elements as "no TQ zones" in the preparation phase prior to optimization. When the optimizer sees the markings, it builds a plan that will force the computation of the marked subgraphs to be in the same section. This preparation phase also provides the partitioning information for all inputs to the "no TQ zones". This allows the bottom-up optimizer to correctly plan the partitioning for the "no TQ zones".

Type: Grant

Filed: June 30, 1997

Date of Patent: June 27, 2000

Assignee: International Business Machines Corporation

Inventors: Roberta Jo Cochrane, George Lapis, Mir Hamid Pirahesh, Richard Sefton Sidle, David Everett Simmen, Tuong Chanh Truong, Monica Sachiye Urata
Star/join query optimization

Patent number: 5960428

Abstract: Unwieldy star/join queries are performed more efficiently using a filtered fact table. Suitable queries include star/join queries with a large fact table joined with multiple subsidiary dimension tables, where indices exist over fact table join columns. The query is analyzed to prepare a query plan for the dimension table accesses. This plan is supplemented by adding nested loop join operations, where the inner table is a dimension table plan and the outer table is an index scan performed over a fact table index of the join column with the dimension table. The plan is also supplemented by filtering records resulting from the nested loop joins using a sequence of dynamic bit vectors, ultimately yielding a list of probable fact table records. The plan is further supplemented by fetching these records to construct a distilled fact which is used, instead of the large original table, to execute the query in considerably less time.

Type: Grant

Filed: August 28, 1997

Date of Patent: September 28, 1999

Assignee: International Business Machines Corporation

Inventors: Bruce Gilbert Lindsay, Guy Maring Lohman, Mir Hamid Pirahesh, Eugene Jon Shekita, David Everett Simmen, Monica Sachiye Urata

1 2 next