Patents by Inventor David E. Simmen
David E. Simmen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11188550Abstract: The disclosed embodiments include a method performed by a data intake and query system. The method includes ingesting each metric including at least one key value and a measured value taken of a computing resource, and storing each metric in an index of a metrics store, where the index defines at least one dimension populated with the at least one key value and a measure populated with the measured value. The method further includes cataloging metadata in a metrics catalog, where the metadata is related to the metrics stored in the metrics store, performing an analysis of metrics data included in the metrics store and/or the metrics catalog to obtain results, and causing display of the results or an indication of the results on a display device.Type: GrantFiled: October 31, 2016Date of Patent: November 30, 2021Assignee: SPLUNK INC.Inventors: Thomas Allan Haggie, Clint Sharp, Alexander Douglas James, David Ryan Marquardt, Hailun Yan, Christopher Pride, Vishal Patel, Amrittpal Singh Bath, Pratiksha Shah, Murugan Kandaswamy, Steve Yu Zhang, Ledion Bitincka, David E. Simmen, Marc Andre Chene, Esguerra Ma Kharisma, Igor Stojanovski
-
Patent number: 11055300Abstract: The disclosed embodiments include a method performed by a data intake and query system. The method includes receiving a real-time search query including search criteria, and receiving a stream of metrics, where each metric includes a measured value taken of a computing device. The method further includes filtering the metrics to obtain filtered metrics satisfying the search criteria, creating an in-memory summarization data structure based on the filtered metrics, communicating the summarization data to a search head, and providing search results including the summarization data, where the summarization data or data indicative of the summarization data is displayed on a display of a display device.Type: GrantFiled: October 31, 2016Date of Patent: July 6, 2021Assignee: SPLUNK INC.Inventors: Steve Yu Zhang, Ledion Bitincka, Vishal Patel, David E. Simmen
-
Publication number: 20180089289Abstract: The disclosed embodiments include a method performed by a data intake and query system. The method includes receiving a real-time search query including search criteria, and receiving a stream of metrics, where each metric includes a measured value taken of a computing device. The method further includes filtering the metrics to obtain filtered metrics satisfying the search criteria, creating an in-memory summarization data structure based on the filtered metrics, communicating the summarization data to a search head, and providing search results including the summarization data, where the summarization data or data indicative of the summarization data is displayed on a display of a display device.Type: ApplicationFiled: October 31, 2016Publication date: March 29, 2018Inventors: Steve Yu Zhang, Ledion Bitincka, Vishal Patel, David E. Simmen
-
Publication number: 20180089290Abstract: The disclosed embodiments include a method performed by a data intake and query system. The method includes ingesting each metric including at least one key value and a measured value taken of a computing resource, and storing each metric in an index of a metrics store, where the index defines at least one dimension populated with the at least one key value and a measure populated with the measured value. The method further includes cataloging metadata in a metrics catalog, where the metadata is related to the metrics stored in the metrics store, performing an analysis of metrics data included in the metrics store and/or the metrics catalog to obtain results, and causing display of the results or an indication of the results on a display device.Type: ApplicationFiled: October 31, 2016Publication date: March 29, 2018Inventors: Thomas Allan Haggie, Clint Sharp, Alexander Douglas James, David Ryan Marquardt, Hailun Yan, Christopher Pride, Vishal Patel, Amrittpal Singh Bath, Pratiksha Shah, Murugan Kandaswamy, Steve Yu Zhang, Ledion Bitincka, David E. Simmen, Marc Andre Chene, Esguerra Ma Kharisma, Igor Stojanovski
-
Patent number: 8386450Abstract: A technique for optimizing execution of a query that accesses data stored on a data store connected to a computer. Statistics on one or more pre-defined queries are used to determine an optimal query execution plan for the query. In particular, improved cardinality estimates are generated for one or more query execution plans for the query using statistics of one or more pre-defined queries that vertically overlap the query. These cardinality estimates are used to make more accurate cost estimates, thus improving the likelihood of determining the optimal query execution plan.Type: GrantFiled: March 24, 2004Date of Patent: February 26, 2013Assignee: International Business Machines CorporationInventor: David E. Simmen
-
Publication number: 20120209844Abstract: A data mashup system having information extraction capabilities for receiving multiple streams of textual data, at least one of which contains unstructured textual data. A repository stores annotators that describe how to analyze the streams of textual data for specified unstructured data components. The annotators are applied to the data streams to identify and extract the specified data components according to the annotators. The extracted data components are tagged to generate structured data components and the specified unstructured data components in the input data streams are replaced with the tagged data components. The system then combines the tagged data from the multiple streams to form a mashup output data stream.Type: ApplicationFiled: March 7, 2012Publication date: August 16, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Yunyao Li, Frederick R. Reiss, David E. Simmen, Suresh Thalamati
-
Publication number: 20110295853Abstract: A data mashup system having information extraction capabilities for receiving multiple streams of textual data, at least one of which contains unstructured textual data. A repository stores annotators that describe how to analyze the streams of textual data for specified unstructured data components. The annotators are applied to the data streams to identify and extract the specified data components according to the annotators. The extracted data components are tagged to generate structured data components and the specified unstructured data components in the input data streams are replaced with the tagged data components. The system then combines the tagged data from the multiple streams to form a mashup output data stream.Type: ApplicationFiled: May 26, 2010Publication date: December 1, 2011Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Yunyao Li, Frederick R. Reiss, David E. Simmen, Suresh Thalamati
-
Patent number: 7890491Abstract: A technique for optimizing execution of a query that accesses data stored on a data store connected to a computer. Statistics on one or more automatic summary tables are used to determine an optimal query execution plan for the query. In particular, improved cardinality estimates are generated for one or more query execution plans for the query using statistics of one or more automatic summary tables that vertically overlap the query. These cardinality estimates are used to make more accurate cost estimates, thus improving the likelihood of determining the optimal query execution plan.Type: GrantFiled: September 26, 2000Date of Patent: February 15, 2011Assignee: International Business Machines CorporationInventor: David E. Simmen
-
Patent number: 7783625Abstract: Data in materialized query tables (MQTs) are used as statistics for determining the optimal execution plan for a query. When an MQT is defined, it is examined to determine whether its data provides statistics for determining an optimal execution plan for a query. If so, then the MQT is identified, in the RDBMS, as a source for statistics. Information needed to exploit the MQT data as statistics is cataloged in the RDBMS. This information includes a characterization of the type of statistics provided by the MQT, the table and column distributions represented by those statistics, and a query for later retrieving relevant data from the MQT during the query optimization process. When a query is accepted for execution, the cataloged relevant information about MQTs is examined to determine whether an MQT exists that provides statistics relevant to optimization of the query. If such an MQT exists, then the relevant data is retrieved from the MQT using the cataloged query.Type: GrantFiled: October 21, 2005Date of Patent: August 24, 2010Assignee: International Business Machines CorporationInventor: David E. Simmen
-
Publication number: 20090327323Abstract: Data integration in a data processing system is provided. A data mashup specification is received and an interleaved sequence of operations as defined by the data mashup specification is executed. The interleaved sequence of operations comprises at least one of an import operation, an augment operation, or a publish operation. In executing the interleaved sequence of operations a determination is made as to the next operation to execute. An outer context is formed and added to a binding context of the next operation. If the next operation is an import operation, a data resource is imported from a data source and an input generic feed is generated. If the next operation is an augment operation, a set of augmented generic feeds is produced from a set of input generic feeds. If the next operation is a publish operation, a new data resource is produced from a specified augmented generic feed.Type: ApplicationFiled: June 27, 2008Publication date: December 31, 2009Applicant: International Business Machines CorporationInventors: Mehmet Altinel, Volker G. Markl, David E. Simmen, Ashutosh Singh
-
System and Method for Optimizing Query Access to a Database Comprising Hierarchically-Organized Data
Publication number: 20080222087Abstract: An cost based optimizer optimizes access to at least a portion of hierarchically-organized documents, such as those formatted using eXtensible Markup Language (XML), by estimating a number of results produced by the access of the hierarchically-organized documents. Estimating the number of results comprises computing the cardinality of each operator executing query language expressions and further computing a sequence size of sequences of hierarchically-organized nodes produced by the query language expressions. Access to the hierarchically-organized documents is optimized using the structure of the query expression and/or path statistics involving the hierarchically-organized data. The cardinality and the sequence size are used to calculate a cost estimation for execution of alternate query execution plans. Based on the cost estimation, an optimal query execution plan is selected from among the alternate query execution plans.Type: ApplicationFiled: May 15, 2006Publication date: September 11, 2008Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Andrey Balmin, Tom Eliaz, Guy M. Lohman, David E. Simmen, Chun Zhang -
Patent number: 7080062Abstract: A method, apparatus, and article of manufacture for optimizing database queries using automatic summary tables. Query execution plans derived from an automatic summary table can be used to generate results for the query if a comparison of the query requirements with an automatic summary table definition determines that the automatic summary table overlaps the query, and if an optimization process determines that using the summary table will lower the cost of the query. The optimization process involves enumerating a plurality of query execution plans for the query, wherein the query execution plans enumerated include those that access combinations of query and summary tables. Each such query execution plan is assigned a cost representing an estimation of its execution characteristics, and the least costly query execution plan is selected for the query.Type: GrantFiled: February 11, 2000Date of Patent: July 18, 2006Assignee: International Business Machines CorporationInventors: Ting Yu Leung, David E. Simmen, Yang Sun
-
Patent number: 7007006Abstract: The invention herein provides method and apparatus, including software for determining a set of materialized views or indices of the contents or a subset of the contents of a database in a data processing system to be created for one or more users of the database. The method and apparatus provide method and means for evaluating a workload presented by a user to the database; evaluating the data processing system characteristics; evaluating the database characteristics; and, using the above evaluations for recommending a set of suitable materialized views or indices to the user.Type: GrantFiled: June 21, 2002Date of Patent: February 28, 2006Assignee: International Business Machines CorporationInventors: Daniel C. Zilio, Gary G. Valentin, Guy M. Lohman, Calisto P. Zuzarte, Roberta J. Cochrane, Hamid Pirahesh, Markos Zaharioudakis, Kit Man Cheung, Kevin S. Beyer, David E. Simmen, Ting Y. Leung, Samuel S. Lightstone
-
Publication number: 20040181521Abstract: A technique for optimizing execution of a query that accesses data stored on a data store connected to a computer. Statistics on one or more pre-defined queries are used to determine an optimal query execution plan for the query. In particular, improved cardinality estimates are generated for one or more query execution plans for the query using statistics of one or more pre-defined queries that vertically overlap the query. These cardinality estimates are used to make more accurate cost estimates, thus improving the likelihood of determining the optimal query execution plan.Type: ApplicationFiled: March 24, 2004Publication date: September 16, 2004Inventor: David E. Simmen
-
Patent number: 6738755Abstract: A method, apparatus, and article of manufacture for incrementally estimating the cardinality of a derived relation when statistically correlated predicates are applied. A plurality of query execution plans (QEPs) are generated for the query. During the generation of the QEPs, a cardinality is computed for any of the QEPs in which two or more predicates are correlated to each other. The cardinality comprises a number of rows expected to be returned by the QEP and is computed in an incremental fashion for each operator of the QEP. The computations include calculations that may be done prior to the generation of the QEPs and calculations that are necessarily done as each operator of a QEP is added to that QEP. Thereafter, one of the QEPs is chosen to satisfy the query in a manner that minimizes an estimated cost metric, wherein the cost metric is computed using the cardinality.Type: GrantFiled: February 29, 2000Date of Patent: May 18, 2004Assignee: International Business Machines CorporationInventors: Johann Christoph Freytag, Guy Maring Lohman, David E. Simmen, Calisto Paul Zuzarte
-
Patent number: 6625593Abstract: A method, apparatus, and article of manufacture for optimizing database queries. The query is analyzed to determine whether at least a portion of the query can be evaluated using a plurality of parallel operations without data redistribution. If so, then the most efficient query execution plan that uses these parallel operations is constructed and executed.Type: GrantFiled: September 13, 1999Date of Patent: September 23, 2003Assignee: International Business Machines CorporationInventors: Ting Yu Leung, Mir Hamid Pirahesh, David E. Simmen, Tuong Chanh Truong
-
Publication number: 20030088541Abstract: The invention herein provides method and apparatus, including software for determining a set of materialized views or indices of the contents or a subset of the contents of a database in a data processing system to be created for one or more users of the database. The method and apparatus provide method and means for evaluating a workload presented by a user to the database; evaluating the data processing system characteristics; evaluating the database characteristics; and, using the above evaluations for recommending a set of suitable materialized views or indices to the user.Type: ApplicationFiled: June 21, 2002Publication date: May 8, 2003Inventors: Daniel C. Zilio, Gary G. Valentin, Guy M. Lohman, Calisto P. Zuzarte, Roberta J. Cochrane, Hamid Pirahesh, Markos Zaharioudakis, Kit Man Cheung, Kevin S. Beyer, David E. Simmen, Ting Y. Leung, Samuel S. Lightstone
-
Patent number: 6513034Abstract: A method, apparatus, and article of manufacture for optimizing a query by deriving uniqueness for indices on the summary tables. The query is analyzed to determine whether a summary table can be used to answer the query or a summary table is directly referenced in the query. A determination is made whether a non-unique index of the summary table is unique based on a query definition of the summary table. If the non-unique index of the summary table is unique, then query optimization techniques can be applied.Type: GrantFiled: December 21, 1999Date of Patent: January 28, 2003Assignee: International Business Machines CorporationInventors: Ting Yu Leung, David E. Simmen
-
Method and system for look ahead query evaluation planning based on interesting partition properties
Patent number: 6345267Abstract: A relational data base management system includes a query processor that uses a query operator partition property to perform QEP pruning and to ensure that data input to a query operator is partitioned appropriately for the operation. The partition property indicates the group of network nodes across which a table is distributed. The query processor also makes use of partition classes that are designated “interesting classes” to perform preoptimization planning and query pruning, and to perform look-ahead partitioning based on partition classes that are identified as being of interest to future operations, thereby more efficiently evaluating complex query statements in an MPP, shared-nothing environment.Type: GrantFiled: March 2, 2000Date of Patent: February 5, 2002Assignee: International Business Machines CorporationInventors: Guy Maring Lohman, Eugene Jon Shekita, David E. Simmen, Monica Sachiye Urata -
Patent number: 6339769Abstract: A method, apparatus, and article of manufacture for optimizing database queries using a materialized view for a table referenced in the query, wherein the materialized view has different properties than the referenced table. The materialized view may be replicated across multiple processors of the computer system, so that some or all of the query can be executed in a local fashion wherein there is no data movement required to perform the operations. The materialized view also may be partitioned across multiple processors of the computer system using a different partitioning key than the referenced table. The materialized view may be a vertical and/or horizontal subset of the table, so that only selected columns and/or tuples from the table are present therein. Columns may be added to the materialized view to contain pre-computed results of complex expressions, and indices may be created on the columns of the materialized view.Type: GrantFiled: September 14, 1998Date of Patent: January 15, 2002Assignee: International Business Machines CorporationInventors: Roberta Jo Cochrane, George Lapis, Ting Yu Leung, Mir Hamid Pirahesh, Monica Sachiye Urata, Richard Sefton Sidle, David E. Simmen, Calisto Paul Zuzarte