Based On Joins Patents (Class 707/714)
  • Patent number: 11163757
    Abstract: Systems, methods, and devices for querying over an external table are disclosed. A method includes connecting a database platform to an external table such that the database platform has read access for the external table and does not have write access for the external table. The method includes receiving a query comprising a predicate, the query directed at least to data in the external table. The method includes determining, based on metadata, one or more partitions in the external table comprising data satisfying the predicate. The method includes pruning, based on the metadata, all partitions in the external table that do not comprise any data satisfying the predicate. The method includes generating a query plan comprising a plurality of discrete subtasks. The method includes assigning, based on the metadata, the plurality of discrete subtasks to one or more nodes in an execution platform.
    Type: Grant
    Filed: March 12, 2021
    Date of Patent: November 2, 2021
    Assignee: Snowflake Inc.
    Inventors: Subramanian Muralidhar, Benoit Dageville, Thierry Cruanes, Nileema Shingte, Saurin Shah, Torsten Grabs, Istvan Cseri
  • Patent number: 11163782
    Abstract: Techniques are provided for storing generic time series data using a dynamic schema. One method comprises receiving a request to store a data point into a time series, wherein the data point comprises a key-value pair; in response to the received request, (i) evaluating whether a current dynamic schema supports the data point, and (ii) updating the current dynamic schema to support the data point responsive to the current dynamic schema not supporting the data point, wherein the current dynamic schema supports a plurality of data types; encoding the data point using an encoding technique that depends on the data type of at least one key-value pair of the data point; and storing the data point in a time series storage buffer.
    Type: Grant
    Filed: March 10, 2020
    Date of Patent: November 2, 2021
    Assignee: EMC IP Holding Company LLC
    Inventor: Dominique Prunier
  • Patent number: 11163769
    Abstract: A computer-implemented method for joining two data tables on a join attribute, where the data tables have at least a first and a second attribute and the second attribute is the join attribute. The method provides a function for associating a computing node to a given record. The function may be used to determine the associated computing node. The records of the two data tables may be distributed to the respective determined computing nodes. The relationship between the values of the first and second attributes may be modelled using a predefined dataset. For each record of the two data tables the values of the first attribute may be re-determined using the corresponding values of the second attribute. The function may be used to re-determine the associated computing node.
    Type: Grant
    Filed: June 18, 2019
    Date of Patent: November 2, 2021
    Assignee: International Business Machines Corporation
    Inventors: Michal Bodziony, Konrad K. Skibski, Tomasz Kazalski, Artur M. Gruszecki, Lukasz Gaza
  • Patent number: 11157516
    Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/Furthermore, the device processes the set of queries using the updated set of processors.
    Type: Grant
    Filed: January 4, 2021
    Date of Patent: October 26, 2021
    Assignee: Snowflake Inc.
    Inventors: Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Patent number: 11151166
    Abstract: Context-based multi-granularity intent discovery, in relation to determining the intent of a user-problem associated with one of a plurality of products, includes identifying each of the plurality of products in a base intent level of a hierarchical intent data structure, creating a first granularity level below the base intent level, and creating a subsequent granularity level. The first granularity level includes groupings of the plurality of user-problems created according to a first common intent feature; the common intent feature is related to a respective one of the plurality of products. The subsequent granularity level includes sub-groupings of the user-problems of a respective grouping with each sub-grouping created according to a second common intent feature. The second common intent feature of each sub-grouping is determined without consideration of the first common intent feature.
    Type: Grant
    Filed: July 3, 2019
    Date of Patent: October 19, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Ziliu Li, Junaid Ahmed
  • Patent number: 11138197
    Abstract: Disclosed is a computer program stored in a computer-readable medium according to an exemplary embodiment of the present disclosure. When the computer program is executed by one or more processors, the computer program allows the following operations for data caching to be performed and the operations may include an operation of generating a cache, an operation of caching at least some of the segments and at least one of records to the cache, an operation of reading the cache when receiving an inquiry request from a client, an operation of determining to read a cursor based on whether there is a segment or record corresponding to the inquiry request in the cache; and an operation of determining whether a query for at least one of a relational database and a hierarchical database is generated based on whether there is the segment or record in the cursor.
    Type: Grant
    Filed: April 12, 2019
    Date of Patent: October 5, 2021
    Assignee: TMAXSOFT CO., LTD
    Inventors: Donghyeon Kim, Minhyeok Lee, Wonjun Jeong, Byungmin Ahn
  • Patent number: 11120020
    Abstract: A method for performing analysis may use a minimal set of indices for an input query including identifying the input query including primitive searches that are accelerated using indices, computing a minimal set of indices for the input query using an optimal, polynomial-time algorithm, and performing relational data analysis using the minimal set of indices for input queries.
    Type: Grant
    Filed: December 4, 2015
    Date of Patent: September 14, 2021
    Assignee: Oracle International Corporation
    Inventors: Bernhard F. Scholz, Pavle Subotic
  • Patent number: 11113282
    Abstract: Techniques are provided for merging (a) statistics associated with data added to a table in a bulk load operation with (b) statistics associated with data that existed in the table before the bulk load operation. The statistics associated with the bulk load data are generated on-the-fly during the bulk load, and are merged with the pre-existing statistics as part of the same transaction that is used to perform the bulk load operation. Consequently, execution plans for queries that are assigned snapshot times after the commit time of the bulk load transaction will be selected based on the new statistics, while execution plans for queries that are assigned snapshot times before the commit time of the bulk load transaction will be selected based on the pre-existing statistics.
    Type: Grant
    Filed: September 28, 2018
    Date of Patent: September 7, 2021
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Sunil P. Chakkappen, Hong Su, Mohamed Zait, Suratna Budalakoti
  • Patent number: 11113273
    Abstract: Managed materialized views may be generated from across heterogeneous data sources. A request to create a materialized view may be received and performed by a materialized view management platform, which may obtain and generate the materialized view from different data sources and store the materialized view in a target system. Changes to the data sources may be obtained at the materialized view management platform and updates to the materialized view may be determined. The materialized view in the target system may be updated.
    Type: Grant
    Filed: November 29, 2019
    Date of Patent: September 7, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Rohan Deshpande, Yannis Papakonstantinou, Omer Ahmed Zaki, Karthik Gurumoorthy Subramanya Bharathy, Rajib Dugar, Shivateja Reddy Ummenthala
  • Patent number: 11106680
    Abstract: Example implementations described herein are directed to systems and methods for managing a relationship between real-time analysis processes and applications, where each of the applications are configured to utilize output from one or more of the corresponding real-time analysis processes. In an example implementation, resource adjustment is applied to the real-time analysis process based on a determined priority.
    Type: Grant
    Filed: November 8, 2016
    Date of Patent: August 31, 2021
    Assignee: HITACHI, LTD.
    Inventors: Hiroaki Shikano, Yukinori Sakashita
  • Patent number: 11108835
    Abstract: A computer system receives an input corresponding to one or more data pipeline metrics and generates a first plurality of data windows from a first portion of a plurality of streaming data, the first plurality of data windows corresponding to a first data pipeline metric. The computer system generates a first plurality of data aggregations corresponding to the first plurality of data windows, wherein generating the first plurality of data aggregations includes aggregating a data associated with each of the first plurality of windows based on the first data pipeline metric. The computer system accesses a database to determine a first set of reference data aggregations that corresponds to a first data aggregation of the first plurality of data aggregations and determines whether the first data aggregation corresponds to an anomaly based on utilizing a model to compare the first set of reference data aggregations to the first data aggregation.
    Type: Grant
    Filed: March 29, 2019
    Date of Patent: August 31, 2021
    Assignee: PayPal, Inc.
    Inventors: Guy Gerson-Golan, Uri Silberstein, Yiftach Schlamm
  • Patent number: 11106670
    Abstract: Various examples are directed to systems and methods for utilizing local identifiers in a distributed database system. A database management system server may receive first execution unit data describing a first execution unit of a first database query. The first execution unit data may comprise first operand data describing a set of operands for the first execution unit. The database management system server may determine that the first operand data describes at least one operand from a first partition and at least one operand from a second partition. The database management system server may generate a first set of local identifiers describing rows at the first partition and a first set of local identifiers describing rows at the second partition. The database management system server may execute the first execution unit based at least in part on the first set of local identifiers describing rows at the first partition and the first set of local identifiers describing rows at the second partition.
    Type: Grant
    Filed: September 27, 2017
    Date of Patent: August 31, 2021
    Assignee: SAP SE
    Inventors: Andreas Bauer, Maik Goergens
  • Patent number: 11099108
    Abstract: A non-transitory storage medium having stored thereon logic is disclosed. The logic is executable by one or more processors to perform operations including: determining, according to first user input received via a user interface (UI), a type of sampling to be performed, and generating (i) a randomized sampling plan for a substance stored in a first container, and (ii) a corresponding data object representing a three-dimensional (3D) model of the first container, wherein the 3D model includes a plurality of increments, wherein a subset of the plurality of increments to be sampled includes a visual indicator. A first type of sampling to be performed includes bulk sampling, and a second type of sampling to be performed includes packaged sampling. The logic, when executed by the one or more processors, may perform further operations including receiving additional user input, via the UI, corresponding to information pertaining to sampling environment conditions.
    Type: Grant
    Filed: November 21, 2018
    Date of Patent: August 24, 2021
    Inventors: Tyler Autera, Brian Lannon, Swetha Kaul, Thomas Autera
  • Patent number: 11093514
    Abstract: A query comprising a graph pattern comprising a plurality of graph triples of node-edge-node is received. A graph database comprising a plurality of graph patterns is accessed and a plurality of graph patterns in the graph database that match the received query are identified. An inverse frequency for each graph triple of the received query in the accessed graph database is calculated and from this a score for each graph pattern in the graph database that matches the received query is also calculated, the score comprising a sum of the inverse frequencies for each graph triple contained within the respective graph pattern. The plurality of graph patterns in the graph database that match the received query are then ranked according to their respective calculated scores.
    Type: Grant
    Filed: July 23, 2018
    Date of Patent: August 17, 2021
    Assignee: International Business Machines Corporation
    Inventors: Robert Tucker, Robert Kerr, Martin Nathan, Dominique Martindale
  • Patent number: 11093524
    Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provision a set of processors from a first plurality of processors, where the set of processors to process the set of queries and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates an amount of the processor resources in the set of processors provisioned based on the utilization. Furthermore, the device processes the set of queries using the updated set of processors.
    Type: Grant
    Filed: March 5, 2020
    Date of Patent: August 17, 2021
    Assignee: Snowflake Inc.
    Inventors: Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Patent number: 11086900
    Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/Furthermore, the device processes the set of queries using the updated set of processors.
    Type: Grant
    Filed: March 5, 2020
    Date of Patent: August 10, 2021
    Assignee: Snowflake Inc.
    Inventors: Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Patent number: 11086870
    Abstract: A data store system includes an array of persistent storage devices configured to store a plurality of data store tables. The data store system includes a processor in communication with the storage device. The processor may receive a query comprising an aggregate function and identify structure of an argument of the aggregate function. The subset of data store tables may be associated with the argument. The processor may partially-execute the aggregate function on each data store table in the subset involved in the argument of the aggregate function to create partially-executed results for each data store table of the subset of data store tables. The processor may join the partially-executed results based on join conditions contained in the aggregate function. The processor may complete execution of the aggregate function on the partially-executed results to generate a final result of the aggregate function. A method and computer-readable medium are also disclosed.
    Type: Grant
    Filed: December 30, 2015
    Date of Patent: August 10, 2021
    Assignee: Teradata US, Inc.
    Inventors: Anantha B. Subramanian, Sanjay Nair, Yi Xia, Grace Kwan-On Au, Kuorong Chiang
  • Patent number: 11080281
    Abstract: A method, system, and computer program product for graph-based searching for one or more data streams is disclosed. A computer-implemented method comprises extracting a plurality of tuples from one or more data streams. The method further comprises generating a graph for the plurality of tuples in which a node represents a tuple of the plurality of tuples and an edge represents a correlation between the node and another node, and the edge is generated based at least partly on one or more predetermined queries for the one or more data streams. The method further comprises traversing the graph based on the one or more predetermined queries. Accordingly, embodiments of the present disclosure can improve the search speed by use of the graph-based searching for one or more data streams.
    Type: Grant
    Filed: May 17, 2019
    Date of Patent: August 3, 2021
    Assignee: International Business Machines Corporation
    Inventors: Xiao Xing Liang, Cheng Luo, Wayne B. Riley, Shaw-Ben Shi, Meng Hong Tsai, Qingyan Wang, Yu Ling Zheng, Yu Chen Zhou
  • Patent number: 11080276
    Abstract: In some implementations, there is provided an apparatus having at least one data processor and at least one memory storing instructions which, when executed by the at least one data processor, cause executing at least a portion of a query execution plan; determining, at an edge including an intermediate result, a cardinality; proceeding with the execution of the query execution plan, when the determined cardinality is within an optimality range associated with the edge; and selecting an alternative query execution plan for execution, when the determined cardinality is outside the optimality range associated with the edge. Related systems and articles of manufacture are also provided.
    Type: Grant
    Filed: February 23, 2018
    Date of Patent: August 3, 2021
    Assignee: SAP SE
    Inventors: Florian Wolf, Norman May, Paul Willems, Kai-Uwe Sattler
  • Patent number: 11074258
    Abstract: A system includes reception of a first query including a first temporary materialized view, where the first temporary materialized view associated with a first view and a first temporary table. The first temporary table is refreshed if it is determined that the first temporary materialized view is stale, and the first query is executed using a result set cached in the first temporary table. The first temporary table caches the result set for the remainder of the current database session.
    Type: Grant
    Filed: May 15, 2019
    Date of Patent: July 27, 2021
    Assignee: SAP SE
    Inventors: Andrzej Sarapuk, Swapnil Kasaliwal, Abhishek Gupta, Purva Joshi
  • Patent number: 11068504
    Abstract: A relational database storage method according to one embodiment generates a join graph including tables and edges based on either a database schema including referential constraints among database tables or a query workload containing a set of join predicates, classifies the type of tables based on the cost of a join operation between the tables included in the join graph, classifies the type of edges in the join graph on the basis of the type of the tables connected to each other by the edge, adds a virtual edge between two tables satisfying the indirect join condition, and generates a database partitioning scheme for the join graph based on the set of tables in the join graph and their type, the set of edges in the join graph and their type, and virtual edges.
    Type: Grant
    Filed: January 11, 2017
    Date of Patent: July 20, 2021
    Assignee: DAEGU GYEONGBUK INSTITUTE OF SCIENCE AND TECHNOLOGY
    Inventors: Yoon Min Nam, Dong Hyoung Han, Min Soo Kim
  • Patent number: 11068520
    Abstract: A method is executed at a computer system to retrieve data from a database. Upon receiving a database query, a database engine of the computer system parses the query to form an operator tree including a plurality of join operators. For each of the plurality of clauses, the database engine adds to the operator tree a respective node that specifies a mark join operator, a single join operator, an inner join operator, or an outer join operator. Specifically, the database engine adds the mark join operator when the respective clause includes one of a predetermined set of predicate subqueries, and adds the single join operator when the respective clause includes a scalar subquery. The database engine performs one or more optimization passes on the operator tree to form an optimized execution plan, and executes the optimized execution plan to retrieve a result set from the database.
    Type: Grant
    Filed: February 6, 2018
    Date of Patent: July 20, 2021
    Assignee: TABLEAU SOFTWARE, INC.
    Inventors: Thomas Neumann, Viktor Leis, Alfons Kemper, Tobias Muehlbauer, Jan Finis
  • Patent number: 11055284
    Abstract: A database engine receives a database query that specifies retrieving data from a data source. The database engine parses the query to build an operator tree that includes a TableScan operator configured to scan a table from the data source to produce outputs corresponding to a single data field from the table, and includes a GroupBy operator that groups rows of the table according to the data field. The database engine generates and executes code corresponding to the operator tree to retrieve a result set. When the TableScan operator is a child of the GroupBy operator and the outputs are independent of duplicate input rows from the table, execution of the TableScan operator comprises, for each storage block of rows from the table: determining a storage compression scheme for encoding the data field and, for certain encodings, using the encoding to produce the outputs without duplication of rows.
    Type: Grant
    Filed: March 30, 2018
    Date of Patent: July 6, 2021
    Assignee: TABLEAU SOFTWARE, INC.
    Inventors: Adrian Vogelsgesang, Manuel Then, Jan Finis, Tobias Muehlbauer, Richard Wesley, Pawel Terlecki
  • Patent number: 11036733
    Abstract: A method includes: obtaining, by an inner table node that stores a portion of an inner table, a portion of an outer table from each of outer table nodes, and wherein a size of the outer table is less than a size of the inner table; reading, by the inner table node, the outer table by table joining threads or table joining processes of the inner table node; matching, by the table joining threads or the table joining processes, the outer table with the portion of the inner table stored at the inner table node; and outputting, by the inner table node, data from the outer table and the portion of the inner table that matches with each other.
    Type: Grant
    Filed: May 26, 2020
    Date of Patent: June 15, 2021
    Assignee: Ant Financial (Hang Zhou) Network Technology Co., Ltd.
    Inventors: Bin Liu, Yi Pan, Bo Zhang, Huang Yu
  • Patent number: 11036735
    Abstract: Techniques for efficient execution of queries. A query plan generated for the query is optimized and rewritten as an enhanced query plan, which when executed, uses fewer CPU cycles and thus executes faster than the original query plan. The query for which the enhanced query plan is generated thus executes faster without compromising the results obtained or the data being queried. Optimization includes identifying a set of one or more fact scan operations in the original query plan and then, in the rewritten enhanced query plan, associating one or more dimension context predicate conditions with one or more of the set of fact scan operations. This reduces the overall cost of scanning and/or processing fact records in the enhanced query plan compared to the original query plan and makes the enhanced query plan execute faster than the original query plan.
    Type: Grant
    Filed: January 15, 2019
    Date of Patent: June 15, 2021
    Assignee: Oracle International Corporation
    Inventor: Harish Butani
  • Patent number: 11030256
    Abstract: A user selects a data source, and a computer displays a data visualization in a data visualization user interface according to the data source. The data visualization includes visual data marks representing data from the data source. The user selects a subset of the visual data marks. In response, the computer displaying a view data window having a summary of the selected data marks. The computer obtains a data model encoding the data source as a tree of logical tables, each including one or more logical fields. Each logical field corresponds to either a data field or a calculation that spans one or more logical tables. Each edge of the tree connects two logical tables that are related. The computer identifies aggregate measures corresponding to the selected data marks, and displays each aggregate measure in the view data window.
    Type: Grant
    Filed: November 5, 2019
    Date of Patent: June 8, 2021
    Assignee: Tableau Software, Inc.
    Inventors: Justin Talbot, Amy Nicole Forstrom, Daniel Cory, Christian Gabriel Eubank, Jeffrey Mark Booth, Jr., Nicolas Borden
  • Patent number: 11030212
    Abstract: A computer-implemented method includes: receiving, by a computer device, a database query of a database from a database client; determining, by the computer device and based on the database query, a to-be-masked portion of data; sending, by the computer device, the database query to a federated database system for the to-be-masked portion of the data; receiving, by the computer device and in response to the sending, masked resultant data from the federated database system; and forwarding, by the computer device, the masked resultant data to the database client. The to-be-masked portion of the data is masked in at least one view of the database residing on the federated database system.
    Type: Grant
    Filed: September 6, 2018
    Date of Patent: June 8, 2021
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Michal Bodziony, Marcin Filip, Monika Piatek, Lukasz S. Studzienny, Andrzej Laskawiec, Tomasz Zatorski, Marcin Luczynski
  • Patent number: 11030186
    Abstract: Systems, methods, and devices for incrementally refreshing a materialized view are disclosed. A method includes generating a materialized view based on a source table. The method includes merging the source table and the materialized view to generate a merged table to identify whether an update has been executed on the source table that is not reflected in the materialized view. The method includes, in response to detecting an update made to the source table that is not reflected in the materialized view, applying the update to the materialized view.
    Type: Grant
    Filed: October 24, 2019
    Date of Patent: June 8, 2021
    Assignee: Snowflake lnc.
    Inventors: Thierry Cruanes, Benoit Dageville, Prasanna Rajaperumal, Jiaqi Yan
  • Patent number: 11023324
    Abstract: Harmonization of failure domains in a distributed system is provided herein. A data storage system as described herein can include a memory that stores computer executable components and a processor that executes computer executable components stored in the memory. The computer executable components can include a node assignment component that assigns computing nodes in the data storage system to respective failure domains of a set of data failure domains, a hashing component that generates hashing information associated with respective computing nodes of the computing nodes assigned to a selected failure domain of the set of failure domains, and a group update component that designates at least one computing node of the computing nodes in the selected data failure domain to perform an operation associated with the hashing information.
    Type: Grant
    Filed: January 31, 2019
    Date of Patent: June 1, 2021
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Suraj Raju, Travis Lane, Douglas Kilpatrick
  • Patent number: 11016947
    Abstract: A system has a distributed database with database partitions distributed across worker nodes connected by a network. An analytical view recommendation engine defines an analytical view comprising attributes and measures defined prior to the receipt of a query. The analytical view is maintained as a data unit separate from the distributed database. The analytical view recommendation engine includes instructions executed by a processor to identify a poorly performing report, evaluate queries associated with the poorly performing report, create a recommended analytical view to enhance the performance of the poorly performing report, and deploy the recommended analytical view.
    Type: Grant
    Filed: December 20, 2016
    Date of Patent: May 25, 2021
    Assignee: Cloudera, Inc.
    Inventors: Priyank Patel, Anjali Betawadkar-Norwood, Douglas J. Cameron, Shant Hovsepian, Sushil Thomas
  • Patent number: 11010407
    Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/Furthermore, the device processes the set of queries using the updated set of processors.
    Type: Grant
    Filed: June 18, 2020
    Date of Patent: May 18, 2021
    Assignee: Snowflake Inc.
    Inventors: Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Patent number: 11010387
    Abstract: Embodiments relate to efficiently performing an inner join on segments from a data source, where the segments have wildcards in join columns. An algorithm for an inner segment join operator may build a location map that describes locations, by segment and column, of values in the join columns of rows in the segments. A map of wildcard locations, by segment and column, may also be constructed. The location map may be supplemented with information from the wildcard map. The data source is scanned and contents of rows in the data source are queried against the location map to join rows with segments that match due to either matching values or wildcard presence. A user interface may be provided to facilitate construction of joins using the inner segment join operator.
    Type: Grant
    Filed: October 6, 2017
    Date of Patent: May 18, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Benjamin Mark Zhitomirsky, Roy Ben Ofer, Adi Eldar
  • Patent number: 11010362
    Abstract: In a method for caching a generated query plan for time series data, a query plan for time series data is generated based on a query comprising an expression, the query plan including a path of execution for resolving the query. The path of execution of the query plan to resolve the query is executed. A result of the path of execution is returned as a response to the query. The query plan is cached for retrieval and execution responsive to receiving another query that matches the query, such that the query plan can be used to resolve the another query.
    Type: Grant
    Filed: April 12, 2018
    Date of Patent: May 18, 2021
    Assignee: VMware, Inc.
    Inventor: Clement Pang
  • Patent number: 10997201
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Grant
    Filed: July 19, 2018
    Date of Patent: May 4, 2021
    Assignee: SNOWFLAKE INC.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Patent number: 10986185
    Abstract: The present disclosure relates to systems, methods, and computer-readable media for implementing a device management system that controls a plurality of remotely managed computing devices by way of one or more delta proxies. The systems disclosed herein can set up, generate, or otherwise implement a delta proxy that includes interfaces for facilitating communication between a master control system and any number of managed devices. The systems disclosed herein can implement a delta proxy for controlling multiple managed devices via a single instance of the delta proxy. The delta proxy may include a plug-in layer including software plug-ins that facilitate control of devices of different types and functionalities. The delta proxy can additionally include a namespace layer for allocating memory on a per-device basis.
    Type: Grant
    Filed: November 16, 2018
    Date of Patent: April 20, 2021
    Assignee: SALTSTACK, INC.
    Inventors: Thomas S. Hatch, Charles R. Oldham
  • Patent number: 10983976
    Abstract: Systems and methods are disclosed for optimizing full-spectrum cardinality approximations on big data by exploiting an underlying relationship between LogLog counting estimation techniques and order statistics-based estimation techniques. To accomplish the foregoing, a multiset of objects that each corresponds to one of a plurality of objects associated with a resource are obtained by a computing device. A compound data object is populated by the computing device with data that is derived based on generated hash values that correspond to each object in the obtained multiset. The populated compound data object is processed utilizing a processor with a full-spectrum unified estimation operation that can accurately determine a cardinality estimate for the obtained multiset, utilizing considerably less resources when compared to traditional and state of the art techniques.
    Type: Grant
    Filed: January 24, 2017
    Date of Patent: April 20, 2021
    Assignee: Verizon Media Inc.
    Inventor: Jason Jinshui Qin
  • Patent number: 10970282
    Abstract: Systems, methods, and devices, for managing data skew during a join operation are disclosed. A method includes computing a hash value for a join operation and detecting data skew on a probe side of the join operation at a runtime of the join operation using a lightweight sketch data structure. The method includes identifying a frequent probe-side join key on the probe side of the join operation during a probe phase of the join operation. The method includes identifying a frequent build-side row having a build-side join key corresponding with the frequent probe-side join key. The method includes asynchronously distributing the frequent build-side row to one or more remote servers.
    Type: Grant
    Filed: June 11, 2018
    Date of Patent: April 6, 2021
    Assignee: Snowflake Inc.
    Inventors: Florian Andreas Funke, Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Patent number: 10970283
    Abstract: Systems, methods, and devices, for managing data skew during a join operation are disclosed. A method includes computing a hash value for a join operation and detecting data skew on a probe side of the join operation at a runtime of the join operation using a lightweight sketch data structure. The method includes identifying a frequent probe-side join key on the probe side of the join operation during a probe phase of the join operation. The method includes identifying a frequent build-side row having a build-side join key corresponding with the frequent probe-side join key. The method includes asynchronously distributing the frequent build-side row to one or more remote servers.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: April 6, 2021
    Assignee: Snowflake Inc.
    Inventors: Florian Andreas Funke, Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Patent number: 10963480
    Abstract: Methods, systems, and computer program products for a database processing system are described. A core database comprises common procedures for processing information, at least one of the common procedures configured to provide a common functionality to an analysis application. A content import database comprises content-specific procedures, at least one of the content-specific procedures configured to provide access for at least one of the common procedures to content of the content import database that conforms to a content-specific data model.
    Type: Grant
    Filed: May 5, 2017
    Date of Patent: March 30, 2021
    Assignee: SAP SE
    Inventors: Apoorv Bhargava, Madathiveetil Bipin, Markus Schmidt-Karaca, Ismail Basha, Gonda Marcusse, Anthony Arun B, Vishnu Kare, Naveed Mohammed
  • Patent number: 10929400
    Abstract: A method, a computer program product and a computer system are provided. Attribute value information contains at least a minimum value representing a smallest value of a first attribute and a maximum value representing a largest value of the first attribute, thereby defining a first range of values of the first attribute. A received query against a data table requests one or more values of at least the first attribute that are covered by the first range of values. The attribute value information may be used for selecting a data block of the data table as a candidate potentially including at least part of the requested one or more values and scanning the data block. In response to determining that the data block does not include the one or more requested values, the attribute value information may be updated accordingly.
    Type: Grant
    Filed: October 26, 2016
    Date of Patent: February 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Felix O. Beier, Andreas Brodt, Namik Hrle, Oliver Schiller
  • Patent number: 10922294
    Abstract: Methods and apparatuses for determining set-membership using Single Instruction Multiple Data (“SIMD”) architecture are presented herein. Specifically, methods and apparatuses are discussed for determining, in parallel, whether multiple values in a first set of values are members of a second set of values. Many of the methods and systems discussed herein are applied to determining whether one or more rows in a dictionary-encoded column of a database table satisfy one or more conditions based on the dictionary-encoded column. However, the methods and systems discussed herein may apply to many applications executed on a SIMD processor using set-membership tests.
    Type: Grant
    Filed: January 17, 2018
    Date of Patent: February 16, 2021
    Assignee: Oracle International Corporation
    Inventors: Shasank Kisan Chavan, Phumpong Watanaprakornkul
  • Patent number: 10915514
    Abstract: Methods and apparatuses for determining set-membership using Single Instruction Multiple Data (“SIMD”) architecture are presented herein. Specifically, methods and apparatuses are discussed for determining, in parallel, whether multiple values in a first set of values are members of a second set of values. Many of the methods and systems discussed herein are applied to determining whether one or more rows in a dictionary-encoded column of a database table satisfy one or more conditions based on the dictionary-encoded column. However, the methods and systems discussed herein may apply to many applications executed on a SIMD processor using set-membership tests.
    Type: Grant
    Filed: January 17, 2018
    Date of Patent: February 9, 2021
    Assignee: Oracle International Corporation
    Inventors: Shasank Kisan Chavan, Phumpong Watanaprakornkul
  • Patent number: 10909119
    Abstract: Examples disclosed herein relate to accessing electronic databases. Some examples disclosed herein may include partitioning a computation task into subtasks. A processing node of a computation engine may generate a database query for retrieving an electronic data segment associated with at least one of the subtasks from a database. The database query may include pre-processing instructions for a database management system (DBMS) associated with the database to pre-process the electronic data segment before providing the electronic data segment to the processing node. The pre-processing instructions may include at least one of: filtering, projection, join, aggregation, count, and user-defined instructions. The generated query may be provided to the DBMS.
    Type: Grant
    Filed: July 6, 2016
    Date of Patent: February 2, 2021
    Assignee: MICRO FOCUS LLC
    Inventors: Rui Liu, Qiming Chen, Meichun Hsu, Jeffrey LeFevre, Maria Castellanos
  • Patent number: 10909481
    Abstract: A method including calculating processing cost data of the general data table of the data common layer; determining a general data table on which the external data table of the non-data common layer is dependent; and calculating use cost data of the external data table according to the processing cost data of the general data table. In evaluation of the cost of each general data table of the data common layer, instead of taking into account the storage and computation consumption of the current data table in isolation, several upstream data tables and multiple siblings of the data table are also taken into account comprehensively. The processing cost of the general data table is reasonably and accurately assessed, thus reflecting the advantages and disadvantages of data model construction of the data common layer, and providing decision-making support for the optimization and operation of the models of the data common layer.
    Type: Grant
    Filed: July 20, 2018
    Date of Patent: February 2, 2021
    Assignee: Alibaba Group Holding Limited
    Inventors: Wei Wang, Jinpeng Luo, Min Pan
  • Patent number: 10901990
    Abstract: A database engine receives a human-readable database query that includes a plurality of conditional expressions. The database engine parses the database query to build an operator tree that includes a subtree corresponding to each of the conditional expressions. The database engine identifies a subexpression that appears in two or more of the conditional expressions. The subexpression is executed conditionally. The database engine hoists the subexpression outside of the conditional expression so that it is executed unconditionally. The database engine modifies the operator tree to specify computation of a value for the subexpression a first time and to reuse the computed value when the subexpression is subsequently encountered. The database engine executes the modified operator tree to form a result set corresponding to the database query, thereby evaluating the subexpression only a single time for each row of input data and returns the result set.
    Type: Grant
    Filed: January 31, 2018
    Date of Patent: January 26, 2021
    Assignee: Tableau Software, Inc.
    Inventors: Adrian Vogelsgesang, Jan Finis, Manuel Then
  • Patent number: 10878493
    Abstract: A big data sampling system indexed by at least one key and including a big data table comprising complete historical data corresponding to at least one financial instrument, a query table indexed by a query identifier associated with each of the at least one financial instrument and including references to the complete historical data of the big data table, and an inner join corresponding to the query table and the big data table. The inner join takes in a sample of the complete historical data corresponding to an input query. The sample is extracted by byte range download from a file server and is associated with the query identifier of the query table. The sample corresponds with the complete historical data of the at least one financial instrument. An output of the inner join identifies the complete historical data of the financial instrument in response to the query.
    Type: Grant
    Filed: March 13, 2018
    Date of Patent: December 29, 2020
    Inventor: Erland Wittkotter
  • Patent number: 10872095
    Abstract: A system and computer implemented method for execution of aggregation expressions on a distributed non-relational database system is provided. According to an aspect, an operation is provided to perform the aggregation operation. An indication of at least one foreign collection of documents is accepted. An indication of at least one variable for at least one local field of at least one local collection of documents is accepted. An indication of a pipeline for execution on the at least one local collection of documents and the at least one foreign collection of documents is accepted. An operation comparing the local field of the at least one local collection of documents and the foreign field of the at least one foreign collection of documents is performed. Responsive to the operation, data from the at least one local collection of documents and the at least one foreign collection of documents is aggregated.
    Type: Grant
    Filed: June 20, 2018
    Date of Patent: December 22, 2020
    Assignee: MongoDB, Inc.
    Inventors: Eliot Horowitz, David Lenox Storch, Charles William Swanson
  • Patent number: 10838979
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Grant
    Filed: April 24, 2020
    Date of Patent: November 17, 2020
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Patent number: 10824620
    Abstract: In an example, a database system may be configured to receive configuration data including a mapping of objects of a relational database to query generation parameters; compare values in a received first query to the mapping to identify column definition data of the query generation parameters; ascertain whether to identify table definition data of the configuration data based on the column definition data; determine whether to create one or more subqueries for one or more tables based on the table definition data; in response to determining to create the one or more subqueries, join a table of the one or more tables to a root table based on table relationship data of the configuration data; and build a second relational datastore query using the one or more subqueries.
    Type: Grant
    Filed: October 21, 2016
    Date of Patent: November 3, 2020
    Assignee: SALESFORCE.COM, INC.
    Inventor: Jonathan Menzies-Smith
  • Patent number: 10824622
    Abstract: A system, computer-implemented method, and a computer-readable storage medium for determining estimations to statistical questions. Data statistics used in evaluating estimations to statistical questions are gathered from multiple sources in a database management system. A statistical question comprising, for example, cardinality estimations is received, wherein the question is posed by different database components, such as query optimizer, or execution engine. Estimation for the statistical question is computed, where the estimation is based on multiple data statistics and without using the data referred to in the question.
    Type: Grant
    Filed: February 25, 2014
    Date of Patent: November 3, 2020
    Assignee: SAP SE
    Inventors: Anisoara Nica, Anil Kumar Goel