Patents by Inventor Marcin Zukowski

Marcin Zukowski has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210019335
    Abstract: The subject technology creates one or more partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition, the creating of the one or more partitions resulting in a decrease in a degree of clustering of the table. The subject technology determines, after creating the one or more partitions, that the degree of clustering of the table is below a clustering threshold, the clustering threshold corresponding to a clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes. The subject technology increases the degree of clustering of the table by reclustering partitions of the table.
    Type: Application
    Filed: September 24, 2020
    Publication date: January 21, 2021
    Inventors: Thierry Cruanes, Marcin Zukowski, Benoit Dageville, Jiaqi Yan
  • Publication number: 20210019336
    Abstract: The subject technology creates partitions based on changes to a table, at least one of the one or more partitions overlapping with respect to values of one or more attributes with at least one of another partition and a previous partition. The subject technology maintains states for the partitions, each state from the plurality of states representing a particular degree of clustering of the table. The subject technology determines a number of overlapping partitions and a depth of the overlapping partitions, and determines a clustering ratio based at least n part on the number of overlapping partitions and the depth. The subject technology reclusters partitions of the table to increase the clustering ratio, the clustering ratio determined by at least a proportion of rows in a layout of the table that satisfy an ordering criteria based at least in part a particular attribute of the one or more attributes.
    Type: Application
    Filed: September 24, 2020
    Publication date: January 21, 2021
    Inventors: Thierry Cruanes, Marcin Zukowski, Benoit Dageville, Jiaqi Yan
  • Patent number: 10853346
    Abstract: A computer system hosting a column-store database engine is responsive to database requests for the update and retrieval of data from within a stable data table and providing for the storage of database tuples within a column-store organized database structure. A positional delta tree data structure is implemented in the memory space of the database engine and is operatively coupled in an update data transfer path between a database engine interface and the stable data table. The positional delta tree data structure includes a differential data storage layer operative to store differential update data values in positionally defined relative reference to database tuples stored by the stable data table.
    Type: Grant
    Filed: March 10, 2012
    Date of Patent: December 1, 2020
    Assignee: ACTIAN NETHERLANDS B.V.
    Inventors: Sandor ABC Heman, Peter A. Boncz, Marcin Zukowski, Nicolaas J. Nes
  • Publication number: 20200364237
    Abstract: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.
    Type: Application
    Filed: July 31, 2020
    Publication date: November 19, 2020
    Inventors: Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Publication number: 20200364236
    Abstract: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.
    Type: Application
    Filed: July 31, 2020
    Publication date: November 19, 2020
    Inventors: Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Patent number: 10838979
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Grant
    Filed: April 24, 2020
    Date of Patent: November 17, 2020
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Patent number: 10838978
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Grant
    Filed: April 24, 2020
    Date of Patent: November 17, 2020
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Patent number: 10817540
    Abstract: A method includes storing table data for a table in a plurality of partitions and for maintaining approximate or good enough clustering. The method includes creating one or more new partitions based on changes to the table, wherein at least one of the one or more new partitions overlap with each other or previous partitions resulting in a decrease in a degree of clustering of the table. The method includes determining that a degree of clustering of the table data is below a clustering threshold. The method further includes reclustering one or more partitions of the table to improve the degree of clustering of the table in response to one or more of: determining that the degree of clustering has fallen below the clustering threshold, an explicit user command from a user, and/or as part of a DML command. Reclustering may be performed in incremental steps to iteratively improve clustering.
    Type: Grant
    Filed: September 1, 2017
    Date of Patent: October 27, 2020
    Assignee: Snowflake Inc.
    Inventors: Thierry Cruanes, Marcin Zukowski, Benoit Dageville, Jiaqi Yan
  • Publication number: 20200320096
    Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/ Furthermore, the device processes the set of queries using the updated set of processors.
    Type: Application
    Filed: June 18, 2020
    Publication date: October 8, 2020
    Inventors: Thierry CRUANES, Benoit DAGEVILLE, Marcin ZUKOWSKI
  • Patent number: 10776388
    Abstract: A system is described inkling a cache memory and a hardware processor. The hardware processor to determine that at least one of additional data storage capacity and additional processing capacity are needed based on data processing requests of a plurality of computerized query sources to be executed by an execution platform comprising a plurality of nodes, where the data processing requests are directed to database data stored on a plurality of shared storage devices collectively storing the database data. The hardware processor further to increase, in response to the determining, an amount of the data cached by the execution platform by adding one or more nodes to the plurality of nodes.
    Type: Grant
    Filed: January 13, 2020
    Date of Patent: September 15, 2020
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski
  • Patent number: 10776389
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Grant
    Filed: March 18, 2020
    Date of Patent: September 15, 2020
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Patent number: 10776390
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Grant
    Filed: March 18, 2020
    Date of Patent: September 15, 2020
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Phillip Thomas Unterbrunner
  • Publication number: 20200265066
    Abstract: Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
    Type: Application
    Filed: April 28, 2020
    Publication date: August 20, 2020
    Inventors: Thierry CRUANES, Benoit DAGEVILLE, Marcin ZUKOWSKI
  • Publication number: 20200257658
    Abstract: A system and method for pruning data based on metadata. The method may include receiving a query comprising a plurality of predicates and identifying one or more applicable files comprising database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
    Type: Application
    Filed: April 29, 2020
    Publication date: August 13, 2020
    Inventors: Thierry CRUANES, Benoit DAGEVILLE, Ashish MOTIVALA, Marcin ZUKOWSKI
  • Publication number: 20200257703
    Abstract: Example resource provisioning systems and methods are described. In one implementation, multiple processing resources are provided within a data warehouse. The processing resources include at least one processor and at least one storage device. At least one query to process database data is received. At least some of the processing resources may process the database data. When a processing capacity of the processing resources has reached a threshold processing capacity, the processing capacity is automatically scaled by adding at least one additional processor to the data warehouse.
    Type: Application
    Filed: April 29, 2020
    Publication date: August 13, 2020
    Inventors: Benoit DAGEVILLE, Thierry CRUANES, Marcin ZUKOWSKI
  • Publication number: 20200257701
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Application
    Filed: April 24, 2020
    Publication date: August 13, 2020
    Inventors: Benoit DAGEVILLE, Thierry CRUANES, Marcin Zukowski, Allison Waingold LEE, Philipp Thomas Unterbrunner
  • Publication number: 20200257702
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Application
    Filed: April 24, 2020
    Publication date: August 13, 2020
    Inventors: Benoit DAGEVILLE, Thierry CRUANES, Marcin Zukowski, Allison Waingold LEE, Philipp Thomas Unterbrunner
  • Publication number: 20200226148
    Abstract: Example resource provisioning systems and methods are described. In one implementation, multiple processing resources are provided within a data warehouse. The processing resources include at least one processor and at least one storage device. At least one query to process database data is received. At least some of the processing resources may process the database data. When a processing capacity of the processing resources has reached a threshold processing capacity, the processing capacity is automatically scaled by adding at least one additional processor to the data warehouse.
    Type: Application
    Filed: March 23, 2020
    Publication date: July 16, 2020
    Inventors: Benoit DAGEVILLE, Thierry CRUANES, Marcin ZUKOWSKI
  • Publication number: 20200226147
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Application
    Filed: March 18, 2020
    Publication date: July 16, 2020
    Inventors: Benoit DAGEVILLE, Thierry CRUANES, Marcin Zukowski, Allison Waingold LEE, Philipp Thomas Unterbrunner
  • Publication number: 20200218733
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Application
    Filed: March 18, 2020
    Publication date: July 9, 2020
    Inventors: Benoit DAGEVILLE, Thierry CRUANES, Marcin Zukowski, Allison Waingold LEE, Phillip Thomas Unterbrunner