Patents by Inventor Marcin Zukowski

Marcin Zukowski has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20200210450
    Abstract: Example resource management systems and methods are described. In one implementation, a resource manager is configured to receive a query to process database data. A plurality of virtual processors is operatively coupled to the resource manager. Each virtual processor comprises at least one CPU core and one storage device. The resource manager is configured to allocate computing resources from the plurality of virtual processors to process the database data in response to the query.
    Type: Application
    Filed: March 11, 2020
    Publication date: July 2, 2020
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski
  • Publication number: 20200201883
    Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provision a set of processors from a first plurality of processors, where the set of processors to process the set of queries and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates an amount of the processor resources in the set of processors provisioned based on the utilization. Furthermore, the device processes the set of queries using the updated set of processors.
    Type: Application
    Filed: March 5, 2020
    Publication date: June 25, 2020
    Inventors: Thierry CRUANES, Benoit DAGEVILLE, Marcin ZUKOWSKI
  • Publication number: 20200201880
    Abstract: Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
    Type: Application
    Filed: February 28, 2020
    Publication date: June 25, 2020
    Inventors: Thierry CRUANES, Benoit DAGEVILLE, Marcin ZUKOWSKI
  • Publication number: 20200201882
    Abstract: A method and apparatus managing a set of processors for a set of queries is described. In an exemplary embodiment, a device receives a set of queries for a data warehouse, the set of queries including one or more queries to be processed by the data warehouse. The device further provisions a set of processors from a first plurality of processors, where the set of processors to process the set of queries, and a set of storage resources to store data for the set of queries. In addition, the device monitors a utilization of the set of processors as the set of processors processes the set of queries. The device additionally updates a number of the processors in the set of processors provisioned based on the utilization/Furthermore, the device processes the set of queries using the updated set of processors.
    Type: Application
    Filed: March 5, 2020
    Publication date: June 25, 2020
    Inventors: Thierry CRUANES, Benoit DAGEVILLE, Marcin ZUKOWSKI
  • Publication number: 20200201881
    Abstract: Example caching systems and methods are described. In one implementation, a method identifies multiple files used to process a query and distributes each of the multiple files to a particular execution node to execute the query. Each execution node determines whether the distributed file is stored in the execution node's cache. If the execution node determines that the file is stored in the cache, it processes the query using the cached file. If the file is not stored in the cache, the execution node retrieves the file from a remote storage device, stores the file in the execution node's cache, and processes the query using the file.
    Type: Application
    Filed: February 28, 2020
    Publication date: June 25, 2020
    Inventors: Thierry CRUANES, Benoit DAGEVILLE, Marcin ZUKOWSKI
  • Patent number: 10678753
    Abstract: A system and method for pruning data based on metadata. The method may include receiving a query that includes a plurality of predicates and identifying one or more applicable files including database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
    Type: Grant
    Filed: November 19, 2019
    Date of Patent: June 9, 2020
    Assignee: Snowflake Inc.
    Inventors: Marcin Zukowski, Benoit Dageville, Thierry Cruanes, Ashish Motivala
  • Publication number: 20200167315
    Abstract: A system, apparatus, and method for processing queries wherein the query includes a request to access or delete data and accessing metadata associated with the set of data, the metadata defining data characteristics of the set of data and identifying at least sets of data that need or not need to be accessed or deleted based on the metadata without accessing the actual data in the set of data; also methods to optimize processing of some operations based on the collected metadata on data.
    Type: Application
    Filed: January 22, 2020
    Publication date: May 28, 2020
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Publication number: 20200151192
    Abstract: A system is described inkling a cache memory and a hardware processor. The hardware processor to determine that at least one of additional data storage capacity and additional processing capacity are needed based on data processing requests of a plurality of computerized query sources to be executed by an execution platform comprising a plurality of nodes, where the data processing requests are directed to database data stored on a plurality of shared storage devices collectively storing the database data. The hardware processor further to increase, in response to the determining, an amount of the data cached by the execution platform by adding one or more nodes to the plurality of nodes.
    Type: Application
    Filed: January 13, 2020
    Publication date: May 14, 2020
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski
  • Publication number: 20200125567
    Abstract: Systems, methods, and devices, for managing data skew during a join operation are disclosed. A method includes computing a hash value for a join operation and detecting data skew on a probe side of the join operation at a runtime of the join operation using a lightweight sketch data structure. The method includes identifying a frequent probe-side join key on the probe side of the join operation during a probe phase of the join operation. The method includes identifying a frequent build-side row having a build-side join key corresponding with the frequent probe-side join key. The method includes asynchronously distributing the frequent build-side row to one or more remote servers.
    Type: Application
    Filed: December 17, 2019
    Publication date: April 23, 2020
    Inventors: Florian Andreas Funke, Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Publication number: 20200089655
    Abstract: A system and method for pruning data based on metadata. The method may include receiving a query comprising a plurality of predicates and identifying one or more applicable files comprising database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata. store that is separate from the database data. The method further includes pruning inapplicable files comprising database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
    Type: Application
    Filed: November 19, 2019
    Publication date: March 19, 2020
    Inventors: Marcin Zukowski, Benoit Dageville, Thierry Cruanes, Ashish Motivala
  • Patent number: 10545917
    Abstract: A system, apparatus, and method for managing data storage and data access with querying data and filtering value ranges using only a constant amount of computer memory in the implementation of bloom filters based on a first consumption of a relation.
    Type: Grant
    Filed: October 13, 2015
    Date of Patent: January 28, 2020
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Patent number: 10534794
    Abstract: Example resource provisioning systems and methods are described. In one implementation, an execution platform accesses multiple remote storage devices. The execution platform includes multiple virtual warehouses, each of which includes a cache to store data retrieved from the remote storage devices and a processor that is independent of the remote storage devices. A resource manager is coupled to the execution platform and monitors received data processing requests and resource utilization. The resource manager also determines whether additional virtual warehouses are needed based on the data processing requests and the resource utilization. If additional virtual warehouses are needed, the resource manager provisions a new virtual warehouse.
    Type: Grant
    Filed: January 11, 2017
    Date of Patent: January 14, 2020
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski
  • Publication number: 20190377813
    Abstract: Systems, methods, and devices, for managing data skew during a join operation are disclosed. A method includes computing a hash value for a join operation and detecting data skew on a probe side of the join operation at a runtime of the join operation using a lightweight sketch data structure. The method includes identifying a frequent probe-side join key on the probe side of the join operation during a probe phase of the join operation. The method includes identifying a frequent build-side row having a build-side join key corresponding with the frequent probe-side join key. The method includes asynchronously distributing the frequent build-side row to one or more remote servers.
    Type: Application
    Filed: June 11, 2018
    Publication date: December 12, 2019
    Inventors: Florian Andreas Funke, Thierry Cruanes, Benoit Dageville, Marcin Zukowski
  • Patent number: 10437780
    Abstract: A system, apparatus, and method for processing queries wherein the query includes a request to access or delete data and accessing metadata associated with the set of data, the metadata defining data characteristics of the set of data and identifying at least sets of data that need or not need to be accessed or deleted based on the metadata without accessing the actual data in the set of data; also methods to optimize processing of some operations based on the collected metadata on data.
    Type: Grant
    Filed: July 14, 2016
    Date of Patent: October 8, 2019
    Assignee: SNOWFLAKE INC.
    Inventors: Marcin Zukowski, Benoit Dageville, Thierry Cruanes, Ashish Motivala
  • Publication number: 20190303389
    Abstract: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.
    Type: Application
    Filed: June 20, 2019
    Publication date: October 3, 2019
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski
  • Publication number: 20190258607
    Abstract: A system, apparatus, and method for processing queries wherein the query includes a request to access or delete data and accessing metadata associated with the set of data, the metadata defining data characteristics of the set of data and identifying at least sets of data that need or not need to be accessed or deleted based on the metadata without accessing the actual data in the set of data; also methods to optimize processing of some operations based on the collected metadata on data.
    Type: Application
    Filed: April 30, 2019
    Publication date: August 22, 2019
    Inventors: Marcin Zukowski, Benoit Dageville, Thierry Cruanes, Ashish Motivala
  • Publication number: 20190236080
    Abstract: Example resource provisioning systems and methods are described. In one implementation, an execution platform accesses multiple remote storage devices. The execution platform includes multiple virtual warehouses, each of which includes a cache to store data retrieved from the remote storage devices and a processor that is independent of the remote storage devices. A resource manager is coupled to the execution platform and monitors received data processing requests and resource utilization. The resource manager also determines whether additional virtual warehouses are needed based on the data processing requests and the resource utilization. If additional virtual warehouses are needed, the resource manager provisions a new virtual warehouse.
    Type: Application
    Filed: April 8, 2019
    Publication date: August 1, 2019
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski
  • Patent number: 10366102
    Abstract: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.
    Type: Grant
    Filed: October 20, 2014
    Date of Patent: July 30, 2019
    Assignee: SNOWFLAKE INC.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski
  • Publication number: 20190197039
    Abstract: A system and method for just in time compilation for hash table probing are disclosed. In one implementation, the method of using just-in-time compilation is used to combine the stages of hash value computation and initial lookup into a single compound operation. In another implementation, the method of using just-in-time compilation is used to combine the stages of a hash table record and input record checking to detect if these records are equal, and if not, fetching the next possible record in the linked list of the per-bucket records.
    Type: Application
    Filed: July 30, 2018
    Publication date: June 27, 2019
    Inventors: Juliusz Sompolski, Marcin Zukowski, Peter Boncz
  • Patent number: 10325032
    Abstract: Example resource provisioning systems and methods are described. In one implementation, an execution platform accesses multiple remote storage devices. The execution platform includes multiple virtual warehouses, each of which includes a cache to store data retrieved from the remote storage devices and a processor that is independent of the remote storage devices. A resource manager is coupled to the execution platform and monitors received data processing requests and resource utilization. The resource manager also determines whether additional virtual warehouses are needed based on the data processing requests and the resource utilization. If additional virtual warehouses are needed, the resource manager provisions a new virtual warehouse.
    Type: Grant
    Filed: October 20, 2014
    Date of Patent: June 18, 2019
    Assignee: SNOWFLAKE INC.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski