Patents by Inventor Benoit Dageville

Benoit Dageville has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220222247
    Abstract: Systems, methods, and devices for tracking a series of changes to database data are disclosed. A method includes executing a transaction to modify data in a micro-partition of a table of a database by generating a new micro-partition that embodies the transaction. The method includes associating transaction data with the new micro-partition, wherein the transaction data comprises a timestamp when the transaction was fully executed, and further includes associating modification data with the new micro-partition that comprises an indication of one or more rows of the table that were modified by the transaction. The method includes joining the transaction data with the modification data to generate joined data and querying the joined data to determine a listing of intermediate modifications made to the table between a first timestamp and a second timestamp.
    Type: Application
    Filed: March 29, 2022
    Publication date: July 14, 2022
    Inventors: Torsten Grabs, Istvan Cseri, Benoit Dageville
  • Patent number: 11386220
    Abstract: A method for sharing data in a multi-tenant database includes receiving, by a target account of a multiple tenant database, access rights of a share object in a first account of the multiple tenant database, wherein the share object having access rights to a database object of the first account and wherein access to the database object of the first account by the target account is based on the access rights of the share object. The method also includes receiving, by one or more processors of the target account, access rights to an alias object, wherein the alias object references the database object of the first account.
    Type: Grant
    Filed: November 30, 2021
    Date of Patent: July 12, 2022
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Martin Hentschel, Peter Povinec
  • Publication number: 20220215041
    Abstract: Replication and failover of database data is disclosed. A method includes copying a first dataset of a first account stored in a primary deployment to a secondary deployment. The method includes determining that the first dataset stored in the primary deployment is unavailable. The method includes executing a first transaction on the first dataset at the secondary deployment in response to determining that the first dataset stored in the primary deployment is unavailable. The method includes executing a second transaction on a second dataset of a second account at the primary deployment while the first dataset of the first account stored in the primary deployment is unavailable.
    Type: Application
    Filed: March 24, 2022
    Publication date: July 7, 2022
    Inventors: Benoit Dageville, Eric Robinson, Martin Hentschel
  • Publication number: 20220215033
    Abstract: A method for accessing listings in a data exchange includes creating a first listing in a data exchange, the first listing referencing a first database of a plurality of databases and specifying identity-based sharing of the first database, creating a second listing in the data exchange, the second listing referencing a second database of the plurality of databases and data of the first database shared according to the identity-based sharing of the first database, and receiving an instruction from a user of the data exchange, the instruction referencing the second listing and instructing the addition of the second listing to a set of consumed data shares accessible by the user.
    Type: Application
    Filed: March 25, 2022
    Publication date: July 7, 2022
    Inventors: Pui Kei Johnston Chu, Benoit Dageville, Matthew J. Glickman, Christian Kleinerman, Prasanna Krishnan, Justin Langseth
  • Publication number: 20220215027
    Abstract: A query directed at a source table organized into a set of batch units is received. The query includes a pattern matching predicate that specifies a search pattern. A set of N-grams are generated based on the search pattern. A pruning index associated with the source table is accessed. The pruning index comprises a set of filters that index distinct N-grams in each column of the source table. The pruning index is used to identify a subset of batch units to scan for matching data based on the set of N-grams generated for the search pattern. The query is processed by scanning the subset of batch units.
    Type: Application
    Filed: March 29, 2022
    Publication date: July 7, 2022
    Inventors: Thierry Cruanes, Benoit Dageville, Ismail Oukid, Stefan Richter
  • Publication number: 20220217150
    Abstract: Systems and methods for managing membership in a private data exchange are provided herein. In one embodiment, the method includes generating a plurality of listings in a data exchange. A first listing of the plurality of listings may include a reference to a first version of shared data within a first database, where the first version is one of a plurality of versions of the shared data within the first database. The method further includes receiving a request from a member of the data exchange for access to the first listing and limiting, by a processing device, access for the member to the first version of the shared data of the first database referenced by the first listing.
    Type: Application
    Filed: March 23, 2022
    Publication date: July 7, 2022
    Inventors: Pui Kei Johnston Chu, Benoit Dageville, Matthew J. Glickman, Christian Kleinerman, Prasanna Krishnan, Justin Langseth
  • Publication number: 20220206992
    Abstract: A system and method for pruning data based on metadata. The method may include receiving a query that includes a plurality of predicates and identifying one or more applicable files including database data satisfying at least one of the plurality of predicates. The identifying the one or more applicable files including reading metadata stored in a metadata store that is separate from the database data. The method further includes pruning inapplicable files including database data that does not satisfy at least one of the plurality of predicates to create a reduced set of files and reading the reduced set of files to execute the query.
    Type: Application
    Filed: March 16, 2022
    Publication date: June 30, 2022
    Inventors: Marcin ZUKOWSKI, Benoit DAGEVILLE, Thierry CRUANES, Ashish MOTIVALA
  • Publication number: 20220207169
    Abstract: A method of sharing data in a multi-tenant database includes inspecting, by a processing device of a multiple tenant database, a sharer account to determine a presence of a grant to a second role object, in a target account, of access rights to a first role object included in the sharer account. The method includes granting the second role object, in the target account, access rights to an alias object. The first role object having one or more grants to the one or more resources of the sharer account. The target account accesses the one or more resources using the one or more grants of the first role object and using the alias object without at least one of copying the one or more resources or transmitting the one or more resources.
    Type: Application
    Filed: March 17, 2022
    Publication date: June 30, 2022
    Inventors: Benoit Dageville, Thierry Cruanes, Martin Hentschel, Peter Povinec
  • Publication number: 20220207054
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components.
    Type: Application
    Filed: March 18, 2022
    Publication date: June 30, 2022
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Patent number: 11372888
    Abstract: A method, apparatus, and system for join operations of a plurality of relations that are distributed over a plurality of storage locations over a network of computing components. An example method includes: receiving a relational join query for a join operation associated with a first relation and a second relation; generating at least one build operator and at least one probe operator to perform build operations and probe operations, respectively, of the join operation; and managing a state of one or more communication links between the at least one build operator and the at least one probe operator based on a size of the second relation as determined by the at least one build operator and an estimated size of the first relation.
    Type: Grant
    Filed: November 11, 2021
    Date of Patent: June 28, 2022
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Allison Waingold Lee, Philipp Thomas Unterbrunner
  • Patent number: 11372740
    Abstract: Systems, methods, and devices for retrying a query. A method includes receiving, by a first database query manager, a query directed to database data from a client account. The method includes assigning an original execution of the query to one or more execution nodes of an execution platform. The method includes determining the original execution of the query was unsuccessful. The method includes transferring the query to a second database query manager configured to manage internal tasks for improving operation of a database platform that are not received from client accounts. The method includes assigning, by the second database query manager, a retry execution of the query to one or more execution nodes of an execution platform.
    Type: Grant
    Filed: August 31, 2020
    Date of Patent: June 28, 2022
    Assignee: Snowflake Inc.
    Inventors: Benoit Dageville, Johan Harjono, Simon Holm Jensen, Kunal Prafulla Nabar, Steven James Pelley
  • Publication number: 20220197872
    Abstract: Provided herein are systems and methods for an efficient method of replicating share objects to remote deployments. For example, the method may include generating a global representation of a share object of a first database account located in a first region. The share object includes grant metadata associated with a set of objects of a database located in the first region and associated with the first database account. The method may further include, in response to a database refresh command received from a second database account associated with a database replica located in a second region, replicating the set of objects of the database to the database replica. The method may further include, in response to a share refresh command received from the second database account, replicating the grant metadata to a share object replica located in the second region.
    Type: Application
    Filed: March 14, 2022
    Publication date: June 23, 2022
    Inventors: Pui Kei Johnston Chu, Benoit Dageville, Shreyas Narendra Desai, German Alberto Gil Echeverri, Prasanna Krishnan, Vishnu Dutt Paladugu, Bowen Zhang
  • Patent number: 11366920
    Abstract: Embodiments of the present disclosure provide systems and methods for defining database roles to allow sharing of the objects within a database in a more granular fashion. A database role is created within a database, and privileges are granted to the database role by a provider account. The database role may be granted to a share object, which would result in the share object being indirectly granted all privileges that have been granted to the database role. Once the share object has been hydrated in the consumer account, the local administrator can then decide which local roles are granted usage on the shared database role, thus ultimately allowing some level of filtering as to which shared objects can be accessed by which local roles.
    Type: Grant
    Filed: September 1, 2021
    Date of Patent: June 21, 2022
    Assignee: Snowflake Inc.
    Inventors: Damien Carru, Pui Kei Johnston Chu, Benoit Dageville, Shreyas Narendra Desai, Subramanian Muralidhar, Bowen Zhang
  • Publication number: 20220188287
    Abstract: A system includes one or more processors and data storage containing instructions executable by the one or more processors to perform operations. The operations include storing table data in a plurality of partitions of a storage device. Metadata is retrieved from a first partition of the plurality of partitions. The metadata includes a plurality of change tracking entries stored as a change tracking stream. A lineage of modifications made to the table data is determined using the plurality of change tracking entries. A report of one or more transactions performed on the table data is generated. The one or more transactions are included in the lineage of modifications.
    Type: Application
    Filed: March 4, 2022
    Publication date: June 16, 2022
    Inventors: Subramanian Muralidhar, Istvan Cseri, Torsten Grabs, Benoit Dageville
  • Publication number: 20220188297
    Abstract: A method includes generating a task using a plurality of logical statements embedded in a database, the plurality of logical statements corresponding to a data modification. Database data is ingested into a staging table that is configured within the database. The task is executed based on applying the data modification to a first set of partitions storing the database data and generating a second set of partitions. The second set of partitions store modified data corresponding to the database data. A stream of committed transactions is advanced at least in part by adding an entry into the stream. The entry corresponds to committed transactions performed on the first set of partitions during the data modification. A data processing task is scheduled for execution on the modified data based on the advancing of the stream offset.
    Type: Application
    Filed: March 4, 2022
    Publication date: June 16, 2022
    Inventors: Istvan Cseri, Torsten Grabs, Benoit Dageville
  • Patent number: 11360995
    Abstract: A data warehouse storing databases for a plurality of users, including service providers hosting data for other users of the data warehouse may implement a data exchange. The data warehouse to verify identity of users and execute instructions with respect to databases of the data warehouse.
    Type: Grant
    Filed: April 6, 2021
    Date of Patent: June 14, 2022
    Assignee: Snowflake Inc.
    Inventors: Pui Kei Johnston Chu, Benoit Dageville, Matthew J. Glickman, Christian Kleinerman, Prasanna Krishnan, Justin Langseth
  • Publication number: 20220179846
    Abstract: Systems, methods, and devices for batch ingestion of data into a table of a database. A method includes determining a notification indicating a presence of a user file received from a client account to be ingested into a database. The method includes identifying data in the user file and identifying a target table of the database to receive the data in the user file. The method includes generating an ingest task indicating the data and the target table. The method includes assigning the ingest task to an execution node of an execution platform, wherein the execution platform comprises a plurality of execution nodes operating independent of a plurality of shared storage devices collectively storing database data. The method includes registering metadata concerning the target table in a metadata store after the data has been fully committed to the target table by the execution node.
    Type: Application
    Filed: February 25, 2022
    Publication date: June 9, 2022
    Inventors: Jiansheng Huang, Jiaxing Liang, Scott Ziegler, Haowei Yu, Benoit Dageville, Varun Ganesh
  • Patent number: 11354334
    Abstract: Example systems and methods for cloning catalog objects are described. In one implementation, a method identifies an original catalog object associated with data and creates a duplicate copy of the original catalog object without copying the data itself. The method allows access to the data using the duplicate catalog object and supports modifying the data associated with the original catalog object independently of the duplicate catalog object. The duplicate catalog object can be deleted upon completion of modifying the data associated with the original catalog object.
    Type: Grant
    Filed: March 19, 2021
    Date of Patent: June 7, 2022
    Assignee: Snowflake Inc.
    Inventors: Ashish Motivala, Benoit Dageville
  • Patent number: 11354316
    Abstract: Disclosed herein are systems and methods for selective scanning of external partitions. In an embodiment, a database platform receives a query directed at least in part to an external table stored on an external data storage platform. The external table is partitioned into partitions corresponding to storage locations in the external data storage platform. The database platform prunes, using external-table metadata that is stored by the database platform and that maps the partitions of the external table to the storage locations in the external data storage platform, those partitions that do not potentially contain data that satisfies the query. The database platform identifies data that satisfies the query by scanning any one or more of the partitions of the external table that were not pruned, and responds to the query at least in part with the identified data that satisfies the query.
    Type: Grant
    Filed: December 23, 2021
    Date of Patent: June 7, 2022
    Assignee: Snowflake Inc.
    Inventors: Subramanian Muralidhar, Benoit Dageville, Thierry Cruanes, Nileema Shingte, Saurin Shah, Torsten Grabs, Istvan Cseri
  • Patent number: 11347738
    Abstract: Systems, methods, and devices, for managing data skew during a join operation are disclosed. A method includes computing a hash value for a join operation and detecting data skew on a probe side of the join operation at a runtime of the join operation using a lightweight sketch data structure. The method includes identifying a frequent probe-side join key on the probe side of the join operation during a probe phase of the join operation. The method includes identifying a frequent build-side row having a build-side join key corresponding with the frequent probe-side join key. The method includes asynchronously distributing the frequent build-side row to one or more remote servers.
    Type: Grant
    Filed: October 15, 2021
    Date of Patent: May 31, 2022
    Assignee: Snowflake Inc.
    Inventors: Florian Andreas Funke, Thierry Cruanes, Benoit Dageville, Marcin Zukowski