Patents Assigned to SNOWFLAKE INC.
-
Patent number: 11687436Abstract: Techniques described herein can monitor various data metrics. The auto-insight techniques can further detect and rank data segments that contributed to, or counteracted, shifts in data and detect when such shifts occurred. Thus, the techniques described herein can detect and identify root causes in shifts in different metrics. The techniques include pruning and ranking causes to identify the root causes and identify non-relevant factors, as well.Type: GrantFiled: March 25, 2022Date of Patent: June 27, 2023Assignee: Snowflake Inc.Inventors: Christian Kleinerman, Christopher F. Pouliot, Jacobus J. P. Van Ryswyck
-
Patent number: 11687506Abstract: Affinity-based listing recommendations are created and used in a public data exchange. Listings can be evaluated against one another for affinity or similarity such that users working with a particular dataset can be presented with other datasets that share an affinity. Affinity can be determined from both the dataset metadata as well as information from the dataset content. Calculation of affinity scores can be pre-computed and stored, in advance of use, or determined on-the-fly. Presentation of most-similar listings can be deterministic, can contain randomization, can employ time-decay, can be weighted, and can make use of a tiered-sum approach.Type: GrantFiled: July 25, 2022Date of Patent: June 27, 2023Assignee: Snowflake Inc.Inventors: Orestis Kostakis, Prasanna V. Krishnan, Subramanian Muralidhar, Shakhina Pulatova, Megan Marie Schoendorf
-
Patent number: 11681708Abstract: A query directed at a source table organized into a set of batch units is received. The query comprises a regular expression search pattern. The regular expression search pattern is converted to a pruning index predicate comprising a set of substring literals extracted from the regular expression search pattern. A set of N-grams is generated based on the set of substring literals extracted from the regular expression search pattern. A pruning index associated with the source table is accessed. The pruning index indexes distinct N-grams in each column of the source table. A subset of batch units to scan for data matching the query are identified based on the pruning index and the set of N-grams. The query is processed by scanning the subset of batch units.Type: GrantFiled: September 23, 2022Date of Patent: June 20, 2023Assignee: Snowflake Inc.Inventors: Thierry Cruanes, Ismail Oukid, Stefan Richter, Alejandro Salinger
-
Patent number: 11681583Abstract: A shared database platform can interface with a cluster computing platform over a network through a database connector and one or more cluster connectors. The data transferred over the network can include telemetry metadata that can be distributed to execution nodes of the cluster computing platform for generation and transmission of cluster data to the shared database platform.Type: GrantFiled: January 6, 2022Date of Patent: June 20, 2023Assignee: Snowflake Inc.Inventors: Bing Li, Zihan Li, Edward Ma, Mingli Rui, Srinath Shankar
-
Patent number: 11675780Abstract: Disclosed herein are embodiments of systems and methods for partition-based scanning of external tables for query processing. In an example embodiment, a database platform receives a query that includes one or more predicates, where the query is directed at least to data in an external table that is stored in an external storage platform that is external to the database platform. The database platform identifies, based on metadata that summarizes the data in the external table, one or more partitions of the external table that potentially include data that satisfies the one or more predicates. The database platform also identifies, from the one or more identified partitions, data that satisfies the one or more predicates. The database platform sends a response to the query to the client, the response comprising the data satisfying the one or more predicates.Type: GrantFiled: February 9, 2022Date of Patent: June 13, 2023Assignee: Snowflake Inc.Inventors: Subramanian Muralidhar, Benoit Dageville, Thierry Cruanes, Nileema Shingte, Saurin Shah, Torsten Grabs, Istvan Cseri
-
Patent number: 11675815Abstract: A method for a multi-cluster warehouse includes allocating processing units as part of a data warehouse. The processing units access data within one or more databases in cloud storage resources. The method also includes providing one or more queries to each processing unit within the data warehouse. In response to the queries, each processing unit performs database operations on a particular portion of a database table. The method also includes monitoring a workload of the processing units to determine that a processing capacity of the processing units has reached a threshold processing capacity. The method also includes dynamically adding additional processing units to and removing processing units from the data warehouse as needed based on a workload of the processing units.Type: GrantFiled: January 27, 2023Date of Patent: June 13, 2023Assignee: Snowflake Inc.Inventors: Thierry Cruanes, Benoit Dageville, Florian Andreas Funke, Peter Povinec
-
Patent number: 11675784Abstract: A query referencing an external table function provided by a remote software component is received. Requests to execute the external table function on input data are sent to a proxy service. A first request includes a batch of input rows from the input data. A first response to the first request received from the proxy service includes a first portion of result data and a pagination token. The pagination token indicates that at least a second portion of the result data corresponding to the first batch of input rows is to be obtained from the remote software component. Based on the pagination token, a second request is sent to obtain the second portion of the result data. One or more responses are received from the proxy service that comprise at least the second portion of the result data. The result data is processed according to the query.Type: GrantFiled: August 30, 2022Date of Patent: June 13, 2023Assignee: Snowflake Inc.Inventors: Srilakshmi Chintala, Istvan Cseri, Albert L. Hu, Isaac Kunen, Nitya Kumar Sharma, Igor Zinkovsky
-
Patent number: 11675640Abstract: A query referencing a function associated with a remote software component is received by a network-based data warehouse system. Temporary security credentials corresponding to a role at a cloud computing service platform are obtained. The role has permission to send calls to a web endpoint corresponding to the remote software component. A request comprising input data and electronically signed using the temporary security credentials is sent to a web Application Programming Interface (API) management system of the cloud computing service platform. The request, when received by the web API management system, causes the web API management system to invoke external functionality provided by the remote software component at the web endpoint with respect to the input data. A response comprising a result of invoking the external functionality is received from the web API management system, and the result data is processed according to the query.Type: GrantFiled: January 10, 2022Date of Patent: June 13, 2023Assignee: Snowflake Inc.Inventors: Istvan Cseri, Isaac Kunen, Igor Zinkovsky
-
Patent number: 11675806Abstract: A hybrid network-based database system for handling OLTP and OLAP queries using decoupled compute and storage devices. A set of decoupled compute instances perform transactions on an OLTP database, and the data is replicated to an OLAP database, which is managed by another set of decoupled compute instances. Further, in response to queries, the database system can retrieve data from the OLTP and OLAP database for merging and processing according to the query.Type: GrantFiled: March 5, 2021Date of Patent: June 13, 2023Assignee: Snowflake Inc.Inventors: Leonidas Galanis, Alexander Miller, William Waddington, Khaled Yagoub
-
Patent number: 11671491Abstract: Provided herein are systems and methods to provide a way to share metrics regarding shared data access and accesses associated with data providers for different data listings of the data exchange. For example, the method may comprise detecting one or more client interactions with a set of data listings of a data exchange, the set of data listings associated with one or data providers. The method may further comprise collecting metrics corresponding to the one or more client interactions. In addition, the method may share metrics relevant to the one or more data providers with the one or more data providers.Type: GrantFiled: December 22, 2022Date of Patent: June 6, 2023Assignee: Snowflake Inc.Inventors: Edmond T. Chan, Pui Kei Johnston Chu, Chao Ren, Stephanie Stillman, Dangfu Wang
-
Patent number: 11669630Abstract: A data dictionary generation system utilizes a background service that is programmed to automatically populate and update a data dictionary for listings offering shared data. A data dictionary includes metadata describing the shared data overall as well as the individual objects included in the listing, such as the individual tables, schemas, views, and functions. To generate the data dictionary, the data dictionary generation system analyzes the shared data to identify objects, identifies a set of data fields associated with each identified object and populates the set of data fields associated with each identified object based on the shared data offered by the listing. To ensure that a data dictionary for each listing remains up to date, the data dictionary generation system periodically scans the listings to identify any changes to share access granted to the listings.Type: GrantFiled: October 31, 2022Date of Patent: June 6, 2023Assignee: Snowflake Inc.Inventors: Durga Mahesh Arikatla, Subramanian Muralidhar, Vishnu Dutt Paladugu, Shakhina Pulatova, Di Wu, Ziqi Xu
-
Patent number: 11671459Abstract: The disclosure relates generally to methods, systems, and apparatuses for managing network connections. A system for managing network connections includes a storage component, a decoding component, a rule manager component, and a notification component. The storage component is configured to store a list of expected connections for a plurality of networked machines, wherein each connection in the list of expected connections defines a start point and an end point for the connection. The decoding component is configured to decode messages from the plurality of networked machines indicating one or more connections for a corresponding machine. The rule manager component is configured to identify an unexpected presence or absence of a connection on at least one of the plurality of network machines based on the list of expected connections. The notification component is configured to provide a notification or indication of the unexpected presence or absence.Type: GrantFiled: November 15, 2021Date of Patent: June 6, 2023Assignee: Snowflake Inc.Inventors: James Calvin Armstrong, Jonathan Claybaugh
-
Patent number: 11669427Abstract: Disclosed herein are embodiments of systems and methods for query-attempt processing in a database environment. In an embodiment, a database platform receives a query, and operates a first query manager of the database platform to manage a first attempt to execute the query. Based on determining that the attempt to execute the query was unsuccessful, the database platform identifies a second query manager of the database platform as implementing a same version of the database platform as the first query manager, and operates the second query manager to manage a first retry attempt to execute the query.Type: GrantFiled: March 31, 2022Date of Patent: June 6, 2023Assignee: Snowflake Inc.Inventors: Benoit Dageville, Johan Harjono, Simon Holm Jensen, Kunal Prafulla Nabar, Steven James Pelley
-
Patent number: 11651287Abstract: Embodiments of the present disclosure may provide a data sharing system implemented as a local application in a consumer database of a distributed database. The local application can include a training function and a scoring function to train a machine learning model on provider and consumer data, and generate output data by applying the trained machine learning model on input data. The input data can include data portions from a consumer database and a provider database that are joined to create a joined dataset for scoring.Type: GrantFiled: July 31, 2022Date of Patent: May 16, 2023Assignee: Snowflake Inc.Inventors: Orestis Kostakis, Justin Langseth
-
Patent number: 11645243Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF signatures and interfaces (e.g., classes, sub-classes) that can be called by other users. Upon a UDF being called, one or more interface objects (e.g., InputStream) can be created and requests transferred to a execution node via a network channel. The execution node can implement multiple threads that are authorized and download file data from a staging location (e.g., internal stage, external stage) concurrently.Type: GrantFiled: November 9, 2021Date of Patent: May 9, 2023Assignee: Snowflake Inc.Inventors: Elliott Brossard, Sukruth Komarla Sukumar, Isaac Kunen, Ju-yi Kuo, Jonathan Lee Leang, Edward Ma, Schuyler James Manchester, Polita Paulus, Saurin Shah, Igor Zinkovsky
-
Patent number: 11645262Abstract: The subject technology executes a change on an existing micro-partition of a table of a database, the executing of the change comprising generating a new micro-partition that embodies the change. The subject technology receives a request for a delta for the table between a first timestamp and a second timestamp. The subject technology queries at least one change tracking column to determine the delta between the first timestamp and the second timestamp, the delta including information indicating at least one database operation that was performed to at least one row of a set of rows of the table, without including information as to intermediate changes made to at least one row of the set of rows of the table between the first timestamp and the second timestamp, that facilitates a reduction in storage of historical versions of the set of rows of the table.Type: GrantFiled: January 29, 2021Date of Patent: May 9, 2023Assignee: Snowflake Inc.Inventors: Istvan Cseri, Torsten Grabs, Benoit Dageville
-
Patent number: 11645413Abstract: Techniques for document sharing include configuring a secure view of shared data as a listing in a data exchange of a data provider. The listing includes metadata with a first uniform resource locator (URL) of the shared data. An invitation to view the listing for is encoded transmission to a client device. A user defined function (UDF) client is invoked to access a data object of the shared data, based on detecting an acceptance of the invitation by the client device. A security function is applied to the data object to generate a modified data object. The metadata is updated to include a second URL, which is associated with a storage location storing the modified data object. The client device is granted access to the modified data object using the second URL, based on detecting a request by the client device to view the listing in the data exchange.Type: GrantFiled: July 26, 2022Date of Patent: May 9, 2023Assignee: Snowflake Inc.Inventors: Abhiram Arikapudi, Saurin Shah
-
Patent number: 11645305Abstract: Example resource management systems and methods are described. In one implementation, a resource manager is configured to manage data processing tasks associated with multiple data elements. An execution platform is coupled to the resource manager and includes multiple execution nodes configured to store data retrieved from multiple remote storage devices. Each execution node includes a cache and a processor, where the cache and processor are independent of the remote storage devices. A metadata manager is configured to access metadata associated with at least a portion of the multiple data elements.Type: GrantFiled: May 16, 2022Date of Patent: May 9, 2023Assignee: Snowflake Inc.Inventors: Thierry Cruanes, Benoit Dageville, Marcin Zukowski
-
Patent number: 11645306Abstract: Techniques for database configurations for remote deployments include a method performed by a data platform executing instructions on at least one processor. The method includes provisioning by at least one hardware processor, a remote deployment of a data platform with a plurality of objects. The plurality of objects includes at least one task object associated with a primary deployment of the data platform. The method further includes detecting using the at least one task object of the plurality of objects, a request to replicate a database stored at the primary deployment of the data platform at the remote deployment. Responsive to the request, database data is pushed from the database stored at the primary deployment to at least a second object of the plurality of objects provisioned at the remote deployment.Type: GrantFiled: May 11, 2022Date of Patent: May 9, 2023Assignee: Snowflake Inc.Inventors: Pui Kei Johnston Chu, Benoit Dageville, Shreyas Narendra Desai, Sameer Deshpande, Subramanian Muralidhar, Di Wu
-
Patent number: 11645275Abstract: Aspects of the present disclosure address systems, methods, and devices for tracking object dependencies in a cloud database system. An object dependency created between a referencing object and a referenced object is detected. Based on detecting the object dependency, a dependency record is generated. The dependency record includes dependency information describing the object dependency between the reference object and the referenced object. The dependency record is stored in a database of dependency records.Type: GrantFiled: April 19, 2022Date of Patent: May 9, 2023Assignee: Snowflake Inc.Inventors: Raja Suresh Krishna Balakrishnan, Khalid Zaman Bijon, Subramanian Muralidhar, David Schultz, Jian Xu