Patents by Inventor Polita Paulus
Polita Paulus has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11899646Abstract: The subject technology receives a command to commit a table in a different table format on an external volume. The subject technology generates a first snapshot of the table on internal storage. The subject technology generates a first list of metadata files on the internal storage. The subject technology generates, based on the first list of metadata files, a first set of metadata files on the internal storage. The subject technology generates a second snapshot of the table on the external volume. The subject technology generates a second list of metadata files on the external volume. The subject technology generates, based on the second list of metadata files, a second set of metadata files on the external volume. The subject technology generates a first set of data files in a different file format on the external volume.Type: GrantFiled: March 30, 2023Date of Patent: February 13, 2024Assignee: Snowflake Inc.Inventors: Selcuk Aya, Thierry Cruanes, Istvan Cseri, Benoit Dageville, Marcia Feitel, Steven P. Herbert, Xinglian Liu, James Malone, Subramanian Muralidhar, Muthunagappan Muthuraman, Polita Paulus, Marianne Shaw, Nileema Shingte, Wai Sing Wong, Jiaqi Yan
-
Patent number: 11876802Abstract: A command to load or unload data at a storage location is received. In response to the command, a storage integration object associated with the storage location is identified. The storage integration object identifies a cloud identity object that corresponds to a cloud identity that is associated with a proxy identity object corresponding to a proxy identity granted permission to access the storage location. The data is loaded or unloaded at the storage location by assuming the proxy identity.Type: GrantFiled: November 11, 2022Date of Patent: January 16, 2024Assignee: Snowflake Inc.Inventors: Polita Paulus, Peter Povinec, Saurin Shah, Srinidhi Karthik Bisthavalli Srinivasa
-
Publication number: 20230409636Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.Type: ApplicationFiled: September 6, 2023Publication date: December 21, 2023Inventors: Polita Paulus, Aravind Ramarathinam, Saurin Shah, Sukruth Komarla Sukumar
-
Publication number: 20230401329Abstract: Provided herein are systems and methods for sharing unstructured data in stages. For example, a method includes generating a share object at an account of a data provider. The share object identifies an account of a data consumer and at least one unstructured data file shared with the account of the data consumer. The share object is configured with access privileges to the at least one unstructured data file. A notification of the share object is communicated to the account of the data consumer.Type: ApplicationFiled: September 20, 2022Publication date: December 14, 2023Inventors: Subramanian Muralidhar, Polita Paulus, Saurin Shah, Srinidhi Karthik Bisthavalli Srinivasa
-
Publication number: 20230401197Abstract: In one aspect, a method includes forming, at a database platform, a file stage in a user account on the database platform, the user account being associated with a client, forming, at the database platform, a hidden stage directory table and a data pipeline for the file stage, and binding the data pipeline to the file stage. The method also includes receiving a notification of an event of the file stage. The method also includes identifying the data pipeline corresponding to the file stage. The method also includes updating the hidden stage directory table of the identified data pipeline.Type: ApplicationFiled: June 26, 2023Publication date: December 14, 2023Inventors: Polita Paulus, Aravind Ramarathinam, Saurin Shah
-
Publication number: 20230401193Abstract: The subject technology receives a command to commit a table in a different table format on an external volume. The subject technology generates a first snapshot of the table on internal storage. The subject technology generates a first list of metadata files on the internal storage. The subject technology generates, based on the first list of metadata files, a first set of metadata files on the internal storage. The subject technology generates a second snapshot of the table on the external volume. The subject technology generates a second list of metadata files on the external volume. The subject technology generates, based on the second list of metadata files, a second set of metadata files on the external volume. The subject technology generates a first set of data files in a different file format on the external volume.Type: ApplicationFiled: March 30, 2023Publication date: December 14, 2023Inventors: Selcuk Aya, Thierry Cruanes, Istvan Cseri, Benoit Dageville, Marcia Feitel, Steven P. Herbert, Xinglian Liu, James Malone, Subramanian Muralidhar, Muthunagappan Muthuraman, Polita Paulus, Marianne Shaw, Nileema Shingte, Wai Sing Wong, Jiaqi Yan
-
Patent number: 11797608Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.Type: GrantFiled: October 26, 2022Date of Patent: October 24, 2023Assignee: Snowflake Inc.Inventors: Polita Paulus, Aravind Ramarathinam, Saurin Shah, Sukruth Komarla Sukumar
-
Patent number: 11748338Abstract: In one aspect, a method includes forming, at a database platform, a file stage in a user account on the database platform, the user account being associated with a client, forming, at the database platform, a hidden stage directory table and a data pipeline for the file stage, and binding the data pipeline to the file stage. The method also includes receiving a notification of an event of the file stage. The method also includes identifying the data pipeline corresponding to the file stage. The method also includes updating the hidden stage directory table of the identified data pipeline.Type: GrantFiled: August 25, 2022Date of Patent: September 5, 2023Assignee: Snowflake Inc.Inventors: Polita Paulus, Aravind Ramarathinam, Saurin Shah
-
Publication number: 20230214405Abstract: The embodiments described herein provide means for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with a set of storage locations, wherein the storage integration comprises a base URL for each of the set of storage locations and wherein each storage location identifies a remote deployment where a cloud platform is hosted and a geographic region of the remote deployment. An external stage object may be bound to the storage integration, wherein the external stage object facilitates a data loading operation that is currently in progress on the first storage location. In response to detecting an outage at the first storage location, the data loading operation that was in progress on the first storage location may be continued at the second storage location using the storage integration.Type: ApplicationFiled: March 9, 2023Publication date: July 6, 2023Inventors: Subramanian Muralidhar, Polita Paulus, Sahaj Saini, Saurin Shah, Srinidhi Karthik Bisthavalli Srinivasa
-
Patent number: 11645243Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF signatures and interfaces (e.g., classes, sub-classes) that can be called by other users. Upon a UDF being called, one or more interface objects (e.g., InputStream) can be created and requests transferred to a execution node via a network channel. The execution node can implement multiple threads that are authorized and download file data from a staging location (e.g., internal stage, external stage) concurrently.Type: GrantFiled: November 9, 2021Date of Patent: May 9, 2023Assignee: Snowflake Inc.Inventors: Elliott Brossard, Sukruth Komarla Sukumar, Isaac Kunen, Ju-yi Kuo, Jonathan Lee Leang, Edward Ma, Schuyler James Manchester, Polita Paulus, Saurin Shah, Igor Zinkovsky
-
Patent number: 11620307Abstract: Described herein are techniques for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with multiple storage locations that it can point to, as well as a designation of an active storage location. The storage integration may also be defined with base file paths for each storage location as well as a relative file path which together may serve to synchronize data loading operations between deployments when e.g., a fail-over occurs from one deployment to another. The storage integration may be replicated from a first deployment to a second deployment, and when database replication occurs, an external stage may be replicated to the second deployment and bound to the replicated storage integration. Thus, a fail-over to the second deployment may result in a seamless transition of data loading processes to the second deployment.Type: GrantFiled: August 6, 2021Date of Patent: April 4, 2023Assignee: Snowflake Inc.Inventors: Subramanian Muralidhar, Polita Paulus, Sahaj Saini, Saurin Shah, Srinidhi Karthik Bisthavalli Srinivasa
-
Publication number: 20230097756Abstract: A method includes decoding, by at least one hardware processor, a request for a user-defined function (UDF). The request includes a reference to one or more files. The method further includes generating, by the at least one hardware processor, the UDF based on the request. The UDF includes a file reference object with file path information corresponding to the reference. The file path information identifies a file path to the one or more files. A UDF call into the UDF is detected. The UDF call specifies the file path information. The UDF call is processed to generate result data using the one or more files.Type: ApplicationFiled: December 8, 2022Publication date: March 30, 2023Inventors: Elliott Brossard, Sukruth Komarla Sukumar, Isaac Kunen, Ju-yi Kuo, Jonathan Lee Leang, Edward ma, Schuyler James Manchester, Polita Paulus, Saurin Shah, Igor Zinkovsky
-
Publication number: 20230070935Abstract: A command to load or unload data at a storage location is received. In response to the command, a storage integration object associated with the storage location is identified. The storage integration object identifies a cloud identity object that corresponds to a cloud identity that is associated with a proxy identity object corresponding to a proxy identity granted permission to access the storage location. The data is loaded or unloaded at the storage location by assuming the proxy identity.Type: ApplicationFiled: November 11, 2022Publication date: March 9, 2023Inventors: Polita Paulus, Peter Povinec, Saurin Shah, Srinidhi Karthik Bisthavalli Srinivasa
-
Publication number: 20230070255Abstract: Disclosed herein are embodiments of systems and methods for synchronizing file-catalog table with a file stage. In an embodiment, a data platform receives a notification of a modification to one or more files in a file stage. The file stage includes data storage having a storage location. The data platform updates, based on receiving the notification of the modification, a first file-catalog table for the file stage based on the modification. The first file-catalog table includes a row for each file in the file stage, as well as a column for each of one or more metadata properties of the one or more files in the file stage.Type: ApplicationFiled: October 26, 2022Publication date: March 9, 2023Inventors: Polita Paulus, Aravind Ramarathinam, Saurin Shah, Sukruth Komarla Sukumar
-
Patent number: 11550845Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF interfaces and file reference objects that can be called by other users. Upon a UDF being called, files on a stage, one or more interface objects (e.g., InputStream), and file reference objects can be implemented by execution nodes of the distributed database system. The execution nodes can implement multiple threads that are authenticated and can download file data from a staging location concurrently.Type: GrantFiled: March 31, 2022Date of Patent: January 10, 2023Assignee: Snowflake Inc.Inventors: Elliott Brossard, Sukruth Komarla Sukumar, Isaac Kunen, Ju-yi Kuo, Jonathan Lee Leang, Edward Ma, Schuyler James Manchester, Polita Paulus, Saurin Shah, Igor Zinkovsky
-
Publication number: 20220391408Abstract: The embodiments described herein provide means for replicating external stages between deployments of e.g., a cloud data lake using a modified storage integration. The modified storage integration may be defined with multiple storage locations that it can point to, as well as a designation of an active storage location. The storage integration may also be defined with base file paths for each storage location as well as a relative file path which together may serve to synchronize data loading operations between deployments when e.g., a fail-over occurs from one deployment to another. The storage integration may be replicated from a first deployment to a second deployment, and when database replication occurs, an external stage may be replicated to the second deployment and bound to the replicated storage integration. Thus, a fail-over to the second deployment may result in a seamless transition of data loading processes to the second deployment.Type: ApplicationFiled: August 6, 2021Publication date: December 8, 2022Inventors: Subramanian Muralidhar, Polita Paulus, Sahaj Saini, Saurin Shah, Srinidhi Karthik Bisthavalli Srinivasa
-
Publication number: 20220391357Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF signatures and interfaces (e.g., classes, sub-classes) that can be called by other users. Upon a UDF being called, one or more interface objects (e.g., InputStream) can be created and requests transferred to a execution node via a network channel. The execution node can implement multiple threads that are authorized and download file data from a staging location (e.g., internal stage, external stage) concurrently.Type: ApplicationFiled: November 9, 2021Publication date: December 8, 2022Inventors: Elliott Brossard, Sukruth Komarla Sukumar, Isaac Kunen, Ju-yi Kuo, Jonathan Lee Leang, Edward Ma, Schuyler James Manchester, Polita Paulus, Saurin Shah, Igor Zinkovsky
-
Patent number: 11522860Abstract: A command to load or unload data at a storage location is received. In response to the command, a storage integration object associated with the storage location is identified. The storage integration object identifies a cloud identity object that corresponds to a cloud identity that is associated with a proxy identity object corresponding to a proxy identity granted permission to access the storage location. The data is loaded or unloaded at the storage location by assuming the proxy identity.Type: GrantFiled: January 28, 2022Date of Patent: December 6, 2022Assignee: Snowflake Inc.Inventors: Polita Paulus, Peter Povinec, Saurin Shah, Srinidhi Karthik Bisthavalli Srinivasa
-
Patent number: 11494438Abstract: Disclosed herein are systems and methods for population of a file-catalog table for a file stage in a user account on a data platform. In an embodiment, a data platform receives, from a client associated with a user account, a request to populate a file-catalog table of the user account based on a plurality of files stored in a file stage of the user account. The data platform responsively executes a list-files table function with respect to the file stage to generate a database-table object having a row for each file stored in the file stage. The data platform populates the file-catalog table of the user account based on the database-table object generated by the list-files table function.Type: GrantFiled: December 21, 2021Date of Patent: November 8, 2022Assignee: Snowflake Inc.Inventors: Polita Paulus, Aravind Ramarathinam, Saurin Shah, Sukruth Komarla Sukumar
-
Patent number: 11423081Abstract: A file access system for user defined functions (UDFs) can be implemented on a distributed database system. The system can store UDF signatures and interfaces (e.g., classes, sub-classes) that can be called by other users. Upon a UDF being called, one or more interface objects (e.g., InputStream) can be created and requests transferred to a execution node via a network channel. The execution node can implement multiple threads that are authorized and download file data from a staging location (e.g., internal stage, external stage) concurrently.Type: GrantFiled: August 1, 2021Date of Patent: August 23, 2022Assignee: Snowflake Inc.Inventors: Elliott Brossard, Sukruth Komarla Sukumar, Isaac Kunen, Ju-Yi Kuo, Jonathan Leang, Edward Ma, Schuyler James Manchester, Polita Paulus, Saurin Shah, Igor Zinkovsky