Patents by Inventor Aleksandras Surna

Aleksandras Surna has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Performance Optimizations for Row-Level Security Filters

Publication number: 20230169121

Abstract: A method for optimizing a database query includes receiving a database query from a user requesting a database to conditionally return one or more rows stored at the database and characterizing one or more user filters. Each user filter filters rows from the database. The method includes determining that a security filter restricts the user from accessing one or more rows of the database. For each respective user filter, the method includes classifying the respective user filter as a safe user filter incapable of reporting an error or an unsafe user filter capable of reporting an error. The method also includes determining a filter execution order of the one or more user filters and the security filter based on the classified one or more user filters and executing, using the determined filter execution order, the one or more user filters and the security filter.

Type: Application

Filed: December 1, 2021

Publication date: June 1, 2023

Applicant: Google LLC

Inventors: Jonah Tang Soon Yuen, Mosha Pasumansky, Kwanho Ryu, Daniel Thurman Asheghian, Aleksandras Surna, Thibaud Baptiste Hottelier
Execution-Time Dynamic Range Partitioning Transformations

Publication number: 20220358142

Abstract: An example method includes receiving a data load request requesting loading and partitioning of an unknown quantity of user data for storage at a data storage system. The user data including a partitioning key; a total data size of the user data; a plurality of rows, each row of the plurality of rows associated with a value defined by the partitioning key; and one or more columns. The method also includes identifying one or more storage constraints for the data storage system. The method further includes, after receiving the user data, determining a plurality of partitioning quantiles defining respective ranges of values of the partitioning key based on the user data and the one or more storage constraints for the data storage system; and range partitioning each row of the user data into files based on the value associated with the row defined by the partitioning key, and the respective ranges of the values of the partitioning key defined by the plurality of partitioning quantiles.

Type: Application

Filed: July 25, 2022

Publication date: November 10, 2022

Applicant: Google LLC

Inventors: Seyed Omid Fatemieh, Mikhail Entin, Adrian Baras, Pavan Edara, Aleksandras Surna
Execution-time dynamic range partitioning transformations

Patent number: 11423049

Abstract: A method for execution-time dynamic range partitioning includes receiving user data including a partitioning key and a clustering key. The user data includes a respective number of total rows defining a total data size for the user data. The method also includes identifying storage constraints for the data storage system. The storage constraints include a target file size and a target number of rows per file. The method further includes determining a plurality of split points for the user data based on the storage constraints. The method also includes generating partitioning quantiles from the plurality of split points that define a range between each split point of the plurality of split points. The method further includes range partitioning each row of the user data into files using the partitioning quantiles.

Type: Grant

Filed: May 11, 2020

Date of Patent: August 23, 2022

Assignee: Google LLC

Inventors: Seyed Omid Fatemieh, Mikhail Entin, Adrian Baras, Pavan Edara, Aleksandras Surna
Execution-Time Dynamic Range Partitioning Transformations

Publication number: 20210349648

Abstract: A method for execution-time dynamic range partitioning includes receiving user data including a partitioning key and a clustering key. The user data includes a respective number of total rows defining a total data size for the user data. The method also includes identifying storage constraints for the data storage system. The storage constraints include a target file size and a target number of rows per file. The method further includes determining a plurality of split points for the user data based on the storage constraints. The method also includes generating partitioning quantiles from the plurality of split points that define a range between each split point of the plurality of split points. The method further includes range partitioning each row of the user data into files using the partitioning quantiles.

Type: Application

Filed: May 11, 2020

Publication date: November 11, 2021

Applicant: Google LLC

Inventors: Seyed Omid Fatemieh, Mikhail Entin, Adrian Baras, Pavan Edara, Aleksandras Surna
Scalable index build techniques for column stores

Patent number: 10860556

Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.

Type: Grant

Filed: January 10, 2019

Date of Patent: December 8, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
Scaling computing clusters in a distributed computing system

Patent number: 10394611

Abstract: A currently operating computing cluster that has multiple nodes storing cluster data may be scaled. A cluster scaling request may be received for the current cluster indicating a change in a number or type of nodes in the current cluster. In response to receiving the cluster scaling request, a new cluster may be created as indicated in the cluster scaling request, a copy of the cluster data from the current cluster to the nodes in the new cluster may be initiated, a network endpoint for the current cluster may be moved to the new cluster, and the current cluster may be disabled. The current cluster may, in some embodiments, respond to read access requests during the copy of the cluster data.

Type: Grant

Filed: November 26, 2012

Date of Patent: August 27, 2019

Assignee: Amazon Technologies, Inc.

Inventors: Venkata Harish Mallipeddi, Aleksandras Surna
SCALABLE INDEX BUILD TECHNIQUES FOR COLUMN STORES

Publication number: 20190205291

Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.

Type: Application

Filed: January 10, 2019

Publication date: July 4, 2019

Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
Global query hint specification

Patent number: 10311055

Abstract: A query optimizer may receive a query (e.g., from a source that generated the query). Input that specifies both a query hint string and a hint may be received to a hint specification interface. The hint may be applied to the query, from outside the query, to optimize a query execution plan. Applying the hint may be based, at least in part, on a query hint string. For example, which query block is associated with the query hint string may be determined. Upon such a determination, the hint may be applied to the determined query block.

Type: Grant

Filed: May 9, 2016

Date of Patent: June 4, 2019

Assignee: Amazon Technologies, Inc.

Inventors: Neil Thombre, Anurag Windlass Gupta, Stefano Stefani, Aleksandras Surna
Scalable index build techniques for column stores

Patent number: 10216777

Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.

Type: Grant

Filed: January 16, 2017

Date of Patent: February 26, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
SCALABLE INDEX BUILD TECHNIQUES FOR COLUMN STORES

Publication number: 20170124128

Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.

Type: Application

Filed: January 16, 2017

Publication date: May 4, 2017

Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
NORMALIZING DATA FOR FAST SUPERSCALAR PROCESSING

Publication number: 20170124125

Abstract: A data normalization system is described herein that represents multiple data types that are common within database systems in a normalized form that can be processed uniformly to achieve faster processing of data on superscalar CPU architectures. The data normalization system includes changes to internal data representations of a database system as well as functional processing changes that leverage normalized internal data representations for a high density of independently executable CPU instructions. Because most data in a database is small, a majority of data can be represented by the normalized format. Thus, the data normalization system allows for fast superscalar processing in a database system in a variety of common cases, while maintaining compatibility with existing data sets.

Type: Application

Filed: January 18, 2017

Publication date: May 4, 2017

Inventors: Aleksandras Surna, Erik Ismert, Srikumar Rangarajan, Nimish Khanolkar
Normalizing data for fast superscalar processing

Patent number: 9582467

Abstract: A data normalization system is described herein that represents multiple data types that are common within database systems in a normalized form that can be processed uniformly to achieve faster processing of data on superscalar CPU architectures. The data normalization system includes changes to internal data representations of a database system as well as functional processing changes that leverage normalized internal data representations for a high density of independently executable CPU instructions. Because most data in a database is small, a majority of data can be represented by the normalized format. Thus, the data normalization system allows for fast superscalar processing in a database system in a variety of common cases, while maintaining compatibility with existing data sets.

Type: Grant

Filed: May 3, 2015

Date of Patent: February 28, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Aleksandras Surna, Erik Ismert, Srikumar Rangarajan, Nimish S. Khanolkar
Normalizing data for fast superscalar processing

Patent number: 9547674

Abstract: A data normalization system is described herein that represents multiple data types that are common within database systems in a normalized form that can be processed uniformly to achieve faster processing of data on superscalar CPU architectures. The data normalization system includes changes to internal data representations of a database system as well as functional processing changes that leverage normalized internal data representations for a high density of independently executable CPU instructions. Because most data in a database is small, a majority of data can be represented by the normalized format. Thus, the data normalization system allows for fast superscalar processing in a database system in a variety of common cases, while maintaining compatibility with existing data sets.

Type: Grant

Filed: January 13, 2014

Date of Patent: January 17, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Aleksandras Surna, Erik Ismert, Srikumar Rangarajan, Nimish S. Khanolkar
Scalable index build techniques for column stores

Patent number: 9547677

Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.

Type: Grant

Filed: March 18, 2015

Date of Patent: January 17, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
Streaming restore of a database from a backup system

Patent number: 9449038

Abstract: A distributed data warehouse system may maintain data blocks on behalf of clients in multiple clusters in a data store. Each cluster may include a single leader node and multiple compute nodes, each including multiple disks storing data. The warehouse system may store primary and secondary copies of each data block on different disks or nodes in a cluster. Each node may include a data structure that maintains metadata about each data block stored on the node, including its unique identifier. The warehouse system may back up data blocks in a remote key-value backup storage system with high durability. A streaming restore operation may be used to retrieve data blocks from backup storage using their unique identifiers as keys. The warehouse system may service incoming queries (and may satisfy some queries by retrieving data from backup storage on an as-needed basis) prior to completion of the restore operation.

Type: Grant

Filed: March 11, 2013

Date of Patent: September 20, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Anurag Windlass Gupta, Jakub Kulesza, Deepak Agarwal, Aleksandras Surna, Tushar Jain, Zelaine Fong, Stefano Stefani
GLOBAL QUERY HINT SPECIFICATION

Publication number: 20160253385

Abstract: A query optimizer may receive a query (e.g., from a source that generated the query). Input that specifies both a query hint string and a hint may be received to a hint specification interface. The hint may be applied to the query, from outside the query, to optimize a query execution plan. Applying the hint may be based, at least in part, on a query hint string. For example, which query block is associated with the query hint string may be determined. Upon such a determination, the hint may be applied to the determined query block.

Type: Application

Filed: May 9, 2016

Publication date: September 1, 2016

Applicant: Amazon Technologies, Inc.

Inventors: Neil Thombre, Anurag Windlass Gupta, Stefano Stefani, Aleksandras Surna
Global query hint specification

Patent number: 9336272

Abstract: A query optimizer may receive a query (e.g., from a source that generated the query). Input that specifies both a query hint string and a hint may be received to a hint specification interface. The hint may be applied to the query, from outside the query, to optimize a query execution plan. Applying the hint may be based, at least in part, on a query hint string. For example, which query block is associated with the query hint string may be determined. Upon such a determination, the hint may be applied to the determined query block.

Type: Grant

Filed: February 13, 2013

Date of Patent: May 10, 2016

Assignee: Amazon Technologies, Inc.

Inventors: Neil Thombre, Anurag Windlass Gupta, Stefano Stefani, Aleksandras Surna
NORMALIZING DATA FOR FAST SUPERSCALAR PROCESSING

Publication number: 20150234778

Abstract: A data normalization system is described herein that represents multiple data types that are common within database systems in a normalized form that can be processed uniformly to achieve faster processing of data on superscalar CPU architectures. The data normalization system includes changes to internal data representations of a database system as well as functional processing changes that leverage normalized internal data representations for a high density of independently executable CPU instructions. Because most data in a database is small, a majority of data can be represented by the normalized format. Thus, the data normalization system allows for fast superscalar processing in a database system in a variety of common cases, while maintaining compatibility with existing data sets.

Type: Application

Filed: May 3, 2015

Publication date: August 20, 2015

Inventors: Aleksandras Surna, Erik Ismert, Srikumar Rangarajan, Nimish S. Khanolkar
Scalable index build techniques for column stores

Patent number: 8990216

Abstract: Architecture that includes an index creation algorithm that utilizes available resources and dynamically adjusts to successfully scale with increased resources and be able to do so for any data distribution. The resources can be processing resources, memory, and/or input/output, for example. A finer level of granularity, called a segment, is utilized to process tuples in a partition while creating an index. The segment also aligns with compression techniques for the index. By choosing an appropriate size for a segment and using load balancing the overall time for index creation can be reduced. Each segment can then be processed by a single thread thereby limiting segment skew. Skew is further limited by breaking down the work done by a thread into parallelizable stages.

Type: Grant

Filed: March 19, 2010

Date of Patent: March 24, 2015

Assignee: Microsoft Corporation

Inventors: Artem Oks, Qingqing Zhou, Srikumar Rangarajan, Aleksandras Surna, Wey Guy, Cipri Clinciu, Ashit Gosalia
NORMALIZING DATA FOR FAST SUPERSCALAR PROCESSING

Publication number: 20140129525

Abstract: A data normalization system is described herein that represents multiple data types that are common within database systems in a normalized form that can be processed uniformly to achieve faster processing of data on superscalar CPU architectures. The data normalization system includes changes to internal data representations of a database system as well as functional processing changes that leverage normalized internal data representations for a high density of independently executable CPU instructions. Because most data in a database is small, a majority of data can be represented by the normalized format. Thus, the data normalization system allows for fast superscalar processing in a database system in a variety of common cases, while maintaining compatibility with existing data sets.

Type: Application

Filed: January 13, 2014

Publication date: May 8, 2014

Applicant: Microsoft Corporation

Inventors: Aleksandras Surna, Erik Ismert, Srikumar Rangarajan, Nimish S. Khanolkar

1 2 next