Patents by Inventor Shrikumar Hariharasubrahmanian

Shrikumar Hariharasubrahmanian has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

ANISOTROPIC COMPRESSION AS APPLIED TO COLUMNAR STORAGE FORMATS

Publication number: 20230101153

Abstract: Herein are spatially scalable techniques for anisotropic compression of shared entropy between alternate representations of same data. In an embodiment, a computer compresses an uncompressed independent column into a compressed independent column. Based on the compressed independent column, an uncompressed dependent column is compressed into a compressed dependent column. The compressed independent column and the compressed dependent column are stored in a same file. In an embodiment, a computer stores, in metadata, an encrypted private key for decrypting an encrypted column. The encrypted column and the metadata are stored in a file. A request to read the encrypted column is received. Based on a public key and the file, the encrypted private key is decrypted into a decrypted private key. The public key is contained in the request and/or the file. The request is executed by decrypting, based on the decrypted private key and the file, the encrypted column.

Type: Application

Filed: December 2, 2022

Publication date: March 30, 2023

Inventors: Shrikumar Hariharasubrahmanian, Jean-Pierre Dijcks, Jacco Draaijer
Anisotropic compression as applied to columnar storage formats

Patent number: 11562085

Abstract: Herein are spatially scalable techniques for anisotropic compression of shared entropy between alternate representations of same data. In an embodiment, a computer compresses an uncompressed independent column into a compressed independent column. Based on the compressed independent column, an uncompressed dependent column is compressed into a compressed dependent column. The compressed independent column and the compressed dependent column are stored in a same file. In an embodiment, a computer stores, in metadata, an encrypted private key for decrypting an encrypted column. The encrypted column and the metadata are stored in a file. A request to read the encrypted column is received. Based on a public key and the file, the encrypted private key is decrypted into a decrypted private key. The public key is contained in the request and/or the file. The request is executed by decrypting, based on the decrypted private key and the file, the encrypted column.

Type: Grant

Filed: October 17, 2019

Date of Patent: January 24, 2023

Assignee: Oracle International Corporation

Inventors: Shrikumar Hariharasubrahmanian, Jean-Pierre Dijcks, Jacco Draaijer
Probabilistic text index for semi-structured data in columnar analytics storage formats

Patent number: 11514697

Abstract: Herein is a probabilistic indexing technique for searching semi-structured text documents in columnar storage formats such as Parquet, using columnar input/output (I/O) avoidance, and needing minimal storage overhead. In an embodiment, a computer associates columns with text strings that occur in semi-structured documents. Text words that occur in the text strings are detected. Respectively for each text word, a bitmap, of a plurality of bitmaps, that contains a respective bit for each column is generated. Based on at least one of the bitmaps, some of the columns or some of the semi-structured documents are accessed.

Type: Grant

Filed: July 15, 2020

Date of Patent: November 29, 2022

Assignee: Oracle International Corporation

Inventors: Jian Wen, Hamed Ahmadi, Sanjay Jinturkar, Nipun Agarwal, Lijian Wan, Shrikumar Hariharasubrahmanian
Personal information indexing for columnar data storage format

Patent number: 11238035

Abstract: Techniques are described herein for indexing personal information in columnar data storage format based files. In an embodiment, row groups of rows that comprise a plurality of columns are stored in a set of files. Each column of a row group is stored in a chunk of column pages in the set of files. A regular expression index that indexes a particular column in the set of files is stored for each row group. The regular expression index identifies column pages in the chunk of the particular column that include a particular column value that satisfies a regular expression specified in a query. The regular expression specified in the query in evaluated against the particular column using the regular expression index.

Type: Grant

Filed: March 10, 2020

Date of Patent: February 1, 2022

Assignee: ORACLE INTERNATIONAL CORPORATION

Inventors: Hamed Ahmadi, Jian Wen, Shrikumar Hariharasubrahmanian, Sanjay Jinturkar, Nipun Agarwal
PROBABILISTIC TEXT INDEX FOR SEMI-STRUCTURED DATA IN COLUMNAR ANALYTICS STORAGE FORMATS

Publication number: 20220019784

Abstract: Herein is a probabilistic indexing technique for searching semi-structured text documents in columnar storage formats such as Parquet, using columnar input/output (I/O) avoidance, and needing minimal storage overhead. In an embodiment, a computer associates columns with text strings that occur in semi-structured documents. Text words that occur in the text strings are detected. Respectively for each text word, a bitmap, of a plurality of bitmaps, that contains a respective bit for each column is generated. Based on at least one of the bitmaps, some of the columns or some of the semi-structured documents are accessed.

Type: Application

Filed: July 15, 2020

Publication date: January 20, 2022

Inventors: Jian Wen, Hamed Ahmadi, Sanjay Jinturkar, Nipun Agarwal, Lijian Wan, Shrikumar Hariharasubrahmanian
PERSONAL INFORMATION INDEXING FOR COLUMNAR DATA STORAGE FORMAT

Publication number: 20210286806

Abstract: Techniques are described herein for indexing personal information in columnar data storage format based files. In an embodiment, row groups of rows that comprise a plurality of columns are stored in a set of files. Each column of a row group is stored in a chunk of column pages in the set of files. A regular expression index that indexes a particular column in the set of files is stored for each row group. The regular expression index identifies column pages in the chunk of the particular column that include a particular column value that satisfies a regular expression specified in a query. The regular expression specified in the query in evaluated against the particular column using the regular expression index.

Type: Application

Filed: March 10, 2020

Publication date: September 16, 2021

Inventors: Hamed Ahmadi, Jian Wen, Shrikumar Hariharasubrahmanian, Sanjay Jinturkar, Nipun Agarwal
Map of operations for ingesting external data

Patent number: 11074248

Abstract: Techniques are described for generating a map of operations to ingest external data. In an embodiment, DBMS receives a query schema that specifies an external data source and client-defined columns to arrange queried data from data fields of the external data source. DBMS generates an ingest schema for the queried data of the external data source such that the ingest schema specifies ingest data fields from the data fields of the external data source to be mapped to the one or more client-defined columns. Additionally, the DBMS identifies surplus data in the external data source, which has not been specified by the ingest schema and is different from the queried data defined by the client-requested query schema. The process not only ingests the ingest data fields from the external data source but also the surplus data from the external data source. In response to receiving a query according to a different query schema, DBMS generates a result set that based on the surplus data of the external data source.

Type: Grant

Filed: March 31, 2019

Date of Patent: July 27, 2021

Assignee: ORACLE INTERNATIONAL CORPORATION

Inventors: Shrikumar Hariharasubrahmanian, Michael Delorimier, Jarod Wen
MAP OF OPERATIONS FOR INGESTING EXTERNAL DATA

Publication number: 20200311063

Abstract: Techniques are described for generating a map of operations to ingest external data. In an embodiment, DBMS receives a query schema that specifies an external data source and client-defined columns to arrange queried data from data fields of the external data source. DBMS generates an ingest schema for the queried data of the external data source such that the ingest schema specifies ingest data fields from the data fields of the external data source to be mapped to the one or more client-defined columns. Additionally, the DBMS identifies surplus data in the external data source, which has not been specified by the ingest schema and is different from the queried data defined by the client-requested query schema. The process not only ingests the ingest data fields from the external data source but also the surplus data from the external data source. In response to receiving a query according to a different query schema, DBMS generates a result set that based on the surplus data of the external data source.

Type: Application

Filed: March 31, 2019

Publication date: October 1, 2020

Inventors: SHRIKUMAR HARIHARASUBRAHMANIAN, MICHAEL DELORIMIER, JAROD WEN
Entropy sharing across multiple compression streams

Patent number: 10681106

Abstract: According to one or more embodiments, multiple related data streams are compressed jointly, such that substantive similarities between the multiple related streams are leveraged to reduce the overall size of the resulting compressed data. Specifically, given the compressed version of a primary data stream, a secondary data stream may be highly compressed by utilizing portions of the compressed version of the primary data stream to represent similar portions of the secondary data stream. A compression application is configured to receive (either concurrently or separately) information identifying multiple related input streams. For each input stream provided to the compression application, the application outputs one output stream. The computing system writes the output streams of the compression application to storage. Embodiments allow the size of the compressed version of a secondary data stream to be greatly reduced compared to the size of the stream compressed without reference to a primary data stream.

Type: Grant

Filed: February 28, 2018

Date of Patent: June 9, 2020

Assignee: Oracle International Corporation

Inventors: Shrikumar Hariharasubrahmanian, Michael Delorimier
ANISOTROPIC COMPRESSION AS APPLIED TO COLUMNAR STORAGE FORMATS

Publication number: 20200125751

Abstract: Herein are spatially scalable techniques for anisotropic compression of shared entropy between alternate representations of same data. In an embodiment, a computer compresses an uncompressed independent column into a compressed independent column. Based on the compressed independent column, an uncompressed dependent column is compressed into a compressed dependent column. The compressed independent column and the compressed dependent column are stored in a same file. In an embodiment, a computer stores, in metadata, an encrypted private key for decrypting an encrypted column. The encrypted column and the metadata are stored in a file. A request to read the encrypted column is received. Based on a public key and the file, the encrypted private key is decrypted into a decrypted private key. The public key is contained in the request and/or the file. The request is executed by decrypting, based on the decrypted private key and the file, the encrypted column.

Type: Application

Filed: October 17, 2019

Publication date: April 23, 2020

Inventors: Shrikumar Hariharasubrahmanian, Jean-Pierre Dijcks, Jacco Draaijer
ENTROPY SHARING ACROSS MULTIPLE COMPRESSION STREAMS

Publication number: 20190098064

Abstract: According to one or more embodiments, multiple related data streams are compressed jointly, such that substantive similarities between the multiple related streams are leveraged to reduce the overall size of the resulting compressed data. Specifically, given the compressed version of a primary data stream, a secondary data stream may be highly compressed by utilizing portions of the compressed version of the primary data stream to represent similar portions of the secondary data stream. A compression application is configured to receive (either concurrently or separately) information identifying multiple related input streams. For each input stream provided to the compression application, the application outputs one output stream. The computing system writes the output streams of the compression application to storage. Embodiments allow the size of the compressed version of a secondary data stream to be greatly reduced compared to the size of the stream compressed without reference to a primary data stream.

Type: Application

Filed: February 28, 2018

Publication date: March 28, 2019

Inventors: Shrikumar Hariharasubrahmanian, Michael Delorimier
Systems and methods for combined protocol processing protocols

Patent number: 7197046

Abstract: Methods and systems for processing data packets enable a communication device having limited memory to participate in network protocols. One communicating device formats packets in accordance with the specifications of one or two or more communication protocols, with one or more additional constraints, and transmits it to an other communication device. The other communication device performs a specified processing of the received packet to generate a reply packet in response to the received packet, which conforms to the one of two or more communication protocols, while also satisfying one or more additional formatting constraints.

Type: Grant

Filed: August 7, 2000

Date of Patent: March 27, 2007

Inventor: Shrikumar Hariharasubrahmanian
Virtual memory systems and methods

Patent number: 7185114

Abstract: A network (100) includes a first communication device (110) and a second communication device (120). The first communication device (110) receives a first packet containing information for the first communication device (110) and determines whether adequate resources exist for handling the packet. When inadequate resources are determined to exist, the first communication device (110) generates a second packet, stores the information in the second packet, and transmits the second packet. The second communication device (120) receives the second packet, generates a third packet, transfers the information to the third packet, and transmits the third packet to the first communication device (110). The first communication device (110) receives the third packet and processes the information in the third packet.

Type: Grant

Filed: August 4, 2000

Date of Patent: February 27, 2007

Inventor: Shrikumar Hariharasubrahmanian
Systems and methods for transmitting data packets

Patent number: 7009967

Abstract: Methods and systems for transmitting data packets enable a communication device having limited memory to transmit data files in a packet-based network. The communication device partitions the sequence number field transmitted with a data packet into multiple portions. The communication device then generates a sequence number corresponding to the portions, where at least one portion identifies a particular segment of the data file. The communication device then transmits a data packet including a segment of a data file and the sequence number to a receiving device. When the communication device receives the acknowledgement packet from the receiving device, the communication device may determine which segment of the data file to transmit next.

Type: Grant

Filed: August 4, 2000

Date of Patent: March 7, 2006

Inventor: Shrikumar Hariharasubrahmanian
Systems and methods for predicting fields in a data packet

Patent number: 6961777

Abstract: A system predicts one or more fields of a packet that includes multiple fields. Each of the fields stores a value. The system receives one or more of the fields of the packet and determines the value of at least one of the received fields (710). The system predicts a value of at least one other one of the fields of the packet based on the value of the received field before the other field is received (715). The system then processes the packet based on the received fields and the predicted field(s) (720).

Type: Grant

Filed: August 4, 2000

Date of Patent: November 1, 2005

Inventor: Shrikumar Hariharasubrahmanian
Systems and methods for predicting data fields in layered protocols

Patent number: 6819681

Abstract: A system for predicting one or more fields of a packet that includes multiple fields, the packet belonging to a set of packets broken up into one or more packets by the process of fragmentation or segmentation. Each of the fields stores a value. The system receives one or more of the fields of the packet and determines the value of at least one of the received fields. The system predicts a value of at least one other one of the fields of the packet based on the value of the received field before the other field is received. The system then processes the packet based on the received fields and the predicted field(s).

Type: Grant

Filed: August 7, 2000

Date of Patent: November 16, 2004

Inventor: Shrikumar Hariharasubrahmanian