Patents by Inventor Shrikumar Hariharasubrahmanian

Shrikumar Hariharasubrahmanian has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230101153
    Abstract: Herein are spatially scalable techniques for anisotropic compression of shared entropy between alternate representations of same data. In an embodiment, a computer compresses an uncompressed independent column into a compressed independent column. Based on the compressed independent column, an uncompressed dependent column is compressed into a compressed dependent column. The compressed independent column and the compressed dependent column are stored in a same file. In an embodiment, a computer stores, in metadata, an encrypted private key for decrypting an encrypted column. The encrypted column and the metadata are stored in a file. A request to read the encrypted column is received. Based on a public key and the file, the encrypted private key is decrypted into a decrypted private key. The public key is contained in the request and/or the file. The request is executed by decrypting, based on the decrypted private key and the file, the encrypted column.
    Type: Application
    Filed: December 2, 2022
    Publication date: March 30, 2023
    Inventors: Shrikumar Hariharasubrahmanian, Jean-Pierre Dijcks, Jacco Draaijer
  • Patent number: 11562085
    Abstract: Herein are spatially scalable techniques for anisotropic compression of shared entropy between alternate representations of same data. In an embodiment, a computer compresses an uncompressed independent column into a compressed independent column. Based on the compressed independent column, an uncompressed dependent column is compressed into a compressed dependent column. The compressed independent column and the compressed dependent column are stored in a same file. In an embodiment, a computer stores, in metadata, an encrypted private key for decrypting an encrypted column. The encrypted column and the metadata are stored in a file. A request to read the encrypted column is received. Based on a public key and the file, the encrypted private key is decrypted into a decrypted private key. The public key is contained in the request and/or the file. The request is executed by decrypting, based on the decrypted private key and the file, the encrypted column.
    Type: Grant
    Filed: October 17, 2019
    Date of Patent: January 24, 2023
    Assignee: Oracle International Corporation
    Inventors: Shrikumar Hariharasubrahmanian, Jean-Pierre Dijcks, Jacco Draaijer
  • Patent number: 11514697
    Abstract: Herein is a probabilistic indexing technique for searching semi-structured text documents in columnar storage formats such as Parquet, using columnar input/output (I/O) avoidance, and needing minimal storage overhead. In an embodiment, a computer associates columns with text strings that occur in semi-structured documents. Text words that occur in the text strings are detected. Respectively for each text word, a bitmap, of a plurality of bitmaps, that contains a respective bit for each column is generated. Based on at least one of the bitmaps, some of the columns or some of the semi-structured documents are accessed.
    Type: Grant
    Filed: July 15, 2020
    Date of Patent: November 29, 2022
    Assignee: Oracle International Corporation
    Inventors: Jian Wen, Hamed Ahmadi, Sanjay Jinturkar, Nipun Agarwal, Lijian Wan, Shrikumar Hariharasubrahmanian
  • Patent number: 11238035
    Abstract: Techniques are described herein for indexing personal information in columnar data storage format based files. In an embodiment, row groups of rows that comprise a plurality of columns are stored in a set of files. Each column of a row group is stored in a chunk of column pages in the set of files. A regular expression index that indexes a particular column in the set of files is stored for each row group. The regular expression index identifies column pages in the chunk of the particular column that include a particular column value that satisfies a regular expression specified in a query. The regular expression specified in the query in evaluated against the particular column using the regular expression index.
    Type: Grant
    Filed: March 10, 2020
    Date of Patent: February 1, 2022
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Hamed Ahmadi, Jian Wen, Shrikumar Hariharasubrahmanian, Sanjay Jinturkar, Nipun Agarwal
  • Publication number: 20220019784
    Abstract: Herein is a probabilistic indexing technique for searching semi-structured text documents in columnar storage formats such as Parquet, using columnar input/output (I/O) avoidance, and needing minimal storage overhead. In an embodiment, a computer associates columns with text strings that occur in semi-structured documents. Text words that occur in the text strings are detected. Respectively for each text word, a bitmap, of a plurality of bitmaps, that contains a respective bit for each column is generated. Based on at least one of the bitmaps, some of the columns or some of the semi-structured documents are accessed.
    Type: Application
    Filed: July 15, 2020
    Publication date: January 20, 2022
    Inventors: Jian Wen, Hamed Ahmadi, Sanjay Jinturkar, Nipun Agarwal, Lijian Wan, Shrikumar Hariharasubrahmanian
  • Publication number: 20210286806
    Abstract: Techniques are described herein for indexing personal information in columnar data storage format based files. In an embodiment, row groups of rows that comprise a plurality of columns are stored in a set of files. Each column of a row group is stored in a chunk of column pages in the set of files. A regular expression index that indexes a particular column in the set of files is stored for each row group. The regular expression index identifies column pages in the chunk of the particular column that include a particular column value that satisfies a regular expression specified in a query. The regular expression specified in the query in evaluated against the particular column using the regular expression index.
    Type: Application
    Filed: March 10, 2020
    Publication date: September 16, 2021
    Inventors: Hamed Ahmadi, Jian Wen, Shrikumar Hariharasubrahmanian, Sanjay Jinturkar, Nipun Agarwal
  • Patent number: 11074248
    Abstract: Techniques are described for generating a map of operations to ingest external data. In an embodiment, DBMS receives a query schema that specifies an external data source and client-defined columns to arrange queried data from data fields of the external data source. DBMS generates an ingest schema for the queried data of the external data source such that the ingest schema specifies ingest data fields from the data fields of the external data source to be mapped to the one or more client-defined columns. Additionally, the DBMS identifies surplus data in the external data source, which has not been specified by the ingest schema and is different from the queried data defined by the client-requested query schema. The process not only ingests the ingest data fields from the external data source but also the surplus data from the external data source. In response to receiving a query according to a different query schema, DBMS generates a result set that based on the surplus data of the external data source.
    Type: Grant
    Filed: March 31, 2019
    Date of Patent: July 27, 2021
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Shrikumar Hariharasubrahmanian, Michael Delorimier, Jarod Wen
  • Publication number: 20200311063
    Abstract: Techniques are described for generating a map of operations to ingest external data. In an embodiment, DBMS receives a query schema that specifies an external data source and client-defined columns to arrange queried data from data fields of the external data source. DBMS generates an ingest schema for the queried data of the external data source such that the ingest schema specifies ingest data fields from the data fields of the external data source to be mapped to the one or more client-defined columns. Additionally, the DBMS identifies surplus data in the external data source, which has not been specified by the ingest schema and is different from the queried data defined by the client-requested query schema. The process not only ingests the ingest data fields from the external data source but also the surplus data from the external data source. In response to receiving a query according to a different query schema, DBMS generates a result set that based on the surplus data of the external data source.
    Type: Application
    Filed: March 31, 2019
    Publication date: October 1, 2020
    Inventors: SHRIKUMAR HARIHARASUBRAHMANIAN, MICHAEL DELORIMIER, JAROD WEN
  • Patent number: 10681106
    Abstract: According to one or more embodiments, multiple related data streams are compressed jointly, such that substantive similarities between the multiple related streams are leveraged to reduce the overall size of the resulting compressed data. Specifically, given the compressed version of a primary data stream, a secondary data stream may be highly compressed by utilizing portions of the compressed version of the primary data stream to represent similar portions of the secondary data stream. A compression application is configured to receive (either concurrently or separately) information identifying multiple related input streams. For each input stream provided to the compression application, the application outputs one output stream. The computing system writes the output streams of the compression application to storage. Embodiments allow the size of the compressed version of a secondary data stream to be greatly reduced compared to the size of the stream compressed without reference to a primary data stream.
    Type: Grant
    Filed: February 28, 2018
    Date of Patent: June 9, 2020
    Assignee: Oracle International Corporation
    Inventors: Shrikumar Hariharasubrahmanian, Michael Delorimier
  • Publication number: 20200125751
    Abstract: Herein are spatially scalable techniques for anisotropic compression of shared entropy between alternate representations of same data. In an embodiment, a computer compresses an uncompressed independent column into a compressed independent column. Based on the compressed independent column, an uncompressed dependent column is compressed into a compressed dependent column. The compressed independent column and the compressed dependent column are stored in a same file. In an embodiment, a computer stores, in metadata, an encrypted private key for decrypting an encrypted column. The encrypted column and the metadata are stored in a file. A request to read the encrypted column is received. Based on a public key and the file, the encrypted private key is decrypted into a decrypted private key. The public key is contained in the request and/or the file. The request is executed by decrypting, based on the decrypted private key and the file, the encrypted column.
    Type: Application
    Filed: October 17, 2019
    Publication date: April 23, 2020
    Inventors: Shrikumar Hariharasubrahmanian, Jean-Pierre Dijcks, Jacco Draaijer
  • Publication number: 20190098064
    Abstract: According to one or more embodiments, multiple related data streams are compressed jointly, such that substantive similarities between the multiple related streams are leveraged to reduce the overall size of the resulting compressed data. Specifically, given the compressed version of a primary data stream, a secondary data stream may be highly compressed by utilizing portions of the compressed version of the primary data stream to represent similar portions of the secondary data stream. A compression application is configured to receive (either concurrently or separately) information identifying multiple related input streams. For each input stream provided to the compression application, the application outputs one output stream. The computing system writes the output streams of the compression application to storage. Embodiments allow the size of the compressed version of a secondary data stream to be greatly reduced compared to the size of the stream compressed without reference to a primary data stream.
    Type: Application
    Filed: February 28, 2018
    Publication date: March 28, 2019
    Inventors: Shrikumar Hariharasubrahmanian, Michael Delorimier
  • Patent number: 7197046
    Abstract: Methods and systems for processing data packets enable a communication device having limited memory to participate in network protocols. One communicating device formats packets in accordance with the specifications of one or two or more communication protocols, with one or more additional constraints, and transmits it to an other communication device. The other communication device performs a specified processing of the received packet to generate a reply packet in response to the received packet, which conforms to the one of two or more communication protocols, while also satisfying one or more additional formatting constraints.
    Type: Grant
    Filed: August 7, 2000
    Date of Patent: March 27, 2007
    Inventor: Shrikumar Hariharasubrahmanian
  • Patent number: 7185114
    Abstract: A network (100) includes a first communication device (110) and a second communication device (120). The first communication device (110) receives a first packet containing information for the first communication device (110) and determines whether adequate resources exist for handling the packet. When inadequate resources are determined to exist, the first communication device (110) generates a second packet, stores the information in the second packet, and transmits the second packet. The second communication device (120) receives the second packet, generates a third packet, transfers the information to the third packet, and transmits the third packet to the first communication device (110). The first communication device (110) receives the third packet and processes the information in the third packet.
    Type: Grant
    Filed: August 4, 2000
    Date of Patent: February 27, 2007
    Inventor: Shrikumar Hariharasubrahmanian
  • Patent number: 7009967
    Abstract: Methods and systems for transmitting data packets enable a communication device having limited memory to transmit data files in a packet-based network. The communication device partitions the sequence number field transmitted with a data packet into multiple portions. The communication device then generates a sequence number corresponding to the portions, where at least one portion identifies a particular segment of the data file. The communication device then transmits a data packet including a segment of a data file and the sequence number to a receiving device. When the communication device receives the acknowledgement packet from the receiving device, the communication device may determine which segment of the data file to transmit next.
    Type: Grant
    Filed: August 4, 2000
    Date of Patent: March 7, 2006
    Inventor: Shrikumar Hariharasubrahmanian
  • Patent number: 6961777
    Abstract: A system predicts one or more fields of a packet that includes multiple fields. Each of the fields stores a value. The system receives one or more of the fields of the packet and determines the value of at least one of the received fields (710). The system predicts a value of at least one other one of the fields of the packet based on the value of the received field before the other field is received (715). The system then processes the packet based on the received fields and the predicted field(s) (720).
    Type: Grant
    Filed: August 4, 2000
    Date of Patent: November 1, 2005
    Inventor: Shrikumar Hariharasubrahmanian
  • Patent number: 6819681
    Abstract: A system for predicting one or more fields of a packet that includes multiple fields, the packet belonging to a set of packets broken up into one or more packets by the process of fragmentation or segmentation. Each of the fields stores a value. The system receives one or more of the fields of the packet and determines the value of at least one of the received fields. The system predicts a value of at least one other one of the fields of the packet based on the value of the received field before the other field is received. The system then processes the packet based on the received fields and the predicted field(s).
    Type: Grant
    Filed: August 7, 2000
    Date of Patent: November 16, 2004
    Inventor: Shrikumar Hariharasubrahmanian