Patents by Inventor Stratos Idreos

Stratos Idreos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11675694
    Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups and storage space. The improved approach involves use of a new merge policy that removes merge operations from all but the largest levels of LSM-tree. In addition, the improved approach may include an improved LSM-tree that allows separate control over the frequency of merge operations for the largest level and for all other levels. By adjusting various parameters, such as the storage capacity of the largest level, the storage capacity of the other smaller levels, and/or the size ratio between adjacent levels in the improved LSM-tree, the improved LSM-tree-based key-value approach may maximize throughput for a particular workload.
    Type: Grant
    Filed: July 15, 2021
    Date of Patent: June 13, 2023
    Assignee: President and Fellows of Harvard College
    Inventors: Stratos Idreos, Niv Dayan
  • Patent number: 11567952
    Abstract: Embodiments of the invention utilize a “data canopy” that breaks statistical measures down to basic primitives for various data portions and stores the basic aggregates in a library within an in-memory data structure. When a queried statistical measure involves a basic aggregate stored in the library over a data portion that at least partially overlaps the data portion associated with the basic aggregate, the basic aggregate may be reused in the statistical computation of the queried measure.
    Type: Grant
    Filed: April 23, 2018
    Date of Patent: January 31, 2023
    Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGE
    Inventors: Stratos Idreos, Wasay Abdul, Niv Dayan
  • Patent number: 11563803
    Abstract: Embodiments of the invention utilize an optimized key-value storage engine to strike the optimal balance between cloud-cost and performance and supports queries, including updates, lookups, range queries, inserts, and read-modify-writes. Cloud cost is manifested in purchasing both storage and processing resources. The improved approach has the ability to self-design and instantiate holistic configurations given a workload, a cloud budget, and optionally performance goals and a set of Service Level Agreement (SLA) specifications. A configuration reflects an optimized storage engine design in terms of, for example, the individual data structures design (in-memory and on-disk) in the engine as well as their algorithms and interactions, a cloud provider, and the exact virtual machines to be used.
    Type: Grant
    Filed: November 10, 2021
    Date of Patent: January 24, 2023
    Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGE
    Inventors: Stratos Idreos, Subarna Chatterjee, Meena Jagadeesan, Wilson Qin
  • Patent number: 11397712
    Abstract: Various approaches for accelerating data access to a computer memory and predicate evaluation includes storing, in the computer memory, (i) base data as multiple base columns, (ii) multiple sketched columns each corresponding to a base column in the base data and having smaller code values compared thereto, and (iii) a compression map for mapping one or more base columns to the corresponding sketched column; applying the compression map to a query having a predicate; determining data on the sketched column that satisfies the predicate; and evaluating the predicate based at least in part on the determined data on the sketched column without accessing the base column in the base data.
    Type: Grant
    Filed: April 22, 2019
    Date of Patent: July 26, 2022
    Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGE
    Inventors: Stratos Idreos, Brian Hentschel, Michael Kester
  • Patent number: 11392644
    Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups with any given main memory budget. The improved approach involves allocating memory to Bloom filters differently across different levels so as to minimize the sum of the false positive rates associated with the Bloom filters. In addition, the improved approach may predict the impact of the system design parameter(s) and/or environmental parameter(s) on the lookup performance. Subsequently, the improved approach may “self-tune” the system design parameter(s) and/or environment parameter(s) to maximize the throughput.
    Type: Grant
    Filed: January 9, 2018
    Date of Patent: July 19, 2022
    Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGE
    Inventors: Stratos Idreos, Niv Dayan, Manoussos Gavriil Athanassoulis
  • Patent number: 11372823
    Abstract: Embodiments of the present invention provide a new multi-level data structure, log-structured merge bush (LSM-bush), to alleviate the performance compromise between LSH-table and LSM-tree data structures. Similar to LSM-tree, LSM-bush may buffer writes in memory, merge the writes as sorted runs across multiple levels in storage, and use in-memory fence pointers and Bloom filters to facilitate lookups. LSM-bush differs from LSM-tree in that it allows newer data to be merged more “lazily” than LSM-tree. This can be achieved by allowing larger numbers of runs to be collected at the smaller levels before merging them.
    Type: Grant
    Filed: January 27, 2020
    Date of Patent: June 28, 2022
    Assignee: President and Fellows of Harvard College
    Inventors: Niv Dayan, Stratos Idreos
  • Publication number: 20210365422
    Abstract: Various approaches for accelerating data access to a computer memory and predicate evaluation includes storing, in the computer memory, (i) base data as multiple base columns, (ii) multiple sketched columns each corresponding to a base column in the base data and having smaller code values compared thereto, and (iii) a compression map for mapping one or more base columns to the corresponding sketched column; applying the compression map to a query having a predicate; determining data on the sketched column that satisfies the predicate; and evaluating the predicate based at least in part on the determined data on the sketched column without accessing the base column in the base data.
    Type: Application
    Filed: April 22, 2019
    Publication date: November 25, 2021
    Inventors: Stratos IDREOS, Brian HENTSCHEL, Michael KESTER
  • Publication number: 20210342259
    Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups and storage space. The improved approach involves use of a new merge policy that removes merge operations from all but the largest levels of LSM-tree. In addition, the improved approach may include an improved LSM-tree that allows separate control over the frequency of merge operations for the largest level and for all other levels. By adjusting various parameters, such as the storage capacity of the largest level, the storage capacity of the other smaller levels, and/or the size ratio between adjacent levels in the improved LSM-tree, the improved LSM-tree-based key-value approach may maximize throughput for a particular workload.
    Type: Application
    Filed: July 15, 2021
    Publication date: November 4, 2021
    Inventors: Stratos IDREOS, Niv DAYAN
  • Patent number: 11151028
    Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups and storage space. The improved approach involves use of a new merge policy that removes merge operations from all but the largest levels of LSM-tree. In addition, the improved approach may include an improved LSM-tree that allows separate control over the frequency of merge operations for the largest level and for all other levels. By adjusting various parameters, such as the storage capacity of the largest level, the storage capacity of the other smaller levels, and/or the size ratio between adjacent levels in the improved LSM-tree, the improved LSM-tree-based key-value approach may maximize throughput for a particular workload.
    Type: Grant
    Filed: January 22, 2019
    Date of Patent: October 19, 2021
    Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGE
    Inventors: Stratos Idreos, Niv Dayan
  • Publication number: 20210279237
    Abstract: Embodiments of the invention utilize a “data canopy” that breaks statistical measures down to basic primitives for various data portions and stores the basic aggregates in a library within an in-memory data structure. When a queried statistical measure involves a basic aggregate stored in the library over a data portion that at least partially overlaps the data portion associated with the basic aggregate, the basic aggregate may be reused in the statistical computation of the queried measure.
    Type: Application
    Filed: April 23, 2018
    Publication date: September 9, 2021
    Inventors: Stratos IDREOS, Wasay ABDUL, Niv DAYAN
  • Publication number: 20210097044
    Abstract: Various approaches for determining the operation cost of a computational workload that is executed on a computational apparatus and accesses data stored in a data structure include decomposing the data structure into multiple data layout primitives, each data layout primitive corresponding to a smallest, fundamental layout aspect of the data structure; decomposing the computational workload into multiple data access primitives, each data access primitive corresponding to a computational mechanism for accessing the data stored in the data structure; determining a hardware profile associated with the apparatus; and computing the operation cost of the computational workload on the apparatus based at least in part on the data layout primitives, the data access primitives, and the hardware profile.
    Type: Application
    Filed: April 22, 2019
    Publication date: April 1, 2021
    Inventors: Stratos IDREOS, Kostas ZOUMPATIANOS, Brian HENTSCHEL, Michael KESTER
  • Publication number: 20200341889
    Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups and storage space. The improved approach involves use of a new merge policy that removes merge operations from all but the largest levels of LSM-tree. In addition, the improved approach may include an improved LSM-tree that allows separate control over the frequency of merge operations for the largest level and for all other levels. By adjusting various parameters, such as the storage capacity of the largest level, the storage capacity of the other smaller levels, and/or the size ratio between adjacent levels in the improved LSM-tree, the improved LSM-tree-based key-value approach may maximize throughput for a particular workload.
    Type: Application
    Filed: January 22, 2019
    Publication date: October 29, 2020
    Inventors: Stratos IDREOS, Niv DAYAN
  • Publication number: 20200250148
    Abstract: Embodiments of the present invention provide a new multi-level data structure, log-structured merge bush (LSM-bush), to alleviate the performance compromise between LSH-table and LSM-tree data structures. Similar to LSM-tree, LSM-bush may buffer writes in memory, merge the writes as sorted runs across multiple levels in storage, and use in-memory fence pointers and Bloom filters to facilitate lookups. LSM-bush differs from LSM-tree in that it allows newer data to be merged more “lazily” than LSM-tree. This can be achieved by allowing larger numbers of runs to be collected at the smaller levels before merging them.
    Type: Application
    Filed: January 27, 2020
    Publication date: August 6, 2020
    Inventors: Niv DAYAN, Stratos IDREOS
  • Publication number: 20200057782
    Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups with any given main memory budget. The improved approach involves allocating memory to Bloom filters differently across different levels so as to minimize the sum of the false positive rates associated with the Bloom filters. In addition, the improved approach may predict the impact of the system design parameter(s) and/or environmental parameter(s) on the lookup performance. Subsequently, the improved approach may “self-tune” the system design parameter(s) and/or environment parameter(s) to maximize the throughput.
    Type: Application
    Filed: January 9, 2018
    Publication date: February 20, 2020
    Inventors: Stratos Idreos, Niv Dayan, Manos Athanassoulis
  • Patent number: 9298754
    Abstract: A database system maintains a feature set of a modern database system while operating directly on raw data files. Systems may use an adaptive indexing mechanism that maintains positional information to provide efficient access to raw data files, a flexible caching structure, and techniques for selective parsing and selective tokenizing. In doing so, possible performance bottlenecks associated with repeated parsing, tokenizing, and expensive data type conversion costs can be overcome.
    Type: Grant
    Filed: November 15, 2012
    Date of Patent: March 29, 2016
    Assignee: Ecole Polytechnique Federale de Lausanne (EPFL) (027559)
    Inventors: Anastasia Ailamaki, Stratos Idreos, Ioannis Alagiannis, Renata Borovica, Miguel Sergio De Oliveira Branco
  • Publication number: 20140136513
    Abstract: A database system maintains a feature set of a modern database system while operating directly on raw data files. Systems may use an adaptive indexing mechanism that maintains positional information to provide efficient access to raw data files, a flexible caching structure, and techniques for selective parsing and selective tokenizing. In doing so, possible performance bottlenecks associated with repeated parsing, tokenizing, and expensive data type conversion costs can be overcome.
    Type: Application
    Filed: November 15, 2012
    Publication date: May 15, 2014
    Applicant: Ecole Polytechnique Fédérale de Lausanne (EPFL)
    Inventors: Anastasia Ailamaki, Stratos Idreos, Ioannis Alagiannis, Renata Borovica, Miguel Sergio De Oliveira Branco