Patents by Inventor Stratos Idreos
Stratos Idreos has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11675694Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups and storage space. The improved approach involves use of a new merge policy that removes merge operations from all but the largest levels of LSM-tree. In addition, the improved approach may include an improved LSM-tree that allows separate control over the frequency of merge operations for the largest level and for all other levels. By adjusting various parameters, such as the storage capacity of the largest level, the storage capacity of the other smaller levels, and/or the size ratio between adjacent levels in the improved LSM-tree, the improved LSM-tree-based key-value approach may maximize throughput for a particular workload.Type: GrantFiled: July 15, 2021Date of Patent: June 13, 2023Assignee: President and Fellows of Harvard CollegeInventors: Stratos Idreos, Niv Dayan
-
Patent number: 11567952Abstract: Embodiments of the invention utilize a “data canopy” that breaks statistical measures down to basic primitives for various data portions and stores the basic aggregates in a library within an in-memory data structure. When a queried statistical measure involves a basic aggregate stored in the library over a data portion that at least partially overlaps the data portion associated with the basic aggregate, the basic aggregate may be reused in the statistical computation of the queried measure.Type: GrantFiled: April 23, 2018Date of Patent: January 31, 2023Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGEInventors: Stratos Idreos, Wasay Abdul, Niv Dayan
-
Patent number: 11563803Abstract: Embodiments of the invention utilize an optimized key-value storage engine to strike the optimal balance between cloud-cost and performance and supports queries, including updates, lookups, range queries, inserts, and read-modify-writes. Cloud cost is manifested in purchasing both storage and processing resources. The improved approach has the ability to self-design and instantiate holistic configurations given a workload, a cloud budget, and optionally performance goals and a set of Service Level Agreement (SLA) specifications. A configuration reflects an optimized storage engine design in terms of, for example, the individual data structures design (in-memory and on-disk) in the engine as well as their algorithms and interactions, a cloud provider, and the exact virtual machines to be used.Type: GrantFiled: November 10, 2021Date of Patent: January 24, 2023Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGEInventors: Stratos Idreos, Subarna Chatterjee, Meena Jagadeesan, Wilson Qin
-
Patent number: 11397712Abstract: Various approaches for accelerating data access to a computer memory and predicate evaluation includes storing, in the computer memory, (i) base data as multiple base columns, (ii) multiple sketched columns each corresponding to a base column in the base data and having smaller code values compared thereto, and (iii) a compression map for mapping one or more base columns to the corresponding sketched column; applying the compression map to a query having a predicate; determining data on the sketched column that satisfies the predicate; and evaluating the predicate based at least in part on the determined data on the sketched column without accessing the base column in the base data.Type: GrantFiled: April 22, 2019Date of Patent: July 26, 2022Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGEInventors: Stratos Idreos, Brian Hentschel, Michael Kester
-
Patent number: 11392644Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups with any given main memory budget. The improved approach involves allocating memory to Bloom filters differently across different levels so as to minimize the sum of the false positive rates associated with the Bloom filters. In addition, the improved approach may predict the impact of the system design parameter(s) and/or environmental parameter(s) on the lookup performance. Subsequently, the improved approach may “self-tune” the system design parameter(s) and/or environment parameter(s) to maximize the throughput.Type: GrantFiled: January 9, 2018Date of Patent: July 19, 2022Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGEInventors: Stratos Idreos, Niv Dayan, Manoussos Gavriil Athanassoulis
-
Patent number: 11372823Abstract: Embodiments of the present invention provide a new multi-level data structure, log-structured merge bush (LSM-bush), to alleviate the performance compromise between LSH-table and LSM-tree data structures. Similar to LSM-tree, LSM-bush may buffer writes in memory, merge the writes as sorted runs across multiple levels in storage, and use in-memory fence pointers and Bloom filters to facilitate lookups. LSM-bush differs from LSM-tree in that it allows newer data to be merged more “lazily” than LSM-tree. This can be achieved by allowing larger numbers of runs to be collected at the smaller levels before merging them.Type: GrantFiled: January 27, 2020Date of Patent: June 28, 2022Assignee: President and Fellows of Harvard CollegeInventors: Niv Dayan, Stratos Idreos
-
Publication number: 20210365422Abstract: Various approaches for accelerating data access to a computer memory and predicate evaluation includes storing, in the computer memory, (i) base data as multiple base columns, (ii) multiple sketched columns each corresponding to a base column in the base data and having smaller code values compared thereto, and (iii) a compression map for mapping one or more base columns to the corresponding sketched column; applying the compression map to a query having a predicate; determining data on the sketched column that satisfies the predicate; and evaluating the predicate based at least in part on the determined data on the sketched column without accessing the base column in the base data.Type: ApplicationFiled: April 22, 2019Publication date: November 25, 2021Inventors: Stratos IDREOS, Brian HENTSCHEL, Michael KESTER
-
Publication number: 20210342259Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups and storage space. The improved approach involves use of a new merge policy that removes merge operations from all but the largest levels of LSM-tree. In addition, the improved approach may include an improved LSM-tree that allows separate control over the frequency of merge operations for the largest level and for all other levels. By adjusting various parameters, such as the storage capacity of the largest level, the storage capacity of the other smaller levels, and/or the size ratio between adjacent levels in the improved LSM-tree, the improved LSM-tree-based key-value approach may maximize throughput for a particular workload.Type: ApplicationFiled: July 15, 2021Publication date: November 4, 2021Inventors: Stratos IDREOS, Niv DAYAN
-
Patent number: 11151028Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups and storage space. The improved approach involves use of a new merge policy that removes merge operations from all but the largest levels of LSM-tree. In addition, the improved approach may include an improved LSM-tree that allows separate control over the frequency of merge operations for the largest level and for all other levels. By adjusting various parameters, such as the storage capacity of the largest level, the storage capacity of the other smaller levels, and/or the size ratio between adjacent levels in the improved LSM-tree, the improved LSM-tree-based key-value approach may maximize throughput for a particular workload.Type: GrantFiled: January 22, 2019Date of Patent: October 19, 2021Assignee: PRESIDENT AND FELLOWS OF HARVARD COLLEGEInventors: Stratos Idreos, Niv Dayan
-
Publication number: 20210279237Abstract: Embodiments of the invention utilize a “data canopy” that breaks statistical measures down to basic primitives for various data portions and stores the basic aggregates in a library within an in-memory data structure. When a queried statistical measure involves a basic aggregate stored in the library over a data portion that at least partially overlaps the data portion associated with the basic aggregate, the basic aggregate may be reused in the statistical computation of the queried measure.Type: ApplicationFiled: April 23, 2018Publication date: September 9, 2021Inventors: Stratos IDREOS, Wasay ABDUL, Niv DAYAN
-
Publication number: 20210097044Abstract: Various approaches for determining the operation cost of a computational workload that is executed on a computational apparatus and accesses data stored in a data structure include decomposing the data structure into multiple data layout primitives, each data layout primitive corresponding to a smallest, fundamental layout aspect of the data structure; decomposing the computational workload into multiple data access primitives, each data access primitive corresponding to a computational mechanism for accessing the data stored in the data structure; determining a hardware profile associated with the apparatus; and computing the operation cost of the computational workload on the apparatus based at least in part on the data layout primitives, the data access primitives, and the hardware profile.Type: ApplicationFiled: April 22, 2019Publication date: April 1, 2021Inventors: Stratos IDREOS, Kostas ZOUMPATIANOS, Brian HENTSCHEL, Michael KESTER
-
Publication number: 20200341889Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups and storage space. The improved approach involves use of a new merge policy that removes merge operations from all but the largest levels of LSM-tree. In addition, the improved approach may include an improved LSM-tree that allows separate control over the frequency of merge operations for the largest level and for all other levels. By adjusting various parameters, such as the storage capacity of the largest level, the storage capacity of the other smaller levels, and/or the size ratio between adjacent levels in the improved LSM-tree, the improved LSM-tree-based key-value approach may maximize throughput for a particular workload.Type: ApplicationFiled: January 22, 2019Publication date: October 29, 2020Inventors: Stratos IDREOS, Niv DAYAN
-
Publication number: 20200250148Abstract: Embodiments of the present invention provide a new multi-level data structure, log-structured merge bush (LSM-bush), to alleviate the performance compromise between LSH-table and LSM-tree data structures. Similar to LSM-tree, LSM-bush may buffer writes in memory, merge the writes as sorted runs across multiple levels in storage, and use in-memory fence pointers and Bloom filters to facilitate lookups. LSM-bush differs from LSM-tree in that it allows newer data to be merged more “lazily” than LSM-tree. This can be achieved by allowing larger numbers of runs to be collected at the smaller levels before merging them.Type: ApplicationFiled: January 27, 2020Publication date: August 6, 2020Inventors: Niv DAYAN, Stratos IDREOS
-
Publication number: 20200057782Abstract: Embodiments of the invention utilize an improved LSM-tree-based key-value approach to strike the optimal balance between the costs of updates and lookups with any given main memory budget. The improved approach involves allocating memory to Bloom filters differently across different levels so as to minimize the sum of the false positive rates associated with the Bloom filters. In addition, the improved approach may predict the impact of the system design parameter(s) and/or environmental parameter(s) on the lookup performance. Subsequently, the improved approach may “self-tune” the system design parameter(s) and/or environment parameter(s) to maximize the throughput.Type: ApplicationFiled: January 9, 2018Publication date: February 20, 2020Inventors: Stratos Idreos, Niv Dayan, Manos Athanassoulis
-
Patent number: 9298754Abstract: A database system maintains a feature set of a modern database system while operating directly on raw data files. Systems may use an adaptive indexing mechanism that maintains positional information to provide efficient access to raw data files, a flexible caching structure, and techniques for selective parsing and selective tokenizing. In doing so, possible performance bottlenecks associated with repeated parsing, tokenizing, and expensive data type conversion costs can be overcome.Type: GrantFiled: November 15, 2012Date of Patent: March 29, 2016Assignee: Ecole Polytechnique Federale de Lausanne (EPFL) (027559)Inventors: Anastasia Ailamaki, Stratos Idreos, Ioannis Alagiannis, Renata Borovica, Miguel Sergio De Oliveira Branco
-
Publication number: 20140136513Abstract: A database system maintains a feature set of a modern database system while operating directly on raw data files. Systems may use an adaptive indexing mechanism that maintains positional information to provide efficient access to raw data files, a flexible caching structure, and techniques for selective parsing and selective tokenizing. In doing so, possible performance bottlenecks associated with repeated parsing, tokenizing, and expensive data type conversion costs can be overcome.Type: ApplicationFiled: November 15, 2012Publication date: May 15, 2014Applicant: Ecole Polytechnique Fédérale de Lausanne (EPFL)Inventors: Anastasia Ailamaki, Stratos Idreos, Ioannis Alagiannis, Renata Borovica, Miguel Sergio De Oliveira Branco