Patents by Inventor Mustafa Ozan OZEN

Mustafa Ozan OZEN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11934409
    Abstract: Methods, systems, and computer-readable media for continuous functions in a time-series database are disclosed. A plurality of data points of a time series are stored into one or more storage tiers of a time-series database. The plurality of data points comprise a plurality of discrete measurements at respective timestamps. Using one or more query processors of the time-series database, a query of the time series is initiated. The query indicates a time range. Using the one or more query processors, a continuous function is determined that represents a segment of the time series in the time range. The continuous function is determined based at least in part on the plurality of data points. An operation is performed using the continuous function as an input.
    Type: Grant
    Filed: November 23, 2018
    Date of Patent: March 19, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Lonnie J. Princehouse, Timothy A. Rath, Gaurav Gupta, Mustafa Ozan Ozen, Omer Ahmed Zaki, Karthik Gurumoorthy Subramanya Bharathy, Gaurav Saxena
  • Patent number: 11609933
    Abstract: Atomic partition scheme updates to partition items may be implemented by a time series database. A time threshold may be assigned to partition scheme update so that the time threshold may be applied across a set of ingestion nodes that may apply the partition scheme update the same. A request to store an item with a timestamp less than the time threshold may be stored in one partition of the time series database, while the item may be stored in a different partition of the time series database if the item has timestamp greater than or equal to the time threshold.
    Type: Grant
    Filed: July 18, 2018
    Date of Patent: March 21, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Mustafa Ozan Ozen, Sandeep Bhatia, Lonnie J. Princehouse, Timothy A. Rath, Gaurav Saxena
  • Patent number: 11513854
    Abstract: Methods, systems, and computer-readable media for resource usage restrictions in a time-series database are disclosed. Elements of a plurality of time series are stored into one or more storage tiers of a time-series database. The time series are associated with a plurality of clients of the time-series database. Execution of tasks is initiated using one or more resources of one or more hosts. The time-series elements represent inputs to the tasks. The tasks comprise a first task and a second task. A usage of the one or more resources by the first task is determined to violate one or more resource usage restrictions. Based at least in part on the usage, one or more actions are performed to modify the execution of the first task. The one or more actions increase an amount of the one or more resources available to the second task.
    Type: Grant
    Filed: June 26, 2019
    Date of Patent: November 29, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Gaurav Saxena, Mustafa Ozan Ozen
  • Patent number: 11397752
    Abstract: Techniques for -memory ingestion for highly available distributed time-series databases are described. A method of in-memory ingestion may include obtaining, by a host of a time series database, time series data from one or more electronic devices, the time series database including a plurality of portions of the time series database spread across a plurality of hosts, the plurality of portions of the time series database including at least one hot portion and a plurality of cold portions, storing the time series data in a volatile storage location associated with the hot portion on the host, detecting an event to store the time series data associated with the hot portion to a non-volatile storage location on the host, and storing the time series data associated with the hot portion to the non-volatile storage location.
    Type: Grant
    Filed: June 27, 2019
    Date of Patent: July 26, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Dumanshu Goyal, Mustafa Ozan Ozen
  • Publication number: 20220171792
    Abstract: Methods, systems, and computer-readable media for ingestion partition auto-scaling in a time-series database are disclosed. A first set of one or more hosts divides elements of time-series data into a plurality of partitions. A second set of one or more hosts stores the elements of time-series data from the plurality of partitions into one or more storage tiers of a time-series database. An analyzer receives first data indicative of the resource usage of the time-series data at the first set of one or more hosts. The analyzer receives second data indicative of the resource usage of the time-series data at the second set of one or more hosts. Based at least in part on analysis of the first data and the second data, the analyzer initiates a split of an individual one of the partitions into two or more partitions.
    Type: Application
    Filed: February 18, 2022
    Publication date: June 2, 2022
    Applicant: Amazon Technologies, Inc.
    Inventors: Gaurav Saxena, Mustafa Ozan Ozen, Dumanshu Goyal, Gaurav Gupta, Sen Yue, Nabanita Maji
  • Patent number: 11263184
    Abstract: Methods, systems, and computer-readable media for partition splitting in a distributed database are disclosed. A partition of data is split into a first sub-partition and a second sub-partition. A first portion of the data is assigned to the first sub-partition, and a second portion of the data is assigned to the second sub-partition. One or more elements of the first portion of the data from the partition and an additional one or more elements of the first portion of the data from the first sub-partition are stored into a first node. One or more elements of the second portion of the data from the partition and an additional one or more elements of the second portion of the data from the second sub-partition are stored into a second node. The partition is prevented from receiving new data after the partition is split into the first and second sub-partitions.
    Type: Grant
    Filed: August 29, 2018
    Date of Patent: March 1, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Mustafa Ozan Ozen, Dumanshu Goyal, Lonnie J. Princehouse, Gaurav Saxena, Atilim Cetin, Gaurav Gupta, Sandeep Bhatia, Nilesh Shahdadpuri, Timothy A. Rath, Eric Coll, Nirmesh Khandelwal
  • Patent number: 11256719
    Abstract: Methods, systems, and computer-readable media for ingestion partition auto-scaling in a time-series database are disclosed. A first set of one or more hosts divides elements of time-series data into a plurality of partitions. A second set of one or more hosts stores the elements of time-series data from the plurality of partitions into one or more storage tiers of a time-series database. An analyzer receives first data indicative of the resource usage of the time-series data at the first set of one or more hosts. The analyzer receives second data indicative of the resource usage of the time-series data at the second set of one or more hosts. Based at least in part on analysis of the first data and the second data, the analyzer initiates a split of an individual one of the partitions into two or more partitions.
    Type: Grant
    Filed: June 27, 2019
    Date of Patent: February 22, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Gaurav Saxena, Mustafa Ozan Ozen, Dumanshu Goyal, Gaurav Gupta, Sen Yue, Nabanita Maji
  • Patent number: 11120052
    Abstract: Techniques are described for clustering data at the point of ingestion for storage using scalable storage resources. To cluster data at the point of ingestion, a data ingestion and query service uses a multilevel hash tree (MLHT)-based index to map a hierarchy of attribute values associated with each data element onto a point of a MLHT (which itself conceptually maps onto a continuous range of values). The total range of the MLHT is divided into one or more data partitions, each of which is mapped to one or more physical storage resources. A mapping algorithm uses the hierarchy of attribute fields to calculate the position of each data element ingested and, consequently, a physical storage resource at which to store the data element.
    Type: Grant
    Filed: June 28, 2018
    Date of Patent: September 14, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Mustafa Ozan Ozen, Sandeep Bhatia, Atilim Cetin, Lonnie J. Princehouse, Timothy Andrew Rath, Gaurav Saxena
  • Patent number: 10884644
    Abstract: Techniques are described for clustering data at the point of ingestion for storage using scalable storage resources. The clustering techniques described herein are used to cluster time series data in a manner such that data that is likely to be queried together is localized to a same partition, or to a minimal set of partitions if the data set is large, where the partitions are mapped to physical storage resources where the data is to be stored for subsequent processing. Among other benefits, the clustered storage of the data at the physical storage resources can reduce an amount of data that needs to be filtered by many types of queries, thereby improving the performance of any applications or processes that rely on querying the data.
    Type: Grant
    Filed: June 28, 2018
    Date of Patent: January 5, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Timothy Andrew Rath, Mustafa Ozan Ozen
  • Patent number: 10853373
    Abstract: A data storage and retrieval system receives data points for a time series. The data storage and retrieval system stores the data points for a first portion of the time series using a first data format. Based at least in part on an analysis of queries performed on the first portion of the time series, the data storage and retrieval system determines to store a second portion of the time series using a second data format. The data storage and retrieval system stores subsequently received data points of the time series in the second data format.
    Type: Grant
    Filed: May 15, 2018
    Date of Patent: December 1, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Sandeep Bhatia, Timothy Andrew Rath, Mustafa Ozan Ozen, Atilim Cetin, Gaurav Gupta
  • Publication number: 20200167355
    Abstract: Methods, systems, and computer-readable media for edge processing in a distributed time-series database are disclosed. A first set of time-series data is generated by one or more client devices and is associated with one or more time series. A local time-series database stores the first set of time-series data into a local storage tier. The local time-series database generates a second set of time-series data derived from the first set of time-series data. A remote time-series database receives the second set of time-series data from the local time-series database via a network. The remote time-series database stores the second set of time-series data into one or more remote storage tiers.
    Type: Application
    Filed: November 23, 2018
    Publication date: May 28, 2020
    Applicant: Amazon Technologies, Inc.
    Inventors: Timothy A. Rath, Gaurav Gupta, Mustafa Ozan Ozen, Omer Ahmed Zaki
  • Publication number: 20200167360
    Abstract: Methods, systems, and computer-readable media for a scalable architecture for a distributed time-series database are disclosed. Using a fleet of ingestion routers, time-series data generated by a plurality of client devices is stored into a plurality of durable partitions. The time-series data comprises a plurality of time series, and an amount of the ingestion routers is determined based at least in part on an ingestion rate of the time-series data. Using a fleet of stream processors, the time-series data from the durable partitions is stored into a plurality of storage tiers including a first storage tier and a second storage tier. A retention period for the first storage tier differs from a retention period for the second storage tier. An amount of the stream processors is determined based at least in part on the time-series data in the durable partitions.
    Type: Application
    Filed: November 23, 2018
    Publication date: May 28, 2020
    Applicant: Amazon Technologies, Inc.
    Inventors: Timothy A. Rath, Gaurav Gupta, Mustafa Ozan Ozen, Omer Ahmed Zaki
  • Publication number: 20200167361
    Abstract: Methods, systems, and computer-readable media for continuous functions in a time-series database are disclosed. A plurality of data points of a time series are stored into one or more storage tiers of a time-series database. The plurality of data points comprise a plurality of discrete measurements at respective timestamps. Using one or more query processors of the time-series database, a query of the time series is initiated. The query indicates a time range. Using the one or more query processors, a continuous function is determined that represents a segment of the time series in the time range. The continuous function is determined based at least in part on the plurality of data points. An operation is performed using the continuous function as an input.
    Type: Application
    Filed: November 23, 2018
    Publication date: May 28, 2020
    Applicant: Amazon Technologies, Inc.
    Inventors: Lonnie J. Princehouse, Timothy A. Rath, Gaurav Gupta, Mustafa Ozan Ozen, Omer Ahmed Zaki, Karthik Gurumoorthy Subramanya Bharathy, Gaurav Saxena
  • Publication number: 20200004449
    Abstract: Techniques are described for clustering data at the point of ingestion for storage using scalable storage resources. The clustering techniques described herein are used to cluster time series data in a manner such that data that is likely to be queried together is localized to a same partition, or to a minimal set of partitions if the data set is large, where the partitions are mapped to physical storage resources where the data is to be stored for subsequent processing. Among other benefits, the clustered storage of the data at the physical storage resources can reduce an amount of data that needs to be filtered by many types of queries, thereby improving the performance of any applications or processes that rely on querying the data.
    Type: Application
    Filed: June 28, 2018
    Publication date: January 2, 2020
    Inventors: Timothy Andrew RATH, Mustafa Ozan OZEN