Patents by Inventor Alexandros Batsakis

Alexandros Batsakis has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230147068
    Abstract: Systems and methods are described for establishing and managing components of a distributed computing framework implemented in a data intake and query system. The distributed computing framework may include a master and a plurality of worker nodes. The master may selectively operate on a search head captain that is chosen from the search heads of the data intake and query system. The search head captain may distribute configuration information for the master and the distributed computing framework to the other search heads, which in turn, may distribute that configuration information to indexers of the data intake and query system. Worker nodes may be selectively activated for operation on the indexers based on the configuration information, and the worker nodes may additionally use the configuration information to contact the master and join the distributed computing framework.
    Type: Application
    Filed: October 31, 2022
    Publication date: May 11, 2023
    Inventors: Balaji Rao, Jindrich Dinga, Kieran Cairney, Manuel Martinez, Nitilaksha Halakatti, Ningxuan He, Arindam Bhattacharjee, Sourav Pal, Alexandros Batsakis
  • Patent number: 11645210
    Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.
    Type: Grant
    Filed: February 25, 2022
    Date of Patent: May 9, 2023
    Assignee: Splunk Inc.
    Inventors: Ledion Bitincka, Alexandros Batsakis, Paul J. Lucas, Nicholas Robert Romito
  • Patent number: 11630695
    Abstract: Dynamic reassignment of search processes into workload pools includes receiving a search query to search at least one data store, assigning the search query to a first workload pool, and executing the search query using a first hardware resource in the first workload pool, the first hardware resource corresponding to a first portion of a hardware device. Dynamic reassignment further includes receiving, while executing the search query, an update command to move the search query to a second workload pool, moving, while executing the search query, the search query to the second workload pool; and continuing execution of the search query using a second hardware resource in the second workload pool. The second hardware resource corresponds to a second portion of the hardware device.
    Type: Grant
    Filed: January 29, 2021
    Date of Patent: April 18, 2023
    Assignee: Splunk Inc.
    Inventors: Bharath Kishore Reddy Aleti, Alexandros Batsakis, Mitchell Neuman Blank, Rama Gopalan, Hongxun Liu, Anish Shrigondekar
  • Patent number: 11620288
    Abstract: Systems and methods are disclosed for mapping search nodes to a search head in a data intake and query system based on a tenant identifier in order to execute a query received by the data intake and query system. The mapping may allow same or similar search nodes to be used to execute queries that are associated with a particular tenant identifier, in order to take advantage of caching and local data stored with those search nodes. In some cases, search nodes can be mapped based on the tenant identifier using a hashing algorithm, such as a consistent hashing algorithm.
    Type: Grant
    Filed: February 25, 2022
    Date of Patent: April 4, 2023
    Assignee: Splunk Inc.
    Inventors: Alexandros Batsakis, Scott Calvert, Alexander Douglas James, Bei Li, Ashish Mathew, James Monschke, Sogol Moshtaghi, Christopher Madden Pride, Xiaowei Wang
  • Patent number: 11620336
    Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives raw machine data at an indexing system, and stores at least a portion of the raw machine data in buckets. Based on a determination that the size of multiple buckets satisfies a threshold size, the data intake and query system converts the buckets to non-editable buckets and stores the data in a remote shared storage system.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: April 4, 2023
    Assignee: Splunk Inc.
    Inventors: Alexandros Batsakis, Sourav Pal, Sai Krishna Sajja, Igor Stojanovski, Ledion Bitincka, John Nguyen
  • Patent number: 11609913
    Abstract: A data intake and query system can manage the search of large amounts of data using one or more processing nodes. When a new processing node is added or becomes available, the node coordinator can reassign duties from one or more processing nodes to the new processing node. The node coordinator can initially assign the new processing node one or more groups of data for backup purposes. At a later time, the node coordinator can reassign the new processing node to the one or more groups of data for searching purposes.
    Type: Grant
    Filed: January 29, 2021
    Date of Patent: March 21, 2023
    Assignee: Splunk Inc.
    Inventors: Tameem Anwar, Alexandros Batsakis, Tianyi Gou, Mehul Goyal, Ashish Mathew, Douglas Rapp, Sai Krishna Sajja, Anish Shrigondekar, Igor Stojanovski, Eric Woo, Zhenghui Xie, Ruochen Zhang, Sophia Rui Zhu
  • Patent number: 11567993
    Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives a query identifying a set of data to be processed and a manner of processing the set of data. The data intake and query system identifies buckets that are to be searched and stores a copy of buckets in memory associated with one or more search nodes. A search node performs a search on buckets residing in its memory.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: January 31, 2023
    Assignee: Splunk Inc.
    Inventors: Alexandros Batsakis, Ashish Mathew, Christopher Madden Pride, Bharath Kishore Reddy Aleti, Sourav Pal, Arindam Bhattacharjee, James Monschke
  • Patent number: 11562023
    Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. An indexing system of the data intake and query system receives data and stores at least a portion of it in buckets, which are then stored in a shared storage system. The indexing system merges multiple buckets to generate merged buckets and uploads the merged buckets to the shared storage system.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: January 24, 2023
    Assignee: Splunk Inc.
    Inventors: Alexandros Batsakis, Sourav Pal, Sai Krishna Sajja, Hasan Alayli, Vishal Patel, Igor Stojanovski, Eric Woo, Steve Wong, Tameem Anwar
  • Patent number: 11550847
    Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives a query identifying a set of data to be processed and a manner of processing the set of data. The data intake and query system identifies buckets that are to be searched. The data intake and query system performs a hash on bucket identifiers of the identified buckets to identify search nodes to search the buckets.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: January 10, 2023
    Assignee: Splunk Inc.
    Inventors: Alexandros Batsakis, Ashish Mathew, Christopher Madden Pride, Bharath Kishore Reddy Aleti, Sourav Pal, Arindam Bhattacharjee, James Monschke
  • Publication number: 20220382755
    Abstract: Systems and methods are disclosed for mapping search nodes to a search head in a data intake and query system based on a tenant identifier in order to execute a query received by the data intake and query system. The mapping may allow same or similar search nodes to be used to execute queries that are associated with a particular tenant identifier, in order to take advantage of caching and local data stored with those search nodes. In some cases, search nodes can be mapped based on the tenant identifier using a hashing algorithm, such as a consistent hashing algorithm.
    Type: Application
    Filed: February 25, 2022
    Publication date: December 1, 2022
    Inventors: Alexandros Batsakis, Scott Calvert, Alexander Douglas James, Bei Li, Ashish Mathew, James Monschke, Sogol Moshtaghi, Christopher Madden Pride, Xiaowei Wang
  • Patent number: 11500783
    Abstract: Systems and methods are disclosed for making space available in a local storage of a data intake and query system. A cache manager of the data intake and query system may determine an amount of storage space of a local data store that is available for use to perform a query. The cache manager may then use one or more eviction policies associated with content stored at the local data store to purge content items to evict from the local storage. The system may then retrieve content for performing the query from a remote storage and store the retrieved content at the local storage.
    Type: Grant
    Filed: July 21, 2021
    Date of Patent: November 15, 2022
    Assignee: Splunk Inc.
    Inventors: Bharath Aleti, Alexandros Batsakis, Paul J. Lucas, Igor Stojanovski
  • Patent number: 11494380
    Abstract: Systems and methods are described for establishing and managing components of a distributed computing framework implemented in a data intake and query system. The distributed computing framework may include a master and a plurality of worker nodes. The master may selectively operate on a search head captain that is chosen from the search heads of the data intake and query system. The search head captain may distribute configuration information for the master and the distributed computing framework to the other search heads, which in turn, may distribute that configuration information to indexers of the data intake and query system. Worker nodes may be selectively activated for operation on the indexers based on the configuration information, and the worker nodes may additionally use the configuration information to contact the master and join the distributed computing framework.
    Type: Grant
    Filed: October 18, 2019
    Date of Patent: November 8, 2022
    Assignee: Splunk Inc.
    Inventors: Balaji Rao, Jindrich Dinga, Kieran Cairney, Manuel Martinez, Nitilaksha Halakatti, Ningxuan He, Arindam Bhattacharjee, Sourav Pal, Alexandros Batsakis
  • Publication number: 20220292021
    Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.
    Type: Application
    Filed: February 25, 2022
    Publication date: September 15, 2022
    Inventors: Ledion Bitincka, Alexandros Batsakis, Paul J. Lucas, Nicholas Robert Romito
  • Patent number: 11436116
    Abstract: Systems and methods are described for improving data availability and/or resiliency of indexers of a data intake and query system. A data intake and query system can index large amounts of data using one or more indexers. An indexer can store a copy of the data that the indexer is assigned to process in the shared storage system, and a cluster master can track the storage of the data and the indexer assigned to process the data. In the event an indexer fails or is otherwise unable to index data that it has been assigned to index, the cluster master can assign one or more second indexers to process the data. The second indexer can download the data from the shared storage system.
    Type: Grant
    Filed: January 31, 2020
    Date of Patent: September 6, 2022
    Assignee: Splunk Inc.
    Inventors: Alexandros Batsakis, Mehul Goyal, Ashish Mathew, Douglas Rapp, Igor Stojanovski, Eric Woo
  • Publication number: 20220269727
    Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives raw machine data at an indexing system, and stores at least a portion of the raw machine data in buckets using containerized indexing nodes instantiated in a containerized environment. The data intake and query system stores the buckets in a shared storage system.
    Type: Application
    Filed: January 3, 2022
    Publication date: August 25, 2022
    Inventors: Alexandros Batsakis, Sourav Pal, Sai Krishna Sajja, Igor Stojanovski, Tameem Anwar, Paul J. Lucas, Eric Woo, Steve Wong
  • Publication number: 20220261385
    Abstract: Systems and methods are disclosed for scalable bucket merging in a data intake and query system. Various components of a bucket manager can be used to monitor recently-created buckets of data in common storage that are associated with a particular tenant and a particular index, apply a comprehensive bucket merge policy to determine groups of buckets that qualify for merging, merge those group of buckets into merged buckets to be stored in the common storage, and update any information associated with the merged buckets and pre-merged buckets. These components may be shared across multiple tenants, and some of these components may be dynamically scalable based on need. This approach may also provide many additional benefits, including improved search performance from merged buckets, efficient resource utilization associated with discriminate merging, and redundancy in case of component failure.
    Type: Application
    Filed: April 29, 2022
    Publication date: August 18, 2022
    Inventors: Tameem Anwar, Tianyi Gou, Alexandros Batsakis, Abhinav Prasad Nekkanti, Sai Krishna Sajja, Jiahan Wang
  • Patent number: 11416465
    Abstract: Systems and methods are described for processing incoming data. The system can receive, from a first partition manager of a data intake and query system, first data that is associated with a first identifier, and can receive, from a second partition manager of the data intake and query system, second data that is associated with a second identifier. The system can process the first data and store first results of said processing the first data in one or more first buckets associated with the first tenant identifier. The system can process the second data and store second results of said processing the second data in one or more second buckets associated with the second tenant identifier.
    Type: Grant
    Filed: July 16, 2019
    Date of Patent: August 16, 2022
    Assignee: Splunk Inc.
    Inventors: Tameem Anwar, Alexandros Batsakis, Sai Krishna Sajja, Igor Stojanovski, Eric Woo
  • Publication number: 20220245093
    Abstract: Embodiments described herein facilitate enhancement of data model acceleration, including generating data model summaries and performing searches in an accelerated manner. In one implementation, obtaining a search query from a user device. A determination may be made to execute a search, in association with the search query, via an external computing service. As such, the search query, or a variant thereof, can be provided to the external computing service, wherein the external computing service executes the search using data model summaries stored in a remote data store that is separate from a set of events from which the data model summaries were generated. A set of search results are received from the external computing service, and such search results are provided to the user device.
    Type: Application
    Filed: January 29, 2021
    Publication date: August 4, 2022
    Inventors: Alexandros Batsakis, Ankit Jain, Manu Jose, Jonah Pan, Hailun Yan
  • Publication number: 20220245091
    Abstract: Embodiments described herein facilitate enhancement of data model acceleration, including generating data model summaries and performing searches in an accelerated manner. In one implementation, a set of events are indexed, each of the events having a corresponding index time representing a time at which the event was indexed in an indexer. Index time parameters including an index earliest time indicating a first index time at which to begin generating a data model summary and an index latest time indicating a second index time at which to complete generating the data model summary are obtained. Thereafter, a data model summary is generated. Such a data model summary summarizes events having corresponding index times between the index earliest time and the index latest time. The data model summary is provided to a remote data store that is separate from the indexer at which at least a portion of the events were indexed.
    Type: Application
    Filed: January 29, 2021
    Publication date: August 4, 2022
    Inventors: Alexandros Batsakis, Ankit Jain, Manu Jose, Jonah Pan, Hailun Yan
  • Patent number: 11385936
    Abstract: Achieving search and ingest isolation via resource management in a search and indexing system includes receiving a search query associated with at least one data store, assigning, in response to the search query being associated with the at least one data store, the search query to a first workload pool in a set of query workload pools, and processing the search query using a first hardware resource in the first workload pool. Achieving search and ingest isolation further includes receiving an ingest request comprising data associated with the at least one data store. The ingest request is assigned to a second workload pool in a set of ingest workload pools. The set of query workload pools and the set of ingest workload pools are disjoint. Achieving search and ingest isolation further includes processing the ingest request using a second hardware resource in the second workload pool.
    Type: Grant
    Filed: September 28, 2018
    Date of Patent: July 12, 2022
    Assignee: Splunk Inc.
    Inventors: Bharath Kishore Reddy Aleti, Alexandros Batsakis, Mitchell Neuman Blank, Rama Gopalan, Hongxun Liu, Anish Shrigondekar