Patents by Inventor Alexandros Batsakis
Alexandros Batsakis has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250117373Abstract: According to an aspect, a search system includes an indexing tier and a search tier. The indexing tier is configured to generate an index structure from a plurality of documents, store the index structure in a local storage device, and transmit the index structure to an external object store, the external object store configured to store the index structure. The search tier is configured to receive a search query from a client device and communicate with the external object store or the indexing tier to retrieve at least a portion of the index structure to search data based on the search query.Type: ApplicationFiled: October 4, 2024Publication date: April 10, 2025Inventors: Henning Andersen, Alexandros Batsakis, Adrien Grand, David Christopher Turner, Jason Tedor, Tanguy Leroux, Timothy Regan Brooks
-
Publication number: 20250103604Abstract: A computing device can receive a query that identifies a set of data to be processed and determine that a portion of the set of data resides in an external data system. The query system can request data identifiers associated with data objects of the set of data from the external data system and communicate the data identifiers to a data queue. The computing device can instruct one or more search nodes to retrieve the identifiers from the data queue. The search nodes can use the data identifiers to retrieve data objects from the external data system and process the data objects according to instructions received from the computing device. The search nodes can provide results of the processing to the computing device.Type: ApplicationFiled: June 20, 2024Publication date: March 27, 2025Inventors: Alexandros Batsakis, Nitilaksha Satyaveera Halakatti, Ningxuan He, Prem Kumar Jayaraj, Manuel Gregorio Martinez, Balaji Rao, Jianming Zhang, Steve Yu Zhang
-
Publication number: 20240386053Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives raw machine data at an indexing system, and stores at least a portion of the raw machine data in buckets using containerized indexing nodes instantiated in a containerized environment. The data intake and query system stores the buckets in a shared storage system.Type: ApplicationFiled: May 10, 2024Publication date: November 21, 2024Inventors: Alexandros Batsakis, Ashish Mathew, Christopher Madden Pride, Bharath Kishore Reddy Aleti, Sourav Pal, Arindam Bhattacharjee, James Monschke, Sai Krishna Sajja, Igor Stojanovski, Tameem Anwar, Paul J. Lucas, Eric Woo, Steve Wong
-
Patent number: 12105632Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.Type: GrantFiled: May 5, 2023Date of Patent: October 1, 2024Assignee: Splunk Inc.Inventors: Ledion Bitincka, Alexandros Batsakis, Paul J. Lucas, Nicholas Robert Romito
-
Patent number: 12093272Abstract: A computing device can receive a query that identifies a set of data to be processed and determine that a portion of the set of data resides in an external data system. The query system can request data identifiers associated with data objects of the set of data from the external data system and communicate the data identifiers to a data queue. The computing device can instruct one or more search nodes to retrieve the identifiers from the data queue. The search nodes can use the data identifiers to retrieve data objects from the external data system and process the data objects according to instructions received from the computing device. The search nodes can provide results of the processing to the computing device.Type: GrantFiled: April 29, 2022Date of Patent: September 17, 2024Assignee: Splunk Inc.Inventors: Alexandros Batsakis, Nitilaksha Satyaveera Halakatti, Ningxuan He, Prem Kumar Jayaraj, Manuel Gregorio Martinez, Balaji Rao, Jianming Zhang, Steve Yu Zhang
-
Patent number: 12072939Abstract: A data intake and query system can generate local data enrichment objects and receive federated data enrichment objects from another data intake and query system. In response to receiving a query, the data intake and query system can determine whether the query is subquery of a federated query. If the query is a subquery, the data intake and query system can use the federated data enrichment objects to execute the query.Type: GrantFiled: January 31, 2022Date of Patent: August 27, 2024Assignee: Splunk Inc.Inventors: Alexandros Batsakis, Nir Frenkel, Nitilaksha Halakatti, Balaji Rao, Anish Shrigondekar, Ruochen Zhang, Steve Yu Zhang
-
Patent number: 12019634Abstract: A data intake and query system can manage the search of large amounts of data using one or more processing nodes. When a new processing node is added or becomes available, the node coordinator can reassign duties from one or more processing nodes to the new processing node. The node coordinator can initially assign the new processing node one or more groups of data for backup purposes. At a later time, the node coordinator can reassign the new processing node to the one or more groups of data for searching purposes.Type: GrantFiled: March 20, 2023Date of Patent: June 25, 2024Assignee: Splunk Inc.Inventors: Tameem Anwar, Alexandros Batsakis, Tianyi Gou, Mehul Goyal, Ashish Mathew, Douglas Rapp, Sai Krishna Sajja, Anish Shrigondekar, Igor Stojanovski, Eric Woo, Zhenghui Xie, Ruochen Zhang, Sophia Rui Zhu
-
Patent number: 12013895Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives raw machine data at an indexing system, and stores at least a portion of the raw machine data in buckets using containerized indexing nodes instantiated in a containerized environment. The data intake and query system stores the buckets in a shared storage system.Type: GrantFiled: June 2, 2023Date of Patent: June 18, 2024Assignee: Splunk Inc.Inventors: Alexandros Batsakis, Ashish Mathew, Christopher Madden Pride, Bharath Kishore Reddy Aleti, Sourav Pal, Arindam Bhattacharjee, James Monschke
-
Patent number: 12007996Abstract: Systems and methods are described for establishing and managing components of a distributed computing framework implemented in a data intake and query system. The distributed computing framework may include a master and a plurality of worker nodes. The master may selectively operate on a search head captain that is chosen from the search heads of the data intake and query system. The search head captain may distribute configuration information for the master and the distributed computing framework to the other search heads, which in turn, may distribute that configuration information to indexers of the data intake and query system. Worker nodes may be selectively activated for operation on the indexers based on the configuration information, and the worker nodes may additionally use the configuration information to contact the master and join the distributed computing framework.Type: GrantFiled: October 31, 2022Date of Patent: June 11, 2024Assignee: Splunk Inc.Inventors: Balaji Rao, Jindrich Dinga, Kieran Cairney, Manuel Martinez, Nitilaksha Halakatti, Ningxuan He, Arindam Bhattacharjee, Sourav Pal, Alexandros Batsakis
-
Patent number: 11892996Abstract: Systems and methods are described for monitoring indexing nodes, populating and maintaining a resource catalog with relevant information, receiving requests for indexing node availability or assignments, identifying indexing nodes that are available to process data, and/or communicating information relating to available indexing nodes. The system can maintain the resource catalog based on communications with each of the containerized indexing nodes. The system can receive, from a partition manager of a data intake and query system, a request for a containerized indexing node that the partition manager can assign to process data received by the partition manager. The system can identify an available containerized indexing node to process the data. The system can communicate, to the partition manager, an indexing node identifier associated with the available containerized indexing node.Type: GrantFiled: July 16, 2019Date of Patent: February 6, 2024Assignee: Splunk Inc.Inventors: Tameem Anwar, Alexandros Batsakis, Sai Krishna Sajja, Igor Stojanovski, Eric Woo
-
Patent number: 11892976Abstract: Embodiments described herein facilitate enhancement of data model acceleration, including generating data model summaries and performing searches in an accelerated manner. In one implementation, obtaining a search query from a user device. A determination may be made to execute a search, in association with the search query, via an external computing service. As such, the search query, or a variant thereof, can be provided to the external computing service, wherein the external computing service executes the search using data model summaries stored in a remote data store that is separate from a set of events from which the data model summaries were generated. A set of search results are received from the external computing service, and such search results are provided to the user device.Type: GrantFiled: January 29, 2021Date of Patent: February 6, 2024Assignee: Splunk Inc.Inventors: Alexandros Batsakis, Ankit Jain, Manu Jose, Jonah Pan, Hailun Yan
-
Patent number: 11874691Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives a query identifying a set of data to be processed and a manner of processing the set of data. The data intake and query system identifies buckets that are to be searched and search nodes to execute the query. The data intake and query system maps the identified buckets to the search nodes and executes the query using the identified bucket and search nodes.Type: GrantFiled: June 5, 2018Date of Patent: January 16, 2024Assignee: Splunk Inc.Inventors: Alexandros Batsakis, Ashish Mathew, Christopher Pride, Bharath Kishore Reddy Aleti, Sourav Pal, Arindam Bhattacharjee, James Monschke
-
Patent number: 11860940Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives a query identifying a set of data to be processed and a manner of processing the set of data. The data intake and query system uses a search node catalog to identify search nodes that are available to execute the query and uses a bucket catalog to identify buckets to be searched. The data intake and query system executes the query using the identified bucket and search nodes.Type: GrantFiled: April 16, 2021Date of Patent: January 2, 2024Assignee: Splunk Inc.Inventors: Alexandros Batsakis, Ashish Mathew, Christopher Madden Pride, Bharath Kishore Reddy Aleti, Sourav Pal, Arindam Bhattacharjee, James Monschke
-
Patent number: 11841827Abstract: Embodiments described herein facilitate enhancement of data model acceleration, including generating data model summaries and performing searches in an accelerated manner. In one implementation, a set of events are indexed, each of the events having a corresponding index time representing a time at which the event was indexed in an indexer. Index time parameters including an index earliest time indicating a first index time at which to begin generating a data model summary and an index latest time indicating a second index time at which to complete generating the data model summary are obtained. Thereafter, a data model summary is generated. Such a data model summary summarizes events having corresponding index times between the index earliest time and the index latest time. The data model summary is provided to a remote data store that is separate from the indexer at which at least a portion of the events were indexed.Type: GrantFiled: January 29, 2021Date of Patent: December 12, 2023Assignee: Splunk Inc.Inventors: Alexandros Batsakis, Ankit Jain, Manu Jose, Jonah Pan, Hailun Yan
-
Patent number: 11829415Abstract: Systems and methods are described for improving data availability and/or resiliency of indexers of a data intake and query system. Due to a lag between the time at which data is received and the time at which the data is available for searching, the data intake and query system may receive a query indicating that received (but unavailable for search) data is to be included as part of the query. A cluster master can dynamically track what data is available for searching by different indexers and map the data to filter criteria using a bucket map identifier. When a search head receives a query, it can request a bucket map identifier from the cluster master and send the bucket map identifier to the indexers that will be executing the query. The indexers can use the bucket map identifier to request the individual buckets that they are assigned to search.Type: GrantFiled: January 31, 2020Date of Patent: November 28, 2023Assignee: Splunk Inc.Inventors: Alexandros Batsakis, Mehul Goyal, Ashish Mathew, Douglas Rapp, Igor Stojanovski, Eric Woo
-
Publication number: 20230315641Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.Type: ApplicationFiled: May 5, 2023Publication date: October 5, 2023Inventors: Ledion Bitincka, Alexandros Batsakis, Paul J. Lucas, Nicholas Robert Romito
-
Publication number: 20230315785Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives raw machine data at an indexing system, and stores at least a portion of the raw machine data in buckets using containerized indexing nodes instantiated in a containerized environment. The data intake and query system stores the buckets in a shared storage system.Type: ApplicationFiled: June 2, 2023Publication date: October 5, 2023Inventors: Alexandros Batsakis, Sourav Pal, Sai Krishna Sajja, Igor Stojanovski, Tameem Anwar, Paul J. Lucas, Eric Woo, Steve Wong
-
Patent number: 11768776Abstract: Systems and methods are disclosed for making space available in a local storage of a data intake and query system. A cache manager of the data intake and query system may determine an amount of storage space of a local data store that is available for use to perform a query. The cache manager may then use one or more eviction policies associated with content stored at the local data store to purge content items to evict from the local storage. The system may then retrieve content for performing the query from a remote storage and store the retrieved content at the local storage.Type: GrantFiled: October 10, 2022Date of Patent: September 26, 2023Assignee: Splunk Inc.Inventors: Bharath Aleti, Alexandros Batsakis, Paul J. Lucas, Igor Stojanovski
-
Patent number: 11720537Abstract: Systems and methods are disclosed for scalable bucket merging in a data intake and query system. Various components of a bucket manager can be used to monitor recently-created buckets of data in common storage that are associated with a particular tenant and a particular index, apply a comprehensive bucket merge policy to determine groups of buckets that qualify for merging, merge those group of buckets into merged buckets to be stored in the common storage, and update any information associated with the merged buckets and pre-merged buckets. These components may be shared across multiple tenants, and some of these components may be dynamically scalable based on need. This approach may also provide many additional benefits, including improved search performance from merged buckets, efficient resource utilization associated with discriminate merging, and redundancy in case of component failure.Type: GrantFiled: April 29, 2022Date of Patent: August 8, 2023Assignee: Splunk Inc.Inventors: Tameem Anwar, Tianyi Gou, Alexandros Batsakis, Abhinav Prasad Nekkanti, Sai Krishna Sajja, Jiahan Wang
-
Patent number: 11693710Abstract: Resource management includes storing, for multiple workload pools of a data intake and query system, a workload pool hierarchy arranged in multiple workload pool layers. After storing a processing request is assigned a selected subset of workload pools in a second layer of the workload pool hierarchy based on a type of processing request. The processing request is then assigned to an individual workload pool in the selected subset to obtain a selected workload pool. Execution of the processing request is initiated on the selected workload pool.Type: GrantFiled: January 31, 2022Date of Patent: July 4, 2023Assignee: Splunk Inc.Inventors: Bharath Kishore Reddy Aleti, Alexandros Batsakis, Mitchell Neuman Blank, Rama Gopalan, Hongxun Liu, Anish Shrigondekar