Patents by Inventor Ledion Bitincka

Ledion Bitincka has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11934418
    Abstract: Techniques and mechanisms are disclosed to optimize the size of index files to improve use of storage space available to indexers and other components of a data intake and query system. Index files of a data intake and query system may include, among other data, a keyword portion containing mappings between keywords and location references to event data containing the keywords. Optimizing an amount of storage space used by index files may include removing, modifying and/or recreating various components of index files in response to detecting one or more storage conditions related to the event data indexed by the index files. The optimization of index files generally may attempt to manage a tradeoff between an efficiency with which search requests can be processed using the index files and an amount of storage space occupied by the index files.
    Type: Grant
    Filed: September 14, 2021
    Date of Patent: March 19, 2024
    Assignee: Splunk, Inc.
    Inventors: Ashish Mathew, Ledion Bitincka, Igor Stojanovski, Dhruva Kumar Bhagi
  • Patent number: 11921602
    Abstract: In some aspects, an edge-based data collection system discovers, collects, processes, and forwards data in an observability pipeline system. In some implementations, an edge agent of the observability pipeline system runs on a computer node. The edge agent identifies processes running on the computer node; identifies files on the computer node that the processes have opened for writing; accesses log discovery parameters of the observability pipeline system; selects a plurality of files from the identified files according to the log discovery parameters; generates a list of discovered log files that includes a path and a name for each of the plurality of files; adds the list of discovered log files to a list of monitored log files to be monitored by the observability pipeline system; and then monitors the plurality of files to generate input for the observability pipeline system.
    Type: Grant
    Filed: February 10, 2023
    Date of Patent: March 5, 2024
    Assignee: Cribl, Inc.
    Inventors: Paul Avery Dugas, Dritan Bitincka, Ledion Bitincka
  • Patent number: 11914562
    Abstract: A method and system for managing searches of a data set that is partitioned based on a plurality of events. A structure of a search query may be analyzed to determine if logical computational actions performed on the data set is reducible. Data in each partition is analyzed to determine if at least a portion of the data in the partition is reducible. In response to a subsequent or reoccurring search request, intermediate summaries of reducible data and reducible search computations may be aggregated for each partition. Next, a search result may be generated based on at least one of the aggregated intermediate summaries, the aggregated reducible search computations, and a query of adhoc non-reducible data arranged in at least one of the plurality of partitions for the data set.
    Type: Grant
    Filed: February 8, 2023
    Date of Patent: February 27, 2024
    Assignee: SPLUNK INC.
    Inventors: Ledion Bitincka, Stephen Phillip Sorkin, Steve Yu Zhang
  • Publication number: 20230376491
    Abstract: In some aspects, search functionality is provided in an observability pipeline system. In some implementations, a search query is received at a computer node from a leader role of an observability pipeline system. The search query represents a request to search event data at the computer node and includes a first search operator that specifies a system state context criterion and a second search operator that specifies an event criterion. An observability pipeline process is configured according to the search query. Search results are generated based on applying the observability pipeline process at the computer node. A current system state of the computer node that matches the system state context criterion specified by the first search operator is determined; and a subset of event data on the computer node that matches the event criterion specified by the second search operator is identified. The search results including the subset of event data are sent to the leader role.
    Type: Application
    Filed: May 23, 2023
    Publication date: November 23, 2023
    Applicant: CribI, Inc.
    Inventors: Clint Sharp, Dritan Bitincka, Ledion Bitincka, Oliver Draese
  • Publication number: 20230376509
    Abstract: In some aspects, an interactive graphical user interface displays search data for an observability pipeline system. In some aspects, a method includes obtaining search results including events identified based on searching data generated by an observability pipeline system. The method includes identifying time bins based on the search results; generating first histogram data based on the time bins and the events; and generating second histogram data based on the time bins and the events. The method includes generating a graphical user interface including a first histogram representing the first histogram data and including a first set of bins, and a second histogram representing the second histogram data and including a second set of bins. The method includes updating the graphical user interface to include a visual indication of a selected bin in the first histogram and a visual indication of a corresponding bin in the second histogram.
    Type: Application
    Filed: May 22, 2023
    Publication date: November 23, 2023
    Applicant: Cribl, Inc.
    Inventors: Clint Sharp, Dritan Bitincka, Ledion Bitincka, Oliver Draese, Janusz Sobolewski
  • Publication number: 20230376483
    Abstract: In some aspects, search functionality is provided in an observability pipeline system. In some implementations, a method of searching remotely-stored data includes receiving a search query at a computer node residing at a node geolocation, the search query representing a request to search data stored at a storage geolocation; identifying compute geolocations each including computing resources, each of the compute geolocations being distinct from the node geolocation and the storage geolocation; obtaining latency data including latency values for the compute geolocations; selecting one of the compute geolocations based on the latency data; by operation of a coordinator agent on the computer node, initiating a dynamic computing resource at the selected compute geolocation, and receiving search results identified by the dynamic computing resource.
    Type: Application
    Filed: May 22, 2023
    Publication date: November 23, 2023
    Applicant: Cribl, Inc.
    Inventors: Clint Sharp, Dritan Bitincka, Ledion Bitincka, Oliver Draese
  • Publication number: 20230376498
    Abstract: In some aspects, search functionality is provided in an observability pipeline system. In some implementations, a search method includes receiving a search query from a leader role in an observability pipeline system. The search query represents a request to search event data at a computer resource. An observability pipeline process is configured to perform a search according to the search query, and search results are obtained based on applying the observability pipeline process to the event data. The search results include events from the event data. Provenance information is obtained for each of the events. The provenance information for each event includes an identification of the computer resource and a link to the computer resource. Augmented search results are generated by associating the provenance information with the respective events, and the augmented search results are communicated to the leader role.
    Type: Application
    Filed: May 23, 2023
    Publication date: November 23, 2023
    Applicant: Cribl, Inc.
    Inventors: Clint Sharp, Dritan Bitincka, Ledion Bitincka, Oliver Draese
  • Publication number: 20230315641
    Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.
    Type: Application
    Filed: May 5, 2023
    Publication date: October 5, 2023
    Inventors: Ledion Bitincka, Alexandros Batsakis, Paul J. Lucas, Nicholas Robert Romito
  • Patent number: 11748160
    Abstract: Load balancing processes are performed in an observability pipeline system comprising a plurality of computing resources. In some aspects, the observability pipeline system defines a leader role and worker roles. A plurality of computing jobs each include computing tasks associated with event data. The leader role dispatches the computing tasks to the worker roles according to a least in-flight task dispatch criteria, which includes iteratively: identifying an available worker role; identifying one or more incomplete computing jobs; selecting, from the one or more incomplete computing jobs, a computing job that has the least number of in-flight computing tasks currently being executed in the observability pipeline system; identifying a next computing task from the selected computing job; and dispatching the next computing task to the available worker role. The worker roles execute the computing tasks by applying an observability pipeline process to the event data associated with the respective computing task.
    Type: Grant
    Filed: June 14, 2021
    Date of Patent: September 5, 2023
    Assignee: Cribl, Inc.
    Inventors: Dritan Bitincka, Ledion Bitincka, Nicholas Robert Romito, Clint Sharp
  • Publication number: 20230259438
    Abstract: In some aspects, an edge-based data collection system discovers, collects, processes, and forwards data in an observability pipeline system. In some implementations, an edge agent of the observability pipeline system runs on a computer node. The edge agent identifies processes running on the computer node; identifies files on the computer node that the processes have opened for writing; accesses log discovery parameters of the observability pipeline system; selects a plurality of files from the identified files according to the log discovery parameters; generates a list of discovered log files that includes a path and a name for each of the plurality of files; adds the list of discovered log files to a list of monitored log files to be monitored by the observability pipeline system; and then monitors the plurality of files to generate input for the observability pipeline system.
    Type: Application
    Filed: February 10, 2023
    Publication date: August 17, 2023
    Applicant: Cribl, Inc.
    Inventors: Paul Avery Dugas, Dritan Bitincka, Ledion Bitincka
  • Patent number: 11645210
    Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.
    Type: Grant
    Filed: February 25, 2022
    Date of Patent: May 9, 2023
    Assignee: Splunk Inc.
    Inventors: Ledion Bitincka, Alexandros Batsakis, Paul J. Lucas, Nicholas Robert Romito
  • Patent number: 11620336
    Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives raw machine data at an indexing system, and stores at least a portion of the raw machine data in buckets. Based on a determination that the size of multiple buckets satisfies a threshold size, the data intake and query system converts the buckets to non-editable buckets and stores the data in a remote shared storage system.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: April 4, 2023
    Assignee: Splunk Inc.
    Inventors: Alexandros Batsakis, Sourav Pal, Sai Krishna Sajja, Igor Stojanovski, Ledion Bitincka, John Nguyen
  • Patent number: 11604779
    Abstract: A method and system for managing searches of a data set that is partitioned based on a plurality of events. A structure of a search query may be analyzed to determine if logical computational actions performed on the data set is reducible. Data in each partition is analyzed to determine if at least a portion of the data in the partition is reducible. In response to a subsequent or reoccurring search request, intermediate summaries of reducible data and reducible search computations may be aggregated for each partition. Next, a search result may be generated based on at least one of the aggregated intermediate summaries, the aggregated reducible search computations, and a query of adhoc non-reducible data arranged in at least one of the plurality of partitions for the data set.
    Type: Grant
    Filed: May 10, 2021
    Date of Patent: March 14, 2023
    Assignee: SPLUNK INC.
    Inventors: Ledion Bitincka, Stephen Phillip Sorkin, Steve Yu Zhang
  • Publication number: 20220292021
    Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.
    Type: Application
    Filed: February 25, 2022
    Publication date: September 15, 2022
    Inventors: Ledion Bitincka, Alexandros Batsakis, Paul J. Lucas, Nicholas Robert Romito
  • Patent number: 11416505
    Abstract: In embodiments, a computer-implemented method may entail receiving a search request. A first data store and a second data store, that contains data archived from the first data store, may be identified. Data from the first data store may remain available in the first data store for a limited period of time once archived to the second data store. The first data store storing data in a first format and the second data store storing data in a second format, the first format and the second format being different from one another. Determining that a subset of data that has been archived into the second data store and is to be searched as part of the search request is still available from the first data store, and executing the search request on the subset of data utilizing the first data store. Additional embodiments are described and/or claimed.
    Type: Grant
    Filed: October 26, 2020
    Date of Patent: August 16, 2022
    Assignee: Splunk Inc.
    Inventors: Elizabeth Lin, Nils Petter Eriksson, Ledion Bitincka
  • Patent number: 11403350
    Abstract: A search request received at a computer of a search support system is processed by analyzing the received search request to identify request parameters and connecting to a system index of the search support system that is referenced in the request parameters. An external result provider (ERP) process is initiated that establishes communication between the search support system and a data source external to the search support system, for a virtual index referenced in the request parameters. Thus, the ERP process provides an interface between the search support system and external data sources, such as by third parties. The ERP process can operate in a streaming mode (providing real-time search results with minimal processing) and/or a reporting mode (providing results with a greater delay and processing extent) and can switch between modes. The search request results are received from the connected system indexes and the referenced virtual indexes.
    Type: Grant
    Filed: March 25, 2020
    Date of Patent: August 2, 2022
    Assignee: Splunk Inc.
    Inventors: Ledion Bitincka, Steve Zhang, Igor Stojanovski, Stephen Sorkin
  • Patent number: 11394767
    Abstract: In a computer-implemented method for configuring a distributed computer system comprising a plurality of nodes of a plurality of node classes, configuration files for a plurality of nodes of each of the plurality of node classes are stored in a central repository. The configuration files include information representing a desired system state of the distributed computer system, and the distributed computer system operates to keep an actual system state of the distributed computer system consistent with the desired system state. The plurality of node classes includes forwarder nodes for receiving data from an input source, indexer nodes for indexing the data, and search head nodes for searching the data. Responsive to receiving changes to the configuration files, the changes are propagated to nodes of the plurality of nodes impacted by the changes based on a node class of the nodes impacted by the changes.
    Type: Grant
    Filed: September 2, 2020
    Date of Patent: July 19, 2022
    Assignee: Splunk Inc.
    Inventors: Ledion Bitincka, Vishal Patel, Geoffrey Hendrey, Eric Woo
  • Patent number: 11392655
    Abstract: A search request received at a computer of a search support system is processed by analyzing the received search request to identify request parameters and connecting to a system index of the search support system that is referenced in the request parameters. An external result provider (ERP) process is initiated that establishes communication between the search support system and a data source external to the search support system, for a virtual index referenced in the request parameters. Thus, the ERP process provides an interface between the search support system and external data sources, such as by third parties. The ERP process can operate in a streaming mode (providing realtime search results with minimal processing) and/or a reporting mode (providing results with a greater delay and processing extent) and can switch between modes. The search request results are received from the connected system indexes and the referenced virtual indexes.
    Type: Grant
    Filed: October 26, 2020
    Date of Patent: July 19, 2022
    Assignee: Splunk Inc.
    Inventors: Ledion Bitincka, Steve Zhang, Igor Stojanovski, Stephen Sorkin
  • Patent number: 11263140
    Abstract: Embodiments are disclosed for performing cache aware searching. In response to a search query, a first bucket and a second bucket in remote storage for processing the search query. A determination is made that a first file in the first bucket is present in a cache when the search query is received. In response to the search query, a search is performed using the first file based on the determination that the first file is present in the cache when the search query is received, and the search is performed using a second file from the second bucket once the second file is stored in the cache.
    Type: Grant
    Filed: May 29, 2020
    Date of Patent: March 1, 2022
    Assignee: Splunk Inc.
    Inventors: Ledion Bitincka, Alexandros Batsakis, Paul J. Lucas, Nicholas Robert Romito
  • Publication number: 20210406258
    Abstract: Techniques and mechanisms are disclosed to optimize the size of index files to improve use of storage space available to indexers and other components of a data intake and query system. Index files of a data intake and query system may include, among other data, a keyword portion containing mappings between keywords and location references to event data containing the keywords. Optimizing an amount of storage space used by index files may include removing, modifying and/or recreating various components of index files in response to detecting one or more storage conditions related to the event data indexed by the index files. The optimization of index files generally may attempt to manage a tradeoff between an efficiency with which search requests can be processed using the index files and an amount of storage space occupied by the index files.
    Type: Application
    Filed: September 14, 2021
    Publication date: December 30, 2021
    Inventors: ASHISH MATHEW, LEDION BITINCKA, IGOR STOJANOVSKI, DHRUVA KUMAR BHAGI