Patents by Inventor Sourav Pal

Sourav Pal has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11481396
    Abstract: Systems and methods are disclosed for generating a distributed execution model with untrusted commands. The system can receive a query, and process the query to identify the untrusted commands. The system can use data associated with the untrusted command to identify one or more files associated with the untrusted command. Based on the files, the system can generate a data structure and include one or more identifiers associated with the data structure in the distributed execution model. The system can distribute the distributed execution model to one or more nodes in a distributed computing environment for execution.
    Type: Grant
    Filed: April 17, 2020
    Date of Patent: October 25, 2022
    Assignee: Splunk Inc.
    Inventors: Arindam Bhattacharjee, Sourav Pal, Alexander Douglas James
  • Publication number: 20220327149
    Abstract: Systems and methods are disclosed for processing and executing queries against one or more dataset sources, where the queries identify a set of data to be processed and a manner of processing the set of data. To query the dataset sources, a query coordinator generates a query processing scheme that includes a dynamic allocation of multiple layers of partitions. The query is then executed based on the query processing scheme.
    Type: Application
    Filed: March 17, 2022
    Publication date: October 13, 2022
    Inventors: Sourav Pal, Arindam Bhattacharjee, Kishore Reddy Ramasayam, Alexander Douglas James
  • Publication number: 20220327125
    Abstract: Systems and methods are described for scheduling a query for execution. The system receives and parses a query to identify one or more portions of the query. The system determines a resource allocation for each portion of the query, and determines an availability of compute resources for the different portions of the query. Based on the resource allocation and the availability of compute resources, the system schedules the query.
    Type: Application
    Filed: April 11, 2022
    Publication date: October 13, 2022
    Inventors: Sourav Pal, Arindam Bhattacharjee, Nikhil Roy
  • Patent number: 11461334
    Abstract: Systems and methods are disclosed for processing queries against one or more dataset sources utilizing dynamically allocated partitions operating on one or more worker nodes. The results of the processing are stored in a dataset destination. The queries can identify data in the one or more dataset sources for processing and a manner for processing the data. In addition, the queries can identify the dataset destination for storing results of the query. To process the query, a query coordinator can dynamically allocate partitions operating on worker nodes to retrieve data for processing, process the data, and communicate the data to the dataset sources. In addition, the query coordinator can dynamically allocate partitions based on an identification of the dataset destination.
    Type: Grant
    Filed: July 31, 2017
    Date of Patent: October 4, 2022
    Assignee: Splunk Inc.
    Inventors: Arindam Bhattacharjee, Sourav Pal, Alexander Douglas James, Christopher Pride
  • Publication number: 20220298532
    Abstract: Method for the production of ethanol from corn fibres. Invention relates to a process for the preparation of ethanol from a corn fibre containing feedstock by using a set of process steps specifically identified for the treatment of the said feedstock. It particularly relates to the use of a soaking followed by mix acid and enzyme treatment that release fermentable C5 and C6 sugars from said feedstock. It further relates to using a recombinant yeast to convert both C5 and C6 sugars to ethanol.
    Type: Application
    Filed: June 8, 2020
    Publication date: September 22, 2022
    Inventors: Siddhartha Sourav PAL, Prasanna Sham PAI, Ajit Prabhakar DESHMUKH, Sandip Uttamrao NALWADE, Nilesh Ankush BORAGE, Ghanshyam Baburao DESHPANDE, Pramod Shankar KUMBHAR
  • Patent number: 11442935
    Abstract: Systems and methods are described for determining a record generation estimate related to a particular processing task. The system obtains a sample set of data that includes multiple records. The system applies a processing task, such as a transform or regular expression rule to the sample set of data and determines how many records are generated by the processing task. Based on the number of records generated, the system determines a record generation estimate. The system can use the record generation estimate to allocate compute resources or determine a query execution time for at least a portion of the query based on the record generation estimate.
    Type: Grant
    Filed: April 29, 2019
    Date of Patent: September 13, 2022
    Assignee: Splunk Inc.
    Inventors: Sourav Pal, Arindam Bhattacharjee, Asha Andrade
  • Publication number: 20220269727
    Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. The data intake and query system receives raw machine data at an indexing system, and stores at least a portion of the raw machine data in buckets using containerized indexing nodes instantiated in a containerized environment. The data intake and query system stores the buckets in a shared storage system.
    Type: Application
    Filed: January 3, 2022
    Publication date: August 25, 2022
    Inventors: Alexandros Batsakis, Sourav Pal, Sai Krishna Sajja, Igor Stojanovski, Tameem Anwar, Paul J. Lucas, Eric Woo, Steve Wong
  • Patent number: 11416528
    Abstract: Systems and methods for a data index and query system that utilize a query acceleration data store. An example method includes receiving a query identifying a set of data to be processed and a manner of processing the set of data. A query processing scheme for obtaining and processing the set of data is defined. First partial results of the query stored in a data store are identified, with the first partial results corresponding to a first portion of the set of data. One or more partitions are dynamically allocated to obtain a second portion of the set of data from different data sources. The second portion of the set of data is processed to obtain second partial results. The first partial results and second partial results are combined. The query is executed based on the query processing scheme.
    Type: Grant
    Filed: July 31, 2017
    Date of Patent: August 16, 2022
    Assignee: Splunk Inc.
    Inventors: Sourav Pal, Arindam Bhattacharjee, Asha Andrade
  • Patent number: 11392654
    Abstract: Disclosed is a data fabric service system that can be implemented in a distributed computer network, such as a data intake and query system. The data index and query system can receive a search query and define a search scheme for applying the search query on distributed data storage systems including internal data storage and external data storage. The data index and query system may provide a portion of the search scheme to a search service of the data fabric service system, which can cause worker nodes of the data fabric service system to perform various functions—including applying the search query to the external data storage based on the portion of the search scheme in order to obtain search results.
    Type: Grant
    Filed: January 30, 2020
    Date of Patent: July 19, 2022
    Assignee: Splunk Inc.
    Inventors: Sourav Pal, Christopher Pride, Arindam Bhattacharjee, Xiaowei Wang, James Alasdair Robert Hodge, Mustafa Ahamed
  • Patent number: 11341131
    Abstract: Systems and methods are described for scheduling a query for execution. The system receives and parses a query to identify one or more portions of the query. The system determines a resource allocation for each portion of the query, and determines an availability of compute resources for the different portions of the query. Based on the resource allocation and the availability of compute resources, the system schedules the query.
    Type: Grant
    Filed: April 29, 2019
    Date of Patent: May 24, 2022
    Assignee: Splunk Inc.
    Inventors: Sourav Pal, Arindam Bhattacharjee, Nikhil Roy
  • Publication number: 20220156335
    Abstract: Systems and methods are disclosed for processing streaming data. The data can come from various sources. Worker nodes can be configured to process the streaming data, without delays that may be caused by indexing the data. The data can be filtered and/or transformed as it is processed. In some cases, data can be stored in a data store without transformation. The data in the data store can be accessed and processed at a later time.
    Type: Application
    Filed: January 31, 2022
    Publication date: May 19, 2022
    Inventors: Arindam Bhattacharjee, Alexander Douglas James, Sourav Pal
  • Patent number: 11321321
    Abstract: Systems and methods are described for processing records associated with a query that identifies an association between two data fields. The system can obtain a chunk of data that includes multiple records based on a query received by a data intake and query system. At least one record can include multiple sub-records that share a field value for at least one field. The system can generate a record from each sub-record and assign the generated records to one or more groups of partitions. The system can combine record data of generated records assigned to one partition of a group of partitions and then combine record data across the group of partitions. The system can process the results of the combination of records across the group of partitions based on the query.
    Type: Grant
    Filed: April 29, 2019
    Date of Patent: May 3, 2022
    Assignee: Splunk Inc.
    Inventors: Arindam Bhattacharjee, Sourav Pal, Wayne Patterson
  • Patent number: 11314753
    Abstract: Systems and methods are disclosed for receiving and executing a query received from a data intake and query system and providing results to a first group of worker nodes in a distributed execution environment. The query identifies a set of data to be processed and a manner of processing the set of data. Based on the query, the system defines a query processing scheme, and generates instructions for a second group of worker nodes to obtain the set of data from one or more dataset sources and to process the set of data. The system communicates results of the query to the first group of worker nodes.
    Type: Grant
    Filed: July 31, 2018
    Date of Patent: April 26, 2022
    Assignee: Splunk Inc.
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Patent number: 11310313
    Abstract: Multi-threaded processing of search responses returned by search peers is disclosed. An example method may include transmitting, by a computer system, a search request to a plurality of search peers of a data aggregation and analysis system; receiving, by a first processing thread, a plurality of data packets from the plurality of search peers; parsing, by a second processing thread operating asynchronously with respect to the first processing thread, one or more data packets of the plurality of data packets, to produce a partial response to the search request; splitting the partial response into two or more fields; and generating, based on the two or more fields of the partial response, an aggregated response to the search request.
    Type: Grant
    Filed: September 8, 2020
    Date of Patent: April 19, 2022
    Assignee: SPLUNK INC.
    Inventors: Sourav Pal, Christopher Madden Pride
  • Patent number: 11294941
    Abstract: Systems and methods are described for preprocessing data later ingested into an indexing system. The preprocessing can include receiving messages published to a first publish-subscribe messaging system, the messages containing raw machine data generated by one or more components in an information technology environment, performing one or more processing operations on at least some of the messages to generate preprocessed messages, republishing the preprocessed messages to a second publish-subscribe messaging system, and providing to the indexing system, a subset of the messages from the second publish-subscribe messaging system.
    Type: Grant
    Filed: June 5, 2018
    Date of Patent: April 5, 2022
    Assignee: Splunk Inc.
    Inventors: Eric Sammer, Sourav Pal, Joseph Gabriel Echeverria
  • Patent number: 11281706
    Abstract: Systems and methods are disclosed for processing and executing queries against one or more dataset sources, where the queries identify a set of data to be processed and a manner of processing the set of data. To query the dataset sources, a query coordinator generates a query processing scheme that includes a dynamic allocation of multiple layers of partitions. The query is then executed based on the query processing scheme.
    Type: Grant
    Filed: July 31, 2017
    Date of Patent: March 22, 2022
    Assignee: Splunk Inc.
    Inventors: Sourav Pal, Arindam Bhattacharjee, Kishore Reddy Ramasayam, Alexander Douglas James
  • Patent number: 11269939
    Abstract: Systems and methods are described for processing data by interactions with a publish-subscribe messaging system. The processing may include receiving messages published to at least one publish-subscribe messaging system, the messages containing data collected during operation of one or more source computing systems and iteratively processing the received messages through the publish-subscribe messaging system, wherein iteratively processing a message comprises serially republishing the message to the at least one publish-subscribe messaging system and altering the message between individual republishings according to a set of rules, and wherein a final republishing of the message during the serial republishing makes a version of the message available on the at least one publish-subscribe messaging system to a downstream component.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: March 8, 2022
    Assignee: Splunk Inc.
    Inventors: Eric Sammer, Sourav Pal, Joseph Gabriel Echeverria
  • Patent number: 11263470
    Abstract: A content saliency network is a machine-learned neural network that predicts the saliency of elements of a content item. The content saliency network may be used in a method that includes determining a set of elements in draft content and computing a first pixel-level vector for the content. The method may also include, for each element in the set of elements, computing a vector of simple features for the element, the simple features being computed from attributes of the element, computing a second pixel-level vector for the element, computing a third pixel-level vector for an intermediate context of the element, and providing the vectors to the content saliency network. The content saliency network provides a saliency score for the element. The method further includes generating an element-level saliency map of the content using the respective saliency scores for the set of elements and providing the saliency map to a requestor.
    Type: Grant
    Filed: November 15, 2017
    Date of Patent: March 1, 2022
    Assignee: ADOBE INC.
    Inventors: Prakhar Gupta, Shubh Gupta, Ritwik Sinha, Sourav Pal, Ajaykrishnan Jayagopal
  • Patent number: 11250056
    Abstract: Systems and methods are disclosed for processing and executing queries in a data intake and query system. An indexing system of the data intake and query system receives data from an ingestion buffer that includes a marker that indicates data that is made available to the indexing system. The data intake and query system stores at least a portion of the data in buckets and stores the buckets in a shared storage system. Based on the storage of the buckets in the shared storage system, the indexing system indicates to the ingestion buffer that the marker can be updated.
    Type: Grant
    Filed: April 30, 2018
    Date of Patent: February 15, 2022
    Assignee: Splunk Inc.
    Inventors: Alexandros Batsakis, Sourav Pal, Sai Krishna Sajja, Igor Stojanovski, Tameem Anwar, Eric Woo, Steve Wong
  • Patent number: 11243963
    Abstract: Systems and methods are disclosed for executing a query that includes an indication to process data managed by an external data system. The system identifies the external data system that manages the data to be processed, and generates a subquery for the external data system indicating that the results of the subquery are to be sent to multiple worker nodes. The system also generates instructions for multiple worker nodes to receive and process results of the subquery from the external data system.
    Type: Grant
    Filed: July 31, 2018
    Date of Patent: February 8, 2022
    Assignee: Splunk Inc.
    Inventors: Sourav Pal, Arindam Bhattacharjee