Patents by Inventor Arindam Bhattacharjee

Arindam Bhattacharjee has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20190163824
    Abstract: The disclosed embodiments include a technique to obtain search results from the application of transformation operations on partial search results obtained from across internal and/or external data sources. Examples of transformation operations include arithmetic operations such as an average, mean, count, or the like. Examples of reporting transformations include join operations, statistics, sort, top head. Hence, the search results of a search query can be derived from partial search result rather than include the actual partial search results. In this case, the ordering of the search results may be nonessential. An example of a search query that requires a transformation operation is a “batch” or “reporting” search query. The related disclosed techniques involve obtaining data stored in the bid data ecosystem, and returning that data or data derived from that data.
    Type: Application
    Filed: October 31, 2016
    Publication date: May 30, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee, Christopher Pride
  • Publication number: 20190163841
    Abstract: The capabilities of a data intake and query system can be improved by implementing the data fabric service (DFS) system in a co-located deployment with the data intake and query system. The DFS system can extend the capabilities of a data intake and query system by leveraging computing assets from anywhere in a big data ecosystem to collectively execute search queries on diverse data systems regardless of whether data stores are internal of the data intake and query system and/or external data stores that are communicatively coupled to the data intake and query system over a network.
    Type: Application
    Filed: October 31, 2016
    Publication date: May 30, 2019
    Inventors: Arindam Bhattacharjee, Sourav Pal
  • Publication number: 20190163796
    Abstract: The disclosed embodiments also include monitoring and metering services of the data fabric service (DFS) system. Specifically, these services can include techniques for monitoring and metering metrics of the DFS system. The metrics are standards for measuring use or misuse of the DFS system. Examples of the metrics include data or components of the DFS system. For example, a metric can include data stored or communicated by the DFS system or components of the DFS system that are used or reserved for exclusive use by customers. The metrics can be measured with respect to time or computing resources (e.g., CPU utilization, memory usage) of the DFS system. For example, a DFS service can include metering the usage of particular worker nodes by a customer over a threshold period of time.
    Type: Application
    Filed: October 31, 2016
    Publication date: May 30, 2019
    Inventors: James Alasdair Robert Hodge, Sourav Pal, Arindam Bhattacharjee, Mustafa Ahamed
  • Publication number: 20190163822
    Abstract: The disclosed embodiments include techniques to obtain ordered search results based on partial search results from across multiple diverse internal and/or external data sources. The ordering of the search results may be with respect to a parameter associated with the partial search results. An example of a parameter includes time. As such, the disclosed technique can provide a time-ordered search result based on partial search results obtained from across multiple internal and/or external data sources. Moreover, the disclosed technique can provide time-ordered search results regardless of whether the partial search results obtained from the diverse data sources are timestamped.
    Type: Application
    Filed: October 31, 2016
    Publication date: May 30, 2019
    Inventors: Arindam Bhattacharjee, Sourav Pal, Christopher Pride
  • Publication number: 20190163842
    Abstract: The performance and flexibility of a data intake and query system having capabilities extended by a fabric service (DFS) system can be improved with deployment on a cloud computing platform. The DFS system can extend the capabilities of a data intake and query system by leveraging computing assets from anywhere in a big data ecosystem to collectively execute search queries on diverse data systems regardless of whether data stores are internal of the data intake and query system and/or external data stores that are communicatively coupled to the data intake and query system over a network.
    Type: Application
    Filed: October 31, 2016
    Publication date: May 30, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190147085
    Abstract: Systems and methods are disclosed for receiving, at a data intake and query system, a query that includes a subquery that is to be executed at an external data system that supports a different query language than the data intake and query system. The data intake and query system converts the subquery from the query language supported by the external data system to the query language supported by the data intake and query system. The data intake and query system then processes the query including the translated subquery. The translated subquery is then translated back to the language supported by the external data system including any processing or optimizations performed with respect to the subquery.
    Type: Application
    Filed: September 28, 2018
    Publication date: May 16, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190147084
    Abstract: Systems and methods are disclosed for executing a query that includes an indication to process data managed by an external data system. The system identifies the external data system that manages the data to be processed and generates a subquery for the external data system indicating that the results of the subquery are to be sent to one worker node of multiple worker nodes. The system instructs the one worker node to distribute the results received from the external data system to multiple worker nodes for processing.
    Type: Application
    Filed: July 31, 2018
    Publication date: May 16, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190147092
    Abstract: Systems and methods are disclosed for executing a query that includes an indication to process data managed by an external data system. The system identifies the external data system that manages the data to be processed, and generates a subquery for the external data system indicating that the results of the subquery are to be sent to multiple worker nodes. The system also generates instructions for multiple worker nodes to receive and process results of the subquery from the external data system.
    Type: Application
    Filed: July 31, 2018
    Publication date: May 16, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190147086
    Abstract: Systems and methods are disclosed for receiving, at a data intake and query system, a query that includes an indication to process data managed by a third-party data storage and processing system that supports a different query language than the data intake and query system. The data intake and query system identifies a third-party data storage and processing system that manages the data to be processed and generates a subquery for execution by the third-party data storage and processing system, generates instructions for one or more worker nodes to receive and process results of the subquery from the third-party data storage and processing system, and instructs the worker nodes to provide results of the processing to the data intake and query system.
    Type: Application
    Filed: September 28, 2018
    Publication date: May 16, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190138638
    Abstract: Systems and methods are disclosed for processing data chunks from different data sources at an execution node in a distributed execution environment. The execution node receives data chunks from different sources and combines data from groups of data chunks into partitions based on an associated data source. The execution node executes the partitions using one or more processors.
    Type: Application
    Filed: July 31, 2018
    Publication date: May 9, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190138640
    Abstract: Systems and methods are disclosed for executing a query that includes an indication to process data managed by an external data system. The system identifies the external data system that manages the data to be processed, and obtained search configuration data from the external system. The system uses the search configuration data to generate a subquery for the external data system. The system also generates instructions for one or more worker nodes to receive and process results of the subquery from the external data system.
    Type: Application
    Filed: July 31, 2018
    Publication date: May 9, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190138641
    Abstract: Systems and methods are disclosed for executing a query that includes an indication to process data managed by an external data system. The system identifies the external data system that manages the data to be processed and generates a subquery for the external data system. The system determines a data ingest estimate and uses the data ingest estimate to generate instructions for one or more worker nodes to receive and process results of the subquery from the external data system.
    Type: Application
    Filed: July 31, 2018
    Publication date: May 9, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190138639
    Abstract: Systems and methods are disclosed for receiving, at a first data intake and query system, a query that includes an indication to process data managed by another data intake and query system. The first data intake and query system identifies a second data intake and query system that manages the data to be processed and generates a subquery for execution by the second data intake and query system, generates instructions for one or more worker nodes to receive and process results of the subquery from the second data intake and query system, and instructs the worker nodes to provide results of the processing to the first data intake and query system.
    Type: Application
    Filed: July 31, 2018
    Publication date: May 9, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Publication number: 20190138642
    Abstract: Systems and methods are disclosed for receiving and executing a query received from a data intake and query system and providing results to a first group of worker nodes in a distributed execution environment. The query identifies a set of data to be processed and a manner of processing the set of data. Based on the query, the system defines a query processing scheme, and generates instructions for a second group of worker nodes to obtain the set of data from one or more dataset sources and to process the set of data. The system communicates results of the query to the first group of worker nodes.
    Type: Application
    Filed: July 31, 2018
    Publication date: May 9, 2019
    Inventors: Sourav Pal, Arindam Bhattacharjee
  • Patent number: 10268645
    Abstract: A user uploads date sets through a client to a database. The data sets are provisioned in the database for in-database searching. The data sets are evaluated and classifications for the columns of the tables that include the data set are detected. Columns content may be classified into different analysis types, aggregation types, formats, categories, hierarchies, etc. Metadata is generated based on the evaluation of the data sets. A schema is used to store the metadata that describes the detected classification of the columns. The schema is stored in the database and is used when a search in the database is performed.
    Type: Grant
    Filed: April 4, 2013
    Date of Patent: April 23, 2019
    Assignee: SAP SE
    Inventors: Ganesh Vaitheeswaran, Rahul Kumar Jha, Suraj Pai Airody, Anup Kumar Rai, Arindam Bhattacharjee
  • Publication number: 20190095488
    Abstract: Systems and methods are disclosed for executing a distributed execution model with untrusted commands. The distributed execution model can be distributed to multiple nodes in a distributed computing environment. At least one node can process the distributed execution model to identify an untrusted command. The node can use data associated with the untrusted command to identify one or more files associated with the untrusted command. Based on the files, the node can generate a data structure, and execute at least a portion of the data structure.
    Type: Application
    Filed: September 25, 2017
    Publication date: March 28, 2019
    Inventors: Arindam Bhattacharjee, Sourav Pal, Alexander Douglas James
  • Publication number: 20190095491
    Abstract: Systems and methods are disclosed for generating a distributed execution model with untrusted commands. The system can receive a query, and process the query to identify the untrusted commands. The system can use data associated with the untrusted command to identify one or more files associated with the untrusted command. Based on the files, the system can generate a data structure and include one or more identifiers associated with the data structure in the distributed execution model. The system can distribute the distributed execution model to one or more nodes in a distributed computing environment for execution.
    Type: Application
    Filed: September 25, 2017
    Publication date: March 28, 2019
    Inventors: Arindam Bhattacharjee, Sourav Pal, Alexander Douglas James
  • Publication number: 20190095493
    Abstract: In an environment where multiple datasets are to be combined, systems and methods are disclosed for allocating a group of data entries from at least one dataset into multiple partitions. For a particular partition, the subgroup in the partition can be combined with data entries from the other dataset. In some cases, groups of data entries from each dataset are assigned to different partitions. For a particular partition, a subgroup is duplicated, some of the data entries of the subgroup are reassigned to other partitions, the subgroup is reformed to include data entries from other partitions, and the reformed subgroup is combined with the subgroup from the other dataset(s).
    Type: Application
    Filed: September 25, 2017
    Publication date: March 28, 2019
    Inventors: Arindam Bhattacharjee, Sourav Pal, Christopher Pride
  • Publication number: 20190095494
    Abstract: Systems and methods are disclosed for processing and executing queries against one or more dataset. As part of processing the query, the system determines whether the query is susceptible to a significantly imbalanced partition. In the event, the query is susceptible to an imbalanced partition, the system monitors the query and determines whether to perform a multi-partitioning determination to avoid a significantly imbalanced partition.
    Type: Application
    Filed: September 25, 2017
    Publication date: March 28, 2019
    Inventors: Arindam Bhattacharjee, Sourav Pal, Christopher Pride
  • Publication number: 20180089306
    Abstract: Systems and methods for a data index and query system that utilize a query acceleration data store. An example method includes receiving a query identifying a set of data to be processed and a manner of processing the set of data. A query processing scheme for obtaining and processing the set of data is defined. First partial results of the query stored in a data store are identified, with the first partial results corresponding to a first portion of the set of data. One or more partitions are dynamically allocated to obtain a second portion of the set of data from different data sources. The second portion of the set of data is processed to obtain second partial results. The first partial results and second partial results are combined. The query is executed based on the query processing scheme.
    Type: Application
    Filed: July 31, 2017
    Publication date: March 29, 2018
    Inventors: Sourav Pal, Arindam Bhattacharjee, Asha Andrade