Patents by Inventor Stephen Phillip Sorkin
Stephen Phillip Sorkin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11947513Abstract: Methods and apparatus consistent with the invention provide the ability to organize, index, search, and present time series data based on searches. Time series data are sequences of time stamped records occurring in one or more usually continuous streams, representing some type of activity. In one embodiment, time series data is stored as discrete events time stamps. A search is received and relevant event information is retrieved based in whole or in part on the time stamp, a keyword indexing mechanism, or statistical indices calculated at the time of the search.Type: GrantFiled: December 21, 2022Date of Patent: April 2, 2024Assignee: Splunk Inc.Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Rory Greene, Bradley Hall, Nicholas Christian Mealy, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan
-
Patent number: 11940967Abstract: Embodiments are directed towards a method for searching data. The method comprises providing an inverted index that comprises at least one record, wherein the at least one record comprises at least one field name and a corresponding at least one field value. The at least one field name and corresponding value are extracted from time-stamped searchable events that are stored in a field searchable datastore and comprise portions of raw data. The at least one record further comprises a posting value that identifies a location in the field searchable datastore where an event associated with the at least one record is stored. The method further comprises evaluating an incoming search query that references a field name. Responsive to the evaluating, the method comprises determining results for the incoming search query by executing the incoming search query across the field searchable datastore or the inverted index.Type: GrantFiled: June 30, 2021Date of Patent: March 26, 2024Assignee: SPLUNK INC.Inventors: David Ryan Marquardt, Mitchell Neuman Blank, Jr., Stephen Phillip Sorkin
-
Patent number: 11914562Abstract: A method and system for managing searches of a data set that is partitioned based on a plurality of events. A structure of a search query may be analyzed to determine if logical computational actions performed on the data set is reducible. Data in each partition is analyzed to determine if at least a portion of the data in the partition is reducible. In response to a subsequent or reoccurring search request, intermediate summaries of reducible data and reducible search computations may be aggregated for each partition. Next, a search result may be generated based on at least one of the aggregated intermediate summaries, the aggregated reducible search computations, and a query of adhoc non-reducible data arranged in at least one of the plurality of partitions for the data set.Type: GrantFiled: February 8, 2023Date of Patent: February 27, 2024Assignee: SPLUNK INC.Inventors: Ledion Bitincka, Stephen Phillip Sorkin, Steve Yu Zhang
-
Patent number: 11914552Abstract: Embodiments are directed towards managing and tracking item identification of a plurality of items to determine if an item is a new or existing item, where an existing item has been previously processed. In some embodiments, two or more item identifiers may be generated. In one embodiment, generating the two or more item identifiers may include analyzing the item using a small item size characteristic, a compressed item, or for an identifier collision. The two or more item identifiers may be employed to determine if the item is a new or existing item. In one embodiment, the two or more item identifiers may be compared to a record about an existing item to determine if the item is a new or existing item. If the item is an existing item, then the item may be further processed to determine if the existing item has actually changed.Type: GrantFiled: January 26, 2023Date of Patent: February 27, 2024Assignee: Splunk Inc.Inventors: Amritpal Singh Bath, Mitchell Neuman Blank, Jr., Vishal Patel, Stephen Phillip Sorkin
-
Patent number: 11893010Abstract: Embodiments include generating data models that may give semantic meaning for unstructured or structured data that may include data generated and/or received by search engines, including a time series engine. A method includes generating a data model for data stored in a repository. Generating the data model includes generating an initial query string, executing the initial query string on the data, generating an initial result set based on the initial query string being executed on the data, determining one or more candidate fields from one or results of the initial result set, generating a candidate data model based on the one or more candidate fields, iteratively modifying the candidate data model until the candidate data model models the data, and using the candidate data model as the data model.Type: GrantFiled: May 2, 2022Date of Patent: February 6, 2024Assignee: SPLUNK INC.Inventors: Alice Emily Neels, Archana Sulochana Ganapathi, Marc Vincent Robichaud, Stephen Phillip Sorkin, Steve Yu Zhang
-
Patent number: 11886502Abstract: Embodiments are directed towards previewing results generated from indexing data raw data before the corresponding index data is added to an index store. Raw data may be received from a preview data source. After an initial set of configuration information may be established, the preview data may be submitted to an index processing pipeline. A previewing application may generate preview results based on the preview index data and the configuration information. The preview results may enable previewing how the data is being processed by the indexing application. If the preview results are not acceptable, the configuration information may be modified. The preview application enables modification of the configuration information until the generated preview results may be acceptable. If the configuration information is acceptable, the preview data may be processed and indexed in one or more index stores.Type: GrantFiled: May 8, 2020Date of Patent: January 30, 2024Assignee: Splunk Inc.Inventors: Mitchell Neuman Blank, Jr., Leonid Budchenko, David Carasso, Micah James Delfino, Johnvey Hwang, Stephen Phillip Sorkin, Eric Timothy Woo
-
Patent number: 11860881Abstract: A method, system, and processor-readable storage medium are directed towards generating a report derived from data, such as event data, stored on a plurality of distributed nodes. In one embodiment the analysis is generated using a “divide and conquer” algorithm, such that each distributed node analyzes locally stored event data while an aggregating node combines these analysis results to generate the report. In one embodiment, each distributed node also transmits a list of event data references associated with the analysis result to the aggregating node. The aggregating node may then generate a global ordered list of data references based on the list of event data references received from each distributed node. Subsequently, in response to a user selection of a range of global event data, the report may dynamically retrieve event data from one or more distributed nodes for display according to the global order.Type: GrantFiled: November 15, 2021Date of Patent: January 2, 2024Assignee: Splunk Inc.Inventors: Steve Yu Zhang, Stephen Phillip Sorkin
-
Publication number: 20230205791Abstract: Methods and apparatus consistent with the invention provide the ability to organize and build understandings of machine data generated by a variety of information-processing environments. Machine data is a product of information-processing systems (e.g., activity logs, configuration files, messages, database records) and represents the evidence of particular events that have taken place and been recorded in raw data format. In one embodiment, machine data is turned into a machine data web by organizing machine data into events and then linking events together.Type: ApplicationFiled: March 3, 2023Publication date: June 29, 2023Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Bradley Hall, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan, Rory Greene, Nicholas Christian Mealy, Christina Frances Regina Noren
-
Publication number: 20230205749Abstract: Methods and apparatus consistent with the invention provide the ability to organize, index, search, and present time series data based on searches. Time series data are sequences of time stamped records occurring in one or more usually continuous streams, representing some type of activity. In one embodiment, time series data is stored as discrete events time stamps. A search is received and relevant event information is retrieved based in whole or in part on the time stamp, a keyword indexing mechanism, or statistical indices calculated at the time of the search.Type: ApplicationFiled: December 21, 2022Publication date: June 29, 2023Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Rory Greene, Bradley Hall, Nicholas Christian Mealy, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan
-
Patent number: 11663244Abstract: Methods and apparatus consistent with the invention provide the ability to organize and build understandings of machine data generated by a variety of information-processing environments. Machine data is a product of information-processing systems (e.g., activity logs, configuration files, messages, database records) and represents the evidence of particular events that have taken place and been recorded in raw data format. In one embodiment, machine data is turned into a machine data web by organizing machine data into events and then linking events together.Type: GrantFiled: September 20, 2021Date of Patent: May 30, 2023Assignee: Splunk Inc.Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Bradley Hall, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan, Rory Greene, Nicholas Christian Mealy, Christina Frances Regina Noren
-
Patent number: 11604779Abstract: A method and system for managing searches of a data set that is partitioned based on a plurality of events. A structure of a search query may be analyzed to determine if logical computational actions performed on the data set is reducible. Data in each partition is analyzed to determine if at least a portion of the data in the partition is reducible. In response to a subsequent or reoccurring search request, intermediate summaries of reducible data and reducible search computations may be aggregated for each partition. Next, a search result may be generated based on at least one of the aggregated intermediate summaries, the aggregated reducible search computations, and a query of adhoc non-reducible data arranged in at least one of the plurality of partitions for the data set.Type: GrantFiled: May 10, 2021Date of Patent: March 14, 2023Assignee: SPLUNK INC.Inventors: Ledion Bitincka, Stephen Phillip Sorkin, Steve Yu Zhang
-
Patent number: 11599547Abstract: A method of data replication in a clustered computing environment comprises receiving, at a selected indexer within a plurality of indexers in a cluster, data from a forwarder indexer, wherein the selected indexer is designated as a primary indexer for the data, wherein the primary indexer has primary responsibility for responding to search queries pertaining to the data, wherein the cluster comprises a plurality of sites. The method further comprises receiving, at the selected indexer, data replication instructions, wherein the data replication instructions comprise a number of other indexers in the cluster for storing a replicated copy of the data and further comprise a number of sites from the plurality of sites across which to store a replicated copy of the data determined in accordance with a site replication factor.Type: GrantFiled: April 14, 2021Date of Patent: March 7, 2023Assignee: SPLUNK INC.Inventors: Vishal Patel, Mitchell Neuman Blank, Jr., Sundar Renegarajan Vasan, Stephen Phillip Sorkin
-
Patent number: 11599400Abstract: Methods and apparatus consistent with the invention provide the ability to organize and build understandings of machine data generated by a variety of information-processing environments. Machine data is a product of information-processing systems (e.g., activity logs, configuration files, messages, database records) and represents the evidence of particular events that have taken place and been recorded in raw data format. In one embodiment, machine data is turned into a machine data web by organizing machine data into events and then linking events together.Type: GrantFiled: September 10, 2021Date of Patent: March 7, 2023Assignee: Splunk Inc.Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Bradley Hall, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan, Rory Greene, Nicholas Christian Mealy, Christina Frances Regina Noren
-
Patent number: 11580071Abstract: Embodiments are directed towards managing and tracking item identification of a plurality of items to determine if an item is a new or existing item, where an existing item has been previously processed. In some embodiments, two or more item identifiers may be generated. In one embodiment, generating the two or more item identifiers may include analyzing the item using a small item size characteristic, a compressed item, or for an identifier collision. The two or more item identifiers may be employed to determine if the item is a new or existing item. In one embodiment, the two or more item identifiers may be compared to a record about an existing item to determine if the item is a new or existing item. If the item is an existing item, then the item may be further processed to determine if the existing item has actually changed.Type: GrantFiled: October 26, 2020Date of Patent: February 14, 2023Assignee: Splunk Inc.Inventors: Amritpal Singh Bath, Mitchell Neuman Blank, Jr., Vishal Patel, Stephen Phillip Sorkin
-
Patent number: 11561952Abstract: Methods and apparatus consistent with the invention provide the ability to organize, index, search, and present time series data based on searches. Time series data are sequences of time stamped records occurring in one or more usually continuous streams, representing some type of activity. In one embodiment, time series data is stored as discrete events time stamps. A search is received and relevant event information is retrieved based in whole or in part on the time stamp, a keyword indexing mechanism, or statistical indices calculated at the time of the search.Type: GrantFiled: December 17, 2020Date of Patent: January 24, 2023Assignee: Splunk Inc.Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Rory Greene, Bradley Hall, Nicholas Christian Mealy, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan
-
Patent number: 11550772Abstract: Methods and apparatus consistent with the invention provide the ability to organize, index, search, and present time series data based on searches. Time series data are sequences of time stamped records occurring in one or more usually continuous streams, representing some type of activity. In one embodiment, time series data is stored as discrete events time stamps. A search is received and relevant event information is retrieved based in whole or in part on the time stamp, a keyword indexing mechanism, or statistical indices calculated at the time of the search.Type: GrantFiled: April 29, 2021Date of Patent: January 10, 2023Assignee: Splunk Inc.Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Rory Greene, Bradley Hall, Nicholas Christian Mealy, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan
-
Patent number: 11537585Abstract: Methods and apparatus consistent with the invention provide the ability to organize, index, search, and present time series data based on searches. Time series data are sequences of time stamped records occurring in one or more usually continuous streams, representing some type of activity. In one embodiment, time series data is stored as discrete events time stamps. A search is received and relevant event information is retrieved based in whole or in part on the time stamp, a keyword indexing mechanism, or statistical indices calculated at the time of the search.Type: GrantFiled: April 29, 2021Date of Patent: December 27, 2022Assignee: Splunk Inc.Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Rory Greene, Bradley Hall, Nicholas Christian Mealy, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan
-
Patent number: 11526482Abstract: Methods and apparatus are disclosed to automatically timestamp events within streaming machine data. The streaming machine data is broken into a set of events using breaking rules. Each event can be analyzed by iterating over own time stamp format patterns from a list of known time stamp format patterns to determine whether a matching pattern exists in the event. When an individual event broken out from the streaming machine data includes time information according to at least one known time stamp format pattern of the list of known time stamp format patterns, a timestamp can be created for the event by extracting a time value from event ng the matching pattern determined to exist in the event.Type: GrantFiled: January 31, 2022Date of Patent: December 13, 2022Assignee: Splunk Inc.Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Rory Greene, Bradley Hall, Nicholas Christian Mealy, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan
-
Publication number: 20220156244Abstract: Methods and apparatus are disclosed to automatically timestamp events within streaming machine data. The streaming machine data is broken into a set of events using breaking rules. Each event can be analyzed by iterating over own time stamp format patterns from a list of known time stamp format patterns to determine whether a matching pattern exists in the event. When an individual event broken out from the streaming machine data includes time information according to at least one known time stamp format pattern of the list of known time stamp format patterns, a timestamp can be created for the event by extracting a time value from event ng the matching pattern determined to exist in the event.Type: ApplicationFiled: January 31, 2022Publication date: May 19, 2022Inventors: Michael Joseph Baum, R. David Carasso, Robin Kumar Das, Rory Greene, Bradley Hall, Nicholas Christian Mealy, Brian Philip Murphy, Stephen Phillip Sorkin, Andre David Stechert, Erik M. Swan
-
Patent number: 11321311Abstract: Embodiments include generating data models that may give semantic meaning for unstructured or structured data that may include data generated and/or received by search engines, including a time series engine. A method includes generating a data model for data stored in a repository. Generating the data model includes generating an initial query string, executing the initial query string on the data, generating an initial result set based on the initial query string being executed on the data, determining one or more candidate fields from one or results of the initial result set, generating a candidate data model based on the one or more candidate fields, iteratively modifying the candidate data model until the candidate data model models the data, and using the candidate data model as the data model.Type: GrantFiled: November 29, 2018Date of Patent: May 3, 2022Assignee: SPLUNK INC.Inventors: Alice Emily Neels, Archana Sulochana Ganapathi, Marc Vincent Robichaud, Stephen Phillip Sorkin, Steve Yu Zhang