Patents by Inventor Ram Sriharsha

Ram Sriharsha has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

ANOMALY AND OUTLIER EXPLANATION GENERATION FOR DATA INGESTED TO A DATA INTAKE AND QUERY SYSTEM

Publication number: 20220358124

Abstract: Systems and methods are described for processing ingested data, detecting anomalies in the ingested data, and providing explanations of a possible cause of the detected anomalies as the data is being ingested. For example, a token or field in the ingested data may have an anomalous value. Tokens or fields from another portion of the ingested data can be extracted and analyzed to determine whether there is any correlation between the values of the extracted tokens or fields and the anomalous token or field having an anomalous value. If a correlation is detected, this information can be surfaced to a user.

Type: Application

Filed: July 27, 2022

Publication date: November 10, 2022

Inventor: Ram Sriharsha
Anomaly and outlier explanation generation for data ingested to a data intake and query system

Patent number: 11475024

Abstract: Systems and methods are described for processing ingested data, detecting anomalies in the ingested data, and providing explanations of a possible cause of the detected anomalies as the data is being ingested. For example, a token or field in the ingested data may have an anomalous value. Tokens or fields from another portion of the ingested data can be extracted and analyzed to determine whether there is any correlation between the values of the extracted tokens or fields and the anomalous token or field having an anomalous value. If a correlation is detected, this information can be surfaced to a user.

Type: Grant

Filed: January 31, 2020

Date of Patent: October 18, 2022

Assignee: Splunk Inc.

Inventor: Ram Sriharsha
Systems and methods for detecting DNS communications through time-to-live analyses

Patent number: 11477161

Abstract: A computerized method is disclosed that includes accessing domain name server (DNS) record data including a plurality of DNS records spanning a first time period, performing a time-to-live (TTL) analysis to determine a TTL run length distribution for the DNS record data, wherein the TTL analysis includes: generating a vector of the TTL values of each DNS record ordered sequentially in time, parsing the vector of the TTL values into segments, where a segment consists of one or more TTL values where a current TTL value is less than an immediately preceding TTL value, and determining the TTL run length distribution, determining whether DNS beaconing is present based on a result of the TTL analysis and in response to determining that DNS beaconing is present, generating an alert for a system administrator.

Type: Grant

Filed: October 29, 2021

Date of Patent: October 18, 2022

Assignee: SPLUNK Inc.

Inventors: Abhinav Mishra, Giovanni Mola, Ram Sriharsha, Zhaohui Wang
DATA FIELD EXTRACTION MODEL TRAINING FOR A DATA INTAKE AND QUERY SYSTEM

Publication number: 20220035775

Abstract: Systems and methods are described for training an artificial intelligence model to extract one or more data fields from a log. For example, the artificial intelligence model may be a neural network. The neural network may be trained using training data obtained by iterating through a plurality of logs using active learning, and selecting a subset of the logs in the plurality to be labeled by a user. For example, the selected subset of logs may be logs that are not similar to other logs already labeled by a user. The user may be prompted to label the selected subset of logs to identify one or more data fields to extract. Once the selected subset of logs are labeled, these labeled logs can be used as the training data to train the neural network.

Type: Application

Filed: July 31, 2020

Publication date: February 3, 2022

Inventors: Ram Sriharsha, Zhaohui Wang
LOG SOURCETYPE INFERENCE MODEL TRAINING FOR A DATA INTAKE AND QUERY SYSTEM

Publication number: 20220036002

Abstract: Systems and methods are described for training an artificial intelligence model to infer a log sourcetype of a log. For example, logs may have different log sourcetypes, and logs having the same log sourcetypes may have different messagetypes. The artificial intelligence model may be a machine learning model, and can be trained using training data that includes logs with known log sourcetypes. Each log can be tokenized, filtered, converted into a vector, and applied to a machine learning model as an input to perform the training. The machine learning model may output an inferred log sourcetype, which can be compared with the known log sourcetype to update model parameters to improve the machine learning model accuracy. The trained machine learning model may be trained to infer a log sourcetype of a log regardless of the messagetype of the log.

Type: Application

Filed: July 31, 2020

Publication date: February 3, 2022

Inventors: Ram Sriharsha, Zhaohui Wang
DATA FIELD EXTRACTION BY A DATA INTAKE AND QUERY SYSTEM

Publication number: 20220036177

Abstract: Systems and methods are described for extracting data fields from logs ingested in a data processing pipeline or otherwise stored. For example, a log can be applied as an input to an artificial intelligence model trained to infer a log sourcetype of logs, and the artificial intelligence model can output an inferred log sourcetype of the log. The inferred log sourcetype can be used to select another artificial intelligence model trained to extract data fields from logs having the inferred log sourcetype, and the log can then be applied as an input to the other artificial intelligence model. The other artificial intelligence model may then output one or more data fields extracted from the log.

Type: Application

Filed: July 31, 2020

Publication date: February 3, 2022

Inventors: Ram Sriharsha, Zhaohui Wang
ONLINE MACHINE LEARNING ALGORITHM FOR A DATA INTAKE AND QUERY SYSTEM

Publication number: 20210117857

Abstract: Systems and methods are described for processing ingested data using an online machine learning algorithm as the data is being ingested. For example, the online machine learning algorithm can be an adaptive thresholding algorithm used to identify outliers in a moving window of data. As another example, the online machine learning algorithm can be a sequential outlier detector that detects anomalous sequences of logs or events. As another example, the online machine learning algorithm can be a sentiment analyzer that determines whether text has a positive, negative, or neutral sentiment. As another example, the online machine learning algorithm can be a drift detector that detects whether ingested data marks the start of a change in the distribution of a time-series.

Type: Application

Filed: January 31, 2020

Publication date: April 22, 2021

Inventor: Ram Sriharsha
DATA INGESTION PIPELINE ANOMALY DETECTION

Publication number: 20210117232

Abstract: Systems and methods are described for processing ingested pipeline metrics and ingested logs in an asynchronous manner as the data is being ingested to explain anomalies detected in the pipeline metrics using the ingested logs. For example, one or more streaming data processors can convert data as the data is ingested into a comparable data structure, determine whether the comparable data structure should be assigned to an existing data pattern or a new data pattern, and determine whether the logs corresponding to the comparable data structure is anomalous. Separately, the streaming data processor(s) can perform an outlier detection on the pipeline metrics to detect outliers. The streaming data processor(s) can then window the anomalous logs and the pipeline metric outliers to surface explanations for the pipeline metric outliers using the anomalous logs.

Type: Application

Filed: October 31, 2019

Publication date: April 22, 2021

Inventors: Ram Sriharsha, Mark Huang, Abhinav Mishra, Harsha Wasalathanthrige Don
SAMPLING-BASED PREVIEW MODE FOR A DATA INTAKE AND QUERY SYSTEM

Publication number: 20210117382

Abstract: Systems and methods are described for providing a user interface through which a user can program operation of a data processing pipeline by specifying a graph of nodes that transform data and interconnections that designate routing of data between individual nodes within the graph. In response to a user request, a preview mode can be activated that causes the data processing pipeline to retrieve data from at least one source specified by the graph, transform the data according to the nodes of the graph, sample the transformed data, and display the sampling of the transformed data to at least one node without writing the transformed data to at least one destination specified by the graph.

Type: Application

Filed: January 31, 2020

Publication date: April 22, 2021

Inventor: Ram Sriharsha
ANOMALY DETECTION IN DATA INGESTED TO A DATA INTAKE AND QUERY SYSTEM

Publication number: 20210117416

Abstract: Systems and methods are described for processing ingested data in an asynchronous manner as the data is being ingested to detect potential anomalies. For example, one or more streaming data processors can convert data as the data is ingested into a comparable data structure, determine whether the comparable data structure should be assigned to an existing data pattern or a new data pattern, and optionally update a characteristic of the data pattern to which the comparable data structure is assigned. The streaming data processor(s) can perform these operations automatically in real-time or in periodic batches. Once one or more comparable data structures have been assigned to one or more data patterns, the streaming data processor(s) can analyze the comparable data structures assigned to a particular data pattern to determine whether any of the comparable data structures appear to be anomalous.

Type: Application

Filed: January 31, 2020

Publication date: April 22, 2021

Inventors: Ram Sriharsha, Kristal Lyn Curtis, Iryna Vogler-Ivashchanka, Clark Eugene Mullen
ANOMALY AND OUTLIER EXPLANATION GENERATION FOR DATA INGESTED TO A DATA INTAKE AND QUERY SYSTEM

Publication number: 20210117415

Abstract: Systems and methods are described for processing ingested data, detecting anomalies in the ingested data, and providing explanations of a possible cause of the detected anomalies as the data is being ingested. For example, a token or field in the ingested data may have an anomalous value. Tokens or fields from another portion of the ingested data can be extracted and analyzed to determine whether there is any correlation between the values of the extracted tokens or fields and the anomalous token or field having an anomalous value. If a correlation is detected, this information can be surfaced to a user.

Type: Application

Filed: January 31, 2020

Publication date: April 22, 2021

Inventor: Ram Sriharsha
SWAPPABLE ONLINE MACHINE LEARNING ALGORITHMS IMPLEMENTED IN A DATA INTAKE AND QUERY SYSTEM

Publication number: 20210117868

Abstract: Systems and methods are described for testing one or more machine learning algorithms in parallel with an existing machine learning algorithm implemented within a data processing pipeline. Each machine learning algorithm can train a machine learning model that receives a live stream of raw machine data. The output of the machine learning model trained by the existing machine learning algorithm may be written to an external storage system, but the output of the machine learning model(s) trained by the test machine learning algorithm(s) may not be written to an external storage system. After some time, performance of the test machine learning algorithm(s) and the existing machine learning algorithm is evaluated. If the test machine learning algorithm performs better than the existing machine learning algorithm, then the machine learning algorithms can be swapped without any downtime and without needed to re-train a machine learning model using previously seen raw machine data.

Type: Application

Filed: January 31, 2020

Publication date: April 22, 2021

Inventor: Ram Sriharsha
Spark satellite clusters to HADOOP data stores

Patent number: 10902464

Abstract: An advertising and data analysis platform may need to mine through vast amounts of data to come up with insights into advertising effectiveness, and measure and improve the effectiveness of advertising reach. Distributed network data analytics may be applied to ad matching/targeting, such that an in-memory cluster computing environment may be used with advertising data. For example, HADOOP may be utilized for distributed processing of the vast amounts of data and the HADOOP distributed file system (HDFS) is used for organizing communications and storage of that data. Satellite clusters or nodes may be generated that also utilize HDFS. For example, a SPARK or SHARK satellite cluster may be arranged to further utilize the HDFS of the HADOOP clusters.

Type: Grant

Filed: August 27, 2014

Date of Patent: January 26, 2021

Assignee: Verizon Media Inc.

Inventors: Ram Sriharsha, Tim Tully, Supreeth Rao, Reynold Xin
METHOD OF LOG SCANNING

Publication number: 20160019578

Abstract: Example methods, apparatuses, and/or articles of manufacture are disclosed that may be implemented, in whole or in part, using one or more computing devices to provide analysis of the distribution of overlaps of logs of values.

Type: Application

Filed: July 17, 2014

Publication date: January 21, 2016

Inventor: Ram Sriharsha
ELASTICITY OF ENGAGEMENT TO AD QUALITY

Publication number: 20150356595

Abstract: Described herein are solutions for determining quality of online ads and matching the ads to content so that the content is not devalued by the ads. Such solutions may also identify relationships between ads and their influence on user engagement with host content. The solutions may also define and provide the relationships to advertisers, in forms of historical scores and projected scores. The historical scores may include historical elasticity scores and the projected scores may include projected elasticity scores. The scores may be determined per ad and content pair. The solutions can use the scores to influence ad pricing.

Type: Application

Filed: June 5, 2014

Publication date: December 10, 2015

Applicant: YAHOO! INC.

Inventors: Ram Sriharsha, Supreeth Hosur Nagesh Rao
SPARK SATELLITE CLUSTERS TO HADOOP DATA STORES

Publication number: 20150066646

Abstract: An advertising and data analysis platform may need to mine through vast amounts of data to come up with insights into advertising effectiveness, and measure and improve the effectiveness of advertising reach. Distributed network data analytics may be applied to ad matching/targeting, such that an in-memory cluster computing environment may be used with advertising data. For example, HADOOP may be utilized for distributed processing of the vast amounts of data and the HADOOP distributed file system (HDFS) is used for organizing communications and storage of that data. Satellite clusters or nodes may be generated that also utilize HDFS. For example, a SPARK or SHARK satellite cluster may be arranged to further utilize the HDFS of the HADOOP clusters.

Type: Application

Filed: August 27, 2014

Publication date: March 5, 2015

Applicant: Yahoo! Inc.

Inventors: Ram Sriharsha, Tim Tully, Supreeth Rao, Reynold Xin

prev 1 2