Patents by Inventor Ramesh Natarajan

Ramesh Natarajan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and architecture for enterprise-scale, parallel data mining

Publication number: 20070174290

Abstract: A grid-based approach for enterprise-scale data mining that leverages database technology for I/O parallelism and on-demand compute servers for compute parallelism in the statistical computations is described. By enterprise-scale, we mean the highly-automated use of data mining in vertical business applications, where the data is stored on one or more relational database systems, and where a distributed architecture comprising of high-performance compute servers or a network of low-cost, commodity processors, is used to improve application performance, provide better quality data mining models, and for overall workload management. The approach relies on an algorithmic decomposition of the data mining kernel on the data and compute grids, which provides a simple way to exploit the parallelism on the respective grids, while minimizing the data transfer between them.

Type: Application

Filed: January 19, 2006

Publication date: July 26, 2007

Applicant: International Business Machines Corporation

Inventors: Inderpal Narang, Ramesh Natarajan, Radu Sioh
Method for ensemble predictive modeling by multiplicative adjustment of class probability: APM (adjusted probability model)

Patent number: 7020593

Abstract: A new method is used to model the class probability from data that is based on a novel multiplicative adjustment of the class probability by a plurality of items of evidence induced from training data. The optimal adjustment factors from each item of evidence can be determined by several techniques, a preferred embodiment thereof being the method of maximum likelihood. The evidence induced from the data can be any function of the feature variables, the simplest of which are the individual feature variables themselves. The adjustment factor of an item of evidence Ej is given by the ratio of the conditional probability P(C|Ej) of the class C given Ej to the prior class probability P(C), exponentiated by a parameter aj. The method provides a new and useful way to aggregate probabilistic evidence so that the final model output exhibits a low error rate for classification, and also gives a superior lift curve when distinguishing between any one class and the remaining classes.

Type: Grant

Filed: December 4, 2002

Date of Patent: March 28, 2006

Assignee: International Business Machines Corporation

Inventors: Se June Hong, Jonathan R. Hosking, Ramesh Natarajan
Method for ensemble predictive modeling by multiplicative adjustment of class probability: APM (adjusted probability model)

Publication number: 20040111169

Abstract: A new method is used to model the class probability from data that is based on a novel multiplicative adjustment of the class probability by a plurality of items of evidence induced from training data. The optimal adjustment factors from each item of evidence can be determined by several techniques, a preferred embodiment thereof being the method of maximum likelihood. The evidence induced from the data can be any function of the feature variables, the simplest of which are the individual feature variables themselves. The adjustment factor of an item of evidence E1 is given by the ratio of the conditional probability P(C|E1) of the class C given E1 to the prior class probability P(C), exponentiated by a parameter a1. The method provides a new and useful way to aggregate probabilistic evidence so that the final model Output exhibits a low error rate for classification, and also gives a superior lift curve when distinguishing between any one class and the remaining classes.

Type: Application

Filed: December 4, 2002

Publication date: June 10, 2004

Inventors: Se June Hong, Jonathan R. Hosking, Ramesh Natarajan
Method for constructing segmentation-based predictive models

Publication number: 20030176931

Abstract: The present invention generally relates to computer databases and, more particularly, to data mining and knowledge discovery. The invention specifically relates to a method for constructing segmentation-based predictive models, such as decision-tree classifiers, wherein data records are partitioned into a plurality of segments and separate predictive models are constructed for each segment. The present invention contemplates a computerized method for automatically building segmentation-based predictive models that substantially improves upon the modeling capabilities of decision trees and related technologies, and that automatically produces models that are competitive with, if not better than, those produced by data analysts and applied statisticians using traditional, labor-intensive statistical techniques. The invention achieves these properties by performing segmentation and multivariate statistical modeling within each segment simultaneously.

Type: Application

Filed: March 11, 2002

Publication date: September 18, 2003

Applicant: International Business Machines Corporation

Inventors: Edwin Peter Dawson Pednault, Ramesh Natarajan
Fast method for renewal and associated recommendations for market basket items

Publication number: 20020143613

Abstract: When a customer is in the process of filling a market basket for purchase on an Internet commerce site, a method makes prioritized recommendation of items so as to maximize the likelihood that the customer will add to the basket those items that are in the list with higher priorities. The method separately considers in turn preferences due to a current set of items in the market basket and also preferences due to a new choice independent of what is in the market basket. In this way, the method recognizes that not all items in the market basket are selected because of their affinity with some other item already in the basket. The two preferences are estimated separately from training data and combined in proper proportions to obtain an overall preference for item not yet in the market basket.

Type: Application

Filed: February 5, 2001

Publication date: October 3, 2002

Inventors: Se June Hong, Ramesh Natarajan, Ilana Belitskaya
Using simulated pseudo data to speed up statistical predictive modeling from massive data sets

Patent number: 6388592

Abstract: The computational cost of many statistical modeling algorithms is affected by the input/output (I/O) cost of accessing out-of-core training data. This is an important challenge for emerging data mining applications, where the amount of training data can be potentially enormous. A heuristic approach to this problem is described. This approach is based on constructing a simple probability model from the large training data set, and using this model to generate simulated pseudo data for some aspects of the statistical modeling procedure. This approach is illustrated in the context of building a Naive Bayes probability model with feature selection. Here, the usual algorithms would require numerous data scans over the massive training data set, but our heuristic obtains models of comparable accuracy with just two data scans.

Type: Grant

Filed: January 18, 2001

Date of Patent: May 14, 2002

Assignee: International Business Machines Corporation

Inventor: Ramesh Natarajan

prev 1 2 3 4

System and architecture for enterprise-scale, parallel data mining

Method for ensemble predictive modeling by multiplicative adjustment of class probability: APM (adjusted probability model)

Method for ensemble predictive modeling by multiplicative adjustment of class probability: APM (adjusted probability model)

Method for constructing segmentation-based predictive models

Fast method for renewal and associated recommendations for market basket items

Using simulated pseudo data to speed up statistical predictive modeling from massive data sets