Patents by Inventor Konstantinos (Constantin) F. Aliferis

Konstantinos (Constantin) F. Aliferis has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Data analysis computer system and method for fast discovery of multiple Markov boundaries

Patent number: 10303737

Abstract: Methods for discovery of a Markov boundary from data constitute one of the most important recent developments in pattern recognition and applied data analysis and modeling, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. Even though there is always a single Markov boundary of the response variable in faithful distributions, distributions with violations of the intersection property of probability theory may have multiple Markov boundaries. Such distributions are abundant in practical data-analytic applications, and there are several reasons why it is important to discover and extract all Markov boundaries from such data as a critical step of data analysis. The present invention is a novel fast generative method (termed Generalized-iTIE*) that can discover all Markov boundaries from a sample drawn from a distribution.

Type: Grant

Filed: March 17, 2014

Date of Patent: May 28, 2019

Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
Document coding computer system and method with integrated quality assurance

Patent number: 10296850

Abstract: The present invention consists of a computer-implemented system and method for automatically analyzing and coding documents into content categories suitable for high cost, high yield settings where quality and efficiency of classification are essential. A prototypical example application field is legal document predictive coding for purposes of e-discovery and litigation (or litigation readiness) where the automated classification of documents as “responsive” or not must be (a) efficient, (b) accurate, and (c) defensible in court. Many text classification technologies exist but they focus on the relatively simple steps of using a training method on training data, producing a model and testing it on test data. They invariably do not address effectively and simultaneously key quality assurance requirements. The invention applies several data design and validation steps that ensure quality and removal of all possible sources of document classification error or deficiencies.

Type: Grant

Filed: March 17, 2014

Date of Patent: May 21, 2019

Inventors: Konstantinos (Constantin) F. Aliferis, Yin Aphinyanaphongs, Alexander Statnikov, Lawrence Fu
Data analysis computer system and method employing local to global causal discovery

Patent number: 10289751

Abstract: Discovery of causal networks is essential for understanding and manipulating complex systems in numerous data analysis application domains. Several methods have been proposed in the last two decades for solving this problem. The inventive method uses local causal discovery methods for global causal network learning in a divide-and-conquer fashion. The usefulness of the invention is demonstrated in data capturing characteristics of several domains. The inventive method outputs more accurate networks compared to other discovery approaches.

Type: Grant

Filed: March 17, 2014

Date of Patent: May 14, 2019

Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
Data analysis computer system and method for conversion of predictive models to equivalent ones

Patent number: 9858533

Abstract: The present invention addresses two ubiquitous and pressing problems of modern data analytics technology. Many modern pattern recognition technologies produce models with excellent predictivity but (a) they are “black boxes”, that is they are opaque to the user; (b) they are too large, and/or expensive to execute in less powerful computing platforms. The invention “opens up” a black box model by converting it to a compact and understandable model that is functionally equivalent. The invention also converts a predictive model into a functionally equivalent model into a form that can be implemented and deployed more easily or efficiently in practice. The benefits include: model understandability and defensibility of modeling. A particularly interesting application is that of understanding the decision making of humans, comparison of the behavior of a human or computerized decision process against another and use to enhance education and guideline compliance/adherence detection and improvement.

Type: Grant

Filed: March 17, 2014

Date of Patent: January 2, 2018

Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov, Lawrence Fu, Yin Aphinyanaphongs
Data analysis computer system and method for organizing, presenting, and optimizing predictive modeling

Patent number: 9772741

Abstract: Predictive modeling is an important class of data analytics with applications in numerous fields. Once a predictive model is built, validated, and applied on a set of objects, by a data analytics system (or even by manual modeling), consumers of the model information need assistance to navigate through the results. This is because both regression and classification models that output continuous values (eg, probability of belonging to a class) are often used to rank objects and then a thresholding of the ranked scores needs to be used to separate objects into a “positive” and a “negative” class. The choice of threshold greatly affects the true positive, false positive, true negative, and false negative results of the model's application. An ideal data analytics system should allow the user to understand the tradeoffs of different threshold values for different thresholds.

Type: Grant

Filed: March 17, 2014

Date of Patent: September 26, 2017

Inventors: Konstantinos (Constantin) F. Aliferis, Yin Aphinyanaphongs, Lawrence Fu, Alexander Statnikov
Data analysis computer system and method for parallelized and modularized analysis of big data

Patent number: 9720940

Abstract: The focus of the present invention is the modular analysis of Big Data encompassing parallelization, chunking, and distributed analysis applications.

Type: Grant

Filed: March 17, 2014

Date of Patent: August 1, 2017

Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
Data Analysis Computer System and Method For Fast Discovery Of Multiple Markov Boundaries

Publication number: 20140324752

Abstract: Methods for discovery of a Markov boundary from data constitute one of the most important recent developments in pattern recognition and applied data analysis and modeling, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. Even though there is always a single Markov boundary of the response variable in faithful distributions, distributions with violations of the intersection property of probability theory may have multiple Markov boundaries. Such distributions are abundant in practical data-analytic applications, and there are several reasons why it is important to discover and extract all Markov boundaries from such data as a critical step of data analysis. The present invention is a novel fast generative method (termed Generalized-iTIE*) that can discover all Markov boundaries from a sample drawn from a distribution.

Type: Application

Filed: March 17, 2014

Publication date: October 30, 2014

Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
Data Analysis Computer System and Method For Causal Discovery with Experimentation Optimization

Publication number: 20140289174

Abstract: Discovery of causal models via experimentation is essential in numerous applications fields. One of the primary objectives of the invention is to minimize the use of costly experimental resources while achieving high discovery accuracy. The invention provides new methods and processes to enable accurate discovery of local causal pathways by integrating high-throughput observational data with efficient experimentation strategies. At the core of these methods are computational causal discovery techniques that account for multiplicity (i.e., indistinguishability) of causal pathways consistent with observational data. The invention, when applied for discovery of local causal pathways from a combination of observational and experimental data, achieves higher discovery accuracy than existing observational approaches and uses fewer experimental resources than existing experimental approaches. Repeated application of the invention for each variable in the modeled system produces the full causal model.

Type: Application

Filed: March 17, 2014

Publication date: September 25, 2014

Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
Data Analysis Computer System and Method Employing Local to Global Causal Discovery

Publication number: 20140280361

Abstract: Discovery of causal networks is essential for understanding and manipulating complex systems in numerous data analysis application domains. Several methods have been proposed in the last two decades for solving this problem. The inventive method uses local causal discovery methods for global causal network learning in a divide-and-conquer fashion. The usefulness of the invention is demonstrated in data capturing characteristics of several domains. The inventive method outputs more accurate networks compared to other discovery approaches.

Type: Application

Filed: March 17, 2014

Publication date: September 18, 2014

Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
Computer System and Method That Determines Sample Size and Power Required For Complex Predictive and Causal Data Analysis

Publication number: 20140278339

Abstract: Established methods for statistical “power-size” analysis for statistical modeling are geared toward statistical hypothesis testing, and have serious shortcomings in modern complex predictive and causal modeling applications where the determination of sample size is affected by parameters not addressed by the standard statistical power-size analysis. The present invention provides a method and computer-implemented system for determining sufficient sample size for training predictive or causal models for a given application field or distribution type and desired performance level taking into account the critical factors that affect the needed sample size. The invention can be applied to practically any field where predictive modeling or causal modeling are desired.

Type: Application

Filed: March 17, 2014

Publication date: September 18, 2014

Inventors: Konstantinos (Constantin) F. Aliferis, Lawrence Fu, Alexander Statnikov, Yin Aphinyanaphongs
Data Analysis Computer System and Method For Parallelized and Modularized Analysis of Big Data

Publication number: 20140280257

Abstract: The focus of the present invention is the modular analysis of Big Data encompassing parallelization, chunking, and distributed analysis applications.

Type: Application

Filed: March 17, 2014

Publication date: September 18, 2014

Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
Data Analysis Computer System and Method For Conversion Of Predictive Models To Equivalent Ones

Publication number: 20140279760

Abstract: The present invention addresses two ubiquitous and pressing problems of modern data analytics technology. Many modern pattern recognition technologies produce models with excellent predictivity but (a) they are “black boxes”, that is they are opaque to the user; (b) they are too large, and/or expensive to execute in less powerful computing platforms. The invention “opens up” a black box model by converting it to a compact and understandable model that is functionally equivalent. The invention also converts a predictive model into a functionally equivalent model into a form that can be implemented and deployed more easily or efficiently in practice. The benefits include: model understandability and defensibility of modeling. A particularly interesting application is that of understanding the decision making of humans, comparison of the behavior of a human or computerized decision process against another and use to enhance education and guideline compliance/adherence detection and improvement.

Type: Application

Filed: March 17, 2014

Publication date: September 18, 2014

Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Satnikov, Lawrence Fu, Yin Aphinyanaphongs
Data Analysis Computer System and Method for Organizing, Presenting, and Optimizing Predictive Modeling

Publication number: 20140279794

Abstract: Predictive modeling is an important class of data analytics with applications in numerous fields. Once a predictive model is built, validated, and applied on a set of objects, by a data analytics system (or even by manual modeling), consumers of the model information need assistance to navigate through the results. This is because both regression and classification models that output continuous values (eg, probability of belonging to a class) are often used to rank objects and then a thresholding of the ranked scores needs to be used to separate objects into a “positive” and a “negative” class. The choice of threshold greatly affects the true positive, false positive, true negative, and false negative results of the model's application. An ideal data analytics system should allow the user to understand the tradeoffs of different threshold values for different thresholds.

Type: Application

Filed: March 17, 2014

Publication date: September 18, 2014

Inventors: Konstantinos (Constantin) F. Aliferis, Yin Aphinyanaphongs, Lawrence Fu, Alexander Statnikov
Document Coding Computer System and Method With Integrated Quality Assurance

Publication number: 20140279761

Abstract: The present invention consists of a computer-implemented system and method for automatically analyzing and coding documents into content categories suitable for high cost, high yield settings where quality and efficiency of classification are essential. A prototypical example application field is legal document predictive coding for purposes of e-discovery and litigation (or litigation readiness) where the automated classification of documents as “responsive” or not must be (a) efficient, (b) accurate, and (c) defensible in court. Many text classification technologies exist but they focus on the relatively simple steps of using a training method on training data, producing a model and testing it on test data. They invariably do not address effectively and simultaneously key quality assurance requirements. The invention applies several data design and validation steps that ensure quality and removal of all possible sources of document classification error or deficiencies.

Type: Application

Filed: March 17, 2014

Publication date: September 18, 2014

Inventors: Konstantinos (Constantin) F. Aliferis, Yin Aphinyanaphongs, Alexander Statnikov, Lawrence Fu
Computer implemented method for the automatic classification of instrumental citations

Patent number: 8832002

Abstract: The learning method taught in this patent document is significantly different from previous methods for automatic classification of citations that are labor intensive and subject to human bias and error. The present invention automatically generates and avoids these limitations. A set of operational definitions and features uniquely suited to the scientific literature is disclosed along with their use with a learning method that is capable of analyzing the textual content of articles along with bibliometric data to accurately classify instrumental citations.

Type: Grant

Filed: November 6, 2009

Date of Patent: September 9, 2014

Inventors: Lawrence Fu, Konstantinos (Constantin) F. Aliferis
Computer implemented method for determining all markov boundaries and its application for discovering multiple maximally accurate and non-redundant predictive models

Patent number: 8805761

Abstract: Methods for discovery of a Markov boundary from data constitute one of the most important recent developments in pattern recognition and applied statistics, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. Even though there is always a single Markov boundary of the response variable in faithful distributions, distributions with violations of the intersection property of probability theory may have multiple Markov boundaries. Such distributions are abundant in practical data-analytic applications, and there are several reasons why it is important to discover all Markov boundaries from such data. The present invention is a novel computer implemented generative method (termed TIE*) that can discover all Markov boundaries from a data sample drawn from a distribution. TIE* can be instantiated to discover all and only Markov boundaries independent of data distribution.

Type: Grant

Filed: October 30, 2009

Date of Patent: August 12, 2014

Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
Local causal and Markov blanket induction method for causal discovery and feature selection from data

Patent number: 8655821

Abstract: Methods for discovery of local causes/effects and of Markov blankets enable discovery of causal relationships from large data sets and provide principled solutions to the variable/feature selection problem, an integral part of predictive modeling. The present invention provides a generative method for learning local causal structure around target variables of interest in the form of direct causes/effects and Markov blankets applicable to very large real world datasets even with small samples. The selected feature sets can be used for causal discovery, classification, and regression. The generative method GLL can be instantiated in many ways giving rise to novel method variants. The method transforms a dataset with many variables into either a minimal reduced dataset where all variables are needed for optimal prediction of the response variable, or a dataset where all variables are direct causes and direct effects or the Markov blanket of the response variable.

Type: Grant

Filed: February 4, 2010

Date of Patent: February 18, 2014

Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
Local Causal and Markov Blanket Induction Method for Causal Discovery and Feature Selection from Data

Publication number: 20110307437

Abstract: In many areas, recent developments have generated very large datasets from which it is desired to extract meaningful relationships between the dataset elements. However, to date, the finding of such relationships using prior art methods has proved extremely difficult especially in the biomedical arts. Methods for local causal learning and Markov blanket discovery are important recent developments in pattern recognition and applied statistics, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. The present invention provides a generative method for learning local causal structure around target variables of interest in the form of direct causes/effects and Markov blankets applicable to very large datasets and relatively small samples. The method is readily applicable to real-world data, and the selected feature sets can be used for causal discovery and classification.

Type: Application

Filed: February 4, 2010

Publication date: December 15, 2011

Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
Computer Implemented Method for Discovery of Markov Boundaries from Datasets with Hidden Variables

Publication number: 20110202322

Abstract: Methods for Markov boundary discovery are important recent developments in pattern recognition and applied statistics, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. Currently there exist two major local method families for identification of Markov boundaries from data: methods that directly implement the definition of the Markov boundary and newer compositional Markov boundary methods that are more sample efficient and thus often more accurate in practical applications. However, in the datasets with hidden (i.e., unmeasured or unobserved) variables compositional Markov boundary methods may miss some Markov boundary members.

Type: Application

Filed: January 19, 2010

Publication date: August 18, 2011

Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
Computer Implemented Method for the Automatic Classification of Instrumental Citations

Publication number: 20100217731

Abstract: The learning method taught in this patent document is significantly different from previous methods for automatic classification of citations that are labor intensive and subject to human bias and error. The present invention automatically generates and avoids these limitations. A set of operational definitions and features uniquely suited to the scientific literature is disclosed along with their use with a learning method that is capable of analyzing the textual content of articles along with bibliometric data to accurately classify instrumental citations.

Type: Application

Filed: November 6, 2009

Publication date: August 26, 2010

Inventors: Lawrence Fu, Konstantinos (Constantin) F. Aliferis

1 2 next