Patents by Inventor Konstantinos (Constantin) F. Aliferis

Konstantinos (Constantin) F. Aliferis has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10303737
    Abstract: Methods for discovery of a Markov boundary from data constitute one of the most important recent developments in pattern recognition and applied data analysis and modeling, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. Even though there is always a single Markov boundary of the response variable in faithful distributions, distributions with violations of the intersection property of probability theory may have multiple Markov boundaries. Such distributions are abundant in practical data-analytic applications, and there are several reasons why it is important to discover and extract all Markov boundaries from such data as a critical step of data analysis. The present invention is a novel fast generative method (termed Generalized-iTIE*) that can discover all Markov boundaries from a sample drawn from a distribution.
    Type: Grant
    Filed: March 17, 2014
    Date of Patent: May 28, 2019
    Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
  • Patent number: 10296850
    Abstract: The present invention consists of a computer-implemented system and method for automatically analyzing and coding documents into content categories suitable for high cost, high yield settings where quality and efficiency of classification are essential. A prototypical example application field is legal document predictive coding for purposes of e-discovery and litigation (or litigation readiness) where the automated classification of documents as “responsive” or not must be (a) efficient, (b) accurate, and (c) defensible in court. Many text classification technologies exist but they focus on the relatively simple steps of using a training method on training data, producing a model and testing it on test data. They invariably do not address effectively and simultaneously key quality assurance requirements. The invention applies several data design and validation steps that ensure quality and removal of all possible sources of document classification error or deficiencies.
    Type: Grant
    Filed: March 17, 2014
    Date of Patent: May 21, 2019
    Inventors: Konstantinos (Constantin) F. Aliferis, Yin Aphinyanaphongs, Alexander Statnikov, Lawrence Fu
  • Patent number: 10289751
    Abstract: Discovery of causal networks is essential for understanding and manipulating complex systems in numerous data analysis application domains. Several methods have been proposed in the last two decades for solving this problem. The inventive method uses local causal discovery methods for global causal network learning in a divide-and-conquer fashion. The usefulness of the invention is demonstrated in data capturing characteristics of several domains. The inventive method outputs more accurate networks compared to other discovery approaches.
    Type: Grant
    Filed: March 17, 2014
    Date of Patent: May 14, 2019
    Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
  • Patent number: 9858533
    Abstract: The present invention addresses two ubiquitous and pressing problems of modern data analytics technology. Many modern pattern recognition technologies produce models with excellent predictivity but (a) they are “black boxes”, that is they are opaque to the user; (b) they are too large, and/or expensive to execute in less powerful computing platforms. The invention “opens up” a black box model by converting it to a compact and understandable model that is functionally equivalent. The invention also converts a predictive model into a functionally equivalent model into a form that can be implemented and deployed more easily or efficiently in practice. The benefits include: model understandability and defensibility of modeling. A particularly interesting application is that of understanding the decision making of humans, comparison of the behavior of a human or computerized decision process against another and use to enhance education and guideline compliance/adherence detection and improvement.
    Type: Grant
    Filed: March 17, 2014
    Date of Patent: January 2, 2018
    Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov, Lawrence Fu, Yin Aphinyanaphongs
  • Patent number: 9772741
    Abstract: Predictive modeling is an important class of data analytics with applications in numerous fields. Once a predictive model is built, validated, and applied on a set of objects, by a data analytics system (or even by manual modeling), consumers of the model information need assistance to navigate through the results. This is because both regression and classification models that output continuous values (eg, probability of belonging to a class) are often used to rank objects and then a thresholding of the ranked scores needs to be used to separate objects into a “positive” and a “negative” class. The choice of threshold greatly affects the true positive, false positive, true negative, and false negative results of the model's application. An ideal data analytics system should allow the user to understand the tradeoffs of different threshold values for different thresholds.
    Type: Grant
    Filed: March 17, 2014
    Date of Patent: September 26, 2017
    Inventors: Konstantinos (Constantin) F. Aliferis, Yin Aphinyanaphongs, Lawrence Fu, Alexander Statnikov
  • Patent number: 9720940
    Abstract: The focus of the present invention is the modular analysis of Big Data encompassing parallelization, chunking, and distributed analysis applications.
    Type: Grant
    Filed: March 17, 2014
    Date of Patent: August 1, 2017
    Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
  • Publication number: 20140324752
    Abstract: Methods for discovery of a Markov boundary from data constitute one of the most important recent developments in pattern recognition and applied data analysis and modeling, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. Even though there is always a single Markov boundary of the response variable in faithful distributions, distributions with violations of the intersection property of probability theory may have multiple Markov boundaries. Such distributions are abundant in practical data-analytic applications, and there are several reasons why it is important to discover and extract all Markov boundaries from such data as a critical step of data analysis. The present invention is a novel fast generative method (termed Generalized-iTIE*) that can discover all Markov boundaries from a sample drawn from a distribution.
    Type: Application
    Filed: March 17, 2014
    Publication date: October 30, 2014
    Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
  • Publication number: 20140289174
    Abstract: Discovery of causal models via experimentation is essential in numerous applications fields. One of the primary objectives of the invention is to minimize the use of costly experimental resources while achieving high discovery accuracy. The invention provides new methods and processes to enable accurate discovery of local causal pathways by integrating high-throughput observational data with efficient experimentation strategies. At the core of these methods are computational causal discovery techniques that account for multiplicity (i.e., indistinguishability) of causal pathways consistent with observational data. The invention, when applied for discovery of local causal pathways from a combination of observational and experimental data, achieves higher discovery accuracy than existing observational approaches and uses fewer experimental resources than existing experimental approaches. Repeated application of the invention for each variable in the modeled system produces the full causal model.
    Type: Application
    Filed: March 17, 2014
    Publication date: September 25, 2014
    Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
  • Publication number: 20140280361
    Abstract: Discovery of causal networks is essential for understanding and manipulating complex systems in numerous data analysis application domains. Several methods have been proposed in the last two decades for solving this problem. The inventive method uses local causal discovery methods for global causal network learning in a divide-and-conquer fashion. The usefulness of the invention is demonstrated in data capturing characteristics of several domains. The inventive method outputs more accurate networks compared to other discovery approaches.
    Type: Application
    Filed: March 17, 2014
    Publication date: September 18, 2014
    Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
  • Publication number: 20140278339
    Abstract: Established methods for statistical “power-size” analysis for statistical modeling are geared toward statistical hypothesis testing, and have serious shortcomings in modern complex predictive and causal modeling applications where the determination of sample size is affected by parameters not addressed by the standard statistical power-size analysis. The present invention provides a method and computer-implemented system for determining sufficient sample size for training predictive or causal models for a given application field or distribution type and desired performance level taking into account the critical factors that affect the needed sample size. The invention can be applied to practically any field where predictive modeling or causal modeling are desired.
    Type: Application
    Filed: March 17, 2014
    Publication date: September 18, 2014
    Inventors: Konstantinos (Constantin) F. Aliferis, Lawrence Fu, Alexander Statnikov, Yin Aphinyanaphongs
  • Publication number: 20140280257
    Abstract: The focus of the present invention is the modular analysis of Big Data encompassing parallelization, chunking, and distributed analysis applications.
    Type: Application
    Filed: March 17, 2014
    Publication date: September 18, 2014
    Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
  • Publication number: 20140279760
    Abstract: The present invention addresses two ubiquitous and pressing problems of modern data analytics technology. Many modern pattern recognition technologies produce models with excellent predictivity but (a) they are “black boxes”, that is they are opaque to the user; (b) they are too large, and/or expensive to execute in less powerful computing platforms. The invention “opens up” a black box model by converting it to a compact and understandable model that is functionally equivalent. The invention also converts a predictive model into a functionally equivalent model into a form that can be implemented and deployed more easily or efficiently in practice. The benefits include: model understandability and defensibility of modeling. A particularly interesting application is that of understanding the decision making of humans, comparison of the behavior of a human or computerized decision process against another and use to enhance education and guideline compliance/adherence detection and improvement.
    Type: Application
    Filed: March 17, 2014
    Publication date: September 18, 2014
    Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Satnikov, Lawrence Fu, Yin Aphinyanaphongs
  • Publication number: 20140279794
    Abstract: Predictive modeling is an important class of data analytics with applications in numerous fields. Once a predictive model is built, validated, and applied on a set of objects, by a data analytics system (or even by manual modeling), consumers of the model information need assistance to navigate through the results. This is because both regression and classification models that output continuous values (eg, probability of belonging to a class) are often used to rank objects and then a thresholding of the ranked scores needs to be used to separate objects into a “positive” and a “negative” class. The choice of threshold greatly affects the true positive, false positive, true negative, and false negative results of the model's application. An ideal data analytics system should allow the user to understand the tradeoffs of different threshold values for different thresholds.
    Type: Application
    Filed: March 17, 2014
    Publication date: September 18, 2014
    Inventors: Konstantinos (Constantin) F. Aliferis, Yin Aphinyanaphongs, Lawrence Fu, Alexander Statnikov
  • Publication number: 20140279761
    Abstract: The present invention consists of a computer-implemented system and method for automatically analyzing and coding documents into content categories suitable for high cost, high yield settings where quality and efficiency of classification are essential. A prototypical example application field is legal document predictive coding for purposes of e-discovery and litigation (or litigation readiness) where the automated classification of documents as “responsive” or not must be (a) efficient, (b) accurate, and (c) defensible in court. Many text classification technologies exist but they focus on the relatively simple steps of using a training method on training data, producing a model and testing it on test data. They invariably do not address effectively and simultaneously key quality assurance requirements. The invention applies several data design and validation steps that ensure quality and removal of all possible sources of document classification error or deficiencies.
    Type: Application
    Filed: March 17, 2014
    Publication date: September 18, 2014
    Inventors: Konstantinos (Constantin) F. Aliferis, Yin Aphinyanaphongs, Alexander Statnikov, Lawrence Fu
  • Patent number: 8832002
    Abstract: The learning method taught in this patent document is significantly different from previous methods for automatic classification of citations that are labor intensive and subject to human bias and error. The present invention automatically generates and avoids these limitations. A set of operational definitions and features uniquely suited to the scientific literature is disclosed along with their use with a learning method that is capable of analyzing the textual content of articles along with bibliometric data to accurately classify instrumental citations.
    Type: Grant
    Filed: November 6, 2009
    Date of Patent: September 9, 2014
    Inventors: Lawrence Fu, Konstantinos (Constantin) F. Aliferis
  • Patent number: 8805761
    Abstract: Methods for discovery of a Markov boundary from data constitute one of the most important recent developments in pattern recognition and applied statistics, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. Even though there is always a single Markov boundary of the response variable in faithful distributions, distributions with violations of the intersection property of probability theory may have multiple Markov boundaries. Such distributions are abundant in practical data-analytic applications, and there are several reasons why it is important to discover all Markov boundaries from such data. The present invention is a novel computer implemented generative method (termed TIE*) that can discover all Markov boundaries from a data sample drawn from a distribution. TIE* can be instantiated to discover all and only Markov boundaries independent of data distribution.
    Type: Grant
    Filed: October 30, 2009
    Date of Patent: August 12, 2014
    Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
  • Patent number: 8655821
    Abstract: Methods for discovery of local causes/effects and of Markov blankets enable discovery of causal relationships from large data sets and provide principled solutions to the variable/feature selection problem, an integral part of predictive modeling. The present invention provides a generative method for learning local causal structure around target variables of interest in the form of direct causes/effects and Markov blankets applicable to very large real world datasets even with small samples. The selected feature sets can be used for causal discovery, classification, and regression. The generative method GLL can be instantiated in many ways giving rise to novel method variants. The method transforms a dataset with many variables into either a minimal reduced dataset where all variables are needed for optimal prediction of the response variable, or a dataset where all variables are direct causes and direct effects or the Markov blanket of the response variable.
    Type: Grant
    Filed: February 4, 2010
    Date of Patent: February 18, 2014
    Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
  • Publication number: 20110307437
    Abstract: In many areas, recent developments have generated very large datasets from which it is desired to extract meaningful relationships between the dataset elements. However, to date, the finding of such relationships using prior art methods has proved extremely difficult especially in the biomedical arts. Methods for local causal learning and Markov blanket discovery are important recent developments in pattern recognition and applied statistics, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. The present invention provides a generative method for learning local causal structure around target variables of interest in the form of direct causes/effects and Markov blankets applicable to very large datasets and relatively small samples. The method is readily applicable to real-world data, and the selected feature sets can be used for causal discovery and classification.
    Type: Application
    Filed: February 4, 2010
    Publication date: December 15, 2011
    Inventors: Konstantinos (Constantin) F. Aliferis, Alexander Statnikov
  • Publication number: 20110202322
    Abstract: Methods for Markov boundary discovery are important recent developments in pattern recognition and applied statistics, primarily because they offer a principled solution to the variable/feature selection problem and give insight about local causal structure. Currently there exist two major local method families for identification of Markov boundaries from data: methods that directly implement the definition of the Markov boundary and newer compositional Markov boundary methods that are more sample efficient and thus often more accurate in practical applications. However, in the datasets with hidden (i.e., unmeasured or unobserved) variables compositional Markov boundary methods may miss some Markov boundary members.
    Type: Application
    Filed: January 19, 2010
    Publication date: August 18, 2011
    Inventors: Alexander Statnikov, Konstantinos (Constantin) F. Aliferis
  • Publication number: 20100217731
    Abstract: The learning method taught in this patent document is significantly different from previous methods for automatic classification of citations that are labor intensive and subject to human bias and error. The present invention automatically generates and avoids these limitations. A set of operational definitions and features uniquely suited to the scientific literature is disclosed along with their use with a learning method that is capable of analyzing the textual content of articles along with bibliometric data to accurately classify instrumental citations.
    Type: Application
    Filed: November 6, 2009
    Publication date: August 26, 2010
    Inventors: Lawrence Fu, Konstantinos (Constantin) F. Aliferis