Patents by Inventor David HAWS
David HAWS has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20140207436Abstract: Various embodiments select markers for modeling epistasis effects. In one embodiment, a processor receives a set of genetic markers and a phenotype. A relevance score is determined with respect to the phenotype for each of the set of genetic markers. A threshold is set based on the relevance score of a genetic marker with a highest relevancy score. A relevance score is determined for at least one genetic marker in the set of genetic markers for at least one interaction between the at least one genetic marker and at least one other genetic marker in the set of genetic markers. The at least one interaction is added to a top-k feature set based on the relevance score of the at least one interaction satisfying the threshold.Type: ApplicationFiled: September 18, 2013Publication date: July 24, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Dan HE, Laxmi P. PARIDA
-
Publication number: 20140207711Abstract: Various embodiments select features from a feature space. In one embodiment, a set of training samples and a set of test samples are received. The set of training samples includes a set of features and a class value. The set of test samples includes the set of features absent the class value. A relevancy with respect to the class value is determined for each of a plurality of unselected features based on the set of training samples. A redundancy with respect to one or more of the set of features is determined for each of the plurality of unselected features in the first set of features based on the set of training samples and the set of test samples. A set of features is selected from the plurality of unselected features based on the relevancy and the redundancy determined for each of the plurality of unselected features.Type: ApplicationFiled: January 21, 2013Publication date: July 24, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Dan HE, Laxmi P. PARIDA, Irina RISH
-
Publication number: 20140207713Abstract: Various embodiments select features from a feature space. In one embodiment, a set of training samples and a set of test samples are received. The set of training samples includes a set of features and a class value. The set of test samples includes the set of features absent the class value. A relevancy with respect to the class value is determined for each of a plurality of unselected features based on the set of training samples. A redundancy with respect to one or more of the set of features is determined for each of the plurality of unselected features in the first set of features based on the set of training samples and the set of test samples. A set of features is selected from the plurality of unselected features based on the relevancy and the redundancy determined for each of the plurality of unselected features.Type: ApplicationFiled: September 18, 2013Publication date: July 24, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Dan HE, Laxmi P. PARIDA, Irina RISH
-
Publication number: 20140207765Abstract: Various embodiments select features from a feature space. In one embodiment a set of features and a class value are received. A redundancy score is obtained for a feature that was previously selected from the set of features. A redundancy score is determined, for each of a plurality of unselected features in the set of features, based on the redundancy score that has been obtained, and a redundancy between the unselected feature and the feature that was previously selected. A relevance to the class value is determined for each of the unselected features. A feature from the plurality of unselected features with a highest relevance to the class value and a lowest redundancy score is selected.Type: ApplicationFiled: September 18, 2013Publication date: July 24, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Dan HE, Laxmi P. PARIDA
-
Publication number: 20140207800Abstract: Various embodiments select features from a feature space. In one embodiment a candidate feature set of k? features is selected from at least one set of features based on maximum relevancy and minimum redundancy (MRMR) criteria. A target feature set of k features is identified from the candidate feature set, where k?>k. Each a plurality of features in the target feature set is iteratively updated with each of a plurality of k??k features from the candidate feature set. The feature from the plurality of k??k features is maintained in the target feature set, for at least one iterative update, based on a current MRMR score of the target feature set satisfying a threshold. The target feature set is stored as a top-k feature set of the at least one set of features after a given number of iterative updates.Type: ApplicationFiled: September 18, 2013Publication date: July 24, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Dan HE, Laxmi P. PARIDA
-
Publication number: 20140207799Abstract: Various embodiments select features from a feature space. In one embodiment a candidate feature set of k? features is selected from at least one set of features based on maximum relevancy and minimum redundancy (MRMR) criteria. A target feature set of k features is identified from the candidate feature set, where k?>k. Each a plurality of features in the target feature set is iteratively updated with each of a plurality of k??k features from the candidate feature set. The feature from the plurality of k??k features is maintained in the target feature set, for at least one iterative update, based on a current MRMR score of the target feature set satisfying a threshold. The target feature set is stored as a top-k feature set of the at least one set of features after a given number of iterative updates.Type: ApplicationFiled: January 21, 2013Publication date: July 24, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Dan HE, Laxmi P. PARIDA
-
Publication number: 20140207427Abstract: Various embodiments select markers for modeling epistasis effects. In one embodiment, a processor receives a set of genetic markers and a phenotype. A relevance score is determined with respect to the phenotype for each of the set of genetic markers. A threshold is set based on the relevance score of a genetic marker with a highest relevancy score. A relevance score is determined for at least one genetic marker in the set of genetic markers for at least one interaction between the at least one genetic marker and at least one other genetic marker in the set of genetic markers. The at least one interaction is added to a top-k feature set based on the relevance score of the at least one interaction satisfying the threshold.Type: ApplicationFiled: January 21, 2013Publication date: July 24, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Dan HE, Laxmi P. PARIDA
-
Publication number: 20140164402Abstract: Various embodiments sort data. In one embodiment, a matrix D including a set of data values is received. A matrix Q is received, and includes a set of columns and a set of rows. The matrix Q further includes a sorting of each column of the matrix D. Each of these rows corresponds to a sorting. Each of a set of values in each of the set of columns in the matrix Q identifies a row in the matrix D. At least one sub-matrix D? of the matrix D is identified. A set of columns of the sub-matrix D? is restricted to one or more columns of the matrix D. A processor sorts the sub-matrix D? by rows based on the sorting of the set of columns of the matrix D as given in the matrix Q, and based on the set of data values in the matrix D.Type: ApplicationFiled: December 11, 2012Publication date: June 12, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Laxmi P. PARIDA
-
Publication number: 20140164395Abstract: Various embodiments sort data. In one embodiment, a matrix D including a set of data values is received. A matrix Q is received, and includes a set of columns and a set of rows. The matrix Q further includes a sorting of each column of the matrix D. Each of these rows corresponds to a sorting. Each of a set of values in each of the set of columns in the matrix Q identifies a row in the matrix D. At least one sub-matrix D? of the matrix D is identified. A set of columns of the sub-matrix D? is restricted to one or more columns of the matrix D. A processor sorts the sub-matrix D? by rows based on the sorting of the set of columns of the matrix D as given in the matrix Q, and based on the set of data values in the matrix D.Type: ApplicationFiled: October 9, 2013Publication date: June 12, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Laxmi P. PARIDA
-
Publication number: 20140136167Abstract: Various embodiments generate a quantitative model of multi-allelic multi-loci interactions. In one embodiment, a plurality of distinct allelic forms of at least two loci of an entity is received. Each of the plurality of distinct allelic forms is associated with a set of genotypes. A contribution value of each genotype to a given physical trait is determined for each set of genotypes. An interaction contribution value for each interaction between each of the set of genotypes of a first of the least two loci and each of the set of genotypes of at least a second of the least two loci to the physical trait is determined from at least one interaction model. A model of a quantitative value of the entity is generated based on the contribution value of each genotype in each set of genotypes and each interaction contribution value that has been determined from the interaction model.Type: ApplicationFiled: September 18, 2013Publication date: May 15, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Laxmi P. PARIDA
-
Publication number: 20140136160Abstract: Various embodiments generate a quantitative model of multi-allelic multi-loci interactions. In one embodiment, a plurality of distinct allelic forms of at least two loci of an entity is received. Each of the plurality of distinct allelic forms is associated with a set of genotypes. A contribution value of each genotype to a given physical trait is determined for each set of genotypes. An interaction contribution value for each interaction between each of the set of genotypes of a first of the least two loci and each of the set of genotypes of at least a second of the least two loci to the physical trait is determined from at least one interaction model. A model of a quantitative value of the entity is generated based on the contribution value of each genotype in each set of genotypes and each interaction contribution value that has been determined from the interaction model.Type: ApplicationFiled: November 13, 2012Publication date: May 15, 2014Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: David HAWS, Laxmi P. PARIDA