Patents by Inventor Svetlana Levitan

Svetlana Levitan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Bayesian Inference Regarding Independence in Two-Way Contingency Tables Having Intrinsic Priors

Publication number: 20200327187

Abstract: Estimating a Bayes factor is provided. Table dimensions of a contingency table are determined. A statistical model type to apply to the contingency table is determined. Fixed marginal totals are specified for either rows or columns when a Multinomial sampling model is applied. A table total is computed when a Poisson sampling model is applied or fixed marginal totals are computed when the Multinomial sampling model is applied to a two by two contingency table. The table total is compared to a first threshold when the Poisson sampling model is applied or fixed marginal totals are compared to a second threshold when the Multinomial sampling model is applied to a two by two contingency table. An estimation method is selected to apply to the contingency table to compute the Bayes factor based on table dimensions, sampling model applied, and fixed marginal totals of the contingency table.

Type: Application

Filed: April 10, 2019

Publication date: October 15, 2020

Inventors: Yingda Jiang, Svetlana Levitan
Mining association rules in the map-reduce framework

Patent number: 10467236

Abstract: In each iteration of the process of mining association rules from transaction data by a cluster of computing systems, each mapper node in the cluster receives a split of the transaction data. Each mapper node scans the split to count an absolute support value of each candidate itemset for current search level(s), and passes the candidate itemsets and their support values to reducer nodes in the cluster. The number of reducer nodes will be determined adaptively based on the number of the candidate itemsets and the number of maximum available resource nodes in the cluster. Each reducer node combines the absolute support value of each candidate itemset, and finds frequent itemsets among them using a minimum support threshold. For each frequent itemset it finds, the reducer node creates association rule(s) satisfying a minimum confidence threshold, and exports all discovered frequent itemsets and association rules to a file system for storage.

Type: Grant

Filed: September 29, 2014

Date of Patent: November 5, 2019

Assignee: International Business Machines Corporation

Inventors: Svetlana Levitan, Dong Liang
Mining association rules in the map-reduce framework

Patent number: 10185752

Abstract: In each iteration of the process of mining association rules from transaction data by a cluster of computing systems, each mapper node in the cluster receives a split of the transaction data. Each mapper node scans the split to count an absolute support value of each candidate itemset for current search level(s), and passes the candidate itemsets and their support values to reducer nodes in the cluster. The number of reducer nodes will be determined adaptively based on the number of the candidate itemsets and the number of maximum available resource nodes in the cluster. Each reducer node combines the absolute support value of each candidate itemset, and finds frequent itemsets among them using a minimum support threshold. For each frequent itemset it finds, the reducer node creates association rule(s) satisfying a minimum confidence threshold, and exports all discovered frequent itemsets and association rules to a file system for storage.

Type: Grant

Filed: April 24, 2015

Date of Patent: January 22, 2019

Assignee: International Business Machines Corporation

Inventors: Svetlana Levitan, Dong Liang
Distributed clustering with outlier detection

Patent number: 10042912

Abstract: One or more processors initiate cluster feature (CF)-tree based hierarchical clustering on leaf entries of CF-trees included in a plurality of subsets. One or more processors, generate respective partial clustering solutions for the subsets. A partial clustering solution includes a set of regular sub-clusters and candidate outlier sub-clusters. One or more processors generate initial regular clusters by performing hierarchical clustering using the regular sub-clusters. For a candidate outlier sub-cluster, one or more processors determine a closest initial regular cluster, and a distance separating the candidate outlier sub-cluster and the closest initial regular cluster. One or more processors determine which candidate outlier sub-clusters are outlier clusters based on which candidate outlier sub-clusters have a computed distance to their respective closest initial regular cluster that is greater than a corresponding distance threshold associated with their respective closest initial regular cluster.

Type: Grant

Filed: November 25, 2014

Date of Patent: August 7, 2018

Assignee: International Business Machines Corporation

Inventors: Svetlana Levitan, Jing-Yun Shyr, Damir Spisic, Jing Xu
Computation of composite functions in a map-reduce framework

Patent number: 9916344

Abstract: Embodiments of the present invention provide efficient systems and methods for processing large data sets using a composite function. Embodiments of the present invention can be used to compute a broad range of composite functions in a single map-reduce job. Each mapper computes an additive function G on a set of specified data partitions, and then passes the results to one or more reducers. The one or more reducers can then compute a function F, using the aggregate results of function G and data from a single partition.

Type: Grant

Filed: January 4, 2016

Date of Patent: March 13, 2018

Assignee: International Business Machines Corporation

Inventors: Svetlana Levitan, Damir Spisic
COMPUTATION OF COMPOSITE FUNCTIONS IN A MAP-REDUCE FRAMEWORK

Publication number: 20170193030

Abstract: Embodiments of the present invention provide efficient systems and methods for processing large data sets using a composite function. Embodiments of the present invention can be used to compute a broad range of composite functions in a single map-reduce job. Each mapper computes an additive function G on a set of specified data partitions, and then passes the results to one or more reducers. The one or more reducers can then compute a function F, using the aggregate results of function G and data from a single partition.

Type: Application

Filed: January 4, 2016

Publication date: July 6, 2017

Inventors: Svetlana Levitan, Damir Spisic
Distributed clustering with outlier detection

Patent number: 9589045

Abstract: One or more processors initiate cluster feature (CF)-tree based hierarchical clustering on leaf entries of CF-trees included in a plurality of subsets. One or more processors, generate respective partial clustering solutions for the subsets. A partial clustering solution includes a set of regular sub-clusters and candidate outlier sub-clusters. One or more processors generate initial regular clusters by performing hierarchical clustering using the regular sub-clusters. For a candidate outlier sub-cluster, one or more processors determine a closest initial regular cluster, and a distance separating the candidate outlier sub-cluster and the closest initial regular cluster. One or more processors determine which candidate outlier sub-clusters are outlier clusters based on which candidate outlier sub-clusters have a computed distance to their respective closest initial regular cluster that is greater than a corresponding distance threshold associated with their respective closest initial regular cluster.

Type: Grant

Filed: April 8, 2014

Date of Patent: March 7, 2017

Assignee: International Business Machines Corporation

Inventors: Svetlana Levitan, Jing-Yun Shyr, Damir Spisic, Jing Xu
MINING ASSOCIATION RULES IN THE MAP-REDUCE FRAMEWORK

Publication number: 20160092514

Abstract: In each iteration of the process of mining association rules from transaction data by a cluster of computing systems, each mapper node in the cluster receives a split of the transaction data. Each mapper node scans the split to count an absolute support value of each candidate itemset for current search level(s), and passes the candidate itemsets and their support values to reducer nodes in the cluster. The number of reducer nodes will be determined adaptively based on the number of the candidate itemsets and the number of maximum available resource nodes in the cluster. Each reducer node combines the absolute support value of each candidate itemset, and finds frequent itemsets among them using a minimum support threshold. For each frequent itemset it finds, the reducer node creates association rule(s) satisfying a minimum confidence threshold, and exports all discovered frequent itemsets and association rules to a file system for storage.

Type: Application

Filed: September 29, 2014

Publication date: March 31, 2016

Inventors: Svetlana LEVITAN, Dong LIANG
MINING ASSOCIATION RULES IN THE MAP-REDUCE FRAMEWORK

Publication number: 20160092515

Abstract: In each iteration of the process of mining association rules from transaction data by a cluster of computing systems, each mapper node in the cluster receives a split of the transaction data. Each mapper node scans the split to count an absolute support value of each candidate itemset for current search level(s), and passes the candidate itemsets and their support values to reducer nodes in the cluster. The number of reducer nodes will be determined adaptively based on the number of the candidate itemsets and the number of maximum available resource nodes in the cluster. Each reducer node combines the absolute support value of each candidate itemset, and finds frequent itemsets among them using a minimum support threshold. For each frequent itemset it finds, the reducer node creates association rule(s) satisfying a minimum confidence threshold, and exports all discovered frequent itemsets and association rules to a file system for storage.

Type: Application

Filed: April 24, 2015

Publication date: March 31, 2016

Inventors: Svetlana LEVITAN, Dong LIANG
DISTRIBUTED CLUSTERING WITH OUTLIER DETECTION

Publication number: 20150286707

Abstract: One or more processors initiate cluster feature (CF)-tree based hierarchical clustering on leaf entries of CF-trees included in a plurality of subsets. One or more processors, generate respective partial clustering solutions for the subsets. A partial clustering solution includes a set of regular sub-clusters and candidate outlier sub-clusters. One or more processors generate initial regular clusters by performing hierarchical clustering using the regular sub-clusters. For a candidate outlier sub-cluster, one or more processors determine a closest initial regular cluster, and a distance separating the candidate outlier sub-cluster and the closest initial regular cluster. One or more processors determine which candidate outlier sub-clusters are outlier clusters based on which candidate outlier sub-clusters have a computed distance to their respective closest initial regular cluster that is greater than a corresponding distance threshold associated with their respective closest initial regular cluster.

Type: Application

Filed: April 8, 2014

Publication date: October 8, 2015

Applicant: International Business Machines Corporation

Inventors: Svetlana Levitan, Jing-Yun Shyr, Damir Spisic, Jing Xu
ADAPTIVE VARIABLE SELECTION FOR DATA CLUSTERING

Publication number: 20150286703

Abstract: One or more processors initiate cluster feature (CF)-tree based hierarchical clustering on leaf entries of CF-trees included in a plurality of subsets. One or more processors, generate respective partial clustering solutions for the subsets. A partial clustering solution includes a set of regular sub-clusters and candidate outlier sub-clusters. One or more processors generate initial regular clusters by performing hierarchical clustering using the regular sub-clusters. For a candidate outlier sub-cluster, one or more processors determine a closest initial regular cluster, and a distance separating the candidate outlier sub-cluster and the closest initial regular cluster. One or more processors determine which candidate outlier sub-clusters are outlier clusters based on which candidate outlier sub-clusters have a computed distance to their respective closest initial regular cluster that is greater than a corresponding distance threshold associated with their respective closest initial regular cluster.

Type: Application

Filed: November 25, 2014

Publication date: October 8, 2015

Inventors: Svetlana Levitan, Jing-Yun Shyr, Damir Spisic, Jing Xu