Patents by Inventor Damir Spisic
Damir Spisic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11783177Abstract: A set of classifiable data containing a plurality of classes is ingested. A target class within the plurality of classes is determined. Using the set of classifiable data, an interactive recall rate chart is generated, and the interactive recall rate chart shows a set of target class recall rates against a set of class recall rates for the remainder of the plurality of classes. The interactive recall rate chart is presented to a user. A target class recall rate selection from the set of target class recall rates is received from the user. The set of classifiable data is reclassified, based on the target class recall rate selection.Type: GrantFiled: September 18, 2019Date of Patent: October 10, 2023Assignee: International Business Machines CorporationInventors: Damir Spisic, Jing Xu, Xue Ying Zhang, Xing Wei
-
Patent number: 11494353Abstract: Mechanisms are provided for detecting interesting decision rules from a set of decision rules in a tree ensemble. Each tree in the tree ensemble is traversed in order to assign each individual data record from a set of data records to an identified leaf node in each tree. Predicted values are determined for the tree ensemble based on predictions provided by each leaf node to which each individual data record is assigned. Interesting sub-indices for decision rules from the set of decision rules are determined and, for each decision rule corresponding to the leaf nodes in the tree ensemble, the sub-indices are combined into interestingness index It. The decision rules are ranked corresponding to the leaf nodes in the tree ensemble according to the associated value of the interestingness index It and a subset of the decision rules corresponding to the leaf nodes in the tree ensemble are reported.Type: GrantFiled: August 22, 2019Date of Patent: November 8, 2022Assignee: International Business Machines CorporationInventors: Damir Spisic, Jing Xu
-
Publication number: 20220335357Abstract: A computer-implemented method, a computer program product, and a computer system for identifying an influencer combination having a root cause to a key performance indicator change. The computer system analyzes metadata to discover semantic information for each column of data, identifies candidate factors that categorize a target performance indicators (KPI), groups the candidate factors into groups based on hierarchies which are included in the semantic information. For respective ones of the hierarchies, the computer system chooses most influential levels as influencer candidates. The computer system creates a stratified sample based on a distribution of target KPI values and evaluates an influential strength to the target KPI. The computer system identifies top influencers in the influencer candidates, based on influential strengths of respective ones of the influencer candidates.Type: ApplicationFiled: April 16, 2021Publication date: October 20, 2022Inventors: Lin Luo, Changying Sun, Graham Wills, Damir Spisic
-
Patent number: 11409723Abstract: Mechanisms are provided for detecting interesting decision rules from a set of decision rules in a tree ensemble. Each tree in the tree ensemble is traversed in order to assign each individual data record from a set of data records to an identified leaf node in each tree. Predicted values are determined for the tree ensemble based on predictions provided by each leaf node to which each individual data record is assigned. Interesting sub-indices for decision rules from the set of decision rules are determined and, for each decision rule corresponding to the leaf nodes in the tree ensemble, the sub-indices are combined into interestingness index It. The decision rules are ranked corresponding to the leaf nodes in the tree ensemble according to the associated value of the interestingness index It and a subset of the decision rules corresponding to the leaf nodes in the tree ensemble are reported.Type: GrantFiled: August 22, 2019Date of Patent: August 9, 2022Assignee: International Business Machines CorporationInventors: Damir Spisic, Jing Xu
-
Patent number: 11176187Abstract: Disclosed aspects relate to data insight discovery using a clustering technique. A set of data may be compressed based on a set of proximity values with respect to a set of predictors to assemble a set of sub-clusters. A set of subgroups may be established by merging a plurality of individual sub-clusters of the set of sub-clusters using a tightness factor. A subset of the subgroups may be selected based on a selection criterion. A set of insight data which indicates a profile of the subset of the set of subgroups with respect to the set of data may be compiled for the subset of the set of subgroups.Type: GrantFiled: September 9, 2019Date of Patent: November 16, 2021Assignee: International Business Machines CorporationInventors: Damir Spisic, Jing Xu
-
Publication number: 20210224677Abstract: A method, apparatus, system, and computer program product for creating a forecasting model for time series data. Seasonality is removed from the times series data having the seasonality and trends to form deseasoned time series data. Trend models are created for sets of candidate change points in the deseasoned time series data. The trend models are for the sets of candidate change points without the seasonality. Seasonal models are created using the time series data without the trends. The seasonal models have different time periods for the seasonality. The trend models are combined with the seasonal models to form complete models that take into account the seasonality and the trends. The forecasting model is selected from the complete models in which the forecasting model is a best fit to a set of criteria.Type: ApplicationFiled: January 21, 2020Publication date: July 22, 2021Inventors: Goran Tomic, Damir Spisic, Graham Wills, Kevin Gasiorowski
-
Patent number: 11036701Abstract: A computer-implemented method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.Type: GrantFiled: January 6, 2020Date of Patent: June 15, 2021Assignee: International Business Machines CorporationInventors: Albert Maier, Yannick Saillet, Damir Spisic
-
Publication number: 20210081767Abstract: A set of classifiable data containing a plurality of classes is ingested. A target class within the plurality of classes is determined. Using the set of classifiable data, an interactive recall rate chart is generated, and the interactive recall rate chart shows a set of target class recall rates against a set of class recall rates for the remainder of the plurality of classes. The interactive recall rate chart is presented to a user. A target class recall rate selection from the set of target class recall rates is received from the user. The set of classifiable data is reclassified, based on the target class recall rate selection.Type: ApplicationFiled: September 18, 2019Publication date: March 18, 2021Inventors: Damir Spisic, Jing Xu, Xue Ying Zhang, XING WEI
-
Patent number: 10831733Abstract: Provided are techniques for interactive adjustment of decision rules. A modified decision rule with one or more decision rule conditions is received for adjusting an original decision tree, wherein at least one of the decision rule conditions has been modified. A decision rule condition that has been modified and a corresponding decision tree node of the original decision tree are selected. Data records from a database are selected for a parent node of the corresponding decision tree node. The selected data records that match the decision rule condition are filtered. A sub-tree is generated using the filtered data records with a first splitting variable from the modified decision rule condition. An original sub-tree is replaced with the generated sub-tree and the decision rule condition to form an adjusted decision tree. The adjusted decision tree is used to predict a value of a target variable based on available predictors.Type: GrantFiled: December 22, 2017Date of Patent: November 10, 2020Assignee: International Business Machines CorporationInventors: Ana C. Gomez, Damir Spisic
-
Publication number: 20200142870Abstract: A computer-implemented method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.Type: ApplicationFiled: January 6, 2020Publication date: May 7, 2020Inventors: Albert Maier, Yannick Saillet, Damir Spisic
-
Patent number: 10579663Abstract: Disclosed aspects relate to data insight discovery using a clustering technique. A set of data may be compressed based on a set of proximity values with respect to a set of predictors to assemble a set of sub-clusters. A set of subgroups may be established by merging a plurality of individual sub-clusters of the set of sub-clusters using a tightness factor. A subset of the subgroups may be selected based on a selection criterion. A set of insight data which indicates a profile of the subset of the set of subgroups with respect to the set of data may be compiled for the subset of the set of subgroups.Type: GrantFiled: May 2, 2017Date of Patent: March 3, 2020Assignee: International Business Machines CorporationInventors: Damir Spisic, Jing Xu
-
Patent number: 10534763Abstract: A computer-implemented method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.Type: GrantFiled: May 10, 2019Date of Patent: January 14, 2020Assignee: International Business Machines CorporationInventors: Albert Maier, Yannick Saillet, Damir Spisic
-
Patent number: 10534762Abstract: A computer-implemented method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.Type: GrantFiled: May 10, 2019Date of Patent: January 14, 2020Assignee: International Business Machines CorporationInventors: Albert Maier, Yannick Saillet, Damir Spisic
-
Publication number: 20190391993Abstract: Disclosed aspects relate to data insight discovery using a clustering technique. A set of data may be compressed based on a set of proximity values with respect to a set of predictors to assemble a set of sub-clusters. A set of subgroups may be established by merging a plurality of individual sub-clusters of the set of sub-clusters using a tightness factor. A subset of the subgroups may be selected based on a selection criterion. A set of insight data which indicates a profile of the subset of the set of subgroups with respect to the set of data may be compiled for the subset of the set of subgroups.Type: ApplicationFiled: September 9, 2019Publication date: December 26, 2019Inventors: Damir Spisic, Jing Xu
-
Publication number: 20190384752Abstract: Mechanisms are provided for detecting interesting decision rules from a set of decision rules in a tree ensemble. Each tree in the tree ensemble is traversed in order to assign each individual data record from a set of data records to an identified leaf node in each tree. Predicted values are determined for the tree ensemble based on predictions provided by each leaf node to which each individual data record is assigned. Interesting sub-indices for decision rules from the set of decision rules are determined and, for each decision rule corresponding to the leaf nodes in the tree ensemble, the sub-indices are combined into interestingness index It. The decision rules are ranked corresponding to the leaf nodes in the tree ensemble according to the associated value of the interestingness index It and a subset of the decision rules corresponding to the leaf nodes in the tree ensemble are reported.Type: ApplicationFiled: August 22, 2019Publication date: December 19, 2019Inventors: Damir Spisic, Jing Xu
-
Publication number: 20190377719Abstract: Mechanisms are provided for detecting interesting decision rules from a set of decision rules in a tree ensemble. Each tree in the tree ensemble is traversed in order to assign each individual data record from a set of data records to an identified leaf node in each tree. Predicted values are determined for the tree ensemble based on predictions provided by each leaf node to which each individual data record is assigned. Interesting sub-indices for decision rules from the set of decision rules are determined and, for each decision rule corresponding to the leaf nodes in the tree ensemble, the sub-indices are combined into interestingness index It. The decision rules are ranked corresponding to the leaf nodes in the tree ensemble according to the associated value of the interestingness index It and a subset of the decision rules corresponding to the leaf nodes in the tree ensemble are reported.Type: ApplicationFiled: August 22, 2019Publication date: December 12, 2019Inventors: Damir Spisic, Jing Xu
-
Patent number: 10467206Abstract: A method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.Type: GrantFiled: March 8, 2017Date of Patent: November 5, 2019Assignee: International Business Machines CorporationInventors: Albert Maier, Yannick Saillet, Damir Spisic
-
Patent number: 10467204Abstract: A method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.Type: GrantFiled: February 18, 2016Date of Patent: November 5, 2019Assignee: International Business Machines CorporationInventors: Albert Maier, Yannick Saillet, Damir Spisic
-
Patent number: 10460275Abstract: A method for comparing predictive data models based on a predictive model search is provided. The method may include receiving a first and second portion of a set of data. The method may also include identifying a first and second variation of the second portion, wherein the first variation is different from the second variation. The method may further include generating first predictive data models based on the first variation, and second predictive data models based on the second variation. Additionally, the method may include applying a criteria to rank the first and second predictive data models based on predictive strength. The method may also include presenting a display of the ranked criteria, comprising the first portion, and a portion of the first and second predictive data models, wherein the portion of the first and second predictive data models are collectively ranked and presented according to the predictive strength.Type: GrantFiled: February 27, 2015Date of Patent: October 29, 2019Assignee: International Business Machines CorporationInventors: Marc Altshuller, Jing-Yun Shyr, Damir Spisic, Margaret J. Vais, Neil Whitney
-
Patent number: 10460276Abstract: A method for comparing predictive data models based on a predictive model search is provided. The method may include receiving a first and second portion of a set of data. The method may also include identifying a first and second variation of the second portion, wherein the first variation is different from the second variation. The method may further include generating first predictive data models based on the first variation, and second predictive data models based on the second variation. Additionally, the method may include applying a criteria to rank the first and second predictive data models based on predictive strength. The method may also include presenting a display of the ranked criteria, comprising the first portion, and a portion of the first and second predictive data models, wherein the portion of the first and second predictive data models are collectively ranked and presented according to the predictive strength.Type: GrantFiled: March 22, 2016Date of Patent: October 29, 2019Assignee: International Business Machines CorporationInventors: Marc Altshuller, Jing-Yun Shyr, Damir Spisic, Margaret J. Vais, Neil Whitney