Patents by Inventor Damir Spisic

Damir Spisic has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11783177
    Abstract: A set of classifiable data containing a plurality of classes is ingested. A target class within the plurality of classes is determined. Using the set of classifiable data, an interactive recall rate chart is generated, and the interactive recall rate chart shows a set of target class recall rates against a set of class recall rates for the remainder of the plurality of classes. The interactive recall rate chart is presented to a user. A target class recall rate selection from the set of target class recall rates is received from the user. The set of classifiable data is reclassified, based on the target class recall rate selection.
    Type: Grant
    Filed: September 18, 2019
    Date of Patent: October 10, 2023
    Assignee: International Business Machines Corporation
    Inventors: Damir Spisic, Jing Xu, Xue Ying Zhang, Xing Wei
  • Patent number: 11494353
    Abstract: Mechanisms are provided for detecting interesting decision rules from a set of decision rules in a tree ensemble. Each tree in the tree ensemble is traversed in order to assign each individual data record from a set of data records to an identified leaf node in each tree. Predicted values are determined for the tree ensemble based on predictions provided by each leaf node to which each individual data record is assigned. Interesting sub-indices for decision rules from the set of decision rules are determined and, for each decision rule corresponding to the leaf nodes in the tree ensemble, the sub-indices are combined into interestingness index It. The decision rules are ranked corresponding to the leaf nodes in the tree ensemble according to the associated value of the interestingness index It and a subset of the decision rules corresponding to the leaf nodes in the tree ensemble are reported.
    Type: Grant
    Filed: August 22, 2019
    Date of Patent: November 8, 2022
    Assignee: International Business Machines Corporation
    Inventors: Damir Spisic, Jing Xu
  • Publication number: 20220335357
    Abstract: A computer-implemented method, a computer program product, and a computer system for identifying an influencer combination having a root cause to a key performance indicator change. The computer system analyzes metadata to discover semantic information for each column of data, identifies candidate factors that categorize a target performance indicators (KPI), groups the candidate factors into groups based on hierarchies which are included in the semantic information. For respective ones of the hierarchies, the computer system chooses most influential levels as influencer candidates. The computer system creates a stratified sample based on a distribution of target KPI values and evaluates an influential strength to the target KPI. The computer system identifies top influencers in the influencer candidates, based on influential strengths of respective ones of the influencer candidates.
    Type: Application
    Filed: April 16, 2021
    Publication date: October 20, 2022
    Inventors: Lin Luo, Changying Sun, Graham Wills, Damir Spisic
  • Patent number: 11409723
    Abstract: Mechanisms are provided for detecting interesting decision rules from a set of decision rules in a tree ensemble. Each tree in the tree ensemble is traversed in order to assign each individual data record from a set of data records to an identified leaf node in each tree. Predicted values are determined for the tree ensemble based on predictions provided by each leaf node to which each individual data record is assigned. Interesting sub-indices for decision rules from the set of decision rules are determined and, for each decision rule corresponding to the leaf nodes in the tree ensemble, the sub-indices are combined into interestingness index It. The decision rules are ranked corresponding to the leaf nodes in the tree ensemble according to the associated value of the interestingness index It and a subset of the decision rules corresponding to the leaf nodes in the tree ensemble are reported.
    Type: Grant
    Filed: August 22, 2019
    Date of Patent: August 9, 2022
    Assignee: International Business Machines Corporation
    Inventors: Damir Spisic, Jing Xu
  • Patent number: 11176187
    Abstract: Disclosed aspects relate to data insight discovery using a clustering technique. A set of data may be compressed based on a set of proximity values with respect to a set of predictors to assemble a set of sub-clusters. A set of subgroups may be established by merging a plurality of individual sub-clusters of the set of sub-clusters using a tightness factor. A subset of the subgroups may be selected based on a selection criterion. A set of insight data which indicates a profile of the subset of the set of subgroups with respect to the set of data may be compiled for the subset of the set of subgroups.
    Type: Grant
    Filed: September 9, 2019
    Date of Patent: November 16, 2021
    Assignee: International Business Machines Corporation
    Inventors: Damir Spisic, Jing Xu
  • Publication number: 20210224677
    Abstract: A method, apparatus, system, and computer program product for creating a forecasting model for time series data. Seasonality is removed from the times series data having the seasonality and trends to form deseasoned time series data. Trend models are created for sets of candidate change points in the deseasoned time series data. The trend models are for the sets of candidate change points without the seasonality. Seasonal models are created using the time series data without the trends. The seasonal models have different time periods for the seasonality. The trend models are combined with the seasonal models to form complete models that take into account the seasonality and the trends. The forecasting model is selected from the complete models in which the forecasting model is a best fit to a set of criteria.
    Type: Application
    Filed: January 21, 2020
    Publication date: July 22, 2021
    Inventors: Goran Tomic, Damir Spisic, Graham Wills, Kevin Gasiorowski
  • Patent number: 11036701
    Abstract: A computer-implemented method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.
    Type: Grant
    Filed: January 6, 2020
    Date of Patent: June 15, 2021
    Assignee: International Business Machines Corporation
    Inventors: Albert Maier, Yannick Saillet, Damir Spisic
  • Publication number: 20210081767
    Abstract: A set of classifiable data containing a plurality of classes is ingested. A target class within the plurality of classes is determined. Using the set of classifiable data, an interactive recall rate chart is generated, and the interactive recall rate chart shows a set of target class recall rates against a set of class recall rates for the remainder of the plurality of classes. The interactive recall rate chart is presented to a user. A target class recall rate selection from the set of target class recall rates is received from the user. The set of classifiable data is reclassified, based on the target class recall rate selection.
    Type: Application
    Filed: September 18, 2019
    Publication date: March 18, 2021
    Inventors: Damir Spisic, Jing Xu, Xue Ying Zhang, XING WEI
  • Patent number: 10831733
    Abstract: Provided are techniques for interactive adjustment of decision rules. A modified decision rule with one or more decision rule conditions is received for adjusting an original decision tree, wherein at least one of the decision rule conditions has been modified. A decision rule condition that has been modified and a corresponding decision tree node of the original decision tree are selected. Data records from a database are selected for a parent node of the corresponding decision tree node. The selected data records that match the decision rule condition are filtered. A sub-tree is generated using the filtered data records with a first splitting variable from the modified decision rule condition. An original sub-tree is replaced with the generated sub-tree and the decision rule condition to form an adjusted decision tree. The adjusted decision tree is used to predict a value of a target variable based on available predictors.
    Type: Grant
    Filed: December 22, 2017
    Date of Patent: November 10, 2020
    Assignee: International Business Machines Corporation
    Inventors: Ana C. Gomez, Damir Spisic
  • Publication number: 20200142870
    Abstract: A computer-implemented method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.
    Type: Application
    Filed: January 6, 2020
    Publication date: May 7, 2020
    Inventors: Albert Maier, Yannick Saillet, Damir Spisic
  • Patent number: 10579663
    Abstract: Disclosed aspects relate to data insight discovery using a clustering technique. A set of data may be compressed based on a set of proximity values with respect to a set of predictors to assemble a set of sub-clusters. A set of subgroups may be established by merging a plurality of individual sub-clusters of the set of sub-clusters using a tightness factor. A subset of the subgroups may be selected based on a selection criterion. A set of insight data which indicates a profile of the subset of the set of subgroups with respect to the set of data may be compiled for the subset of the set of subgroups.
    Type: Grant
    Filed: May 2, 2017
    Date of Patent: March 3, 2020
    Assignee: International Business Machines Corporation
    Inventors: Damir Spisic, Jing Xu
  • Patent number: 10534763
    Abstract: A computer-implemented method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.
    Type: Grant
    Filed: May 10, 2019
    Date of Patent: January 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Albert Maier, Yannick Saillet, Damir Spisic
  • Patent number: 10534762
    Abstract: A computer-implemented method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.
    Type: Grant
    Filed: May 10, 2019
    Date of Patent: January 14, 2020
    Assignee: International Business Machines Corporation
    Inventors: Albert Maier, Yannick Saillet, Damir Spisic
  • Publication number: 20190391993
    Abstract: Disclosed aspects relate to data insight discovery using a clustering technique. A set of data may be compressed based on a set of proximity values with respect to a set of predictors to assemble a set of sub-clusters. A set of subgroups may be established by merging a plurality of individual sub-clusters of the set of sub-clusters using a tightness factor. A subset of the subgroups may be selected based on a selection criterion. A set of insight data which indicates a profile of the subset of the set of subgroups with respect to the set of data may be compiled for the subset of the set of subgroups.
    Type: Application
    Filed: September 9, 2019
    Publication date: December 26, 2019
    Inventors: Damir Spisic, Jing Xu
  • Publication number: 20190384752
    Abstract: Mechanisms are provided for detecting interesting decision rules from a set of decision rules in a tree ensemble. Each tree in the tree ensemble is traversed in order to assign each individual data record from a set of data records to an identified leaf node in each tree. Predicted values are determined for the tree ensemble based on predictions provided by each leaf node to which each individual data record is assigned. Interesting sub-indices for decision rules from the set of decision rules are determined and, for each decision rule corresponding to the leaf nodes in the tree ensemble, the sub-indices are combined into interestingness index It. The decision rules are ranked corresponding to the leaf nodes in the tree ensemble according to the associated value of the interestingness index It and a subset of the decision rules corresponding to the leaf nodes in the tree ensemble are reported.
    Type: Application
    Filed: August 22, 2019
    Publication date: December 19, 2019
    Inventors: Damir Spisic, Jing Xu
  • Publication number: 20190377719
    Abstract: Mechanisms are provided for detecting interesting decision rules from a set of decision rules in a tree ensemble. Each tree in the tree ensemble is traversed in order to assign each individual data record from a set of data records to an identified leaf node in each tree. Predicted values are determined for the tree ensemble based on predictions provided by each leaf node to which each individual data record is assigned. Interesting sub-indices for decision rules from the set of decision rules are determined and, for each decision rule corresponding to the leaf nodes in the tree ensemble, the sub-indices are combined into interestingness index It. The decision rules are ranked corresponding to the leaf nodes in the tree ensemble according to the associated value of the interestingness index It and a subset of the decision rules corresponding to the leaf nodes in the tree ensemble are reported.
    Type: Application
    Filed: August 22, 2019
    Publication date: December 12, 2019
    Inventors: Damir Spisic, Jing Xu
  • Patent number: 10467206
    Abstract: A method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.
    Type: Grant
    Filed: March 8, 2017
    Date of Patent: November 5, 2019
    Assignee: International Business Machines Corporation
    Inventors: Albert Maier, Yannick Saillet, Damir Spisic
  • Patent number: 10467204
    Abstract: A method, computer program product and system for data sampling in a storage system. The storage system includes a dataset comprising records and a buffer. The dataset is scanned record-by-record to determine whether the current record belongs to a random sample. If so, then the current record may be added to a first set of records. Otherwise, at least one storage score may be calculated or determined for the current record using attribute values of the current record. Next, it may be determined whether the buffer includes available size for storing the current record. In case the buffer comprises the available size, the current record may be stored in the buffer. Otherwise, at least part of the buffer may be free up. A subsample of the dataset may be provided as a result of merging the first set of records and at least part of the buffered records.
    Type: Grant
    Filed: February 18, 2016
    Date of Patent: November 5, 2019
    Assignee: International Business Machines Corporation
    Inventors: Albert Maier, Yannick Saillet, Damir Spisic
  • Patent number: 10460275
    Abstract: A method for comparing predictive data models based on a predictive model search is provided. The method may include receiving a first and second portion of a set of data. The method may also include identifying a first and second variation of the second portion, wherein the first variation is different from the second variation. The method may further include generating first predictive data models based on the first variation, and second predictive data models based on the second variation. Additionally, the method may include applying a criteria to rank the first and second predictive data models based on predictive strength. The method may also include presenting a display of the ranked criteria, comprising the first portion, and a portion of the first and second predictive data models, wherein the portion of the first and second predictive data models are collectively ranked and presented according to the predictive strength.
    Type: Grant
    Filed: February 27, 2015
    Date of Patent: October 29, 2019
    Assignee: International Business Machines Corporation
    Inventors: Marc Altshuller, Jing-Yun Shyr, Damir Spisic, Margaret J. Vais, Neil Whitney
  • Patent number: 10460276
    Abstract: A method for comparing predictive data models based on a predictive model search is provided. The method may include receiving a first and second portion of a set of data. The method may also include identifying a first and second variation of the second portion, wherein the first variation is different from the second variation. The method may further include generating first predictive data models based on the first variation, and second predictive data models based on the second variation. Additionally, the method may include applying a criteria to rank the first and second predictive data models based on predictive strength. The method may also include presenting a display of the ranked criteria, comprising the first portion, and a portion of the first and second predictive data models, wherein the portion of the first and second predictive data models are collectively ranked and presented according to the predictive strength.
    Type: Grant
    Filed: March 22, 2016
    Date of Patent: October 29, 2019
    Assignee: International Business Machines Corporation
    Inventors: Marc Altshuller, Jing-Yun Shyr, Damir Spisic, Margaret J. Vais, Neil Whitney