Patents by Inventor Lukasz G. Cmielowski

Lukasz G. Cmielowski has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11922279
    Abstract: An approach for selecting a transformed prediction model for estimating the uncertainty of prediction associated with machine learning is disclosed. The approach determines a function corresponding to a prediction interval based on one or more datasets and calculates one or more sets of prediction interval parameters associated with the function based on training a set of regression models with the one or more datasets. The approach creates one or more transformed predictions based on the one or more sets of parameters and based on a logical distance and selects a transformed prediction model based on a mean absolute correlation from the one or more transformed predictions. Furthermore, the approach outputs the selected transformed prediction model.
    Type: Grant
    Filed: June 12, 2020
    Date of Patent: March 5, 2024
    Assignee: International Business Machines Corporation
    Inventors: Wojciech Sobala, Rafal Bigaj, Lukasz G. Cmielowski, Maksymilian Erazmus
  • Publication number: 20240070520
    Abstract: Aspects of the invention include systems and methods configured for federated automatic machine learning. A non-limiting example computer-implemented method includes defining a search process including a model configuration for building an automatic machine learning pipeline definition and distributing the search process across a plurality of parties. Each member of the plurality of parties retains federated data including training data and holdout data. The method includes receiving, from each member of the plurality of parties, an evaluation result of the model configuration against respective holdout data and aggregating the received evaluation results to define aggregated parameters. A new pipeline definition is generated from the aggregated parameters and trained local models received from each member of the plurality of parties are aggregated to define an aggregated model. Each trained local model includes the new pipeline definition.
    Type: Application
    Filed: August 30, 2022
    Publication date: February 29, 2024
    Inventors: Lukasz G. Cmielowski, Daniel Jakub Ryszka, Oronde Jason Tucker, Maksymilian Erazmus
  • Patent number: 11853392
    Abstract: A method, computer system, and a computer program product for providing reduced training data for training a machine learning model (ML-model) is disclosed. The present invention may include generating the reduced training data dependent on a first two batches of data records of original training data. The present invention may include generating an updated version of the reduced training data dependent on the reduced training data and a further batch of the original training data, wherein a size of the updated version of the reduced training data is equal or less than a size of the limited storage capacity and less than a combined size of the reduced training data and the further batch together. The reading of a further batch of the data records of the original training data and the generating of an updated version of the reduced training data may be repeated.
    Type: Grant
    Filed: November 30, 2021
    Date of Patent: December 26, 2023
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lukasz G Cmielowski, Amadeusz Masny, Daniel Jakub Ryszka, Wojciech Sobala
  • Patent number: 11816542
    Abstract: Identifying a change of an indicator value for a system of interdependent entities includes determining the indicator value, logging input data for the system of interdependent entities, logging scoring payload data and related results of a machine-learning model used as part of the system of interdependent entities, wherein the scoring payload data are derived from the input data, clustering the input data into a number of clusters resulting in defined clusters, determining metric values of the machine-learning model by mapping each of the defined clusters onto the scoring payload data resulting in a vector of metric values, re-determining the indicator value for each defined cluster, resulting in a vector of re-calculated indicator values, and determining correlation matrix values for a matrix between the vector of re-determined indicator values and the vector of the metric values for each of the defined clusters.
    Type: Grant
    Filed: January 3, 2020
    Date of Patent: November 14, 2023
    Assignee: International Business Machines Corporation
    Inventors: Lukasz G. Cmielowski, Rafal Bigaj, Wojciech Sobala, Maksymilian Erazmus
  • Patent number: 11790256
    Abstract: A computer-implemented method, system and computer program product for analyzing test result failures using artificial intelligence models. A first machine learning model is trained to differentiate between a bug failure and a test failure within the test failures based on the failure attributes and historical failures. The failure type for each failed test in test failure groups is then determined using the first machine learning model. The failed tests in the test failure groups are then clustered into a set of clusters according to the failure attributes and the determined failure type for each failed test. A root cause failure for each cluster is identified based on the set of clusters and the failure attributes. The root cause of an unclassified failure is predicted using a second machine learning model trained to predict a root cause of the unclassified failure based on identifying the root cause failure for each cluster.
    Type: Grant
    Filed: July 22, 2022
    Date of Patent: October 17, 2023
    Assignee: International Business Machines Corporation
    Inventors: Lukasz G Cmielowski, Maksymilian Erazmus, Rafal Bigaj, Wojciech Sobala
  • Publication number: 20230289650
    Abstract: A continuous machine learning system includes a data generator module, a pipeline search module, a pipeline refinement module, and a pipeline training module. The data generator module obtains raw training data defining a total data size and generates a plurality of data batches from the raw training data. The pipeline search module obtains an initial data batch from among the plurality of data batches and determines a best machine learning model pipeline among a plurality of machine learning model pipelines based on the initial data batch. The pipeline refinement module receives the best machine learning model pipeline and refines the best machine learning model pipeline to generate a refined pipeline that consumes the plurality of data batches. The pipeline training module incrementally trains the refined pipeline using remaining data batches among the plurality of data batches generated after the initial data batch.
    Type: Application
    Filed: March 9, 2022
    Publication date: September 14, 2023
    Inventors: Lukasz G. Cmielowski, AMADEUSZ MASNY, Thomas Parnell, Kiran A. Kate
  • Patent number: 11748638
    Abstract: A dataset is received that is for processing by a machine learning model. A scoring payload for the dataset and that regards the machine learning model is also received. A set of features of the machine learning model is determined by analyzing the scoring payload. The scoring payload is structured in accordance with the set of features such that the structured scoring payload is ready for analysis for a monitor of the machine learning model.
    Type: Grant
    Filed: July 22, 2020
    Date of Patent: September 5, 2023
    Assignee: International Business Machines Corporation
    Inventors: Rafal Bigaj, Lukasz G. Cmielowski, Wojciech Sobala, Maksymilian Erazmus
  • Patent number: 11704220
    Abstract: An overall performance metric of a computer system may be determined for each bin of the set of analysis bins. In case one or more bins of the set of analysis bins do not have at least a predefined minimum number of records, a new set of analysis bins may be redefined by joining analysis bins of the set of analysis bins. For each bin of the redefined set of bins a machine learning (ML) performance metric of the ML model may be computed. The ML performance metric may be estimated for the set of analysis bins using the ML performance metrics of the redefined bins. The computer system may be configured based on a correlation over the set of analysis bins between the computed overall performance metric and the ML performance metric.
    Type: Grant
    Filed: April 11, 2022
    Date of Patent: July 18, 2023
    Assignee: International Business Machines Corporation
    Inventors: Rafal Bigaj, Lukasz G. Cmielowski, Wojciech Sobala, Maksymilian Erazmus
  • Patent number: 11699082
    Abstract: A method, system, and computer program product for correlation detection between artificial intelligence (AI) transactions. The method stores a set of transaction records associated with an AI decision engine. Each transaction record has a set of record characteristics. The method assigns the set of transaction records to a set of batches on the set of record characteristics. A set of batch characteristics are determined for a batch of the set of batches. The method determines one or more correlations among the set of batch characteristics. The one or more correlations are compared with one or more threshold batches. The method determines, from the one or more correlations and the comparing, an impact of one or more recommendations of the AI decision engine. The one or more recommendations are defined by the set of transaction records.
    Type: Grant
    Filed: November 21, 2019
    Date of Patent: July 11, 2023
    Assignee: International Business Machines Corporation
    Inventors: Lukasz G. Cmielowski, Wojciech Sobala, Maksymilian Erazmus, Rafal Bigaj
  • Patent number: 11699071
    Abstract: A first machine learning model processes a set of inputs to generate a first set of results. Based on that first set of results, a quality control range is calculated. A second machine learning model calculates a mean accuracy of a second set of results, based on the set of inputs. A determination of whether the mean accuracy of the second set of results is within the quality control range is made, and a user is notified of that determination.
    Type: Grant
    Filed: November 20, 2019
    Date of Patent: July 11, 2023
    Assignee: International Business Machines Corporation
    Inventors: Lukasz G. Cmielowski, Wojciech Sobala, Rafal Bigaj, Maksymilian Erazmus
  • Patent number: 11676063
    Abstract: Aspects of the present invention provide an approach for exposing payloads from non-integrated machine learning systems. A generic binding identifier is established to represent a machine learning (ML) system among a set of non-integrated learning systems. A generic subscription identifier is established to represent a deployed model in the ML system. Payload data including a user request, a response, the generic binding identifier, and the generic subscription identifier are received from the ML system and stored in a database for later analysis to identify any issues related to the deployed model.
    Type: Grant
    Filed: March 28, 2019
    Date of Patent: June 13, 2023
    Assignee: International Business Machines Corporation
    Inventors: Lukasz G. Cmielowski, Bartlomiej T. Malecki, Rafal Bigaj, Maria H. Oleszkiewicz
  • Publication number: 20230169148
    Abstract: A method, computer system, and a computer program product for providing reduced training data for training a machine learning model (ML-model) is disclosed. The present invention may include generating the reduced training data dependent on a first two batches of data records of original training data. The present invention may include generating an updated version of the reduced training data dependent on the reduced training data and a further batch of the original training data, wherein a size of the updated version of the reduced training data is equal or less than a size of the limited storage capacity and less than a combined size of the reduced training data and the further batch together. The reading of a further batch of the data records of the original training data and the generating of an updated version of the reduced training data may be repeated.
    Type: Application
    Filed: November 30, 2021
    Publication date: June 1, 2023
    Inventors: Lukasz G. Cmielowski, Amadeusz Masny, Daniel Jakub Ryszka, Wojciech Sobala
  • Publication number: 20230115067
    Abstract: The present disclosure relates to a computer-implemented method for generating a machine learning module (ML-module). The method comprises providing training data comprising a first set of data corresponding to a first feature and a second set of data corresponding to a second feature and generating a further set of data, wherein the further set of data corresponds to a further feature. A first correlation metric as a measure of a correlation between a selected feature of the first and the second feature and the further feature is calculated. Furthermore, a bias metric is determined indicating a strength of a bias of the trained ML-module towards a first subset of values of the further feature or a second subset of values of the further feature if the first correlation metric is greater than a first threshold. If the bias metric satisfies the bias constraint, then the ML-module is released for usage.
    Type: Application
    Filed: October 13, 2021
    Publication date: April 13, 2023
    Inventors: Lukasz G. Cmielowski, Szymon Kucharczyk, Dorota Laczak, Daniel Jakub Ryszka
  • Publication number: 20230061222
    Abstract: Disclosed herein is a method of training an artificial intelligence model that comprises an iterative training loop. Said iterative training loop comprises: receiving a current set of training data; dividing said current set of training data into a predetermined number of training data subsets; sequentially training said artificial intelligence model with each of said predetermined number of training data subsets using a training portion and calculating a performance metric using a validation portion; and comparing performance metrics from a previous iteration of said iterative training loop to said calculated performance metric to determine if an improving performance metric condition is met. Said method further comprises halting said iterative training loop unless said improving performance metric condition is not met at least once within a predetermined number of previous iterations.
    Type: Application
    Filed: August 30, 2021
    Publication date: March 2, 2023
    Inventors: Lukasz G. Cmielowski, Daniel Jakub Ryszka, Wojciech Sobala, Gregory Bramble
  • Publication number: 20230068816
    Abstract: The present disclosure relates to a method for generating a machine learning (ML) model. The method comprises: receiving a selection of a desired value of the metric for predicting a value of the first label attribute based on a current training dataset. Previously obtained sets of training settings may be used for determining a set of training settings that corresponds to the desired value of the metric. A ML engine may be controlled to generate using the current training dataset a machine learning model according to the determined set of training settings. The generated machine learning model may be deployed for performing predictions of values of the first label attribute.
    Type: Application
    Filed: August 26, 2021
    Publication date: March 2, 2023
    Inventors: Lukasz G. Cmielowski, Daniel Jakub Ryszka, Wojciech Sobala, Jan Soltysik
  • Patent number: 11593388
    Abstract: A method and a computer program product are used generating an index of a scoring payload dataset. Correlation coefficients for correlations between input data values and output data values of the machine learning model provided by the scoring payload datasets as well as performance data values of the processes provided by process datasets are calculated. Features of which feature values are used as input data values are ranked according to their importance using the correlation coefficients. For the features of a set of highest-ranking features feature value sets with feature values of the respective features are selected from the scoring payload datasets and a database index of the selected feature value sets is generated.
    Type: Grant
    Filed: March 19, 2021
    Date of Patent: February 28, 2023
    Assignee: International Business Machines Corporation
    Inventors: Rafal Bigaj, Lukasz G. Cmielowski, Wojciech Sobala, Maksymilian Erazmus
  • Patent number: 11556860
    Abstract: An ML root model format having a root model definition, is converted into an ML target model format not having the root model definition. A learning system is assigned to the root model definition that is convertible to the machine learning target model format. A new version of the root model definition is ported from the ML root model to the format of the ML target model to generate a new version of the machine learning target model after a learning iteration of the learning system. Quality limits are determined using an X control chart method based on a cross-validation of fold results generated during a validation of the new version of the machine learning root model. A quality metric value of the new version of the ML target model is evaluated against the quality threshold values of the new version of the ML root model.
    Type: Grant
    Filed: April 15, 2019
    Date of Patent: January 17, 2023
    Assignee: International Business Machines Corporation
    Inventors: Lukasz G. Cmielowski, Rafal Bigaj, Blazej Rafal Rutkowski, Wojciech Sobala
  • Patent number: 11551152
    Abstract: Identifying input feature significance for a machine learning model is provided. A set of scoring payload data corresponding to a set of input features of the machine learning model is sliced into a predefined number of batches. Using the sliced set of scoring payload data, a correlation coefficient matrix for each respective input feature of the machine learning model is generated based on input of each particular input feature into the machine learning model and a corresponding output from the machine learning model for each particular input feature. A correlation coefficient value is extracted from the correlation coefficient matrix for each particular input feature and the corresponding output from the machine learning model. A set of significant input features having a corresponding correlation coefficient value greater than a predefined correlation coefficient threshold level is identified. A set of action steps is performed regarding the set of significant input features.
    Type: Grant
    Filed: September 3, 2020
    Date of Patent: January 10, 2023
    Assignee: International Business Machines Corporation
    Inventors: Lukasz G. Cmielowski, Maksymilian Erazmus, Rafal Bigaj, Wojciech Sobala
  • Publication number: 20220414530
    Abstract: A system may be configured to perform operations to select a machine learning model. The operations may include training machine learning models with training data of a training data set and obtaining a first value representing a first required runtime for training each machine learning model. The operations may include evaluating, based on the first value, a second value representing a second required runtime for training the machine learning model with a complete training data set. The operations may include calculating a final score for each machine learning model in a group of machine learning models, wherein the calculating is performed on a basis of the second values for the machine learning models, ranking the machine learning models based on the final score to obtain ranks, and selecting the machine learning model that has obtained a highest rank in the ranking.
    Type: Application
    Filed: June 25, 2021
    Publication date: December 29, 2022
    Inventors: Lukasz G. Cmielowski, Szymon Kucharczyk, Daniel Jakub Ryszka, Thomas Parnell
  • Publication number: 20220413821
    Abstract: A method, computer system, and a computer program product for deploying at a client system a machine learning model is provided. The present invention may include requesting, from a training system, information on a training environment in which a machine learning model was trained. The present invention may include determining a compatibility of a local environment of a client system with the training environment of the training system based on the information on the training environment. The present invention may include determining the local environment of the client system is compatible with the training environment of the training system. The present invention may include downloading the machine learning model.
    Type: Application
    Filed: June 28, 2021
    Publication date: December 29, 2022
    Inventors: Lukasz G. Cmielowski, AMADEUSZ MASNY, Daniel Jakub Ryszka, Wojciech Jakub Jargielo