Patents Assigned to DataRobot, Inc.
  • Publication number: 20240086736
    Abstract: A system can include a data processing system that can include memory and one or more processors to generate, by a first model trained using machine learning and compatible with first data having a first type and second data having a second type, a first metric based on the first data and indicating a first fault probability in a second model, generate, by the first model, a second metric based on the second data and indicating a second fault probability in a third model, determine, based on the first metric and the second metric, that an aggregate model that includes the second model and the third model satisfies a heuristic indicating a third fault probability in the aggregate model, and instruct, in response to a determination that the aggregate model satisfies the heuristic, a user interface to present an indication that the aggregate model satisfies the heuristic.
    Type: Application
    Filed: November 17, 2023
    Publication date: March 14, 2024
    Applicant: DataRobot, Inc.
    Inventors: Edward Kwartler, Jett Oristaglio, Sarah Khatry, Haniyeh Mahmoudian, Scott Lindeman, Oleksandr Bagan, Vlad Vovk, Wesley Hedrick, Kent Borg, Alex Shoop, Nikita Striuk, Gianni Saporiti, Alisa Zosimova, Oleksandr Pikovets, Anton Bogatyrov
  • Publication number: 20240086725
    Abstract: Aspects of this technical solution can segment a first time period for the first series into a second time period bounded by a first time stamp and a second time stamp later than the first time stamp, and into a third time period bounded by a third time stamp later than the second timestamp and a fourth time stamp later than the third time stamp, determine a metric for the third time period and based on first data points of a training data set for the first series and having time stamps bounded by the first time stamp and the second time stamp within the second time period, generate data points within the third time period based on the first metric and generate data points corresponding to a performance of a second series subsequent to the prediction time stamp.
    Type: Application
    Filed: August 31, 2023
    Publication date: March 14, 2024
    Applicant: DataRobot, Inc.
    Inventors: Jonas Marius Vilkas, Mykhailo Poliakov, Iryna Kovalchuk
  • Publication number: 20240086775
    Abstract: Presented herein are methods and systems for generating and executing applications that provide insights to a model's operation without requiring the user to have knowledge of coding, computer programming, or artificial intelligence machine-learning methodologies. An exemplary method includes deploying a model using input data to generate a predicted dataset; presenting indications for a plurality of applications associated with the deployed model including an configured to generate new scenarios and another application configured to optimize at least one feature; presenting a plurality of features analyzed by the model; and in response to receiving a selection of a feature of the plurality of features and a new value for the feature, executing the first application to generate a second predicted dataset using the new value.
    Type: Application
    Filed: November 10, 2023
    Publication date: March 14, 2024
    Applicant: DataRobot, Inc.
    Inventors: Jeremy Achin, Ina Ko, Borys Kupar, Tristan Spaulding, Yulia Bezuhla, Brett Rowley, Colleen Wilhide
  • Publication number: 20240078163
    Abstract: A system to deploy virtual sensors to a machine learning project and translate data of the machine learning project is provided. The system can deploy, for a machine learning project, a plurality of virtual sensors at a first location of a plurality of locations to detect metadata of a data source of the machine learning project, at a second location of the plurality of locations to detect deployment information of a model trained for the machine learning project, and at a third location of the plurality of locations to detect learning session information for creation of the model. The system can collect, via the plurality of virtual sensors, data for the machine learning project. The system can translate, for render on a computing system, the data collected via the plurality of virtual sensors into a plurality of graphics.
    Type: Application
    Filed: November 10, 2023
    Publication date: March 7, 2024
    Applicant: DataRobot, Inc.
    Inventors: Jeremy Achin, Michael Schmidt, Dmitry Zahanych, Alexander Jason Conway, Benjamin Taylor, Michael William Gilday, Uros Perisic, Andrii Chulovskyi, Romain Briot, Sully Matthew Sullenberger
  • Publication number: 20240078093
    Abstract: Customizing an automated machine learning system is provided. The system receives a request to establish computer-executable operations for use with machine learning on a data set. The system provides, for display via a graphical user interface on the client device, an indication of a set of computer-executable operations generated automatically for machine learning on the data set by the system responsive to the request. The system receives, from the client device via the graphical user interface, an indication to modify the set of computer-executable operations. The system establishes compatibility of the set of computer-executable operations responsive to the modification. The system constructs, responsive to establishment of the compatibility, the set of computer-executable operations for use with machine learning.
    Type: Application
    Filed: November 10, 2023
    Publication date: March 7, 2024
    Applicant: DataRobot, Inc.
    Inventors: Sylvain Ferrandiz, Zachary Mayer, Jason Jay McGhee, Joshua David Preuss, Mikhail Yakubovskiy
  • Patent number: 11922329
    Abstract: A predictive modeling method may include obtaining a fitted, first-order predictive model configured to predict values of output variables based on values of first input variables; and performing a second-order modeling procedure on the fitted, first-order model, which may include: generating input data including observations including observed values of second input variables and predicted values of the output variables; generating training data and testing data from the input data; generating a fitted second-order model of the fitted first-order model by fitting a second-order model to the training data; and testing the fitted, second-order model of the first-order model on the testing data. Each observation of the input data may be generated by (1) obtaining observed values of the second input variables, and (2) applying the first-order predictive model to corresponding observed values of the first input variables to generate the predicted values of the output variables.
    Type: Grant
    Filed: December 20, 2019
    Date of Patent: March 5, 2024
    Assignee: DataRobot, Inc.
    Inventors: Jeremy Achin, Thomas DeGodoy, Timothy Owen, Xavier Conort, Sergey Yurgenson, Mark L. Steadman, Glen Koundry, Hon Nian Chua
  • Publication number: 20240064074
    Abstract: Aspects of this technical solution can generate, according to a lag time window based at least in part on a first plurality of features, a second data set via aggregation of compatible fields in the first data set, the first plurality of features corresponding to a first data set, augment the first plurality of features extracted from the first data set with a second plurality of features extracted from a third data set, the third data set corresponding to a join of the first data set and the second data set, update, via machine learning and according to a rate corresponding to the data set, a model with the third plurality of features, and instruct a user interface to present at least one performance of the model with the third plurality of features, according to the rate.
    Type: Application
    Filed: August 15, 2023
    Publication date: February 22, 2024
    Applicant: DataRobot, Inc.
    Inventors: Rishabh Raman, Peter Simon, Oleg Zarakhani
  • Publication number: 20240028959
    Abstract: Re-binning and smoothing an indicator table is provided. A system can identify a table generated a model trained with machine learning, the table including bins for ranges of values of a feature and coefficients that indicate a level of a target for the bins. The system can receive, via a graphical user interface from a client device, a request to modify bins of the table. The system can establish, responsive to the request, a spline to fit the table based at least in part on a cost function weighted based on a number of entries of the feature for the ranges of values of the feature. The system can generate, via the spline established based at least in part on the cost function, a second table including second bins and second coefficients. The system can generate data to cause the graphical user interface to include a graphic representation of the second table.
    Type: Application
    Filed: July 19, 2023
    Publication date: January 25, 2024
    Applicant: DataRobot, Inc.
    Inventors: Glen Koundry, Mikhail Yakubovskiy
  • Publication number: 20240028828
    Abstract: Aspects of this technical solution can identify a plurality of n-grams at a plurality of locations in a first data set comprising text, generate, via a model trained with machine learning, a first prediction for the first data set, generate, via the model, a second prediction for a second data set that lacks the first n-gram at a first location of the plurality of locations, generate, by comparing a first prediction for the first data set with a second prediction for the second data set, an impact of the first n-gram at the first location, and cause a user interface to present at least a portion of the first data set with a visual indication corresponding to the impact, the visual indication applied to a portion of the user interface corresponding to the first n-gram and positioned in the user interface at the first location.
    Type: Application
    Filed: July 24, 2023
    Publication date: January 25, 2024
    Applicant: DataRobot, Inc.
    Inventors: Anton Kasyanov, Jonathan Chang, Mykyta Yarmak, Ee Kin Chin
  • Publication number: 20230394361
    Abstract: Machine learning model searching using meta data is provided. A system receives, via a graphical user interface from a client device, a request to search for one or more blueprints including one or more models to add to a project. The system can identify, based on a selection, a list of features with which to execute the requested search. The system can provide a blueprint including a model selected from projects established via input from client devices different from the client device, the projects including blueprints, the blueprints including models trained by machine learning. The system can train, via machine learning, the model of the blueprint to determine the target and add the blueprint including the trained model to the project. The system can generate data causing the graphical user interface to display an indication of the blueprint including the trained model.
    Type: Application
    Filed: May 31, 2023
    Publication date: December 7, 2023
    Applicant: DataRobot, Inc.
    Inventors: Ho Nian Chua, Michael Schmidt, Zachary Meyer, Senbong Gee, Mark Steadman, Alex Conway, Lingjun Kang
  • Publication number: 20230316137
    Abstract: Automated spatial feature engineering techniques may include (1) automatically deriving new features (e.g., spatial lags) based on spatial relationships between or among observations, (2) using parameter optimization techniques to optimize parameters of the spatial feature engineering process (e.g., parameters relating to the size of spatial neighborhoods and/or to the orders of spatial lags), (3) automatically deriving new spatial features representing geometric properties and/or spatial statistics associated with individual spatial observations, (4) determining the feature importance of location features, and/or (5) automatically partitioning spatial datasets such that spatial leakage is reduced, which generally leads to the development of more accurate spatial models. Such techniques may involve joint treatment of distinct location coordinate features as a single location feature for purposes of determining feature importance.
    Type: Application
    Filed: January 17, 2023
    Publication date: October 5, 2023
    Applicant: DataRobot, Inc.
    Inventors: David Blumstein, Lingjun Kang, Andrey Mukomolov, Joseph O’Halloran, Eric Reyes, Rohit Sharma, Kevin Stofan, Pavel Tyslacki
  • Publication number: 20230297043
    Abstract: A system to generate scenarios by modifying values of machine learning features is provided. The system can present a first indication in a first coordinate space of a first performance generated by a model trained with a plurality of features using machine learning. The system can present a second indication in a second coordinate space of a first performance of the first feature. The system can receive a modification to a value in the second coordinate space of the first feature. The system can determine a second performance of the model using machine learning based on a first derived feature to output derived data points in the time period. The system can present in the first coordinate space, a third indication of the second performance of the model overlaid with the first indication of the first performance of the model.
    Type: Application
    Filed: March 15, 2022
    Publication date: September 21, 2023
    Applicant: DataRobot, Inc.
    Inventors: Ina Ko, Borys Kupar, Yulia Bezhula
  • Patent number: 11748653
    Abstract: Apparatuses, systems, program products, and method are disclosed for machine learning abstraction. An apparatus includes an objective module configured to receive an objective to be analyzed using machine learning. An apparatus includes a grouping module configured to select a logical grouping of one or more machine learning pipelines to analyze a received objective. An apparatus includes an adjustment module configured to dynamically adjust one or more machine learning settings for a logical grouping of one or more machine learning pipelines based on feedback generated in response to analyzing a received objective.
    Type: Grant
    Filed: June 5, 2018
    Date of Patent: September 5, 2023
    Assignee: DataRobot, Inc.
    Inventors: Nisha Talagala, Vinay Sridhar, Swaminathan Sundararaman, Sindhu Ghanta, Lior Amar, Lior Khermosh, Bharath Ramsundar, Sriram Subramanian, Drew Roselli
  • Publication number: 20230206610
    Abstract: Disclosed herein at methods and systems for visualizing machine learning model performance. One method comprises receiving a request to provide a visual representation of a machine learning technique executed on a set of images to generate a first attribute and a second attribute for each image; executing the machine learning model to receive the first and the second attribute for each image; mapping the first attribute to a visual distinctiveness protocol; identifying a distance for each image, the distance representing a difference between the second attribute predicted by the model for each pair of respective images within the set of images; and providing for display at least a subset of the set of images arranged in accordance with their respective distance and having a visual attribute corresponding to the mapped first attribute for each image.
    Type: Application
    Filed: December 29, 2022
    Publication date: June 29, 2023
    Applicant: DataRobot, Inc.
    Inventors: Ivan Pyzow, Pavlo Kochubei, Yehor Kolchyba, Sylvain Ferrandiz, Anton Kasyanov
  • Publication number: 20230196101
    Abstract: An automated machine learning (“ML”) method may include training a first machine learning model using a first machine learning algorithm and a training data set; validating the first machine learning model using a validation data set, wherein validating the first machine learning model comprises generating an error data set; training a second machine learning model to predict a suitability of the first machine learning model for analyzing an inference data set, wherein the second machine learning model is trained using a second machine learning algorithm and the error data set; and triggering a remedial action associated with the first or second machine learning model in response to a predicted suitability of the first machine learning model for analyzing the inference data set not satisfying a suitability threshold.
    Type: Application
    Filed: November 16, 2022
    Publication date: June 22, 2023
    Applicant: DataRobot, Inc.
    Inventors: Sindhu Ghanta, Drew Roselli, Nisha Talagala, Vinay Sridhar, Swaminathan Sundararaman, Lior Amar, Lior Khermosh, Bharath Ramsundar, Sriram Subramanian
  • Publication number: 20230186116
    Abstract: Aspects of this technical solution can identify, by a second machine learning model receiving as input first features, second features having respective impact metrics that satisfy an impact threshold, the impact threshold indicating that the second features modify various forecast data points, cause a graphical user interface to present the forecast including one or more of the first features having respective first visual properties corresponding to identifiers of respective ones of the first features, cause the graphical user interface to present the forecast including the second features having a second visual property corresponding to an indication that the second features satisfy the impact threshold, and cause the graphical user interface to modify the forecast including the second features to include an explanation portion including metrics of the second features, the metrics corresponding to respective time points of a time dependency relationship.
    Type: Application
    Filed: December 9, 2022
    Publication date: June 15, 2023
    Applicant: DataRobot, Inc.
    Inventors: Ina Ko, Borys Kupar, Yulia Bezhula, Kyrylo Kniazev
  • Publication number: 20230186174
    Abstract: Segmenting data and forecasting by a combination of models trained on segmented data is provided. A system compares, with a first model, values of timestamps corresponding to data points to determine a time series dependency between the data points. The system generates, with the first model and based on the time series dependency, a first cluster with first data points and a second cluster with second data points. The system allocates, by a controller, a second model to the first cluster, and a third model to the second cluster. The system trains the second model based on the time series dependency and the first data points. The system trains the third model based on the time series dependency and the second data points. The system generates a fourth model based on a combination of the second trained model and the third trained model.
    Type: Application
    Filed: December 9, 2022
    Publication date: June 15, 2023
    Applicant: DataRobot, Inc.
    Inventors: Matt Nitzken, David McGarry, Roman Midianyi, Anatolli Stehni
  • Publication number: 20230186175
    Abstract: Comparing a challenger model with a primary model is provided herein. In an embodiment, a system comprises one or more processors, coupled to memory, configured to determine, based on a comparison of a first model that is deployed as a primary model with a second model that is acting as a challenger model, that the second model performs better than the first model based on at least one performance metric; determine, based on a comparison of a characteristic of the first model with a characteristic of the second model, to skip a validation process for the second model; and establish the second model as the primary model in the deployment to replace the first model in the deployment.
    Type: Application
    Filed: December 9, 2022
    Publication date: June 15, 2023
    Applicant: DataRobot, Inc.
    Inventors: Bohdan Usatov, Chris Li, Evan Chang, Tristan Spauding, Christopher Cozzi
  • Publication number: 20230091610
    Abstract: This disclosure relates generally to using machine learning models to generate current time-series features using machine learning and validate time-series machine learning model output. At least one aspect is directed to a system with one or more processors, coupled to memory, to segment a time series range into a first segment for an instance of time, the segment associated with a value for a target feature and a timestamp for the value, segment the time series range into an input segment associated with a plurality of input features and a segment timestamp less than or equal to the timestamp, generate a model trained with input comprising values for the target feature and timestamps for the values less than or equal to the segment timestamp, and transform at least one of the input features based at least on the model.
    Type: Application
    Filed: September 12, 2022
    Publication date: March 23, 2023
    Applicant: DataRobot, Inc.
    Inventors: Anastasiia Tamazlykar, Igor Iaroshenkno, Mark Steadman, Jilian Schwiep, Peter Michael Simon, Zachary Deane-Mayer, Brett Rowley, Jing Qiang Goh
  • Publication number: 20230083891
    Abstract: Disclosed herein are methods and systems to generate and revise a workflow that utilizes machine learning model nodes and other analytical nodes to analyze data and generate a decision via allowing a user to interact with input elements of a graphical user interface. The methods and systems use a processor to provide, for rendering by a user device, a graphical user interface comprising at least a first graphical indicator corresponding to a computer model node within workflow code and a second graphical indicator corresponding to a decision node within the workflow code, the computer model node visually connected with the decision node; and in response to receiving, via a user interacting with the graphical user interface, an additional node corresponding to at least one analytical protocol, revise the workflow code, by adding the analytical protocol before an execution of the decision node.
    Type: Application
    Filed: September 12, 2022
    Publication date: March 16, 2023
    Applicant: DataRobot, Inc.
    Inventors: Jeremy Achin, Ina Ko, Stephen James Millet, Daniel Thomas Trost, Igor Veksler