Patents by Inventor Jacques Doan Huu

Jacques Doan Huu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Declarative debriefing for predictive pipeline

Patent number: 11823073

Abstract: Provided are systems and methods for auto-completing debriefing processing for a machine learning model pipeline based on a type of predictive algorithm. In one example, the method may include one or more of building a machine learning model pipeline via a user interface, detecting, via the user interface, a selection associated with a predictive algorithm included within the machine learning model pipeline, in response to the selection, identifying debriefing components for the predictive algorithm based on a type of the predictive algorithm from among a plurality of types of predictive algorithms, and automatically incorporating processing for the debriefing components within the machine learning model pipeline such that values of the debriefing components are generated during training of the predictive algorithm within the machine learning model pipeline.

Type: Grant

Filed: November 14, 2018

Date of Patent: November 21, 2023

Assignee: SAP SE

Inventor: Jacques Doan Huu
DISCRIMINATION LIKELIHOOD ESTIMATE FOR TRAINED MACHINE LEARNING MODEL

Publication number: 20230342659

Abstract: Systems and methods include reception of a plurality of records, each of the plurality of records associating each of a plurality of features with a respective value, a second feature with a value, and a target feature with a value, a first machine learning model trained based on the plurality of records to output a value of the target feature based on values of each of the plurality of features, a second machine learning model trained based on the plurality of records to output a value of the second feature based on the values of each of the plurality of features, determination, based on the trained second machine learning model, of a first one or more of the plurality of features which are correlated to the second feature, determination of an influence of each of the first one or more features on the trained first machine learning model, and determination of a first value associated with the second feature based on the determined influences and on the trained second machine learning model.

Type: Application

Filed: April 25, 2022

Publication date: October 26, 2023

Inventor: Jacques DOAN HUU
SURROGATE MODEL FOR TIME-SERIES MODEL INTERPRETATION

Publication number: 20230342671

Abstract: Provided is a system and method which build a composite time-series machine learning model including a core model and a debrief model that includes a combination of the core model and a surrogate model. In one example, the method may include executing the plurality of models on test data and determining accuracy values and interpretability toughness values for the plurality models, selecting a most accurate model as a core model based on the accuracy values and select a most interpretable model as a surrogate model from among other models remaining in the plurality of models based on the interpretability toughness values, building a composite model comprising the core model, the surrogate model, and instructions for generating a debrief model for debriefing the core model based on a combination of the core model and the surrogate model, and storing the composite model within the memory.

Type: Application

Filed: April 25, 2022

Publication date: October 26, 2023

Inventor: Jacques DOAN HUU
SUPERVISED DIMENSIONALITY REDUCTION FOR LEVEL-BASED HIERARCHICAL TRAINING DATA

Publication number: 20230342628

Abstract: Systems and methods include identification first members of a child level of a dimension hierarchy which are associated with boundaries between second members of a parent level of the dimension hierarchy, training of a decision tree model based on data associated with the child level, extraction of predicates on the child level from the trained decision tree model, determination of a value based on the identified first members of the child level and on the extracted predicates on the child level, and determination, based on the value, whether to include the parent level and the child level within training data or to include the parent level and not include the child level within the training data.

Type: Application

Filed: April 25, 2022

Publication date: October 26, 2023

Inventor: Jacques DOAN HUU
TIME-SERIES FORECASTING BASED ON DETECTED DOWNTIME

Publication number: 20230342632

Abstract: Provided is a system and method which generates a composite machine learning model that can filter downtime data from a time-series data signal and perform a prediction on remaining time-series data. In one example, the method may include detecting a pattern of downtime data within a time-series data signal, removing a subset of data from the time-series data based on the detected pattern of downtime and building a machine learning model to make predictions based on remaining data in the time-series data, generating segregation instructions configured to remove downtime data from a time-series data signal of a same type and to predict zero on future dates matching the downtime segregation codes, and building a composite machine learning model that includes the trained machine learning model and the segregation instructions for filtering data that is input to the trained machine learning models.

Type: Application

Filed: April 22, 2022

Publication date: October 26, 2023

Inventor: Jacques DOAN HUU
OPTIMIZING GRADIENT BOOSTING FEATURE SELECTION

Publication number: 20230206083

Abstract: Gradient Boosting Decision Tree (GBDT) successively stacks many decision trees which at each step try to fix the residual errors from the previous steps. The final score produced by the GBDT is the sum of the individual scores obtained by the decision trees for an input vector. Overfitting in GBDT can be reduced by removing the input values that have the least impact on the output from the training data. One way to determine which input variable has the lowest predictive value is to determine the input variable that is used for the first time in the latest decision tree in the GBDT. This method of identifying the low-predictive features to be removed does not require that earlier trees be regenerated to generate the new GBDT. Since the removed feature was already not used in the earlier trees, those trees already ignore the removed feature.

Type: Application

Filed: March 6, 2023

Publication date: June 29, 2023

Inventor: Jacques Doan Huu
Optimizing gradient boosting feature selection

Patent number: 11620537

Abstract: Gradient Boosting Decision Tree (GBDT) successively stacks many decision trees which at each step try to fix the residual errors from the previous steps. The final score produced by the GBDT is the sum of the individual scores obtained by the decision trees for an input vector. Overfitting in GBDT can be reduced by removing the input values that have the least impact on the output from the training data. One way to determine which input variable has the lowest predictive value is to determine the input variable that is used for the first time in the latest decision tree in the GBDT. This method of identifying the low-predictive features to be removed does not require that earlier trees be regenerated to generate the new GBDT. Since the removed feature was already not used in the earlier trees, those trees already ignore the removed feature.

Type: Grant

Filed: April 24, 2020

Date of Patent: April 4, 2023

Assignee: SAP SE

Inventor: Jacques Doan Huu
AUTOMATIC MACHINE LEARNING FEATURE BACKWARD STRIPPING

Publication number: 20230026391

Abstract: Features are used to train one or more ML models in a modelling layer. In a feature selection layer, each generated ML model is analyzed to determine, for each input feature, a degree of importance of the feature on the results generated by the ML model. Features with low importance are identified and the information is propagated backward to the data source and feature engineering layers. In response, the data source and feature engineering layers refrain from gathering or generating the unimportant features. Based on a confidence measure of the determination that each feature is important or unimportant, a number of periods between reevaluation of the feature importance is determined. After the number of periods has elapsed, a removed feature is restored to the pipeline.

Type: Application

Filed: September 29, 2022

Publication date: January 26, 2023

Inventor: Jacques Doan Huu
Automatic generative process bridge between analytics models

Patent number: 11561940

Abstract: Disclosed herein are system, method, and computer program product embodiments for generating a bridge between analytical models. In an embodiment, a server can extract a first variable dependency schema from a first model (e.g., predictive model or business intelligence report) and a second variable schema from a second model (e.g., predictive model or business intelligence report). The first variable dependency schema includes a first definition of a relationship between a first variable and a second variable. The server can compare the first variable dependency schema and the second variable dependency schema. Furthermore, the server can generate a modification to be made in the second variable dependency schema based on the first definition of the relationship between the first and second variable and outputs the modification to be made to the second variable dependency schema.

Type: Grant

Filed: August 6, 2020

Date of Patent: January 24, 2023

Assignee: SAP SE

Inventor: Jacques Doan Huu
In-database predictive pipeline incremental engine

Patent number: 11521089

Abstract: A predictive model pipeline data store may contain electronic records defining a predictive model pipeline composed of operation nodes. Based on the information in the data store, an execution framework platform may calculate a hash value for each operation node by including all recursive dependencies using ancestor node hash values and current node parameters. The platform may then compare each computed hash value with a previously computed hash value associated with a prior execution of a prior version of the pipeline. Operation nodes that have an unchanged hash value may be tagged “idle.” Operation nodes that have a changed hash value may be tagged “train and apply” or “apply” based on current node parameters (and an “apply” tag may propagate backwards through the pipeline to ancestor nodes). The platform may then ignore the operation nodes tagged “idle” when creating a physical execution plan to be provided to a target platform.

Type: Grant

Filed: November 29, 2018

Date of Patent: December 6, 2022

Assignee: SAP SE

Inventors: Scott Kumar Cameron, Olivier Hamon, Gabriel Kevorkian, Eric Gouthiere, Jacques Doan Huu
FEATURE SELECTION FOR MODEL TRAINING

Publication number: 20220366315

Abstract: Systems and methods include determination of a first plurality of sets of data, each including values associated with respective ones of a first plurality of features, partial training of a first machine-learning model based on the first plurality of sets of data, determination of one or more of the first plurality of features to remove based on the partially-trained first machine-learning model, removal of the one or more of the first plurality of features to generate a second plurality of sets of data, partial training of a second machine-learning model based on the second plurality of sets of data, determination that a performance of the partially-trained second machine-learning model is less than a threshold, addition, in response to the determination, of the one or more of the first plurality of features to the second plurality of sets of data, and training of the partially-trained first machine-learning model based on the first plurality of sets of data.

Type: Application

Filed: May 6, 2021

Publication date: November 17, 2022

Inventors: Louis DESREUMAUX, Jacques DOAN HUU
Automatic machine learning feature backward stripping

Patent number: 11494699

Abstract: Features are used to train one or more ML models in a modelling layer. In a feature selection layer, each generated ML model is analyzed to determine, for each input feature, a degree of importance of the feature on the results generated by the ML model. Features with low importance are identified and the information is propagated backward to the data source and feature engineering layers. In response, the data source and feature engineering layers refrain from gathering or generating the unimportant features. Based on a confidence measure of the determination that each feature is important or unimportant, a number of periods between reevaluation of the feature importance is determined. After the number of periods has elapsed, a removed feature is restored to the pipeline.

Type: Grant

Filed: May 6, 2020

Date of Patent: November 8, 2022

Assignee: SAP SE

Inventor: Jacques Doan Huu
TIME-SERIES ANOMALY PREDICTION AND ALERT

Publication number: 20220335347

Abstract: Provided is a system and method which can identify a causal relationship for anomalies in a time-series signal based on co-occurring and preceding anomalies in another time-series signal. In one example, the method may include identifying a recurring anomaly within a time-series signal of a first data value, determining a time-series signal of a second data value that is a cause of the recurring anomaly in the time-series signal of the first data value based on a preceding and co-occurring anomaly in the time-series signal of the second data value, and storing a correlation between the preceding and co-occurring anomaly in the time-series signal of the second data value and the recurring anomaly in the time-series signal of the first data value.

Type: Application

Filed: April 15, 2021

Publication date: October 20, 2022

Inventor: Jacques DOAN HUU
DETERMINING COMPONENT CONTRIBUTIONS OF TIME-SERIES MODEL

Publication number: 20220335314

Abstract: Provided is a system and method which decomposes a predicted output signal of a time-series forecasting model into a plurality of sub signals that correspond to a plurality of components, and determines and displays a global contribution of each component. In one example, the method may include iteratively predicting an output signal of a time-series data value via execution of a time-series model, decomposing the predicted output signal into a plurality of component signals corresponding to a plurality of components of the time-series machine learning algorithm, respectively, and displaying the plurality of global values via a user interface.

Type: Application

Filed: April 19, 2021

Publication date: October 20, 2022

Inventors: Jacques DOAN HUU, Elouan ARGOUARCH
HORIZON-BASED SMOOTHING OF FORECASTING MODEL

Publication number: 20220188383

Abstract: Provided is a system and method which trains a model based on a horizon-wise cost function which accounts for error across a horizon rather than just a next point in time thereby improving the accuracy of the trained model in the long term. In one example, the method may include storing time-series data, executing a training iteration for a machine learning model based on one or more parameter values, determining error values between the predicted values output by the machine learning model and actual values of the time-series data for a plurality of intervals included in a horizon of the time-series data, generating a total error value for the horizon based on the determined error values for the intervals, and storing the generated total error value for the horizon. The method also enables a user to dynamically adjust a weight for each interval of the horizon.

Type: Application

Filed: December 14, 2020

Publication date: June 16, 2022

Inventor: Jacques DOAN HUU
AUTOMATIC GENERATIVE PROCESS BRIDGE BETWEEN ANALYTICS MODELS

Publication number: 20220043784

Abstract: Disclosed herein are system, method, and computer program product embodiments for generating a bridge between analytical models. In an embodiment, a server can extract a first variable dependency schema from a first model (e.g., predictive model or business intelligence report) and a second variable schema from a second model (e.g., predictive model or business intelligence report). The first variable dependency schema includes a first definition of a relationship between a first variable and a second variable. The server can compare the first variable dependency schema and the second variable dependency schema. Furthermore, the server can generate a modification to be made in the second variable dependency schema based on the first definition of the relationship between the first and second variable and outputs the modification to be made to the second variable dependency schema.

Type: Application

Filed: August 6, 2020

Publication date: February 10, 2022

Inventor: Jacques DOAN HUU
AUTOMATIC MACHINE LEARNING FEATURE BACKWARD STRIPPING

Publication number: 20210350273

Abstract: Features are used to train one or more ML models in a modelling layer. In a feature selection layer, each generated ML model is analyzed to determine, for each input feature, a degree of importance of the feature on the results generated by the ML model. Features with low importance are identified and the information is propagated backward to the data source and feature engineering layers. In response, the data source and feature engineering layers refrain from gathering or generating the unimportant features. Based on a confidence measure of the determination that each feature is important or unimportant, a number of periods between reevaluation of the feature importance is determined. After the number of periods has elapsed, a removed feature is restored to the pipeline.

Type: Application

Filed: May 6, 2020

Publication date: November 11, 2021

Inventor: JACQUES DOAN HUU
OPTIMIZING GRADIENT BOOSTING FEATURE SELECTION

Publication number: 20210334667

Abstract: Gradient Boosting Decision Tree (GBDT) successively stacks many decision trees which at each step try to fix the residual errors from the previous steps. The final score produced by the GBDT is the sum of the individual scores obtained by the decision trees for an input vector. Overfitting in GBDT can be reduced by removing the input values that have the least impact on the output from the training data. One way to determine which input variable has the lowest predictive value is to determine the input variable that is used for the first time in the latest decision tree in the GBDT. This method of identifying the low-predictive features to be removed does not require that earlier trees be regenerated to generate the new GBDT. Since the removed feature was already not used in the earlier trees, those trees already ignore the removed feature.

Type: Application

Filed: April 24, 2020

Publication date: October 28, 2021

Inventor: Jacques Doan Huu
Predictive modeling optimization

Patent number: 10789547

Abstract: Techniques are described for identifying an input training dataset stored within an underlying data platform; and transmitting instructions to the data platform, the instructions being executable by the data platform to train a predictive model based on the input training dataset by delegating one or more data processing operations to a plurality of nodes across the data platform.

Type: Grant

Filed: September 9, 2016

Date of Patent: September 29, 2020

Assignee: Business Objects Software Ltd.

Inventors: Alan McShane, Jacques Doan Huu, Ahmed Abdelrahman, Antoine Carme, Bertrand Lamy, Fadi Maali, Laya Ouologuem, Milena Caires, Nicolas Dulian, Erik Marcade
IN-DATABASE PREDICTIVE PIPELINE INCREMENTAL ENGINE

Publication number: 20200175402

Abstract: A predictive model pipeline data store may contain electronic records defining a predictive model pipeline composed of operation nodes. Based on the information in the data store, an execution framework platform may calculate a hash value for each operation node by including all recursive dependencies using ancestor node hash values and current node parameters. The platform may then compare each computed hash value with a previously computed hash value associated with a prior execution of a prior version of the pipeline. Operation nodes that have an unchanged hash value may be tagged “idle.” Operation nodes that have a changed hash value may be tagged “train and apply” or “apply” based on current node parameters (and an “apply” tag may propagate backwards through the pipeline to ancestor nodes). The platform may then ignore the operation nodes tagged “idle” when creating a physical execution plan to be provided to a target platform.

Type: Application

Filed: November 29, 2018

Publication date: June 4, 2020

Inventors: Scott Kumar CAMERON, Olivier HAMON, Gabriel KEVORKIAN, Eric GOUTHIERE, Jacques DOAN HUU

1 2 next