Patents by Inventor Hassan Chafi

Hassan Chafi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MITIGATING BIAS IN MACHINE LEARNING WITHOUT POSITIVE OUTCOME RATE REGRESSIONS

Publication number: 20250139474

Abstract: A computer obtains multipliers of a sensitive feature. From an input that contains a value of the feature, a probability of a class is inferred. Based on the value of the feature in the input, one of the multipliers of the feature is selected. The multiplier is specific to both of the feature and the value of the feature. The input is classified based on a multiplicative product of the probability of the class and the multiplier that is specific to both of the feature and the value of the feature. In an embodiment, a black-box tri-objective optimizer generates multipliers on a three-way Pareto frontier from which a user may interactively select a combination of multipliers that provides a best three-way tradeoff between fairness and accuracy. The optimizer has three objectives to respectively optimize three distinct validation metrics that may, for example, be accuracy, fairness, and favorable outcome rate decrease.

Type: Application

Filed: December 19, 2023

Publication date: May 1, 2025

Inventors: Yasha Pushak, Ehsan Soltan Aghai, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
Estimating Graph Size And Memory Consumption Of Distributed Graph For Efficient Resource Management

Publication number: 20250139163

Abstract: An estimator is provided that can be used to get an estimate of final graph size and peak memory usage of the graph during loading, based on sampling of the graph data and using machine learning (ML) techniques. A data sampler samples the data from files or databases and estimates some statistics about the final graph. The sampler also samples some information about property data. Given the sampled statistics gathered and estimated by the data sampler, a graph size estimator estimates how much memory is required by the graph processing engine to load the graph. The final graph size represents how much memory will be used to keep the final graph structures in memory once loading is completed. The peak memory usage represents the memory usage upper bound that is reached by the graph processing engine during loading.

Type: Application

Filed: October 26, 2023

Publication date: May 1, 2025

Inventors: Jonas Schweizer, Arnaud Delamare, Jinsu Lee, Sungpack Hong, Hassan Chafi, Vasileios Trigonakis
INVALID TRAFFIC DETECTION USING EXPLAINABLE UNSUPERVISED GRAPH ML

Publication number: 20250119453

Abstract: Herein are graph machine learning explainability (MLX) techniques for invalid traffic detection. In an embodiment, a computer generates a graph that contains: a) domain vertices that represent network domains that received requests and b) address vertices that respectively represent network addresses from which the requests originated. Based on the graph, domain embeddings are generated that respectively encode the domain vertices. Based on the domain embeddings, multidomain embeddings are generated that respectively encode the network addresses. The multidomain embeddings are organized into multiple clusters of multidomain embeddings. A particular cluster is detected as suspicious. In an embodiment, an unsupervised trained graph model generates the multidomain embeddings. Based on the clusters of multidomain embeddings, feature importances are unsupervised trained. Based on the feature importances, an explanation is automatically generated for why an object is or is not suspicious.

Type: Application

Filed: November 20, 2024

Publication date: April 10, 2025

Inventors: Valentin Venzin, Rhicheek Patra, Sungpack Hong, Hassan Chafi
Method, product, and system to provide a parser for property graph queries with precise error reporting and auto-completion based on information from property graph schemas

Patent number: 12271402

Abstract: A method, apparatus, and product to provide a parser for property graph queries with precise error reporting and auto-completion based on information from property graph schemas. The approach generally comprises analysis of graph queries prior to their execution to identify issues prior to execution. In some embodiments, the approach includes any of: use of a property graph schema to determine whether names in a received property graph query exist within a corresponding property graph; determining whether the property graph query includes a comparison of mismatched data types; providing an autocomplete suggestion feature for assistance in resolving errors or corresponding to a cursor position within a query string; or evaluation of a property graph query to determine whether it would return an empty result. In some embodiments, property graph query analysis is performed using a context aware approach.

Type: Grant

Filed: June 21, 2022

Date of Patent: April 8, 2025

Assignee: Oracle International Corporation

Inventors: Florian Gratzer, Oskar Van Rest, Vlad Ioan Haprian, Vasileios Trigonakis, Korbinian Schmid, Steven Serra, Sungpack Hong, Hassan Chafi
MEMORY-TRACKING RESOURCE MANAGER FOR ELASTIC DISTRIBUTED GRAPH-PROCESSING SYSTEM

Publication number: 20250094224

Abstract: A resource manager tracks the amount of available memory for a cluster of machines and for each machine in the cluster. The resource manager receives a reservation request from a job for a graph processing operation. The reservation request specifies an identification of the job, a type of reservation, and an amount of memory requested. The resource manager determines whether to grant the reservation request based on the type of reservation, the amount of memory requested, and the amount of available memory in the cluster or in one or more machines in the cluster. In response to determining to grant the reservation request, the resource manager sends a response to the job indicating an amount of memory reserved and adjusts the amount of available cluster memory and the amount of available machine memory for at least one machine in the cluster based on the amount of memory reserved.

Type: Application

Filed: September 18, 2023

Publication date: March 20, 2025

Inventors: Calin Iorgulescu, Martin Kucera, Vasileios Trigonakis, Arnaud Delamare, Petr Koupy, Jinsu Lee, Sungpack Hong, Hassan Chafi
FAIRNESS FEATURE IMPORTANCE: UNDERSTANDING AND MITIGATING UNJUSTIFIABLE BIAS IN MACHINE LEARNING MODELS

Publication number: 20250094862

Abstract: In an embodiment, a computer generates a respective original inference from each of many records. Permuted values are selected for a feature from original values of the feature. Based on the permuted values for the feature, a permuted inference is generated from each record. Fairness and accuracy of the original and permuted inferences are measured. For each of many features, the computer measures a respective impact on fairness of a machine learning model, and a respective impact on accuracy of the machine learning model. A global explanation of the machine learning model is generated and presented based on, for multiple features, the impacts on fairness and accuracy. Based on the global explanation, an interactive indication to exclude or include a particular feature is received. The machine learning model is (re-)trained based on the interactive indication to exclude or include the particular feature, which may increase the fairness of the model.

Type: Application

Filed: December 5, 2023

Publication date: March 20, 2025

Inventors: Yasha Pushak, Mathieu Godbout, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
SUPERVISED MODEL SELECTION VIA DIVERSITY CRITERIA

Publication number: 20250077876

Abstract: Techniques for selecting machine-learned (ML) models using diversity criteria are provided. In one technique, for each ML model of multiple ML models, output data is generated based on input data to the ML model. Multiple pairs of ML models are identified, where each ML model in the multiple pairs is from the multiple ML models. For each pair of ML models in the multiple pairs of ML models: (1) first output data that was previously generated by a first ML model in the pair is identified; (2) second output data that was previously generated by a second ML model in the pair is identified; (3) a diversity value that is based on the first and second output data is generated; and (4) the diversity value is added to a set of diversity values. A subset of the multiple ML models is selected based on the set of diversity values.

Type: Application

Filed: August 29, 2023

Publication date: March 6, 2025

Inventors: Moein Owhadi Kareshk, Giulia Carocari, Yasha Pushak, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
Efficient compilation of bounded recursive graph queries on top of SQL based relational engine

Patent number: 12242487

Abstract: Techniques support graph pattern matching queries inside a relational database management system (RDBMS) that supports SQL execution. The techniques compile a graph pattern matching query that includes a bounded recursive pattern query into a SQL query that can then be executed by the relational engine. As a result, techniques enable execution of graph pattern matching queries that include bounded recursive patterns on top of the relational engine by avoiding any change in the existing SQL engine.

Type: Grant

Filed: October 13, 2022

Date of Patent: March 4, 2025

Assignee: Oracle International Corporation

Inventors: Vlad Ioan Haprian, Lei Sheng, Laurent Daynes, Zhen Hua Liu, Hugo Kapp, Marco Arnaboldi, Andrew Witkowski, Sungpack Hong, Hassan Chafi
OFFLOADING GRAPH COMPONENTS TO PERSISTENT STORAGE FOR REDUCING RESIDENT MEMORY IN DISTRIBUTED GRAPH PROCESSING

Publication number: 20250053563

Abstract: A storage manager maintains metadata for a plurality of graph components including, for each given graph component, a memory-state indicator that indicates whether the given graph component is stored in memory. The storage manager identifies a set of graph components required to execute a graph processing operation and identifies, based on the metadata, a first subset of the set of graph components that are stored in the memory and a second subset of the set of graph components that are not stored in the memory. The storage manager loads the second subset of graph components into memory and initiates execution of the graph processing operation using the set of graph components in memory.

Type: Application

Filed: October 31, 2024

Publication date: February 13, 2025

Inventors: ARNAUD DELAMARE, IRFAN BUNJAKU, VASILEIOS TRIGONAKIS, CALIN IORGULESCU, TOMAS FALTIN, SUNGPACK HONG, HASSAN CHAFI
INCREMENTAL REBALANCING OF IN-MEMORY DISTRIBUTED GRAPHS FOR ELASTICITY, PERFORMANCE, AND SCALABILITY

Publication number: 20250045331

Abstract: A graph rebalancing approach is provided that allows a distributed graph system to effectively support elasticity by incrementally balancing distributed in-memory graphs uniformly or in a custom manner on a set of given machines. Performing the incremental rebalancing operation comprises selecting a chunk in a source machine in the cluster having a surplus of chunks, selecting a target machine in the cluster having a deficit of chunks, transferring the selected chunk from the source machine to the target machine, and updating metadata in each machine in the cluster to reflect a location of the graph data elements in the selected chunk in the target machine.

Type: Application

Filed: July 31, 2023

Publication date: February 6, 2025

Inventors: ARNAUD DELAMARE, GIORGOS XANTHAKIS, VASILEIOS TRIGONAKIS, SUNGPACK HONG, HASSAN CHAFI, JINSU LEE
Subqueries in distributed asynchronous graph queries

Patent number: 12197436

Abstract: A graph processing engine is provided for executing a graph query comprising a parent query and a subquery nested within the parent query. The subquery uses a reference to one or more correlated variables from the parent query. Executing the graph query comprises initiating execution of the parent query, pausing the execution of the parent query responsive to the parent query matching the one or more correlated variables in an intermediate result set, generating a subquery identifier for each match of the one or more correlated variables, modifying the subquery to include a subquery aggregate function and a clause to group results by subquery identifier, executing the modified subquery using the intermediate result set and collecting subquery results into a subquery results table responsive to pausing execution of the parent query, and resuming execution of the parent query using the subquery results table.

Type: Grant

Filed: December 29, 2022

Date of Patent: January 14, 2025

Assignee: Oracle International Corporation

Inventors: Vasileios Trigonakis, Anton Ragot, Yahya Ez-zainabi, Tomas Faltin, Sungpack Hong, Hassan Chafi
SIMULTANEOUS DATA SAMPLING AND FEATURE SELECTION VIA WEAK LEARNERS

Publication number: 20250013909

Abstract: From many features and many multidimensional points, a computer generates exploratory training configurations. Each point contains a value for each of the features. Each exploratory training configuration identifies a random subset of the features and a random subset of the points. A performance score is generated for each of the exploratory training configurations. A feature weight is generated for each of the features that is based on the performance scores of the exploratory training configurations whose random subset of features contains the feature. A point weight is generated for each of the points that is based on the performance scores of the exploratory training configurations whose random subset of the many points contains the point. A machine learning model is trained using an optimized training corpus that consists of a subset of the many features based on feature weight and a subset of the many points based on point weight.

Type: Application

Filed: July 6, 2023

Publication date: January 9, 2025

Inventors: Moein Owhadi Kareshk, Ali Seyfi, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
Invalid traffic detection using explainable unsupervised graph ML

Patent number: 12184692

Abstract: Herein are graph machine learning explainability (MLX) techniques for invalid traffic detection. In an embodiment, a computer generates a graph that contains: a) domain vertices that represent network domains that received requests and b) address vertices that respectively represent network addresses from which the requests originated. Based on the graph, domain embeddings are generated that respectively encode the domain vertices. Based on the domain embeddings, multidomain embeddings are generated that respectively encode the network addresses. The multidomain embeddings are organized into multiple clusters of multidomain embeddings. A particular cluster is detected as suspicious. In an embodiment, an unsupervised trained graph model generates the multidomain embeddings. Based on the clusters of multidomain embeddings, feature importances are unsupervised trained. Based on the feature importances, an explanation is automatically generated for why an object is or is not suspicious.

Type: Grant

Filed: December 21, 2021

Date of Patent: December 31, 2024

Assignee: Oracle International Corporation

Inventors: Valentin Venzin, Rhicheek Patra, Sungpack Hong, Hassan Chafi
Offloading graph components to persistent storage for reducing resident memory in distributed graph processing

Patent number: 12174835

Abstract: A storage manager for offloading graph components to persistent storage for reducing resident memory in a distributed graph processing engine is provided. The storage manager identifies a set of graph components required to execute a graph processing operation on a graph in a graph processing engine of a database system and reserves an amount of memory needed to load the set of graph components into memory. The storage manager loads the set of graph components into memory and initiates execution of the graph processing operation using the set of graph components in memory. The storage manager evicts one or more unused graph components from memory in response to receiving a request to free a requested amount of memory from memory.

Type: Grant

Filed: June 20, 2023

Date of Patent: December 24, 2024

Assignee: Oracle International Corporation

Inventors: Arnaud Delamare, Irfan Bunjaku, Vasileios Trigonakis, Calin Iorgulescu, Tomas Faltin, Sungpack Hong, Hassan Chafi
Scouting queries for improving query planning in distributed asynchronous graph queries

Patent number: 12174831

Abstract: A graph processing system is provided for executing scouting queries for improving query planning. A query planner creates a plurality of scouting queries, each scouting query in the plurality of scouting queries corresponding to a query plan for a graph query and having an associated confidence value. A graph processing system performs limited execution of the plurality of scouting queries and determines a metric value for each scouting query in the plurality of scouting queries based on execution of the scouting query. The system determines a score for each scouting query in the plurality of scouting queries based on its metric value and the confidence value of the corresponding query plan and selects a query plan based on the scores of the plurality of scouting queries. The system executes the graph query based on the selected query plan.

Type: Grant

Filed: December 2, 2022

Date of Patent: December 24, 2024

Assignee: Oracle International Corporation

Inventors: Tomáš Faltín, Vasileios Trigonakis, Sungpack Hong, Hassan Chafi
MULTIPLIER TUNING POSTPROCESSING FOR MACHINE LEARNING BIAS MITIGATION

Publication number: 20240403674

Abstract: In an embodiment, a computer infers, from an input (e.g. that represents a person) that contains a value of a sensitive feature that has a plurality of multipliers, a probability of a majority class (i.e. an outcome). Based on the value of the sensitive feature in the input, from the multipliers of the sensitive feature, a multiplier is selected that is specific to both of the sensitive feature and the value of the sensitive feature. The input is classified based on a multiplicative product of the probability of the majority class and the multiplier that is specific to both of the sensitive feature and the value of the sensitive feature. In an embodiment, a black-box bi-objective optimizer generates multipliers on a Pareto frontier from which a user may interactively select a combination of multipliers that provide a best tradeoff between fairness and accuracy.

Type: Application

Filed: December 5, 2023

Publication date: December 5, 2024

Inventors: Mathieu Godbout, Yasha Pushak, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
ACCELERATING AUTOMATED ALGORITHM CONFIGURATION USING HISTORICAL PERFORMANCE DATA

Publication number: 20240394557

Abstract: In an embodiment, a computer combines first original hyperparameters and second original hyperparameters into combined hyperparameters. In each iteration of a binary search that selects hyperparameters, these are selected: a) important hyperparameters from the combined hyperparameters and b) based on an estimated complexity decrease by including only important hyperparameters as compared to the combined hyperparameters, which only one boundary of the binary search to adjust. For the important hyperparameters of a last iteration of the binary search that selects hyperparameters, a pruned value range of a particular hyperparameter is generated based on a first original value range of the particular hyperparameter for the first original hyperparameters and a second original value range of the same particular hyperparameter for the second original hyperparameters.

Type: Application

Filed: May 26, 2023

Publication date: November 28, 2024

Inventors: Yasha Pushak, Mobina Mahdavi, Ali Asgari Khoshouyeh, Ali Seyfi, Zahra Zohrevand, Ritesh Ahuja, Moein Owhadi Kareshk, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
Using temporary tables to store graph algorithm results for a relational database management system

Patent number: 12124448

Abstract: An RDBMS specifies a graph algorithm function (GAF) that takes a graph object as input and returns a logical graph object as output. GAFs are used within graph queries to compute temporary and output properties (“GAF-computed properties”), which are live for the duration of the query cursor execution. GAF-computed output properties are accessible in the enclosing graph pattern matching query as though they were part of the input graph object of the GAF. Temporary cursor-duration tables are generated for the query cursor during compilation of a graph query that includes a GAF, and are used to store the GAF-computed properties. Each temporary table corresponds to one of the primary tables of the input graph, and includes, as a foreign key, primary key information from the corresponding primary table.

Type: Grant

Filed: January 26, 2022

Date of Patent: October 22, 2024

Inventors: Hugo Kapp, Laurent Daynes, Vlad Ioan Haprian, Jean-Pierre Lozi, Zhen Hua Liu, Marco Arnaboldi, Sabina Petride, Andrew Witkowski, Hassan Chafi, Sungpack Hong
THRESHOLD TUNING FOR IMBALANCED MULTI-CLASS CLASSIFICATION MODELS

Publication number: 20240303541

Abstract: In an embodiment, a computer generates, from an input, an inference that contains multiple probabilities respectively for multiple mutually exclusive classes that contain a first class and a second class. The probabilities contain (e.g. due to overfitting) a higher probability for the first class that is higher than a lower probability for the second class. In response to a threshold exceeding the higher probability, the input is automatically and more accurately classified as the second class. One, some, or almost all classes may have a respective distinct threshold that can be concurrently applied for acceleration. Data parallelism may simultaneously apply a threshold to a batch of multiple inputs for acceleration.

Type: Application

Filed: November 1, 2023

Publication date: September 12, 2024

Inventors: Yasha Pushak, Ali Seyfi, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
AUTOMLX COUNTERFACTUAL EXPLAINER (ACE)

Publication number: 20240303515

Abstract: A computer stores a reference corpus that consists of many reference points that each has a respective class. Later, an expected class and a subject point (i.e. instance to explain) that does not have the expected class are received. Multiple reference points that have the expected class are selected as starting points. Based on the subject point and the starting points, multiple discrete interpolated points are generated that have the expected class. Based on the subject point and the discrete interpolated points, multiple continuous interpolated points are generated that have the expected class. A counterfactual explanation of why the subject point does not have the expected class is directly generated based on continuous interpolated point(s) and, thus, indirectly generated based on the discrete interpolated points. For acceleration, neither way of interpolation (i.e. counterfactual generation) is iterative.

Type: Application

Filed: November 17, 2023

Publication date: September 12, 2024

Inventors: Zahra Zohrevand, Ehsan Soltan Aghai, Yasha Pushak, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi

1 2 3 4 5 … next