Patents by Inventor Hassan Chafi
Hassan Chafi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12361065Abstract: Techniques are provided herein for efficient representation of heterogeneous graphs in memory. In an embodiment, vertices and edges of the graph are segregated by type. Each property of a type of vertex or edge has values stored in a respective vector. Directed or undirected edges of a same type are stored in compressed sparse row (CSR) format. The CSR format is more or less repeated for edge traversal in either forward or reverse direction. An edge map translates edge offsets obtained from traversal in the reverse direction for use with data structures that expect edge offsets in the forward direction. Subsequent filtration and/or traversal by type or property of vertex or edge entails minimal data access and maximal data locality, thereby increasing efficient use of the graph.Type: GrantFiled: May 25, 2021Date of Patent: July 15, 2025Assignee: Oracle International CorporationInventors: Damien Hilloulin, Davide Bartolini, Oskar Van Rest, Alexander Weld, Sungpack Hong, Hassan Chafi
-
Publication number: 20250217683Abstract: A copy-or-generate model architecture is provided that a generates generation distribution obtained from the outputs of the last decoder layer and a copy distribution built from the cross-attention scores of the last decoder layer. The model applies copy weights to the generation distribution and copy distribution to determine whether to generate a next token or to copy a token from the prompt. The model provides better security by ensuring that the input values from the prompt are directly copied to the output when appropriate, such that the model is blind to the original values to copy. In a semi-sandboxed configuration, additional information may be input to the model to help the model adapt the output based on the context of those input fields.Type: ApplicationFiled: December 29, 2023Publication date: July 3, 2025Inventors: Paul Cayet, Rhicheek Patra, Damien Hilloulin, Sungpack Hong, Hassan Chafi
-
Patent number: 12339842Abstract: A graph processing engine is provided for executing a graph query comprising a parent query and a subquery nested within the parent query. The subquery is an existential subquery, uses a reference to one or more correlated variables from the parent query, is inlined in the parent query pattern matching, does not have a post-processing phase, does not contain any global aggregation operations, uses a reference to at most one non-correlated variable, and does not include any filters on a non-correlated variable. Executing the graph query comprises initiating execution of the parent query, responsive to the parent query matching the one or more correlated variables in an intermediate result set, executing the subquery by applying a neighbor pattern matching operator that checks for existence of an edge, and resuming execution of the parent query based on results of the neighbor pattern matching operation.Type: GrantFiled: December 29, 2022Date of Patent: June 24, 2025Assignee: Oracle International CorporationInventors: Vasileios Trigonakis, Anton Ragot, Yahya Ez-zainabi, Tomas Faltin, Sungpack Hong, Hassan Chafi
-
Patent number: 12326866Abstract: A breadth first search (BFS) algorithm is provided that uses out-of-core external storage in a memory constrained system. Memory resources are used as long as they are available and external storage is used when necessary due to memory pressure. The BFS algorithm uses a disk-spilling hash-table (DSH) as the visited set and disk-spilling queues (DSQs) as the BFS frontier queue. To get the most out of the DSH, subsequent inserts and lookups must happen in the same DSH partition. To ensure that consecutive lookups happen in the same DSH partition, the BFS frontier queue is partitioned in a manner similar to the DSH partitions.Type: GrantFiled: March 29, 2024Date of Patent: June 10, 2025Assignee: Oracle International CorporationInventors: Hugo Kapp, Laurent Phillipe Daynes, Vlad Ioan Haprian, Ioannis Alagiannis, Hassan Chafi, Sungpack Hong, Andrew Witkowski, Angela Amor, Huagang Li
-
Publication number: 20250156637Abstract: In a computer-implemented embodiment, an interaction machine learning model is trained based on many interactions on many resources. A context lexical token is inferred that represents a current operational context of a user. The context lexical token is inserted into a sequence of other inferred lexical tokens. From the context lexical token within the sequence of tokens, the interaction machine learning model infers a predicted resource that will be accessed next. In an embodiment, accelerated matchmaking entails suitability measurement by a dot product of a) a dynamically inferred user embedding that is based on the context lexical token and b) a statically inferred item embedding.Type: ApplicationFiled: November 9, 2023Publication date: May 15, 2025Inventors: Cesare Bernardis, Damien Hilloulin, Rhicheek Patra, Sungpack Hong, Hassan Chafi
-
Patent number: 12299553Abstract: In a computer, each of multiple anomaly detectors infers an anomaly score for each of many tuples. For each tuple, a synthetic label is generated that indicates for each anomaly detector: the anomaly detector, the anomaly score inferred by the anomaly detector for the tuple and, for each of multiple contamination factors, the contamination factor and, based on the contamination factor, a binary class of the anomaly score. For each particular anomaly detector excluding a best anomaly detector, a similarity score is measured for each contamination factor. The similarity score indicates how similar, between the particular anomaly detector and the best anomaly detector, are the binary classes of labels with that contamination factor. For each contamination factor, a combined similarity score is calculated based on the similarity scores for the contamination factor.Type: GrantFiled: December 6, 2022Date of Patent: May 13, 2025Assignee: Oracle International CorporationInventors: Yasha Pushak, Constantin Le Clei, Fatjon Zogaj, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
-
Publication number: 20250139163Abstract: An estimator is provided that can be used to get an estimate of final graph size and peak memory usage of the graph during loading, based on sampling of the graph data and using machine learning (ML) techniques. A data sampler samples the data from files or databases and estimates some statistics about the final graph. The sampler also samples some information about property data. Given the sampled statistics gathered and estimated by the data sampler, a graph size estimator estimates how much memory is required by the graph processing engine to load the graph. The final graph size represents how much memory will be used to keep the final graph structures in memory once loading is completed. The peak memory usage represents the memory usage upper bound that is reached by the graph processing engine during loading.Type: ApplicationFiled: October 26, 2023Publication date: May 1, 2025Inventors: Jonas Schweizer, Arnaud Delamare, Jinsu Lee, Sungpack Hong, Hassan Chafi, Vasileios Trigonakis
-
Publication number: 20250139474Abstract: A computer obtains multipliers of a sensitive feature. From an input that contains a value of the feature, a probability of a class is inferred. Based on the value of the feature in the input, one of the multipliers of the feature is selected. The multiplier is specific to both of the feature and the value of the feature. The input is classified based on a multiplicative product of the probability of the class and the multiplier that is specific to both of the feature and the value of the feature. In an embodiment, a black-box tri-objective optimizer generates multipliers on a three-way Pareto frontier from which a user may interactively select a combination of multipliers that provides a best three-way tradeoff between fairness and accuracy. The optimizer has three objectives to respectively optimize three distinct validation metrics that may, for example, be accuracy, fairness, and favorable outcome rate decrease.Type: ApplicationFiled: December 19, 2023Publication date: May 1, 2025Inventors: Yasha Pushak, Ehsan Soltan Aghai, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
-
Publication number: 20250119453Abstract: Herein are graph machine learning explainability (MLX) techniques for invalid traffic detection. In an embodiment, a computer generates a graph that contains: a) domain vertices that represent network domains that received requests and b) address vertices that respectively represent network addresses from which the requests originated. Based on the graph, domain embeddings are generated that respectively encode the domain vertices. Based on the domain embeddings, multidomain embeddings are generated that respectively encode the network addresses. The multidomain embeddings are organized into multiple clusters of multidomain embeddings. A particular cluster is detected as suspicious. In an embodiment, an unsupervised trained graph model generates the multidomain embeddings. Based on the clusters of multidomain embeddings, feature importances are unsupervised trained. Based on the feature importances, an explanation is automatically generated for why an object is or is not suspicious.Type: ApplicationFiled: November 20, 2024Publication date: April 10, 2025Inventors: Valentin Venzin, Rhicheek Patra, Sungpack Hong, Hassan Chafi
-
Patent number: 12271402Abstract: A method, apparatus, and product to provide a parser for property graph queries with precise error reporting and auto-completion based on information from property graph schemas. The approach generally comprises analysis of graph queries prior to their execution to identify issues prior to execution. In some embodiments, the approach includes any of: use of a property graph schema to determine whether names in a received property graph query exist within a corresponding property graph; determining whether the property graph query includes a comparison of mismatched data types; providing an autocomplete suggestion feature for assistance in resolving errors or corresponding to a cursor position within a query string; or evaluation of a property graph query to determine whether it would return an empty result. In some embodiments, property graph query analysis is performed using a context aware approach.Type: GrantFiled: June 21, 2022Date of Patent: April 8, 2025Assignee: Oracle International CorporationInventors: Florian Gratzer, Oskar Van Rest, Vlad Ioan Haprian, Vasileios Trigonakis, Korbinian Schmid, Steven Serra, Sungpack Hong, Hassan Chafi
-
Publication number: 20250094862Abstract: In an embodiment, a computer generates a respective original inference from each of many records. Permuted values are selected for a feature from original values of the feature. Based on the permuted values for the feature, a permuted inference is generated from each record. Fairness and accuracy of the original and permuted inferences are measured. For each of many features, the computer measures a respective impact on fairness of a machine learning model, and a respective impact on accuracy of the machine learning model. A global explanation of the machine learning model is generated and presented based on, for multiple features, the impacts on fairness and accuracy. Based on the global explanation, an interactive indication to exclude or include a particular feature is received. The machine learning model is (re-)trained based on the interactive indication to exclude or include the particular feature, which may increase the fairness of the model.Type: ApplicationFiled: December 5, 2023Publication date: March 20, 2025Inventors: Yasha Pushak, Mathieu Godbout, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
-
Publication number: 20250094224Abstract: A resource manager tracks the amount of available memory for a cluster of machines and for each machine in the cluster. The resource manager receives a reservation request from a job for a graph processing operation. The reservation request specifies an identification of the job, a type of reservation, and an amount of memory requested. The resource manager determines whether to grant the reservation request based on the type of reservation, the amount of memory requested, and the amount of available memory in the cluster or in one or more machines in the cluster. In response to determining to grant the reservation request, the resource manager sends a response to the job indicating an amount of memory reserved and adjusts the amount of available cluster memory and the amount of available machine memory for at least one machine in the cluster based on the amount of memory reserved.Type: ApplicationFiled: September 18, 2023Publication date: March 20, 2025Inventors: Calin Iorgulescu, Martin Kucera, Vasileios Trigonakis, Arnaud Delamare, Petr Koupy, Jinsu Lee, Sungpack Hong, Hassan Chafi
-
Publication number: 20250077876Abstract: Techniques for selecting machine-learned (ML) models using diversity criteria are provided. In one technique, for each ML model of multiple ML models, output data is generated based on input data to the ML model. Multiple pairs of ML models are identified, where each ML model in the multiple pairs is from the multiple ML models. For each pair of ML models in the multiple pairs of ML models: (1) first output data that was previously generated by a first ML model in the pair is identified; (2) second output data that was previously generated by a second ML model in the pair is identified; (3) a diversity value that is based on the first and second output data is generated; and (4) the diversity value is added to a set of diversity values. A subset of the multiple ML models is selected based on the set of diversity values.Type: ApplicationFiled: August 29, 2023Publication date: March 6, 2025Inventors: Moein Owhadi Kareshk, Giulia Carocari, Yasha Pushak, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
-
Patent number: 12242487Abstract: Techniques support graph pattern matching queries inside a relational database management system (RDBMS) that supports SQL execution. The techniques compile a graph pattern matching query that includes a bounded recursive pattern query into a SQL query that can then be executed by the relational engine. As a result, techniques enable execution of graph pattern matching queries that include bounded recursive patterns on top of the relational engine by avoiding any change in the existing SQL engine.Type: GrantFiled: October 13, 2022Date of Patent: March 4, 2025Assignee: Oracle International CorporationInventors: Vlad Ioan Haprian, Lei Sheng, Laurent Daynes, Zhen Hua Liu, Hugo Kapp, Marco Arnaboldi, Andrew Witkowski, Sungpack Hong, Hassan Chafi
-
Publication number: 20250053563Abstract: A storage manager maintains metadata for a plurality of graph components including, for each given graph component, a memory-state indicator that indicates whether the given graph component is stored in memory. The storage manager identifies a set of graph components required to execute a graph processing operation and identifies, based on the metadata, a first subset of the set of graph components that are stored in the memory and a second subset of the set of graph components that are not stored in the memory. The storage manager loads the second subset of graph components into memory and initiates execution of the graph processing operation using the set of graph components in memory.Type: ApplicationFiled: October 31, 2024Publication date: February 13, 2025Inventors: ARNAUD DELAMARE, IRFAN BUNJAKU, VASILEIOS TRIGONAKIS, CALIN IORGULESCU, TOMAS FALTIN, SUNGPACK HONG, HASSAN CHAFI
-
INCREMENTAL REBALANCING OF IN-MEMORY DISTRIBUTED GRAPHS FOR ELASTICITY, PERFORMANCE, AND SCALABILITY
Publication number: 20250045331Abstract: A graph rebalancing approach is provided that allows a distributed graph system to effectively support elasticity by incrementally balancing distributed in-memory graphs uniformly or in a custom manner on a set of given machines. Performing the incremental rebalancing operation comprises selecting a chunk in a source machine in the cluster having a surplus of chunks, selecting a target machine in the cluster having a deficit of chunks, transferring the selected chunk from the source machine to the target machine, and updating metadata in each machine in the cluster to reflect a location of the graph data elements in the selected chunk in the target machine.Type: ApplicationFiled: July 31, 2023Publication date: February 6, 2025Inventors: ARNAUD DELAMARE, GIORGOS XANTHAKIS, VASILEIOS TRIGONAKIS, SUNGPACK HONG, HASSAN CHAFI, JINSU LEE -
Patent number: 12197436Abstract: A graph processing engine is provided for executing a graph query comprising a parent query and a subquery nested within the parent query. The subquery uses a reference to one or more correlated variables from the parent query. Executing the graph query comprises initiating execution of the parent query, pausing the execution of the parent query responsive to the parent query matching the one or more correlated variables in an intermediate result set, generating a subquery identifier for each match of the one or more correlated variables, modifying the subquery to include a subquery aggregate function and a clause to group results by subquery identifier, executing the modified subquery using the intermediate result set and collecting subquery results into a subquery results table responsive to pausing execution of the parent query, and resuming execution of the parent query using the subquery results table.Type: GrantFiled: December 29, 2022Date of Patent: January 14, 2025Assignee: Oracle International CorporationInventors: Vasileios Trigonakis, Anton Ragot, Yahya Ez-zainabi, Tomas Faltin, Sungpack Hong, Hassan Chafi
-
Publication number: 20250013909Abstract: From many features and many multidimensional points, a computer generates exploratory training configurations. Each point contains a value for each of the features. Each exploratory training configuration identifies a random subset of the features and a random subset of the points. A performance score is generated for each of the exploratory training configurations. A feature weight is generated for each of the features that is based on the performance scores of the exploratory training configurations whose random subset of features contains the feature. A point weight is generated for each of the points that is based on the performance scores of the exploratory training configurations whose random subset of the many points contains the point. A machine learning model is trained using an optimized training corpus that consists of a subset of the many features based on feature weight and a subset of the many points based on point weight.Type: ApplicationFiled: July 6, 2023Publication date: January 9, 2025Inventors: Moein Owhadi Kareshk, Ali Seyfi, Hesam Fathi Moghadam, Sungpack Hong, Hassan Chafi
-
Patent number: 12184692Abstract: Herein are graph machine learning explainability (MLX) techniques for invalid traffic detection. In an embodiment, a computer generates a graph that contains: a) domain vertices that represent network domains that received requests and b) address vertices that respectively represent network addresses from which the requests originated. Based on the graph, domain embeddings are generated that respectively encode the domain vertices. Based on the domain embeddings, multidomain embeddings are generated that respectively encode the network addresses. The multidomain embeddings are organized into multiple clusters of multidomain embeddings. A particular cluster is detected as suspicious. In an embodiment, an unsupervised trained graph model generates the multidomain embeddings. Based on the clusters of multidomain embeddings, feature importances are unsupervised trained. Based on the feature importances, an explanation is automatically generated for why an object is or is not suspicious.Type: GrantFiled: December 21, 2021Date of Patent: December 31, 2024Assignee: Oracle International CorporationInventors: Valentin Venzin, Rhicheek Patra, Sungpack Hong, Hassan Chafi
-
Patent number: 12174831Abstract: A graph processing system is provided for executing scouting queries for improving query planning. A query planner creates a plurality of scouting queries, each scouting query in the plurality of scouting queries corresponding to a query plan for a graph query and having an associated confidence value. A graph processing system performs limited execution of the plurality of scouting queries and determines a metric value for each scouting query in the plurality of scouting queries based on execution of the scouting query. The system determines a score for each scouting query in the plurality of scouting queries based on its metric value and the confidence value of the corresponding query plan and selects a query plan based on the scores of the plurality of scouting queries. The system executes the graph query based on the selected query plan.Type: GrantFiled: December 2, 2022Date of Patent: December 24, 2024Assignee: Oracle International CorporationInventors: Tomáš Faltín, Vasileios Trigonakis, Sungpack Hong, Hassan Chafi