Patents by Inventor Fanny SILAVONG

Fanny SILAVONG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYSTEMS AND METHODS FOR UNIT TEST GENERATION USING REINFORCEMENT LEARNING AUGMENTED TRANSFORMER ARCHITECTURES

Publication number: 20240078435

Abstract: Systems and methods for unit test generation using reinforcement learning augmented transformer architectures are disclosed. A method may include: receiving raw data for source code from a database; identifying a function for which a unit test will be generated and an existing unit test for that function; receiving the function and the existing unit test; generating a generated unit test for the function using the function for the unit test and the existing unit test using a deep learning model; applying a loss function to the generated unit test, wherein the loss function is based on a comparison between the generated unit test and the existing unit test and results of the application of the loss function are fed back to the transformer computer program; simulating the generated unit test using a simulator; generating scalar feedback; and refining the generated unit test using the scalar feedback.

Type: Application

Filed: August 16, 2023

Publication date: March 7, 2024

Inventors: Rohan SAPHAL, Georgios PAPADOPOULOS, Fanny SILAVONG, Sean MORAN
SYSTEMS AND METHODS FOR SCALABLE AND FLEXIBLE FEDERATED LEARNING FRAMEWORKS

Publication number: 20240062113

Abstract: Systems and methods for scalable and flexible federated learning frameworks are disclosed. A method may include: (1) receiving, by a computer program executed by an electronic device and from a client, a project for federated learning using a training federation, the training federation comprising a plurality of clients; (2) generating, by the computer program, a configuration file that reflects a set-up for the training federation; (3) receiving, by the computer program, files necessary to build containers, wherein at least some of the files are customized by the client; (4) generating, by the computer program, containers comprising the configuration file and files necessary to build the containers; and (5) deploying, by the computer program, the containers to a client compute environment for the client as a client node, wherein the client node is configured to join the training federation as a server and/or a participant.

Type: Application

Filed: August 18, 2023

Publication date: February 22, 2024

Inventors: Fanny SILAVONG, Shaltiel ELOUL, Antonios GEORGIADIS, Sanket KAMTHE, Sean MORAN
SYSTEMS AND METHODS FOR SIMULTANEOUS NETWORK PRUNING AND PARAMETER OPTIMIZATION

Publication number: 20240054346

Abstract: Systems and methods for simultaneous network pruning and parameter optimization are disclosed. A method may include: (1) receiving a network to optimize, the network comprising a plurality of layers; (2) selecting layer pruning and/or channel pruning for the layers within the network; (3) providing a gating module at layers within the network, wherein each gating module opens or closes a gate in the gating module based on an output of a binary head; (4) training parameters for the network; (5) extracting gate open/close features from the gating modules; (6) optimizing a loss function for the network using a polarization regularizer to reach a consensus static sub-network; and (7) updating parameters for each gating module in the network consistent with the consensus static sub-network.

Type: Application

Filed: July 25, 2023

Publication date: February 15, 2024

Inventors: Xiaoying ZHI, Sean MORAN, Fanny SILAVONG, Ruibo SHI, Pheobe SUN
SYSTEMS AND METHODS FOR AUTO-THRESHOLDING USING PAIRWISE FEATURE CROSS-CORRELATION FOR HYPERPARAMETER VALUE SELECTION

Publication number: 20240045929

Abstract: Systems and methods for auto-thresholding using pairwise feature cross-correlation for hyperparameter value selection are disclosed. A method may include a hyperparameter value optimization computer program: receiving data to be used by a clustering algorithm; receiving a selection of a hyperparameter value to tune; for each possible hyperparameter value, executing the clustering algorithm resulting in a set of clusters for each hyperparameter value; extracting a series of cluster features from the set of clusters; performing pairwise cross-correlation on the series of cluster features resulting in potential candidates for an optimal hyperparameter value; aggregating maximum or minimum values for the hyperparameter value at their respective indices; selecting an optimum value for the hyperparameter value; and outputting the optimum value for the hyperparameter value to the clustering algorithm.

Type: Application

Filed: August 5, 2022

Publication date: February 8, 2024

Inventors: Rohan SAPHAL, Fanny SILAVONG, Sean MORAN
Detecting secrets in source code

Patent number: 11868768

Abstract: A method for facilitating identification of secrets in source code by using machine learning is provided. The method includes retrieving a plurality of files from a repository, each of the plurality of files including a source code file; parsing the source code file to identify a training feature; associating a predetermined label with the training feature, the predetermined label corresponding to a secret label and a non-secret label; training a model by using the training feature and the corresponding predetermined label; receiving, via a graphical user interface, a test file, the test file including a set of source codes; parsing the set of source codes to identify a feature; and determining, by using the model, a first characteristic of the feature.

Type: Grant

Filed: September 16, 2021

Date of Patent: January 9, 2024

Assignee: JPMORGAN CHASE BANK, N.A.

Inventors: Sean Moran, Ahmad Emami, Fanny Silavong, Joachim Fainberg, Ashish Tiwari, Antonios Georgiadis, Bill Moriarty, Solomon Olaniyi Adebayo, Georgios Papadopoulos, Rohan Saphal, Robert Falconer Keith, Rob Otter, Stephen Hall
SYSTEMS AND METHODS FOR WEAKLY SUPERVISED UNIT TEST QUALITY SCORING

Publication number: 20230401469

Abstract: Systems and methods for weakly supervised unit test quality scoring are disclosed. According to one embodiment, a method may include: parsing, by a generative model computer program, a plurality of code snippets in a repository using an abstract syntax tree; receiving, by the generative model computer program, a plurality of binary labelling functions from a labelling function repository; creating, by the generative model computer program, a labelling matrix by applying the binary labelling functions to the parsed code snippets; training, by the generative model computer program, a probabilistic generative model using the labelling matrix resulting in a vector of pseudo-labels; building, by a unit test scoring computer program, a discriminative model, wherein the discriminative model receives an array of real value inputs; and training, by the unit test scoring computer program, the discriminative model using the parsed code snippets, the vector of pseudo labels, and the abstract syntax tree.

Type: Application

Filed: September 19, 2022

Publication date: December 14, 2023

Inventors: Georgios PAPADOPOULOS, Alla NADEIN, Fanny SILAVONG, Shanshan JIANG, Sean MORAN, Rob OTTER, Brett SANFORD
SYSTEMS AND METHOD FOR AUTOMATED CODE ANALYSIS AND TAGGING

Publication number: 20230259359

Abstract: Systems and methods for automated code analysis and tagging are disclosed. In one embodiment, a method for automated code analysis and tagging may include: (1) receiving, by a code annotation computer program executed by a computer processor, a training code snippet from a training codebase; (2) parsing, by the code annotation computer program, the training code snippet into a data structure; (3) quantifying, by the code annotation computer program, the data structure, (4) parsing, by the code annotation computer program, a docstring associated with the training code snippet into a plurality of keywords; (5) quantifying, by the code annotation computer program, the plurality of keywords; and (6) training, by the code annotation computer program, a code annotation model based on a similarity between the quantified data structure and a smoothing parameter for a Dirichlet prior smoothing estimate.

Type: Application

Filed: February 14, 2022

Publication date: August 17, 2023

Inventors: Sean MORAN, Sanat SAHA, Gaurav SINGH, Fanny SILAVONG, Antonios GEORGIADIS, Ganesh CHANDRASEKAR, Andy ALEXANDER, Rob OTTER, Brett SANFORD
SYSTEMS AND METHODS FOR WEAK SUPERVISION CLASSIFICATION WITH PROBABILISTIC GENERATIVE LATENT VARIABLE MODELS

Publication number: 20230237341

Abstract: Systems and methods for weak supervision classification with probabilistic generative latent variable models are disclosed. A method for weak supervision classification with probabilistic generative latent variable models may include: (1) receiving, by a generative model computer program, a plurality of records from a database; (2) receiving, by the generative model computer program, a plurality of user-defined label functions; (3) labeling, by the generative model computer program, each of the plurality of records with each of the plurality of user-defined label functions; (4) representing, by the generative model computer program, the plurality of records that are labeled with the user-defined label functions in a matrix; (5) performing, by the generative model computer program, probabilistic latent variable model analysis on the matrix using a probabilistic generative latent variable model; and (6) outputting, by the generative model computer program, a labeled dataset for the plurality of records.

Type: Application

Filed: January 19, 2023

Publication date: July 27, 2023

Inventors: Georgios PAPADOPOULOS, Fanny SILAVONG, Sean MORAN, Rob OTTER, Brett SANFORD
SYSTEMS AND METHODS FOR FEDERATED MODEL VALIDATION AND DATA VERIFICATION

Publication number: 20230229786

Abstract: Systems and methods for federated model validation and data verification are disclosed. A method may include: (1) receiving, by a local computer program executed by client system, a federated machine learning model from a federated model server; (2) testing, by the local computer program and using a policy service, the federated machine learning model for vulnerabilities to attacks; (3) accepting, by the local computer program, the federated machine learning model in response to the federated machine learning model passing the testing; (4) training, by the local computer program, the federated machine learning model using input data comprising local data and outputting training parameters; (5) identifying, by the local computer program using the policy service, accidental leakage and/or contamination by comparing the training parameters to the input data; and (6) providing, by the local computer program, the training parameters to the federated model server.

Type: Application

Filed: January 19, 2023

Publication date: July 20, 2023

Inventors: Shaltiel ELOUL, Sean MORAN, Fanny SILAVONG, Sanket KAMTHE, Antonios GEORGIADIS
SYSTEMS AND METHODS FOR MULTI-SIGNAL PATTERN MATCHING WITH GRAPH PROPAGATION

Publication number: 20230229440

Abstract: A method may include: retrieving a plurality of code snippets from code repositories; generating a syntax representation, a property representation for each of the code snippets; receiving a query comprising a query code snippet, natural language keywords, and/or a string pattern; performing string-based matching and parser/syntax tree matching on the query and the tree representations, syntax matching on the query and the syntax representations, and property matching on the query and the property representations, wherein each of the matchings results in a score; combining the scores of the string-based matching, the parser/syntax tree matching, the syntax matching, and/or the property matching; identifying a plurality of code snippets of interest based on the combined scores; classifying the code snippets of interest using a machine learning classifier; outputting a list of the code snippets of interest with their classifications; and training the machine learning classifier based on user feedback.

Type: Application

Filed: March 4, 2022

Publication date: July 20, 2023

Inventors: Fanny SILAVONG, Sean MORAN, Georgios PAPADOPOULOS, Solomon Olaniyi ADEBAYO, William COVELL, Rob OTTER
SYSTEMS AND METHODS FOR LOCALITY PRESERVING FEDERATED LEARNING

Publication number: 20230229930

Abstract: Systems and methods for locality preserving federated learning are disclosed. In one embodiment, a method for locality preserving federated learning may include: (1) receiving, at an aggregator computer program and from each of a plurality of clients, weights for each client's local machine learning model; (2) generating, by the aggregator computer program, an averaged machine learning model based on the received weights; (3) sharing, by the aggregator computer program, the averaged machine learning model with the plurality of clients; and (4) applying, by each client, a scaling factor to the averaged machine learning model to update its local machine learning model.

Type: Application

Filed: January 17, 2023

Publication date: July 20, 2023

Inventors: Antonios GEORGIADIS, Fanny SILAVONG, Sean MORAN, Rob OTTER
SYSTEMS AND METHODS FOR DETECTING CODE DUPLICATION IN CODEBASES

Publication number: 20230185550

Abstract: Systems and methods for detecting code duplication are disclosed. In one embodiment, a method for detecting exact code snippet duplicates may include: (1) representing, by a code duplication detection computer program, each of a plurality of code snippets in a codebase as an abstract syntax trees; (2) featurizing, by the code duplication detection computer program, the abstract syntax trees into corpus feature vectors by converting the abstract syntax tree into vector representations; (3) generating, by the code duplication detection computer program, dense feature vectors from the corpus feature vectors using a dimension reduction technique; (4) identifying, by the code duplication detection computer program, exact duplicate code snippet matches by apply density-based clustering to the dense feature vectors; and (5) tagging, by the code duplication detection computer program, the exact duplicate code snippets.

Type: Application

Filed: December 12, 2022

Publication date: June 15, 2023

Inventors: Rohan SAPHAL, Fanny SILAVONG, Sean MORAN, Antonios GEORGIADIS, Sanat SAHA, Gaurav SINGH, Pierre OSSELIN, Rob OTTER
SYSTEMS AND METHODS FOR SOURCE CODE UNDERSTANDING USING SPATIAL REPRESENTATIONS

Publication number: 20230153085

Abstract: Systems and methods for source code understanding using spatial representations are disclosed. In one embodiment, a method may include: (1) receiving, by a source code understanding computer program, a source code snippet; (2) converting, by the source code understanding computer program, the source code snippet to a two-dimensional image representation using an encoding technique; (3) mapping, by the source code understanding computer program, the two-dimensional image representation into a three-dimensional image representation; (4) determining, by the source code understanding computer program, a classification for the source code snippet using a deep learning network; and (5) providing, by the source code understanding computer program, the classification for the source code snippet to a downstream system.

Type: Application

Filed: November 14, 2022

Publication date: May 18, 2023

Inventors: Ruibo SHI, Sean MORAN, Lili TAO, Fanny SILAVONG, Rohan SAPHAL
Linear late-fusion semantic structural retrieval

Patent number: 11625423

Abstract: Systems and methods for generating a fusion score between electronic documents. The method includes receiving a first electronic document by a document management system. The method further includes extracting a first set of features from the first electronic document including at least one feature type indicating the hierarchical structure of the first electronic document. The method also includes receiving a second electronic document by the document management server. The method further includes extracting a second set of features from the second electronic document including at least one feature type indicating the hierarchical structure of the second electronic document. The method further includes generating a fusion score based on a comparison of the first set of features and the second set of features.

Type: Grant

Filed: January 25, 2021

Date of Patent: April 11, 2023

Assignee: JPMORGAN CHASE BANK, N.A.

Inventors: Sean Moran, Fanny Silavong, Rob Otter, Brett Sanford, Antonios Georgiadis, Sae Young Moon
DETECTING SECRETS IN SOURCECODE

Publication number: 20230070420

Abstract: A method for facilitating identification of secrets in source code by using machine learning is provided. The method includes retrieving a plurality of files from a repository, each of the plurality of files including a source code file; parsing the source code file to identify a training feature; associating a predetermined label with the training feature, the predetermined label corresponding to a secret label and a non-secret label; training a model by using the training feature and the corresponding predetermined label; receiving, via a graphical user interface, a test file, the test file including a set of source codes; parsing the set of source codes to identify a feature; and determining, by using the model, a first characteristic of the feature.

Type: Application

Filed: September 16, 2021

Publication date: March 9, 2023

Applicant: JPMorgan Chase Bank, N.A.

Inventors: Sean MORAN, Ahmad EMAMI, Fanny SILAVONG, Joachim FAINBERG, Ashish TIWARI, Antonios GEORGIADIS, Bill MORIARTY, Solomon Olaniyi ADEBAYO, Georgios PAPADOPOULOS, Rohan SAPHAL, Robert Falconer KEITH, Rob OTTER, Stephen HALL
SYSTEMS AND METHODS FOR CODE REPOSITORY EMBEDDING FOR TAGGING AND SUMMARIZATION TASKS USING ATTENTION ON MULTIPLE CODE DOMAINS

Publication number: 20230062297

Abstract: Systems and methods for code repository embedding using attention mechanism for tagging and summarization are disclosed. According to one embodiment, a method for code repository embedding may include: (1) extracting, by a computer program executed by an electronic device, docstring embeddings, code embeddings, and dependency embeddings from scripts in a repository (2) applying, by the computer program, a machine learning algorithm to each of the docstring embeddings, code embeddings, and dependency embeddings; (3) concatenating, by the computer program, outputs of the machine learning algorithm; (4) weighting, by the computer program, the concatenated outputs of the machine learning algorithm using an attention mechanism, resulting in a repository representation comprising an abstract vector; and (5) tagging or summarizing, by the computer program, the script using its repository representation.

Type: Application

Filed: August 23, 2022

Publication date: March 2, 2023

Inventors: Shaltiel ELOUL, Agathe LHÉRONDELLE, Sean MORAN, Fanny SILAVONG, Yash SATSANGI
SYSTEMS AND METHODS FOR FEDERATED SECURE VOCABULARY LEARNING

Publication number: 20230064540

Abstract: Systems and methods for federated secure vocabulary learning. The method may include communicating, by a first client network and a second client network and to a global federation terminal, a first set of encrypted tokens and a second set of encrypted tokens. The method may further include generating, by the global federation terminal, a consolidated vocabulary based on the first set of encrypted tokens and the second set of encrypted tokens. The method may include determining, a first set of weights and a second set of weights for the consolidated vocabulary. The method may include receiving by the global federation terminal the first set of weights and the second set of weights. The method may further include aggregating the first set of weights and the second set of weights. The method may further include distributing, to the first client network and the second client network, an aggregated set of weights.

Type: Application

Filed: February 16, 2022

Publication date: March 2, 2023

Inventors: Fanny SILAVONG, Antonios GEORGIADIS, Sean MORAN, Brett SANFORD, Rob OTTER
SYSTEMS AND METHODS FOR NOISE AGNOSTIC FEDERATED LEARNING

Publication number: 20230058972

Abstract: Systems and methods for noise agnostic federated learning are disclosed. A method may include a client computer program executed by an electronic device in a federated learning computer network comprising a plurality of clients: receiving, from a federated learning computer program, a data format having desirable noise characteristics; transforming a client data set comprising variable noise characteristics to the data format using a client generative adversarial network (GAN); generating client weights for the transformed client data set, wherein the client weights indicate features of the client data set; communicating the client weights to the federated learning computer program; receiving, from the federated learning computer program, adjusted weights, wherein the adjusted weights are based on the client weights and a plurality client weights received from the clients in the federated learning computer network; and updating the client weights for a client machine learning model using the adjusted weights.

Type: Application

Filed: August 16, 2022

Publication date: February 23, 2023

Inventors: Antonios GEORGIADIS, Fanny SILAVONG, Sean MORAN, Rob OTTER, Varun BABBAR
SYSTEMS AND METHODS FOR LINEAR LATE-FUSION SEMANTIC STRUCTURAL RETRIEVAL

Publication number: 20220237182

Abstract: Systems and methods for generating a fusion score between electronic documents. The method includes receiving a first electronic document by a document management system. The method further includes extracting a first set of features from the first electronic document including at least one feature type indicating the hierarchical structure of the first electronic document. The method also includes receiving a second electronic document by the document management server. The method further includes extracting a second set of features from the second electronic document including at least one feature type indicating the hierarchical structure of the second electronic document. The method further includes generating a fusion score based on a comparison of the first set of features and the second set of features.

Type: Application

Filed: January 25, 2021

Publication date: July 28, 2022

Inventors: Fanny SILAVONG, Sean MORAN, Antonios GEORGIADIS, Rob OTTER, Brett SANFORD
SYSTEMS AND METHODS FOR COUNTERACTING DATA-SKEWNESS FOR LOCALITY SENSITIVE HASHING VIA FEATURE SELECTION AND PRUNING

Publication number: 20220100725

Abstract: Systems and methods for counteracting data-skewness for locality sensitive hashing via feature selection and pruning are disclosed. In one embodiment, a method for feature selection for counteracting data skewness on locality sensitive hashing (LSH)-based search may include: (1) ingesting, by an ingestion computer program and from a plurality of data sources, data; (2) extracting, by the ingestion computer program, a plurality of features from the ingested data; (3) transforming, by the ingestion computer program, each of the plurality of features into a feature vector; (4) selecting, by the ingestion computer program, a subset of the plurality of features; and (5) for each selected feature vector: computing, by the ingestion computer program, a random hash function for the selected feature; and inserting, by the ingestion computer program, an output of the random hash function into a hash table with the selected feature.

Type: Application

Filed: September 27, 2021

Publication date: March 31, 2022

Inventors: Sean MORAN, Fanny SILAVONG, Rob OTTER, Antonios GEORGIADIS, Brett SANFORD