Patents by Inventor Fanny SILAVONG

Fanny SILAVONG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240078435
    Abstract: Systems and methods for unit test generation using reinforcement learning augmented transformer architectures are disclosed. A method may include: receiving raw data for source code from a database; identifying a function for which a unit test will be generated and an existing unit test for that function; receiving the function and the existing unit test; generating a generated unit test for the function using the function for the unit test and the existing unit test using a deep learning model; applying a loss function to the generated unit test, wherein the loss function is based on a comparison between the generated unit test and the existing unit test and results of the application of the loss function are fed back to the transformer computer program; simulating the generated unit test using a simulator; generating scalar feedback; and refining the generated unit test using the scalar feedback.
    Type: Application
    Filed: August 16, 2023
    Publication date: March 7, 2024
    Inventors: Rohan SAPHAL, Georgios PAPADOPOULOS, Fanny SILAVONG, Sean MORAN
  • Publication number: 20240062113
    Abstract: Systems and methods for scalable and flexible federated learning frameworks are disclosed. A method may include: (1) receiving, by a computer program executed by an electronic device and from a client, a project for federated learning using a training federation, the training federation comprising a plurality of clients; (2) generating, by the computer program, a configuration file that reflects a set-up for the training federation; (3) receiving, by the computer program, files necessary to build containers, wherein at least some of the files are customized by the client; (4) generating, by the computer program, containers comprising the configuration file and files necessary to build the containers; and (5) deploying, by the computer program, the containers to a client compute environment for the client as a client node, wherein the client node is configured to join the training federation as a server and/or a participant.
    Type: Application
    Filed: August 18, 2023
    Publication date: February 22, 2024
    Inventors: Fanny SILAVONG, Shaltiel ELOUL, Antonios GEORGIADIS, Sanket KAMTHE, Sean MORAN
  • Publication number: 20240054346
    Abstract: Systems and methods for simultaneous network pruning and parameter optimization are disclosed. A method may include: (1) receiving a network to optimize, the network comprising a plurality of layers; (2) selecting layer pruning and/or channel pruning for the layers within the network; (3) providing a gating module at layers within the network, wherein each gating module opens or closes a gate in the gating module based on an output of a binary head; (4) training parameters for the network; (5) extracting gate open/close features from the gating modules; (6) optimizing a loss function for the network using a polarization regularizer to reach a consensus static sub-network; and (7) updating parameters for each gating module in the network consistent with the consensus static sub-network.
    Type: Application
    Filed: July 25, 2023
    Publication date: February 15, 2024
    Inventors: Xiaoying ZHI, Sean MORAN, Fanny SILAVONG, Ruibo SHI, Pheobe SUN
  • Publication number: 20240045929
    Abstract: Systems and methods for auto-thresholding using pairwise feature cross-correlation for hyperparameter value selection are disclosed. A method may include a hyperparameter value optimization computer program: receiving data to be used by a clustering algorithm; receiving a selection of a hyperparameter value to tune; for each possible hyperparameter value, executing the clustering algorithm resulting in a set of clusters for each hyperparameter value; extracting a series of cluster features from the set of clusters; performing pairwise cross-correlation on the series of cluster features resulting in potential candidates for an optimal hyperparameter value; aggregating maximum or minimum values for the hyperparameter value at their respective indices; selecting an optimum value for the hyperparameter value; and outputting the optimum value for the hyperparameter value to the clustering algorithm.
    Type: Application
    Filed: August 5, 2022
    Publication date: February 8, 2024
    Inventors: Rohan SAPHAL, Fanny SILAVONG, Sean MORAN
  • Patent number: 11868768
    Abstract: A method for facilitating identification of secrets in source code by using machine learning is provided. The method includes retrieving a plurality of files from a repository, each of the plurality of files including a source code file; parsing the source code file to identify a training feature; associating a predetermined label with the training feature, the predetermined label corresponding to a secret label and a non-secret label; training a model by using the training feature and the corresponding predetermined label; receiving, via a graphical user interface, a test file, the test file including a set of source codes; parsing the set of source codes to identify a feature; and determining, by using the model, a first characteristic of the feature.
    Type: Grant
    Filed: September 16, 2021
    Date of Patent: January 9, 2024
    Assignee: JPMORGAN CHASE BANK, N.A.
    Inventors: Sean Moran, Ahmad Emami, Fanny Silavong, Joachim Fainberg, Ashish Tiwari, Antonios Georgiadis, Bill Moriarty, Solomon Olaniyi Adebayo, Georgios Papadopoulos, Rohan Saphal, Robert Falconer Keith, Rob Otter, Stephen Hall
  • Publication number: 20230401469
    Abstract: Systems and methods for weakly supervised unit test quality scoring are disclosed. According to one embodiment, a method may include: parsing, by a generative model computer program, a plurality of code snippets in a repository using an abstract syntax tree; receiving, by the generative model computer program, a plurality of binary labelling functions from a labelling function repository; creating, by the generative model computer program, a labelling matrix by applying the binary labelling functions to the parsed code snippets; training, by the generative model computer program, a probabilistic generative model using the labelling matrix resulting in a vector of pseudo-labels; building, by a unit test scoring computer program, a discriminative model, wherein the discriminative model receives an array of real value inputs; and training, by the unit test scoring computer program, the discriminative model using the parsed code snippets, the vector of pseudo labels, and the abstract syntax tree.
    Type: Application
    Filed: September 19, 2022
    Publication date: December 14, 2023
    Inventors: Georgios PAPADOPOULOS, Alla NADEIN, Fanny SILAVONG, Shanshan JIANG, Sean MORAN, Rob OTTER, Brett SANFORD
  • Publication number: 20230259359
    Abstract: Systems and methods for automated code analysis and tagging are disclosed. In one embodiment, a method for automated code analysis and tagging may include: (1) receiving, by a code annotation computer program executed by a computer processor, a training code snippet from a training codebase; (2) parsing, by the code annotation computer program, the training code snippet into a data structure; (3) quantifying, by the code annotation computer program, the data structure, (4) parsing, by the code annotation computer program, a docstring associated with the training code snippet into a plurality of keywords; (5) quantifying, by the code annotation computer program, the plurality of keywords; and (6) training, by the code annotation computer program, a code annotation model based on a similarity between the quantified data structure and a smoothing parameter for a Dirichlet prior smoothing estimate.
    Type: Application
    Filed: February 14, 2022
    Publication date: August 17, 2023
    Inventors: Sean MORAN, Sanat SAHA, Gaurav SINGH, Fanny SILAVONG, Antonios GEORGIADIS, Ganesh CHANDRASEKAR, Andy ALEXANDER, Rob OTTER, Brett SANFORD
  • Publication number: 20230237341
    Abstract: Systems and methods for weak supervision classification with probabilistic generative latent variable models are disclosed. A method for weak supervision classification with probabilistic generative latent variable models may include: (1) receiving, by a generative model computer program, a plurality of records from a database; (2) receiving, by the generative model computer program, a plurality of user-defined label functions; (3) labeling, by the generative model computer program, each of the plurality of records with each of the plurality of user-defined label functions; (4) representing, by the generative model computer program, the plurality of records that are labeled with the user-defined label functions in a matrix; (5) performing, by the generative model computer program, probabilistic latent variable model analysis on the matrix using a probabilistic generative latent variable model; and (6) outputting, by the generative model computer program, a labeled dataset for the plurality of records.
    Type: Application
    Filed: January 19, 2023
    Publication date: July 27, 2023
    Inventors: Georgios PAPADOPOULOS, Fanny SILAVONG, Sean MORAN, Rob OTTER, Brett SANFORD
  • Publication number: 20230229786
    Abstract: Systems and methods for federated model validation and data verification are disclosed. A method may include: (1) receiving, by a local computer program executed by client system, a federated machine learning model from a federated model server; (2) testing, by the local computer program and using a policy service, the federated machine learning model for vulnerabilities to attacks; (3) accepting, by the local computer program, the federated machine learning model in response to the federated machine learning model passing the testing; (4) training, by the local computer program, the federated machine learning model using input data comprising local data and outputting training parameters; (5) identifying, by the local computer program using the policy service, accidental leakage and/or contamination by comparing the training parameters to the input data; and (6) providing, by the local computer program, the training parameters to the federated model server.
    Type: Application
    Filed: January 19, 2023
    Publication date: July 20, 2023
    Inventors: Shaltiel ELOUL, Sean MORAN, Fanny SILAVONG, Sanket KAMTHE, Antonios GEORGIADIS
  • Publication number: 20230229440
    Abstract: A method may include: retrieving a plurality of code snippets from code repositories; generating a syntax representation, a property representation for each of the code snippets; receiving a query comprising a query code snippet, natural language keywords, and/or a string pattern; performing string-based matching and parser/syntax tree matching on the query and the tree representations, syntax matching on the query and the syntax representations, and property matching on the query and the property representations, wherein each of the matchings results in a score; combining the scores of the string-based matching, the parser/syntax tree matching, the syntax matching, and/or the property matching; identifying a plurality of code snippets of interest based on the combined scores; classifying the code snippets of interest using a machine learning classifier; outputting a list of the code snippets of interest with their classifications; and training the machine learning classifier based on user feedback.
    Type: Application
    Filed: March 4, 2022
    Publication date: July 20, 2023
    Inventors: Fanny SILAVONG, Sean MORAN, Georgios PAPADOPOULOS, Solomon Olaniyi ADEBAYO, William COVELL, Rob OTTER
  • Publication number: 20230229930
    Abstract: Systems and methods for locality preserving federated learning are disclosed. In one embodiment, a method for locality preserving federated learning may include: (1) receiving, at an aggregator computer program and from each of a plurality of clients, weights for each client's local machine learning model; (2) generating, by the aggregator computer program, an averaged machine learning model based on the received weights; (3) sharing, by the aggregator computer program, the averaged machine learning model with the plurality of clients; and (4) applying, by each client, a scaling factor to the averaged machine learning model to update its local machine learning model.
    Type: Application
    Filed: January 17, 2023
    Publication date: July 20, 2023
    Inventors: Antonios GEORGIADIS, Fanny SILAVONG, Sean MORAN, Rob OTTER
  • Publication number: 20230185550
    Abstract: Systems and methods for detecting code duplication are disclosed. In one embodiment, a method for detecting exact code snippet duplicates may include: (1) representing, by a code duplication detection computer program, each of a plurality of code snippets in a codebase as an abstract syntax trees; (2) featurizing, by the code duplication detection computer program, the abstract syntax trees into corpus feature vectors by converting the abstract syntax tree into vector representations; (3) generating, by the code duplication detection computer program, dense feature vectors from the corpus feature vectors using a dimension reduction technique; (4) identifying, by the code duplication detection computer program, exact duplicate code snippet matches by apply density-based clustering to the dense feature vectors; and (5) tagging, by the code duplication detection computer program, the exact duplicate code snippets.
    Type: Application
    Filed: December 12, 2022
    Publication date: June 15, 2023
    Inventors: Rohan SAPHAL, Fanny SILAVONG, Sean MORAN, Antonios GEORGIADIS, Sanat SAHA, Gaurav SINGH, Pierre OSSELIN, Rob OTTER
  • Publication number: 20230153085
    Abstract: Systems and methods for source code understanding using spatial representations are disclosed. In one embodiment, a method may include: (1) receiving, by a source code understanding computer program, a source code snippet; (2) converting, by the source code understanding computer program, the source code snippet to a two-dimensional image representation using an encoding technique; (3) mapping, by the source code understanding computer program, the two-dimensional image representation into a three-dimensional image representation; (4) determining, by the source code understanding computer program, a classification for the source code snippet using a deep learning network; and (5) providing, by the source code understanding computer program, the classification for the source code snippet to a downstream system.
    Type: Application
    Filed: November 14, 2022
    Publication date: May 18, 2023
    Inventors: Ruibo SHI, Sean MORAN, Lili TAO, Fanny SILAVONG, Rohan SAPHAL
  • Patent number: 11625423
    Abstract: Systems and methods for generating a fusion score between electronic documents. The method includes receiving a first electronic document by a document management system. The method further includes extracting a first set of features from the first electronic document including at least one feature type indicating the hierarchical structure of the first electronic document. The method also includes receiving a second electronic document by the document management server. The method further includes extracting a second set of features from the second electronic document including at least one feature type indicating the hierarchical structure of the second electronic document. The method further includes generating a fusion score based on a comparison of the first set of features and the second set of features.
    Type: Grant
    Filed: January 25, 2021
    Date of Patent: April 11, 2023
    Assignee: JPMORGAN CHASE BANK, N.A.
    Inventors: Sean Moran, Fanny Silavong, Rob Otter, Brett Sanford, Antonios Georgiadis, Sae Young Moon
  • Publication number: 20230070420
    Abstract: A method for facilitating identification of secrets in source code by using machine learning is provided. The method includes retrieving a plurality of files from a repository, each of the plurality of files including a source code file; parsing the source code file to identify a training feature; associating a predetermined label with the training feature, the predetermined label corresponding to a secret label and a non-secret label; training a model by using the training feature and the corresponding predetermined label; receiving, via a graphical user interface, a test file, the test file including a set of source codes; parsing the set of source codes to identify a feature; and determining, by using the model, a first characteristic of the feature.
    Type: Application
    Filed: September 16, 2021
    Publication date: March 9, 2023
    Applicant: JPMorgan Chase Bank, N.A.
    Inventors: Sean MORAN, Ahmad EMAMI, Fanny SILAVONG, Joachim FAINBERG, Ashish TIWARI, Antonios GEORGIADIS, Bill MORIARTY, Solomon Olaniyi ADEBAYO, Georgios PAPADOPOULOS, Rohan SAPHAL, Robert Falconer KEITH, Rob OTTER, Stephen HALL
  • Publication number: 20230062297
    Abstract: Systems and methods for code repository embedding using attention mechanism for tagging and summarization are disclosed. According to one embodiment, a method for code repository embedding may include: (1) extracting, by a computer program executed by an electronic device, docstring embeddings, code embeddings, and dependency embeddings from scripts in a repository (2) applying, by the computer program, a machine learning algorithm to each of the docstring embeddings, code embeddings, and dependency embeddings; (3) concatenating, by the computer program, outputs of the machine learning algorithm; (4) weighting, by the computer program, the concatenated outputs of the machine learning algorithm using an attention mechanism, resulting in a repository representation comprising an abstract vector; and (5) tagging or summarizing, by the computer program, the script using its repository representation.
    Type: Application
    Filed: August 23, 2022
    Publication date: March 2, 2023
    Inventors: Shaltiel ELOUL, Agathe LHÉRONDELLE, Sean MORAN, Fanny SILAVONG, Yash SATSANGI
  • Publication number: 20230064540
    Abstract: Systems and methods for federated secure vocabulary learning. The method may include communicating, by a first client network and a second client network and to a global federation terminal, a first set of encrypted tokens and a second set of encrypted tokens. The method may further include generating, by the global federation terminal, a consolidated vocabulary based on the first set of encrypted tokens and the second set of encrypted tokens. The method may include determining, a first set of weights and a second set of weights for the consolidated vocabulary. The method may include receiving by the global federation terminal the first set of weights and the second set of weights. The method may further include aggregating the first set of weights and the second set of weights. The method may further include distributing, to the first client network and the second client network, an aggregated set of weights.
    Type: Application
    Filed: February 16, 2022
    Publication date: March 2, 2023
    Inventors: Fanny SILAVONG, Antonios GEORGIADIS, Sean MORAN, Brett SANFORD, Rob OTTER
  • Publication number: 20230058972
    Abstract: Systems and methods for noise agnostic federated learning are disclosed. A method may include a client computer program executed by an electronic device in a federated learning computer network comprising a plurality of clients: receiving, from a federated learning computer program, a data format having desirable noise characteristics; transforming a client data set comprising variable noise characteristics to the data format using a client generative adversarial network (GAN); generating client weights for the transformed client data set, wherein the client weights indicate features of the client data set; communicating the client weights to the federated learning computer program; receiving, from the federated learning computer program, adjusted weights, wherein the adjusted weights are based on the client weights and a plurality client weights received from the clients in the federated learning computer network; and updating the client weights for a client machine learning model using the adjusted weights.
    Type: Application
    Filed: August 16, 2022
    Publication date: February 23, 2023
    Inventors: Antonios GEORGIADIS, Fanny SILAVONG, Sean MORAN, Rob OTTER, Varun BABBAR
  • Publication number: 20220237182
    Abstract: Systems and methods for generating a fusion score between electronic documents. The method includes receiving a first electronic document by a document management system. The method further includes extracting a first set of features from the first electronic document including at least one feature type indicating the hierarchical structure of the first electronic document. The method also includes receiving a second electronic document by the document management server. The method further includes extracting a second set of features from the second electronic document including at least one feature type indicating the hierarchical structure of the second electronic document. The method further includes generating a fusion score based on a comparison of the first set of features and the second set of features.
    Type: Application
    Filed: January 25, 2021
    Publication date: July 28, 2022
    Inventors: Fanny SILAVONG, Sean MORAN, Antonios GEORGIADIS, Rob OTTER, Brett SANFORD
  • Publication number: 20220100725
    Abstract: Systems and methods for counteracting data-skewness for locality sensitive hashing via feature selection and pruning are disclosed. In one embodiment, a method for feature selection for counteracting data skewness on locality sensitive hashing (LSH)-based search may include: (1) ingesting, by an ingestion computer program and from a plurality of data sources, data; (2) extracting, by the ingestion computer program, a plurality of features from the ingested data; (3) transforming, by the ingestion computer program, each of the plurality of features into a feature vector; (4) selecting, by the ingestion computer program, a subset of the plurality of features; and (5) for each selected feature vector: computing, by the ingestion computer program, a random hash function for the selected feature; and inserting, by the ingestion computer program, an output of the random hash function into a hash table with the selected feature.
    Type: Application
    Filed: September 27, 2021
    Publication date: March 31, 2022
    Inventors: Sean MORAN, Fanny SILAVONG, Rob OTTER, Antonios GEORGIADIS, Brett SANFORD