Patents by Inventor Pedro Gustavo Santos Rodrigues Bizarro

Pedro Gustavo Santos Rodrigues Bizarro has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Interleaved sequence recurrent neural networks for fraud detection

Patent number: 11989643

Abstract: A process for handling interleaved sequences using RNNs includes receiving data of a first transaction, retrieving a first state (e.g., a default or a saved RNN state for an entity associated with the first transaction), and determining a new second state and a prediction result using the first state and an input data based on the first transaction. The process includes updating the saved RNN state for the entity to be the second state. The process includes receiving data of a second transaction, where the second transaction is associated with the same entity as the first transaction. The process unloops an RNN associated with the saved RNN state including by: retrieving the second state, determining a new third state and a prediction result using the second state and an input data based the second transaction, and updating the saved RNN state for the entity to be the third state.

Type: Grant

Filed: February 11, 2021

Date of Patent: May 21, 2024

Assignee: Feedzai—Consultadoria e Inovação Tecnológica, S.A.

Inventors: Bernardo José Amaral Nunes de Almeida Branco, Pedro Caldeira Abreu, Ana Sofia Leal Gomes, Mariana S. C. Almeida, João Tiago Barriga Negra Ascensão, Pedro Gustavo Santos Rodrigues Bizarro
SELF-SUPERVISED FRAMEWORK FOR GRAPH REPRESENTATION LEARNING

Publication number: 20230394318

Abstract: In various embodiments, a process for providing a self-supervised framework for graph representation learning includes receiving entity data for a plurality of entities and receiving transaction data for transactions between corresponding entities included in the plurality of entities. The process includes generating a heterogeneous graph representation. Nodes of the heterogeneous graph representation includes a first type of node representing an entity of the plurality of entities and a second type of node representing the transactions. The process includes performing a self-supervised training of a graph neural network including by sampling the heterogeneous graph representation for positive samples and negative samples to learn embedding representations for the nodes of the heterogeneous graph representation, and utilizing the learned embedding representations for the nodes of the heterogeneous graph representation for automatic transaction analysis.

Type: Application

Filed: August 19, 2022

Publication date: December 7, 2023

Inventors: Mário João Cabral Cardoso, Pedro dos Santos Saleiro, Pedro Gustavo Santos Rodrigues Bizarro
ALERT REVIEW USING MACHINE LEARNING AND INTERACTIVE VISUALIZATIONS

Publication number: 20230351392

Abstract: In various embodiments, a process for alert review using machine learning and interactive visualizations includes receiving transaction data for transactions, using a machine learning model to determine embedding representations of the transaction data, and using one or more automated rules to identify of a subset of the transactions. The process includes using at least a portion of the embedding representations to automatically cluster the identified subset of the transactions into a plurality of different cluster groups, and providing an interactive visual representation of the plurality of different cluster groups.

Type: Application

Filed: August 19, 2022

Publication date: November 2, 2023

Inventors: Rita Marques Costa, Mário João Cabral Cardoso, Pedro dos Santos Saleiro, Pedro Gustavo Santos Rodrigues Bizarro
METHOD AND SYSTEM FOR OBTAINING A DATASOURCE SCHEMA COMPRISING COLUMN-SPECIFIC DATA-TYPES AND/OR SEMANTIC-TYPES FROM RECEIVED TABULAR DATA RECORDS

Publication number: 20230316147

Abstract: A computer-implemented method for obtaining a datasource schema comprising column-specific data-types and/or semantic-types from received tabular data records with values arranged in rows and columns, said method including: extracting a feature vector record comprising data-type recognition features for each of one or more columns of the received input tabular records; feeding the extracted feature vector records to a pretrained type classification discriminative machine learning model; using said model for classifying each extracted feature vector record of a corresponding column of received input tabular records into an estimated data-type and/or semantic-type, respectively, of the corresponding column. It is further disclosed a computer program product, a computer system and a method for training a machine learning model for obtaining the datasource schema.

Type: Application

Filed: March 30, 2023

Publication date: October 5, 2023

Inventors: RICARDO JORGE DIAS BARATA, HUGO RICARDO COLAÇO FERREIRA, JOÃO TIAGO BARRIGA NEGRA ASCENSÃO, PEDRO GUSTAVO SANTOS RODRIGUES BIZARRO
Obtaining a generated dataset with a predetermined bias for evaluating algorithmic fairness of a machine learning model

Patent number: 11734612

Abstract: In various embodiments, a process for obtaining a generated dataset with a predetermined bias for evaluating algorithmic fairness of a machine learning model includes receiving an input dataset and generating an anonymized reconstructed dataset based at least on the input dataset. The process includes introducing a predetermined bias into the generated dataset, forming an evaluation dataset based at least on the generated dataset with the predetermined bias, and outputting the evaluation dataset. In various embodiments, a process for training a generative model includes configuring a generative model and receiving training data, where the training data includes a tabular dataset. The process includes using computer processor(s) and the received training data to train the generative model, where the generative model is sampled to generate a dataset with a predetermined bias.

Type: Grant

Filed: June 30, 2022

Date of Patent: August 22, 2023

Inventors: Sérgio Gabriel Pontes Jesus, Duarte Miguel Rodrigues dos Santos Marques Alves, José Maria Pereira Rosa Correia Pombal, André Miguel Ferreira Da Cruz, Joäo António Sobral Leite Veiga, Joäo Guilherme Simöes Bravo Ferreira, Catarina Garcia Belém, Marco Oliveira Pena Sampaio, Pedro Dos Santos Saleiro, Pedro Gustavo Santos Rodrigues Bizarro
Automatic model monitoring for data streams

Patent number: 11729194

Abstract: In an embodiment, a process for automatic model monitoring for data streams includes receiving an input dataset, using a machine learning model to determine a model score for each data record of at least a portion of the input dataset, and determining monitoring values. Each monitoring value is associated with a measure of similarity between model scores for those data records of the input dataset within a corresponding moving reference window and model scores for those data records of the input dataset within a corresponding moving target window. The process includes outputting the determined monitoring values.

Type: Grant

Filed: June 10, 2022

Date of Patent: August 15, 2023

Inventors: Marco Oliveira Pena Sampaio, Fábio Hernäni dos Santos Costa Pinto, Pedro Gustavo Santos Rodrigues Bizarro, Pedro Cardoso Lessa e Silva, Ana Margarida Caetano Ruela, Miguel Ramos de Araújo, Nuno Miguel Lourenço Diegues
TRIAGING ALERTS USING MACHINE LEARNING

Publication number: 20230147934

Abstract: In various embodiment, a process for triaging alerts using machine learning includes receiving data associated with transactions and using computer processor(s) to analyze the received data using rule(s) to automatically identify potentially suspicious activities. The process includes scoring each of the identified potentially suspicious activities using a machine learning model and based at least in part on analysis results of the rule(s) associated with the identified potentially suspicious activities, and triaging the identified potentially suspicious activities including by determining an action to take with respect to at least a portion of the identified potentially suspicious activities based at least in part on the scoring. In various embodiments, a process for training a machine learning model to triage alerts includes configuring the machine learning model, and receiving training data.

Type: Application

Filed: June 2, 2022

Publication date: May 11, 2023

Inventors: Ahmad Naser Eddin, Jacopo Bono, João Tiago Barriga Negra Ascensão, Pedro Gustavo Santos Rodrigues Bizarro
Distributed streaming system supporting real-time sliding windows

Patent number: 11645114

Abstract: In various embodiments, a process for providing a distributed streaming system supporting real-time sliding windows includes receiving a stream of events at a plurality of distributed nodes and routing the events into topic groupings. The process includes using one or more events in at least one of the topic groupings to determine one or more metrics of events with at least one window and an event reservoir including by: tracking, in a volatile memory of the event reservoir, beginning and ending events within the at least one window; and tracking, in a persistent storage of the event reservoir, events associated with tasks assigned to a respective node. The process includes updating the one or more metrics based on one or more previous values of the one or more metrics as a new event is added or an existing event is expired from the at least one window.

Type: Grant

Filed: January 13, 2022

Date of Patent: May 9, 2023

Inventors: João Miguel Forte Oliveirinha, Ana Sofia Leal Gomes, Pedro Cardoso Lessa e Silva, Pedro Gustavo Santos Rodrigues Bizarro
WEAK SUPERVISION FRAMEWORK FOR LEARNING TO LABEL CONCEPT EXPLANATIONS ON TABULAR DATA

Publication number: 20230133410

Abstract: In various embodiments, a process includes receiving input records including tabular data, where the input records are unlabeled for a concept-explainability task. The process includes obtaining primitives for at least a subset of the input records, where the obtained primitives are based at least on at least one annotation including a plurality of user-defined concept labels. The process includes training, using hardware processor(s), a plurality of candidate models using the obtained primitives. For each of the plurality of user-defined concept labels, at least one corresponding model from the plurality of candidate models is used to determine a corresponding concept labeling model. The process includes using the determined corresponding concept labeling models to label the input records with which to train a concept-explainability machine learning model using the labeled input records.

Type: Application

Filed: October 25, 2022

Publication date: May 4, 2023

Inventors: Ricardo Miguel de Oliveira Moreira, Vladimir Balayan, João Pedro Bento Sousa, Pedro dos Santos Saleiro, Pedro Gustavo Santos Rodrigues Bizarro
Graph decomposition for fraudulent transaction analysis

Patent number: 11636487

Abstract: In an embodiment, a process for graph decomposition includes initializing nodes and edges of a data graph for analysis using a computer, and performing message passing between at least a portion of the nodes of the data graph to determine a corresponding measure of interest for each node of at least a portion of the data graph. The process further includes receiving an identification of one or more nodes of interest in the data graph, performing message passing between at least a portion of the nodes of the data graph using at least the determined measures of interest to identify a corresponding subgraph of interest for each of the one or more nodes of interest in the data graph, and performing an analysis action using the one or more identified subgraphs of interest.

Type: Grant

Filed: October 14, 2020

Date of Patent: April 25, 2023

Inventors: Maria Inês Silva, David Oliveira Aparício, Pedro Gustavo Santos Rodrigues Bizarro, João Tiago Barriga Negra Ascensão, Rodolfo Cristóvão, Miguel Ramos de Araújo, Maria Beatriz Malveiro Jorge, Mariana Rodrigues Lourenço, Sandro Daniel Sabudin Nunes
ASSESSING TRANSACTIONAL GRAPHS BASED ON GENERATOR-DISCRIMINATOR NETWORKS

Publication number: 20230111818

Abstract: In various embodiments, a process for assessing transactional graphs based on generator-discriminator networks includes using a generator network to generate a first set of transaction graph samples that are of a generated type, wherein the generator network is trained to optimize a predetermined objective function. The process includes sampling, from a collected dataset, a second set of transaction graph samples that are of a non-generated type; and providing the first set of transaction graph samples and the second set of transaction graph samples to a discriminator network, wherein the discriminator network is trained to classify a provided transaction graph sample as the generated type or the non-generated type. The process includes discriminating, by the discriminator network, each of at least a portion of the first set of transaction graph samples and the second set of transaction graph samples as the generated type or the non-generated type.

Type: Application

Filed: September 8, 2022

Publication date: April 13, 2023

Inventors: Ricardo Ribeiro Pereira, Jacopo Bono, David Oliveira Aparício, Maria Inês Silva, Miguel Ramos de Araújo, João Tiago Barriga Negra Ascensão, Pedro Gustavo Santos Rodrigues Bizarro
OBTAINING A GENERATED DATASET WITH A PREDETERMINED BIAS FOR EVALUATING ALGORITHMIC FAIRNESS OF A MACHINE LEARNING MODEL

Publication number: 20230074606

Abstract: In various embodiments, a process for obtaining a generated dataset with a predetermined bias for evaluating algorithmic fairness of a machine learning model includes receiving an input dataset and generating an anonymized reconstructed dataset based at least on the input dataset. The process includes introducing a predetermined bias into the generated dataset, forming an evaluation dataset based at least on the generated dataset with the predetermined bias, and outputting the evaluation dataset. In various embodiments, a process for training a generative model includes configuring a generative model and receiving training data, where the training data includes a tabular dataset. The process includes using computer processor(s) and the received training data to train the generative model, where the generative model is sampled to generate a dataset with a predetermined bias.

Type: Application

Filed: June 30, 2022

Publication date: March 9, 2023

Inventors: Sérgio Gabriel Pontes Jesus, Duarte Miguel Rodrigues dos Santos Marques Alves, José Maria Pereira Rosa Correia Pombal, André Miguel Ferreira da Cruz, João António Sobral Leite Veiga, João Guilherme Simões Bravo Ferreira, Catarina Garcia Belém, Marco Oliveira Pena Sampaio, Pedro dos Santos Saleiro, Pedro Gustavo Santos Rodrigues Bizarro
CONSTRAINED OPTIMIZATION FOR GRADIENT BOOSTING MACHINES

Publication number: 20230070086

Abstract: In various embodiments, a process for constrained optimization for sequential error-based additive machine learning models (e.g., gradient boosting machines) includes configuring a sequential error-based additive machine learning model, receiving training data, and using one or more hardware processors to train the sequential error-based additive machine learning model using the received training data. The training includes performing optimization iterations to minimize a loss function that includes a fairness constraint, where the fairness constraint is based at least in part on disparities between groups.

Type: Application

Filed: May 26, 2022

Publication date: March 9, 2023

Inventors: André Miguel Ferreira da Cruz, Catarina Garcia Belém, Pedro dos Santos Saleiro, Pedro Gustavo Santos Rodrigues Bizarro, João Guilherme Simões Bravo Ferreira
SURROGATE HIERARCHICAL MACHINE-LEARNING MODEL TO PROVIDE CONCEPT EXPLANATIONS FOR A MACHINE-LEARNING CLASSIFIER

Publication number: 20230031512

Abstract: In various embodiments, a process for providing a surrogate hierarchical multi-task machine learning model (“model”) includes configuring the model to perform (i) a knowledge distillation task associated with a pre-trained classifier (“black-box model”) and (ii) an explanation task to predict semantic concepts for explainability associated with the distillation task. The model includes a concept layer to perform the explanation task and a decision layer to perform the distillation task. The output of the concept layer is utilized as an input to the decision layer. The process includes receiving training data including input records and concept labels, and training the model by minimizing a joint loss function that combines a loss function associated with the distillation task and one associated with the explanation task. The loss function associated with the distillation task is determined by comparing an output of the decision layer and an output of the black-box model.

Type: Application

Filed: July 18, 2022

Publication date: February 2, 2023

Inventors: João Pedro Bento Sousa, Vladimir Balayan, Ricardo Miguel de Oliveira Moreira, Pedro dos Santos Saleiro, Pedro Gustavo Santos Rodrigues Bizarro
Weakly supervised multi-task learning for concept-based explainability

Patent number: 11544471

Abstract: A labeling function associated with generating one or more semantic concepts is received. The received labeling function is used to automatically annotate an existing dataset with the one or more semantic concepts to generate an annotated noisy dataset. A reference dataset annotated with the one or more semantic concepts is received. A training dataset is prepared including by combining at least a portion of the reference dataset with at least a portion of the annotated noisy dataset. The training dataset is used to train a multi-task machine learning model configured to perform both a decision task to predict a decision result and an explanation task to predict a plurality of semantic concepts for explainability associated with the decision task.

Type: Grant

Filed: August 30, 2021

Date of Patent: January 3, 2023

Inventors: Catarina Garcia Belém, Vladimir Balayan, Pedro dos Santos Saleiro, Pedro Gustavo Santos Rodrigues Bizarro
AUTOMATIC MODEL MONITORING FOR DATA STREAMS

Publication number: 20220382861

Abstract: In an embodiment, a process for automatic model monitoring for data streams includes receiving an input dataset, using a machine learning model to determine a model score for each data record of at least a portion of the input dataset, and determining monitoring values. Each monitoring value is associated with a measure of similarity between model scores for those data records of the input dataset within a corresponding moving reference window and model scores for those data records of the input dataset within a corresponding moving target window. The process includes outputting the determined monitoring values.

Type: Application

Filed: June 10, 2022

Publication date: December 1, 2022

Inventors: Marco Oliveira Pena Sampaio, Fábio Hernâni dos Santos Costa Pinto, Pedro Gustavo Santos Rodrigues Bizarro, Pedro Cardoso Lessa e Silva, Ana Margarida Caetano Ruela, Miguel Ramos de Araújo, Nuno Miguel Lourenço Diegues
Adaptive threshold estimation for streaming data

Patent number: 11477220

Abstract: In an embodiment, a process for adaptive threshold estimation for streaming data includes determining initial positions for a set of percentile bins, receiving a new data item in a stream of data, and identifying one of the set of percentile bins corresponding to the new data item. The process includes incrementing a count of items in the identified percentile bin, adjusting one or more counts of data items in one or more of the percentile bins including by applying a suppression factor based on a relative ordering of items, and redistributing positions for the set of percentile bins to equalize respective count numbers of items for each percentile bin of the set of percentile bins. The process includes utilizing the redistributed positions of the set of percentile bins to determine a percentile distribution of the data stream, and calculating a threshold based at least in part on the percentiles distribution.

Type: Grant

Filed: October 29, 2019

Date of Patent: October 18, 2022

Inventors: Marco Oliveira Pena Sampaio, Fábio Hernâni dos Santos Costa Pinto, Pedro Gustavo Santos Rodrigues Bizarro, Pedro Cardoso Lessa e Silva, Ana Margarida Caetano Ruela, Miguel Ramos de Araújo, Nuno Miguel Lourenço Diegues
Automatic model monitoring for data streams

Patent number: 11451568

Abstract: In an embodiment, a process for automatic model monitoring for data streams includes receiving an input dataset, using a machine learning model to determine a model score for each data record of at least a portion of the input dataset, and determining monitoring values. Each monitoring value is associated with a measure of similarity between model scores for those data records of the input dataset within a corresponding moving reference window and model scores for those data records of the input dataset within a corresponding moving target window. The process includes outputting the determined monitoring values.

Type: Grant

Filed: October 29, 2019

Date of Patent: September 20, 2022

Inventors: Marco Oliveira Pena Sampaio, Fábio Hernâni dos Santos Costa Pinto, Pedro Gustavo Santos Rodrigues Bizarro, Pedro Cardoso Lessa e Silva, Ana Margarida Caetano Ruela, Miguel Ramos de Araújo, Nuno Miguel Lourenço Diegues
AUTOMATIC PROFILE EXTRACTION IN DATA STREAMS USING RECURRENT NEURAL NETWORKS

Publication number: 20220245426

Abstract: In various embodiments, a process for automatic profile extraction in data streams using recurrent neural networks includes receiving input sequence data associated with a stream of events and using a plurality of trained recurrent neural network machine learning models at least in part in parallel to determine different embedding output sets that represent at least a portion of the input sequence data in a plurality of different embedding spaces. The process includes providing the different embedding output sets to one or more classifier machine learning models to determine one or more classifier results, and using the one or more classifier results to provide a prediction output.

Type: Application

Filed: January 27, 2022

Publication date: August 4, 2022

Inventors: Bernardo José Amaral Nunes de Almeida Branco, Jacopo Bono, João Tiago Barriga Negra Ascensão, Pedro Gustavo Santos Rodrigues Bizarro
Automated rules management system

Patent number: 11403644

Abstract: In an embodiment, a process for automated rules management system includes receiving a specification of past predicted results of evaluation rules and corresponding observed outcomes. The process includes determining one or more sets of alternative activations or priorities of at least a portion of the evaluation rules, assessing the one or more sets of alternative activations or priorities of at least a portion of the evaluation rules, and optimizing result activations or priorities of at least a portion of the evaluation rules based at least in part on the assessment of the one or more sets of alternative activations or priorities.

Type: Grant

Filed: June 11, 2020

Date of Patent: August 2, 2022

Inventors: David Oliveira Aparício, Ricardo Jorge Dias Barata, João Guilherme Simões Bravo Ferreira, João Tiago Barriga Negra Ascensão, Pedro Gustavo Santos Rodrigues Bizarro

1 2 3 next