Patents by Inventor Rômulo Teixeira de Abreu Pinho

Rômulo Teixeira de Abreu Pinho has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230237272
    Abstract: Techniques are disclosed for predicting a table column using machine learning. For example, a system can include at least one processing device including a processor coupled to a memory, the processing device being configured to implement the following: determining a local word density for words in a table, the local word density measuring a count of other words in a first region surrounding the words; determining a local numeric density for the words, the local numeric density measuring a proportion of digits in a second region surrounding the words; determining semantic associations for the words by processing the words using an ML-based semantic association model trained based on surrounding words in nearby table columns and rows; and predicting a table column index for the words by processing the table using an ML-based table column model trained based on the local word density, local numeric density, and semantic association.
    Type: Application
    Filed: January 27, 2022
    Publication date: July 27, 2023
    Applicant: Dell Products L.P.
    Inventors: Romulo Teixeira de Abreu Pinho, Paulo Abelha Ferreira, Vinicius Gottin, Pablo Nascimento Da Silva
  • Publication number: 20230237080
    Abstract: One example method includes collecting annotated unstructured documents that each include a table with words whose respective column indices are known, using the documents to train a model to detect a table header in a given document, identifying, by the model, a region of a document that corresponds to a table header in a new document that is not part of the training data, using an algorithm to perform a segmentation process on the table header that identifies column boundaries in the table header, and to use the identified column boundaries to preliminarily assign a respective column index to each word in the table header. Finally, a graph neural network model is run on a graph that includes the words in the table, and running the graph neural network generates a refined prediction of a respective column index for each of the words in the table of the new document.
    Type: Application
    Filed: January 27, 2022
    Publication date: July 27, 2023
    Inventors: Rômulo Teixeira de Abreu Pinho, Paulo Abelha Ferreira, Vinicius Michel Gottin, Pablo Nascimento da Silva
  • Publication number: 20230237100
    Abstract: Techniques for table row identification using machine learning are disclosed herein. For example, a method can include detecting a table body in a document by processing the document using a machine learning (ML)-based table body model; predicting an initial table row index for one or more words among a plurality of words obtained in the document, wherein the one or more words are determined to be within the table body; and determining a table row index for the one or more words using an ML-based table row model that is trained based on the predicted initial table row index for the one or more words.
    Type: Application
    Filed: January 27, 2022
    Publication date: July 27, 2023
    Applicant: Dell Products L.P.
    Inventors: Paulo Abelha Ferreira, Rômulo Teixeira de Abreu Pinho, Pablo Nascimento Da Silva, Vinicius Gottin
  • Publication number: 20230229514
    Abstract: One example method includes receiving a computation workflow defined by a graph that includes quantum computing nodes, receiving a catalogue of quantum computing instances that are available in a hybrid classic-quantum computation infrastructure, transforming the graph to create a first graph transformation, and each of the quantum computing nodes is assigned a respective candidate resource allocation that identifies candidate resources operable to execute a respective quantum algorithm associated with that quantum computing node, and the transforming is performed using information from the catalogue, and optimizing the computation workflow by selecting, for each of the quantum computing nodes, a resource from the candidate resource allocation associated with that quantum computing node, and the optimizing includes transforming the first graph transformation to create a second graph transformation that specifies the selected resources for each node.
    Type: Application
    Filed: January 14, 2022
    Publication date: July 20, 2023
    Inventors: Rômulo Teixeira de Abreu Pinho, Victor Fong, Kenneth Durazzo
  • Patent number: 11647103
    Abstract: A compression system is disclosed. A compression service removes compression responsibilities from an application. The compression system can deploy virtual network engines to locations near the applications. The virtual network engines compress the data using a compressor selected from multiple compressors. The compressed data can then be transmitted, decompressed, and delivered to the destination.
    Type: Grant
    Filed: January 14, 2022
    Date of Patent: May 9, 2023
    Assignee: DELL PRODUCTS L.P.
    Inventors: Rômulo Teixeira De Abreu Pinho, Vinicius Michel Gottin, Joel Christner
  • Patent number: 11641212
    Abstract: One example method includes file specific compression selection. Compression metrics are generated for a chunk of a file. Using a set of training data, the compression metrics are corrected using a correction factor to determine estimated file compression metrics. A compressor is then selected to compress the file based on at least the estimated file compression metrics.
    Type: Grant
    Filed: March 12, 2021
    Date of Patent: May 2, 2023
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Rômulo Teixeira de Abreu Pinho, Vinicius Michel Gottin, Joel Christner
  • Patent number: 11637565
    Abstract: One example method includes file specific compression selection. Compression metrics are generated for a chunk of a file using a reference compressor. Compression metrics for other compressors are determined from the metrics of the reference compressor. A compressor is then selected to compress the file.
    Type: Grant
    Filed: June 30, 2021
    Date of Patent: April 25, 2023
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Rômulo Teixeira De Abreu Pinho, Vinicius Michel Gottin, Joel Christner
  • Publication number: 20230121060
    Abstract: Techniques described herein relate to systems and methods for workload placement based on subgraph similarity. Such techniques may include obtaining an encoded workload graph based on receiving a workload execution request; using the encoded workload subgraph to obtain encoded graphs representing previous workload executions, encoded subgraphs representing infrastructures on which the workload were executed, resource usage information, and execution metrics; using the encoded infrastructure subgraphs using subgraph similarity to identify candidate infrastructure subgraphs, using an ML model to predict an execution metric for an execution of the workload using the candidate; and selecting a best candidate infrastructure on which to execute the workload based on the predicted execution results.
    Type: Application
    Filed: October 20, 2021
    Publication date: April 20, 2023
    Inventors: Rômulo Teixeira de Abreu Pinho, Vinicius Michel Gottin, Eduardo Vera Sousa
  • Publication number: 20230114965
    Abstract: One example method includes, for each document in a group of annotated documents, extracting a set of words from the annotated document, and each of the words is positioned in a respective field of the annotated document. The method further includes using an aggregation function to determine, for one of the fields, a similarity of each one of the annotated documents to all of the other annotated documents, creating a document layout graph with nodes that each correspond to a respective annotated document, and each node is connected to all other nodes for which a similarity threshold for the one field has been met, and running an algorithm on the document layout graph to identify a clique of the annotated documents, and each annotated document in the clique has a similar layout to respective layouts of the other annotated documents in the clique.
    Type: Application
    Filed: October 8, 2021
    Publication date: April 13, 2023
    Inventors: Paulo Abelha Ferreira, Pablo Nascimento da Silva, Rômulo Teixeira de Abreu Pinho, Vinicius Michel Gottin
  • Publication number: 20230031202
    Abstract: Techniques described herein relate to a method for predicting field values of documents. The method may include identifying a field prediction model generation request; obtaining, training documents from a document manager; selecting a first training document; making a first determination that the first training document is a text-based document; performing text-based data extraction to identify first words and first boxes included in the first training document; identifying first keywords and first candidate words included in the first training document based on the first words and the first boxes; and generating a first annotated training document using the first keywords and the first candidate words, wherein the first annotated training document comprises color-based representation masks for the first keywords, the first candidate words, and first general words included in the first training document.
    Type: Application
    Filed: July 27, 2021
    Publication date: February 2, 2023
    Inventors: Paulo Abelha Ferreira, Pablo Nascimento da Silva, Rômulo Teixeira de Abreu Pinho, Tiago Salviano Calmon, Vinicius Michel Gottin
  • Publication number: 20230023136
    Abstract: One example method includes file specific compression selection. Compression metrics are generated for a chunk of a file using a reference compressor. Compression metrics for other compressors are determined from the metrics of the reference compressor. A compressor is then selected to compress the file.
    Type: Application
    Filed: June 30, 2021
    Publication date: January 26, 2023
    Inventors: Rômulo Teixeira de Abreu Pinho, Vinicius Michel Gottin, Joel Christner
  • Patent number: 11544138
    Abstract: A method comprises collecting operational data for one or more devices and identifying one or more anomalies associated with the one or more devices based at least in part on the collected operational data. At least a portion of the collected operational data corresponding to the identified one or more anomalies is analyzed, and a probability of automatic resolution for respective ones of the identified one or more anomalies is determined based at least in part on the analysis. The identifying, the analyzing and the determining are performed using one or more machine learning models.
    Type: Grant
    Filed: May 27, 2021
    Date of Patent: January 3, 2023
    Assignee: Dell Products L.P.
    Inventors: Kanika Kapish, Hung Dinh, Bijan Kumar Mohanty, Rômulo Teixeira de Abreu Pinho
  • Patent number: 11520703
    Abstract: Techniques are provided for adaptive look-ahead configuration for data prefetching. One method comprises, in response to a request for a data item in a storage system: obtaining a size of a look-ahead window for the request based on one of multiple available caching policies; and moving the requested data item and additional data items within the look-ahead window to the cache memory when the requested data item and/or the additional data items within the look-ahead window are not in the cache memory. The multiple available caching policies comprise a caching policy based on characteristics of an input/output workload of the storage system, or a portion thereof; and/or a caching policy based on an input/output workload of at least a portion of the storage system within a prior predefined time window. The look-ahead window size may be varied over time.
    Type: Grant
    Filed: January 31, 2019
    Date of Patent: December 6, 2022
    Assignee: EMC IP Holding Company LLC
    Inventors: Jonas F. Dias, Rômulo Teixeira de Abreu Pinho, Diego Salomone Bruno, Owen Martin
  • Publication number: 20220382611
    Abstract: A method comprises collecting operational data for one or more devices and identifying one or more anomalies associated with the one or more devices based at least in part on the collected operational data. At least a portion of the collected operational data corresponding to the identified one or more anomalies is analyzed, and a probability of automatic resolution for respective ones of the identified one or more anomalies is determined based at least in part on the analysis. The identifying, the analyzing and the determining are performed using one or more machine learning models.
    Type: Application
    Filed: May 27, 2021
    Publication date: December 1, 2022
    Inventors: Kanika Kapish, Hung Dinh, Bijan Kumar Mohanty, Rômulo Teixeira de Abreu Pinho
  • Patent number: 11513961
    Abstract: A method and system for assessing sequentiality of a data stream is disclosed. Specifically, the method and system disclosed herein may entail receiving an incoming request to access a page in a cache memory, wherein the page is identified by a page address of an address space in a main memory; identifying, in a memory, a bin corresponding to an address range including the page address of the page of the incoming request, wherein the bin includes k address ranges of the address space of the main memory; determining whether to update an occupation count of the bin in the memory; locating the bin in a heuristics table to obtain an estimated total number of expected proximal accesses based on an updated occupation count of the bin; and determining, based on the estimated total number of expected proximal accesses, sequentiality of the data stream to device in order to generate a policy for the cache memory.
    Type: Grant
    Filed: January 28, 2021
    Date of Patent: November 29, 2022
    Assignee: EMC IP HOLDING COMPANY LLC
    Inventors: Vinicius Michel Gottin, Tiago Salviano Calmon, Paulo Abelha Ferreira, Hugo de Oliveira Barbalho, Rômulo Teixeira de Abreu Pinho
  • Publication number: 20220342704
    Abstract: A system and method place an incoming workload within a data center having infrastructure elements (IEs) for execution. Instrumentation data are collected for both individual IEs in the data center, and workload instances executing on each of these IEs. These data are used to train a future load model according to machine learning techniques, especially supervised learning. Future loads, in turn, are used to train a ranking model that ranks IEs according to suitability to execute additional workloads. After receiving an incoming workload, the first model is used to predict, for each IE, the load on its computing resources if the workload were executed on that IE. The resulting predicted loads are then fed into the second model to predict the best ranking of IEs, and the workload is placed on the highest-ranked IE that is available to execute the workload.
    Type: Application
    Filed: April 27, 2021
    Publication date: October 27, 2022
    Applicant: Dell Products L.P.
    Inventors: Rômulo Teixeira De Abreu Pinho, Satyam Sheshansh, Hung Dinh, Bijan Mohanty
  • Publication number: 20220294470
    Abstract: One example method includes file specific compression selection. Compression metrics are generated for a chunk of a file. Using a set of training data, the compression metrics are corrected using a correction factor to determine estimated file compression metrics. A compressor is then selected to compress the file based on at least the estimated file compression metrics.
    Type: Application
    Filed: March 12, 2021
    Publication date: September 15, 2022
    Inventors: Rômulo Teixeira de Abreu Pinho, Vinicius Michel Gottin, Joel Christner
  • Patent number: 11416506
    Abstract: Facilitating temporal data management for anomalous state detection in data centers is provided herein. A system can comprise a processor and a memory that stores executable instructions that, when executed by the processor, facilitate performance of operations. The operations can comprise performing a process of extraction, transformation, and loading of data from log files into a telemetry data store. The data can be loaded into the telemetry data store as telemetry data. The operations also can comprise dividing the telemetry data into first telemetry data and second telemetry data. The first telemetry data can comprise telemetry data that does not satisfy a defined quality level. The second telemetry data can comprise telemetry data that satisfies the defined quality level. Further, the operations can comprise removing the first telemetry data from the telemetry data store and outputting the second telemetry data based on a request for the second telemetry data.
    Type: Grant
    Filed: April 29, 2020
    Date of Patent: August 16, 2022
    Assignee: EMC IP Holding Company LLC
    Inventors: Rômulo Teixeira de Abreu Pinho, Vítor Silva Sousa, Rodrigo Rios Almeida de Souza, Roberto Nery Stelling Neto
  • Publication number: 20220253370
    Abstract: Facilitating detection of anomalies of a target entity is provided herein. A system can comprise a processor and a memory that stores executable instructions that, when executed by the processor, facilitate performance of operations. The operations can comprise training a model on a first set of variables that are constrained by a second set of variables. The second set of variables can characterize elements of a defined entity. The first set of variables can define a normality of the defined entity. The operations also can comprise employing the model to identify expected parameters and unexpected parameters associated with the defined entity to at least a defined level of confidence.
    Type: Application
    Filed: April 20, 2022
    Publication date: August 11, 2022
    Inventors: Rômulo Teixeira de Abreu Pinho, Vítor Silva Sousa, Rodrigo Rios Almeida de Souza, Roberto Nery Stelling Neto
  • Publication number: 20220237124
    Abstract: A method and system for assessing sequentiality of a data stream is disclosed. Specifically, the method and system disclosed herein may entail receiving an incoming request to access a page in a cache memory, wherein the page is identified by a page address of an address space in a main memory; identifying, in a memory, a bin corresponding to an address range including the page address of the page of the incoming request, wherein the bin includes k address ranges of the address space of the main memory; determining whether to update an occupation count of the bin in the memory; locating the bin in a heuristics table to obtain an estimated total number of expected proximal accesses based on an updated occupation count of the bin; and determining, based on the estimated total number of expected proximal accesses, sequentiality of the data stream to device in order to generate a policy for the cache memory.
    Type: Application
    Filed: January 28, 2021
    Publication date: July 28, 2022
    Inventors: Vinicius Michel Gottin, Tiago Salviano Calmon, Paulo Abelha Ferreira, Hugo de Oliveira Barbalho, Rômulo Teixeira de Abreu Pinho