Patents by Inventor Carlos Henrique Andrade Costa

Carlos Henrique Andrade Costa has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230169354
    Abstract: A system, computer program product, and method are provided for distributed data workflow semantics. A pipeline, such as a machine learning (ML) pipeline, is represented in a data flow graph (DFG). The represented pipeline is subject to annotations, with the annotations including pipeline nodes and object references. The pre-processed pipeline is subject to execution or processing with the annotated object references capturing object lineage. Output from the executed pipeline is constructed and a corresponding control signal is formatted to dynamically and selectively control an operatively coupled physical hardware device or software.
    Type: Application
    Filed: November 30, 2021
    Publication date: June 1, 2023
    Applicant: International Business Machines Corporation
    Inventors: Mudhakar SRIVATSA, Raghu Kiran GANTI, Carlos Henrique ANDRADE COSTA, Linsong CHU, Joshua M. ROSENKRANZ
  • Publication number: 20230168923
    Abstract: A system, computer program product, and method are provided for distributed data workflow semantics. A pipeline, such as a machine learning pipeline, is represented in a data flow graph (DFG) with nodes and edges. The represented nodes are configured to be annotated with an operational semantic. On order of execution of the pipeline is discovered through the node annotation(s) represented in the annotated DFG, and execution of the pipeline is based on the discovered order. A control signal formatted based on the executed pipeline is configured to dynamically and selectively control an operatively coupled physical hardware device.
    Type: Application
    Filed: November 30, 2021
    Publication date: June 1, 2023
    Applicant: International Business Machines Corporation
    Inventors: Raghu Kiran GANTI, Mudhakar SRIVATSA, Carlos Henrique Andrade Costa
  • Publication number: 20230169408
    Abstract: A system, computer program product, and method are provided for distributed data workflow semantics. A pipeline, such as a machine learning (ML) pipeline, is implemented over a data flow graph (DFG) with nodes configured to support rich semantics. The rich semantics include two or more operational semantics, and at least one lineage semantic to selectively combine features that trace lineage to a common input object. The lineage semantic is leveraged to associate training and testing data set pairs in cross validation of the trained ML models produced from parallelizing the selection of ML pipelines.
    Type: Application
    Filed: November 30, 2021
    Publication date: June 1, 2023
    Applicant: International Business Machines Corporation
    Inventors: Carlos Henrique Andrade Costa, RAGHU KIRAN GANTI, MUDHAKAR SRIVATSA, Linsong Chu, Joshua M. Rosenkranz, Tuan Minh HOANG TRONG
  • Patent number: 10956125
    Abstract: Methods and systems for shuffling data are described. A processor may generate pair data from source data. The processor may insert the pair data into local tuple spaces. In response to a request for a particular key, the processor may determine a presence of the requested key in a global tuple space. The processor may, in response to a presence of the requested key in the global tuple space, update the global tuple space. The update may be based on the pair data among the local tuple spaces including the existing key. The processor may, in response to an absence of the requested key in the global tuple space, insert pair data including the missing key from the local tuple spaces into the global tuple space. The processor may fetch the requested pair data, and may shuffle the fetched data to generate a dataset.
    Type: Grant
    Filed: December 21, 2017
    Date of Patent: March 23, 2021
    Assignee: International Business Machines Corporation
    Inventors: Carlos Henrique Andrade Costa, Abdullah Kayi, Yoonho Park, Charles Johns
  • Patent number: 10891274
    Abstract: Methods and systems for shuffling data to generate a dataset are described. A first map module may generate first pair data, and a second map module may generate second pair data, from source data. The first map module may insert the first pair data into a first local tuple space accessible to the first map module. The second map module may insert the second pair data into a second local tuple space accessible to the second map module. A shuffle module may request pair data that includes a particular key. The first and second pair data may be inserted into a global tuple space accessible by the first and second map modules. The shuffle module may identify the requested pair data in the global tuple space, and may fetch the identified pair data from a memory. The shuffle module may shuffle the fetched pair data to generate the dataset.
    Type: Grant
    Filed: December 21, 2017
    Date of Patent: January 12, 2021
    Assignee: International Business Machines Corporation
    Inventors: Abdullah Kayi, Carlos Henrique Andrade Costa, Yoonho Park, Charles Johns
  • Patent number: 10545839
    Abstract: A method is disclosed, as well as an associated apparatus and computer program product, for checkpointing using a plurality of communicatively coupled compute nodes. The method comprises acquiring health information for a first node of the plurality of compute nodes, and determining a first failure probability for the first node using the health information. The first failure probability corresponds to a predetermined time interval. The method further comprises selecting a second node of the plurality of compute nodes as a partner node for the first node. The second node has a second failure probability for the time interval. A composite failure probability of the first node and the second node is less than the first failure probability. The method further comprises copying checkpoint information from the first node to the partner node.
    Type: Grant
    Filed: December 22, 2017
    Date of Patent: January 28, 2020
    Assignee: International Business Machines Corporation
    Inventors: Carlos Henrique Andrade Costa, Yoonho Park, Chen-Yong Cher, Bryan Rosenburg, Kyung Ryu
  • Publication number: 20190197138
    Abstract: Methods and systems for shuffling data to generate a dataset are described. A first map module may generate first pair data, and a second map module may generate second pair data, from source data. The first map module may insert the first pair data into a first local tuple space accessible to the first map module. The second map module may insert the second pair data into a second local tuple space accessible to the second map module. A shuffle module may request pair data that includes a particular key. The first and second pair data may be inserted into a global tuple space accessible by the first and second map modules. The shuffle module may identify the requested pair data in the global tuple space, and may fetch the identified pair data from a memory. The shuffle module may shuffle the fetched pair data to generate the dataset.
    Type: Application
    Filed: December 21, 2017
    Publication date: June 27, 2019
    Inventors: Abdullah Kayi, Carlos Henrique Andrade Costa, Yoonho Park, Charles Johns
  • Publication number: 20190196783
    Abstract: Methods and systems for shuffling data are described. A processor may generate pair data from source data. The processor may insert the pair data into local tuple spaces. In response to a request for a particular key, the processor may determine a presence of the requested key in a global tuple space. The processor may, in response to a presence of the requested key in the global tuple space, update the global tuple space. The update may be based on the pair data among the local tuple spaces including the existing key. The processor may, in response to an absence of the requested key in the global tuple space, insert pair data including the missing key from the local tuple spaces into the global tuple space. The processor may fetch the requested pair data, and may shuffle the fetched data to generate a dataset.
    Type: Application
    Filed: December 21, 2017
    Publication date: June 27, 2019
    Inventors: Carlos Henrique Andrade Costa, Abdullah Kayi, Yoonho Park, Charles Johns
  • Publication number: 20190196920
    Abstract: A method is disclosed, as well as an associated apparatus and computer program product, for checkpointing using a plurality of communicatively coupled compute nodes. The method comprises acquiring health information for a first node of the plurality of compute nodes, and determining a first failure probability for the first node using the health information. The first failure probability corresponds to a predetermined time interval. The method further comprises selecting a second node of the plurality of compute nodes as a partner node for the first node. The second node has a second failure probability for the time interval. A composite failure probability of the first node and the second node is less than the first failure probability. The method further comprises copying checkpoint information from the first node to the partner node.
    Type: Application
    Filed: December 22, 2017
    Publication date: June 27, 2019
    Inventors: Carlos Henrique ANDRADE COSTA, Yoonho PARK, Chen-Yong CHER, Bryan ROSENBURG, Kyung RYU