Patents by Inventor Jonas F. Dias
Jonas F. Dias has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11868890Abstract: A computer implemented method, computer program product, and system for managing execution of a workflow comprising a set of subworkflows, comprising optimizing the set of subworkflows using a deep neural network, wherein each subworkflow of the set of subworkflows has a set of tasks, wherein each task of the sets of tasks has a requirement of resources of a set of resources; wherein each task of the sets of tasks is enabled to be dependent on another task of the sets of tasks, training the deep neural network by: executing the set of subworkflows, collecting provenance data from the execution, and collecting monitoring data that represents the state of said set of resources, wherein the training causes the neural network to learn relationships between the states of said set of resources, the said sets of tasks, their parameters and the obtained performance, optimizing an allocation of resources of the set of resources to each task of the sets of tasks to ensure compliance with a user-defined quality metric bType: GrantFiled: April 6, 2022Date of Patent: January 9, 2024Assignees: LANDMARK GRAPHICS CORPORATION, EMC IP HOLDING COMPANY LLCInventors: Chandra Yeleshwarapu, Jonas F. Dias, Angelo Ciarlini, Romulo D. Pinho, Vinicius Gottin, Andre Maximo, Edward Pacheco, David Holmes, Keshava Rangarajan, Scott David Senften, Joseph Blake Winston, Xi Wang, Clifton Brent Walker, Ashwani Dev, Nagaraj Sirinivasan
-
Patent number: 11567807Abstract: Techniques are provided for allocation of shared computing resources using source code feature extraction and machine learning techniques. An exemplary method comprises obtaining source code for execution in a shared computing environment; extracting a plurality of discriminative features from the source code; obtaining a trained machine learning model; and generating a prediction of an allocation of one or more resources of the shared computing environment needed to satisfy one or more service level agreement requirements for the source code. The generated prediction is optionally adjusted using a statistical analysis of an error curve, based on one or more error boundaries obtained by the trained machine learning model.Type: GrantFiled: March 30, 2018Date of Patent: January 31, 2023Assignee: EMC IP Holding Company LLCInventors: Jonas F. Dias, Tiago Salviano Calmon, Adriana Bechara Prado
-
Patent number: 11562223Abstract: Deep reinforcement learning techniques are provided for resource allocation in a shared computing environment.Type: GrantFiled: April 24, 2018Date of Patent: January 24, 2023Assignee: EMC IP Holding Company LLCInventors: Vinícius Michel Gottin, Jonas F. Dias, Daniel Sadoc Menasché, Alex Laier Bordignon, Angelo Ernani Maia Ciarlini
-
Patent number: 11520703Abstract: Techniques are provided for adaptive look-ahead configuration for data prefetching. One method comprises, in response to a request for a data item in a storage system: obtaining a size of a look-ahead window for the request based on one of multiple available caching policies; and moving the requested data item and additional data items within the look-ahead window to the cache memory when the requested data item and/or the additional data items within the look-ahead window are not in the cache memory. The multiple available caching policies comprise a caching policy based on characteristics of an input/output workload of the storage system, or a portion thereof; and/or a caching policy based on an input/output workload of at least a portion of the storage system within a prior predefined time window. The look-ahead window size may be varied over time.Type: GrantFiled: January 31, 2019Date of Patent: December 6, 2022Assignee: EMC IP Holding Company LLCInventors: Jonas F. Dias, Rômulo Teixeira de Abreu Pinho, Diego Salomone Bruno, Owen Martin
-
Patent number: 11474817Abstract: Techniques are provided for provenance-based software script reuse. One method comprises extracting provenance data from source code including, for example, source code fragments, wherein the extracted provenance data indicates a control flow and a data flow of the source code; encapsulating source code fragments from the source code that satisfy one or more similarity criteria as a reusable source code fragment; and providing a repository of encapsulated reusable source code fragments for reuse during a development of new software scripts. The repository of encapsulated reusable source code fragments optionally comprises a searchable database further including, for example, the provenance data, data annotations, input parameters and generated results for the corresponding source code fragment.Type: GrantFiled: May 2, 2019Date of Patent: October 18, 2022Assignee: EMC IP Holding Company LLCInventors: Vitor Sousa, Jonas F. Dias, Adriana Bechara Prado
-
Publication number: 20220300812Abstract: A computer implemented method, computer program product, and system for managing execution of a workflow comprising a set of subworkflows, comprising optimizing the set of subworkflows using a deep neural network, wherein each subworkflow of the set of subworkflows has a set of tasks, wherein each task of the sets of tasks has a requirement of resources of a set of resources; wherein each task of the sets of tasks is enabled to be dependent on another task of the sets of tasks, training the deep neural network by: executing the set of subworkflows, collecting provenance data from the execution, and collecting monitoring data that represents the state of said set of resources, wherein the training causes the neural network to learn relationships between the states of said set of resources, the said sets of tasks, their parameters and the obtained performance, optimizing an allocation of resources of the set of resources to each task of the sets of tasks to ensure compliance with a user-defined quality metric bType: ApplicationFiled: April 6, 2022Publication date: September 22, 2022Applicants: Landmark Graphics Corporation, EMC IP Holding Company LLCInventors: Chandra YELESHWARAPU, Jonas F. DIAS, Angelo CIARLINI, Romulo D. Pinho, Vinicius GOTTIN, Andre MAXIMO, Edward PACHECO, David HOLMES, Keshava RANGARAJAN, Scott David SENFTEN, Joseph Blake WINSTON, Xi WANG, Clifton Brent WALKER, Ashwani DEV, Nagaraj SIRINIVASAN
-
Patent number: 11436056Abstract: Techniques are provided for allocating shared computing resources using source code feature extraction and cluster-based training of machine learning models. An exemplary method comprises: obtaining a source code corpus with source code segments for execution in a shared computing environment; extracting discriminative features from the source code segments in the source code corpus; obtaining a trained machine learning model, wherein the trained machine learning model is trained using samples of source code segments from clusters derived from clustering the source code corpus based on (i) a term frequency metric, and/or (ii) observed values of execution metrics; and generating, using the trained machine learning model, a prediction of an allocation of resources of the shared computing environment needed to satisfy service level agreement requirements for source code to be executed in the shared computing environment.Type: GrantFiled: July 19, 2018Date of Patent: September 6, 2022Assignee: EMC IP Holding Company LLCInventors: Jonas F. Dias, Adriana Bechara Prado, Tiago Salviano Calmon
-
Patent number: 11403232Abstract: One example method includes determining a fall through threshold value for a cache, computing a length ‘s’ of a sequence that is close to LRU eviction, and the length ‘s’ is computed when a current fall through metric value is greater than the fall through threshold value, when the sequence length ‘s’ is greater than a predetermined threshold length ‘k,’ performing a first shift of an LRU position to define a protected queue of the cache, initializing a counter with a value of ‘r’, decrementing the counter each time a requested page is determined to be included in the protected queue, until ‘r’=0, and performing a second shift of the LRU position.Type: GrantFiled: May 29, 2020Date of Patent: August 2, 2022Assignee: EMC IP HOLDING COMPANY LLCInventors: Hugo De Oliveira Barbalho, Jonas F. Dias, Vinicius Michel Gottin
-
Patent number: 11347645Abstract: Managing a cache memory in a storage system includes maintaining a queue that stores data indictive of the read requests for a particular logical storage unit of the storage system in an order that the read requests are received by the storage system, receiving a read request for a particular page of the particular logical storage unit, and removing a number of elements in the queue and resizing the queue in response to the queue being full. Managing the cache memory also includes placing data indicative of the read request in the queue, determining a prefetch metric that varies according to a number of adjacent elements in a sorted version of the queue having a difference that is less than a predetermined value and greater than zero, and prefetching a plurality of pages that come after the particular page sequentially if the prefetch metric is greater than a predefined value.Type: GrantFiled: October 14, 2019Date of Patent: May 31, 2022Assignee: EMC IP Holding Company LLCInventors: Vinicius Gottin, Jonas F. Dias, Hugo de Oliveira Barbalho, Romulo D. Pinho, Tiago Calmon
-
Patent number: 11315014Abstract: A computer implemented method, computer program product, and system for managing execution of a workflow comprising a set of subworkflows, comprising optimizing the set of subworkflows using a deep neural network, wherein each subworkflow of the set of subworkflows has a set of tasks, wherein each task of the sets of tasks has a requirement of resources of a set of resources; wherein each task of the sets of tasks is enabled to be dependent on another task of the sets of tasks, training the deep neural network by: executing the set of subworkflows, collecting provenance data from the execution, and collecting monitoring data that represents the state of said set of resources, wherein the training causes the neural network to learn relationships between the states of said set of resources, the said sets of tasks, their parameters and the obtained performance, optimizing an allocation of resources of the set of resources to each task of the sets of tasks to ensure compliance with a user-defined quality metric bType: GrantFiled: August 16, 2018Date of Patent: April 26, 2022Assignee: EMC IP HOLDING COMPANY LLCInventors: Jonas F. Dias, Angelo Ciarlini, Romulo D. Pinho, Vinicius Gottin, Andre Maximo, Edward Pacheco, David Holmes, Keshava Rangarajan, Scott David Senften, Joseph Blake Winston, Xi Wang, Clifton Brent Walker, Ashwani Dev, Chandra Yeleshwarapu, Nagaraj Srinivasan
-
Publication number: 20210374074Abstract: One example method includes determining a fall through threshold value for a cache, computing a length ‘s’ of a sequence that is close to LRU eviction, and the length ‘s’ is computed when a current fall through metric value is greater than the fall through threshold value, when the sequence length ‘s’ is greater than a predetermined threshold length ‘k,’ performing a first shift of an LRU position to define a protected queue of the cache, initializing a counter with a value of ‘r’, decrementing the counter each time a requested page is determined to be included in the protected queue, until ‘r’=0, and performing a second shift of the LRU position.Type: ApplicationFiled: May 29, 2020Publication date: December 2, 2021Inventors: Hugo de Oliveira Barbalho, Jonas F. Dias, Vinicius Michel Gottin
-
Patent number: 11093404Abstract: Managing a cache memory in a storage system includes maintaining a first queue that stores data indictive of the read requests for a particular logical storage unit of the storage system in an order that the read requests are received by the storage system and maintaining a second queue that stores data indictive of the read requests for the particular logical storage unit in a sort order corresponding to page numbers of the read requests, the second queue persisting for a plurality of iterations of read requests. A read request is received and data indicative of the read request is placed in the first queue and in the second queue while maintaining the sort order of the second queue. The second queue is used to determine a prefetch metric that varies according to a number of adjacent elements in the second queue.Type: GrantFiled: October 14, 2019Date of Patent: August 17, 2021Assignee: EMC IP Holding Company LLCInventors: Vinicius Gottin, Jonas F. Dias, Hugo de Oliveira Barbalho, Romulo D. Pinho, Tiago Calmon
-
Patent number: 11080613Abstract: Methods and apparatus are provided for process monitoring based on large-scale combinations of time series data. An exemplary method comprises generating a model from time series data for a given target time series; determining whether a first difference between measured values and predicted values based on the model exceeds a predefined threshold indicating a target prediction error; in response to a detected target prediction error, performing evaluations of (i) a neighborhood coherence comprising an average of variables of the model weighted by corresponding coefficients on a predefined neighborhood time window, and/or (ii) a second difference between a given value of at least one variable in the model and an average value of the at least one variable based on a training dataset; providing notifications when first predefined criteria based on the evaluations are satisfied; and updating the model when second predefined criteria based on the evaluations are satisfied.Type: GrantFiled: April 29, 2016Date of Patent: August 3, 2021Assignee: EMC IP Holding Company LLCInventors: Mauricio Melo Camara, Angelo E. M. Ciarlini, Jonas F. Dias, André Maximo, José Carlos Costa da Silva Pinto, Monica Barros, Rafael Marinho Soares, Thiago de Sa Feital
-
Patent number: 11030667Abstract: Product planning techniques are provided that recommend compositions of product features for weighted heterogeneous consumer segments using regression trees. An exemplary method comprises obtaining historical consumer data comprising product preferences for existing product items for multiple consumer segments; obtaining product features indicating characteristics for each existing product item; prioritizing the consumer segments by obtaining a weight indicating an interest in each consumer segment; computing a total performance metric, for each product item, by calculating a dot product between the consumer segment weights and respective preferences of the consumer segments regarding a given product item; obtaining a regression tree from the existing product items to predict the total performance metric in terms of corresponding product features; and selecting a combination of the product features to be used in future product items based on identified paths in the regression tree.Type: GrantFiled: October 31, 2016Date of Patent: June 8, 2021Assignee: EMC IP Holding Company LLCInventors: Adriana Bechara Prado, Victor Bursztyn, Jonas F. Dias, André de Almeida Maximo, Angelo E. M. Ciarlini
-
Patent number: 11004025Abstract: Techniques are provided for simulation-based online workflow optimization.Type: GrantFiled: November 1, 2017Date of Patent: May 11, 2021Assignee: EMC IP Holding Company LLCInventors: Vinícius Michel Gottin, Angelo E. M. Ciarlini, Jonas F. Dias, Daniel Sadoc Menasché, Alex L. Bordignon, Fábio A. M. Porto
-
Publication number: 20210109860Abstract: Managing a cache memory in a storage system includes maintaining a first queue that stores data indictive of the read requests for a particular logical storage unit of the storage system in an order that the read requests are received by the storage system and maintaining a second queue that stores data indictive of the read requests for the particular logical storage unit in a sort order corresponding to page numbers of the read requests, the second queue persisting for a plurality of iterations of read requests. A read request is received and data indicative of the read request is placed in the first queue and in the second queue while maintaining the sort order of the second queue. The second queue is used to determine a prefetch metric that varies according to a number of adjacent elements in the second queue.Type: ApplicationFiled: October 14, 2019Publication date: April 15, 2021Applicant: EMC IP Holding Company LLCInventors: Vinicius Gottin, Jonas F. Dias, Hugo de Oliveira Barbalho, Romulo D. Pinho, Tiago Calmon
-
Publication number: 20210109859Abstract: Managing a cache memory in a storage system includes maintaining a queue that stores data indictive of the read requests for a particular logical storage unit of the storage system in an order that the read requests are received by the storage system, receiving a read request for a particular page of the particular logical storage unit, and removing a number of elements in the queue and resizing the queue in response to the queue being full. Managing the cache memory also includes placing data indicative of the read request in the queue, determining a prefetch metric that varies according to a number of adjacent elements in a sorted version of the queue having a difference that is less than a predetermined value and greater than zero, and prefetching a plurality of pages that come after the particular page sequentially if the prefetch metric is greater than a predefined value.Type: ApplicationFiled: October 14, 2019Publication date: April 15, 2021Applicant: EMC IP Holding Company LLCInventors: Vinicius Gottin, Jonas F. Dias, Hugo de Oliveira Barbalho, Romulo D. Pinho, Tiago Calmon
-
Patent number: 10977177Abstract: A pre-fetching technique determines what data, if any, to pre-fetch on a per-logical storage unit basis. For a given logical storage unit, what, if any, data to prefetch is based at least in part on a collective sequential proximity of the most recently requested pages of the logical storage unit. Determining what, if any, data to pre-fetch for a logical storage unit may include determining a value for a proximity metric indicative of the collective sequential proximity of the most recently requested pages, comparing the value to a predetermined proximity threshold value, and determining whether to pre-fetch one or more pages of the logical storage unit based on the result of the comparison. A data structure may be maintained that includes most recently requested pages for one or more logical storage units. This data structure may be a table.Type: GrantFiled: July 11, 2019Date of Patent: April 13, 2021Assignee: EMC IP Holding Company LLCInventors: Vinicius Gottin, Tiago Calmon, Romulo D. Pinho, Jonas F. Dias, Eduardo Sousa, Roberto Nery Stelling Neto, Hugo de Oliveira Barbalho
-
Patent number: 10909503Abstract: Methods and apparatus are provided for taking snapshots to train prediction models and improve workflow execution.Type: GrantFiled: November 30, 2016Date of Patent: February 2, 2021Assignee: EMC IP Holding Company LLCInventors: Jonas F. Dias, Angelo E. M. Ciarlini, Rômulo Teixeira de Abreu Pinho
-
Patent number: 10901782Abstract: Techniques are provided for dataflow execution time estimation for distributed processing frameworks. An exemplary method comprises: obtaining an input dataset for a dataflow for execution; determining a substantially minimal data unit for a given operation of the dataflow processed by the given operation; estimating a number of rounds required to execute a number of data units in the input dataset using nodes assigned to execute the given operation; determining an execution time spent by the given operation to process one data unit; estimating the execution time for the given operation based on the execution time spent by the given operation to process one data unit and the number of rounds required to execute the number of data units in the input dataset; and executing the given operation with the input dataset. A persistent cost model is optionally employed to record the execution times of known dataflow operations.Type: GrantFiled: July 20, 2018Date of Patent: January 26, 2021Assignee: EMC IP Holding Company LLCInventors: Vinícius Michel Gottin, Jonas F. Dias, Edward José Pacheco Condori, Angelo E. M. Ciarlini, Bruno Carlos da Cunha Costa, Fábio André Machado Porto, Paulo de Figueiredo Pires, Yania Molina Souto, Wagner dos Santos Vieira