Patents by Inventor Alessandro Morari

Alessandro Morari has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Training data augmentation via program simplification

Patent number: 11947940

Abstract: Techniques regarding augmenting one or more training datasets for training one or more AI models are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise training augmentation component that can generate an augmented training dataset for training an artificial intelligence model by extracting a simplified source code sample from a source code sample comprised within a training dataset.

Type: Grant

Filed: October 11, 2021

Date of Patent: April 2, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Alessandro Morari, Jim Alain Laredo
Contextual embeddings for improving static analyzer output

Patent number: 11765193

Abstract: In a computer-implemented method for improving a static analyzer output, a processor receives a labeled data set with labeled true vulnerabilities and labeled false vulnerabilities. A processor receives pretrained contextual embeddings from a contextual embeddings model. A processor maps the true vulnerabilities and the false vulnerabilities to the pretrained contextual embeddings model. A processor generates a fine-tuned model with classifications for true vulnerabilities.

Type: Grant

Filed: December 30, 2020

Date of Patent: September 19, 2023

Assignee: International Business Machines Corporation

Inventors: Saurabh Pujar, Luca Buratti, Alessandro Morari, Jim Alain Laredo, Mihaela Ancuta Bornea, Jeffrey Scott McCarley, Yunhui Zheng
Source code fault detection

Patent number: 11762758

Abstract: Approaches presented herein enable fault detection. More specifically, implementation code of one or more functions is identified from source code. The implementation code of the one or more functions is converted to corresponding Abstract Syntax Trees (ASTs). The implementation code of the one or more functions is represented as a first plurality of sets of AST paths over the ASTs. Classification results for the one or more functions are generated with a classifier based on the first plurality of sets of AST paths for the implementation code of the one or more functions. Each of the classification results indicates a probability of having at least one fault in a corresponding function of the one or more functions. Fault detection results of the source code are generated based on the classification results.

Type: Grant

Filed: March 29, 2021

Date of Patent: September 19, 2023

Assignee: International Business Machines Corporation

Inventors: Shiwan Zhao, Bo Yang, HongLei Guo, Zhong Su, Yunhui Zheng, Jim Alain Laredo, Alessandro Morari, Marco Pistoia
Job-launch time reduction by node pre-configuration

Patent number: 11676013

Abstract: Based on historic job data, a computer processor can predict a configuration of a computer node for running a future computer job. The computer processor can pre-configure the computer node based on the predicted configuration. Responsive to receiving a submission of a job, the computer processor can launch the job on the pre-configured computer node.

Type: Grant

Filed: December 30, 2019

Date of Patent: June 13, 2023

Assignee: International Business Machines Corporation

Inventors: Eun Kyung Lee, Giacomo Domeniconi, Alessandro Morari, Yoonho Park
ARTIFICIAL INTELLIGENCE MODEL LEARNING INTROSPECTION

Publication number: 20230130781

Abstract: Techniques regarding AI model introspection are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise model introspection component that can analyze artificial intelligence model learning behavior for a code understanding task by comparing an output of an artificial intelligence model with respect to a plurality of testing data subsets that have varying code complexity distributions.

Type: Application

Filed: October 21, 2021

Publication date: April 27, 2023

Inventors: Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Alessandro Morari, Jim Alain Laredo
COMPLEXITY BASED ARTIFICIAL INTELLIGENCE MODEL TRAINING

Publication number: 20230115723

Abstract: Techniques regarding training one or more AI models for a source code understanding task are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a training component that can train an artificial intelligence model on source code samples for a source code understanding task. The source code samples can be ranked based on code complexity.

Type: Application

Filed: September 30, 2021

Publication date: April 13, 2023

Inventors: Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Alessandro Morari, Jim Alain Laredo
TRAINING DATA AUGMENTATION VIA PROGRAM SIMPLIFICATION

Publication number: 20230113733

Abstract: Techniques regarding augmenting one or more training datasets for training one or more AI models are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise training augmentation component that can generate an augmented training dataset for training an artificial intelligence model by extracting a simplified source code sample from a source code sample comprised within a training dataset.

Type: Application

Filed: October 11, 2021

Publication date: April 13, 2023

Inventors: Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Alessandro Morari, Jim Alain Laredo
MAPPING SUBSURFACE INFRASTRUCTURE

Publication number: 20230029746

Abstract: A method and system for generating a map that shows subsurface structures includes the use of machine learning to develop a trained classifier that associates features in data with types of subsurface structures.

Type: Application

Filed: August 1, 2022

Publication date: February 2, 2023

Inventors: Cambiz Nick Raufi, Alessandro Morari, Giacomo Domeniconi
Probing Model Signal Awareness

Publication number: 20220358400

Abstract: A system, computer program product, and method are provided for probing model signal awareness. An iterative process is employed to systematically isolate one or more relevant tokens of an input sequence to generate a reduced input sequence. The reduced input sequence is validated and presented to a trained artificial intelligence (AI) model and prediction output is generated. The reduction process is continued while the prediction output stays the same as that of the input sequence, and until a minimal sub-sequence is identified. A signal existence in the minimal sub-sequence is verified and signal awareness of the trained AI model is evaluated. The evaluation includes measuring the verified signal existence against an original signal from the input sentence.

Type: Application

Filed: May 10, 2021

Publication date: November 10, 2022

Applicant: International Business Machines Corporation

Inventors: Yunhui Zheng, Sahil Suneja, Yufan Zhuang, Alessandro Morari, Jim Alain Laredo
Remediation strategy optimization for development, security and operations (DevSecOps)

Patent number: 11463478

Abstract: A method provides for collecting data source images from multiple repositories. Application dependencies are discovered from the data source images. Status results are determined based on vulnerability and compliance scanning of all dependent sources for each data source image. The status results are aggregated across all data source images for each of the multiple repositories. Remediations are determined for violations indicated by the aggregated status results. Each of the remediations is aggregated and ordered to define a single global remediation solution.

Type: Grant

Filed: October 29, 2019

Date of Patent: October 4, 2022

Assignee: International Business Machines Corporation

Inventors: Shripad Nadgowda, Alessandro Morari, James R. Doran
SOURCE CODE FAULT DETECTION

Publication number: 20220308984

Abstract: Approaches presented herein enable fault detection. More specifically, implementation code of one or more functions is identified from source code. The implementation code of the one or more functions is converted to corresponding Abstract Syntax Trees (ASTs). The implementation code of the one or more functions is represented as a first plurality of sets of AST paths over the ASTs. Classification results for the one or more functions are generated with a classifier based on the first plurality of sets of AST paths for the implementation code of the one or more functions. Each of the classification results indicates a probability of having at least one fault in a corresponding function of the one or more functions. Fault detection results of the source code are generated based on the classification results.

Type: Application

Filed: March 29, 2021

Publication date: September 29, 2022

Inventors: Shiwan Zhao, Bo Yang, HongLei Guo, Zhong Su, Yunhui Zheng, Jim Alain Laredo, Alessandro Morari, Marco Pistoia
Building pre-trained contextual embeddings for programming languages using specialized vocabulary

Patent number: 11429352

Abstract: A method, a computer system, and a computer program product for building pre-trained contextual embeddings is provided. Embodiments of the present invention may include collecting programming code. Embodiments of the present invention may include loading and preparing the programming code using a specialized programming language keywords-based vocabulary. Embodiments of the present invention may include creating contextual embeddings for the programming code. Embodiments of the present invention may include storing the contextual embeddings.

Type: Grant

Filed: July 1, 2020

Date of Patent: August 30, 2022

Assignee: International Business Machines Corporation

Inventors: Saurabh Pujar, Luca Buratti, Alessandro Morari, Jim Alain Laredo, Alfio Massimiliano Gliozzo, Gaetano Rossiello
NLP-based context-aware log mining for troubleshooting

Patent number: 11409754

Abstract: A method for context-aware data mining of a text document includes receiving a list of words parsed and preprocessed from an input query; computing a related distributed embedding representation for each word in the list of words using a word embedding model of the text document being queried; aggregating the related distributed embedding representations of all words in the list of words to represent the input query with a single embedding, by using one of an average of all the related distributed embedding representations or a maximum of all the related distributed embedding representations; retrieving a ranked list of document segments of N lines that are similar to the aggregated word embedding representation of the query, where N is a positive integer provided by the user; and returning the list of retrieved segments to a user.

Type: Grant

Filed: June 11, 2019

Date of Patent: August 9, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Giacomo Domeniconi, Eun Kyung Lee, Alessandro Morari
CONTEXTUAL EMBEDDINGS FOR IMPROVING STATIC ANALYZER OUTPUT

Publication number: 20220210178

Abstract: In a computer-implemented method for improving a static analyzer output, a processor receives a labeled data set with labeled true vulnerabilities and labeled false vulnerabilities. A processor receives pretrained contextual embeddings from a contextual embeddings model. A processor maps the true vulnerabilities and the false vulnerabilities to the pretrained contextual embeddings model. A processor generates a fine-tuned model with classifications for true vulnerabilities.

Type: Application

Filed: December 30, 2020

Publication date: June 30, 2022

Inventors: Saurabh Pujar, Luca Buratti, Alessandro Morari, Jim Alain Laredo, Mihaela Ancuta Bornea, Jeffrey Scott McCarley, Yunhui Zheng
Global virtual address space consistency model

Patent number: 11288194

Abstract: An approach is disclosed that maintains a consistent view of a virtual address by a local node which writes a first value to the virtual address and, after writing the first value, establishes a snapshot consistency state of the virtual address. The virtual address is shared amongst any number of processes and the processes includes a writing process and other processes that read from the virtual address. After writing the first value, the writing process writes a second value to the virtual address. Even after writing the second value, the first value is still visible to the other processes.

Type: Grant

Filed: December 12, 2018

Date of Patent: March 29, 2022

Assignee: International Business Machines Corporation

Inventors: Charles R. Johns, James A. Kahle, Martin Ohmacht, Changhoan Kim, Jose R. Brunheroto, Constantinos Evangelinos, Abdullah Kayi, Alessandro Morari, James C. Sexton, Patrick D. Siegl
BUILDING PRE-TRAINED CONTEXTUAL EMBEDDINGS FOR PROGRAMMING LANGUAGES USING SPECIALIZED VOCABULARY

Publication number: 20220004365

Abstract: A method, a computer system, and a computer program product for building pre-trained contextual embeddings is provided. Embodiments of the present invention may include collecting programming code. Embodiments of the present invention may include loading and preparing the programming code using a specialized programming language keywords-based vocabulary. Embodiments of the present invention may include creating contextual embeddings for the programming code. Embodiments of the present invention may include storing the contextual embeddings.

Type: Application

Filed: July 1, 2020

Publication date: January 6, 2022

Inventors: Saurabh Pujar, Luca Buratti, Alessandro Morari, Jim Alain Laredo, Alfio Massimiliano Gliozzo, Gaetano Rossiello
VULNERABILITY ANALYSIS USING CONTEXTUAL EMBEDDINGS

Publication number: 20220004642

Abstract: A method, a computer system, and a computer program product for vulnerability analysis using contextual embeddings is provided. Embodiments of the present invention may include collecting labeled code snippets. Embodiments of the present invention may include preparing the labeled code snippets. Embodiments of the present invention may include tokenizing the labeled code snippets. Embodiments of the present invention may include fine-tuning a model. Embodiments of the present invention may include collecting unlabeled code snippets. Embodiments of the present invention may include predicting a vulnerability of the unlabeled code snippets using the model.

Type: Application

Filed: July 1, 2020

Publication date: January 6, 2022

Inventors: Saurabh Pujar, Luca Buratti, Alessandro Morari, Jim Alain Laredo, Alfio Massimiliano Gliozzo, Gaetano Rossiello
Locality aware data loading for machine learning

Patent number: 11093862

Abstract: A data index sequence indexing a dataset is received. A location of a data sample identified by a data index in the data index sequence is determined. A scheme is generated for specifying a data movement based on the location. Responsive to determining that the location is a cache of a process, the data sample in the cache can be reused without having to load the data sample from a storage device.

Type: Grant

Filed: March 21, 2019

Date of Patent: August 17, 2021

Assignee: International Business Machines Corporation

Inventors: Chih-Chieh Yang, Guojing Cong, Bilge Acun, Alessandro Morari
JOB-LAUNCH TIME REDUCTION BY NODE PRE-CONFIGURATION

Publication number: 20210201130

Abstract: Based on historic job data, a computer processor can predict a configuration of a computer node for running a future computer job. The computer processor can pre-configure the computer node based on the predicted configuration. Responsive to receiving a submission of a job, the computer processor can launch the job on the pre-configured computer node.

Type: Application

Filed: December 30, 2019

Publication date: July 1, 2021

Inventors: Eun Kyung Lee, Giacomo Domeniconi, Alessandro Morari, Yoonho Park
REMEDIATION STRATEGY OPTIMIZATION FOR DEVELOPMENT, SECURITY AND OPERATIONS (DevSecOps)

Publication number: 20210126949

Abstract: A method provides for collecting data source images from multiple repositories. Application dependencies are discovered from the data source images. Status results are determined based on vulnerability and compliance scanning of all dependent sources for each data source image. The status results are aggregated across all data source images for each of the multiple repositories. Remediations are determined for violations indicated by the aggregated status results. Each of the remediations is aggregated and ordered to define a single global remediation solution.

Type: Application

Filed: October 29, 2019

Publication date: April 29, 2021

Inventors: Shripad Nadgowda, Alessandro Morari, James R. Doran

1 2 next