Explanipedia

Analyzing source code vulnerabilities in the D2A dataset with ML ensembles and C-BERT Open

Saurabh Pujar, Yunhui Zheng, Luca Buratti, Burn Lewis, Y. Q. Chen , et al. · 2024

Computer science

Static analysis tools are widely used for vulnerability detection as they can analyze programs with complex behavior and millions of lines of code. Despite their popularity, static analysis tools are known to generate an excess of false po…

CONCORD: Clone-aware Contrastive Learning for Source Code Open

Yangruibo Ding, Saikat Chakraborty, Luca Buratti, Saurabh Pujar, Alessandro Morari , et al. · 2023

Computer science Mathematics

Deep Learning (DL) models to analyze source code have shown immense promise during the past few years. More recently, self-supervised pre-training has gained traction for learning generic code representations valuable for many downstream S…

Incorporating Signal Awareness in Source Code Modeling: An Application to Vulnerability Detection Open

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Jim Laredo, Alessandro Morari , et al. · 2023

Computer science Mathematics Chemistry

AI models of code have made significant progress over the past few years. However, many models are actually not learning task-relevant source code features. Instead, they often fit non-relevant but correlated data, leading to a lack of rob…

Automated Code generation for Information Technology Tasks in YAML through Large Language Models Open

Saurabh Pujar, Luca Buratti, Xiaojie Guo, Nicolas Dupuis, Burn Lewis , et al. · 2023

Computer science Mathematics Physics

The recent improvement in code generation capabilities due to the use of large language models has mainly benefited general purpose programming languages. Domain specific languages, such as the ones used for IT Automation, have received fa…

Global virtual address space consistency model Open

Charles R. Johns, James A. Kahle, Martin Ohmacht, Changhoan Kim, José R. Brunheroto , et al. · 2023

Computer science

An approach is disclosed that maintains a consistent view of a virtual address by a local node which writes a first value to the virtual address and, after writing the first value, establishes a snapshot consistency state of the virtual ad…

System and method of storing and analyzing information Open

John Feo, David J. Haglin, Alessandro Morari, Antonino Tumeo, Oreste Villa , et al. · 2023

Computer science

A system and method of storing and analyzing information is disclosed. The system includes a compiler layer to convert user queries to data parallel executable code. The system further includes a library of multithreaded algorithms, proces…

Towards Learning (Dis)-Similarity of Source Code from Program Contrasts Open

Yangruibo Ding, Luca Buratti, Saurabh Pujar, Alessandro Morari, Baishakhi Ray , et al. · 2022

Computer science

Yangruibo Ding, Luca Buratti, Saurabh Pujar, Alessandro Morari, Baishakhi Ray, Saikat Chakraborty. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022.

VELVET: a noVel Ensemble Learning approach to automatically locate VulnErable sTatements Open

Yangruibo Ding, Sahil Suneja, Yunhui Zheng, Jim Laredo, Alessandro Morari , et al. · 2021

Computer science Biology

Automatically locating vulnerable statements in source code is crucial to assure software security and alleviate developers' debugging efforts. This becomes even more important in today's software ecosystem, where vulnerable code can flow …

Data-Driven AI Model Signal-Awareness Enhancement and Introspection Open

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Jim Laredo, Alessandro Morari · 2021

Computer science Psychology Economics

AI modeling for source code understanding tasks has been making significant progress, and is being adopted in production development pipelines. However, reliability concerns, especially whether the models are actually learning task-related…

Data-Driven and SE-assisted AI Model Signal-Awareness Enhancement and Introspection Open

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, Jim Laredo, Alessandro Morari · 2021

Computer science Psychology Philosophy

AI modeling for source code understanding tasks has been making significant progress, and is being adopted in production development pipelines. However, reliability concerns, especially whether the models are actually learning task-related…

Towards Learning (Dis)-Similarity of Source Code from Program Contrasts Open

Yangruibo Ding, Luca Buratti, Saurabh Pujar, Alessandro Morari, Baishakhi Ray , et al. · 2021

Computer science Chemistry Biology

Understanding the functional (dis)-similarity of source code is significant for code modeling tasks such as software vulnerability and code clone detection. We present DISCO(DIS-similarity of COde), a novel self-supervised model focusing o…

Contrastive Learning for Source Code with Structural and Functional Properties Open

Yangruibo Ding, Luca Buratti, Saikat Chakraborty, Saurabh Pujar, Alessandro Morari , et al. · 2021

Computer science Biology Chemistry

Pre-trained transformer models have recently shown promises for understanding the source code. Most existing works expect to understand code from the textual features and limited structural knowledge of code. However, the program functiona…

Software Vulnerability Detection via Deep Learning over Disaggregated Code Graph Representation Open

Yufan Zhuang, Sahil Suneja, Veronika Thost, Giacomo Domeniconi, Alessandro Morari , et al. · 2021

Computer science Political science

Identifying vulnerable code is a precautionary measure to counter software security breaches. Tedious expert effort has been spent to build static analyzers, yet insecure patterns are barely fully enumerated. This work explores a deep lear…

Probing model signal-awareness via prediction-preserving input minimization Open

Sahil Suneja, Yunhui Zheng, Yufan Zhuang, Jim Laredo, Alessandro Morari · 2021

Computer science Philosophy Economics

This work explores the signal awareness of AI models for source code understanding. Using a software vulnerability detection use case, we evaluate the models' ability to capture the correct vulnerability signals to produce their prediction…

D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential Analysis Open

Yunhui Zheng, Saurabh Pujar, Burn Lewis, Luca Buratti, Edward S. Epstein , et al. · 2021

Computer science Biology

Static analysis tools are widely used for vulnerability detection as they understand programs with complex behavior and millions of lines of code. Despite their popularity, static analysis tools are known to generate an excess of false pos…

Exploring Software Naturalness through Neural Language Models Open

Luca Buratti, Saurabh Pujar, Mihaela Bornea, Jason S. McCarley, Yunhui Zheng , et al. · 2020

Computer science Physics

The Software Naturalness hypothesis argues that programming languages can be understood through the same techniques used in natural language processing. We explore this hypothesis through the use of a pre-trained transformer-based language…

Learning to map source code to software vulnerability using code-as-a-graph Open

Sahil Suneja, Yunhui Zheng, Yufan Zhuang, Jim Laredo, Alessandro Morari · 2020

Computer science

We explore the applicability of Graph Neural Networks in learning the nuances of source code from a security perspective. Specifically, whether signatures of vulnerabilities in source code can be learned from its graph representation, in t…

High level synthesis of RDF queries for graph analytics Open

Vito Giovanni Castellana, Marco Minutoli, Alessandro Morari, Antonino Tumeo, Marco Lattuada , et al. · 2015

Computer science

In this paper we present a set of techniques that enable the synthesis of efficient custom accelerators for memory intensive, irregular applications. To address the challenges of irregular applications (large memory footprint, unpredictabl…

Alessandro Morari YOU? Author Swipe