R.I.P.
๐ป
Ghosted
ChatGPT for Vulnerability Detection, Classification, and Repair: How Far Are We?
October 15, 2023 ยท Entered Twilight ยท ๐ Asia-Pacific Software Engineering Conference
Repo contents: ChatGPT_prompts, LICENSE, README.md, avr_baselines, download_data.sh, img, sev_baselines, svc_baselines, svp_baselines
Authors
Michael Fu, Chakkrit Tantithamthavorn, Van Nguyen, Trung Le
arXiv ID
2310.09810
Category
cs.SE: Software Engineering
Cross-listed
cs.CR
Citations
122
Venue
Asia-Pacific Software Engineering Conference
Repository
https://github.com/awsm-research/ChatGPT4Vul
โญ 15
Last Checked
1 month ago
Abstract
Large language models (LLMs) like ChatGPT (i.e., gpt-3.5-turbo and gpt-4) exhibited remarkable advancement in a range of software engineering tasks associated with source code such as code review and code generation. In this paper, we undertake a comprehensive study by instructing ChatGPT for four prevalent vulnerability tasks: function and line-level vulnerability prediction, vulnerability classification, severity estimation, and vulnerability repair. We compare ChatGPT with state-of-the-art language models designed for software vulnerability purposes. Through an empirical assessment employing extensive real-world datasets featuring over 190,000 C/C++ functions, we found that ChatGPT achieves limited performance, trailing behind other language models in vulnerability contexts by a significant margin. The experimental outcomes highlight the challenging nature of vulnerability prediction tasks, requiring domain-specific expertise. Despite ChatGPT's substantial model scale, exceeding that of source code-pre-trained language models (e.g., CodeBERT) by a factor of 14,000, the process of fine-tuning remains imperative for ChatGPT to generalize for vulnerability prediction tasks. We publish the studied dataset, experimental prompts for ChatGPT, and experimental results at https://github.com/awsm-research/ChatGPT4Vul.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Software Engineering
R.I.P.
๐ป
Ghosted
GraphCodeBERT: Pre-training Code Representations with Data Flow
R.I.P.
๐ป
Ghosted
DeepTest: Automated Testing of Deep-Neural-Network-driven Autonomous Cars
R.I.P.
๐ป
Ghosted
Microservices: yesterday, today, and tomorrow
R.I.P.
๐ป
Ghosted
Devign: Effective Vulnerability Identification by Learning Comprehensive Program Semantics via Graph Neural Networks
R.I.P.
๐ป
Ghosted