Academic_LLM_Sec_Papers

Academic Papers about LLM Application on Security

Stars: 54

Visit

Academic_LLM_Sec_Papers is a curated collection of academic papers related to LLM Security Application. The repository includes papers sorted by conference name and published year, covering topics such as large language models for blockchain security, software engineering, machine learning, and more. Developers and researchers are welcome to contribute additional published papers to the list. The repository also provides information on listed conferences and journals related to security, networking, software engineering, and cryptography. The papers cover a wide range of topics including privacy risks, ethical concerns, vulnerabilities, threat modeling, code analysis, fuzzing, and more.

README:

Academic Papers About LLM Application on Cyber Security.

A curated LLM Security Application related academic papers. All papers are sorted based on the conference name and published year.

Welcome developers or researchers to add more published papers to this list.

The cryptocurrency donation address: 0xCC28B05fE858CDbc8692E3272A4451111bDCf700.

Welcome to visit my homepage and Google Scholar.

Table of Listed Conferences

Security & Crypto	Networking & Database	Software Engineering & Programming Language	Machine Learning
IEEE S&P	SIGMETRICS	ICSE	AAAI
ACM CCS	ICDE	ESEC/FSE	ACL
USENIX Security	VLDB	ASE	ICML
NDSS	ACM SIGMOD	ACM PLDI	NeurIPS
IEEE DSN	IEEE INFOCOM	ACM OOPSLA
SRCS	IMC	ISSTA
RAID	WWW	ACM POPL
CAV

Also including:

Survey, ACL.

Literature Review

2024

Large Language Models for Blockchain Security: A Systematic Literature Review.

A survey on large language model (llm) security and privacy: The good, the bad, and the ugly.

Large language models for software engineering: A systematic literature review.

Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices.

Unveiling security, privacy, and ethical concerns of chatgpt.

Conference

S&P

CCS

2024

PromptFuzz: Prompt Fuzzing for Fuzz Driver Generation.

GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models.

2023

Stealing the Decoding Algorithms of Language Models.

Large Language Models for Code: Security Hardening and Adversarial Testing.

Not What You've Signed Up For: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection.

Protecting intellectual property of large language model-based code generation apis via watermarks.

Dp-forward: Fine-tuning and inference on language models with differential privacy in forward pass.

USENIX Security

2024

Rapid Adoption, Hidden Risks: The Dual Impact of Large Language Model Customization.

PENTESTGPT: An LLM-empowered Automatic Penetration Testing Tool

Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models.

Large Language Models for Code Analysis: Do LLMs Really Do Their Job?.

EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability Detection.

Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing.

Prompt Stealing Attacks Against Text-to-Image Generation Models.

2023

Lost at c: A user study on the security implications of large language model code assistants.

CodexLeaks: Privacy Leaks from Code Generation Language Models in GitHub Copilot.

{Two-in-One}: A Model Hijacking Attack Against Text Generation Models.

2021

Extracting Training Data from Large Language Models.

You Autocomplete Me: Poisoning Vulnerabilities in Neural Code Completion.

NDSS

2024

LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors

Analysis of the Effect of the Difference between Japanese and English Input on ChatGPT-Generated Secure Codes.

MASTERKEY: Automated Jailbreaking of Large Language Model Chatbots.

DeGPT: Optimizing Decompiler Output with LLM.

DEMASQ: Unmasking the ChatGPT Wordsmith.

Large Language Model guided Protocol Fuzzing.

Facilitating Threat Modeling by Leveraging Large Language Models

OOPSLA

2024

Enhancing Static Analysis for Practical Bug Detection: An LLM-Integrated Approach.

PyDex: Repairing Bugs in Introductory Python Assignments using LLMs.

ICSE

2024

Large Language Models are Edge-Case Fuzzers: Testing Deep Learning Libraries via FuzzGPT

Fuzz4All: Universal Fuzzing with Large Language Models.

LLMParser: An Exploratory Study on Using Large Language Models for Log Parsing.

Exploring the Potential of ChatGPT in Automated Code Refinement: An Empirical Study.

Large Language Models are Edge-Case Fuzzers: Testing Deep Learning Libraries via FuzzGPT.

UniLog: Automatic Logging via LLM and In-Context Learning.

Prompting Is All You Need: Automated Android Bug Replay with Large Language Models.

Large Language Models for Test-Free Fault Localization.

Large language models are few-shot testers: Exploring llm-based general bug reproduction.

Large Language Models are Few-Shot Summarizers: Multi-Intent Comment Generation via In-Context Learning.

Large Language Models are Edge-Case Generators: Crafting Unusual Programs for Fuzzing Deep Learning Libraries.

GPTScan: Detecting Logic Vulnerabilities in Smart Contracts by Combining GPT with Program Analysis.

Automated Program Repair in the Era of Large Pre-trained Language Models.

2023

Does data sampling improve deep learning-based vulnerability detection? Yeas! and Nays!.

An Empirical Study of Deep Learning Models for Vulnerability Detection.

RepresentThemAll: A Universal Learning Representation of Bug Reports.

Contrabert: Enhancing code pre-trained models via contrastive learning.

On the robustness of code generation techniques: An empirical study on github copilot.

Two sides of the same coin: Exploiting the impact of identifiers in neural code comprehension.

Automated repair of programs from large language models.

Cctest: Testing and repairing code completion systems.

CodaMosa: Escaping Coverage Plateaus in Test Generation with Pre-trained Large Language Models.

Impact of Code Language Models on Automated Program Repair.

2022

ReCode: Robustness Evaluation of Code Generation Models.

CAV

2024

Enchanting Program Specification Synthesis by Large Language Models using Static Analysis and Program Verification.

ASE

2024

Better Patching Using LLM Prompting, via Self-Consistency.

Towards Autonomous Testing Agents via Conversational Large Language Models.

Let's Chat to Find the APIs: Connecting Human, LLM and Knowledge Graph through AI Chain.

Log Parsing: How Far Can ChatGPT Go?.

2022

Robust Learning of Deep Predictive Models from Noisy and Imbalanced Software Engineering Datasets.

ISSTA

2024

Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models.

2023

How Effective Are Neural Networks for Fixing Security Vulnerabilities.

ESEC/FSE

2023

InferFix: End-to-End Program Repair with LLMs.

Getting pwn'd by ai: Penetration testing with large language models.

Llm-based code generation method for golang compiler testing.

Assisting static analysis with large language models: A chatgpt experiment.

Assess and Summarize: Improve Outage Understanding with Large Language Models.

2022

Generating realistic vulnerabilities via neural code editing: an empirical study.

You see what I want you to see: poisoning vulnerabilities in neural code search.

2021

Vulnerability detection with fine-grained interpretations.

ACL

2024

Not the end of story: An evaluation of chatgpt-driven vulnerability description mappings.

Understanding Programs by Exploiting (Fuzzing) Test Cases.

2023

Backdooring Neural Code Search.

Membership inference attacks against language models via neighbourhood comparison.

Are you copying my model? protecting the copyright of large language models for eaas via backdoor watermark.

2022

ReCode: Robustness Evaluation of Code Generation Models.

Knowledge unlearning for mitigating privacy risks in language models.

2018

Contamination attacks and mitigation in multi-party machine learning.

AAAI

2022

Adversarial Robustness of Deep Code Comment Generation.

ICML

2023

Bag of tricks for training data extraction from language models.

2022

Deduplicating training data mitigates privacy risks in language models.

NeurIPS

2022

Recovering private text in federated learning of language models.

WWW

2024

ZipZap: Efficient Training of Language Models for Large-Scale Fraud Detection on Blockchain.

2022

Coprotector: Protect open-source code against unauthorized training usage with data poisoning.

journal

TIFS

(Security) Assertions by Large Language Models.

A Performance-Sensitive Malware Detection System Using Deep Learning on Mobile Devices A Performance-Sensitive Malware Detection System Using Deep Learning on Mobile Devices.

TDSC

PrivacyAsst: Safeguarding User Privacy in Tool-Using Large Language Model Agents.

CD-VulD: Cross-Domain Vulnerability Discovery Based on Deep Domain Adaptation.

TSE

Software Testing with Large Language Models: Survey, Landscape, and Vision.

An Empirical Evaluation of Using Large Language Models for Automated Unit Test Generation.

Deep Learning Based Vulnerability Detection: Are We There Yet?.

On the Value of Oversampling for Deep Learning in Software Defect Prediction.

TOSEM

Prompt Sapper: A LLM-Empowered Production Tool for Building AI Chains.

Adversarial Robustness of Deep Code Comment Generation .

Miscellaneous

LLM4Fuzz: Guided Fuzzing of Smart Contracts with Large Language Models

CHEMFUZZ: Large Language Models-assisted Fuzzing for Quantum Chemistry Software Bug Detection

Attack Prompt Generation for Red Teaming and Defending Large Language Models

For Tasks:

Click tags to check more tools for each tasks

analyze vulnerabilities conduct threat modeling perform code analysis explore ethical concerns research blockchain security

For Jobs:

researcher security analyst data scientist software engineer cybersecurity consultant

Alternative AI tools for Academic_LLM_Sec_Papers

Similar Open Source Tools

Academic_LLM_Sec_Papers

github

: 54

awesome-MLSecOps

Awesome MLSecOps is a curated list of open-source tools, resources, and tutorials for MLSecOps (Machine Learning Security Operations). It includes a wide range of security tools and libraries for protecting machine learning models against adversarial attacks, as well as resources for AI security, data anonymization, model security, and more. The repository aims to provide a comprehensive collection of tools and information to help users secure their machine learning systems and infrastructure.

github

: 204

Awesome-Embodied-AI

Awesome-Embodied-AI is a curated list of papers on Embodied AI and related resources, tracking and summarizing research and industrial progress in the field. It includes surveys, workshops, tutorials, talks, blogs, and papers covering various aspects of Embodied AI, such as vision-language navigation, large language model-based agents, robotics, and more. The repository welcomes contributions and aims to provide a comprehensive overview of the advancements in Embodied AI.

github

: 349

Next-Generation-LLM-based-Recommender-Systems-Survey

The Next-Generation LLM-based Recommender Systems Survey is a comprehensive overview of the latest advancements in recommender systems leveraging Large Language Models (LLMs). The survey covers various paradigms, approaches, and applications of LLMs in recommendation tasks, including generative and non-generative models, multimodal recommendations, personalized explanations, and industrial deployment. It discusses the comparison with existing surveys, different paradigms, and specific works in the field. The survey also addresses challenges and future directions in the domain of LLM-based recommender systems.

github

: 84

inference

Xorbits Inference (Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models. With Xorbits Inference, you can effortlessly deploy and serve your or state-of-the-art built-in models using just a single command. Whether you are a researcher, developer, or data scientist, Xorbits Inference empowers you to unleash the full potential of cutting-edge AI models.

github

: 7.4k

LLMSys-PaperList

This repository provides a comprehensive list of academic papers, articles, tutorials, slides, and projects related to Large Language Model (LLM) systems. It covers various aspects of LLM research, including pre-training, serving, system efficiency optimization, multi-model systems, image generation systems, LLM applications in systems, ML systems, survey papers, LLM benchmarks and leaderboards, and other relevant resources. The repository is regularly updated to include the latest developments in this rapidly evolving field, making it a valuable resource for researchers, practitioners, and anyone interested in staying abreast of the advancements in LLM technology.

github

: 869

Awesome-LLMs-on-device

Welcome to the ultimate hub for on-device Large Language Models (LLMs)! This repository is your go-to resource for all things related to LLMs designed for on-device deployment. Whether you're a seasoned researcher, an innovative developer, or an enthusiastic learner, this comprehensive collection of cutting-edge knowledge is your gateway to understanding, leveraging, and contributing to the exciting world of on-device LLMs.

github

: 747

SimAI

SimAI is the industry's first full-stack, high-precision simulator for AI large-scale training. It provides detailed modeling and simulation of the entire LLM training process, encompassing framework, collective communication, network layers, and more. This comprehensive approach offers end-to-end performance data, enabling researchers to analyze training process details, evaluate time consumption of AI tasks under specific conditions, and assess performance gains from various algorithmic optimizations.

github

: 281

AIOS

AIOS, a Large Language Model (LLM) Agent operating system, embeds large language model into Operating Systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI. AIOS is designed to optimize resource allocation, facilitate context switch across agents, enable concurrent execution of agents, provide tool service for agents, maintain access control for agents, and provide a rich set of toolkits for LLM Agent developers.

github

: 4.0k

Grounded_3D-LLM

Grounded 3D-LLM is a unified generative framework that utilizes referent tokens to reference 3D scenes, enabling the handling of sequences that interleave 3D and textual data. It transforms 3D vision tasks into language formats through task-specific prompts, curating grounded language datasets and employing Contrastive Language-Scene Pre-training (CLASP) to bridge the gap between 3D vision and language models. The model covers tasks like 3D visual question answering, dense captioning, object detection, and language grounding.

github

: 97

buffer-of-thought-llm

Buffer of Thoughts (BoT) is a thought-augmented reasoning framework designed to enhance the accuracy, efficiency, and robustness of large language models (LLMs). It introduces a meta-buffer to store high-level thought-templates distilled from problem-solving processes, enabling adaptive reasoning for efficient problem-solving. The framework includes a buffer-manager to dynamically update the meta-buffer, ensuring scalability and stability. BoT achieves significant performance improvements on reasoning-intensive tasks and demonstrates superior generalization ability and robustness while being cost-effective compared to other methods.

github

: 341

paper-reading

This repository is a collection of tools and resources for deep learning infrastructure, covering programming languages, algorithms, acceleration techniques, and engineering aspects. It provides information on various online tools for chip architecture, CPU and GPU benchmarks, and code analysis. Additionally, it includes content on AI compilers, deep learning models, high-performance computing, Docker and Kubernetes tutorials, Protobuf and gRPC guides, and programming languages such as C++, Python, and Shell. The repository aims to bridge the gap between algorithm understanding and engineering implementation in the fields of AI and deep learning.

github

: 237

terraform-genai-doc-summarization

This solution showcases how to summarize a large corpus of documents using Generative AI. It provides an end-to-end demonstration of document summarization going all the way from raw documents, detecting text in the documents and summarizing the documents on-demand using Vertex AI LLM APIs, Cloud Vision Optical Character Recognition (OCR) and BigQuery.

github

: 85

SynapseML

SynapseML (previously known as MMLSpark) is an open-source library that simplifies the creation of massively scalable machine learning (ML) pipelines. It provides simple, composable, and distributed APIs for various machine learning tasks such as text analytics, vision, anomaly detection, and more. Built on Apache Spark, SynapseML allows seamless integration of models into existing workflows. It supports training and evaluation on single-node, multi-node, and resizable clusters, enabling scalability without resource wastage. Compatible with Python, R, Scala, Java, and .NET, SynapseML abstracts over different data sources for easy experimentation. Requires Scala 2.12, Spark 3.4+, and Python 3.8+.

github

: 5.0k

camel

CAMEL is an open-source library designed for the study of autonomous and communicative agents. We believe that studying these agents on a large scale offers valuable insights into their behaviors, capabilities, and potential risks. To facilitate research in this field, we implement and support various types of agents, tasks, prompts, models, and simulated environments.

github

: 11.5k

AReaL

AReaL (Ant Reasoning RL) is an open-source reinforcement learning system developed at the RL Lab, Ant Research. It is designed for training Large Reasoning Models (LRMs) in a fully open and inclusive manner. AReaL provides reproducible experiments for 1.5B and 7B LRMs, showcasing its scalability and performance across diverse computational budgets. The system follows an iterative training process to enhance model performance, with a focus on mathematical reasoning tasks. AReaL is equipped to adapt to different computational resource settings, enabling users to easily configure and launch training trials. Future plans include support for advanced models, optimizations for distributed training, and exploring research topics to enhance LRMs' reasoning capabilities.

github

: 538

For similar tasks

Academic_LLM_Sec_Papers

github

: 54

HackBot

HackBot is an AI-powered cybersecurity chatbot designed to provide accurate answers to cybersecurity-related queries, conduct code analysis, and scan analysis. It utilizes the Meta-LLama2 AI model through the 'LlamaCpp' library to respond coherently. The chatbot offers features like local AI/Runpod deployment support, cybersecurity chat assistance, interactive interface, clear output presentation, static code analysis, and vulnerability analysis. Users can interact with HackBot through a command-line interface and utilize it for various cybersecurity tasks.

github

: 232

vulnerability-analysis

The NVIDIA AI Blueprint for Vulnerability Analysis for Container Security showcases accelerated analysis on common vulnerabilities and exposures (CVE) at an enterprise scale, reducing mitigation time from days to seconds. It enables security analysts to determine software package vulnerabilities using large language models (LLMs) and retrieval-augmented generation (RAG). The blueprint is designed for security analysts, IT engineers, and AI practitioners in cybersecurity. It requires NVAIE developer license and API keys for vulnerability databases, search engines, and LLM model services. Hardware requirements include L40 GPU for pipeline operation and optional LLM NIM and Embedding NIM. The workflow involves LLM pipeline for CVE impact analysis, utilizing LLM planner, agent, and summarization nodes. The blueprint uses NVIDIA NIM microservices and Morpheus Cybersecurity AI SDK for vulnerability analysis.

github

: 86

For similar jobs

last_layer

last_layer is a security library designed to protect LLM applications from prompt injection attacks, jailbreaks, and exploits. It acts as a robust filtering layer to scrutinize prompts before they are processed by LLMs, ensuring that only safe and appropriate content is allowed through. The tool offers ultra-fast scanning with low latency, privacy-focused operation without tracking or network calls, compatibility with serverless platforms, advanced threat detection mechanisms, and regular updates to adapt to evolving security challenges. It significantly reduces the risk of prompt-based attacks and exploits but cannot guarantee complete protection against all possible threats.

github

: 79

aircrack-ng

Aircrack-ng is a comprehensive suite of tools designed to evaluate the security of WiFi networks. It covers various aspects of WiFi security, including monitoring, attacking (replay attacks, deauthentication, fake access points), testing WiFi cards and driver capabilities, and cracking WEP and WPA PSK. The tools are command line-based, allowing for extensive scripting and have been utilized by many GUIs. Aircrack-ng primarily works on Linux but also supports Windows, macOS, FreeBSD, OpenBSD, NetBSD, Solaris, and eComStation 2.

github

: 5.2k

reverse-engineering-assistant

ReVA (Reverse Engineering Assistant) is a project aimed at building a disassembler agnostic AI assistant for reverse engineering tasks. It utilizes a tool-driven approach, providing small tools to the user to empower them in completing complex tasks. The assistant is designed to accept various inputs, guide the user in correcting mistakes, and provide additional context to encourage exploration. Users can ask questions, perform tasks like decompilation, class diagram generation, variable renaming, and more. ReVA supports different language models for online and local inference, with easy configuration options. The workflow involves opening the RE tool and program, then starting a chat session to interact with the assistant. Installation includes setting up the Python component, running the chat tool, and configuring the Ghidra extension for seamless integration. ReVA aims to enhance the reverse engineering process by breaking down actions into small parts, including the user's thoughts in the output, and providing support for monitoring and adjusting prompts.

github

: 219

AutoAudit

AutoAudit is an open-source large language model specifically designed for the field of network security. It aims to provide powerful natural language processing capabilities for security auditing and network defense, including analyzing malicious code, detecting network attacks, and predicting security vulnerabilities. By coupling AutoAudit with ClamAV, a security scanning platform has been created for practical security audit applications. The tool is intended to assist security professionals with accurate and fast analysis and predictions to combat evolving network threats.

github

: 201

aif

Arno's Iptables Firewall (AIF) is a single- & multi-homed firewall script with DSL/ADSL support. It is a free software distributed under the GNU GPL License. The script provides a comprehensive set of configuration files and plugins for setting up and managing firewall rules, including support for NAT, load balancing, and multirouting. It offers detailed instructions for installation and configuration, emphasizing security best practices and caution when modifying settings. The script is designed to protect against hostile attacks by blocking all incoming traffic by default and allowing users to configure specific rules for open ports and network interfaces.

github

: 147

watchtower

AIShield Watchtower is a tool designed to fortify the security of AI/ML models and Jupyter notebooks by automating model and notebook discoveries, conducting vulnerability scans, and categorizing risks into 'low,' 'medium,' 'high,' and 'critical' levels. It supports scanning of public GitHub repositories, Hugging Face repositories, AWS S3 buckets, and local systems. The tool generates comprehensive reports, offers a user-friendly interface, and aligns with industry standards like OWASP, MITRE, and CWE. It aims to address the security blind spots surrounding Jupyter notebooks and AI models, providing organizations with a tailored approach to enhancing their security efforts.

github

: 187

Academic_LLM_Sec_Papers

github

: 54

DeGPT

DeGPT is a tool designed to optimize decompiler output using Large Language Models (LLM). It requires manual installation of specific packages and setting up API key for OpenAI. The tool provides functionality to perform optimization on decompiler output by running specific scripts.

github

: 64

Academic_LLM_Sec_Papers

README:

Academic Papers About LLM Application on Cyber Security.

Table of Listed Conferences

Table of Listed Journals

Also including:

Literature Review

2024

Conference

S&P

2024

2023

2022

2020

CCS

2024

2023

USENIX Security

2024

2023

2021

NDSS

2024

OOPSLA

2024

ICSE

2024

2023

2022

CAV

2024

ASE

2024

2022

ISSTA

2024

2023

ESEC/FSE

2023

2022

2021

ACL

2024

2023

2022

2018

AAAI

2022

ICML

2023

2022

NeurIPS

2022

WWW

2024

2022

journal

TIFS

TDSC

TSE

TOSEM

Miscellaneous

For Tasks:

For Jobs:

Alternative AI tools for Academic_LLM_Sec_Papers

Similar Open Source Tools

Academic_LLM_Sec_Papers

awesome-MLSecOps

Awesome-Embodied-AI

Next-Generation-LLM-based-Recommender-Systems-Survey

inference

LLMSys-PaperList

Awesome-LLMs-on-device

SimAI

AIOS

Grounded_3D-LLM

buffer-of-thought-llm

paper-reading

terraform-genai-doc-summarization

SynapseML