Awesome-LLM-Watermark

Awesome-LLM-Watermark

UP-TO-DATE LLM Watermark paper. 🔥🔥🔥

Stars: 212

Visit
 screenshot

This repository contains a collection of research papers related to watermarking techniques for text and images, specifically focusing on large language models (LLMs). The papers cover various aspects of watermarking LLM-generated content, including robustness, statistical understanding, topic-based watermarks, quality-detection trade-offs, dual watermarks, watermark collision, and more. Researchers have explored different methods and frameworks for watermarking LLMs to protect intellectual property, detect machine-generated text, improve generation quality, and evaluate watermarking techniques. The repository serves as a valuable resource for those interested in the field of watermarking for LLMs.

README:

Watermark papers

This repo includes papers about the watermarking for text and images.

Text watermark

  • Is Watermarking LLM-Generated Code Robust? Tiny ICLR 2024

  • Towards Better Statistical Understanding of Watermarking LLMs. Preprint.

  • Topic-based Watermarks for LLM-Generated Text. Preprint.

  • A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules. Preprint.

  • WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models. Preprint.

  • Duwak: Dual Watermarks in Large Language Models. Preprint.

  • Lost in Overlap: Exploring Watermark Collision in LLMs. Preprint.

  • WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off. Preprint.

  • WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection. Preprint.

  • EmMark: Robust Watermarks for IP Protection of Embedded Quantized Large Language Models. Preprint.

  • Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models. Preprint.

  • Attacking LLM Watermarks by Exploiting Their Strengths. Preprint.

  • Multi-Bit Distortion-Free Watermarking for Large Language Models. preprint.

  • Watermarking Makes Language Models Radioactive. Preprint.

  • Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models. Preprint.

  • GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick. Preprint.

  • k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text. Preprint.

  • Proving membership in LLM pretraining data via data watermarks. Preprint.

  • Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs. Preprint.

  • Provably Robust Multi-bit Watermarking for AI-generated Text via Error Correction Code. Preprint.

  • Instructional Fingerprinting of Large Language Models. Preprint.

  • Adaptive Text Watermark for Large Language Models. Preprint.

  • Excuse me, sir? Your language model is leaking (information) Preprint.

  • Cross-Attention Watermarking of Large Language Models. ICASSP2024.

  • Optimizing watermarks for large language models. Preprint.

  • Towards Optimal Statistical Watermarking. Preprint.

  • A Survey of Text Watermarking in the Era of Large Language Models. Preprint. Survey paper.

  • On the Learnability of Watermarks for Language Models. Preprint.

  • New Evaluation Metrics Capture Quality Degradation due to LLM Watermarking. Preprint.

  • Mark My Words: Analyzing and Evaluating Language Model Watermarks. Preprint.

  • I Know You Did Not Write That! A Sampling Based Watermarking Method for Identifying Machine Generated Text. Preprint.

  • Improving the Generation Quality of Watermarked Large Language Models via Word Importance Scoring. Preprint

  • Performance Trade-offs of Watermarking Large Language Models. Preprint.

  • X-Mark: Towards Lossless Watermarking Through Lexical Redundancy. Preprint.

  • WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models. ACL 2024.

  • Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models. Preprint.

  • REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models. Preprint.

  • Embarrassingly Simple Text Watermarks. Preprint.

  • Necessary and Sufficient Watermark for Large Language Models. Preprint.

  • Functional Invariants to Watermark Large Transformers. Preprint.

  • Watermarking LLMs with Weight Quantization. EMNLP2023 findings.

  • DiPmark: A Stealthy, Efficient and Resilient Watermark for Large Language Models. Preprint.

  • A Semantic Invariant Robust Watermark for Large Language Models. Preprint.

  • SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation. Preprint.

    • Abe Bohan Hou, Jingyu Zhang, Tianxing He, Yichen Wang, Yung-Sung Chuang, Hongwei Wang, Lingfeng Shen, Benjamin Van Durme, Daniel Khashabi, Yulia Tsvetkov
    • https://arxiv.org/abs/2310.03991
  • Advancing Beyond Identification: Multi-bit Watermark for Language Models. Preprint.

  • Three Bricks to Consolidate Watermarks for Large Language Models. Preprint.

  • Towards Codable Text Watermarking for Large Language Models. Preprint.

  • A Private Watermark for Large Language Models. Preprint.

  • Robust Distortion-free Watermarks for Language Models. Preprint.

  • Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy. Preprint.

  • Provable Robust Watermarking for AI-Generated Text. Preprint.

  • On the Reliability of Watermarks for Large Language Models. Preprint.

    • John Kirchenbauer, Jonas Geiping, Yuxin Wen, Manli Shu, Khalid Saifullah, Kezhi Kong, Kasun Fernando, Aniruddha Saha, Micah Goldblum, Tom Goldstein.
    • https://arxiv.org/abs/2306.04634
  • Undetectable Watermarks for Language Models. Preprint.

  • Watermarking Text Data on Large Language Models for Dataset Copyright Protection. Preprint.

  • Baselines for Identifying Watermarked Large Language Models. Preprint.

  • Who Wrote this Code? Watermarking for Code Generation. Preprint.

  • Robust Multi-bit Natural Language Watermarking through Invariant Features. ACL 2023.

  • Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark. ACL 2023.

  • Watermarking Text Generated by Black-Box Language Models. Preprint.

  • Protecting Language Generation Models via Invisible Watermarking. ICML 2023.

  • A Watermark for Large Language Models. ICML 2023. Outstanding Paper Award

  • Distillation-Resistant Watermarking for Model Protection in NLP. EMNLP 2022

  • CATER: Intellectual Property Protection on Text Generation APIs via Conditional Watermarks. NeurIPS 2022

  • Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding. IEEE S&P 2021

  • Watermarking GPT Outputs. slides 2023

  • Watermarking the Outputs of Structured Prediction with an Application in Statistical Machine Translation. EMNLP 2011

Image watermark

  • Flexible and Secure Watermarking for Latent Diffusion Model. MM23.
  • Leveraging Optimization for Adaptive Attacks on Image Watermarks. Preprint.
  • Catch You Everything Everywhere: Guarding Textual Inversion via Concept Watermarking. Preprint.
  • Hey That's Mine Imperceptible Watermarks are Preserved in Diffusion Generated Outputs. Preprint.
  • Generative Watermarking Against Unauthorized Subject-Driven Image Synthesis. Preprint.
  • Invisible Image Watermarks Are Provably Removable Using Generative AI. Preprint.
    • Xuandong Zhao, Kexun Zhang, Zihao Su, Saastha Vasan, Ilya Grishchenko, Christopher Kruegel, Giovanni Vigna, Yu-Xiang Wang, Lei Li.
    • https://arxiv.org/abs/2306.01953
  • Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust. Preprint.
  • Evading Watermark based Detection of AI-Generated Content. CCS 2023.
  • The Stable Signature: Rooting Watermarks in Latent Diffusion Models. ICCV 2023.
  • Watermarking Images in Self-Supervised Latent Spaces. ICASSP 2022.

Contributing to this paper list

First, think about which category the work should belong to.

Second, use the same format as the others to describe the work.

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for Awesome-LLM-Watermark

Similar Open Source Tools

For similar tasks

For similar jobs