ThuCCSLab / Awesome-LM-SSP Public

Notifications You must be signed in to change notification settings
Fork 104
Star 1.6k

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

github.com/ThuCCSLab/Awesome-LM-SSP

Apache-2.0 license

1.6k stars 104 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 539 Commits
collection		collection
figure		figure
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Repository files navigation

Awesome-LM-SSP

Introduction

The resources related to the trustworthiness of large models (LMs) across multiple dimensions (e.g., safety, security, and privacy), with a special focus on multi-modal LMs (e.g., vision-language models and diffusion models).

This repo is in progress 🌱 (manually collected).
Badges:
- Model:
- Comment: ...
- Venue: ...
🌻 Welcome to recommend resources to us via pulling requests or opening issues with the following format:

Title	Link	Code	Venue	Classification	Model	Comment
aa	arxiv	github	bb'23	A1. Jailbreak	LLM	Agent

News

[2025.01.09] 🎂 Happy 1st Birthday to Awesome-LM-SSP! Keep Going! 💪
[2024.01.09] 🚀 LM-SSP is released!

Collections

Book (3)
Competition (5)
Leaderboard (4)
Toolkit (12)
Survey (36)
Paper (1946)
- A. Safety (1005)
  - A0. General (27)
  - A1. Jailbreak (448)
  - A2. Alignment (119)
  - A3. Deepfake (80)
  - A4. Ethics (5)
  - A5. Fairness (58)
  - A6. Hallucination (113)
  - A7. Prompt Injection (77)
  - A8. Toxicity (78)
- B. Security (351)
  - B0. General (14)
  - B1. Adversarial Examples (99)
  - B2. Agent (68)
  - B3. Poison & Backdoor (147)
  - B4. System (23)
- C. Privacy (590)
  - C0. General (44)
  - C1. Contamination (15)
  - C2. Data Reconstruction (57)
  - C3. Data Reconstruction (1)
  - C4. Membership Inference Attacks (53)
  - C5. Model Extraction (13)
  - C6. Privacy-Preserving Computation (114)
  - C7. Property Inference Attacks (5)
  - C8. Side-Channel (7)
  - C9. Unlearning (62)
  - C10. Watermark & Copyright (219)

Big love to the community — thank you! 🙏

Acknowledgement

Organizers: Tianshuo Cong (丛天硕), Xinlei He (何新磊), Zhengyu Zhao (赵正宇), Yugeng Liu (刘禹更), Delong Ran (冉德龙)
This project is inspired by LLM Security, Awesome LLM Security, LLM Security & Privacy, UR2-LLMs, PLMpapers, EvaluationPapers4ChatGPT