Skip to content

Commit

Permalink
Merge pull request #5 from iamgroot42/v1
Browse files Browse the repository at this point in the history
Code Refactor (working towards v1)
  • Loading branch information
iamgroot42 authored Feb 7, 2024
2 parents ae8841c + 54085e4 commit 05e5efd
Show file tree
Hide file tree
Showing 17 changed files with 591 additions and 1,569 deletions.
7 changes: 6 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,7 @@ local/*results.json

# Logs (from cluster runs)
logs/*
logs_cluster/*

# Vscode
.vscode/*
Expand Down Expand Up @@ -55,6 +56,10 @@ quantile_ref_model/*
table*/*
fig7/*
new_mi/*
DEBUG/*

# Ignore all tar.gz files
*.tar.gz

# Ignore resources for edited members
edit_distance_members
edit_distance_members
1 change: 0 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,6 @@ We include and implement the following attacks, as described in our paper.
- [Zlib Entropy](https://www.usenix.org/system/files/sec21-carlini-extracting.pdf) (`zlib`). Uses the zlib compression size of a sample to approximate local difficulty of sample.
- [Min-k% Prob](https://swj0419.github.io/detect-pretrain.github.io/) (`min_k`). Uses k% of tokens with minimum likelihood for score computation.
- [Neighborhood](https://aclanthology.org/2023.findings-acl.719/) (`ne`). Generates neighbors using auxiliary model and measures change in likelihood.
- [Quantile](https://neurips.cc/virtual/2023/poster/70232) (`quantile`). Trains meta-classifier for predicting quantile of loss.

## Adding your own attack

Expand Down
Loading

0 comments on commit 05e5efd

Please sign in to comment.