add ReCaLL attack #26

austinbrown5 · 2024-09-05T02:09:17Z

In this pull request we introduce ReCaLL into the unified benchmark as defined here

Overview

ReCaLL is a novel membership inference attack (MIA) method designed to detect pretraining data in large language models (LLMs). It leverages the conditional language modeling capabilities of LLMs to identify whether a given piece of text was part of the model's training data

Files Changed

readme.md: added ReCaLL to list of attacks.
configs/recall.json: added config file to run ReCaLL on its own.
mimir/attacks/all_attacks.py: added ReCaLL to list of attacks.
mimir/attacks/recall.py: implemented ReCaLL attack.
mimir/attacks/utils.py: added ReCaLL to attacker mapping .
mimir/config.py: added support for recall_num_shots, to allow ReCaLL to be run with more than one shot.
run.py: added necessary support to allow ReCaLL to run as well as verify it is being run correctly

Implementation Details

The ReCaLL attack is implemented in mimir/attacks/recall.py, following the algorithm described in the original paper.
We've added support for multiple shots in the config, enabling varied experimentation with this parameter.
The attack can be run independently using the configs/recall.json configuration file.

Please review the changes and let me know if any modifications or additional information is needed.
Thanks so much! 🙌

iamgroot42 · 2024-09-13T18:13:24Z

mimir/config.py

@@ -174,6 +174,8 @@ class ExperimentConfig(Serializable):
    """Chunk size"""
    scoring_model_name: Optional[str] = None
    """Scoring model (if different from base model)"""
+    recall_num_shots: Optional[int] = 1


Can you create a separate class for Configuration (just like we have a separate NeighborhoodConfig for neighborhood attack) for this instead of adding it directly to the ExperimentConfig?

iamgroot42 · 2024-09-13T18:17:37Z

run.py

+    nonmember_prefix = kwargs.get("nonmember_prefix", None)
+    if AllAttacks.RECALL in attackers_dict.keys():
+        if nonmember_prefix is None:
+            raise ValueError("Must include a prefix for ReCaLL attack")


Do we want this condition? nonmember_prefix only needs to be present for the recall attack, not all attacks? If someone runs a config with multiple attacks including recall, must all attack function-calls include the nonmember-prefix?

iamgroot42 · 2024-09-13T18:18:57Z

run.py

@@ -515,6 +526,21 @@ def main(config: ExperimentConfig):
        mask_model_tokenizer=mask_model.tokenizer if mask_model else None,
    )

+    #* ReCaLL Specific
+    if AllAttacks.RECALL in config.blackbox_attacks:


If the config has multiple attacks, data_member and data_nonmember should not be modified like this for all attacks, but right now they will modify raw data before running any attack

iamgroot42 · 2024-09-13T18:21:11Z

Hey @austinbrown5 - great work, and thanks for the PR! I left some minor comments, but apart from those the code change looks good to me and I can merge the PR once you've had a look at them.

austinbrown5 · 2024-09-15T19:38:20Z

Hey @iamgroot42- thanks for reviewing our PR. We made some changes according to the comments you made. Let us know if everything looks good now. Thanks so much!

iamgroot42 · 2024-09-16T13:05:28Z

Thanks, @austinbrown5 !

austinbrown5 added 11 commits August 5, 2024 21:18

added recall class, to-do: implement

e7cbe89

typo

dc6260f

new json files for testing

84752ec

working recall along with config file

4510d96

added prefix processing

4472fab

reset new_mi.json

3cc6c9b

fix nan errors

3f19c7e

better sliding window

0249ab9

json update

793bdd7

normalized losses

475e66b

update readme

c701cc0

iamgroot42 reviewed Sep 13, 2024

View reviewed changes

made requested changes

bdb3c06

iamgroot42 approved these changes Sep 16, 2024

View reviewed changes

iamgroot42 merged commit 99b67d2 into iamgroot42:main Sep 16, 2024
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add ReCaLL attack #26

add ReCaLL attack #26

austinbrown5 commented Sep 5, 2024

iamgroot42 Sep 13, 2024

iamgroot42 Sep 13, 2024

iamgroot42 Sep 13, 2024

iamgroot42 commented Sep 13, 2024

austinbrown5 commented Sep 15, 2024

iamgroot42 commented Sep 16, 2024

add ReCaLL attack #26

add ReCaLL attack #26

Conversation

austinbrown5 commented Sep 5, 2024

Overview

Files Changed

Implementation Details

iamgroot42 Sep 13, 2024

Choose a reason for hiding this comment

iamgroot42 Sep 13, 2024

Choose a reason for hiding this comment

iamgroot42 Sep 13, 2024

Choose a reason for hiding this comment

iamgroot42 commented Sep 13, 2024

austinbrown5 commented Sep 15, 2024

iamgroot42 commented Sep 16, 2024