Skip to content

Commit

Permalink
Fix note for bicleaner
Browse files Browse the repository at this point in the history
Co-authored-by: Marco Castelluccio <[email protected]>
  • Loading branch information
eu9ene and marco-c authored Nov 3, 2023
1 parent 37a5e28 commit d63e50f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion docs/training-guide.md
Original file line number Diff line number Diff line change
Expand Up @@ -137,7 +137,7 @@ and add filtering thresholds to the config.

- `0.5` should be a good default value.
- Noisier datasets like OpenSubtitles should have higher threshold.
- Set the threshold to `0` to skip cleaning entirely, for example for ParaCrawl dataset that comes already cleaned.
- Set the threshold to `0` to skip cleaning entirely, for example for ParaCrawl dataset that comes already cleaned by bicleaner.

```
bicleaner:
Expand Down

0 comments on commit d63e50f

Please sign in to comment.