Release 0.4.0
🤩 Highlights
Renate 0.4.0 adds multi-gpu training via deepspeed, data shift detectors, L2P as a new updater, and a couple of new datasets for benchmarking (WildTimeData, CLEAR, DomainNet, 4TextDataset).
🌟 New Features
- MultiGPU training with deepspeed by @prabhuteja12 in #218
- Renate NLP Models and Benchmarking Support for Hugging Face by @wistuba in #213 #233
- Covariate Shift Detectors by @lballes: MMD (#237), KS (#242)
- New Updater: Learning to Prompt (L2P) by @prabhuteja12 in #367
- Upload custom files and folders with a SageMaker training Job by @wistuba in #286
- Custom Optimizer and LR schedulers by @wistuba in #290
- Flag to remove intermediate tasks' states by @prabhuteja12 in #289
- Make number of epochs "finetuning-equivalent" by @lballes in #344
- Add Micro Average Accuracy by @wistuba in #323
- Experimentation Tools by @wistuba in #356
🛢 Datasets
- Added 4 Wild Time Datasets by @wistuba in #187
- Enable CLEAR Datasets for Benchmarking by @prabhuteja12 in #287
- Add DomainNet Benchmark by @wistuba in #357
- Add benchmark made of multiple text datasets by @610v4nn1 in #354
- MultiText dataset Added to Benchmarking by @wistuba in #366
📜 Documentation Updates
- Add doc page and example for shift detection by @lballes in #244
- Add example of using renate in your own script by @lballes in #274
- Describe Installation of Dependencies for Benchmarking by @wistuba in #313
- Improve title for the NLP example by @610v4nn1 in #416
🐛 Bug Fixes
- Fix Offline-ER bug and change loss functions by @wistuba in #273
- Missing Argument Doesn't Allow for Remote Experiments by @wistuba in #304
- Fix Small Bug in Benchmarking Script and Add LR Scheduler to Experiment Config by @wistuba in #305
- Enable Downloading Large Files by @wistuba in #337
- Fix Scenario for CLEAR by @wistuba in #339
- Fix CLS-ER Loss by @wistuba in #347
- Fix weighting in OfflineER by @lballes in #355
- Fixing Bug with HPO by @wistuba in #345
- Adding a Datacollator to handle the wild time text datasets by @prabhuteja12 in #338
- Enable Offline-ER for NestedTensors by @wistuba in #336
- Refactor Offline-ER to work with
collate_fn
by @wistuba in #390 - Fixing the issue with Domainnet redownloading by @prabhuteja12 in #389
- CLEAR dataset download link update by @prabhuteja12 in #431
- Support Use of Joint and GDumb with Pre-Trained Models by @wistuba in #362
🏗️ Code Refactoring
- Remove obsolete
set_transforms
from memory buffer by @lballes in #265 - Missing dependency and problem with import by @wistuba in #272
- Using HuggingFace ViT implementation (#219) by @prabhuteja12 in #303
- Introduce
RenateLightningModule
by @wistuba in #301 - Cleanup iCarl by @wistuba in #358
- Abstracting prompting transformer for use in L2P and S-Prompt by @prabhuteja12 in #420
- Adding flags to expose gradient clipping args in Trainer by @prabhuteja12 in #361
- Wild Time Benchmarks and Small Memory Hack by @wistuba in #363
- Clean Up Learner Checkpoint and Fix Model Loading by @wistuba in #365
- Enable Custom Grouping for DataIncrementalScenario by @wistuba in #368
- Masking of logits of irrelevant classes by @prabhuteja12 in #364
- Modifies current text transformer implementation to a RenateBenchmarkingModule by @prabhuteja12 in #380
- Replace memory batch size with a fraction of the total batch size by @wistuba in #359
- Make offline ER us total batch size in first update by @lballes in #381
🔧 Maintenance
- Robust Integration Tests by @wistuba in #214
- Update Renate Config Example by @wistuba in #226
- Longer Experiments for GPUs by @wistuba in #246
- Using
num_gpus_per_trial
after SyneTune update by @prabhuteja12 in #278 - Implementing a buffer that handles dataset elements of different sizes by @prabhuteja12 in #279
- Run sagemaker tests from GitHub Actions by @wesk in #275
- Fix Security Problem with
transformers
by @wistuba in #298
Full Changelog: v0.3.1...v0.4.0