How to customize a new dataset?

None of my business, see hugging face official document. Just place your dataset script into testbed/data/<dataset_name>/<dataset_name>.py, and implement postprocess_generation to clean and extract results from raw model output.

By the way, I prepared a convenient method in testbed/data/common.py to help you implement _split_generator of your new dataset.

How to customize a new metric?

None of my business, see hugging face official document. Just place your new metric into testbed/evaluate/<metric_name>/<metric_name>.py.

🚨 If you want replace official evaluation, you should add a test script in test/ to prove that your code is consistent with official code.

How to customize a new model?

You need to do follows:

Inherit from ModelBase that placed at testbed/models/model_base.py. It is just a simple wrapper for pretrained model and processor.
Implement model_name property to identify what the model is, and default_prompt_template which is used in apply_prompt_tempalte to transform raw texts and images to a model-specific prompt.
Implement generate. This method is almost same as in transformers, except applying prompt template, processing with processor and tokenizing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How-to guides.md

How-to guides.md

How to customize a new dataset?

How to customize a new metric?

How to customize a new model?

Files

How-to guides.md

Latest commit

History

How-to guides.md

File metadata and controls

How to customize a new dataset?

How to customize a new metric?

How to customize a new model?