Study Note For Repository

Tutorial:

https://allennlp.org/tutorials https://mlexplained.com/2019/01/30/an-in-depth-tutorial-to-allennlp-from-basics-to-elmo-and-bert/

Structure:

AllenNLP base on:

Load/Read data.
Train.
Test/Verify.

Load/Read Data:

Define Data Reader

The data reader read data through the following configuration:

"dataset_reader": {
    "type": "spider",
    "tables_file": dataset_path + "tables.json",
    "dataset_path": dataset_path + "database",
    "lazy": false,
    "keep_if_unparsable": false,
    "loading_limit": -1
  },

  "validation_dataset_reader": {
    "type": "spider",
    "tables_file": dataset_path + "tables.json",
    "dataset_path": dataset_path + "database",
    "lazy": false,
    "keep_if_unparsable": true,
    "loading_limit": -1
  },

So we know that there will be two data reader that are the same type (spider).

The code: @DatasetReader.register("spider") tell us that class SpiderDatasetReader(DatasetReader) is the data reader we need. And we will create two SpiderDatasetReader objective.

Other configuration parameter except type will be sent to the constructor of SpiderDatasetReader, such as tables_file and dataset_path.

Read Data

After construct the data reader, AllenNLP will call def _read(self, file_path: str) to read the data automatically. And then we can finish the process of reading data.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Note.md

Note.md

Study Note For Repository

Structure:

Load/Read Data:

Define Data Reader

Read Data

Files

Note.md

Latest commit

History

Note.md

File metadata and controls

Study Note For Repository

Structure:

Load/Read Data:

Define Data Reader

Read Data