-
Notifications
You must be signed in to change notification settings - Fork 5
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #160 from ku-nlp/dev
v2.0.0
- Loading branch information
Showing
265 changed files
with
12,289 additions
and
29,838 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,5 +1,7 @@ | ||
[report] | ||
exclude_lines = | ||
pragma: no cover | ||
|
||
# Do not complain about missing debug-only code: | ||
def __repr__ | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,29 +1,40 @@ | ||
default_language_version: | ||
python: python3.10 | ||
repos: | ||
- repo: https://github.com/pre-commit/pre-commit-hooks | ||
rev: v4.3.0 | ||
rev: v4.4.0 | ||
hooks: | ||
- id: end-of-file-fixer | ||
- id: trailing-whitespace | ||
- id: check-yaml | ||
- repo: https://github.com/psf/black | ||
rev: 22.10.0 | ||
rev: 23.1.0 | ||
hooks: | ||
- id: black | ||
- repo: https://github.com/PyCQA/flake8 | ||
rev: 5.0.4 | ||
rev: 6.0.0 | ||
hooks: | ||
- id: flake8 | ||
- repo: https://github.com/PyCQA/isort | ||
rev: 5.12.0 | ||
hooks: | ||
- id: isort | ||
- repo: https://github.com/pre-commit/mirrors-mypy | ||
rev: v0.982 | ||
rev: v1.1.1 | ||
hooks: | ||
- id: mypy | ||
additional_dependencies: | ||
- rhoknp==1.0.2 | ||
- hydra-core==1.3.1 | ||
- torch==1.12.1 | ||
- transformers==4.23.1 | ||
- rhoknp==1.2.1 | ||
- hydra-core==1.3.2 | ||
- torch==1.13.1 | ||
- torchmetrics==0.11.4 | ||
- transformers==4.25.1 | ||
- tokenizers==0.13.2 | ||
- wandb==0.13.11 | ||
- repo: https://github.com/jumanjihouse/pre-commit-hooks | ||
rev: 3.0.0 | ||
hooks: | ||
- id: shellcheck | ||
files: ^scripts/ | ||
types: [ shell ] | ||
args: [ --exclude=SC2002 ] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,3 @@ | ||
prediction_writer: | ||
_target_: kwja.callbacks.char_module_writer.CharModuleWriter | ||
output_dir: ${run_dir} | ||
pred_filename: "predict" | ||
destination: ${run_dir}/char_prediction.juman |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,4 @@ | ||
lr_monitor: | ||
_target_: pytorch_lightning.callbacks.LearningRateMonitor | ||
logging_interval: null # "epoch", "step", or "null" | ||
log_momentum: false |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
prediction_writer: | ||
_target_: kwja.callbacks.senter_module_writer.SenterModuleWriter | ||
destination: ${run_dir}/senter_prediction.txt |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,8 @@ | ||
prediction_writer: | ||
_target_: kwja.callbacks.seq2seq_module_writer.Seq2SeqModuleWriter | ||
destination: ${run_dir}/seq2seq_prediction.txt | ||
tokenizer: | ||
_target_: transformers.AutoTokenizer.from_pretrained | ||
pretrained_model_name_or_path: ${encoder.pretrained_model_name_or_path} | ||
additional_special_tokens: ${special_tokens} | ||
_convert_: all |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,10 @@ | ||
prediction_writer: | ||
_target_: kwja.callbacks.typo_module_writer.TypoModuleWriter | ||
output_dir: ${run_dir} | ||
extended_vocab_path: ${dataset.extended_vocab_path} | ||
confidence_threshold: ${confidence_threshold} | ||
pred_filename: "predict" | ||
model_name_or_path: ${encoder.pretrained_model_name_or_path} | ||
tokenizer_kwargs: ${dataset.tokenizer_kwargs} | ||
destination: ${run_dir}/typo_prediction.txt | ||
tokenizer: | ||
_target_: transformers.AutoTokenizer.from_pretrained | ||
pretrained_model_name_or_path: ${encoder.pretrained_model_name_or_path} | ||
do_word_tokenize: false | ||
additional_special_tokens: ${special_tokens} | ||
_convert_: all |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,13 +1,10 @@ | ||
prediction_writer: | ||
_target_: kwja.callbacks.word_module_writer.WordModuleWriter | ||
output_dir: ${run_dir} | ||
reading_resource_path: ${dataset.reading_resource_path} | ||
pred_filename: "predict" | ||
jumandic_path: kwja/resource/jumandic | ||
ambig_surf_specs: | ||
- conjtype: "イ形容詞アウオ段" | ||
conjform: "エ基本形" | ||
- conjtype: "イ形容詞イ段" | ||
conjform: "エ基本形" | ||
- conjtype: "イ形容詞イ段特殊" | ||
conjform: "エ基本形" | ||
destination: ${run_dir}/word_prediction.knp |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.