Skip to content

Commit

Permalink
update meta files
Browse files Browse the repository at this point in the history
  • Loading branch information
nobu-g committed Mar 13, 2023
1 parent 3212795 commit 5dcd072
Show file tree
Hide file tree
Showing 3 changed files with 32 additions and 4 deletions.
17 changes: 16 additions & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,19 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

## [Unreleased]

## [2.0.0] - 2023-03-14

### Added
- Introduce the seq2seq module for more accurate reading prediction and canonicalization.
- Introduce learning-based sentence splitter.

### Changed
- Replace RoBERTa-based models with DeBERTaV2-based models.
- Support CUDA 11.7 by default instead of CUDA 10.2.

### Fixed
- Fix many minor bugs.

## [1.4.2] - 2023-02-22

### Fixed
Expand Down Expand Up @@ -106,7 +119,9 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
### Removed
- Remove an unnecessary dependency, `fugashi`.

[Unreleased]: https://github.com/ku-nlp/kwja/compare/v1.4.1...HEAD
[Unreleased]: https://github.com/ku-nlp/kwja/compare/v2.0.0...HEAD
[2.0.0]: https://github.com/ku-nlp/kwja/compare/v1.4.2...v2.0.0
[1.4.2]: https://github.com/ku-nlp/kwja/compare/v1.4.1...v1.4.2
[1.4.1]: https://github.com/ku-nlp/kwja/compare/v1.4.0...v1.4.1
[1.4.0]: https://github.com/ku-nlp/kwja/compare/v1.3.0...v1.4.0
[1.3.0]: https://github.com/ku-nlp/kwja/compare/v1.2.2...v1.3.0
Expand Down
2 changes: 1 addition & 1 deletion CITATION.cff
Original file line number Diff line number Diff line change
Expand Up @@ -16,6 +16,6 @@ authors:
given-names: Daisuke
- family-names: Kurohashi
given-names: Sadao
version: 1.0.0
version: 2.0.0
repository-code: "https://github.com/ku-nlp/kwja"
date-released: 2022-09-28
17 changes: 15 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ KWJA performs many language analysis tasks, including:

## Requirements

- Python: 3.8+
- Python: 3.8, 3.9, 3.10
- Dependencies: See [pyproject.toml](./pyproject.toml).

## Getting Started
Expand Down Expand Up @@ -78,13 +78,16 @@ Here are some other options for `kwja` command:

`--typo-batch-size`: Batch size for typo module.

`--seq2seq-batch-size`: Batch size for seq2seq module.

`--char-batch-size`: Batch size for char module.

`--word-batch-size`: Batch size for word module.

`--tasks`: Tasks to be performed. Please specify 'typo', 'char', 'typo,char', 'char,word', 'typo,char,word', 'char,word,word_discourse' or 'typo,char,word,word_discourse'.
- `typo`: Typo correction
- `char`: Tokenization and Word normalization
- `seq2seq`: Word segmentation, Word normalization, Reading prediction, lemmatization, and Canonicalization.
- `char`: Word segmentation and Word normalization
- `word`: Morphological analysis, Named entity recognition, Word feature tagging, Dependency parsing, PAS analysis, Bridging reference resolution, and Coreference resolution
- `word_discourse`: Discourse relation analysis
- If you need the results of discourse relation analysis, please specify this in addition to `word`.
Expand Down Expand Up @@ -136,6 +139,16 @@ analyzed_document = kwja.apply(
}
```

```bibtex
@InProceedings{児玉2023,
author = {児玉 貴志 and 植田 暢大 and 大村 和正 and 清丸 寛一 and 村脇 有吾 and 河原 大輔 and 黒橋 禎夫},
title = {テキスト生成モデルによる日本語形態素解析},
booktitle = {言語処理学会 第29回年次大会},
year = {2023},
address = {沖縄},
}
```

## Reference

- [KNP format](http://cr.fvcrc.i.nagoya-u.ac.jp/~sasano/knp/format.html)

0 comments on commit 5dcd072

Please sign in to comment.