This package provides the toolkit for open intent detection implemented with PyTorch.
Open intent detection aims to identify n-class known intents, and detect the one-class open intent, which is regarded as an open classification problem. The following is an example:
We collect benchmark intent datasets, and reproduce related methods to our best. For the convenience of users, we provide flexible and extensible interfaces to add new methods. Welcome to contact us ([email protected]) to add your methods!
Datasets | Source |
---|---|
BANKING | Paper |
OOS | Paper |
StackOverflow | Paper |
Model Name | Source | Published |
---|---|---|
OpenMax* | Paper Code | CVPR 2016 |
MSP | Paper Code | ICLR 2017 |
DOC | Paper Code | EMNLP 2017 |
DeepUnk | Paper Code | ACL 2019 |
SEG | Paper Code | ACL 2020 |
ADB | Paper Code | AAAI 2021 |
(K+1)-way | Paper Code | ACL 2021 |
MDF | Paper Code | ACL 2021 |
ARPL* | Paper Code | IEEE TPAMI 2022 |
KNNCL | Paper Code | ACL 2022 |
DA-ADB | Paper Code | IEEE/ACM TASLP 2023 |
We welcome any issues and requests for model implementation and bug fix.
Each dataset is split to training, development, and testing sets. We select partial intents as known (the labeled ratio can be changed) for training, and use all intents for testing. All the unknown intents are regarded as one open class (with token <UNK> or <OOS> in our codes). More detailed information can be seen in the paper.
The basic parameters include parsing parameters about selected dataset, method, setting, etc. More details can be seen in run.py. For specific parameters of each method, we support add configuration files with different hyper-parameters in the configs directory.
An example can be seen in ADB.py. Notice that the config file name is corresponding to the parsing parameter.
Normally, the input commands are as follows:
python run.py --dataset xxx --known_cls_ratio xxx --labeled_ratio xxx --config_file_name xxx
Notice that if you want to train the model, save the model, or save the testing results, you need to add related parameters (--train, --save_model, --save_results)
The detailed results can be seen in results.md.
- KIR means "Known Intent Ratio". "Open" and "Known" denote the macro f1-score over open class and known classes respectively.
- KNNCL (All) utilizes all 12 transformer layers for fine-tuning. KNNCL (last) utilizes only the last transformer layer for fine-tuning as the other baselines.
- We also test the performance of the DA-ADB method on the backbone of llama, corresponding to the methond DA-ADB_llama.
BANKING | OOS | StackOverflow | |||||
---|---|---|---|---|---|---|---|
KIR | Methods | Accuracy | F1-score | Accuracy | F1-score | Accuracy | F1-score |
0.25 | MSP | 42.19 | 49.92 | 53.38 | 51.23 | 27.91 | 37.49 |
0.25 | SEG | 48.73 | 51.49 | 52.18 | 47.0 | 23.33 | 34.4 |
0.25 | OpenMax | 47.76 | 53.18 | 70.27 | 63.03 | 38.97 | 45.35 |
0.25 | LOF | 66.73 | 63.38 | 87.77 | 78.13 | 25.02 | 35.29 |
0.25 | DOC | 70.31 | 65.74 | 86.08 | 75.86 | 57.75 | 57.34 |
0.25 | DeepUnk | 70.68 | 65.57 | 87.18 | 77.32 | 40.03 | 45.64 |
0.25 | (K+1)-way | 76.66 | 68.44 | 85.36 | 74.43 | 49.75 | 50.82 |
0.25 | MDF | 77.17 | 46.85 | 76.56 | 50.34 | 74.1 | 53.95 |
0.25 | ARPL | 76.8 | 64.01 | 84.51 | 73.44 | 66.76 | 62.62 |
0.25 | KNNCL (Last) | 73.01 | 66.23 | 89.87 | 79.23 | 28.65 | 37.37 |
0.25 | ADB | 79.33 | 71.63 | 88.3 | 78.23 | 86.75 | 79.85 |
0.25 | KNNCL (All) | 86.14 | 77.01 | 93.07 | 82.45 | 85.04 | 79.06 |
0.25 | DA-ADB | 81.19 | 73.73 | 89.48 | 79.92 | 89.07 | 82.83 |
0.25 | DA-ADB_llama | 83.29 | 76.78 | 91.53 | 84.54 | 85.54 | 77.81 |
0.5 | MSP | 61.67 | 72.51 | 66.68 | 72.7 | 53.23 | 62.7 |
0.5 | SEG | 55.11 | 63.32 | 60.67 | 62.55 | 43.04 | 55.1 |
0.5 | OpenMax | 65.53 | 74.64 | 80.22 | 79.86 | 60.27 | 67.72 |
0.5 | LOF | 71.13 | 76.26 | 85.22 | 83.86 | 44.56 | 56.57 |
0.5 | DOC | 74.6 | 78.24 | 85.19 | 83.89 | 73.88 | 76.8 |
0.5 | DeepUnk | 71.01 | 75.41 | 84.95 | 83.35 | 55.46 | 64.78 |
0.5 | (K+1)-way | 74.65 | 77.83 | 82.19 | 81.56 | 62.57 | 68.81 |
0.5 | MDF | 60.18 | 64.1 | 60.72 | 61.61 | 56.46 | 61.47 |
0.5 | ARPL | 74.11 | 77.77 | 80.36 | 80.88 | 75.65 | 77.87 |
0.5 | KNNCL (Last) | 70.41 | 74.96 | 85.32 | 83.31 | 45.38 | 56.69 |
0.5 | ADB | 79.61 | 81.34 | 86.54 | 85.16 | 86.49 | 85.54 |
0.5 | KNNCL (All) | 82.76 | 81.31 | 88.66 | 83.99 | 86.69 | 86.15 |
0.5 | DA-ADB | 81.51 | 82.53 | 87.93 | 85.64 | 87.78 | 86.91 |
0.5 | DA-ADB_llama | 82.66 | 83.67 | 90.29 | 88.86 | 86.42 | 86.09 |
0.75 | MSP | 77.08 | 84.33 | 76.19 | 83.48 | 73.2 | 78.7 |
0.75 | SEG | 64.65 | 69.54 | 42.78 | 42.7 | 62.72 | 69.97 |
0.75 | OpenMax | 78.32 | 84.95 | 75.36 | 71.17 | 75.78 | 80.9 |
0.75 | LOF | 77.21 | 83.64 | 85.07 | 87.2 | 65.05 | 71.87 |
0.75 | DOC | 78.94 | 83.79 | 85.93 | 87.87 | 80.55 | 84.37 |
0.75 | DeepUnk | 74.73 | 81.12 | 84.61 | 86.53 | 71.56 | 77.63 |
0.75 | (K+1)-way | 79.18 | 84.71 | 83.51 | 86.66 | 74.0 | 78.95 |
0.75 | MDF | 64.59 | 74.76 | 63.98 | 72.02 | 62.98 | 71.12 |
0.75 | ARPL | 79.6 | 85.16 | 81.29 | 86.0 | 79.64 | 83.85 |
0.75 | KNNCL (Last) | 74.78 | 81.25 | 84.12 | 86.1 | 65.01 | 71.85 |
0.75 | ADB | 81.39 | 86.11 | 86.99 | 88.94 | 82.89 | 86.11 |
0.75 | KNNCL (All) | 77.50 | 82.30 | 85.07 | 85.11 | 83.15 | 86.73 |
0.75 | DA-ADB | 81.12 | 85.65 | 87.39 | 88.41 | 83.56 | 86.84 |
0.75 | DA-ADB_llama | 82.19 | 86.52 | 89.23 | 90.48 | 82.69 | 86.28 |
BANKING | OOS | StackOverflow | |||||
---|---|---|---|---|---|---|---|
KIR | Methods | Open | Known | Open | Known | Open | Known |
0.25 | MSP | 39.42 | 50.47 | 59.26 | 51.02 | 11.66 | 42.66 |
0.25 | SEG | 51.58 | 51.48 | 59.22 | 46.67 | 4.19 | 40.44 |
0.25 | OpenMax | 48.52 | 53.42 | 77.51 | 62.65 | 34.52 | 47.51 |
0.25 | LOF | 72.64 | 62.89 | 91.96 | 77.77 | 7.14 | 40.92 |
0.25 | DOC | 76.64 | 65.16 | 90.78 | 75.46 | 62.5 | 56.3 |
0.25 | DeepUnk | 76.98 | 64.97 | 91.61 | 76.95 | 36.87 | 47.39 |
0.25 | (K+1)-way | 82.66 | 67.7 | 90.27 | 74.02 | 52.23 | 50.54 |
0.25 | MDF | 85.7 | 44.8 | 84.89 | 49.43 | 83.03 | 48.13 |
0.25 | ARPL | 83.39 | 62.99 | 89.63 | 73.01 | 72.95 | 60.55 |
0.25 | KNNCL (Last) | 79.34 | 65.54 | 93.56 | 78.85 | 15.26 | 41.79 |
0.25 | ADB | 85.05 | 70.92 | 92.36 | 77.85 | 90.96 | 77.62 |
0.25 | KNNCL (All) | 90.55 | 76.30 | 95.73 | 82.10 | 89.59 | 76.96 |
0.25 | DA-ADB | 86.57 | 73.05 | 93.2 | 79.57 | 92.65 | 80.87 |
0.25 | DA-ADB_llama | 88.19 | 76.18 | 94.55 | 84.27 | 90.04 | 75.36 |
0.5 | MSP | 46.29 | 73.2 | 63.71 | 72.82 | 26.94 | 66.28 |
0.5 | SEG | 43.03 | 63.85 | 61.34 | 62.57 | 4.72 | 60.14 |
0.5 | OpenMax | 55.03 | 75.16 | 82.15 | 79.83 | 46.11 | 69.88 |
0.5 | LOF | 66.81 | 76.51 | 87.57 | 83.81 | 5.18 | 61.71 |
0.5 | DOC | 72.66 | 78.38 | 87.45 | 83.84 | 71.18 | 77.37 |
0.5 | DeepUnk | 67.8 | 75.61 | 87.48 | 83.3 | 35.8 | 67.67 |
0.5 | (K+1)-way | 72.58 | 77.97 | 84.25 | 81.52 | 51.69 | 70.53 |
0.5 | MDF | 57.72 | 64.27 | 62.31 | 61.6 | 50.19 | 62.6 |
0.5 | ARPL | 71.79 | 77.93 | 81.81 | 80.87 | 73.97 | 78.26 |
0.5 | KNNCL (Last) | 67.21 | 75.16 | 87.85 | 83.25 | 8.5 | 61.5 |
0.5 | ADB | 79.43 | 81.39 | 88.6 | 85.12 | 87.7 | 85.32 |
0.5 | KNNCL (All) | 84.28 | 81.23 | 91.17 | 83.89 | 87.59 | 86.01 |
0.5 | DA-ADB | 81.93 | 82.54 | 90.1 | 85.58 | 88.86 | 86.71 |
0.5 | DA-ADB_llama | 83.23 | 83.68 | 92.04 | 88.82 | 87.43 | 85.95 |
0.75 | MSP | 46.05 | 84.99 | 63.86 | 83.65 | 37.86 | 81.42 |
0.75 | SEG | 37.22 | 70.1 | 40.74 | 42.72 | 6.0 | 74.24 |
0.75 | OpenMax | 53.02 | 85.5 | 75.18 | 71.14 | 49.69 | 82.98 |
0.75 | LOF | 54.19 | 84.15 | 82.81 | 87.24 | 5.22 | 76.31 |
0.75 | DOC | 63.51 | 84.14 | 83.87 | 87.91 | 65.32 | 85.64 |
0.75 | DeepUnk | 50.57 | 81.65 | 82.67 | 86.57 | 34.38 | 80.51 |
0.75 | (K+1)-way | 59.89 | 85.14 | 79.59 | 86.72 | 45.22 | 81.2 |
0.75 | MDF | 33.43 | 75.47 | 51.33 | 72.21 | 28.52 | 73.96 |
0.75 | ARPL | 61.26 | 85.58 | 74.67 | 86.1 | 62.99 | 85.24 |
0.75 | KNNCL (Last) | 51.42 | 81.76 | 82.05 | 86.14 | 7.19 | 76.16 |
0.75 | ADB | 67.34 | 86.44 | 84.85 | 88.97 | 74.1 | 86.91 |
0.75 | KNNCL (All) | 67.01 | 82.56 | 84.31 | 85.11 | 72.81 | 87.66 |
0.75 | DA-ADB | 69.37 | 85.93 | 86.0 | 88.43 | 74.55 | 87.66 |
0.75 | DA-ADB_llama | 70.94 | 86.78 | 87.93 | 90.51 | 72.84 | 87.17 |
-
Prepare Data
Create a new directory to store your dataset in the data directory. You should provide the train.tsv, dev.tsv, and test.tsv, with the same formats as in the provided datasets. -
Dataloader Setting
Calculate the maximum sentence length (token unit) and count the labels of the dataset. Add them in the file as follows:
max_seq_lengths = {
'new_dataset': max_length
}
benchmark_labels = {
'new_dataset': label_list
}
-
Add a new backbone in the backbones directory. For example, we provide some bert-based backbones in the file. And we also provide llama-based backbones in the file. You can add a new backbone in the same way.
-
Add the new backbone mapping in the file as follows:
from .bert import new_backbone_class # from .llama import new_backbone_class
backbones_map = {
'new_backbone': new_backbone_class
}
Add a new loss in the losses directory is almost the same as adding a new backbone.
-
Configuration Setting
Create a new file, named "method_name.py" in the configs directory, and set the hyper-parameters for the method (an example can be seen in MSP.py). -
Dataloader Setting
Add the dataloader mapping if you use new backbone for the method. For example, the bert-based model corresponds to the bert dataloader as follows.
from .bert_loader import BERT_Loader
backbone_loader_map = {
'bert': BERT_Loader,
'bert_xxx': BERT_Loader,
}
The llama-based model corresponds to the llama dataloader as follows.
from .llama_loader import LLAMA_Loader
backbone_loader_map = {
'llama_disaware': LLAMA_Loader,
}
- Add Methods (Take MSP as an example)
-
Create a new directory, named "MSP" in the methods directory.
-
Add the manager file for MSP. The file should include the method manager class (e.g., MSPManager), which includes training, evalutation, and testing modules for the method. An example can be seen in manager.py.
-
Add the related method dependency in init.py as below:
from .MSP.manager import xxxManager
method_map = {
'MSP': MSPManager
}
(The key corresponds to the input parameter "method")
- Run Examples Add a script in the examples directory, and configure the parsing parameters in the run.py. You can also run the programs serially by setting the combination of different parameters. A running example is shown in run_MSP.sh.
If this work is helpful, or you want to use the codes and results in this repo, please cite the following papers:
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition
- Learning Discriminative Representations and Decision Boundaries for Open Intent Detection
@inproceedings{zhang-etal-2021-textoir,
title = "{TEXTOIR}: An Integrated and Visualized Platform for Text Open Intent Recognition",
author = "Zhang, Hanlei and Li, Xiaoteng and Xu, Hua and Zhang, Panpan and Zhao, Kang and Gao, Kai",
booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations",
pages = "167--174",
year = "2021",
url = "https://aclanthology.org/2021.acl-demo.20",
doi = "10.18653/v1/2021.acl-demo.20",
}
@article{DA-ADB,
title = {Learning Discriminative Representations and Decision Boundaries for Open Intent Detection},
author = {Zhang, Hanlei and Xu, Hua and Zhao, Shaojie and Zhou, Qianrui},
journal = {IEEE/ACM Transactions on Audio, Speech, and Language Processing},
volume = {31},
pages = {1611-1623},
year = {2023},
doi = {10.1109/TASLP.2023.3265203}
}