v4

Cognigy NLU evaluation benchmarks.

Small

640 Training Setences - 10 Sentences per Intent

1076 Test Sentences

	Cognigy	DialogFlow	Microsoft LUIS	Watson
Accuracy	0.751	0.656	0.655	0.69
F1 (macro)	0.748	0.657	0.641	0.686

1908 Training Setences - ~30 Sentences per Intent 5518 Test Sentences

	Cognigy	DialogFlow	Microsoft LUIS	Watson
Accuracy	0.846	0.761	0.788	0.81
F1 (macro)	0.827	0.758	0.776	0.804

Platform\Corpus	Chatbot	Ask Ubuntu	Web Applications	Overall
Cognigy NLU 2.0	0.97	0.91	0.92	0.93
DialogFlow	0.93	0.85	0.80	0.87
LUIS	0.98	0.90	0.81	0.91
Watson	0.97	0.92	0.83	0.92

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
v3		v3
v4		v4
README.md		README.md