-
Add metrics
- ROUGE?
- BLEU?
- perplexity
- other
- plot it and save graphs
- word count
- number of old sentence in generated outputs
- how to use it for random strategies?
- find the closest sentence in the dataset for the given output, count all cases
- k-nn for sentence from the output
- use classifier to filter CJ phrases
- plot it, and save to file
- add Leveinstein distance from nltk
- word cloud
-
top_k in manual inference can be replaced with any other method for diagnostics
-
Augmentation
-
Data cleaning ?
-
Different architectures
-
Which texts datasets have structure to check generation correctness?
- Maybe rhymes, code, etc
-
Dialogue generation, how to implement?
- Style Transfer for making CJ converstional bot: https://medium.com/nlplanet/two-minutes-nlp-quick-intro-to-text-style-transfer-61de9cbd4083
-
Emotions classificaton for gta dialogue, key words in sentence
-
Try different techniques for generation
-
Visualize attention to show important words
-
add argparse
Do I look like I suck, huh?!
This should help keep my belly full.
I'm a street criminal, am I?!
I'm just a fat bitch, huh?!
Oh, do you want a hole in your head, huh?
You got a problem with big men like you!
I can tell you what, one-time, shut your fucking mouth!
Aw, I ain't running!
You got a problem with my car homie!
It'sma put you to sleep, punk!
Aw, I still can live, playa.
You ain't just some bitch you can slap about!
This gonna get real nasty?
You gonna look like you don't need this no more.
You can't get shut up, po-po.
I'm fat and useless, bitch!
You hit homeboy!
I can't believe you got a license, huh!?
You think this is funny?
I got a gun, bitch!
quite similar to original dataset, sometime those sentences are the exact copies