Skip to content

Latest commit

 

History

History
38 lines (32 loc) · 4.04 KB

README.md

File metadata and controls

38 lines (32 loc) · 4.04 KB

We Built a Fake News & Click-bait Filter: What Happened Next Will Blow Your Mind!

Paper abstract:

It is completely amazing! Fake news and click-baits have totally invaded the cyber space. Let us face it: everybody hates them for three simple reasons. Reason №2 will absolutely amaze you. What these can achieve at the time of election will completely blow your mind! Now, we all agree, this cannot go on, you know, somebody has to stop it. So, we did this research on fake news/click-bait detection and trust us, it is totally great research, it really is! Make no mistake. This is the best research ever! Seriously, come have a look, we have it all: neural networks, attention mechanism, sentiment lexicons, author profiling, you name it. Lexical features, semantic features, we absolutely have it all. And we have totally tested it, trust us! We have results, and numbers, really big numbers. The best numbers ever! Oh, and analysis, absolutely top notch analysis. Interested? Come read the shocking truth about fake news and click-bait in the Bulgarian cyber space. You won't believe what we have found!

Authors:

Georgi Karadzhov, Pepa Gencheva, Preslav Nakov, Ivan Koychev

Please, cite the following paper if you use the resources below:

@InProceedings{RANLP2017:clickbait,
  author    = {Georgi Karadzhov and Pepa Gencheva and Preslav Nakov and Ivan Koychev},
  title     = {We Built a Fake News \& Click-bait Filter: What Happened Next Will Blow Your Mind!},
  booktitle = {Proceedings of the 2017 International Conference on Recent Advances in Natural Language Processing},
  month     = {September},
  year      = {2017},
  address   = {Varna, Bulgaria},
  series    = {RANLP~'17}
}

Resources

Name Short description Link
News Bulgarian news, each labeled wheter it is factual or not and whether it is a clickbait or not Download
LDA LDA topic models generated with gensim on ~100 000 bulgarian news articles Download
Word2Vec Word2Vec model generated with gensim on ~100 000 bulgarian news articles Download
Stopwords Dictionary with stop words Download
PMI-content-clickbait Calculated PMI scores over article content in regards to clickbait label Download
PMI-content-non-factual Calculated PMI scores over article content in regards to not-factual label Download
PMI-header-clickbait Calculated PMI scores over article header in regards to clickbait label Download
PMI-header-non-factual Calculated PMI scores over article header in regards to not-factual label Download
Typos List of words that are frequently mistyped in Bulgarian Download
Foreign Words List of words with foreign origin used in Bulgarian language. Download
Frequency List Frequency list of Bulgarian words, taken from Wikpedia. Download