Data & Analysis compendium for the Economic Sentiment analysis paper.
View the article (open access).
The main analysis code is located in the src/data-processing. Of interest might be:
- 10_apply_dictionaries.py Sentiment Dictionaries (Python)
- 11_apply_dictionaries_quanteda.R Sentiment Dictionaries (Quanteda)
- 20_svm.py Support Vector Machines
- 22_nb.py Naive Bayes
- 40_cnn_predict.py Convolutional Neural Network
The following data files might be of interest:
- metadata.csv Headline, date, and source of each article
- gold.csv Gold standard (expert) coding
- crowdcodings.csv Crowd codings
- manual_coding.csv Manual (student) coding of headlines
See the files in src/data-processing for details on how these files were constructed.