This repository contains data (embeddings), plots and results for our project on the Danish canon of the Modern Breakthrough.
Some useful directions:
- the main folder contains the notebooks used for the analysis,
ML_experiment.py
is the main notebook for analysis,descriptive.py
is used to generate descriptive stats and plots. figures/
contains the figures generatedresults/
contains the results of the MLdata/
contains saved embeddings (.json) used for the analysis
The dataset used is available at huggingface
Please cite our previous paper if you use the code or the embeddings:
@inproceedings{feldkamp-etal-2024-canonical,
title = "Canonical Status and Literary Influence: A Comparative Study of {D}anish Novels from the Modern Breakthrough (1870{--}1900)",
author = "Feldkamp, Pascale and
Lassche, Alie and
Kostkan, Jan and
Kardos, M{\'a}rton and
Enevoldsen, Kenneth and
Baunvig, Katrine and
Nielbo, Kristoffer",
editor = {H{\"a}m{\"a}l{\"a}inen, Mika and
{\"O}hman, Emily and
Miyagawa, So and
Alnajjar, Khalid and
Bizzoni, Yuri},
booktitle = "Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities",
month = nov,
year = "2024",
address = "Miami, USA",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.nlp4dh-1.14",
pages = "140--155"
}