diff --git a/content/en/projects/becoming-urban-reconstructing-city-graz-long-19th-century-beurb.mdx b/content/en/projects/becoming-urban-reconstructing-city-graz-long-19th-century-beurb.mdx index 78dd534b..7395788f 100644 --- a/content/en/projects/becoming-urban-reconstructing-city-graz-long-19th-century-beurb.mdx +++ b/content/en/projects/becoming-urban-reconstructing-city-graz-long-19th-century-beurb.mdx @@ -6,7 +6,6 @@ summary: "A geographic information system will be established in order to its perception thereof during the process of urbanization. " startDate: 2019-01-01 endDate: 2021-01-01 -image: /assets/content/en/projects/becoming-urban-reconstructing-city-graz-long-19th-century-beurb/undefined attachments: [] links: - url: https://informationsmodellierung.uni-graz.at/de/forschen/weitere-laufende-projekte/ @@ -23,4 +22,4 @@ This project focuses on the development of the city of Graz in the "long 19th ce Furthermore, the GIS provides a tool to identify places and structures, which had a strong impact on the city's evolution until today. Hence, the project not only gives an insight into the historical development but also provides approaches for a better understanding of the present-day city. Apart from the scholarly use of such a system, the project will implement a web-platform which will also be open for private and touristic use, that can be accessed with mobile devices on site and will be implemented in future exhibitions of the City of Graz Museum. Thus, as a collaboration of the University of Graz, the GrazMuseum and the city archive, the project output will benefit scholarly research that reaches into society, but also further community engagement and the knowledge transfer to local schools and international tourists. -**Team:** Martina Bürgermeister (ZIM-ACDH University of Graz), Matthias Holzer (City Archive Graz), Antonia Nussmüller (GrazMuseum), Jakob Sonnberger (ZIM-ACDH University of Graz), Leif Scheuermann (ZIM-ACDH University of Graz) \ No newline at end of file +**Team:** Martina Bürgermeister (ZIM-ACDH University of Graz), Matthias Holzer (City Archive Graz), Antonia Nussmüller (GrazMuseum), Jakob Sonnberger (ZIM-ACDH University of Graz), Leif Scheuermann (ZIM-ACDH University of Graz) diff --git a/content/en/projects/computer-vision-digital-humanists.mdx b/content/en/projects/computer-vision-digital-humanists.mdx index a423ca9b..b0da0d1d 100644 --- a/content/en/projects/computer-vision-digital-humanists.mdx +++ b/content/en/projects/computer-vision-digital-humanists.mdx @@ -5,7 +5,6 @@ summary: "Computer Vision for Digital Humanities has become a very relevant background in computer science remains scarce. " startDate: 2023-01-01 endDate: 2023-01-01 -image: /assets/content/en/projects/computer-vision-digital-humanists/undefined attachments: [] links: - url: https://informationsmodellierung.uni-graz.at/de/institut/veranstaltungen/call-for-participation-computer-vision-for-digital-humanists-winter-school-8.-10.-february-2023/ @@ -20,4 +19,4 @@ hostingOrganizations: Humanities (ZIM-ACDH) --- -Computer Vision has become a very relevant skill for many Digital Humanists, but it is hard to distinguish how we can best budget our time and efforts to engage with this new technology. This workshop will provide a conceptual introduction to the processes involved, coupled with hands-on exercises that focus on the ways that we, as humanists, can curate content and manage metadata to make the best use of new tools available. \ No newline at end of file +Computer Vision has become a very relevant skill for many Digital Humanists, but it is hard to distinguish how we can best budget our time and efforts to engage with this new technology. This workshop will provide a conceptual introduction to the processes involved, coupled with hands-on exercises that focus on the ways that we, as humanists, can curate content and manage metadata to make the best use of new tools available. diff --git a/content/en/projects/developing-hubert-nlp-algorithm-extending-seshat-global-history-databank.mdx b/content/en/projects/developing-hubert-nlp-algorithm-extending-seshat-global-history-databank.mdx index 27511efd..b1b486c2 100644 --- a/content/en/projects/developing-hubert-nlp-algorithm-extending-seshat-global-history-databank.mdx +++ b/content/en/projects/developing-hubert-nlp-algorithm-extending-seshat-global-history-databank.mdx @@ -7,7 +7,6 @@ summary: This project addresses the challenges of expanding and increasing the across projects. startDate: 2023-01-01 endDate: 2023-01-01 -image: /assets/content/en/projects/developing-hubert-nlp-algorithm-extending-seshat-global-history-databank/undefined attachments: [] links: - url: https://www.csh.ac.at/complexity-science/social-complexity-and-collapse/ @@ -32,15 +31,15 @@ The main challenge for expanding and increasing the cross-utilization of histori **This project is organized in three parts:** -1. Organizing and refining labels of the repository of research articles. +1. Organizing and refining labels of the repository of research articles. The Seshat team has recompiled more than 8000 academic articles and books on different cultures and civilizations. However, there is currently no structured repository that allows for easy or automatic access. Here we will build the Seshat Literature Repository by updating and organizing the current repository so that access can be automatized. We also refine references to link to the particular paragraph(s) where the information was inferred from. This detailed referencing system allows for other researchers to access the Seshat data more easily and is useful for fine tuning NLP algorithms. - -2. Developing a Natural Language Process to aid data extraction data extraction. + +2. Developing a Natural Language Process to aid data extraction data extraction. Recent developments in NLP have allowed researchers to train language models with the whole English Wikipedia text corpus \[6\]. Here we will develop HuBERT, a BERT model fine-tuned with the text of social science and humanities research articles in the Seshat literature repository. In particular we aim for HuBERT to able to both screen research articles and pre-select those that might have information on a specific variable and then allow researchers to query Seshat about variables not yet defined there; i.e. HuBERT will do the pre-selection of articles and identify data suitable for reuse by future researchers. - -3. Maintenance and expansion of documentation and tutorials. + +3. Maintenance and expansion of documentation and tutorials. We will continue to update and document the Seshat database and the website. We will add data visualizations and documentation, and tutorials for the Seshat Literature Repository and HumanBERT. - + **References** @@ -51,4 +50,4 @@ The main challenge for expanding and increasing the cross-utilization of histori 5. Turchin P. Arise 'cliodynamics'. Nature. 2008 Jul;454(7200):34-5. 6. Devlin J, Chang MW, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805. 2018 Oct 11. 7. Beltagy I, Lo K, Cohan A. SciBERT: A pretrained language model for scientific text. arXiv preprint arXiv:1903.10676. 2019 Mar 26. -8. Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, Kang J. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020 Feb 15;36(4):1234-40. \ No newline at end of file +8. Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, Kang J. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020 Feb 15;36(4):1234-40. diff --git a/content/en/projects/digital-edition-creation-pipelines-tools-and-transitions.mdx b/content/en/projects/digital-edition-creation-pipelines-tools-and-transitions.mdx index 9a5f0b74..50d4ecce 100644 --- a/content/en/projects/digital-edition-creation-pipelines-tools-and-transitions.mdx +++ b/content/en/projects/digital-edition-creation-pipelines-tools-and-transitions.mdx @@ -6,7 +6,6 @@ summary: The goal of the project is to create best practice guidelines and extension of the KONDE Weißbuch. startDate: 2023-01-01 endDate: 2023-01-01 -image: /assets/content/en/projects/digital-edition-creation-pipelines-tools-and-transitions/undefined attachments: [] links: - url: https://clariah.at/ @@ -20,4 +19,4 @@ hostingOrganizations: --- Digital editions are a key area of the Digital Humanities and allow sources to be made available to the scholarly community. Edition Projects involve the acquisition, annotation, enrichment, and publication of source material and transform them into research data. They include textual, visual, and potentially quantitative data and often require customised user interfaces to represent domain-specific research questions. Although each edition project has its own specific requirements, individual steps can be identified that are necessary across projects. In summary, these steps are: transcription and management of images and text, transformation of transcribed text into textual structures (paragraphs, headings etc.) expressed as XML/TEI, modelling of relevant text phenomena, annotation of semantic information and named entities and creation of indices, creation of FAIR data and publication including a web representation. Each step employs different methods and tools. -The goal of the project is to create best practice guidelines and tutorials for the transitions between selected tools (e.g. Transkribus, From The Page, Ediarum, LAKomp, GAMS, TEI Publisher), as well as training materials focusing on the tools themselves. These tutorials and guidelines will be realised as an extension of the [KONDE Weißbuch](https://www.digitale-edition.at/), tools and scripts will be published on GitHub. The resources will also be integrated into the respective community infrastructures (e.g. [DARIAH Campus](https://campus.dariah.eu/)). This ensures that future edition projects can easily and efficiently plan their customised workflows. Common tools can thus be evaluated and improved with respect to their requirements, as well as how they can better interact with other tools. \ No newline at end of file +The goal of the project is to create best practice guidelines and tutorials for the transitions between selected tools (e.g. Transkribus, From The Page, Ediarum, LAKomp, GAMS, TEI Publisher), as well as training materials focusing on the tools themselves. These tutorials and guidelines will be realised as an extension of the [KONDE Weißbuch](https://www.digitale-edition.at/), tools and scripts will be published on GitHub. The resources will also be integrated into the respective community infrastructures (e.g. [DARIAH Campus](https://campus.dariah.eu/)). This ensures that future edition projects can easily and efficiently plan their customised workflows. Common tools can thus be evaluated and improved with respect to their requirements, as well as how they can better interact with other tools. diff --git a/content/en/projects/hybridization-two-sides-ukrainian-russian-and-russian-ukrainian-code-mixing-southern-ukraine.mdx b/content/en/projects/hybridization-two-sides-ukrainian-russian-and-russian-ukrainian-code-mixing-southern-ukraine.mdx index b8ffddf7..1f1bc946 100644 --- a/content/en/projects/hybridization-two-sides-ukrainian-russian-and-russian-ukrainian-code-mixing-southern-ukraine.mdx +++ b/content/en/projects/hybridization-two-sides-ukrainian-russian-and-russian-ukrainian-code-mixing-southern-ukraine.mdx @@ -7,7 +7,6 @@ summary: The methodological approach of the joint FWF-DFG project consists in an data. startDate: 2019-01-01 endDate: 2023-01-01 -image: /assets/content/en/projects/hybridization-two-sides-ukrainian-russian-and-russian-ukrainian-code-mixing-southern-ukraine/undefined attachments: [] links: - url: https://pf.fwf.ac.at/de/wissenschaft-konkret/project-finder?search%5Bwhat%5D=&search%5Bpromotion_category_id%5D%5B%5D=&search%5Bpromotion_category_id%5D%5B%5D=13414&multiselect=13414&search%5Bcall%5D=&search%5Bproject_number%5D=I+4189-G30&search%5Bdecision_board_ids%5D=&search%5Bproject_title%5D=&search%5Blead_firstname%5D=Tilmann&search%5Blead_lastname%5D=Reuther&search%5Bresearch_place_kind%5D%5B%5D=&search%5Bresearch_place_kind%5D%5B%5D=Alpen-Adria-Universit%C3%A4t+Klagenfurt&multiselect=Alpen-Adria-Universit%C3%A4t+Klagenfurt&search%5Binstitute_name%5D=&search%5Bstart_date%5D=&search%5Bend_date%5D=&search%5Bgrant_years%5D%5B%5D=&search%5Bstatus_id%5D=&search%5Bscience_discipline_id%5D=&search%5Bper_page%5D=10#search-results @@ -37,4 +36,4 @@ The alternative to these hypotheses is that there are no two distinct Surzhyks b The methodological approach of the project consists in an innovative corpus linguistic description, combined with analytical methods of quantitative variationist sociolinguistics, correlated with sociodemographic data. In addition to that, in-depth interviews on individual “Sprachbiographien” (linguistic biographies) will be conducted and analysed qualitatively, in order to correlate quantitative findings with qualitative data. -_Tilmann Reuther (AAU Klagenfurt, FWF Lead) and Gerd Hentschel (University Oldenburg, DFG Partner), D-A-CH-Joint Project_ \ No newline at end of file +_Tilmann Reuther (AAU Klagenfurt, FWF Lead) and Gerd Hentschel (University Oldenburg, DFG Partner), D-A-CH-Joint Project_ diff --git a/content/en/projects/tei-converter-e-book-and-print-editions.mdx b/content/en/projects/tei-converter-e-book-and-print-editions.mdx index 7ff3b89c..01ec0352 100644 --- a/content/en/projects/tei-converter-e-book-and-print-editions.mdx +++ b/content/en/projects/tei-converter-e-book-and-print-editions.mdx @@ -3,7 +3,6 @@ title: TEI Converter for E-Book and Print Editions summary: A software for establishing print or e-book editions out of TEI files. startDate: 2019-01-01 endDate: 2021-01-01 -image: /assets/content/en/projects/tei-converter-e-book-and-print-editions/undefined attachments: [] links: - url: https://www.uibk.ac.at/projects/ahb/tei-converter/ @@ -18,4 +17,4 @@ hostingOrganizations: - Uni Innsbruck - Institut für Amerikastudien --- -The TEI Converter project is the extension of the ÖAW go!digital 2.0 project “Ambraser Heldenbuch: Transcription and Scientific Dataset” as part of the national projects CLARIN-AT (Common Language Resources and Technology Infrastructure) and DARIAH-AT (Digital Research Infrastructure for the Arts and Humanities) and is funded by the Austrian Academy of Sciences. The research project tries to develop a software for establishing print or e-book editions out of TEI files. In a first step the program will be adjusted to the requirements of an edition of a verse-based text, especially the edition of the _Ambraser Heldenbuch_ that has been transcribed and tagged before. \ No newline at end of file +The TEI Converter project is the extension of the ÖAW go!digital 2.0 project “Ambraser Heldenbuch: Transcription and Scientific Dataset” as part of the national projects CLARIN-AT (Common Language Resources and Technology Infrastructure) and DARIAH-AT (Digital Research Infrastructure for the Arts and Humanities) and is funded by the Austrian Academy of Sciences. The research project tries to develop a software for establishing print or e-book editions out of TEI files. In a first step the program will be adjusted to the requirements of an edition of a verse-based text, especially the edition of the _Ambraser Heldenbuch_ that has been transcribed and tagged before.