Abstract: This collection contains the OCRed content of the Aarhus Stiftstidende newspaper from 1938-1945 in .txt format. The dataset differs from the original publication. The images have been removed and text collated into monthly documents to facilitate text analysis and distant reading in the Introduction to Archives and Digital Methods course at Aarhus University in 2021 The teaching data are available in two versions, with less and more reduced file size: stiften_monthly_raw where each file represents a month of newspaper content within the 1938-19...
(read more)