The enriched DART-Europe dataset is available here. It is organized as follows:
id-thesis \t title \t author \t year \t institution \t macro-area-first-disc \t first-disc \t confidence-classifier-first-disc \t second-disc \t confidence-classifier-second-disc \n
Important: DART-Europe has recently changed the ids of the theses on the website. To match the theses in enriched-dataset to their webpage use the fields author and title, not id. If you encounter any problem, just drop me an email.
If you use this dataset, remember to cite:
Federico Nanni, Giulia Paci. “A Discipline-Enriched Dataset for Tracking the Computational Turn of European Universities”, Proc. of WOSP 2017.