Do you have data that could be relevant to GLOBALISE?

We collect data on a range of subjects to improve our Named Entity Recognition (NER) models and to contextualize entities (such as persons, places, commodities and ships) and events (such as voyages, wars, instances of resistance) mentioned in the sources. Relevant data sets could be lists of inhabitants of a certain region, data on natural disasters and their occurrences, or on diplomatic correspondences, to mention only a few examples.

We store contributions to GLOBALISE in our Dataverse, ensuring sustainable storage of your data. Moreover, your data gains enhanced accessibility and reusability through our curation and linkage with other datasets. We securely handle all relevant data, incorporating it into the GLOBALISE corpus. Usage for NER development or historical contextualization is thoroughly documented on our GitHub.

Contributors can opt to join our pool of (historical) data experts or become a guest researcher with the project, subject to consultation.

Please do not hesitate to contact us about any questions you might have or to discuss the terms of your data deposit.

Downloads