Reconciling and Using Historical Person Registers as Linked Open Data in the AcademySampo Portal and Data Service

Petri Leskinen*, Eero Hyvönen

*Tämän työn vastaava kirjoittaja

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

5 Sitaatiot (Scopus)
45 Lataukset (Pure)

Abstrakti

This paper presents a method for extracting and reassembling a genealogical network automatically from a biographical register of historical people. The method is applied to a dataset of short textual biographies about all 28 000 Finnish and Swedish academic people educated in 1640–1899 in Finland. The aim is to connect and disambiguate the relatives mentioned in the biographies in order to build a continuous, genealogical network, which can be used in Digital Humanities for data and network analysis of historical academic people and their lives. An artificial neural network approach is presented for solving a supervised learning task to disambiguate relatives mentioned in the register descriptions using basic biographical information enhanced with an ontology of vocations and additional occasionally sparse genealogical information. Evaluation results of the record linkage are promising and provide novel insights into the problem of historical people register reconciliation. The outcome of the work has been used in practise as part of the in-use AcademySampo portal and linked open data service, a new member in the Sampo series of cultural heritage applications for Digital Humanities.

AlkuperäiskieliEnglanti
OtsikkoThe Semantic Web – ISWC 2021 - 20th International Semantic Web Conference, ISWC 2021, Proceedings
ToimittajatAndreas Hotho, Eva Blomqvist, Stefan Dietze, Achille Fokoue, Ying Ding, Payam Barnaghi, Armin Haller, Mauro Dragoni, Harith Alani
KustantajaSpringer
Sivut714-730
Sivumäärä17
ISBN (painettu)978-3-030-88360-7
DOI - pysyväislinkit
TilaJulkaistu - 2021
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Semantic Web Conference - Virtual, Online
Kesto: 24 lokak. 202128 lokak. 2021
Konferenssinumero: 20

Julkaisusarja

NimiLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
KustantajaSpringer
Vuosikerta12922 LNCS
ISSN (painettu)0302-9743
ISSN (elektroninen)1611-3349

Conference

ConferenceInternational Semantic Web Conference
LyhennettäISWC
KaupunkiVirtual, Online
Ajanjakso24/10/202128/10/2021

Rahoitus

Thanks to Yrj? Kotivuori and Veli-Matti Autio for their seminal work in creating the original databases used in our work, and for making the data openly available. Discussions with Heikki Rantala, Esko Ikkala, Mikko Koho, and Jouni Tuominen are acknowledged. This work is part of the EU project InTaVia: In/Tangible European Heritage (https://intavia.eu/), and is related to the EU COST action Nexus Linguarum (https://nexuslinguarum.eu/the-action) on linguistic data science. CSC ? IT Center for Science provided computational resources for the work.

Sormenjälki

Sukella tutkimusaiheisiin 'Reconciling and Using Historical Person Registers as Linked Open Data in the AcademySampo Portal and Data Service'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä