Reconciling and Using Historical Person Registers as Linked Open Data in the AcademySampo Portal and Data Service

Petri Leskinen*, Eero Hyvönen

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

This paper presents a method for extracting and reassembling a genealogical network automatically from a biographical register of historical people. The method is applied to a dataset of short textual biographies about all 28 000 Finnish and Swedish academic people educated in 1640–1899 in Finland. The aim is to connect and disambiguate the relatives mentioned in the biographies in order to build a continuous, genealogical network, which can be used in Digital Humanities for data and network analysis of historical academic people and their lives. An artificial neural network approach is presented for solving a supervised learning task to disambiguate relatives mentioned in the register descriptions using basic biographical information enhanced with an ontology of vocations and additional occasionally sparse genealogical information. Evaluation results of the record linkage are promising and provide novel insights into the problem of historical people register reconciliation. The outcome of the work has been used in practise as part of the in-use AcademySampo portal and linked open data service, a new member in the Sampo series of cultural heritage applications for Digital Humanities.

Original languageEnglish
Title of host publicationThe Semantic Web – ISWC 2021 - 20th International Semantic Web Conference, ISWC 2021, Proceedings
EditorsAndreas Hotho, Eva Blomqvist, Stefan Dietze, Achille Fokoue, Ying Ding, Payam Barnaghi, Armin Haller, Mauro Dragoni, Harith Alani
Pages714-730
Number of pages17
DOIs
Publication statusPublished - 2021
MoE publication typeA4 Article in a conference publication
EventInternational Semantic Web Conference - Virtual, Online
Duration: 24 Oct 202128 Oct 2021
Conference number: 20

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer
Volume12922 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Semantic Web Conference
Abbreviated titleISWC
CityVirtual, Online
Period24/10/202128/10/2021

Keywords

  • Biographies
  • Data reconciling
  • Digital humanities
  • Linked data

Fingerprint

Dive into the research topics of 'Reconciling and Using Historical Person Registers as Linked Open Data in the AcademySampo Portal and Data Service'. Together they form a unique fingerprint.

Cite this