Plenary debates of the parliament of Finland as linked open data and in parla-CLARIN markup

Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela, Eero Hyvönen

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

5 Citations (Scopus)
85 Downloads (Pure)

Abstract

This paper presents a knowledge graph created by transforming the plenary debates of the Parliament of Finland (1907-) into Linked Open Data (LOD). The data, totaling over 900 000 speeches, with automatically created semantic annotations and rich ontology-based metadata, are published in a Linked Open Data Service and are used via a SPARQL API and as data dumps. The speech data is part of larger LOD publication FinnParla that also includes prosopographical data about the politicians. The data is being used for studying parliamentary language and culture in Digital Humanities in several universities. To serve a wider variety of users, the entirety of this data was also produced using Parla-CLARIN markup. We present the first publication of all Finnish parliamentary debates as data. Technical novelties in our approach include the use of both Parla-CLARIN and an RDF schema developed for representing the speeches, integration of the data to a new Parliament of Finland Ontology for deeper data analyses, and enriching the data with a variety of external national and international data sources.

Original languageEnglish
Title of host publication3rd Conference on Language, Data and Knowledge, LDK 2021
EditorsDagmar Gromann, Gilles Serasset, Thierry Declerck, John P. McCrae, Jorge Gracia, Julia Bosque-Gil, Fernando Bobillo, Barbara Heinisch
PublisherSchloss Dagstuhl - Leibniz-Zentrum für Informatik
Number of pages17
ISBN (Electronic)9783959771993
DOIs
Publication statusPublished - 1 Aug 2021
MoE publication typeA4 Conference publication
EventInternational Conference on Language, Data, and Knowledge - Zaragoza, Spain
Duration: 1 Sept 20213 Sept 2021
Conference number: 3

Publication series

NameOpenAccess Series in Informatics
PublisherDagstuhl Publishing
Volume93
ISSN (Print)2190-6807

Conference

ConferenceInternational Conference on Language, Data, and Knowledge
Abbreviated titleLDK
Country/TerritorySpain
CityZaragoza
Period01/09/202103/09/2021

Keywords

  • Digital humanities
  • Linked open data
  • Parla-CLARIN
  • Parliamentary data
  • Plenary debates

Fingerprint

Dive into the research topics of 'Plenary debates of the parliament of Finland as linked open data and in parla-CLARIN markup'. Together they form a unique fingerprint.

Cite this