Plenary Speeches of the Parliament of Finland as Linked Open Data and Data Services

Research output: Contribution to journalConference articleScientificpeer-review

11 Downloads (Pure)

Abstract

This paper presents a new open infrastructure called ParliamentSampo for studying the parliamentary culture, language, and activities of politicians in Finland. For the first time, the entire time series of some million plenary speeches of the Parliament of Finland (PoF) since 1907 have been converted into data and data services in unified formats, including CSV, Parla-CLARIN, ParlaMint, and RDF Linked Open Data (LOD). The speech data have been interlinked with an ontology and a knowledge graph about the activities of the Members of Parliament (MP) and other speakers in the plenary sessions of the PoF, enriched by data linking from external data sources into a broader ontology-based LOD service. Knowledge extraction techniques based on Natural Language Processing (NLP) were used for automatic semantic annotations and topical classification of the speeches. The data and data services have been used in Digital Humanities (DH) research projects and for application development, especially for developing the in-use semantic portal ParliamentSampo. The infrastructure was published on February 14th 2023 on the Web using the open CC BY 4.0 license, and quickly gathered thousands of users.

Original languageEnglish
Article number1
Pages (from-to)1-20
Number of pages20
JournalCEUR Workshop Proceedings
Volume3447
Publication statusPublished - 2023
MoE publication typeA4 Conference publication
EventInternational Workshop on Knowledge Graph Generation from Text - Hersonissos, Greece
Duration: 29 May 202329 May 2023
Conference number: 2

Keywords

  • digital humanities
  • linked data
  • parliamentary studies
  • semantic portals

Fingerprint

Dive into the research topics of 'Plenary Speeches of the Parliament of Finland as Linked Open Data and Data Services'. Together they form a unique fingerprint.

Cite this