Plenary debates of the parliament of Finland as linked open data and in parla-CLARIN markup

Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela, Eero Hyvönen

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

3 Sitaatiot (Scopus)
69 Lataukset (Pure)

Abstrakti

This paper presents a knowledge graph created by transforming the plenary debates of the Parliament of Finland (1907-) into Linked Open Data (LOD). The data, totaling over 900 000 speeches, with automatically created semantic annotations and rich ontology-based metadata, are published in a Linked Open Data Service and are used via a SPARQL API and as data dumps. The speech data is part of larger LOD publication FinnParla that also includes prosopographical data about the politicians. The data is being used for studying parliamentary language and culture in Digital Humanities in several universities. To serve a wider variety of users, the entirety of this data was also produced using Parla-CLARIN markup. We present the first publication of all Finnish parliamentary debates as data. Technical novelties in our approach include the use of both Parla-CLARIN and an RDF schema developed for representing the speeches, integration of the data to a new Parliament of Finland Ontology for deeper data analyses, and enriching the data with a variety of external national and international data sources.

AlkuperäiskieliEnglanti
Otsikko3rd Conference on Language, Data and Knowledge, LDK 2021
ToimittajatDagmar Gromann, Gilles Serasset, Thierry Declerck, John P. McCrae, Jorge Gracia, Julia Bosque-Gil, Fernando Bobillo, Barbara Heinisch
KustantajaSchloss Dagstuhl - Leibniz-Zentrum für Informatik
Sivumäärä17
ISBN (elektroninen)9783959771993
DOI - pysyväislinkit
TilaJulkaistu - 1 elok. 2021
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Conference on Language, Data, and Knowledge - Zaragoza, Espanja
Kesto: 1 syysk. 20213 syysk. 2021
Konferenssinumero: 3

Julkaisusarja

NimiOpenAccess Series in Informatics
KustantajaDagstuhl Publishing
Vuosikerta93
ISSN (painettu)2190-6807

Conference

ConferenceInternational Conference on Language, Data, and Knowledge
LyhennettäLDK
Maa/AlueEspanja
KaupunkiZaragoza
Ajanjakso01/09/202103/09/2021

Sormenjälki

Sukella tutkimusaiheisiin 'Plenary debates of the parliament of Finland as linked open data and in parla-CLARIN markup'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä