Projects per year
This paper presents knowledge extraction and natural language processing methods used to enrich the knowledge graph of the plenary debates (textual transcripts of speeches) of the Parliament of Finland. This knowledge graph includes some 960 000 speeches (1907–2021) interlinked with a prosopographical knowledge graph about the politicians. A recent subset of the speeches was used to extract named entities and topical keywords for semantic searching and browsing the data and for data analysis. The process is based on linguistic analysis, named entity linking, and automatic subject indexing. The results were included into the ParliamentSampo knowledge graph in a SPARQL endpoint. This data can be used for studying parliamentary language and culture in Digital Humanities research and for developing applications, such as the ParliamentSampo portal.
|Number of pages||10|
|Journal||CEUR Workshop Proceedings|
|Publication status||Published - 11 Aug 2022|
|MoE publication type||A4 Article in a conference publication|
|Event||International Workshop on Knowledge Graph Generation from Text - Hersonissos, Greece|
Duration: 30 May 2022 → 30 May 2022
Conference number: 1
- digital humanities
- linked data
- natural language processing
- parliamentary studies
FingerprintDive into the research topics of 'Extracting Knowledge from Parliamentary Debates for Studying Political Culture and Language'. Together they form a unique fingerprint.
- 1 Active
InTaVia: In/Tangible European Heritage – Visual Analysis, Curation and Communication
Koho, M., Tuominen, J., Hyvönen, E., Kesäniemi, J., Tamper, C., Poikkimäki, H., Rantala, H. & Wahjoe, M.
01/11/2020 → 30/10/2023
Project: EU: Framework programmes funding