TY - JOUR
T1 - Using linked data for data analytic literary research: Case BookSampo—Finnish fiction literature on the semantic web
AU - Ahola, Annastiina
AU - Peura, Telma
AU - Hyvönen, Eero
N1 - Publisher Copyright:
© 2025 The Author(s). Journal of the Association for Information Science and Technology published by Wiley Periodicals LLC on behalf of Association for Information Science and Technology.
PY - 2025
Y1 - 2025
N2 - The BookSampo Linked Data portal was deployed in 2011 by the Finnish Public Libraries and has today nearly 2 million annual users. Its Linked Data covers virtually all Finnish fiction literature but the data has not been used for data analyses in Digital Humanities. This paper discusses how the Knowledge Graph can be used for literary research in two ways: First, a new BookSampo 2.0 Portal user interface is presented, based on faceted semantic search with seamlessly integrated data-analytic tools for Digital Humanities research as suggested in the Sampo Model. This application makes it possible to analyze the data without programming skills. Second, the BookSampo SPARQL endpoint API can be accessed directly by SPARQL querying and scripting, using tools such as Jupyter Notebooks. The analysis results presented suggest interesting spatial, temporal, and topical trends in how the Finnish fiction literature has evolved during the last decades. The approach and tools presented in this paper can be used for analyzing literary landscapes developments in other countries as well.
AB - The BookSampo Linked Data portal was deployed in 2011 by the Finnish Public Libraries and has today nearly 2 million annual users. Its Linked Data covers virtually all Finnish fiction literature but the data has not been used for data analyses in Digital Humanities. This paper discusses how the Knowledge Graph can be used for literary research in two ways: First, a new BookSampo 2.0 Portal user interface is presented, based on faceted semantic search with seamlessly integrated data-analytic tools for Digital Humanities research as suggested in the Sampo Model. This application makes it possible to analyze the data without programming skills. Second, the BookSampo SPARQL endpoint API can be accessed directly by SPARQL querying and scripting, using tools such as Jupyter Notebooks. The analysis results presented suggest interesting spatial, temporal, and topical trends in how the Finnish fiction literature has evolved during the last decades. The approach and tools presented in this paper can be used for analyzing literary landscapes developments in other countries as well.
UR - http://www.scopus.com/inward/record.url?scp=85216468479&partnerID=8YFLogxK
U2 - 10.1002/asi.24984
DO - 10.1002/asi.24984
M3 - Article
AN - SCOPUS:85216468479
SN - 2330-1635
JO - Journal of the Association for Information Science and Technology
JF - Journal of the Association for Information Science and Technology
ER -