Relevance Feedback Search Based on Automatic Annotation and Classification of Texts

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference contributionScientificvertaisarvioitu

20 Lataukset (Pure)

Abstrakti

The idea behind Relevance Feedback Search (RFBS) is to build search queries as an iterative and interactive process in which they are gradually refined based on the results of the previous search round. This can be helpful in situations where the end user cannot easily formulate their information needs at the outset as a well-focused query, or more generally as a way to filter and focus search results. This paper concerns (1) a framework that integrates keyword extraction and unsupervised classification into the RFBS paradigm and (2) the application of this framework to the legal domain as a use case. We focus on the Natural Language Processing (NLP) methods underlying the framework and application, where an automatic annotation tool is used for extracting document keywords as ontology concepts, which are then transformed into word embeddings to form vectorial representations of the texts. An unsupervised classification system that employs similar techniques is also used in order to classify the documents into broad thematic classes. This classification functionality is evaluated using two different datasets. As the use case, we describe an application perspective in the semantic portal LawSampo - Finnish Legislation and Case Law on the Semantic Web. This online demonstrator uses a dataset of 82145 sections in 3725 statutes of Finnish legislation and another dataset that comprises 13470 court decisions.
AlkuperäiskieliEnglanti
Otsikko3rd Conference on Language, Data and Knowledge (LDK 2021)
ToimittajatDagmar Gromann, Gilles Serasset, Thierry Declerck, John P. McCrae, Jorge Gracia, Julia Bosque-Gil, Fernando Bobillo, Barbara Heinisch
KustantajaSchloss Dagstuhl-Leibniz-Zentrum fuer Informatik
Sivumäärä15
ISBN (elektroninen)978-3-95977-199-3
DOI - pysyväislinkit
TilaJulkaistu - 2021
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaInternational Conference on Language, Data, and Knowledge - Virtual, Online, Zaragoza, Espanja
Kesto: 1 syyskuuta 20213 syyskuuta 2021
Konferenssinumero: 3

Julkaisusarja

NimiOpen Access Series in Informatics (OASIcs)
KustantajaSchloss Dagstuhl -- Leibniz-Zentrum für Informatik
Vuosikerta93
ISSN (elektroninen)2190-6807

Conference

ConferenceInternational Conference on Language, Data, and Knowledge
LyhennettäLDK
Maa/AlueEspanja
KaupunkiZaragoza
Ajanjakso01/09/202103/09/2021

Sormenjälki

Sukella tutkimusaiheisiin 'Relevance Feedback Search Based on Automatic Annotation and Classification of Texts'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä