Abstract
This paper presents an effective method for case law retrieval based on semantic document similarity and a web application for querying Finnish case law. The novelty of the work comes from the idea of using legal documents for automatic formulation of the query, including case law judgments, legal case descriptions, or other texts. The query documents may be in various formats, including image files with text content. This approach allows efficient search for similar documents without the need to specify a query string or keywords, which can be difficult in this use case. The application leverages two traditional word frequency based methods, TF-IDF and LDA, alongside two modern neural network methods, Doc2Vec and Doc2VecC. Effectiveness of the approach for document relevance ranking has been evaluated using a gold standard set of inter-document similarities. We show that a linear combination of similarities derived from the individual models provides a robust automatic similarity assessment for ranking the case law documents for retrieval.
Original language | English |
---|---|
Title of host publication | Artificial Intelligence and Natural Language - 9th Conference, AINL 2020, Proceedings |
Editors | Andrey Filchenkov, Janne Kauttonen, Lidia Pivovarova |
Publisher | Springer |
Pages | 145-157 |
Number of pages | 13 |
ISBN (Electronic) | 978-3-030-59082-6 |
ISBN (Print) | 978-3-030-59081-9 |
DOIs | |
Publication status | Published - 2020 |
MoE publication type | A4 Conference publication |
Event | Artificial Intelligence and Natural Language - Virtual, Online Duration: 7 Oct 2020 → 9 Oct 2020 Conference number: 9 |
Publication series
Name | Communications in Computer and Information Science |
---|---|
Publisher | Springer |
Volume | 1292 |
ISSN (Electronic) | 1865-0929 |
Conference
Conference | Artificial Intelligence and Natural Language |
---|---|
Abbreviated title | AINL |
City | Virtual, Online |
Period | 07/10/2020 → 09/10/2020 |