Filip Ginter — Scifaro

Structure Retention in Embedding Spaces as a Predictor of Benchmark Performance

In this paper, we show that high-performing embedding models organize their embedding spaces in a consistent way. We evaluate 25 contemporary embedding models on five MTEB tasks spanning four diverse task categories (retrieval, bitext…

Computation and Language · Computer Science 2026-05-22 Amanda Myntti , Jenna Kanerva , Veronika Laippala , Filip Ginter

Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke

While digitized corpora have transformed the study of intellectual transmission, current methods rely heavily on lexical text reuse detection, capturing verbatim quotations but fundamentally missing paraphrases and complex implicit…

Computation and Language · Computer Science 2026-05-13 Yu Wu , Ananth Mahadevan , Filip Ginter , Michael Mathioudakis , Mikko Tolonen

Measuring Social Integration Through Participation: Categorizing Organizations and Leisure Activities in the Displaced Karelians Interview Archive using LLMs

Digitized historical archives make it possible to study everyday social life on a large scale, but the information extracted directly from text often does not directly allow one to answer the research questions posed by historians or…

Computation and Language · Computer Science 2026-02-18 Joonatan Laato , Veera Schroderus , Jenna Kanerva , Jenni Kauppi , Virpi Lummaa , Filip Ginter

Creating a Historical Migration Dataset from Finnish Church Records, 1800-1920

This article presents a large-scale effort to create a structured dataset of internal migration in Finland between 1800 and 1920 using digitized church moving records. These records, maintained by Evangelical-Lutheran parishes, document the…

Computer Vision and Pattern Recognition · Computer Science 2025-09-09 Ari Vesalainen , Jenna Kanerva , Aida Nitsch , Kiia Korsu , Ilari Larkiola , Laura Ruotsalainen , Filip Ginter

Interaction Analysis by Humans and AI: A Comparative Perspective

This paper explores how Mixed Reality (MR) and 2D video conferencing influence children's communication during a gesture-based guessing game. Finnish-speaking participants engaged in a short collaborative task using two different setups:…

Human-Computer Interaction · Computer Science 2025-06-10 Maryam Teimouri , Filip Ginter , Tomi "bgt" Suovuo

Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs

We performed a zero-shot information extraction study on a historical collection of 89,339 brief Finnish-language interviews of refugee families relocated post-WWII from Finnish Eastern Karelia. Our research objective is two-fold. First, we…

Computation and Language · Computer Science 2025-02-20 Joonatan Laato , Jenna Kanerva , John Loehr , Virpi Lummaa , Filip Ginter

Semantic Search as Extractive Paraphrase Span Detection

In this paper, we approach the problem of semantic search by framing the search task as paraphrase span detection, i.e. given a segment of text as a query phrase, the task is to identify its paraphrase in a given document, the same…

Computation and Language · Computer Science 2025-02-20 Jenna Kanerva , Hanna Kitti , Li-Hsin Chang , Teemu Vahtola , Mathias Creutz , Filip Ginter

OCR Error Post-Correction with LLMs in Historical Documents: No Free Lunches

Optical Character Recognition (OCR) systems often introduce errors when transcribing historical documents, leaving room for post-correction to improve text quality. This study evaluates the use of open-weight LLMs for OCR error correction…

Computation and Language · Computer Science 2025-02-04 Jenna Kanerva , Cassandra Ledins , Siiri Käpyaho , Filip Ginter

FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering

Data quality is crucial for training Large Language Models (LLMs). Traditional heuristic filters often miss low-quality text or mistakenly remove valuable content. In this paper, we introduce an LLM-based line-level filtering method to…

Computation and Language · Computer Science 2025-01-14 Erik Henriksson , Otto Tarkka , Filip Ginter

Finnish SQuAD: A Simple Approach to Machine Translation of Span Annotations

We apply a simple method to machine translate datasets with span-level annotation using the DeepL MT service and its ability to translate formatted documents. Using this method, we produce a Finnish version of the SQuAD2.0 question…

Computation and Language · Computer Science 2025-01-13 Emil Nuutinen , Iiro Rastas , Filip Ginter

FinGPT: Large Generative Models for a Small Language

Large language models (LLMs) excel in many tasks in NLP and beyond, but most open models have very limited coverage of smaller languages and LLM work tends to focus on languages where nearly unlimited data is available for pretraining. In…

Computation and Language · Computer Science 2023-11-13 Risto Luukkonen , Ville Komulainen , Jouni Luoma , Anni Eskelinen , Jenna Kanerva , Hanna-Mari Kupari , Filip Ginter , Veronika Laippala , Niklas Muennighoff , Aleksandra Piktus , Thomas Wang , Nouamane Tazi , Teven Le Scao , Thomas Wolf , Osma Suominen , Samuli Sairanen , Mikko Merioksa , Jyrki Heinonen , Aija Vahtola , Samuel Antao , Sampo Pyysalo

Silver Syntax Pre-training for Cross-Domain Relation Extraction

Relation Extraction (RE) remains a challenging task, especially when considering realistic out-of-domain evaluations. One of the main reasons for this is the limited training size of current RE datasets: obtaining high-quality (manually…

Computation and Language · Computer Science 2023-05-19 Elisa Bassignana , Filip Ginter , Sampo Pyysalo , Rob van der Goot , Barbara Plank

Multi-CrossRE A Multi-Lingual Multi-Domain Dataset for Relation Extraction

Most research in Relation Extraction (RE) involves the English language, mainly due to the lack of multi-lingual resources. We propose Multi-CrossRE, the broadest multi-lingual dataset for RE, including 26 languages in addition to English,…

Computation and Language · Computer Science 2023-05-19 Elisa Bassignana , Filip Ginter , Sampo Pyysalo , Rob van der Goot , Barbara Plank

Identifying gender bias in blockbuster movies through the lens of machine learning

The problem of gender bias is highly prevalent and well known. In this paper, we have analysed the portrayal of gender roles in English movies, a medium that effectively influences society in shaping people's beliefs and opinions. First, we…

Computation and Language · Computer Science 2022-11-24 Muhammad Junaid Haris , Aanchal Upreti , Melih Kurtaran , Filip Ginter , Sebastien Lafond , Sepinoud Azimi

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal…

Computation and Language · Computer Science 2022-06-27 Sebastian Gehrmann , Abhik Bhattacharjee , Abinaya Mahendiran , Alex Wang , Alexandros Papangelis , Aman Madaan , Angelina McMillan-Major , Anna Shvets , Ashish Upadhyay , Bingsheng Yao , Bryan Wilie , Chandra Bhagavatula , Chaobin You , Craig Thomson , Cristina Garbacea , Dakuo Wang , Daniel Deutsch , Deyi Xiong , Di Jin , Dimitra Gkatzia , Dragomir Radev , Elizabeth Clark , Esin Durmus , Faisal Ladhak , Filip Ginter , Genta Indra Winata , Hendrik Strobelt , Hiroaki Hayashi , Jekaterina Novikova , Jenna Kanerva , Jenny Chim , Jiawei Zhou , Jordan Clive , Joshua Maynez , João Sedoc , Juraj Juraska , Kaustubh Dhole , Khyathi Raghavi Chandu , Laura Perez-Beltrachini , Leonardo F. R. Ribeiro , Lewis Tunstall , Li Zhang , Mahima Pushkarna , Mathias Creutz , Michael White , Mihir Sanjay Kale , Moussa Kamal Eddine , Nico Daheim , Nishant Subramani , Ondrej Dusek , Paul Pu Liang , Pawan Sasanka Ammanamanchi , Qi Zhu , Ratish Puduppully , Reno Kriz , Rifat Shahriyar , Ronald Cardenas , Saad Mahamood , Salomey Osei , Samuel Cahyawijaya , Sanja Štajner , Sebastien Montella , Shailza , Shailza Jolly , Simon Mille , Tahmid Hasan , Tianhao Shen , Tosin Adewumi , Vikas Raunak , Vipul Raheja , Vitaly Nikolaev , Vivian Tsai , Yacine Jernite , Ying Xu , Yisi Sang , Yixin Liu , Yufang Hou

Out-of-Domain Evaluation of Finnish Dependency Parsing

The prevailing practice in the academia is to evaluate the model performance on in-domain evaluation data typically set aside from the training corpus. However, in many real world applications the data on which the model is applied may very…

Computation and Language · Computer Science 2022-04-25 Jenna Kanerva , Filip Ginter

Explaining Classes through Word Attribution

In recent years, several methods have been proposed for explaining individual predictions of deep learning models, yet there has been little study of how to aggregate these predictions to explain how such models view classes as a whole in…

Computation and Language · Computer Science 2021-09-01 Samuel Rönnqvist , Amanda Myntti , Aki-Juhani Kyröläinen , Sampo Pyysalo , Veronika Laippala , Filip Ginter

Annotation Guidelines for the Turku Paraphrase Corpus

This document describes the annotation guidelines used to construct the Turku Paraphrase Corpus. These guidelines were developed together with the corpus annotation, revising and extending the guidelines regularly during the annotation…

Computation and Language · Computer Science 2021-08-20 Jenna Kanerva , Filip Ginter , Li-Hsin Chang , Iiro Rastas , Valtteri Skantsi , Jemina Kilpeläinen , Hanna-Mari Kupari , Aurora Piirto , Jenna Saarni , Maija Sevón , Otto Tarkka

Quantitative Evaluation of Alternative Translations in a Corpus of Highly Dissimilar Finnish Paraphrases

In this paper, we present a quantitative evaluation of differences between alternative translations in a large recently released Finnish paraphrase corpus focusing in particular on non-trivial variation in translation. We combine a series…

Computation and Language · Computer Science 2021-05-07 Li-Hsin Chang , Sampo Pyysalo , Jenna Kanerva , Filip Ginter

Deep learning for sentence clustering in essay grading support

Essays as a form of assessment test student knowledge on a deeper level than short answer and multiple-choice questions. However, the manual evaluation of essays is time- and labor-consuming. Automatic clustering of essays, or their…

Computation and Language · Computer Science 2021-04-26 Li-Hsin Chang , Iiro Rastas , Sampo Pyysalo , Filip Ginter