Related papers: Notation for Subject Answer Analysis

Recommender Systems Notation: Proposed Common Notation for Teaching and Research

As the field of recommender systems has developed, authors have used a myriad of notations for describing the mathematical workings of recommendation algorithms. These notations ap-pear in research papers, books, lecture notes, blog posts,…

Information Retrieval · Computer Science 2019-02-13 Michael D. Ekstrand , Joseph A. Konstan

Toward Effective Automated Content Analysis via Crowdsourcing

Many computer scientists use the aggregated answers of online workers to represent ground truth. Prior work has shown that aggregation methods such as majority voting are effective for measuring relatively objective features. For subjective…

Computation and Language · Computer Science 2021-04-06 Jiele Wu , Chau-Wai Wong , Xinyan Zhao , Xianpeng Liu

How unitizing affects annotation of cohesion

This paper investigates how unitizing affects external observers' annotation of group cohesion. We compared unitizing techniques belonging to these categories: interval coding, continuous coding, and a technique inspired by a cognitive…

Human-Computer Interaction · Computer Science 2022-09-29 Eleonora Ceccaldi , Nale Lehmann-Willenbrock , Erica Volta , Mohamed Chetouani , Gualtiero Volpe , Giovanna Varni

Clinical trials with interim analyses: Standardizing Terminology to increase clarity

Interim analyses for group-sequential decision making are prevalent in clinical trials. Methodology is well established and has been routinely implemented over the last decades. Still, confusions and uncertainties on aspects of how to…

Applications · Statistics 2025-06-16 Elina Asikanius , Benjamin Hofner , Lisa V. Hampson , Gernot Wassmer , Christopher Jennison , Tobias Mielke , Cornelia Ursula Kunz , Kaspar Rufibach

Semi-automatic definite description annotation: a first report

Studies in Referring Expression Generation (REG) often make use of corpora of definite descriptions produced by human subjects in controlled experiments. Experiments of this kind, which are essential for the study of reference phenomena and…

Computation and Language · Computer Science 2017-12-27 Danillo da Silva Rocha , Alex Gwo Jen Lan , Ivandre Paraboni

Practical Annotation Strategies for Question Answering Datasets

Annotating datasets for question answering (QA) tasks is very costly, as it requires intensive manual labor and often domain-specific knowledge. Yet strategies for annotating QA datasets in a cost-effective manner are scarce. To provide a…

Computation and Language · Computer Science 2020-03-09 Bernhard Kratzwald , Xiang Yue , Huan Sun , Stefan Feuerriegel

Graph-Based Recommendation System Enhanced with Community Detection

Many researchers have used tag information to improve the performance of recommendation techniques in recommender systems. Examining the tags of users will help to get their interests and leads to more accuracy in the recommendations. Since…

Information Retrieval · Computer Science 2023-10-03 Zeinab Shokrzadeh , Mohammad-Reza Feizi-Derakhshi , Mohammad-Ali Balafar , Jamshid Bagherzadeh-Mohasefi

Annotation and modeling of emotions in a textual corpus: an evaluative approach

Emotion is a crucial phenomenon in the functioning of human beings in society. However, it remains a widely open subject, particularly in its textual manifestations. This paper examines an industrial corpus manually annotated following an…

Computation and Language · Computer Science 2025-09-03 Jonas Noblet

Interpreting Expert Annotation Differences in Animal Behavior

Hand-annotated data can vary due to factors such as subjective differences, intra-rater variability, and differing annotator expertise. We study annotations from different experts who labelled the same behavior classes on a set of animal…

Machine Learning · Computer Science 2021-06-14 Megan Tjandrasuwita , Jennifer J. Sun , Ann Kennedy , Swarat Chaudhuri , Yisong Yue

Analysis of Automatic Annotation Suggestions for Hard Discourse-Level Tasks in Expert Domains

Many complex discourse-level tasks can aid domain experts in their work but require costly expert annotations for data creation. To speed up and ease annotations, we investigate the viability of automatically generated annotation…

Computation and Language · Computer Science 2019-06-07 Claudia Schulz , Christian M. Meyer , Jan Kiesewetter , Michael Sailer , Elisabeth Bauer , Martin R. Fischer , Frank Fischer , Iryna Gurevych

SAMSum Corpus: A Human-annotated Dialogue Dataset for Abstractive Summarization

This paper introduces the SAMSum Corpus, a new dataset with abstractive dialogue summaries. We investigate the challenges it poses for automated summarization by testing several models and comparing their results with those obtained on a…

Computation and Language · Computer Science 2019-12-02 Bogdan Gliwa , Iwona Mochol , Maciej Biesek , Aleksander Wawer

Explainable Agreement through Simulation for Tasks with Subjective Labels

The field of information retrieval often works with limited and noisy data in an attempt to classify documents into subjective categories, e.g., relevance, sentiment and controversy. We typically quantify a notion of agreement to understand…

Information Retrieval · Computer Science 2018-06-14 John Foley

Sentence Embeddings and High-speed Similarity Search for Fast Computer Assisted Annotation of Legal Documents

Human-performed annotation of sentences in legal documents is an important prerequisite to many machine learning based systems supporting legal tasks. Typically, the annotation is done sequentially, sentence by sentence, which is often time…

Computation and Language · Computer Science 2021-12-23 Hannes Westermann , Jaromir Savelka , Vern R. Walker , Kevin D. Ashley , Karim Benyekhlef

Corpus Considerations for Annotator Modeling and Scaling

Recent trends in natural language processing research and annotation tasks affirm a paradigm shift from the traditional reliance on a single ground truth to a focus on individual perspectives, particularly in subjective tasks. In scenarios…

Computation and Language · Computer Science 2024-04-18 Olufunke O. Sarumi , Béla Neuendorf , Joan Plepi , Lucie Flek , Jörg Schlötterer , Charles Welch

Learning Supervised Topic Models for Classification and Regression from Crowds

The growing need to analyze large collections of documents has led to great developments in topic modeling. Since documents are frequently associated with other related variables, such as labels or ratings, much interest has been placed on…

Machine Learning · Statistics 2018-08-20 Filipe Rodrigues , Mariana Lourenço , Bernardete Ribeiro , Francisco Pereira

Auto-Annotation Quality Prediction for Semi-Supervised Learning with Ensembles

Auto-annotation by ensemble of models is an efficient method of learning on unlabeled data. Wrong or inaccurate annotations generated by the ensemble may lead to performance degradation of the trained model. To deal with this problem we…

Computer Vision and Pattern Recognition · Computer Science 2024-03-14 Dror Simon , Miriam Farber , Roman Goldenberg

ReflectSumm: A Benchmark for Course Reflection Summarization

This paper introduces ReflectSumm, a novel summarization dataset specifically designed for summarizing students' reflective writing. The goal of ReflectSumm is to facilitate developing and evaluating novel summarization techniques tailored…

Computation and Language · Computer Science 2024-04-24 Yang Zhong , Mohamed Elaraby , Diane Litman , Ahmed Ashraf Butt , Muhsin Menekse

Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance

The NLP community has long advocated for the construction of multi-annotator datasets to better capture the nuances of language interpretation, subjectivity, and ambiguity. This paper conducts a retrospective study to show how performance…

Computation and Language · Computer Science 2023-10-24 Pritam Kadasi , Mayank Singh

Supersense and Sensibility: Proxy Tasks for Semantic Annotation of Prepositions

Prepositional supersense annotation is time-consuming and requires expert training. Here, we present two sensible methods for obtaining prepositional supersense annotations by eliciting surface substitution and similarity judgments. Four…

Computation and Language · Computer Science 2021-03-30 Luke Gessler , Shira Wein , Nathan Schneider

Suggesting Relevant Questions for a Query Using Statistical Natural Language Processing Technique

Suggesting similar questions for a user query has many applications ranging from reducing search time of users on e-commerce websites, training of employees in companies to holistic learning for students. The use of Natural Language…

Computation and Language · Computer Science 2022-04-27 Shriniwas Nayak , Anuj Kanetkar , Hrushabh Hirudkar , Archana Ghotkar , Sheetal Sonawane , Onkar Litake