Related papers: Measuring Item Similarity in Introductory Programm…

Measuring Plagiarism in Introductory Programming Course Assignments

Measuring plagiarism in programming assignments is an essential task to the educational procedure. This paper discusses the methods of plagiarism and its detection in introductory programming course assignments written in C++. A small…

Computation and Language · Computer Science 2022-05-31 Muhammad Humayoun , Muhammad Adnan Hashmi , Ali Hanzala Khan

Jointly Learning Multiple Measures of Similarities from Triplet Comparisons

Similarity between objects is multi-faceted and it can be easier for human annotators to measure it when the focus is on a specific aspect. We consider the problem of mapping objects into view-specific embeddings where the distance between…

Machine Learning · Statistics 2015-10-08 Liwen Zhang , Subhransu Maji , Ryota Tomioka

Measuring Human-perceived Similarity in Heterogeneous Collections

We present a technique for estimating the similarity between objects such as movies or foods whose proper representation depends on human perception. Our technique combines a modest number of human similarity assessments to infer a pairwise…

Artificial Intelligence · Computer Science 2018-02-19 Jesse Anderton , Pavel Metrikov , Virgil Pavlu , Javed Aslam

Learning similarity measures from data

Defining similarity measures is a requirement for some machine learning methods. One such method is case-based reasoning (CBR) where the similarity measure is used to retrieve the stored case or set of cases most similar to the query case.…

Machine Learning · Computer Science 2020-01-16 Bjørn Magnus Mathisen , Agnar Aamodt , Kerstin Bach , Helge Langseth

The importance of being dissimilar in Recommendation

Similarity measures play a fundamental role in memory-based nearest neighbors approaches. They recommend items to a user based on the similarity of either items or users in a neighborhood. In this paper we argue that, although it keeps a…

Information Retrieval · Computer Science 2019-07-05 Vito Walter Anelli , Joseph Trotta , Tommaso Di Noia , Eugenio Di Sciascio , Azzurra Ragone

Understanding (dis)similarity measures

Intuitively, the concept of similarity is the notion to measure an inexact matching between two entities of the same reference set. The notions of similarity and its close relative dissimilarity are widely used in many fields of Artificial…

Artificial Intelligence · Computer Science 2012-12-13 Lluís A. Belanche

Evaluation Measures of Individual Item Fairness for Recommender Systems: A Critical Study

Fairness is an emerging and challenging topic in recommender systems. In recent years, various ways of evaluating and therefore improving fairness have emerged. In this study, we examine existing evaluation measures of fairness in…

Information Retrieval · Computer Science 2024-05-21 Theresia Veronika Rampisela , Maria Maistro , Tuukka Ruotsalo , Christina Lioma

Proximity Measure of Information Object Features for Solving the Problem of Their Identification in Information Systems

The paper considers a new quantitative-qualitative proximity measure for the features of information objects, where data enters a common information resource from several sources independently. The goal is to determine the possibility of…

Artificial Intelligence · Computer Science 2026-04-08 Volodymyr Yuzefovych

We present a model to measure the similarity in appearance between different materials, which correlates with human similarity judgments. We first create a database of 9,000 rendered images depicting objects with varying materials, shape…

Graphics · Computer Science 2020-03-18 Manuel Lagunas , Sandra Malpica , Ana Serrano , Elena Garces , Diego Gutierrez , Belen Masia

A Framework for Standardizing Similarity Measures in a Rapidly Evolving Field

Similarity measures are fundamental tools for quantifying the alignment between artificial and biological systems. However, the diversity of similarity measures and their varied naming and implementation conventions makes it challenging to…

Neurons and Cognition · Quantitative Biology 2025-09-09 Nathan Cloos , Guangyu Robert Yang , Christopher J. Cueva

Measuring similarity between training examples is critical for curating high-quality and diverse pretraining datasets for language models. However, similarity is typically computed with a generic off-the-shelf embedding model that has been…

Machine Learning · Computer Science 2025-10-22 Dylan Sam , Ayan Chakrabarti , Afshin Rostamizadeh , Srikumar Ramalingam , Gui Citovsky , Sanjiv Kumar

Metrics for Inter-Dataset Similarity with Example Applications in Synthetic Data and Feature Selection Evaluation -- Extended Version

Measuring inter-dataset similarity is an important task in machine learning and data mining with various use cases and applications. Existing methods for measuring inter-dataset similarity are computationally expensive, limited, or…

Machine Learning · Computer Science 2025-05-06 Muhammad Rajabinasab , Anton D. Lautrup , Arthur Zimek

Plagiarism deterrence for introductory programming

Plagiarism in introductory programming courses is an enormous challenge for both students and institutions. For students, relying on the work of others too early in their academic development can make it impossible to acquire necessary…

Computers and Society · Computer Science 2022-06-08 Simon J. Cohen , Michael J. Martin , Chance A. Shipley , Abhishek Kumar , Andrew R. Cohen

Measuring and predicting visual fidelity

This paper is a study of techniques for measuring and predicting visual fidelity. As visual stimuli we use polygonal models, and vary their fidelity with two different model simplification algorithms. We also group the stimuli into two…

Graphics · Computer Science 2025-07-17 Benjamin Watson , Alinda Friedman , Aaron McGaffey

Measuring Compositionality in Representation Learning

Many machine learning algorithms represent input data with vector embeddings or discrete codes. When inputs exhibit compositional structure (e.g. objects built from parts or procedures from subroutines), it is natural to ask whether this…

Machine Learning · Computer Science 2019-04-09 Jacob Andreas

A Guide to Similarity Measures

Similarity measures play a central role in various data science application domains for a wide assortment of tasks. This guide describes a comprehensive set of prevalent similarity measures to serve both non-experts and professional.…

Information Retrieval · Computer Science 2024-08-16 Avivit Levy , B. Riva Shalom , Michal Chalamish

Ranking relations using analogies in biological and information networks

Analogical reasoning depends fundamentally on the ability to learn and generalize about relations between objects. We develop an approach to relational learning which, given a set of pairs of objects…

Methodology · Statistics 2013-08-30 Ricardo Silva , Katherine Heller , Zoubin Ghahramani , Edoardo M. Airoldi

Learning Compatibility Across Categories for Heterogeneous Item Recommendation

Identifying relationships between items is a key task of an online recommender system, in order to help users discover items that are functionally complementary or visually compatible. In domains like clothing recommendation, this task is…

Information Retrieval · Computer Science 2016-09-30 Ruining He , Charles Packer , Julian McAuley

An Empirical Evaluation of Similarity Measures for Time Series Classification

Time series are ubiquitous, and a measure to assess their similarity is a core part of many computational systems. In particular, the similarity measure is the most essential ingredient of time series clustering and classification systems.…

Machine Learning · Computer Science 2016-05-18 Joan Serrà , Josep Lluis Arcos

Metric Learning for Individual Fairness

There has been much discussion recently about how fairness should be measured or enforced in classification. Individual Fairness [Dwork, Hardt, Pitassi, Reingold, Zemel, 2012], which requires that similar individuals be treated similarly,…

Machine Learning · Computer Science 2020-04-03 Christina Ilvento