Related papers: Carl-Hauser -- Open Source Image Matching Algorith…

Open Dataset of Phishing and Tor Hidden Services Screen-captures

Security analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. However, the main resources to develop these tools are datasets, which are introduced and…

Cryptography and Security · Computer Science 2019-08-08 Vincent Falconieri

Image Matching: An Application-oriented Benchmark

Image matching approaches have been widely used in computer vision applications in which the image-level matching performance of matchers is critical. However, it has not been well investigated by previous works which place more emphases on…

Computer Vision and Pattern Recognition · Computer Science 2018-08-08 JiaWang Bian , Le Zhang , Yun Liu , Wen-Yan Lin , Ming-Ming Cheng , Ian D. Reid

A Bayesian algorithm for detecting identity matches and fraud in image databases

A statistical algorithm for categorizing different types of matches and fraud in image databases is presented. The approach is based on a generative model of a graph representing images and connections between pairs of identities, trained…

Computer Vision and Pattern Recognition · Computer Science 2017-06-21 Gaurav Thakur

A framework for benchmarking clustering algorithms

The evaluation of clustering algorithms can involve running them on a variety of benchmark problems, and comparing their outputs to the reference, ground-truth groupings provided by experts. Unfortunately, many research papers and graduate…

Machine Learning · Computer Science 2023-10-27 Marek Gagolewski

OpenPerf: A Benchmarking Framework for the Sustainable Development of the Open-Source Ecosystem

Benchmarking involves designing scientific test methods, tools, and frameworks to quantitatively and comparably assess specific performance indicators of certain test subjects. With the development of artificial intelligence, AI…

Software Engineering · Computer Science 2023-11-28 Fenglin Bi , Fanyu Han , Shengyu Zhao , Jinlu Li , Yanbin Zhang , Wei Wang

OpenICS: Open Image Compressive Sensing Toolbox and Benchmark

We present OpenICS, an image compressive sensing toolbox that includes multiple image compressive sensing and reconstruction algorithms proposed in the past decade. Due to the lack of standardization in the implementation and evaluation of…

Computer Vision and Pattern Recognition · Computer Science 2021-05-10 Jonathan Zhao , Matthew Westerham , Mark Lakatos-Toth , Zhikang Zhang , Avi Moskoff , Fengbo Ren

Towards Benchmark Datasets for Machine Learning Based Website Phishing Detection: An experimental study

In this paper, we present a general scheme for building reproducible and extensible datasets for website phishing detection. The aim is to (1) enable comparison of systems using different features, (2) overtake the short-lived nature of…

Cryptography and Security · Computer Science 2024-04-24 Abdelhakim Hannousse , Salima Yahiouche

Douglas-Quaid -- Open Source Image Matching Library

Security analysts need to classify, search and correlate numerous images. Automatic classification tools improve the efficiency of such tasks. However, no open-source and turnkey library was found able to reach this goal. The present paper…

Cryptography and Security · Computer Science 2019-08-13 Vincent Falconieri

An Evaluation-Focused Framework for Visualization Recommendation Algorithms

Although we have seen a proliferation of algorithms for recommending visualizations, these algorithms are rarely compared with one another, making it difficult to ascertain which algorithm is best for a given visual analysis scenario.…

Human-Computer Interaction · Computer Science 2021-09-08 Zehua Zeng , Phoebe Moh , Fan Du , Jane Hoffswell , Tak Yeon Lee , Sana Malik , Eunyee Koh , Leilani Battle

Image-Based Benchmarking and Visualization for Large-Scale Global Optimization

In the context of optimization, visualization techniques can be useful for understanding the behaviour of optimization algorithms and can even provide a means to facilitate human interaction with an optimizer. Towards this goal, an…

Neural and Evolutionary Computing · Computer Science 2020-07-27 Kyle Robert Harrison , Azam Asilian Bidgoli , Shahryar Rahnamayan , Kalyanmoy Deb

Benchmarking Robustness to Adversarial Image Obfuscations

Automated content filtering and moderation is an important tool that allows online platforms to build striving user communities that facilitate cooperation and prevent abuse. Unfortunately, resourceful actors try to bypass automated filters…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Florian Stimberg , Ayan Chakrabarti , Chun-Ta Lu , Hussein Hazimeh , Otilia Stretcu , Wei Qiao , Yintao Liu , Merve Kaya , Cyrus Rashtchian , Ariel Fuxman , Mehmet Tek , Sven Gowal

Ranking News-Quality Multimedia

News editors need to find the photos that best illustrate a news piece and fulfill news-media quality standards, while being pressed to also find the most recent photos of live events. Recently, it became common to use social-media content…

Information Retrieval · Computer Science 2018-10-10 Gonçalo Marcelino , Ricardo Pinto , João Magalhães

A Benchmarking Framework for Model Datasets

Empirical and LLM-based research in model-driven engineering increasingly relies on datasets of software models, for instance, to train or evaluate machine learning techniques for modeling support. These datasets have a significant impact…

Software Engineering · Computer Science 2026-03-06 Philipp-Lorenz Glaser , Lola Burgueño , Dominik Bork

An Open Source AutoML Benchmark

In recent years, an active field of research has developed around automated machine learning (AutoML). Unfortunately, comparing different AutoML systems is hard and often done incorrectly. We introduce an open, ongoing, and extensible…

Machine Learning · Computer Science 2019-07-02 Pieter Gijsbers , Erin LeDell , Janek Thomas , Sébastien Poirier , Bernd Bischl , Joaquin Vanschoren

Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models

Image classifiers should be used with caution in the real world. Performance evaluated on a validation set may not reflect performance in the real world. In particular, classifiers may perform well for conditions that are frequently…

Computer Vision and Pattern Recognition · Computer Science 2024-09-30 Adrien LeCoz , Houssem Ouertatani , Stéphane Herbin , Faouzi Adjed

Phishing Attacks and Websites Classification Using Machine Learning and Multiple Datasets (A Comparative Analysis)

Phishing attacks are the most common type of cyber-attacks used to obtain sensitive information and have been affecting individuals as well as organisations across the globe. Various techniques have been proposed to identify the phishing…

Cryptography and Security · Computer Science 2021-01-08 Sohail Ahmed Khan , Wasiq Khan , Abir Hussain

Hashmarks: Privacy-Preserving Benchmarks for High-Stakes AI Evaluation

There is a growing need to gain insight into language model capabilities that relate to sensitive topics, such as bioterrorism or cyberwarfare. However, traditional open source benchmarks are not fit for the task, due to the associated…

Machine Learning · Computer Science 2023-12-27 Paul Bricman

A Sophisticated Framework for the Accurate Detection of Phishing Websites

Phishing is an increasingly sophisticated form of cyberattack that is inflicting huge financial damage to corporations throughout the globe while also jeopardizing individuals' privacy. Attackers are constantly devising new methods of…

Cryptography and Security · Computer Science 2024-03-18 Asif Newaz , Farhan Shahriyar Haq , Nadim Ahmed

OpenDataVal: a Unified Benchmark for Data Valuation

Assessing the quality and impact of individual data points is critical for improving model performance and mitigating undesirable biases within the training dataset. Several data valuation algorithms have been proposed to quantify data…

Machine Learning · Computer Science 2023-10-16 Kevin Fu Jiang , Weixin Liang , James Zou , Yongchan Kwon

ANN-Benchmarks: A Benchmarking Tool for Approximate Nearest Neighbor Algorithms

This paper describes ANN-Benchmarks, a tool for evaluating the performance of in-memory approximate nearest neighbor algorithms. It provides a standard interface for measuring the performance and quality achieved by nearest neighbor…

Information Retrieval · Computer Science 2018-07-19 Martin Aumüller , Erik Bernhardsson , Alexander Faithfull