Edwin Simpson — Scifaro

Teaching Language Models to Check Grounded Claim Factuality with Human Test-Taking Strategies

Grounded claim factuality checking is important for large language model (LLM) applications such as retrieval-augmented generation, as it helps users assess the correctness of generated outputs. Existing metrics using entailment classifiers…

Computation and Language · Computer Science 2026-05-29 Yuxuan Ye , Raul Santos-Rodriguez , Edwin Simpson

Optimising Factual Consistency in Summarisation via Preference Learning from Multiple Imperfect Metrics

Reinforcement learning with evaluation metrics as rewards is widely used to enhance specific capabilities of language models. However, for tasks such as factually consistent summarisation, existing metrics remain underdeveloped, limiting…

Computation and Language · Computer Science 2026-05-27 Yuxuan Ye , Raul Santos-Rodriguez , Edwin Simpson

Self-Supervised Animal Identification for Long Videos

Identifying individual animals in long-duration videos is essential for behavioral ecology, wildlife monitoring, and livestock management. Traditional methods require extensive manual annotation, while existing self-supervised approaches…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Xuyang Fang , Sion Hannuna , Edwin Simpson , Neill Campbell

Clinically-aligned Multi-modal Chest X-ray Classification

Radiology is essential to modern healthcare, yet rising demand and staffing shortages continue to pose major challenges. Recent advances in artificial intelligence have the potential to support radiologists and help address these…

Image and Video Processing · Electrical Eng. & Systems 2025-11-14 Phillip Sloan , Edwin Simpson , Majid Mirmehdi

8-Calves Image dataset

Automated livestock monitoring is crucial for precision farming, but robust computer vision models are hindered by a lack of datasets reflecting real-world group challenges. We introduce the 8-Calves dataset, a challenging benchmark for…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Xuyang Fang , Sion Hannuna , Neill Campbell , Edwin Simpson

Machine Learning for Climate Policy: Understanding Policy Progression in the European Green Deal

Climate change demands effective legislative action to mitigate its impacts. This study explores the application of machine learning (ML) to understand the progression of climate policy from announcement to adoption, focusing on policies…

Machine Learning · Computer Science 2025-10-21 Patricia West , Michelle WL Wan , Alexander Hepburn , Edwin Simpson , Raul Santos-Rodriguez , Jeffrey N Clark

How well can LLMs Grade Essays in Arabic?

This research assesses the effectiveness of state-of-the-art large language models (LLMs), including ChatGPT, Llama, Aya, Jais, and ACEGPT, in the task of Arabic automated essay scoring (AES) using the AR-AES dataset. It explores various…

Computation and Language · Computer Science 2025-01-29 Rayed Ghazawi , Edwin Simpson

Cutting-edge abstractive summarisers generate fluent summaries, but the factuality of the generated text is not guaranteed. Early summary factuality evaluation metrics are usually based on n-gram overlap and embedding similarity, but are…

Computation and Language · Computer Science 2024-09-24 Yuxuan Ye , Edwin Simpson , Raul Santos Rodriguez

Out-of-Distribution Detection with Attention Head Masking for Multimodal Document Classification

Detecting out-of-distribution (OOD) data is crucial in machine learning applications to mitigate the risk of model overconfidence, thereby enhancing the reliability and safety of deployed systems. The majority of existing OOD detection…

Artificial Intelligence · Computer Science 2024-08-22 Christos Constantinou , Georgios Ioannides , Aman Chadha , Aaron Elkins , Edwin Simpson

Medfluencer: A Network Representation of Medical Influencers' Identities and Discourse on Social Media

In our study, we first constructed a dataset from the tweets of the top 100 medical influencers with the highest Influencer Score during the COVID-19 pandemic. This dataset was then used to construct a socio-semantic network, mapping both…

Social and Information Networks · Computer Science 2024-08-01 Zhijin Guo , Edwin Simpson , Roberta Bernardi

Automated essay scoring in Arabic: a dataset and analysis of a BERT-based system

Automated Essay Scoring (AES) holds significant promise in the field of education, helping educators to mark larger volumes of essays and provide timely feedback. However, Arabic AES research has been limited by the lack of publicly…

Computation and Language · Computer Science 2024-07-17 Rayed Ghazawi , Edwin Simpson

Automated Radiology Report Generation: A Review of Recent Advances

Increasing demands on medical imaging departments are taking a toll on the radiologist's ability to deliver timely and accurate reports. Recent technological advances in artificial intelligence have demonstrated great potential for…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Phillip Sloan , Philip Clatworthy , Edwin Simpson , Majid Mirmehdi

Towards Abstractive Timeline Summarisation using Preference-based Reinforcement Learning

This paper introduces a novel pipeline for summarising timelines of events reported by multiple news sources. Transformer-based models for abstractive summarisation generate coherent and concise summaries of long documents but can fail to…

Machine Learning · Computer Science 2023-11-06 Yuxuan Ye , Edwin Simpson

Efficient Methods for Natural Language Processing: A Survey

Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources…

Computation and Language · Computer Science 2023-03-28 Marcos Treviso , Ji-Ung Lee , Tianchu Ji , Betty van Aken , Qingqing Cao , Manuel R. Ciosici , Michael Hassid , Kenneth Heafield , Sara Hooker , Colin Raffel , Pedro H. Martins , André F. T. Martins , Jessica Zosa Forde , Peter Milder , Edwin Simpson , Noam Slonim , Jesse Dodge , Emma Strubell , Niranjan Balasubramanian , Leon Derczynski , Iryna Gurevych , Roy Schwartz

Assisting Decision Making in Scholarly Peer Review: A Preference Learning Perspective

Peer review is the primary means of quality control in academia; as an outcome of a peer review process, program and area chairs make acceptance decisions for each paper based on the review reports and scores they received. Quality of…

Computation and Language · Computer Science 2022-05-30 Nils Dycke , Edwin Simpson , Ilia Kuznetsov , Iryna Gurevych

Predicting the Humorousness of Tweets Using Gaussian Process Preference Learning

Most humour processing systems to date make at best discrete, coarse-grained distinctions between the comical and the conventional, yet such notions are better conceptualized as a broad spectrum. In this paper, we present a probabilistic…

Computation and Language · Computer Science 2021-03-30 Tristan Miller , Erik-Lân Do Dinh , Edwin Simpson , Iryna Gurevych

Improving Factual Consistency Between a Response and Persona Facts

Neural models for response generation produce responses that are semantically plausible but not necessarily factually consistent with facts describing the speaker's persona. These models are trained with fully supervised learning where the…

Computation and Language · Computer Science 2021-02-16 Mohsen Mesgar , Edwin Simpson , Iryna Gurevych

Ranking Creative Language Characteristics in Small Data Scenarios

The ability to rank creative natural language provides an important general tool for downstream language understanding and generation. However, current deep ranking models require substantial amounts of labeled data that are difficult and…

Computation and Language · Computer Science 2020-10-27 Julia Siekiera , Marius Köppel , Edwin Simpson , Kevin Stowe , Iryna Gurevych , Stefan Kramer

Interactive Text Ranking with Bayesian Optimisation: A Case Study on Community QA and Summarisation

For many NLP applications, such as question answering and summarisation, the goal is to select the best solution from a large space of candidates to meet a particular user's needs. To address the lack of user-specific training data, we…

Computation and Language · Computer Science 2020-09-15 Edwin Simpson , Yang Gao , Iryna Gurevych

Text Processing Like Humans Do: Visually Attacking and Shielding NLP Systems

Visual modifications to text are often used to obfuscate offensive comments in social media (e.g., "!d10t") or as a writing style ("1337" in "leet speak"), among other scenarios. We consider this as a new type of adversarial attack in NLP,…

Computation and Language · Computer Science 2020-06-11 Steffen Eger , Gözde Gül Şahin , Andreas Rücklé , Ji-Ung Lee , Claudia Schulz , Mohsen Mesgar , Krishnkant Swarnkar , Edwin Simpson , Iryna Gurevych