Related papers: Data Augmentation for Hypernymy Detection

Improving Hypernymy Detection with an Integrated Path-based and Distributional Method

Detecting hypernymy relations is a key task in NLP, which is addressed in the literature using two complementary approaches. Distributional methods, whose supervised variants are the current best performers, and path-based methods, which…

Computation and Language · Computer Science 2016-06-08 Vered Shwartz , Yoav Goldberg , Ido Dagan

Network Features Based Co-hyponymy Detection

Distinguishing lexical relations has been a long term pursuit in natural language processing (NLP) domain. Recently, in order to detect lexical relations like hypernymy, meronymy, co-hyponymy etc., distributional semantic models are being…

Computation and Language · Computer Science 2018-02-14 Abhik Jana , Pawan Goyal

Distributional Inclusion Vector Embedding for Unsupervised Hypernymy Detection

Modeling hypernymy, such as poodle is-a dog, is an important generalization aid to many NLP tasks, such as entailment, coreference, relation extraction, and question answering. Supervised learning from labeled hypernym sources, such as…

Computation and Language · Computer Science 2018-05-31 Haw-Shiuan Chang , ZiYun Wang , Luke Vilnis , Andrew McCallum

WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation

In Biomedical Natural Language Processing (BioNLP) tasks, such as Relation Extraction, Named Entity Recognition, and Text Classification, the scarcity of high-quality data remains a significant challenge. This limitation poisons large…

Computation and Language · Computer Science 2025-04-01 Zhengyi Zhao , Shubo Zhang , Bin Liang , Binyang Li , Kam-Fai Wong

Data Augmentation in Natural Language Processing: A Novel Text Generation Approach for Long and Short Text Classifiers

In many cases of machine learning, research suggests that the development of training data might have a higher relevance than the choice and modelling of classifiers themselves. Thus, data augmentation methods have been developed to improve…

Computation and Language · Computer Science 2022-07-25 Markus Bayer , Marc-André Kaufhold , Björn Buchhold , Marcel Keller , Jörg Dallmeyer , Christian Reuter

Hypernyms under Siege: Linguistically-motivated Artillery for Hypernymy Detection

The fundamental role of hypernymy in NLP has motivated the development of many methods for the automatic identification of this relation, most of which rely on word distribution. We investigate an extensive number of such unsupervised…

Computation and Language · Computer Science 2017-01-10 Vered Shwartz , Enrico Santus , Dominik Schlechtweg

An Empirical Survey of Data Augmentation for Limited Data Learning in NLP

NLP has achieved great progress in the past decade through the use of neural models and large labeled datasets. The dependence on abundant data prevents NLP models from being applied to low-resource settings or novel tasks where significant…

Computation and Language · Computer Science 2021-06-15 Jiaao Chen , Derek Tam , Colin Raffel , Mohit Bansal , Diyi Yang

DAGA: Data Augmentation with a Generation Approach for Low-resource Tagging Tasks

Data augmentation techniques have been widely used to improve machine learning performance as they enhance the generalization capability of models. In this work, to generate high quality synthetic data for low-resource tagging tasks, we…

Computation and Language · Computer Science 2020-11-04 Bosheng Ding , Linlin Liu , Lidong Bing , Canasai Kruengkrai , Thien Hai Nguyen , Shafiq Joty , Luo Si , Chunyan Miao

An Experimental Study on Data Augmentation Techniques for Named Entity Recognition on Low-Resource Domains

Named Entity Recognition (NER) is a machine learning task that traditionally relies on supervised learning and annotated data. Acquiring such data is often a challenge, particularly in specialized fields like medical, legal, and financial…

Computation and Language · Computer Science 2026-04-01 Arthur Elwing Torres , Edleno Silva de Moura , Altigran Soares da Silva , Mario A. Nascimento , Filipe Mesquita

Adversarial Imitation Learning with Trajectorial Augmentation and Correction

Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be…

Machine Learning · Computer Science 2021-03-29 Dafni Antotsiou , Carlo Ciliberto , Tae-Kyun Kim

Data augmentation using generative networks to identify dementia

Data limitation is one of the most common issues in training machine learning classifiers for medical applications. Due to ethical concerns and data privacy, the number of people that can be recruited to such experiments is generally…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-14 Bahman Mirheidari , Yilin Pan , Daniel Blackburn , Ronan O'Malley , Traci Walker , Annalena Venneri , Markus Reuber , Heidi Christensen

An Analysis of Simple Data Augmentation for Named Entity Recognition

Simple yet effective data augmentation techniques have been proposed for sentence-level and sentence-pair natural language processing tasks. Inspired by these efforts, we design and compare data augmentation for named entity recognition,…

Computation and Language · Computer Science 2020-10-23 Xiang Dai , Heike Adel

The Imaginative Generative Adversarial Network: Automatic Data Augmentation for Dynamic Skeleton-Based Hand Gesture and Human Action Recognition

Deep learning approaches deliver state-of-the-art performance in recognition of spatiotemporal human motion data. However, one of the main challenges in these recognition tasks is limited available training data. Insufficient training data…

Computer Vision and Pattern Recognition · Computer Science 2023-08-14 Junxiao Shen , John Dudley , Per Ola Kristensson

A Comprehensive Study on NLP Data Augmentation for Hate Speech Detection: Legacy Methods, BERT, and LLMs

The surge of interest in data augmentation within the realm of NLP has been driven by the need to address challenges posed by hate speech domains, the dynamic nature of social media vocabulary, and the demands for large-scale neural…

Computation and Language · Computer Science 2024-04-02 Md Saroar Jahan , Mourad Oussalah , Djamila Romaissa Beddia , Jhuma kabir Mim , Nabil Arhab

Diversity-oriented Data Augmentation with Large Language Models

Data augmentation is an essential technique in natural language processing (NLP) for enriching training datasets by generating diverse samples. This process is crucial for improving the robustness and generalization capabilities of NLP…

Computation and Language · Computer Science 2025-10-16 Zaitian Wang , Jinghan Zhang , Xinhao Zhang , Kunpeng Liu , Pengfei Wang , Yuanchun Zhou

Data Augmentation for Emotion Detection in Small Imbalanced Text Data

Emotion recognition in text, the task of identifying emotions such as joy or anger, is a challenging problem in NLP with many applications. One of the challenges is the shortage of available datasets that have been annotated with emotions.…

Computation and Language · Computer Science 2023-10-31 Anna Koufakou , Diego Grisales , Ragy Costa de jesus , Oscar Fox

The Effectiveness of Data Augmentation in Image Classification using Deep Learning

In this paper, we explore and compare multiple solutions to the problem of data augmentation in image classification. Previous work has demonstrated the effectiveness of data augmentation through simple techniques, such as cropping,…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Luis Perez , Jason Wang

Why we have switched from building full-fledged taxonomies to simply detecting hypernymy relations

The study of taxonomies and hypernymy relations has been extensive on the Natural Language Processing (NLP) literature. However, the evaluation of taxonomy learning approaches has been traditionally troublesome, as it mainly relies on…

Computation and Language · Computer Science 2017-03-24 Jose Camacho-Collados

Addressing Discrepancies in Semantic and Visual Alignment in Neural Networks

For the task of image classification, neural networks primarily rely on visual patterns. In robust networks, we would expect for visually similar classes to be represented similarly. We consider the problem of when semantically similar…

Computer Vision and Pattern Recognition · Computer Science 2023-06-05 Natalie Abreu , Nathan Vaska , Victoria Helus

Data Augmentation for Neural NLP

Data scarcity is a problem that occurs in languages and tasks where we do not have large amounts of labeled data but want to use state-of-the-art models. Such models are often deep learning models that require a significant amount of data…

Computation and Language · Computer Science 2023-02-23 Domagoj Pluščec , Jan Šnajder