English
Related papers

Related papers: Data Augmentation for Hypernymy Detection

200 papers

Detecting hypernymy relations is a key task in NLP, which is addressed in the literature using two complementary approaches. Distributional methods, whose supervised variants are the current best performers, and path-based methods, which…

Computation and Language · Computer Science 2016-06-08 Vered Shwartz , Yoav Goldberg , Ido Dagan

Distinguishing lexical relations has been a long term pursuit in natural language processing (NLP) domain. Recently, in order to detect lexical relations like hypernymy, meronymy, co-hyponymy etc., distributional semantic models are being…

Computation and Language · Computer Science 2018-02-14 Abhik Jana , Pawan Goyal

Modeling hypernymy, such as poodle is-a dog, is an important generalization aid to many NLP tasks, such as entailment, coreference, relation extraction, and question answering. Supervised learning from labeled hypernym sources, such as…

Computation and Language · Computer Science 2018-05-31 Haw-Shiuan Chang , ZiYun Wang , Luke Vilnis , Andrew McCallum

In Biomedical Natural Language Processing (BioNLP) tasks, such as Relation Extraction, Named Entity Recognition, and Text Classification, the scarcity of high-quality data remains a significant challenge. This limitation poisons large…

Computation and Language · Computer Science 2025-04-01 Zhengyi Zhao , Shubo Zhang , Bin Liang , Binyang Li , Kam-Fai Wong

In many cases of machine learning, research suggests that the development of training data might have a higher relevance than the choice and modelling of classifiers themselves. Thus, data augmentation methods have been developed to improve…

Computation and Language · Computer Science 2022-07-25 Markus Bayer , Marc-André Kaufhold , Björn Buchhold , Marcel Keller , Jörg Dallmeyer , Christian Reuter

The fundamental role of hypernymy in NLP has motivated the development of many methods for the automatic identification of this relation, most of which rely on word distribution. We investigate an extensive number of such unsupervised…

Computation and Language · Computer Science 2017-01-10 Vered Shwartz , Enrico Santus , Dominik Schlechtweg

NLP has achieved great progress in the past decade through the use of neural models and large labeled datasets. The dependence on abundant data prevents NLP models from being applied to low-resource settings or novel tasks where significant…

Computation and Language · Computer Science 2021-06-15 Jiaao Chen , Derek Tam , Colin Raffel , Mohit Bansal , Diyi Yang

Data augmentation techniques have been widely used to improve machine learning performance as they enhance the generalization capability of models. In this work, to generate high quality synthetic data for low-resource tagging tasks, we…

Computation and Language · Computer Science 2020-11-04 Bosheng Ding , Linlin Liu , Lidong Bing , Canasai Kruengkrai , Thien Hai Nguyen , Shafiq Joty , Luo Si , Chunyan Miao

Named Entity Recognition (NER) is a machine learning task that traditionally relies on supervised learning and annotated data. Acquiring such data is often a challenge, particularly in specialized fields like medical, legal, and financial…

Computation and Language · Computer Science 2026-04-01 Arthur Elwing Torres , Edleno Silva de Moura , Altigran Soares da Silva , Mario A. Nascimento , Filipe Mesquita

Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be…

Machine Learning · Computer Science 2021-03-29 Dafni Antotsiou , Carlo Ciliberto , Tae-Kyun Kim

Data limitation is one of the most common issues in training machine learning classifiers for medical applications. Due to ethical concerns and data privacy, the number of people that can be recruited to such experiments is generally…

Audio and Speech Processing · Electrical Eng. & Systems 2020-04-14 Bahman Mirheidari , Yilin Pan , Daniel Blackburn , Ronan O'Malley , Traci Walker , Annalena Venneri , Markus Reuber , Heidi Christensen

Simple yet effective data augmentation techniques have been proposed for sentence-level and sentence-pair natural language processing tasks. Inspired by these efforts, we design and compare data augmentation for named entity recognition,…

Computation and Language · Computer Science 2020-10-23 Xiang Dai , Heike Adel

Deep learning approaches deliver state-of-the-art performance in recognition of spatiotemporal human motion data. However, one of the main challenges in these recognition tasks is limited available training data. Insufficient training data…

Computer Vision and Pattern Recognition · Computer Science 2023-08-14 Junxiao Shen , John Dudley , Per Ola Kristensson

The surge of interest in data augmentation within the realm of NLP has been driven by the need to address challenges posed by hate speech domains, the dynamic nature of social media vocabulary, and the demands for large-scale neural…

Computation and Language · Computer Science 2024-04-02 Md Saroar Jahan , Mourad Oussalah , Djamila Romaissa Beddia , Jhuma kabir Mim , Nabil Arhab

Data augmentation is an essential technique in natural language processing (NLP) for enriching training datasets by generating diverse samples. This process is crucial for improving the robustness and generalization capabilities of NLP…

Computation and Language · Computer Science 2025-10-16 Zaitian Wang , Jinghan Zhang , Xinhao Zhang , Kunpeng Liu , Pengfei Wang , Yuanchun Zhou

Emotion recognition in text, the task of identifying emotions such as joy or anger, is a challenging problem in NLP with many applications. One of the challenges is the shortage of available datasets that have been annotated with emotions.…

Computation and Language · Computer Science 2023-10-31 Anna Koufakou , Diego Grisales , Ragy Costa de jesus , Oscar Fox

In this paper, we explore and compare multiple solutions to the problem of data augmentation in image classification. Previous work has demonstrated the effectiveness of data augmentation through simple techniques, such as cropping,…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Luis Perez , Jason Wang

The study of taxonomies and hypernymy relations has been extensive on the Natural Language Processing (NLP) literature. However, the evaluation of taxonomy learning approaches has been traditionally troublesome, as it mainly relies on…

Computation and Language · Computer Science 2017-03-24 Jose Camacho-Collados

For the task of image classification, neural networks primarily rely on visual patterns. In robust networks, we would expect for visually similar classes to be represented similarly. We consider the problem of when semantically similar…

Computer Vision and Pattern Recognition · Computer Science 2023-06-05 Natalie Abreu , Nathan Vaska , Victoria Helus

Data scarcity is a problem that occurs in languages and tasks where we do not have large amounts of labeled data but want to use state-of-the-art models. Such models are often deep learning models that require a significant amount of data…

Computation and Language · Computer Science 2023-02-23 Domagoj Pluščec , Jan Šnajder
‹ Prev 1 2 3 10 Next ›