Related papers: A framework for (under)specifying dependency synta…

A hybrid formalism to parse Sign Languages

Sign Language (SL) linguistic is dependent on the expensive task of annotating. Some automation is already available for low-level information (eg. body part tracking) and the lexical level has shown significant progresses. The syntactic…

Computation and Language · Computer Science 2014-06-26 Rémi Dubot , Christophe Collet

Simpler but More Accurate Semantic Dependency Parsing

While syntactic dependency annotations concentrate on the surface or functional structure of a sentence, semantic dependency annotations aim to capture between-word relationships that are more closely related to the meaning of a sentence,…

Computation and Language · Computer Science 2018-07-05 Timothy Dozat , Christopher D. Manning

A Common XML-based Framework for Syntactic Annotations

It is widely recognized that the proliferation of annotation schemes runs counter to the need to re-use language resources, and that standards for linguistic annotation are becoming increasingly mandatory. To answer this need, we have…

Computation and Language · Computer Science 2009-09-16 Nancy Ide , Laurent Romary , Tomaz Erjavec

An Annotation Scheme for Free Word Order Languages

We describe an annotation scheme and a tool developed for creating linguistically annotated corpora for non-configurational languages. Since the requirements for such a formalism differ from those posited for configurational languages,…

cmp-lg · Computer Science 2008-02-03 Wojciech Skut , Brigitte Krenn , Thorsten Brants , Hans Uszkoreit

The Weak Supervision Landscape

Many ways of annotating a dataset for machine learning classification tasks that go beyond the usual class labels exist in practice. These are of interest as they can simplify or facilitate the collection of annotations, while not greatly…

Machine Learning · Computer Science 2022-03-31 Rafael Poyiadzi , Daniel Bacaicoa-Barber , Jesus Cid-Sueiro , Miquel Perello-Nieto , Peter Flach , Raul Santos-Rodriguez

SLATE: A Super-Lightweight Annotation Tool for Experts

Many annotation tools have been developed, covering a wide variety of tasks and providing features like user management, pre-processing, and automatic labeling. However, all of these tools use Graphical User Interfaces, and often require…

Computation and Language · Computer Science 2020-06-05 Jonathan K. Kummerfeld

The Parallel Meaning Bank: A Framework for Semantically Annotating Multiple Languages

This paper gives a general description of the ideas behind the Parallel Meaning Bank, a framework with the aim to provide an easy way to annotate compositional semantics for texts written in languages other than English. The annotation…

Computation and Language · Computer Science 2021-01-01 Lasha Abzianidze , Rik van Noord , Chunliu Wang , Johan Bos

A Formal Framework for Linguistic Annotation

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Mark Liberman

Overview of Annotation Creation: Processes & Tools

Creating linguistic annotations requires more than just a reliable annotation scheme. Annotation can be a complex endeavour potentially involving many people, stages, and tools. This chapter outlines the process of creating end-to-end…

Computation and Language · Computer Science 2016-02-19 Mark A. Finlayson , Tomaž Erjavec

A Survey of Unsupervised Dependency Parsing

Syntactic dependency parsing is an important task in natural language processing. Unsupervised dependency parsing aims to learn a dependency parser from sentences that have no annotation of their correct parse trees. Despite its difficulty,…

Computation and Language · Computer Science 2020-10-06 Wenjuan Han , Yong Jiang , Hwee Tou Ng , Kewei Tu

Fill it up: Exploiting partial dependency annotations in a minimum spanning tree parser

Unsupervised models of dependency parsing typically require large amounts of clean, unlabeled data plus gold-standard part-of-speech tags. Adding indirect supervision (e.g. language universals and rules) can help, but we show that obtaining…

Computation and Language · Computer Science 2016-11-29 Liang Sun , Jason Mielens , Jason Baldridge

A Formal Framework for Linguistic Annotation (revised version)

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions - audio, video and/or physiological recordings - or it may be textual. The added…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Mark Liberman

From LLM-anation to LLM-orchestrator: Coordinating Small Models for Data Labeling

Although the annotation paradigm based on Large Language Models (LLMs) has made significant breakthroughs in recent years, its actual deployment still has two core bottlenecks: first, the cost of calling commercial APIs in large-scale…

Computation and Language · Computer Science 2025-06-23 Yao Lu , Zhaiyuan Ji , Jiawei Du , Yu Shanqing , Qi Xuan , Tianyi Zhou

A modelling language for the effective design of Java annotations

This paper describes a new modelling language for the effective design of Java annotations. Since their inclusion in the 5th edition of Java, annotations have grown from a useful tool for the addition of meta-data to play a central role in…

Programming Languages · Computer Science 2019-10-02 Irene Córdoba , Juan de Lara

A Systematic Comparison of Syntactic Representations of Dependency Parsing

We compare the performance of a transition-based parser in regards to different annotation schemes. We pro-pose to convert some specific syntactic constructions observed in the universal dependency treebanks into a so-called more standard…

Computation and Language · Computer Science 2025-03-11 Guillaume Wisniewski , Ophélie Lacroix

A knowledge-based approach to semi-automatic annotation of multimedia documents via user adaptation

Current approaches to the annotation process focus on annotation schemas, languages for annotation, or are very application driven. In this paper it is proposed that a more flexible architecture for annotation requires a knowledge component…

Digital Libraries · Computer Science 2007-05-23 Afzal Ballim , Nastaran Fatemi , Hatem Ghorbel , Vincenzo Pallotta

Learning a Cost-Effective Annotation Policy for Question Answering

State-of-the-art question answering (QA) relies upon large amounts of training data for which labeling is time consuming and thus expensive. For this reason, customizing QA systems is challenging. As a remedy, we propose a novel framework…

Computation and Language · Computer Science 2020-11-10 Bernhard Kratzwald , Stefan Feuerriegel , Huan Sun

Specifying Genericity through Inclusiveness and Abstractness Continuous Scales

This paper introduces a novel annotation framework for the fine-grained modeling of Noun Phrases' (NPs) genericity in natural language. The framework is designed to be simple and intuitive, making it accessible to non-expert annotators and…

Computation and Language · Computer Science 2024-04-02 Claudia Collacciani , Andrea Amelio Ravelli , Marianna Marcella Bolognesi

Mutlitask Learning for Cross-Lingual Transfer of Semantic Dependencies

We describe a method for developing broad-coverage semantic dependency parsers for languages for which no semantically annotated resource is available. We leverage a multitask learning framework coupled with an annotation projection method.…

Computation and Language · Computer Science 2020-05-01 Maryam Aminian , Mohammad Sadegh Rasooli , Mona Diab

LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation

Successfully training a deep neural network demands a huge corpus of labeled data. However, each label only provides limited information to learn from and collecting the requisite number of labels involves massive human effort. In this…

Computation and Language · Computer Science 2020-04-17 Dong-Ho Lee , Rahul Khanna , Bill Yuchen Lin , Jamin Chen , Seyeon Lee , Qinyuan Ye , Elizabeth Boschee , Leonardo Neves , Xiang Ren