Related papers: A Formal Framework for Linguistic Annotation (revi…

A Formal Framework for Linguistic Annotation

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Mark Liberman

ATLAS: A flexible and extensible architecture for linguistic annotation

We describe a formal model for annotating linguistic artifacts, from which we derive an application programming interface (API) to a suite of tools for manipulating these annotations. The abstract logical model provides for a range of…

Computation and Language · Computer Science 2007-05-23 Steven Bird , David Day , John Garofolo , John Henderson , Christophe Laprun , Mark Liberman

Observations on Annotations

The annotation of textual information is a fundamental activity in Linguistics and Computational Linguistics. This article presents various observations on annotations. It approaches the topic from several angles including Hypertext,…

Computation and Language · Computer Science 2020-04-23 Georg Rehm

A Common XML-based Framework for Syntactic Annotations

It is widely recognized that the proliferation of annotation schemes runs counter to the need to re-use language resources, and that standards for linguistic annotation are becoming increasingly mandatory. To answer this need, we have…

Computation and Language · Computer Science 2009-09-16 Nancy Ide , Laurent Romary , Tomaz Erjavec

International Standard for a Linguistic Annotation Framework

This paper describes the Linguistic Annotation Framework under development within ISO TC37 SC4 WG1. The Linguistic Annotation Framework is intended to serve as a basis for harmonizing existing language resources as well as developing new…

Computation and Language · Computer Science 2007-07-24 Laurent Romary , Nancy Ide

Overview of Annotation Creation: Processes & Tools

Creating linguistic annotations requires more than just a reliable annotation scheme. Annotation can be a complex endeavour potentially involving many people, stages, and tools. This chapter outlines the process of creating end-to-end…

Computation and Language · Computer Science 2016-02-19 Mark A. Finlayson , Tomaž Erjavec

Annotation graphs as a framework for multidimensional linguistic data analysis

In recent work we have presented a formal framework for linguistic annotation based on labeled acyclic digraphs. These `annotation graphs' offer a simple yet powerful method for representing complex annotation structures incorporating…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Mark Liberman

UCCA's Foundational Layer: Annotation Guidelines v2.1

This is the annotation manual for Universal Conceptual Cognitive Annotation (UCCA; Abend and Rappoport, 2013), specifically the Foundational Layer. UCCA is a graph-based semantic annotation scheme based on typological linguistic principles.…

Computation and Language · Computer Science 2021-01-01 Omri Abend , Nathan Schneider , Dotan Dvir , Jakob Prange , Ari Rappoport

A Machine Learning Based Analytical Framework for Semantic Annotation Requirements

The Semantic Web is an extension of the current web in which information is given well-defined meaning. The perspective of Semantic Web is to promote the quality and intelligence of the current web by changing its contents into machine…

Artificial Intelligence · Computer Science 2012-08-06 Hamed Hassanzadeh , MohammadReza Keyvanpour

Annotation Graphs and Servers and Multi-Modal Resources: Infrastructure for Interdisciplinary Education, Research and Development

Annotation graphs and annotation servers offer infrastructure to support the analysis of human language resources in the form of time-series data such as text, audio and video. This paper outlines areas of common need among empirical…

Computation and Language · Computer Science 2007-05-23 Christopher Cieri , Steven Bird

Ontology-Based Annotation of Multimedia Language Data for the Semantic Web

There is an increasing interest and effort in preserving and documenting endangered languages. Language data are valuable only when they are well-cataloged, indexed and searchable. Many language data, particularly those of lesser-spoken…

Digital Libraries · Computer Science 2009-02-19 Artem Chebotko , Shiyong Lu , Farshad Fotouhi , Anthony Aristar

OntoELAN: An Ontology-based Linguistic Multimedia Annotator

Despite its scientific, political, and practical value, comprehensive information about human languages, in all their variety and complexity, is not readily obtainable and searchable. One reason is that many language data are collected as…

Digital Libraries · Computer Science 2016-11-18 Artem Chebotko , Yu Deng , Shiyong Lu , Farshad Fotouhi , Anthony Aristar , Hennie Brugman , Alexander Klassmann , Han Sloetjes , Albert Russel , Peter Wittenburg

Specifying Genericity through Inclusiveness and Abstractness Continuous Scales

This paper introduces a novel annotation framework for the fine-grained modeling of Noun Phrases' (NPs) genericity in natural language. The framework is designed to be simple and intuitive, making it accessible to non-expert annotators and…

Computation and Language · Computer Science 2024-04-02 Claudia Collacciani , Andrea Amelio Ravelli , Marianna Marcella Bolognesi

Mitigating Biases to Embrace Diversity: A Comprehensive Annotation Benchmark for Toxic Language

This study introduces a prescriptive annotation benchmark grounded in humanities research to ensure consistent, unbiased labeling of offensive language, particularly for casual and non-mainstream language uses. We contribute two newly…

Computation and Language · Computer Science 2024-10-18 Xinmeng Hou

Towards a query language for annotation graphs

The multidimensional, heterogeneous, and temporal nature of speech databases raises interesting challenges for representation and query. Recently, annotation graphs have been proposed as a general-purpose representational framework for…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Peter Buneman , Wang-Chiew Tan

An Annotation Scheme for Free Word Order Languages

We describe an annotation scheme and a tool developed for creating linguistically annotated corpora for non-configurational languages. Since the requirements for such a formalism differ from those posited for configurational languages,…

cmp-lg · Computer Science 2008-02-03 Wojciech Skut , Brigitte Krenn , Thorsten Brants , Hans Uszkoreit

Large Language Models for Data Annotation and Synthesis: A Survey

Data annotation and synthesis generally refers to the labeling or generating of raw data with relevant information, which could be used for improving the efficacy of machine learning models. The process, however, is labor-intensive and…

Computation and Language · Computer Science 2024-12-04 Zhen Tan , Dawei Li , Song Wang , Alimohammad Beigi , Bohan Jiang , Amrita Bhattacharjee , Mansooreh Karami , Jundong Li , Lu Cheng , Huan Liu

Understanding Archives: Towards New Research Interfaces Relying on the Semantic Annotation of Documents

The digitisation campaigns carried out by libraries and archives in recent years have facilitated access to documents in their collections. However, exploring and exploiting these documents remain difficult tasks due to the sheer quantity…

Digital Libraries · Computer Science 2024-03-29 Nicolas Gutehrlé , Iana Atanassova

Standards for Language Resources

The goal of this paper is two-fold: to present an abstract data model for linguistic annotations and its implementation using XML, RDF and related standards; and to outline the work of a newly formed committee of the International Standards…

Computation and Language · Computer Science 2009-11-11 Nancy Ide , Laurent Romary

Annotative Indexing

This paper introduces annotative indexing, a novel framework that unifies and generalizes traditional inverted indexes, column stores, object stores, and graph databases. As a result, annotative indexing can provide the underlying indexing…

Information Retrieval · Computer Science 2025-06-04 Charles L. A. Clarke