Related papers: Annotation graphs as a framework for multidimensio…

ATLAS: A flexible and extensible architecture for linguistic annotation

We describe a formal model for annotating linguistic artifacts, from which we derive an application programming interface (API) to a suite of tools for manipulating these annotations. The abstract logical model provides for a range of…

Computation and Language · Computer Science 2007-05-23 Steven Bird , David Day , John Garofolo , John Henderson , Christophe Laprun , Mark Liberman

Annotation Graphs and Servers and Multi-Modal Resources: Infrastructure for Interdisciplinary Education, Research and Development

Annotation graphs and annotation servers offer infrastructure to support the analysis of human language resources in the form of time-series data such as text, audio and video. This paper outlines areas of common need among empirical…

Computation and Language · Computer Science 2007-05-23 Christopher Cieri , Steven Bird

Towards a query language for annotation graphs

The multidimensional, heterogeneous, and temporal nature of speech databases raises interesting challenges for representation and query. Recently, annotation graphs have been proposed as a general-purpose representational framework for…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Peter Buneman , Wang-Chiew Tan

A Formal Framework for Linguistic Annotation

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Mark Liberman

A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation

The usefulness of annotated corpora is greatly increased if there is an associated tool that can allow various kinds of operations to be performed in a simple way. Different kinds of annotation frameworks and many query languages for them…

Computation and Language · Computer Science 2011-08-10 Anil Kumar Singh

A Formal Framework for Linguistic Annotation (revised version)

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions - audio, video and/or physiological recordings - or it may be textual. The added…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Mark Liberman

An Integrated Framework for Treebanks and Multilayer Annotations

Treebank formats and associated software tools are proliferating rapidly, with little consideration for interoperability. We survey a wide variety of treebank structures and operations, and show how they can be mapped onto the annotation…

Computation and Language · Computer Science 2007-05-23 Scott Cotton , Steven Bird

Automatic Alignment of Discourse Relations of Different Discourse Annotation Frameworks

Existing discourse corpora are annotated based on different frameworks, which show significant dissimilarities in definitions of arguments and relations and structural constraints. Despite surface differences, these frameworks share basic…

Computation and Language · Computer Science 2024-04-09 Yingxue Fu

Annotated Hypergraphs: Models and Applications

Hypergraphs offer a natural modeling language for studying polyadic interactions between sets of entities. Many polyadic interactions are asymmetric, with nodes playing distinctive roles. In an academic collaboration network, for example,…

Physics and Society · Physics 2019-11-05 Philip Chodrow , Andrew Mellor

TableTrans, MultiTrans, InterTrans and TreeTrans: Diverse Tools Built on the Annotation Graph Toolkit

Four diverse tools built on the Annotation Graph Toolkit are described. Each tool associates linguistic codes and structures with time-series data. All are based on the same software library and tool architecture. TableTrans is for…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Kazuaki Maeda , Xiaoyi Ma , Haejoong Lee , Beth Randall , Salim Zayat

Text Annotation Graphs: Annotating Complex Natural Language Phenomena

This paper introduces a new web-based software tool for annotating text, Text Annotation Graphs, or TAG. It provides functionality for representing complex relationships between words and word phrases that are not available in other…

Computation and Language · Computer Science 2018-03-02 Angus G. Forbes , Kristine Lee , Gus Hahn-Powell , Marco A. Valenzuela-Escárcega , Mihai Surdeanu

Labelled network subgraphs reveal stylistic subtleties in written texts

The vast amount of data and increase of computational capacity have allowed the analysis of texts from several perspectives, including the representation of texts as complex networks. Nodes of the network represent the words, and edges…

Computation and Language · Computer Science 2017-11-09 Vanessa Q. Marinho , Graeme Hirst , Diego R. Amancio

ChartMark: A Structured Grammar for Chart Annotation

Chart annotations enhance visualization accessibility but suffer from fragmented, non-standardized representations that limit cross-platform reuse. We propose ChartMark, a structured grammar that separates annotation semantics from…

Computation and Language · Computer Science 2025-07-30 Yiyu Chen , Yifan Wu , Shuyu Shen , Yupeng Xie , Leixian Shen , Hui Xiong , Yuyu Luo

Differentiable Allophone Graphs for Language-Universal Speech Recognition

Building language-universal speech recognition systems entails producing phonological units of spoken sound that can be shared across languages. While speech annotations at the language-specific phoneme or surface levels are readily…

Computation and Language · Computer Science 2021-07-27 Brian Yan , Siddharth Dalmia , David R. Mortensen , Florian Metze , Shinji Watanabe

From Variance to Invariance: Qualitative Content Analysis for Narrative Graph Annotation

Narratives in news discourse play a critical role in shaping public understanding of economic events, such as inflation. Annotating and evaluating these narratives in a structured manner remains a key challenge for Natural Language…

Computation and Language · Computer Science 2026-03-05 Junbo Huang , Max Weinig , Ulrich Fritsche , Ricardo Usbeck

Towards a Knowledge Graph based Speech Interface

Applications which use human speech as an input require a speech interface with high recognition accuracy. The words or phrases in the recognised text are annotated with a machine-understandable meaning and linked to knowledge graphs for…

Human-Computer Interaction · Computer Science 2017-05-26 Ashwini Jaya Kumar , Sören Auer , Christoph Schmidt , Joachim köhler

Semiotically-grounded distant viewing of diagrams: insights from two multimodal corpora

In this article, we bring together theories of multimodal communication and computational methods to study how primary school science diagrams combine multiple expressive resources. We position our work within the field of digital…

Computation and Language · Computer Science 2021-12-23 Tuomo Hiippala , John A. Bateman

How compatible are our discourse annotations? Insights from mapping RST-DT and PDTB annotations

Discourse-annotated corpora are an important resource for the community, but they are often annotated according to different frameworks. This makes comparison of the annotations difficult, thereby also preventing researchers from searching…

Computation and Language · Computer Science 2018-03-16 Vera Demberg , Fatemeh Torabi Asr , Merel Scholman

The Immersion of Directed Multi-graphs in Embedding Fields. Generalisations

The purpose of this paper is to outline a generalised model for representing hybrids of relational-categorical, symbolic, perceptual-sensory and perceptual-latent data, so as to embody, in the same architectural data layer, representations…

Machine Learning · Computer Science 2020-04-29 Bogdan Bocse , Ioan Radu Jinga

Annotation of Scientific Summaries for Information Retrieval

We present a methodology combining surface NLP and Machine Learning techniques for ranking asbtracts and generating summaries based on annotated corpora. The corpora were annotated with meta-semantic tags indicating the category of…

Information Retrieval · Computer Science 2011-10-27 Fidelia Ibekwe-Sanjuan , Fernandez Silvia , Sanjuan Eric , Charton Eric