Related papers: Annotative Indexing

Towards a query language for annotation graphs

The multidimensional, heterogeneous, and temporal nature of speech databases raises interesting challenges for representation and query. Recently, annotation graphs have been proposed as a general-purpose representational framework for…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Peter Buneman , Wang-Chiew Tan

A Formal Framework for Linguistic Annotation

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Mark Liberman

A Formal Framework for Linguistic Annotation (revised version)

`Linguistic annotation' covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions - audio, video and/or physiological recordings - or it may be textual. The added…

Computation and Language · Computer Science 2007-05-23 Steven Bird , Mark Liberman

ATLAS: A flexible and extensible architecture for linguistic annotation

We describe a formal model for annotating linguistic artifacts, from which we derive an application programming interface (API) to a suite of tools for manipulating these annotations. The abstract logical model provides for a range of…

Computation and Language · Computer Science 2007-05-23 Steven Bird , David Day , John Garofolo , John Henderson , Christophe Laprun , Mark Liberman

A Unified Active Learning Framework for Annotating Graph Data with Application to Software Source Code Performance Prediction

Most machine learning and data analytics applications, including performance engineering in software systems, require a large number of annotations and labelled data, which might not be available in advance. Acquiring annotations often…

Software Engineering · Computer Science 2023-09-21 Peter Samoaa , Linus Aronsson , Antonio Longa , Philipp Leitner , Morteza Haghir Chehreghani

The Potential of Learned Index Structures for Index Compression

Inverted indexes are vital in providing fast key-word-based search. For every term in the document collection, a list of identifiers of documents in which the term appears is stored, along with auxiliary information such as term frequency,…

Information Retrieval · Computer Science 2019-01-30 Harrie Oosterhuis , J. Shane Culpepper , Maarten de Rijke

Annotation based automatic action processing

With a strong motivational background in search engine optimization the amount of structured data on the web is growing rapidly. The main search engine providers are promising great increase in visibility through annotation of the web…

Information Retrieval · Computer Science 2018-02-02 Elias Kärle , Dieter Fensel

A Common XML-based Framework for Syntactic Annotations

It is widely recognized that the proliferation of annotation schemes runs counter to the need to re-use language resources, and that standards for linguistic annotation are becoming increasingly mandatory. To answer this need, we have…

Computation and Language · Computer Science 2009-09-16 Nancy Ide , Laurent Romary , Tomaz Erjavec

A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation

The usefulness of annotated corpora is greatly increased if there is an associated tool that can allow various kinds of operations to be performed in a simple way. Different kinds of annotation frameworks and many query languages for them…

Computation and Language · Computer Science 2011-08-10 Anil Kumar Singh

Method for the semantic indexing of concept hierarchies, uniform representation, use of relational database systems and generic and case-based reasoning

This paper presents a method for semantic indexing and describes its application in the field of knowledge representation. Starting point of the semantic indexing is the knowledge represented by concept hierarchies. The goal is to assign…

Artificial Intelligence · Computer Science 2025-01-22 Uwe Petersohn , Sandra Zimmer , Jens Lehmann

Using the Annotated Bibliography as a Resource for Indicative Summarization

We report on a language resource consisting of 2000 annotated bibliography entries, which is being analyzed as part of our research on indicative document summarization. We show how annotated bibliographies cover certain aspects of…

Computation and Language · Computer Science 2007-05-23 Min-Yen Kan , Judith L. Klavans , Kathleen R. McKeown

LOST: A flexible framework for semi-automatic image annotation

State-of-the-art computer vision approaches rely on huge amounts of annotated data. The collection of such data is a time consuming process since it is mainly performed by humans. The literature shows that semi-automatic annotation…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Jonas Jäger , Gereon Reus , Joachim Denzler , Viviane Wolff , Klaus Fricke-Neuderth

Retrieve-and-Verify: A Table Context Selection Framework for Accurate Column Annotations

Tables are a prevalent format for structured data, yet their metadata, such as semantic types and column relationships, is often incomplete or ambiguous. Column annotation tasks, including Column Type Annotation (CTA) and Column Property…

Databases · Computer Science 2025-08-26 Zhihao Ding , Yongkang Sun , Jieming Shi

A Machine Learning Based Analytical Framework for Semantic Annotation Requirements

The Semantic Web is an extension of the current web in which information is given well-defined meaning. The perspective of Semantic Web is to promote the quality and intelligence of the current web by changing its contents into machine…

Artificial Intelligence · Computer Science 2012-08-06 Hamed Hassanzadeh , MohammadReza Keyvanpour

Annotation Graphs and Servers and Multi-Modal Resources: Infrastructure for Interdisciplinary Education, Research and Development

Annotation graphs and annotation servers offer infrastructure to support the analysis of human language resources in the form of time-series data such as text, audio and video. This paper outlines areas of common need among empirical…

Computation and Language · Computer Science 2007-05-23 Christopher Cieri , Steven Bird

Annotation Error Detection: Analyzing the Past and Present for a More Coherent Future

Annotated data is an essential ingredient in natural language processing for training and evaluating machine learning models. It is therefore very desirable for the annotations to be of high quality. Recent work, however, has shown that…

Computation and Language · Computer Science 2022-09-27 Jan-Christoph Klie , Bonnie Webber , Iryna Gurevych

Models and Tools for Collaborative Annotation

The Annotation Graph Toolkit (AGTK) is a collection of software which facilitates development of linguistic annotation tools. AGTK provides a database interface which allows applications to use a database server for persistent storage. This…

Computation and Language · Computer Science 2007-05-23 Xiaoyi Ma , Haejoong Lee , Steven Bird , Kazuaki Maeda

Querying Databases of Annotated Speech

Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic `transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough…

Computation and Language · Computer Science 2007-05-23 Steve Cassidy , Steven Bird

Specifying Genericity through Inclusiveness and Abstractness Continuous Scales

This paper introduces a novel annotation framework for the fine-grained modeling of Noun Phrases' (NPs) genericity in natural language. The framework is designed to be simple and intuitive, making it accessible to non-expert annotators and…

Computation and Language · Computer Science 2024-04-02 Claudia Collacciani , Andrea Amelio Ravelli , Marianna Marcella Bolognesi

Partial Adaptive Indexing for Approximate Query Answering

In data exploration, users need to analyze large data files quickly, aiming to minimize data-to-analysis time. While recent adaptive indexing approaches address this need, they are cases where demonstrate poor performance. Particularly,…

Databases · Computer Science 2024-07-29 Stavros Maroulis , Nikos Bikakis , Vassilis Stamatopoulos , George Papastefanatos