Related papers: Annotating Predicate-Argument Structure for a Para…

Predicate-Argument Structure Divergences in Chinese and English Parallel Sentences and their Impact on Language Transfer

Cross-lingual Natural Language Processing (NLP) has gained significant traction in recent years, offering practical solutions in low-resource settings by transferring linguistic knowledge from resource-rich to low-resource languages. This…

Computation and Language · Computer Science 2025-11-14 Rocco Tripodi , Xiaoyu Liu

A Model for Fine-Grained Alignment of Multilingual Texts

While alignment of texts on the sentential level is often seen as being too coarse, and word alignment as being too fine-grained, bi- or multilingual texts which are aligned on a level in-between are a useful resource for many purposes.…

Computation and Language · Computer Science 2007-05-23 Lea Cyrus , Hendrik Feddes

The Parallel Meaning Bank: A Framework for Semantically Annotating Multiple Languages

This paper gives a general description of the ideas behind the Parallel Meaning Bank, a framework with the aim to provide an easy way to annotate compositional semantics for texts written in languages other than English. The annotation…

Computation and Language · Computer Science 2021-01-01 Lasha Abzianidze , Rik van Noord , Chunliu Wang , Johan Bos

The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations Annotated with Compositional Meaning Representations

The Parallel Meaning Bank is a corpus of translations annotated with shared, formal meaning representations comprising over 11 million words divided over four languages (English, German, Italian, and Dutch). Our approach is based on…

Computation and Language · Computer Science 2017-02-15 Lasha Abzianidze , Johannes Bjerva , Kilian Evang , Hessel Haagsma , Rik van Noord , Pierre Ludmann , Duc-Duy Nguyen , Johan Bos

Cross-lingual Argumentation Mining: Machine Translation (and a bit of Projection) is All You Need!

Argumentation mining (AM) requires the identification of complex discourse structures and has lately been applied with success monolingually. In this work, we show that the existing resources are, however, not adequate for assessing…

Computation and Language · Computer Science 2018-07-25 Steffen Eger , Johannes Daxenberger , Christian Stab , Iryna Gurevych

On aligning trees

The increasing availability of corpora annotated for linguistic structure prompts the question: if we have the same texts, annotated for phrase structure under two different schemes, to what extent do the annotations agree on structuring…

cmp-lg · Computer Science 2008-02-03 Jo Calder

One model, two languages: training bilingual parsers with harmonized treebanks

We introduce an approach to train lexicalized parsers using bilingual corpora obtained by merging harmonized treebanks of different languages, producing parsers that can analyze sentences in either of the learned languages, or even…

Computation and Language · Computer Science 2016-05-20 David Vilares , Carlos Gómez-Rodríguez , Miguel A. Alonso

CGELBank: CGEL as a Framework for English Syntax Annotation

We introduce the syntactic formalism of the \textit{Cambridge Grammar of the English Language} (CGEL) to the world of treebanking through the CGELBank project. We discuss some issues in linguistic analysis that arose in adapting the…

Computation and Language · Computer Science 2022-10-04 Brett Reynolds , Aryaman Arora , Nathan Schneider

Towards an Argument Mining Pipeline Transforming Texts to Argument Graphs

This paper targets the automated extraction of components of argumentative information and their relations from natural language text. Moreover, we address a current lack of systems to provide complete argumentative structure from arbitrary…

Computation and Language · Computer Science 2020-09-29 Mirko Lenz , Premtim Sahitaj , Sean Kallenberg , Christopher Coors , Lorik Dumani , Ralf Schenkel , Ralph Bergmann

Cross-lingual RST Discourse Parsing

Discourse parsing is an integral part of understanding information flow and argumentative structure in documents. Most previous research has focused on inducing and evaluating models from the English RST Discourse Treebank. However,…

Computation and Language · Computer Science 2017-01-12 Chloé Braud , Maximin Coavoux , Anders Søgaard

Cross-Lingual Dependency Parsing Using Code-Mixed TreeBank

Treebank translation is a promising method for cross-lingual transfer of syntactic dependency knowledge. The basic idea is to map dependency arcs from a source treebank to its target translation according to word alignments. This method,…

Computation and Language · Computer Science 2019-09-06 Zhang Meishan , Zhang Yue , Fu Guohong

Building a resource for studying translation shifts

This paper describes an interdisciplinary approach which brings together the fields of corpus linguistics and translation studies. It presents ongoing work on the creation of a corpus resource in which translation shifts are explicitly…

Computation and Language · Computer Science 2007-05-23 Lea Cyrus

Tagging Grammatical Functions

This paper addresses issues in automated treebank construction. We show how standard part-of-speech tagging techniques extend to the more general problem of structural annotation, especially for determining grammatical functions and…

cmp-lg · Computer Science 2008-02-03 Thorsten Brants , Wojciech Skut , Brigitte Krenn

Semantically Constrained Multilayer Annotation: The Case of Coreference

We propose a coreference annotation scheme as a layer on top of the Universal Conceptual Cognitive Annotation foundational layer, treating units in predicate-argument structure as a basis for entity and event mentions. We argue that this…

Computation and Language · Computer Science 2019-06-12 Jakob Prange , Nathan Schneider , Omri Abend

Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach

It is commonly believed that knowledge of syntactic structure should improve language modeling. However, effectively and computationally efficiently incorporating syntactic structure into neural language models has been a challenging topic.…

Computation and Language · Computer Science 2020-05-13 Wenyu Du , Zhouhan Lin , Yikang Shen , Timothy J. O'Donnell , Yoshua Bengio , Yue Zhang

Exploiting Multi-typed Treebanks for Parsing with Deep Multi-task Learning

Various treebanks have been released for dependency parsing. Despite that treebanks may belong to different languages or have different annotation schemes, they contain syntactic knowledge that is potential to benefit each other. This paper…

Computation and Language · Computer Science 2016-06-06 Jiang Guo , Wanxiang Che , Haifeng Wang , Ting Liu

A Parallel Corpus of Translationese

We describe a set of bilingual English--French and English--German parallel corpora in which the direction of translation is accurately and reliably annotated. The corpora are diverse, consisting of parliamentary proceedings, literary…

Computation and Language · Computer Science 2016-03-08 Ella Rabinovich , Shuly Wintner , Ofek Luis Lewinsohn

Cross-lingual Annotation Projection for Semantic Roles

This article considers the task of automatically inducing role-semantic annotations in the FrameNet paradigm for new languages. We propose a general framework that is based on annotation projection, phrased as a graph optimization problem.…

Computation and Language · Computer Science 2014-01-23 Sebastian Pado , Mirella Lapata

Language Models for German Text Simplification: Overcoming Parallel Data Scarcity through Style-specific Pre-training

Automatic text simplification systems help to reduce textual information barriers on the internet. However, for languages other than English, only few parallel data to train these systems exists. We propose a two-step approach to overcome…

Computation and Language · Computer Science 2023-11-08 Miriam Anschütz , Joshua Oehms , Thomas Wimmer , Bartłomiej Jezierski , Georg Groh

An Empirical Study on Measuring the Similarity of Sentential Arguments with Language Model Domain Adaptation

Measuring the similarity between two different sentential arguments is an important task in argument mining. However, one of the challenges in this field is that the dataset must be annotated using expertise in a variety of topics, making…

Computation and Language · Computer Science 2021-02-22 ChaeHun Park , Sangwoo Seo