Related papers: Semantic Labeling Using a Deep Contextualized Lang…

Learning Semantic Annotations for Tabular Data

The usefulness of tabular data such as web tables critically depends on understanding their semantics. This study focuses on column type prediction for tables without any meta data. Unlike traditional lexical matching-based methods, we…

Databases · Computer Science 2019-06-04 Jiaoyan Chen , Ernesto Jimenez-Ruiz , Ian Horrocks , Charles Sutton

Analysing Lexical Semantic Change with Contextualised Word Representations

This paper presents the first unsupervised approach to lexical semantic change that makes use of contextualised word representations. We propose a novel method that exploits the BERT neural language model to obtain representations of word…

Computation and Language · Computer Science 2020-10-21 Mario Giulianelli , Marco Del Tredici , Raquel Fernández

Table Search Using a Deep Contextualized Language Model

Pretrained contextualized language models such as BERT have achieved impressive results on various natural language processing benchmarks. Benefiting from multiple pretraining tasks and large scale training corpora, pretrained models can…

Information Retrieval · Computer Science 2020-05-28 Zhiyu Chen , Mohamed Trabelsi , Jeff Heflin , Yinan Xu , Brian D. Davison

Pretrained Language Models for Sequential Sentence Classification

As a step toward better document-level understanding, we explore classification of a sequence of sentences into their corresponding categories, a task that requires understanding sentences in context of the document. Recent successful…

Computation and Language · Computer Science 2021-03-24 Arman Cohan , Iz Beltagy , Daniel King , Bhavana Dalvi , Daniel S. Weld

Semantics-aware BERT for Language Understanding

The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference…

Computation and Language · Computer Science 2020-02-05 Zhuosheng Zhang , Yuwei Wu , Hai Zhao , Zuchao Li , Shuailiang Zhang , Xi Zhou , Xiang Zhou

Simple BERT Models for Relation Extraction and Semantic Role Labeling

We present simple BERT-based models for relation extraction and semantic role labeling. In recent years, state-of-the-art performance has been achieved using neural models by incorporating lexical and syntactic features such as…

Computation and Language · Computer Science 2019-04-11 Peng Shi , Jimmy Lin

Sequence-to-Set Semantic Tagging: End-to-End Multi-label Prediction using Neural Attention for Complex Query Reformulation and Automated Text Categorization

Novel contexts may often arise in complex querying scenarios such as in evidence-based medicine (EBM) involving biomedical literature, that may not explicitly refer to entities or canonical concept forms occurring in any fact- or rule-based…

Computation and Language · Computer Science 2019-11-12 Manirupa Das , Juanxi Li , Eric Fosler-Lussier , Simon Lin , Soheil Moosavinasab , Steve Rust , Yungui Huang , Rajiv Ramnath

R$^2$-Net: Relation of Relation Learning Network for Sentence Semantic Matching

Sentence semantic matching is one of the fundamental tasks in natural language processing, which requires an agent to determine the semantic relation among input sentences. Recently, deep neural networks have achieved impressive performance…

Computation and Language · Computer Science 2020-12-17 Kun Zhang , Le Wu , Guangyi Lv , Meng Wang , Enhong Chen , Shulan Ruan

Evaluating approaches for supervised semantic labeling

Relational data sources are still one of the most popular ways to store enterprise or Web data, however, the issue with relational schema is the lack of a well-defined semantic description. A common ontology provides a way to represent the…

Machine Learning · Computer Science 2018-01-31 Natalia Ruemmele , Yuriy Tyshetskiy , Alex Collins

Sato: Contextual Semantic Type Detection in Tables

Detecting the semantic types of data columns in relational tables is important for various data preparation and information retrieval tasks such as data cleaning, schema matching, data discovery, and semantic search. However, existing…

Databases · Computer Science 2020-06-04 Dan Zhang , Yoshihiko Suhara , Jinfeng Li , Madelon Hulsebos , Çağatay Demiralp , Wang-Chiew Tan

Neural Semantic Role Labeling with Dependency Path Embeddings

This paper introduces a novel model for semantic role labeling that makes use of neural sequence modeling techniques. Our approach is motivated by the observation that complex syntactic structures and related phenomena, such as nested…

Computation and Language · Computer Science 2016-07-19 Michael Roth , Mirella Lapata

Exploring Category Structure with Contextual Language Models and Lexical Semantic Networks

Recent work on predicting category structure with distributional models, using either static word embeddings (Heyman and Heyman, 2019) or contextualized language models (CLMs) (Misra et al., 2021), report low correlations with human…

Machine Learning · Computer Science 2023-02-15 Joseph Renner , Pascal Denis , Rémi Gilleron , Angèle Brunellière

Comprehensive and Efficient Data Labeling via Adaptive Model Scheduling

Labeling data (e.g., labeling the people, objects, actions and scene in images) comprehensively and efficiently is a widely needed but challenging task. Numerous models were proposed to label various data and many approaches were designed…

Machine Learning · Computer Science 2020-02-14 Mu Yuan , Lan Zhang , Xiang-Yang Li , Hui Xiong

Deeper Text Understanding for IR with Contextual Neural Language Modeling

Neural networks provide new possibilities to automatically learn complex language patterns and query-document relations. Neural IR models have achieved promising results in learning query-document relevance patterns, but few explorations…

Information Retrieval · Computer Science 2019-05-23 Zhuyun Dai , Jamie Callan

Character-level Representations Improve DRS-based Semantic Parsing Even in the Age of BERT

We combine character-level and contextual language model representations to improve performance on Discourse Representation Structure parsing. Character representations can easily be added in a sequence-to-sequence model in either one…

Computation and Language · Computer Science 2020-11-10 Rik van Noord , Antonio Toral , Johan Bos

Contextual Graph Embeddings: Accounting for Data Characteristics in Heterogeneous Data Integration

As organizations continue to access diverse datasets, the demand for effective data integration has increased. Key tasks in this process, such as schema matching and entity resolution, are essential but often require significant effort.…

Databases · Computer Science 2025-11-13 Yuka Haruki , Shigeru Ishikura , Kazuya Demachi , Teruaki Hayashi

Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data

Pre-trained language models such as BERT have been proved to be powerful in many natural language processing tasks. But in some text classification applications such as emotion recognition and sentiment analysis, BERT may not lead to…

Computation and Language · Computer Science 2025-06-03 Zixiao Zhu , Kezhi Mao

Can Deep Neural Networks Predict Data Correlations from Column Names?

Recent publications suggest using natural language analysis on database schema elements to guide tuning and profiling efforts. The underlying hypothesis is that state-of-the-art language processing methods, so-called language models, are…

Databases · Computer Science 2023-09-12 Immanuel Trummer

Semantic Annotation for Tabular Data

Detecting semantic concept of columns in tabular data is of particular interest to many applications ranging from data integration, cleaning, search to feature engineering and model building in machine learning. Recently, several works have…

Artificial Intelligence · Computer Science 2020-12-17 Udayan Khurana , Sainyam Galhotra

Variational Sequential Labelers for Semi-Supervised Learning

We introduce a family of multitask variational methods for semi-supervised sequence labeling. Our model family consists of a latent-variable generative model and a discriminative labeler. The generative models use latent variables to define…

Computation and Language · Computer Science 2019-06-25 Mingda Chen , Qingming Tang , Karen Livescu , Kevin Gimpel