Interactive Semantic Featuring for Text Classification

Camille Jandot; Patrice Simard; Max Chickering; David Grangier; Jina Suh

Interactive Semantic Featuring for Text Classification

Computation and Language 2016-06-27 v1 Machine Learning

Authors: Camille Jandot , Patrice Simard , Max Chickering , David Grangier , Jina Suh

Abstract

In text classification, dictionaries can be used to define human-comprehensible features. We propose an improvement to dictionary features called smoothed dictionary features. These features recognize document contexts instead of n-grams. We describe a principled methodology to solicit dictionary features from a teacher, and present results showing that models built using these human-comprehensible features are competitive with models trained with Bag of Words features.

Interactive Semantic Featuring for Text Classification

Abstract

Keywords

Cite

Comments

Related papers