Related papers: Utilizing Probase in Open Directory Project-based …

Incorporating Word Embeddings into Open Directory Project based Large-scale Classification

Recently, implicit representation models, such as embedding or deep learning, have been successfully adopted to text classification task due to their outstanding performance. However, these approaches are limited to small- or moderate-scale…

Computation and Language · Computer Science 2018-04-04 Kang-Min Kim , Aliyeva Dinara , Byung-Ju Choi , SangKeun Lee

Word Tagging with Foundational Ontology Classes: Extending the WordNet-DOLCE Mapping to Verbs

Semantic annotation is fundamental to deal with large-scale lexical information, mapping the information to an enumerable set of categories over which rules and algorithms can be applied, and foundational ontology classes can be used as a…

Computation and Language · Computer Science 2018-06-21 Vivian S. Silva , André Freitas , Siegfried Handschuh

Specifying Data Bases Management Systems by Using RM-ODP Engineering Language

Distributed systems can be very large and complex. The various considerations that influence their design can result in a substantial specification, which requires a structured framework that has to be managed successfully. The purpose of…

Databases · Computer Science 2011-02-28 Jalal Laassiri , Said Elhajji , Mohamed Bouhdadi , Ghizlane Orhanou , Youssef Balouki

OntoSeer -- A Recommendation System to Improve the Quality of Ontologies

Building an ontology is not only a time-consuming process, but it is also confusing, especially for beginners and the inexperienced. Although ontology developers can take the help of domain experts in building an ontology, they are not…

Artificial Intelligence · Computer Science 2022-02-07 Pramit Bhattacharyya , Raghava Mutharaju

Beyond Word Embeddings: Learning Entity and Concept Representations from Large Scale Knowledge Bases

Text representations using neural word embeddings have proven effective in many NLP applications. Recent researches adapt the traditional word embedding models to learn vectors of multiword expressions (concepts/entities). However, these…

Computation and Language · Computer Science 2018-12-21 Walid Shalaby , Wlodek Zadrozny , Hongxia Jin

DOC: Deep Open Classification of Text Documents

Traditional supervised learning makes the closed-world assumption that the classes appeared in the test data must have appeared in training. This also applies to text learning or text classification. As learning is used increasingly in…

Computation and Language · Computer Science 2017-09-27 Lei Shu , Hu Xu , Bing Liu

Semantic Relation Classification: Task Formalisation and Refinement

The identification of semantic relations between terms within texts is a fundamental task in Natural Language Processing which can support applications requiring a lightweight semantic interpretation model. Currently, semantic relation…

Computation and Language · Computer Science 2018-06-21 Vivian S. Silva , Manuela Hürliman , Brian Davis , Siegfried Handschuh , André Freitas

Generating Categories for Sets of Entities

Category systems are central components of knowledge bases, as they provide a hierarchical grouping of semantically related concepts and entities. They are a unique and valuable resource that is utilized in a broad range of information…

Information Retrieval · Computer Science 2020-08-20 Shuo Zhang , Krisztian Balog , Jamie Callan

Towards Open-Text Semantic Parsing via Multi-Task Learning of Structured Embeddings

Open-text (or open-domain) semantic parsers are designed to interpret any statement in natural language by inferring a corresponding meaning representation (MR). Unfortunately, large scale systems cannot be easily machine-learned due to…

Artificial Intelligence · Computer Science 2011-07-20 Antoine Bordes , Xavier Glorot , Jason Weston , Yoshua Bengio

The OpenCitations Data Model

A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies…

Digital Libraries · Computer Science 2020-08-25 Marilena Daquino , Silvio Peroni , David Shotton , Giovanni Colavizza , Behnam Ghavimi , Anne Lauscher , Philipp Mayr , Matteo Romanello , Philipp Zumstein

OSDG -- Open-Source Approach to Classify Text Data by UN Sustainable Development Goals (SDGs)

Sustainable Development Goals (SDGs) bring together the diverse development community and provide a clear set of development targets for 2030. Given a large number of actors and initiatives related to these goals, there is a need to have a…

Digital Libraries · Computer Science 2020-06-01 Lukas Pukelis , Nuria Bautista Puig , Mykola Skrynik , Vilius Stanciauskas

Joint Embeddings of Hierarchical Categories and Entities

Due to the lack of structured knowledge applied in learning distributed representation of categories, existing work cannot incorporate category hierarchies into entity information.~We propose a framework that embeds entities and categories…

Computation and Language · Computer Science 2016-05-16 Yuezhang Li , Ronghuo Zheng , Tian Tian , Zhiting Hu , Rahul Iyer , Katia Sycara

Extracting Domain-specific Concepts from Large-scale Linked Open Data

We propose a methodology for extracting concepts for a target domain from large-scale linked open data (LOD) to support the construction of domain ontologies providing field-specific knowledge and definitions. The proposed method defines…

Information Retrieval · Computer Science 2022-01-31 Satoshi Kume , Kouji Kozaki

Towards NLP-supported Semantic Data Management

The heterogeneity of data poses a great challenge when data from different sources is to be merged for one application. Solutions for this are offered, for example, by ontology-based data management (OBDM). A challenge of OBDM is the…

Information Retrieval · Computer Science 2020-05-15 Andreas Burgdorf , André Pomp , Tobias Meisen

ConceptScope: Organizing and Visualizing Knowledge in Documents based on Domain Ontology

Current text visualization techniques typically provide overviews of document content and structure using intrinsic properties such as term frequencies, co-occurrences, and sentence structures. Such visualizations lack conceptual overviews…

Human-Computer Interaction · Computer Science 2021-03-03 Xiaoyu Zhang , Senthil Chandrasegaran , Kwan-Liu Ma

Adapting Neural Text Classification for Improved Software Categorization

Software Categorization is the task of organizing software into groups that broadly describe the behavior of the software, such as "editors" or "science." Categorization plays an important role in several maintenance tasks, such as…

Software Engineering · Computer Science 2018-06-18 Alexander LeClair , Zachary Eberhart , Collin McMillan

Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques

Ordinal Classification (OC) is a widely encountered challenge in Natural Language Processing (NLP), with applications in various domains such as sentiment analysis, rating prediction, and more. Previous approaches to tackle OC have…

Computation and Language · Computer Science 2024-05-21 Siva Rajesh Kasa , Aniket Goel , Karan Gupta , Sumegh Roychowdhury , Anish Bhanushali , Nikhil Pattisapu , Prasanna Srinivasa Murthy

Automatic Synonym Discovery with Knowledge Bases

Recognizing entity synonyms from text has become a crucial task in many entity-leveraging applications. However, discovering entity synonyms from domain-specific text corpora (e.g., news articles, scientific papers) is rather challenging.…

Computation and Language · Computer Science 2017-06-27 Meng Qu , Xiang Ren , Jiawei Han

Joint Embedding of Hierarchical Categories and Entities for Concept Categorization and Dataless Classification

Due to the lack of structured knowledge applied in learning distributed representation of cate- gories, existing work cannot incorporate category hierarchies into entity information. We propose a framework that embeds entities and…

Computation and Language · Computer Science 2016-07-28 Yuezhang Li , Ronghuo Zheng , Tian Tian , Zhiting Hu , Rahul Iyer , Katia Sycara

Classification of entities via their descriptive sentences

Hypernym identification of open-domain entities is crucial for taxonomy construction as well as many higher-level applications. Current methods suffer from either low precision or low recall. To decrease the difficulty of this problem, we…

Artificial Intelligence · Computer Science 2017-11-29 Chao Zhao , Min Zhao , Yi Guan