Related papers: Syntactic Search by Example

Interactive Extractive Search over Biomedical Corpora

We present a system that allows life-science researchers to search a linguistically annotated corpus of scientific texts using patterns over dependency graphs, as well as using patterns over token sequences and a powerful variant of boolean…

Computation and Language · Computer Science 2020-06-09 Hillel Taub-Tabib , Micah Shlain , Shoval Sadde , Dan Lahav , Matan Eyal , Yaara Cohen , Yoav Goldberg

Emergence of Syntax Needs Minimal Supervision

This paper is a theoretical contribution to the debate on the learnability of syntax from a corpus without explicit syntax-specific guidance. Our approach originates in the observable structure of a corpus, which we use to define and…

Computation and Language · Computer Science 2020-05-05 Raphaël Bailly , Kata Gábor

Conversational Exploratory Search of Scholarly Publications Using Knowledge Graphs

Traditional search methods primarily depend on string matches, while semantic search targets concept-based matches by recognizing underlying intents and contextual meanings of search terms. Semantic search is particularly beneficial for…

Computation and Language · Computer Science 2024-10-02 Phillip Schneider , Florian Matthes

Syntactic Analysis Of Natural Language Using Linguistic Rules And Corpus-based Patterns

We are concerned with the syntactic annotation of unrestricted text. We combine a rule-based analysis with subsequent exploitation of empirical data. The rule-based surface syntactic analyser leaves some amount of ambiguity in the output…

cmp-lg · Computer Science 2008-02-03 Pasi Tapanainen , Timo Järvinen

A Concise Query Language with Search and Transform Operations for Corpora with Multiple Levels of Annotation

The usefulness of annotated corpora is greatly increased if there is an associated tool that can allow various kinds of operations to be performed in a simple way. Different kinds of annotation frameworks and many query languages for them…

Computation and Language · Computer Science 2011-08-10 Anil Kumar Singh

Synonym search in Wikipedia: Synarcher

The program Synarcher for synonym (and related terms) search in the text corpus of special structure (Wikipedia) was developed. The results of the search are presented in the form of graph. It is possible to explore the graph and search for…

Information Retrieval · Computer Science 2007-05-23 A. Krizhanovsky

Exploratory Search with Sentence Embeddings

Exploratory search aims to guide users through a corpus rather than pinpointing exact information. We propose an exploratory search system based on hierarchical clusters and document summaries using sentence embeddings. With sentence…

Computation and Language · Computer Science 2020-07-23 Austin Silveria

Discovering Latent Information By Spreading Activation Algorithm For Document Retrieval

Syntactic search relies on keywords contained in a query to find suitable documents. So, documents that do not contain the keywords but contain information related to the query are not retrieved. Spreading activation is an algorithm for…

Artificial Intelligence · Computer Science 2018-08-07 Vuong M. Ngo

From Data to Dialogue: Leveraging the Structure of Knowledge Graphs for Conversational Exploratory Search

Exploratory search is an open-ended information retrieval process that aims at discovering knowledge about a topic or domain rather than searching for a specific answer or piece of information. Conversational interfaces are particularly…

Computation and Language · Computer Science 2023-10-10 Phillip Schneider , Nils Rehtanz , Kristiina Jokinen , Florian Matthes

Graph Queries from Natural Language using Constrained Language Models and Visual Editing

Querying knowledge bases using ontologies is usually performed using dedicated query languages, question-answering systems, or visual query editors for Knowledge Graphs. We propose a novel approach that enables users to query the knowledge…

Human-Computer Interaction · Computer Science 2025-12-02 Benedikt Kantz , Kevin Innerebner , Peter Waldert , Stefan Lengauer , Elisabeth Lex , Tobias Schreck

Learning Language from a Large (Unannotated) Corpus

A novel approach to the fully automated, unsupervised extraction of dependency grammars and associated syntax-to-semantic-relationship mappings from large text corpora is described. The suggested approach builds on the authors' prior work…

Computation and Language · Computer Science 2014-01-16 Linas Vepstas , Ben Goertzel

Demonstration of Faceted Search on Scholarly Knowledge Graphs

Scientists always look for the most accurate and relevant answer to their queries on the scholarly literature. Traditional scholarly search systems list documents instead of providing direct answers to the search queries. As data in…

Digital Libraries · Computer Science 2021-07-14 Golsa Heidari , Ahmad Ramadan , Markus Stocker , Sören Auer

Syntactic Complexity Identification, Measurement, and Reduction Through Controlled Syntactic Simplification

Text simplification is one of the domains in Natural Language Processing (NLP) that offers an opportunity to understand the text in a simplified manner for exploration. However, it is always hard to understand and retrieve knowledge from…

Computation and Language · Computer Science 2023-04-18 Muhammad Salman , Armin Haller , Sergio J. Rodríguez Méndez

Notable Characteristics Search through Knowledge Graphs

Query answering routinely employs knowledge graphs to assist the user in the search process. Given a knowledge graph that represents entities and relationships among them, one aims at complementing the search with intuitive but effective…

Databases · Computer Science 2018-02-13 Davide Mottin , Bastian Grasnick , Axel Kroschk , Patrick Siegler , Emmanuel Mueller

SoftMatcha: A Soft and Fast Pattern Matcher for Billion-Scale Corpus Searches

Researchers and practitioners in natural language processing and computational linguistics frequently observe and analyze the real language usage in large-scale corpora. For that purpose, they often employ off-the-shelf pattern-matching…

Computation and Language · Computer Science 2025-03-06 Hiroyuki Deguchi , Go Kamoda , Yusuke Matsushita , Chihiro Taguchi , Kohei Suenaga , Masaki Waga , Sho Yokoi

ScheMatiQ: From Research Question to Structured Data through Interactive Schema Discovery

Many disciplines pose natural-language research questions over large document collections whose answers typically require structured evidence, traditionally obtained by manually designing an annotation schema and exhaustively labeling the…

Computation and Language · Computer Science 2026-04-13 Shahar Levy , Eliya Habba , Reshef Mintz , Barak Raveh , Renana Keydar , Gabriel Stanovsky

Syntactic Substitutability as Unsupervised Dependency Syntax

Syntax is a latent hierarchical structure which underpins the robust and compositional nature of human language. In this work, we explore the hypothesis that syntactic dependencies can be represented in language model attention…

Computation and Language · Computer Science 2023-10-24 Jasper Jian , Siva Reddy

Neural Extractive Search

Domain experts often need to extract structured information from large corpora. We advocate for a search paradigm called ``extractive search'', in which a search query is enriched with capture-slots, to allow for such rapid extraction. Such…

Computation and Language · Computer Science 2021-06-10 Shauli Ravfogel , Hillel Taub-Tabib , Yoav Goldberg

Knowledge-based Conversational Search

Conversational interfaces that allow for intuitive and comprehensive access to digitally stored information remain an ambitious goal. In this thesis, we lay foundations for designing conversational search systems by analyzing the…

Information Retrieval · Computer Science 2019-12-17 Svitlana Vakulenko

Semantic Code Browsing

Programmers currently enjoy access to a very high number of code repositories and libraries of ever increasing size. The ensuing potential for reuse is however hampered by the fact that searching within all this code becomes an increasingly…

Programming Languages · Computer Science 2016-08-09 Isabel Garcia-Contreras , Jose F. Morales , Manuel V. Hermenegildo