English
Related papers

Related papers: Real-time Log Query Interface for large datasets u…

200 papers

Interactive tools make data analysis both more efficient and more accessible to a broad population. Simple interfaces such as Google Finance as well as complex visual exploration interfaces such as Tableau are effective because they are…

Databases · Computer Science 2017-12-04 Haoci Zhang , Thibault Sellam , Eugene Wu

Deploying Machine Learning (ML) algorithms within databases is a challenge due to the varied computational footprints of modern ML algorithms and the myriad of database technologies each with its own restrictive syntax. We introduce an…

We present a reusable dataset and accompanying infrastructure for studying human search behavior in Interactive Information Retrieval (IIR). The dataset combines detailed interaction logs from 61 participants (122 sessions) with user…

Information Retrieval · Computer Science 2026-01-15 Jana Isabelle Friese , Andreas Konstantin Kruff , Philipp Schaer , Norbert Fuhr , Nicola Ferro

Interactive tools make data analysis more efficient and more accessible to end-users by hiding the underlying query complexity and exposing interactive widgets for the parts of the query that matter to the analysis. However, creating custom…

Databases · Computer Science 2019-04-16 Qianrui Zhang , Haoci Zhang , Thibault Sellam , Eugene Wu

Skyline queries are frequently used in data analytics and multi-criteria decision support applications to filter relevant information from big amounts of data. Apache Spark is a popular framework for processing big, distributed data. The…

Databases · Computer Science 2022-10-10 Lukas Grasmann , Reinhard Pichler , Alexander Selzer

With the ever-increasing volume of data, there is an urgent need to provide expressive and efficient tools to support Big Data analytics. The declarative logical language Datalog has proven very effective at expressing concisely graph,…

Databases · Computer Science 2022-09-07 Mingda Li , Jin Wang , Guorui Xiao , Youfu Li , Carlo Zaniolo

Today's high-performance computing (HPC) systems are heavily instrumented, generating logs containing information about abnormal events, such as critical conditions, faults, errors and failures, system resource utilization, and about the…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-08-24 Byung H. Park , Saurabh Hukerikar , Ryan Adamson , Christian Engelmann

Building interactive tools to support data analysis is hard because it is not always clear what to build and how to build it. To address this problem, we present Precision Interfaces, a semi-automatic system to generate task-specific data…

Databases · Computer Science 2017-07-03 Haoci Zhang , Thibault Sellam , Eugene Wu

Latency is, unfortunately, a reality when working with large datasets. Guaranteeing imperceptible latency for interactivity is often prohibitively expensive: the application developer may be forced to migrate data processing engines or deal…

Human-Computer Interaction · Computer Science 2020-09-08 Yifan Wu , Remco Chang , Joseph M. Hellerstein , Eugene Wu

Conversational user interfaces powered by large language models (LLMs) have significantly lowered the technical barriers to database querying. However, existing tools still encounter several challenges, such as misinterpretation of user…

Human-Computer Interaction · Computer Science 2025-08-22 Longfei Chen , Shenghan Gao , Shiwei Wang , Ken Lin , Yun Wang , Quan Li

Following the current big data trend, the scale of real-time system call traces generated by Linux applications in a contemporary data center may increase excessively. Due to the deficiency of scalability, it is challenging for traditional…

Cryptography and Security · Computer Science 2022-01-10 Ming Liu , Zhi Xue , Xiangjian He , Jinjun Chen

The use of large-scale machine learning methods is becoming ubiquitous in many applications ranging from business intelligence to self-driving cars. These methods require a complex computation pipeline consisting of various types of…

Databases · Computer Science 2021-11-10 Yongyang Yu , Mingjie Tang , Walid G. Aref

Modern systems produce a large volume of logs to record run-time status and events. System operators use these raw logs to track a system in order to obtain some useful information to diagnose system anomalies. One of the most important…

Networking and Internet Architecture · Computer Science 2023-03-22 Shaohan Huang , Yi Liu , Carol Fung , Jiaxing Qi , Hailong Yang , Zhongzhi Luan

Software systems usually record important runtime information in their logs. Logs help practitioners understand system runtime behaviors and diagnose field failures. As logs are usually very large in size, automated log analysis is needed…

Software Engineering · Computer Science 2020-01-10 Hetong Dai , Heng Li , Weiyi Shang , Tse-Hsun Chen , Che-Shao Chen

Natural language interfaces (NLIs) provide users with a convenient way to interactively analyze data through natural language queries. Nevertheless, interactive data analysis is a demanding process, especially for novice data analysts. When…

Human-Computer Interaction · Computer Science 2022-11-02 Xingbo Wang , Furui Cheng , Yong Wang , Ke Xu , Jiang Long , Hong Lu , Huamin Qu

Large Language Models (LLMs) have shown remarkable proficiency in natural language understanding (NLU), opening doors for innovative applications. We introduce StreamLink - an LLM-driven distributed data system designed to improve the…

Databases · Computer Science 2025-05-29 Dawei Feng , Di Mei , Huiri Tan , Lei Ren , Xianying Lou , Zhangxi Tan

Developing autonomous driving systems (ADSs) involves generating and storing extensive log data from test drives, which is essential for verification, research, and simulation. However, these high-frequency logs, recorded over varying…

Software Engineering · Computer Science 2025-06-16 Simin Sun , Yuchuan Jin , Miroslaw Staron

Agents centered around Large Language Models (LLMs) are now capable of automating mobile device operations for users. After fine-tuning to learn a user's mobile operations, these agents can adhere to high-level user instructions online.…

Human-Computer Interaction · Computer Science 2024-01-18 Tinghe Ding

Autonomous driving software generates enormous amounts of data every second, which software development organizations save for future analysis and testing in the form of logs. However, given the vast size of this data, locating specific…

Software Engineering · Computer Science 2024-12-17 Jesper Knapp , Klas Moberg , Yuchuan Jin , Simin Sun , Miroslaw Staron

Logging is a critical function in modern distributed applications, but the lack of standardization in log query languages and formats creates significant challenges. Developers currently must write ad hoc queries in platform-specific…

‹ Prev 1 2 3 10 Next ›