English
Related papers

Related papers: SQLFlow: A Bridge between SQL and Machine Learning

200 papers

We present sql4ml, a system for expressing supervised machine learning (ML) models in SQL and automatically training them in TensorFlow. The primary motivation for this work stems from the observation that in many data science tasks there…

Databases · Computer Science 2019-08-05 Nantia Makrynioti , Ruy Ley-Wild , Vasilis Vassalos

Legal practitioners, particularly those early in their careers, face complex, high-stakes tasks that require adaptive, context-sensitive reasoning. While AI holds promise in supporting legal work, current datasets and models are narrowly…

The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain dominated by ad-hoc…

Motivation: Building and iterating machine learning models is often a resource-intensive process. In biomedical research, scientific codebases can lack scalability and are not easily transferable to work beyond what they were intended.…

Machine Learning · Computer Science 2025-04-03 Khoa A. Tran , John V. Pearson , Nicola Waddell

Table learning, which lies at the intersection of machine learning and modern database systems, has recently attracted growing attention. However, existing table learning frameworks typically require explicit data export and extensive…

Databases · Computer Science 2026-02-13 Feiyang Chen , Ken Zhong , Aoqian Zhang , Zheng Wang , Li Pan , Jianhua Li

The data-centric paradigm has emerged as a pivotal direction in artificial intelligence (AI), emphasizing the role of high-quality training data. This shift is especially critical in the Text-to-SQL task, where the scarcity, limited…

Computation and Language · Computer Science 2026-02-11 Qifeng Cai , Hao Liang , Chang Xu , Tao Xie , Wentao Zhang , Bin Cui

Text-to-SQL conversion is a critical innovation, simplifying the transition from complex SQL to intuitive natural language queries, especially significant given SQL's prevalence in the job market across various roles. The rise of Large…

Computation and Language · Computer Science 2024-07-23 Tingkai Zhang , Chaoyu Chen , Cong Liao , Jun Wang , Xudong Zhao , Hang Yu , Jianchao Wang , Jianguo Li , Wenhui Shi

Structured Query Language (SQL) has remained the standard query language for databases. SQL is highly optimized for processing structured data laid out in relations. Meanwhile, in the present application development landscape, it is highly…

Databases · Computer Science 2026-04-24 Udesh Kumarasinghe , Tyler Liu , Ahmed R. Mahmood , Chunwei Liu , Walid G. Aref

SqlCompose brings generative AI into the data analytics domain. SQL is declarative, has formal table schemas, and is often written in a non-linear manner. We address each of these challenges and develop a set of models that shows the…

Formulating efficient SQL queries requires several cycles of tuning and execution, particularly for inexperienced users. We examine methods that can accelerate and improve this interaction by providing insights about SQL queries prior to…

Databases · Computer Science 2020-02-24 Zainab Zolaktaf , Mostafa Milani , Rachel Pottinger

The recent development of large language models (LLMs) with multi-billion parameters, coupled with the creation of user-friendly application programming interfaces (APIs), has paved the way for automatically generating and executing code in…

Artificial Intelligence · Computer Science 2023-12-14 Alejandro Duque , Abdullah Syed , Kastan V. Day , Matthew J. Berry , Daniel S. Katz , Volodymyr V. Kindratenko

Dataflow applications, such as machine learning algorithms, can run for days, making it desirable to have assurances that they will work correctly. Current tools are not good enough: too often the interactions between tasks are not…

Programming Languages · Computer Science 2021-11-25 Riley Evans , Samantha Frohlich , Meng Wang

As Machine Learning (ML) gains adoption across industries and new use cases, practitioners increasingly realize the challenges around effectively developing and iterating on ML systems: reproducibility, debugging, scalability, and…

Machine Learning · Computer Science 2023-03-22 Jacopo Tagliabue , Hugo Bowne-Anderson , Ville Tuulos , Savin Goyal , Romain Cledat , David Berg

TensorFlow is a machine learning system that operates at large scale and in heterogeneous environments. TensorFlow uses dataflow graphs to represent computation, shared state, and the operations that mutate that state. It maps the nodes of…

Over the past decade, machine learning model complexity has grown at an extraordinary rate, as has the scale of the systems training such large models. However there is an alarmingly low hardware utilization (5-20%) in large scale AI…

Hardware Architecture · Computer Science 2022-11-14 Newsha Ardalani , Saptadeep Pal , Puneet Gupta

Snowflake's Cortex AISQL is a production SQL engine that integrates native semantic operations directly into SQL. This integration allows users to write declarative queries that combine relational operations with semantic reasoning,…

Querying tables with unstructured data is challenging due to the presence of text (or image), either embedded in the table or in external paragraphs, which traditional SQL struggles to process, especially for tasks requiring semantic…

Artificial Intelligence · Computer Science 2025-09-25 Rohit Khoja , Devanshu Gupta , Yanjie Fu , Dan Roth , Vivek Gupta

AI agents can extend their capabilities at inference time by loading reusable skills into context, yet equipping an agent with too many skills, particularly irrelevant ones, degrades performance. As community-driven skill repositories grow,…

Artificial Intelligence · Computer Science 2026-03-31 Fangzhou Li , Pagkratios Tagkopoulos , Ilias Tagkopoulos

Dialect translation plays a key role in enabling seamless interaction across heterogeneous database systems. However, translating SQL queries between different dialects (e.g., from PostgreSQL to MySQL) remains a challenging task due to…

Databases · Computer Science 2025-04-02 Wei Zhou , Yuyang Gao , Xuanhe Zhou , Guoliang Li

To extract value from evergrowing volumes of data, coming from a number of different sources, and to drive decision making, organizations frequently resort to the composition of data processing workflows, since they are expressive,…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-12-13 Sérgio Esteves , Helena Galhardas , Luís Veiga
‹ Prev 1 2 3 10 Next ›