English
Related papers

Related papers: Columnar Database Techniques for Creating AI Featu…

200 papers

Contemporary database systems, while effective, suffer severe issues related to complexity and usability, especially among individuals who lack technical expertise but are unfamiliar with query languages like Structured Query Language…

Databases · Computer Science 2025-07-25 M. Tedeschi , S. Rizwan , C. Shringi , V. Devram Chandgir , S. Belich

Columnar databases are an established way to speed up online analytical processing (OLAP) queries. Nowadays, data processing (e.g., storage, visualization, and analytics) is often performed at the programming language level, hence it is…

Programming Languages · Computer Science 2023-02-21 Sebastian Kloibhofer , Lukas Makor , David Leopoldseder , Daniele Bonetta , Lukas Stadler , Hanspeter Mössenböck

Tabular data is one of the most widely used formats across industries, driving critical applications in areas such as finance, healthcare, and marketing. In the era of data-centric AI, improving data quality and representation has become…

Tabular data is a fundamental form of data structure. The evolution of table analysis tools reflects humanity's continuous progress in data acquisition, management, and processing. The dynamic changes in table columns arise from…

Artificial Intelligence · Computer Science 2026-01-28 Xinda Chen , Zhen Xing , Hanyu Zhang , Weimin Tan , Bo Yan

The rapid adoption of AI-powered applications demands high-performance, scalable, and efficient cloud database solutions, as traditional architectures often struggle with AI-driven workloads requiring real-time data access, vector search,…

Databases · Computer Science 2025-05-06 Santosh Bhupathi

In-memory columnar databases have become mainstream over the last decade and have vastly improved the fast processing of large volumes of data through multi-core parallelism and in-memory compression thereby eliminating the usual…

Databases · Computer Science 2016-09-27 Jayanth Jayanth

Modern analytical workloads increasingly combine relational data with array-valued attributes. While columnar database systems efficiently process such workloads, their ability to optimize queries that interleave relational operators with…

Databases · Computer Science 2026-04-03 Maroua Zeblah , Etienne Couritas , Sarah Chlyah , Pierre Genevès , Nils Gesbert , Nabil Layaïda

We propose Cognitive Databases, an approach for transparently enabling Artificial Intelligence (AI) capabilities in relational databases. A novel aspect of our design is to first view the structured data source as meaningful unstructured…

Databases · Computer Science 2017-12-21 Rajesh Bordawekar , Bortik Bandyopadhyay , Oded Shmueli

Dimensionality reduction in vector databases is pivotal for streamlining AI data management, enabling efficient storage, faster computation, and improved model performance. This paper explores the benefits of reducing vector database…

Databases · Computer Science 2024-04-10 Vitaly Bulgakov , Alec Segal

As the complexity of modern workloads and hardware increasingly outpaces human research and engineering capacity, existing methods for database performance optimization struggle to keep pace. To address this gap, a new class of techniques,…

Databases · Computer Science 2026-04-09 Audrey Cheng , Harald Ng , Aaron Kabcenell , Peter Bailis , Matei Zaharia , Lin Ma , Xiao Shi , Ion Stoica

We revisit column-oriented storage and query processing techniques in the context of contemporary graph database management systems (GDBMSs). Similar to column-oriented RDBMSs, GDBMSs support read-heavy analytical workloads that however…

Databases · Computer Science 2021-10-29 Pranjal Gupta , Amine Mhedhbi , Semih Salihoglu

Modern Internet applications often produce a large volume of user activity records. Data analysts are interested in cohort analysis, or finding unusual user behavioral trends, in these large tables of activity records. In a traditional…

Databases · Computer Science 2016-05-05 Dawei Jiang , Qingchao Cai , Gang Chen , H. V. Jagadish , Beng Chin Ooi , Kian-Lee Tan , Anthony K. H. Tung

The exponential growth of artificial intelligence (AI) and machine learning (ML) applications has necessitated the development of efficient storage solutions for vector and tensor data. This paper presents a novel approach for tensor…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-14 Zhiwei Bao , Liu Liao-Liao , Zhiyu Wu , Yifan Zhou , Dan Fan , Michal Aibin , Yvonne Coady , Andrew Brownsword

Databases are increasingly embracing AI to provide autonomous system optimization and intelligent in-database analytics, aiming to relieve end-user burdens across various industry sectors. Nonetheless, most existing approaches fail to…

The AI revolution is data driven. AI "data wrangling" is the process by which unusable data is transformed to support AI algorithm development (training) and deployment (inference). Significant time is devoted to translating diverse data…

Databases · Computer Science 2020-01-22 Jeremy Kepner , Vijay Gadepally , Hayden Jananthan , Lauren Milechin , Siddharth Samsi

Data-centric AI focuses on understanding and utilizing high-quality, relevant data in training machine learning (ML) models, thereby increasing the likelihood of producing accurate and useful results. Automatic feature augmentation, aiming…

Despite the artificial intelligence (AI) revolution, deep learning has yet to achieve much success with tabular data due to heterogeneous feature space and limited sample sizes without viable transfer learning. The new era of generative AI,…

Machine Learning · Computer Science 2025-01-14 Shourav B. Rabbani , Ibna Kowsar , Manar D. Samad

ArcNeural introduces a novel multimodal database tailored for the demands of Generative AI and Large Language Models, enabling efficient management of diverse data types such as graphs, vectors, and documents. Its storage-compute separated…

Databases · Computer Science 2025-06-12 Wu Min , Qiao Yuncong , Yu Tan , Chenghu Yang

Powerful foundation models, including large language models (LLMs), with Transformer architectures have ushered in a new era of Generative AI across various industries. Industry and research community have witnessed a large number of new…

Deep learning (DL) models have gained prominence in domains such as computer vision and natural language processing but remain underutilized for regression tasks involving tabular data. In these cases, traditional machine learning (ML)…

Machine Learning · Computer Science 2025-01-08 Assaf Shmuel , Oren Glickman , Teddy Lazebnik
‹ Prev 1 2 3 10 Next ›