English
Related papers

Related papers: Model Lakes

200 papers

The rise of artificial intelligence and data science across industries underscores the pressing need for effective management and governance of machine learning (ML) models. Traditional approaches to ML models management often involve…

Machine Learning · Computer Science 2025-04-01 Moncef Garouani , Franck Ravat , Nathalie Valles-Parlangeau

Data lakes are becoming increasingly prevalent for big data management and data analytics. In contrast to traditional 'schema-on-write' approaches such as data warehouses, data lakes are repositories storing raw data in its original formats…

Databases · Computer Science 2023-10-24 Rihan Hai , Christos Koutras , Christoph Quix , Matthias Jarke

Querying and exploring massive collections of data sources, such as data lakes, has been an essential research topic in the database community. Although many efforts have been paid in the field of data discovery and data integration in data…

Databases · Computer Science 2025-04-04 Jin Wang , Yanlin Feng , Chen Shen , Sajjadur Rahman , Eser Kandogan

Data lakes have emerged as a flexible and scalable solution for storing and analyzing large volumes of heterogeneous data, including structured, semi-structured, and unstructured formats. Despite their growing adoption in both industry and…

Databases · Computer Science 2026-01-28 Yi Lyu , Pei-Chieh Lo , Natan Lidukhover

Over the past decade, the data lake concept has emerged as an alternative to data warehouses for storing and analyzing big data. A data lake allows storing data without any predefined schema. Therefore, data querying and analysis depend on…

Model complexity is a fundamental problem in deep learning. In this paper we conduct a systematic overview of the latest studies on model complexity in deep learning. Model complexity of deep learning can be categorized into expressive…

Machine Learning · Computer Science 2021-08-04 Xia Hu , Lingyang Chu , Jian Pei , Weiqing Liu , Jiang Bian

Over the past two decades, we have witnessed an exponential increase of data production in the world. So-called big data generally come from transactional systems, and even more so from the Internet of Things and social media. They are…

Databases · Computer Science 2021-07-26 Pegdwendé Sawadogo , Jérôme Darmont

Data lakes have emerged as an alternative to data warehouses for the storage, exploration and analysis of big data. In a data lake, data are stored in a raw state and bear no explicit schema. Thence, an efficient metadata system is…

Databases · Computer Science 2019-05-13 Pegdwendé Sawadogo , Tokio Kibata , Jérôme Darmont

The business model represents an increasingly important management concept. However, progress in research related to the concept is currently inhibited from inconsistencies in terms of formalizing and therewith also empirically measuring…

Computers and Society · Computer Science 2015-03-05 Fredrik Hacklin , Nobuaki Minato , Toma Kobayashi

The open-source model ecosystem now contains hundreds of thousands of pretrained models, yet picking the best model for a new dataset is increasingly infeasible: new models and unbenchmarked datasets emerge continuously, leaving…

Machine Learning · Computer Science 2026-05-11 Rui Cai , Weijie Jacky Mo , Xiaofei Wen , Qiyao Ma , Wenhui Zhu , Xiwen Chen , Muhao Chen , Zhe Zhao

In recent years, data lakes emerged as away to manage large amounts of heterogeneous data for modern data analytics. One way to prevent data lakes from turning into inoperable data swamps is semantic data management. Some approaches propose…

Databases · Computer Science 2023-10-25 Sayed Hoseini , Johannes Theissen-Lipp , Christoph Quix

With the rise of big data, business intelligence had to find solutions for managing even greater data volumes and variety than in data warehouses, which proved ill-adapted. Data lakes answer these needs from a storage point of view, but…

Databases · Computer Science 2018-07-12 Iuri Nogueira , Maram Romdhane , Jérôme Darmont

Machine learning can provide deep insights into data, allowing machines to make high-quality predictions and having been widely used in real-world applications, such as text mining, visual classification, and recommender systems. However,…

Machine Learning · Computer Science 2020-08-11 Meng Wang , Weijie Fu , Xiangnan He , Shijie Hao , Xindong Wu

Clouds gather a vast volume of telemetry from their networked systems which contain valuable information that can help solve many of the problems that continue to plague them. However, it is hard to extract useful information from such raw…

Networking and Internet Architecture · Computer Science 2020-04-28 Behnaz Arzani , Bita Rouhani

In the last few years, the concept of data lake has become trendy for data storage and analysis. Thus, several design alternatives have been proposed to build data lake systems. However, these proposals are difficult to evaluate as there…

Databases · Computer Science 2021-10-05 Pegdwendé Sawadogo , Jérôme Darmont

Complex systems' modeling and simulation are powerful ways to investigate a multitude of natural phenomena providing extended knowledge on their structure and behavior. However, enhanced modeling and simulation require integration of…

The rise of big data has revolutionized data exploitation practices and led to the emergence of new concepts. Among them, data lakes have emerged as large heterogeneous data repositories that can be analyzed by various methods. An efficient…

We consider the emerging problem of identifying the presence and use of watermarking schemes in widely used, publicly hosted, closed source large language models (LLMs). We introduce a suite of baseline algorithms for identifying watermarks…

Machine Learning · Computer Science 2023-05-31 Leonard Tang , Gavin Uberti , Tom Shlomi

Artificial intelligence (AI) provides many opportunities to improve private and public life. Discovering patterns and structures in large troves of data in an automated manner is a core component of data science, and currently drives…

Machine Learning · Computer Science 2020-09-25 Vaishak Belle , Ioannis Papantonis

Over the past few years, we have seen fundamental breakthroughs in core problems in machine learning, largely driven by advances in deep neural networks. At the same time, the amount of data collected in a wide array of scientific domains…

Machine Learning · Computer Science 2020-03-27 Maithra Raghu , Eric Schmidt
‹ Prev 1 2 3 10 Next ›