English
Related papers

Related papers: Data Lakes for Digital Humanities

200 papers

Data lakes are becoming increasingly prevalent for big data management and data analytics. In contrast to traditional 'schema-on-write' approaches such as data warehouses, data lakes are repositories storing raw data in its original formats…

Databases · Computer Science 2023-10-24 Rihan Hai , Christos Koutras , Christoph Quix , Matthias Jarke

In recent years, data lakes emerged as away to manage large amounts of heterogeneous data for modern data analytics. One way to prevent data lakes from turning into inoperable data swamps is semantic data management. Some approaches propose…

Databases · Computer Science 2023-10-25 Sayed Hoseini , Johannes Theissen-Lipp , Christoph Quix

Data lakes have emerged as an alternative to data warehouses for the storage, exploration and analysis of big data. In a data lake, data are stored in a raw state and bear no explicit schema. Thence, an efficient metadata system is…

Databases · Computer Science 2019-05-13 Pegdwendé Sawadogo , Tokio Kibata , Jérôme Darmont

Over the past two decades, we have witnessed an exponential increase of data production in the world. So-called big data generally come from transactional systems, and even more so from the Internet of Things and social media. They are…

Databases · Computer Science 2021-07-26 Pegdwendé Sawadogo , Jérôme Darmont

Data commons collate data with cloud computing infrastructure and commonly used software services, tools and applications to create biomedical resources for the large-scale management, analysis, harmonization, and sharing of biomedical…

Genomics · Quantitative Biology 2018-12-27 Robert L. Grossman

Humanities have convincingly argued that they need transnational research opportunities and through the digital transformation of their disciplines also have the means to proceed with it on an up to now unknown scale. The digital…

Other Computer Science · Computer Science 2016-01-05 Tobias Blanke , Conny Kristel , Laurent Romary

Data lakes have emerged as a flexible and scalable solution for storing and analyzing large volumes of heterogeneous data, including structured, semi-structured, and unstructured formats. Despite their growing adoption in both industry and…

Databases · Computer Science 2026-01-28 Yi Lyu , Pei-Chieh Lo , Natan Lidukhover

Given a set of deep learning models, it can be hard to find models appropriate to a task, understand the models, and characterize how models are different one from another. Currently, practitioners rely on manually-written documentation to…

Databases · Computer Science 2025-02-24 Koyena Pal , David Bau , Renée J. Miller

Engaging in interdisciplinary projects on the intersection between visualization and humanities research can be a challenging endeavor. Challenges can be finding valuable outcomes for both domains, or how to apply state-of-the-art visual…

Human-Computer Interaction · Computer Science 2024-04-12 Christofer Meinecke

This vision paper introduces a pioneering data lake architecture designed to meet Life \& Earth sciences' burgeoning data management needs. As the data landscape evolves, the imperative to navigate and maximize scientific opportunities has…

Large organizations are seeking to create new architectures and scalable platforms to effectively handle data management challenges due to the explosive nature of data rarely seen in the past. These data management challenges are largely…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-09-29 Ruoran Liu , Haruna Isah , Farhana Zulkernine

Social science research increasingly demands data-driven insights, yet researchers often face barriers such as lack of technical expertise, inconsistent data formats, and limited access to reliable datasets.Social science research…

Databases · Computer Science 2025-12-03 Puneet Arya , Ojas Sahasrabudhe , Adwaiya Srivastav , Partha Pratim Das , Maya Ramanath

Storing data is easy, but finding and using data is not. It is desirable that the data is stored in a structured format, which can be preserved and retrieved in future. Creating Metadata for the data is one way of creating structured data…

Information Theory · Computer Science 2011-01-04 Ranjeet Devarakonda , Giri Palanisamy , Jim Green

Querying and exploring massive collections of data sources, such as data lakes, has been an essential research topic in the database community. Although many efforts have been paid in the field of data discovery and data integration in data…

Databases · Computer Science 2025-04-04 Jin Wang , Yanlin Feng , Chen Shen , Sajjadur Rahman , Eser Kandogan

The development of digital humanities necessitates scholars to adopt more data-intensive methods and engage in multidisciplinary collaborations. Understanding their collaborative data behaviors becomes essential for providing more curated…

Digital Libraries · Computer Science 2025-03-11 Wenqi Li , Zhenyi Tang , Pengyi Zhang , Jun Wang

With new emerging technologies, such as satellites and drones, archaeologists collect data over large areas. However, it becomes difficult to process such data in time. Archaeological data also have many different formats (images, texts,…

Databases · Computer Science 2021-07-26 Pengfei Liu , Sabine Loudcher , Jérôme Darmont , Camille Noûs

Traditional data lakes provide critical data infrastructure for analytical workloads by enabling time travel, running SQL queries, ingesting data with ACID transactions, and visualizing petabyte-scale datasets on cloud storage. They allow…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-12-15 Sasun Hambardzumyan , Abhinav Tuli , Levon Ghukasyan , Fariz Rahman , Hrant Topchyan , David Isayan , Mark McQuade , Mikayel Harutyunyan , Tatevik Hakobyan , Ivo Stranic , Davit Buniatyan

With the rise of big data, business intelligence had to find solutions for managing even greater data volumes and variety than in data warehouses, which proved ill-adapted. Data lakes answer these needs from a storage point of view, but…

Databases · Computer Science 2018-07-12 Iuri Nogueira , Maram Romdhane , Jérôme Darmont

In the last few years, the concept of data lake has become trendy for data storage and analysis. Thus, several design alternatives have been proposed to build data lake systems. However, these proposals are difficult to evaluate as there…

Databases · Computer Science 2021-10-05 Pegdwendé Sawadogo , Jérôme Darmont

Over the past decade, the data lake concept has emerged as an alternative to data warehouses for storing and analyzing big data. A data lake allows storing data without any predefined schema. Therefore, data querying and analysis depend on…

‹ Prev 1 2 3 10 Next ›