English
Related papers

Related papers: Data Engineering for Everyone

200 papers

Open data is an emerging paradigm to share large and diverse datasets -- primarily from governmental agencies, but also from other organizations -- with the goal to enable the exploitation of the data for societal, academic, and commercial…

Software Engineering · Computer Science 2012-02-09 Holger M. Kienle

In open-source software development environments; textual, numerical and relationship-based data generated are of interest to researchers. Various data sets are available for this data, which is frequently used in areas such as software…

Software Engineering · Computer Science 2020-10-01 Abdulkadir Şeker , Banu Diri , Halil Arslan

Artificial intelligence (AI) and machine learning (ML) are increasingly broadly adopted in industry, However, based on well over a dozen case studies, we have learned that deploying industry-strength, production quality ML models in systems…

Machine Learning · Computer Science 2020-06-04 Jan Bosch , Ivica Crnkovic , Helena Holmström Olsson

Machine learning (ML) is revolutionizing the world, affecting almost every field of science and industry. Recent algorithms (in particular, deep networks) are increasingly data-hungry, requiring large datasets for training. Thus, the…

Machine Learning · Computer Science 2022-11-16 Chen Shani , Jonathan Zarecki , Dafna Shahaf

We present OpenML and mldata, open science platforms that provides easy access to machine learning data, software and results to encourage further study and application. They go beyond the more traditional repositories for data sets and…

Machine Learning · Computer Science 2014-02-26 Joaquin Vanschoren , Mikio L. Braun , Cheng Soon Ong

In this big data era, the use of large dataset in conjunction with machine learning (ML) has been increasingly popular in both industry and academia. In recent times, the field of materials science is also undergoing a big data revolution,…

Materials Science · Physics 2023-09-27 Sue Sin Chong , Yi Sheng Ng , Hui-Qiong Wang , Jin-Cheng Zheng

Data-centric AI is a new and exciting research topic in the AI community, but many organizations already build and maintain various "data-centric" applications whose goal is to produce high quality data. These range from traditional…

Machine Learning · Computer Science 2021-12-14 Neoklis Polyzotis , Matei Zaharia

Machine learning is now used in many applications thanks to its ability to predict, generate, or discover patterns from large quantities of data. However, the process of collecting and transforming data for practical use is intricate. Even…

High-quality data has become increasingly important to software engineers in designing and implementing today's software, for example, as an input to machine-learning algorithms and visualisation- and analytics-based features. Open data -…

Software Engineering · Computer Science 2022-08-02 Johan Linåker , Per Runeson , Anneke Zuiderwijk , Amanda Brock

Background. Due to the widespread adoption of Artificial Intelligence (AI) and Machine Learning (ML) for building software applications, companies are struggling to recruit employees with a deep understanding of such technologies. In this…

Software Engineering · Computer Science 2025-01-24 Fabio Calefato , Luigi Quaranta , Filippo Lanubile , Marcos Kalinowski

The recent success of machine learning (ML) has led to an explosive growth both in terms of new systems and algorithms built in industry and academia, and new applications built by an ever-growing community of data science (DS)…

Machine Learning is transitioning from an art and science into a technology available to every developer. In the near future, every application on every platform will incorporate trained models to encode data-based decisions that would be…

Context: Advancements in machine learning (ML) lead to a shift from the traditional view of software development, where algorithms are hard-coded by humans, to ML systems materialized through learning from data. Therefore, we need to…

Software Engineering · Computer Science 2021-06-16 Görkem Giray

Recent advances in data science, machine learning, and artificial intelligence, such as the emergence of large language models, are leading to an increasing demand for data that can be processed by such models. While data sources are…

Machine Learning · Computer Science 2023-09-13 Paul Bilokon , Oleksandr Bilokon , Saeed Amen

Software engineering (SE) is a dynamic field that involves multiple phases all of which are necessary to develop sustainable software systems. Machine learning (ML), a branch of artificial intelligence (AI), has drawn a lot of attention in…

Software Engineering · Computer Science 2024-06-21 Nyaga Fred , I. O. Temkin

Machine learning (ML) is used increasingly in real-world applications. In this paper, we describe our ongoing endeavor to define characteristics and challenges unique to Requirements Engineering (RE) for ML-based systems. As a first step,…

Machine Learning · Computer Science 2019-08-14 Andreas Vogelsang , Markus Borg

Generating value from data requires the ability to find, access and make sense of datasets. There are many efforts underway to encourage data sharing and reuse, from scientific publishers asking authors to submit data alongside manuscripts…

Currently, a variety of pipeline tools are available for use in data engineering. Data scientists can use these tools to resolve data wrangling issues associated with data and accomplish some data engineering tasks from data ingestion…

Machine Learning · Computer Science 2024-06-21 Anthony Mbata , Yaji Sripada , Mingjun Zhong

The rise of machine learning (ML) and its integration into software systems has drastically changed development practices. While software engineering traditionally focused on manually created code artifacts with dedicated processes and…

Software Engineering · Computer Science 2025-02-25 Yorick Sens , Henriette Knopp , Sven Peldszus , Thorsten Berger

Given the complexity of typical data science projects and the associated demand for human expertise, automation has the potential to transform the data science process. Key insights: * Automation in data science aims to facilitate and…

‹ Prev 1 2 3 10 Next ›