English
Related papers

Related papers: Technical Report: Developing a Working Data Hub

200 papers

The rapid development of network science and technologies depends on shareable datasets. Currently, there is no standard practice for reporting and sharing network datasets. Some network dataset providers only share links, while others…

Social and Information Networks · Computer Science 2022-06-09 Xinyi Zheng , Ryan A. Rossi , Nesreen Ahmed , Dominik Moritz

Data exploration and quality analysis is an important yet tedious process in the AI pipeline. Current practices of data cleaning and data readiness assessment for machine learning tasks are mostly conducted in an arbitrary manner which…

Databases · Computer Science 2020-10-16 Shazia Afzal , Rajmohan C , Manish Kesarwani , Sameep Mehta , Hima Patel

The data warehousing is becoming increasingly important in terms of strategic decision making through their capacity to integrate heterogeneous data from multiple information sources in a common storage space, for querying and analysis. So…

Databases · Computer Science 2012-05-04 Phuc V. Nguyen

Big Data technology is described. Big data is a popular term used to describe the exponential growth and availability of data, both structured and unstructured. There is constructed dataspace architecture. Dataspace has focused solely - and…

Databases · Computer Science 2019-05-07 Nataliya Shakhovska , Yurii Bolubash

"Data" is becoming an indispensable production factor, just like land, infrastructure, labor or capital. As part of this, a myriad of applications in different sectors require huge amounts of information to feed models and algorithms…

Databases · Computer Science 2022-01-13 Santiago Andrés Azcoitia , Nikolaos Laoutaris

Nowadays, many decision support applications need to exploit data that are not only numerical or symbolic, but also multimedia, multistructure, multisource, multimodal, and/or multiversion. We term such data complex data. Managing and…

Databases · Computer Science 2007-07-12 Jérôme Darmont , Omar Boussaid , Jean-Christian Ralaivao , Kamel Aouiche

In the 1990s, statisticians began thinking in a principled way about how computation could better support the learning and doing of statistics. Since then, the pace of software development has accelerated, advancements in computing and data…

Computation · Statistics 2018-06-05 Amelia McNamara

Since the use of computers in the business world, data collection has become one of the most important issues due to the available knowledge in the data; such data has been stored in the database. The database system was developed which led…

Databases · Computer Science 2019-04-03 Amro F. Alasta , Muftah A. Enaba

We propose there is a need for a technical platform enabling people to engage with the collection, management and consumption of personal data; and that this platform should itself be personal, under the direct control of the individual…

Computers and Society · Computer Science 2015-01-21 Hamed Haddadi , Heidi Howard , Amir Chaudhry , Jon Crowcroft , Anil Madhavapeddy , Richard Mortier

Datacenters provide cost-effective and flexible access to scalable compute and storage resources necessary for today's cloud computing needs. A typical datacenter is made up of thousands of servers connected with a large network and usually…

Networking and Internet Architecture · Computer Science 2018-05-24 Mohammad Noormohammadpour , Cauligi S. Raghavendra

In this paper we have focused a variety of techniques, approaches and different areas of the research which are helpful and marked as the important field of data mining Technologies. As we are aware that many Multinational companies and…

Databases · Computer Science 2012-11-27 Neelamadhab Padhy , Dr. Pragnyaban Mishra , Rasmita Panigrahi

Data is a critical element in any discovery process. In the last decades, we observed exponential growth in the volume of available data and the technology to manipulate it. However, data is only practical when one can structure it for a…

Data lakes are becoming increasingly prevalent for big data management and data analytics. In contrast to traditional 'schema-on-write' approaches such as data warehouses, data lakes are repositories storing raw data in its original formats…

Databases · Computer Science 2023-10-24 Rihan Hai , Christos Koutras , Christoph Quix , Matthias Jarke

Big data is no more "all just hype" but widely applied in nearly all aspects of our business, governments, and organizations with the technology stack of AI. Its influences are far beyond a simple technique innovation but involves all rears…

Other Computer Science · Computer Science 2021-04-19 Bilal Abu-Salih , Pornpit Wongthongtham , Dengya Zhu , Kit Yan Chan , Amit Rudra

AI application developers typically begin with a dataset of interest and a vision of the end analytic or insight they wish to gain from the data at hand. Although these are two very important components of an AI workflow, one often spends…

Databases · Computer Science 2021-03-04 El Kindi Rezig , Michael Cafarella , Vijay Gadepally

Data comes in many forms. From a shallow perspective, they can be viewed as being either in structured (e.g., as a relation, as key-value pairs) or unstructured (e.g., text, image) formats. So far, machines have been fairly good at…

Computation and Language · Computer Science 2026-03-31 Md Ataur Rahman , Dimitris Sacharidis , Oscar Romero , Sergi Nadal

The data is an important asset of an organization and it is essential to keep this asset secure. It requires security in whatever state is it i.e. data at rest, data in use, and data in transit. There is a need to pay more attention to it…

Cryptography and Security · Computer Science 2022-02-25 Ishu Gupta , Ashutosh Kumar Singh

The quality of the data in a dataset can have a substantial impact on the performance of a machine learning model that is trained and/or evaluated using the dataset. Effective dataset management, including tasks such as data cleanup,…

Databases · Computer Science 2023-03-16 Ze Mao , Yang Xu , Erick Suarez

Data collected by large-scale instruments, observatories, and sensor networks are key enablers of scientific discoveries in many disciplines. However, ensuring that these data can be accessed, integrated, and analyzed in a democratized and…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-12-14 Yubo Qin , Ivan Rodero , Manish Parashar

Workflow technology is rapidly evolving and, rather than being limited to modeling the control flow in business processes, is becoming a key mechanism to perform advanced data management, such as big data analytics. This survey focuses on…

Databases · Computer Science 2017-01-27 Georgia Kougka , Anastasios Gounaris , Alkis Simitsis
‹ Prev 1 2 3 10 Next ›