English
Related papers

Related papers: Big Data and model-based survey sampling

200 papers

An ever-increasing deluge of big data is becoming available to national statistical offices globally, but it is well documented that statistics produced by big data alone often suffer from selection bias and are not usually representative…

Methodology · Statistics 2023-06-29 Ryan Covey

Bigdata is a dataset of which size is beyond the ability of handling a valuable raw material that can be refined and distilled into valuable specific insights. Compact data is a method that optimizes the big dataset that gives best assets…

Databases · Computer Science 2020-12-29 Song-Kyoo , Kim

Big data is data that exceeds the processing capacity of traditional databases. The data is too big to be processed by a single machine. New and innovative methods are required to process and store such large volumes of data. This paper…

Other Computer Science · Computer Science 2014-04-17 Richa Gupta , Sunny Gupta , Anuradha Singhal

Data can be collected in scientific studies via a controlled experiment or passive observation. Big data is often collected in a passive way, e.g. from social media. In studies of causation great efforts are made to guard against bias and…

Methodology · Statistics 2018-11-21 Elena Pesce , Eva Riccomagno , Henry P. Wynn

The term of big data was used since 1990s, but it became very popular around 2012. A recent definition of this term says that big data are information assets characterized by high volume, velocity, variety and veracity that need special…

General Economics · Economics 2024-06-19 Bogdan Oancea

Big Data has become the primary source of understanding the structure and dynamics of the society at large scale. The network of social interactions can be considered as a multiplex, where each layer corresponds to one communication channel…

Physics and Society · Physics 2016-12-21 János Török , Yohsuke Murase , Hang-Hyun Jo , János Kertész , Kimmo Kaski

Big data has ushered in a new wave of predictive power using machine learning models. In this work, we assess what {\it big} means in the context of typical materials-science machine-learning problems. This concerns not only data volume,…

Big Data concern large-volume, growing data sets that are complex and have multiple autonomous sources. Earlier technologies were not able to handle storage and processing of huge data thus Big Data concept comes into existence. This is a…

Machine Learning · Computer Science 2015-03-26 Praful Koturwar , Sheetal Girase , Debajyoti Mukhopadhyay

The article deals with the problem which led to Big Data. Big Data information technology is the set of methods and means of processing different types of structured and unstructured dynamic large amounts of data for their analysis and use…

Databases · Computer Science 2019-05-07 Nataliya Shakhovska , Uyrii Bolubash , Oleh Veres

Big data presents potential but unresolved value as a source for analysis and inference. However,selection bias, present in many of these datasets, needs to be accounted for so that appropriate inferences can be made on the target…

Methodology · Statistics 2025-01-09 Lyndon Ang , Robert Clark , Bronwyn Loong , Anders Holmberg

Technology is generating a huge and growing availability of observa tions of diverse nature. This big data is placing data learning as a central scientific discipline. It includes collection, storage, preprocessing, visualization and,…

Other Statistics · Statistics 2018-06-12 José L. Torrecilla , Juan Romo

Steve Jobs, one of the greatest visionaries of our time was quoted in 1996 saying "a lot of times, people do not know what they want until you show it to them" [38] indicating he advocated products to be developed based on human intuition…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-01-19 Kevin Taylor-Sakyi

The use of big data in official statistics and the applied sciences is accelerating, but statistics computed using only big data often suffer from substantial selection bias. This leads to inaccurate estimation and invalid statistical…

Methodology · Statistics 2023-08-11 Ryan Covey , Lucca Buonamano

In analyzing big data for finite population inference, it is critical to adjust for the selection bias in the big data. In this paper, we propose two methods of reducing the selection bias associated with the big data sample. The first…

Methodology · Statistics 2019-01-08 Jae Kwang Kim , Zhonglei Wang

Finite population inference is a central goal in survey sampling. Probability sampling is the main statistical approach to finite population inference. Challenges arise due to high cost and increasing non-response rates. Data integration…

Methodology · Statistics 2020-01-13 Shu Yang , Jae Kwang Kim

The term big data has become ubiquitous. Owing to a shared origin between academia, industry and the media there is no single unified definition, and various stakeholders provide diverse and often contradictory definitions. The lack of a…

Databases · Computer Science 2013-09-24 Jonathan Stuart Ward , Adam Barker

There are two main approximations of mining big data in memory. One is to partition a big dataset to several subsets, so as to mine each subset in memory. By this way, global patterns can be obtained by synthesizing all local patterns…

Databases · Computer Science 2016-11-30 Shichao Zhang

Big data has been used widely in many areas including the transportation industry. Using various data sources, traffic states can be well estimated and further predicted for improving the overall operation efficiency. Combined with this…

Machine Learning · Computer Science 2022-02-21 Weiwei Jiang , Jiayun Luo

Recently, increasingly large amounts of data are generated from a variety of sources. Existing data processing technologies are not suitable to cope with the huge amounts of generated data. Yet, many research works focus on Big Data, a…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-06-07 Wissem Inoubli , Sabeur Aridhi , Haithem Mezni , Mondher Maddouri , Engelbert Mephu Nguifo

The term, Big Data, has been authored to refer to the extensive heave of data that can't be managed by traditional data handling methods or techniques. The field of Big Data plays an indispensable role in various fields, such as…

Computers and Society · Computer Science 2017-10-12 Mashooque Ahmed Memon , Safeeullah Soomro , Awais Khan Jumani , Muneer Ahmed Kartio
‹ Prev 1 2 3 10 Next ›