Related papers: Graph Sampling with Distributed In-Memory Dataflow…

Distributed-Memory Vertex-Centric Network Embedding for Large-Scale Graphs

Network embedding is an important step in many different computations based on graph data. However, existing approaches are limited to small or middle size graphs with fewer than a million edges. In practice, web or social network graphs…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-09 Sara Riazi , Boyana Norris

Space-Efficient Sampling from Social Activity Streams

In order to efficiently study the characteristics of network domains and support development of network systems (e.g. algorithms, protocols that operate on networks), it is often necessary to sample a representative subgraph from a large…

Social and Information Networks · Computer Science 2012-06-22 Nesreen K. Ahmed , Jennifer Neville , Ramana Kompella

Distributed GraphLab: A Framework for Machine Learning in the Cloud

While high-level data parallel frameworks, like MapReduce, simplify the design and implementation of large-scale data processing systems, they do not naturally or efficiently support many important data mining and machine learning…

Databases · Computer Science 2012-04-30 Yucheng Low , Joseph Gonzalez , Aapo Kyrola , Danny Bickson , Carlos Guestrin , Joseph M. Hellerstein

A Survey and Taxonomy of Graph Sampling

Graph sampling is a technique to pick a subset of vertices and/ or edges from original graph. It has a wide spectrum of applications, e.g. survey hidden population in sociology [54], visualize social graph [29], scale down Internet AS graph…

Social and Information Networks · Computer Science 2013-08-28 Pili Hu , Wing Cheong Lau

Enhancing Stratified Graph Sampling Algorithms based on Approximate Degree Distribution

Sampling technique has become one of the recent research focuses in the graph-related fields. Most of the existing graph sampling algorithms tend to sample the high degree or low degree nodes in the complex networks because of the…

Social and Information Networks · Computer Science 2018-02-02 Junpeng Zhu , Hui Li , Mei Chen , Zhenyu Dai , Ming Zhu

Empirical Characterization of Graph Sampling Algorithms

Graph sampling allows mining a small representative subgraph from a big graph. Sampling algorithms deploy different strategies to replicate the properties of a given graph in the sampled graph. In this study, we provide a comprehensive…

Social and Information Networks · Computer Science 2021-02-17 Muhammad Irfan Yousuf , Izza Anwer , Raheel Anwar

An Empirical Comparison of Big Graph Frameworks in the Context of Network Analysis

Complex networks are relational data sets commonly represented as graphs. The analysis of their intricate structure is relevant to many areas of science and commerce, and data sets may reach sizes that require distributed storage and…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-01-05 Jannis Koch , Christian L. Staudt , Maximilian Vogel , Henning Meyerhenke

Network Sampling: From Static to Streaming Graphs

Network sampling is integral to the analysis of social, information, and biological networks. Since many real-world networks are massive in size, continuously evolving, and/or distributed in nature, the network structure is often sampled in…

Social and Information Networks · Computer Science 2012-11-16 Nesreen K. Ahmed , Jennifer Neville , Ramana Kompella

DIMSpan - Transactional Frequent Subgraph Mining with Distributed In-Memory Dataflow Systems

Transactional frequent subgraph mining identifies frequent subgraphs in a collection of graphs. This research problem has wide applicability and increasingly requires higher scalability over single machine solutions to address the needs of…

Databases · Computer Science 2017-03-07 André Petermann , Martin Junghanns , Erhard Rahm

Graph Sample and Hold: A Framework for Big-Graph Analytics

Sampling is a standard approach in big-graph analytics; the goal is to efficiently estimate the graph properties by consulting a sample of the whole population. A perfect sample is assumed to mirror every property of the whole population.…

Social and Information Networks · Computer Science 2014-03-18 Nesreen K. Ahmed , Nick Duffield , Jennifer Neville , Ramana Kompella

Graph Sampling Approach for Reducing Computational Complexity of Large-Scale Social Network

Online social network services provide a platform for human social interactions. Nowadays, many kinds of online interactions generate large-scale social network data. Network analysis helps to mine knowledge and pattern from the…

Social and Information Networks · Computer Science 2021-02-19 Andry Alamsyah , Yahya Peranginangin , Intan Muchtadi-Alamsyah , Budi Rahardjo , Kuspriyanto

Node Copying: A Random Graph Model for Effective Graph Sampling

There has been an increased interest in applying machine learning techniques on relational structured-data based on an observed graph. Often, this graph is not fully representative of the true relationship amongst nodes. In these settings,…

Machine Learning · Statistics 2022-08-05 Florence Regol , Soumyasundar Pal , Jianing Sun , Yingxue Zhang , Yanhui Geng , Mark Coates

Subsampling large graphs and invariance in networks

Specify a randomized algorithm that, given a very large graph or network, extracts a random subgraph. What can we learn about the input graph from a single subsample? We derive laws of large numbers for the sampler output, by relating…

Statistics Theory · Mathematics 2017-10-13 Peter Orbanz

Decoupled Graph Neural Networks for Large Dynamic Graphs

Real-world graphs, such as social networks, financial transactions, and recommendation systems, often demonstrate dynamic behavior. This phenomenon, known as graph stream, involves the dynamic changes of nodes and the emergence and…

Machine Learning · Computer Science 2023-05-16 Yanping Zheng , Zhewei Wei , Jiajun Liu

A Framework for Computing on Large Dynamic Graphs

This proposal presents a graph computing framework intending to support both online and offline computing on large dynamic graphs efficiently. The framework proposes a new data model to support rich evolving vertex and edge data types. It…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-12-08 Zhao Yu Dong

Scaling Inter-procedural Dataflow Analysis on the Cloud

Apart from forming the backbone of compiler optimization, static dataflow analysis has been widely applied in a vast variety of applications, such as bug detection, privacy analysis, program comprehension, etc. Despite its importance,…

Programming Languages · Computer Science 2024-12-18 Zewen Sun , Yujin Zhang , Duanchen Xu , Yiyu Zhang , Yun Qi , Yueyang Wang , Yi Li , Zhaokang Wang , Yue Li , Xuandong Li , Zhiqiang Zuo , Qingda Lu , Wenwen Peng , Shengjian Guo

Study on Resource Efficiency of Distributed Graph Processing

Graphs may be used to represent many different problem domains -- a concrete example is that of detecting communities in social networks, which are represented as graphs. With big data and more sophisticated applications becoming widespread…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-04-03 Miguel E. Coimbra , Alexandre P. Francisco , Luis Veiga

Distributed Data Processing Frameworks for Big Graph Data

Recently we create so much data (2.5 quintillion bytes every day) that 90% of the data in the world today has been created in the last two years alone [1]. This data comes from sensors used to gather traffic or climate information, posts to…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-12-20 Afsin Akdogan , Hien To

CADDeLaG: Framework for distributed anomaly detection in large dense graph sequences

Random walk based distance measures for graphs such as commute-time distance are useful in a variety of graph algorithms, such as clustering, anomaly detection, and creating low dimensional embeddings. Since such measures hinge on the…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-02-16 Aniruddha Basak , Kamalika Das , Ole J. Mengshoel

Practical graph signal sampling with log-linear size scaling

Graph signal sampling is the problem of selecting a subset of representative graph vertices whose values can be used to interpolate missing values on the remaining graph vertices. Optimizing the choice of sampling set using concepts from…

Signal Processing · Electrical Eng. & Systems 2022-02-02 Ajinkya Jayawant , Antonio Ortega