English
Related papers

Related papers: Genomic data processing with GenomeFlow

200 papers

This paper provides a global picture about the deployment of networked processing services for genomic data sets. Many current research make an extensive use genomic data, which are massive and rapidly increasing over time. They are…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-09-27 Gianluca Reali , Mauro Femminella , Emilia Nunzi , Dario Valocchi

Genetic Algorithms (GAs) are a powerful technique to address hard optimisation problems. However, scalability issues might prevent them from being applied to real-world problems. Exploiting parallel GAs in the cloud might be an affordable…

Neural and Evolutionary Computing · Computer Science 2016-06-23 Pasquale Salza , Filomena Ferrucci

Genomic data sets are growing dramatically as the cost of sequencing continues to decline and small sequencing devices become available. Enormous community databases store and share this data with the research community, but some of these…

Data commons collate data with cloud computing infrastructure and commonly used software services, tools and applications to create biomedical resources for the large-scale management, analysis, harmonization, and sharing of biomedical…

Genomics · Quantitative Biology 2018-12-27 Robert L. Grossman

Edge computing has evolved to be a promising avenue to enhance the system computing capability by offloading processing tasks from the cloud to edge devices. In this paper, we propose a multi-layer edge computing framework called EdgeFlow.…

Networking and Internet Architecture · Computer Science 2018-04-04 Chao Yao , Xiaoyang Wang , Zijie Zheng , Guangyu Sun , Lingyang Song

Workflows are among the most commonly used tools in a variety of execution environments. Many of them target a specific environment; few of them make it possible to execute an entire workflow in different environments, e.g. Kubernetes and…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-09 Iacopo Colonnelli , Barbara Cantalupo , Ivan Merelli , Marco Aldinucci

Big data processing applications are becoming more and more complex. They are no more monolithic in nature but instead they are composed of decoupled analytical processes in the form of a workflow. One type of such workflow applications is…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-12-19 Mutaz Barika , Saurabh Garg , Andrew Chan , Rodrigo N. Calheiros

The search for similar genetic sequences is one of the main bioinformatics tasks. The genetic sequences data banks are growing exponentially and the searching techniques that use linear time are not capable to do the search in the required…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-07-13 Felipe Albrecht

With the increasing number of Machine and Deep Learning applications in High Energy Physics, easy access to dedicated infrastructure represents a requirement for fast and efficient R&D. This work explores different types of cloud services…

Machine Learning · Computer Science 2021-11-09 Renato Cardoso , Dejan Golubovic , Ignacio Peluaga Lozada , Ricardo Rocha , João Fernandes , Sofia Vallecorsa

TensorFlow is a popular cloud computing framework that targets machine learning applications. It separates the specification of application logic (in a dataflow graph) from the execution of the logic. TensorFlow's native runtime executes…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-08-27 Sam Whitlock , James Larus , Edouard Bugnion

Computational complexity is a key limitation of genomic analyses. Thus, over the last 30 years, researchers have proposed numerous fast heuristic methods that provide computational relief. Comparing genomic sequences is one of the most…

Cloud computing is one of the most used distributed systems for data processing and data storage. Due to the continuous increase in the size of the data processed by cloud computing, scheduling multiple tasks to maintain efficiency while…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-08-24 Mahdi Manavi , Yunpeng Zhang , Guoning Chen

Machine learning (ML) methods have been widely used in genomic studies. However, genomic data are often held by different stakeholders (e.g. hospitals, universities, and healthcare companies) who consider the data as sensitive information,…

Cryptography and Security · Computer Science 2020-03-03 Cheng Hong , Zhicong Huang , Wen-jie Lu , Hunter Qu , Li Ma , Morten Dahl , Jason Mancuso

Today's sequencing technology allows sequencing an individual genome within a few weeks for a fraction of the costs of the original Human Genome project. Genomics labs are faced with dozens of TB of data per week that have to be…

Databases · Computer Science 2009-09-15 Uwe Roehm , Jose Blakeley

The cost of DNA sequencing has resulted in a surge of genetic data being utilised to improve scientific research, clinical procedures, and healthcare delivery in recent years. Since the human genome can uniquely identify an individual, this…

Cryptography and Security · Computer Science 2022-02-11 Sara Jafarbeiki , Raj Gaire , Amin Sakzad , Shabnam Kasra Kermanshahi , Ron Steinfeld

With the rapid development of cloud computing, the privacy security incidents occur frequently, especially data security issues. Cloud users would like to upload their sensitive information to cloud service providers in encrypted form…

Cryptography and Security · Computer Science 2018-12-04 Qi Wang , Dehua Zhou , Yanling Li

Trusted execution environments (TEE) such as Intel's Software Guard Extension (SGX) have been widely studied to boost security and privacy protection for the computation of sensitive data such as human genomics. However, a performance…

Cryptography and Security · Computer Science 2021-07-28 Chathura Widanage , Weijie Liu , Jiayu Li , Hongbo Chen , XiaoFeng Wang , Haixu Tang , Judy Fox

Genome sequence analysis plays a pivotal role in enabling many medical and scientific advancements in personalized medicine, outbreak tracing, and forensics. However, the analysis of genome sequencing data is currently bottlenecked by the…

Hardware Architecture · Computer Science 2021-11-04 Damla Senol Cali

Read mapping is a fundamental, yet computationally-expensive step in many genomics applications. It is used to identify potential matches and differences between fragments (called reads) of a sequenced genome and an already known genome…

In GPU-accelerated data analytics, the overhead of data transfer from CPU to GPU becomes a performance bottleneck when the data scales beyond GPU memory capacity due to the limited PCIe bandwidth. Data compression has come to rescue for…

Databases · Computer Science 2026-02-10 Gwangoo Yeo , Zhiyang Shen , Wei Cui , Matteo Interlandi , Rathijit Sen , Bailu Ding , Qi Chen , Minsoo Rhu
‹ Prev 1 2 3 10 Next ›