English
Related papers

Related papers: Software Scalability Issues in Large Clusters

200 papers

Having built up Linux clusters to more than 1000 nodes over the past five years, we already have practical experience confronting some of the LHC scale computing challenges: scalability, automation, hardware diversity, security, and rolling…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Vladimir Bahyl , Benjamin Chardi , Jan van Eldik , Ulrich Fuchs , Thorsten Kleinwort , Martin Murth , Tim Smith

High-performance computing (HPC) is essential for tackling complex computational problems across various domains. As the scale and complexity of HPC applications continue to grow, the need for scalable systems and software architectures…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-08-21 Risshab Srinivas Ramesh

Many appplications in computational science are sufficiently compute-intensive that they depend on the power of parallel computing for viability. For all but the "embarrassingly parallel" problems, the performance depends upon the level of…

High Energy Physics - Lattice · Physics 2009-09-29 Z. Sroczynski , N. Eicker , Th. Lippert , B. Orth , K. Schilling

The emergence of the GRID architecture and related tools will have a large impact in the operation and design of present and future large clusters. We present here the ongoing efforts to equip the Linux Farm at the RHIC Computing Facility…

Computational Physics · Physics 2007-05-23 A. Chan , R. Hogue , C. Hollowell , O. Rind , J. Smith , T. Throwe , T. Wlodek , D. Yu

To save cost, recently more and more users choose to provision virtual machine resources in cluster systems, especially in data centres. Maintaining a consistent member view is the foundation of reliable cluster managements, and it also…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-04-13 Wei Zhou , Lei Wang , Dan Meng , Lin Yuan , Jianfeng Zhan

Containers are an emerging technology that hold promise for improving productivity and code portability in scientific computing. We examine Linux container technology for the distribution of a non-trivial scientific computing software stack…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-06-19 Jack S. Hale , Lizao Li , Chris N. Richardson , Garth N. Wells

To accommodate the needs of large-scale distributed P2P systems, scalable data management strategies are required, allowing applications to efficiently cope with continuously growing, highly dis tributed data. This paper addresses the…

Distributed, Parallel, and Cluster Computing · Computer Science 2009-09-30 Bogdan Nicolae , Gabriel Antoniu , Luc Bougé

This paper explores the issues around the construction of large-scale complex systems which are built as 'systems of systems' and suggests that there are fundamental reasons, derived from the inherent complexity in these systems, why our…

Software Engineering · Computer Science 2011-09-16 Ian Sommerville , Dave Cliff , Radu Calinescu , Justin Keen , Tim Kelly , Marta Kwiatkowska , John McDermid , Richard Paige

Container technologies such as Docker have become a crucial component of many software industry practices especially those pertaining to reproducibility and portability. The containerization philosophy has influenced the scientific…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-05 Peter Z. Vaillancourt , J. Eric Coulter , Richard Knepper , Brandon Barker

Containers, enabling lightweight environment and performance isolation, fast and flexible deployment, and fine-grained resource sharing, have gained popularity in better application management and deployment in addition to hardware…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-11-06 Maria A. Rodriguez , Rajkumar Buyya

Enterprises and labs performing computationally expensive data science applications sooner or later face the problem of scale but unconnected infrastructure. For this up-scaling process, an IT service provider can be hired or in-house…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-10-12 Martin Uray , Eduard Hirsch , Gerold Katzinger , Michael Gadermayr

Nowadays, many individuals and teams involved on projects are already using agile development techniques as part of their daily work. However, we have much less experience in how to scale and manage agile practices in distributed software…

Software Engineering · Computer Science 2017-11-06 Mohammad Abdur Razzak

Recent developments in the commercial open source community have catalysed the use of Linux containers for scalable deployment of web-based applications to the cloud. Scientific software can be containerized with dependencies, configuration…

Software Engineering · Computer Science 2015-09-30 Robert Nagler , David Bruhwiler , Paul Moeller , Stephen Webb

Nowadays most of the cloud applications process large amount of data to provide the desired results. Data volumes to be processed by cloud applications are growing much faster than computing power. This growth demands new strategies for…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-07-05 B. Thirumala Rao , N. V. Sridevi , V. Krishna Reddy , L. S. S. Reddy

This article examines the significant challenges encountered in implementing sharding within distributed replication systems. It identifies the impediments of achieving consensus among large participant sets, leading to scalability,…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-11 Siamak Solat

This paper presents a stream-oriented architecture for structuring cluster applications. Clusters that run applications based on this architecture can scale to tenths of thousands of nodes with significantly less performance loss or…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Tassos S. Argyros , David R. Cheriton

Virtual clusters are widely used computing platforms than can be deployed in multiple cloud platforms. The ability to dynamically grow and shrink the number of nodes has paved the way for customised elastic computing both for High…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-02-18 Miguel Caballer , Marica Antonacci , Zdeněk Šustr , Michele Perniola , Germán Moltó

Microsoft Cluster Service (MSCS) extends the Win-dows NT operating system to support high-availability services. The goal is to offer an execution environment where off-the-shelf server applications can continue to operate, even in the…

Operating Systems · Computer Science 2007-05-23 Werner Vogels , Dan Dumitriu , Ken Birman , Rod Gamache , Mike Massa , Rob Short , John Vert , Joe Barrera

Modern high-performance computing architectures (Multicore, GPU, Manycore) are based on tightly-coupled clusters of processing elements, physically implemented as rectangular tiles. Their size and aspect ratio strongly impact the achievable…

Hardware Architecture · Computer Science 2022-09-05 Gianna Paulin , Matheus Cavalcante , Paul Scheffler , Luca Bertaccini , Yichao Zhang , Frank Gürkaynak , Luca Benini

Software Product Lines are large-scale, multi-unit systems that enable massive, customized production. They consist of a base of reusable artifacts and points of variation that provide the system with flexibility, allowing generating…

Software Engineering · Computer Science 2019-01-08 Zineb Mcharfi , Bouchra El Asri , Abdelaziz Kriouile
‹ Prev 1 2 3 10 Next ›