AbstractsComputer Science

Exploration of GlusterFS: A Distributed Parallel Fault Tolerant File System

by Abraha Desta Gebremariam




Institution: University of Oslo
Department:
Year: 1000
Keywords: VDP::420
Record ID: 1293655
Full text PDF: https://www.duo.uio.no/handle/10852/37447


https://www.duo.uio.no/bitstream/10852/37447/2/Gebremariam.pdf


Abstract

The high adoption of Internet services and cloud computing leads to increased demands for reliable high performance storage in data centers. Distributed Parallel Fault Tolerant File Systems (DPFTFS) is the evolution of traditional Distributed File Systems (DFS) trying to address these increased demands. GlusterFS is a relatively new open source DPFTFS gaining lots of momentum lately and this thesis is dedicated to explore its overall performance and reliabil- ity. The experiments conducted by scaling up the number of concurrent clients accessing the le system, while generating high workloads at the same time. Di erent metrics are measured in both servers and clients including network bandwidth usage and system load, giving an overview of performance under di erent setups using the Elastic Volume Management feature of GlusterFS. The data integrity on these setups under di erent failing scenarios was also tested providing insight of the reliability. Results show the sustained perfor- mance of GlusterFS with the increased number of connected clients, and better write instead of read performance for the distributed only and striped only vol- ume types. Data loss and corruption has also been observed in these types of volume correspondingly. In a setup combining distributed and replication vol- ume types, no data loss or corruption and better read instead of write results have been obtained. The experiment exposed di erent bottlenecks related to the network capacity, topology and hardware heterogeneity, giving an indica- tion of factors need to be taken into consideration before the deployment of a GlusterFS storage solution.