AbstractsComputer Science

Hadoop MapReduce in Eucalyptus Private Cloud

by Johan Nilsson




Institution: Umeå University
Department:
Year: 2011
Keywords: Engineering and Technology; Teknik och teknologier; TECHNOLOGY; Information technology; Computer science; TEKNIKVETENSKAP; Informationsteknik; Datavetenskap; teknik; Technology; Civilingenjörsprogrammet i teknisk datavetenskap; Master of Science Programme in Computing Science and Engineering
Record ID: 1369587
Full text PDF: http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-51309


Abstract

This thesis investigates how setting up a private cloud using the Eucalyptus Cloud system could be done along with it's usability, requirements and limitations as an open-source cloud platform providing private cloud solutions. It also studies if using the MapReduce framework through Apache Hadoop's implementation on top of the private Eucalyptus Cloud can provide near linear scalability in terms of time and the amount of virtual machines in the cluster. Analysis has shown that Eucalyptus is lacking in a few usability areas when setting up the cloud infrastructure in terms of private networking and DNS lookups, yet the API that Eucalyptus provides gives benefits when migrating from public clouds like Amazon. The MapReduce framework is showing an initial near-linear relation which is declining when the amount of virtual machines is reaching the maximum of the cloud infrastructure.