• Login
    View Item 
    •   Home
    • Theses and Dissertations
    • Theses and Dissertations
    • View Item
    •   Home
    • Theses and Dissertations
    • Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of TUScholarShareCommunitiesDateAuthorsTitlesSubjectsGenresThis CollectionDateAuthorsTitlesSubjectsGenres

    My Account

    LoginRegister

    Help

    AboutPeoplePoliciesHelp for DepositorsData DepositFAQs

    Statistics

    Most Popular ItemsStatistics by CountryMost Popular Authors

    Better Selection of Virtual Machines for a MapReduce Environment

    • CSV
    • RefMan
    • EndNote
    • BibTex
    • RefWorks
    Thumbnail
    Name:
    TETDEDXBlaisse-temple-0225M-12 ...
    Size:
    427.6Kb
    Format:
    PDF
    Download
    Genre
    Thesis/Dissertation
    Date
    2015
    Author
    Blaisse, Adam Pasqua
    Advisor
    Wu, Jie, 1961-
    Committee member
    Tan, Chiu C.
    Shi, Yuan
    Department
    Computer and Information Science
    Subject
    Computer Science
    Permanent link to this record
    http://hdl.handle.net/20.500.12613/2605
    
    Metadata
    Show full item record
    DOI
    http://dx.doi.org/10.34944/dspace/2587
    Abstract
    With the increase in the availability of large sets of data, comes the need for better and more sophisticated methods of handling and processing these sets. Due to the size and complexity of theses data sets, many users have moved to using distributed systems for storage and processing. With a distributed system, there are many different things that become much more complex and many more opportunities present themselves for issues. Out of this rose the paradigm of MapReduce. The basic idea of MapReduce is to minimize the work of the programmer and remove a lot of the chances of creating an error because of the distributed computation. To do this all the work is either done in the Map Phase by the Map Tasks or in the Reduce Phase in the Reduce tasks. Communication and synchronization is taken care of by Map Reduce so that users are protected from misusing them. Users may also want to use map reduce along with cloud computing. The most common resource that is rented from Amazon EC2 is virtual machines. Amazon offers many different sizes with different types of configuration. Some machines may be more specialized to handle CPU based jobs, while others might be optimized for memory or disk based jobs. Each of these different VM's comes with varying levels of CPU cores, RAM, and Storage capacity to match it's use. Each of theses virtual machines also has its own cost per hour. This means that simply selecting the largest or strongest machine may not be the best option if one is trying to get the most for their money. The other resource that amazon offers is called elastic storage blocks. The basic idea of the the elastic storage blocks, is to offer a users more storage capacity to add to their virtual machine. Like the virtual machines, storage space is has a per hour cost that depends on the amount of space used, as well as type of storage requested. These storage volumes, once purchased, can be attached to a virtual machine and used as extended storage capacity. For this thesis, we will look at how users can best select they type of virtual machine to fit some MapReduce job.
    ADA compliance
    For Americans with Disabilities Act (ADA) accommodation, including help with reading this content, please contact scholarshare@temple.edu
    Collections
    Theses and Dissertations

    entitlement

     
    DSpace software (copyright © 2002 - 2023)  DuraSpace
    Temple University Libraries | 1900 N. 13th Street | Philadelphia, PA 19122
    (215) 204-8212 | scholarshare@temple.edu
    Open Repository is a service operated by 
    Atmire NV
     

    Export search results

    The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

    By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

    To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

    After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.