Abstract
Optimization of data intensive applications is affected greatly by the nature of the platform, the distributed file system, the co-location of data and programs, and the proximity of resources. Data intensive applications running on a public cloud have been shown to exhibit degraded performance compared to a private cluster. This performance degradation is as a result of the inefficient resource allocation adopted in a virtualized, volatile and multi-tenancy environment such as a cloud. The placement of virtual machines is critical for improving performance in geo-clouds environment. We address degradation in performance by designing a scheduling algorithm that dynamically distributes virtual machines based on the characteristics of the job, the network characteristics and the real time performance of the data centers. Our algorithm also finds the best node(s) to host virtual machines that will yield maximum resource utilization and minimize bandwidth consumption. Our results show that incorporating these characteristics in a resource scheduling algorithm increases the performance of data intensive application than known traditional scheduling.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2014 IEEE 18th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2014 |
Publisher | IEEE Computer Society |
Pages | 295-300 |
Number of pages | 6 |
ISBN (Electronic) | 978-1-4799-3776-9, 978-1-4799-3775-2 |
ISBN (Print) | 978-1-4799-3776-9 |
DOIs | |
Publication status | Published - 8 Jul 2014 |
Event | IEEE International Conference on Computer Supported Cooperative Work in Design - Hsinchu, Taiwan, Province of China Duration: 21 May 2014 → 23 May 2014 Conference number: 18 |
Conference
Conference | IEEE International Conference on Computer Supported Cooperative Work in Design |
---|---|
Abbreviated title | CSCWD 2014 |
Country/Territory | Taiwan, Province of China |
City | Hsinchu |
Period | 21/05/14 → 23/05/14 |
Keywords
- Virtual machining
- Scheduling
- Processor scheduling
- Resource management
- Distributed databases
- Data models
- Cloud computing
- virtual machines
- data handling
- distributed processing
- operating systems (computers)
- optimisation
- resource allocation
- data centers
- network oriented resource allocation
- NORA
- data intensive applications
- cloud environment
- optimization
- distributed file system
- public cloud
- private cluster
- geoclouds environment
- scheduling algorithm
- network characteristics
- Hadoop MapReduce
- Grid Computing
- Distributed Application
ASJC Scopus subject areas
- Software