Grid computing is emerging as a new paradigm for solving large-scale problems and is becoming an established technology for providing transparent access to large-scale distributed computational resources. Resource allocation and application scheduling are two of the most important aspects of Grid computing. In general, a grid application also requires datasets that may not be available at the local computing site where the application has to be executed, and hence in this case the required data has to be fetched before running the application. Replication of data from primary repositories to other locations may improve the system performance, so as to reduce the frequency of remote data access. In this paper, we tackle with the local scheduling problem by means of a rectangle packing model combined with different policies for dataset replication, with the aim of maximizing the system efficiency of a local computing site. A simulation study is conducted to explore the impact these dataset policies on Grid scheduling by evaluating the performance of an on-line packing algorithm on different Grid scheduling scenarios.
Caramia, M., Giordani, S. (2006). A Simulation study on the impact of data replication policies in grid scheduling. WSEAS TRANSACTIONS ON COMPUTERS, 12, 2962-2969.
A Simulation study on the impact of data replication policies in grid scheduling
CARAMIA, MASSIMILIANO;GIORDANI, STEFANO
2006-01-01
Abstract
Grid computing is emerging as a new paradigm for solving large-scale problems and is becoming an established technology for providing transparent access to large-scale distributed computational resources. Resource allocation and application scheduling are two of the most important aspects of Grid computing. In general, a grid application also requires datasets that may not be available at the local computing site where the application has to be executed, and hence in this case the required data has to be fetched before running the application. Replication of data from primary repositories to other locations may improve the system performance, so as to reduce the frequency of remote data access. In this paper, we tackle with the local scheduling problem by means of a rectangle packing model combined with different policies for dataset replication, with the aim of maximizing the system efficiency of a local computing site. A simulation study is conducted to explore the impact these dataset policies on Grid scheduling by evaluating the performance of an on-line packing algorithm on different Grid scheduling scenarios.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.