TY - GEN
T1 - A prediction-based parallel replication algorithm in distributed storage system
AU - Wang, Yijie
AU - Zhang, Xiaoming
PY - 2005
Y1 - 2005
N2 - Data replication can be used to reduce bandwidth consumption and access latency in the distributed system where users require remote access to large data objects. In this paper, according to the intrinsic characteristic of distributed storage system, the prediction-based parallel replication algorithm PPR is proposed. In the PPR, according to the characteristic of spatial data, the data that will be accessed is predicted, then the data is prefetched; during replication, according to the network state, several replicas of a data object are selected, which are of the least access cost; the different parts of the data object are transferred from these replicas, and they are used to make a new replica. The results of performance evaluation show that the PPR can utilize the network bandwidth efficiently, provide high data replication efficiency and substantially better access efficiency, and can avoid the interference between different replications efficiently.
AB - Data replication can be used to reduce bandwidth consumption and access latency in the distributed system where users require remote access to large data objects. In this paper, according to the intrinsic characteristic of distributed storage system, the prediction-based parallel replication algorithm PPR is proposed. In the PPR, according to the characteristic of spatial data, the data that will be accessed is predicted, then the data is prefetched; during replication, according to the network state, several replicas of a data object are selected, which are of the least access cost; the different parts of the data object are transferred from these replicas, and they are used to make a new replica. The results of performance evaluation show that the PPR can utilize the network bandwidth efficiently, provide high data replication efficiency and substantially better access efficiency, and can avoid the interference between different replications efficiently.
UR - https://www.scopus.com/pages/publications/33646834237
U2 - 10.1007/11590354_86
DO - 10.1007/11590354_86
M3 - 会议稿件
AN - SCOPUS:33646834237
SN - 3540305106
SN - 9783540305101
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 690
EP - 700
BT - Grid and Cooperative Computing - GCC 2005 - 4th International Conference, Proceedings
PB - Springer Verlag
T2 - 4th International Conference on Grid and Cooperative Computing - GCC 2005
Y2 - 30 November 2005 through 3 December 2005
ER -