8. Conclusion and future work
This paper investigated the workload of cloud storage services. We proposed a model to describe the Dropbox workloads, and parameterized it using data from four different networks. We then used our model to implement CloudGen, the first workload generator that simulates both users’ behavior and the network traffic of cloud storage. We validated our workload generator, and illustrated its applicability by exploring the traffic volume that can be expected in future usage scenarios. We offer CloudGen to the community as free software, as a valuable tool to simulate the behavior of cloud storage clients. The analyses of Dropbox workloads from real data showed that our model for the Dropbox functioning is robust. Indeed, we found close agreement for all components of the model in four distinct datasets. We envision several directions for future work. Regarding our workload generator, we plan to extend it to other cloud storage protocols and terminals. For instance, we plan to add components to control the level of replication of updates, modeling cases where only part of peer-devices receives them. We also intend to model seasonality to simulate long-term dynamics of workloads in cloud storage as well as to plug CloudGen in well-known network simulators, then testing the impacts of network conditions on workloads. Moreover, we will investigate how individual user’s profile in- fluences the workload, and factor that into CloudGen, to allow the simulation of populations with particular characteristics. Finally, regarding applications of our model, we will leverage CloudGen to evaluate trade-offs (e.g., return on investments) of alternative architectures for cloud storage services.