6. Conclusions and future directions
XML pub/sub systems are important applications and the performance of such systems is important as the data volume keeps increasing. Performance improvement can be realized with various techniques. This paper focused on performance improvement from the communication models perspective. Effective communication models have potential to significantly improve the performance, as stated in Section 1. This paper presented two communication models for XML pub/sub services, i.e., the cross-layer model and the peer model. We then compared these two models with the conventional filter-based XML multicast model. The conventional XML multicast model builds an application-layer broker overlay above the network layer. The approach forwards the minimal number of XML messages to subscribers at the expense of two high computational cost of XML query aggregation and message filtering operations performed repeatedly at each hop from a PE broker to a CE broker. Both the proposed cross-layer model and the peer model limit the two expensive XML broker operations to be performed only at PE and CE nodes. However, the cross-layer model still relies on XML-capable brokers for message forwarding. On the other hand, the peer model does not establish and maintain a multicast tree, and message forwarding between PE and CEs is realized using the regular network routing technologies rather than following the multicast tree paths. The peer model offers several advantages: low computational cost (both query aggregation and publication filtering operations), easy deployment and management without the need of setting up application-layer XML-capable brokers, high network efficiency, support for frequent user subscription changes, and resilient of broker failures (assuming no failures occur at PE and CEs). An XML pub/sub prototype was devised and the different routing models were evaluated in a LAN environment and on the Amazon cloud centers around the world to emulate a WAN environment where propagation delay becomes the dominant factor. We compared three different models through extensive experimentation. The results showed that the cross-layer model and the peer model significantly outperformed the multicast model in a LAN environment primarily due to the elimination of the expensive filtering operation performed on each broker. Further, the peer model has a notably lower E2E delay than the other two models in a WAN environment as a result of eliminating the filtering operation performed at intermediate brokers and shorter paths used at the network layer than that of the application layer. The focus of this paper was on publication message forwarding. Integrating the publication message routing and user query aggregation techniques into a prototype and evaluating its performance with real workload warrants investigation.