TU Berlin

Internet Network ArchitecturesAll Publications

Page Content

to Navigation

All publications

Move: A Large Scale Keyword-based Content Filtering and Dissemination System
Citation key RCHT-MLSKBCFDS-12
Author Rao, Weixiong and Chen, Lei and Hui, Pan and Tarkoma, Sasu
Title of Book Proceedings of the 32nd International Conference on Distributed Computing Systems (ICDCS '12)
Year 2012
Location Macau, China
Month June
Abstract The Web 2.0 era is characterized by the emergence of a very large amount of live content. A real time and fine-grained content filtering approach can precisely keep users up-to-date the information that they are interested. The key of the approach is to offer a scalable match algorithm. One might treat the content match as a special kind of content search, and resort to the classic algorithm [5]. However, due to blind flooding, [5] cannot be simply adapted for scalable content match. To increase the throughput of scalable match, we propose an adaptive approach to allocate (i.e, replicate and partition) filters. The allocation is based on our observation on real datasets: most users prefer to use short queries, consisting of around 2-3 terms per query, and web content typically contains tens and even thousands of terms per article. Thus, by reducing the number of processed documents, we can reduce the latency of matching large articles with filters, and have chance to achieve higher throughput. We implement our approach on an open source project, Apache Cassandra. The experiment with real datasets shows that our approach can achieve around folds of better throughput than two counterpart state-of-the-arts solutions.
Link to publication Download Bibtex entry


Quick Access

Schnellnavigation zur Seite über Nummerneingabe