close this message

Donate to arXiv

Please join the Simons Foundation and our generous member organizations in supporting arXiv during our giving campaign September 23-27. 100% of your contribution will fund improvements and new initiatives to benefit arXiv's global scientific community.

DONATE

[secure site, no need to create account]

Skip to main content
Cornell University
We gratefully acknowledge support from
the Simons Foundation and member institutions.
arXiv.org > cs > arXiv:1110.0725v1

Help | Advanced Search

arXiv
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:1110.0725v1 (cs)
[Submitted on 4 Oct 2011]

Title:A Survey of Distributed Data Aggregation Algorithms

Authors:Paulo Jesus, Carlos Baquero, Paulo Sérgio Almeida
Download PDF
Abstract: Distributed data aggregation is an important task, allowing the decentralized determination of meaningful global properties, that can then be used to direct the execution of other applications. The resulting values result from the distributed computation of functions like COUNT, SUM and AVERAGE. Some application examples can found to determine the network size, total storage capacity, average load, majorities and many others.
In the last decade, many different approaches have been proposed, with different trade-offs in terms of accuracy, reliability, message and time complexity. Due to the considerable amount and variety of aggregation algorithms, it can be difficult and time consuming to determine which techniques will be more appropriate to use in specific settings, justifying the existence of a survey to aid in this task.
This work reviews the state of the art on distributed data aggregation algorithms, providing three main contributions. First, it formally defines the concept of aggregation, characterizing the different types of aggregation functions. Second, it succinctly describes the main aggregation techniques, organizing them in a taxonomy. Finally, it provides some guidelines toward the selection and use of the most relevant techniques, summarizing their principal characteristics.
Comments: 45 pages, Technical Report
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Data Structures and Algorithms (cs.DS); Information Retrieval (cs.IR); Networking and Internet Architecture (cs.NI)
ACM classes: C.2.4; A.1
Cite as: arXiv:1110.0725 [cs.DC]
  (or arXiv:1110.0725v1 [cs.DC] for this version)

Submission history

From: Paulo Jesus [view email]
[v1] Tue, 4 Oct 2011 15:24:25 UTC (51 KB)
Full-text links:

Download:

  • PDF
  • Other formats
(license)
Current browse context:
cs.DC
< prev   |   next >
new | recent | 1110
Change to browse by:
cs
cs.DS
cs.IR
cs.NI

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar

DBLP - CS Bibliography

listing | bibtex
Paulo Jesus
Carlos Baquero
Paulo Sérgio Almeida

Bookmark

BibSonomy logo Mendeley logo Reddit logo ScienceWISE logo
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?) Browse v0.3.2 released 2020-06-25   
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack