File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1007/s00453-011-9506-5
- Scopus: eid_2-s2.0-79952701285
- WOS: WOS:000298852300022
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Continuous monitoring of distributed data streams over a time-based sliding window
Title | Continuous monitoring of distributed data streams over a time-based sliding window | ||||
---|---|---|---|---|---|
Authors | |||||
Keywords | Algorithms Communication Distributed data streams Frequent items Quantiles | ||||
Issue Date | 2012 | ||||
Publisher | Springer New York LLC. The Journal's web site is located at http://link.springer.de/link/service/journals/00453/index.htm | ||||
Citation | Algorithmica, 2012, v. 62 n. 3-4, p. 1088-1111 How to Cite? | ||||
Abstract | In this paper we extend the study of algorithms for monitoring distributed data streams from whole data streams to a time-based sliding window. The concern is how to minimize the communication between individual streams and the root, while allowing the root, at any time, to report the global statistics of all streams within a given error bound. This paper presents communication-efficient algorithms for three classical statistics, namely, basic counting, frequent items and quantiles. The worst-case communication cost over a window is {Mathematical expression} bits for basic counting, {Mathematical expression} words for frequent items and {Mathematical expression} words for quantiles, where k is the number of distributed data streams, N is the total number of items in the streams that arrive or expire in the window, and ε<1 is the given error bound. The performance of our algorithms matches and nearly matches the corresponding lower bounds. We also show how to generalize these results to streams with out-of-order data. © 2011 Springer Science+Business Media, LLC. | ||||
Persistent Identifier | http://hdl.handle.net/10722/152454 | ||||
ISSN | 2023 Impact Factor: 0.9 2023 SCImago Journal Rankings: 0.905 | ||||
ISI Accession Number ID |
Funding Information: T. W. Lam is partially supported by the GRF Grant HKU-713909E. |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chan, HL | en_HK |
dc.contributor.author | Lam, TW | en_HK |
dc.contributor.author | Lee, LK | en_HK |
dc.contributor.author | Ting, HF | en_HK |
dc.date.accessioned | 2012-06-26T06:39:15Z | - |
dc.date.available | 2012-06-26T06:39:15Z | - |
dc.date.issued | 2012 | en_HK |
dc.identifier.citation | Algorithmica, 2012, v. 62 n. 3-4, p. 1088-1111 | en_HK |
dc.identifier.issn | 0178-4617 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/152454 | - |
dc.description.abstract | In this paper we extend the study of algorithms for monitoring distributed data streams from whole data streams to a time-based sliding window. The concern is how to minimize the communication between individual streams and the root, while allowing the root, at any time, to report the global statistics of all streams within a given error bound. This paper presents communication-efficient algorithms for three classical statistics, namely, basic counting, frequent items and quantiles. The worst-case communication cost over a window is {Mathematical expression} bits for basic counting, {Mathematical expression} words for frequent items and {Mathematical expression} words for quantiles, where k is the number of distributed data streams, N is the total number of items in the streams that arrive or expire in the window, and ε<1 is the given error bound. The performance of our algorithms matches and nearly matches the corresponding lower bounds. We also show how to generalize these results to streams with out-of-order data. © 2011 Springer Science+Business Media, LLC. | en_HK |
dc.language | eng | en_US |
dc.publisher | Springer New York LLC. The Journal's web site is located at http://link.springer.de/link/service/journals/00453/index.htm | en_HK |
dc.relation.ispartof | Algorithmica | en_HK |
dc.rights | The original publication is available at www.springerlink.com | - |
dc.subject | Algorithms | en_HK |
dc.subject | Communication | en_HK |
dc.subject | Distributed data streams | en_HK |
dc.subject | Frequent items | en_HK |
dc.subject | Quantiles | en_HK |
dc.title | Continuous monitoring of distributed data streams over a time-based sliding window | en_HK |
dc.type | Article | en_HK |
dc.identifier.email | Chan, HL: hlchan@cs.hku.hk | en_HK |
dc.identifier.email | Lam, TW: hresltk@hkucc.hku.hk | en_HK |
dc.identifier.email | Lee, LK: lklee@mpi-inf.mpg.de | en_HK |
dc.identifier.email | Ting, HF: hfting@cs.hku.hk | en_HK |
dc.identifier.authority | Chan, HL=rp01310 | en_HK |
dc.identifier.authority | Lam, TW=rp00135 | en_HK |
dc.identifier.authority | Lee, LK=rp00140 | en_HK |
dc.identifier.authority | Ting, HF=rp00177 | en_HK |
dc.description.nature | link_to_subscribed_fulltext | en_US |
dc.identifier.doi | 10.1007/s00453-011-9506-5 | en_HK |
dc.identifier.scopus | eid_2-s2.0-79952701285 | en_HK |
dc.identifier.hkuros | 206259 | - |
dc.identifier.volume | 62 | - |
dc.identifier.issue | 3-4 | - |
dc.identifier.spage | 1088 | en_HK |
dc.identifier.epage | 1111 | en_HK |
dc.identifier.isi | WOS:000298852300022 | - |
dc.publisher.place | United States | en_HK |
dc.identifier.scopusauthorid | Ting, HF=7005654198 | en_HK |
dc.identifier.scopusauthorid | Lee, LK=12646190100 | en_HK |
dc.identifier.scopusauthorid | Lam, TW=7202523165 | en_HK |
dc.identifier.scopusauthorid | Chan, HL=7403402384 | en_HK |
dc.identifier.citeulike | 9082454 | - |
dc.identifier.issnl | 0178-4617 | - |