Efficient top-k aggregation of ranked inputs

Mamoulis, N; Yiu, ML; Cheng, KH; Cheung, DW

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1145/1272743.1272749
Scopus: eid_2-s2.0-34548407283
WOS: WOS:000249890400006
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: Efficient top-k aggregation of ranked inputs

Title	Efficient top-k aggregation of ranked inputs
Authors	Mamoulis, N Yiu, ML Cheng, KH Cheung, DW
Keywords	Rank aggregation Top-k queries
Issue Date	2007
Publisher	Association for Computing Machinery, Inc.
Citation	Acm Transactions On Database Systems, 2007, v. 32 n. 3 How to Cite? DOI: http://dx.doi.org/10.1145/1272743.1272749
Abstract	A top-k query combines different rankings of the same set of objects and returns the k objects with the highest combined score according to an aggregate function. We bring to light some key observations, which impose two phases that any top-k algorithm, based on sorted accesses, should go through. Based on them, we propose a new algorithm, which is designed to minimize the number of object accesses, the computational cost, and the memory requirements of top-k search with monotone aggregate functions. We provide an analysis for its cost and show that it is always no worse than the baseline no random accesses algorithm in terms of computations, accesses, and memory required. As a side contribution, we perform a space analysis, which indicates the memory requirements of top-k algorithms that only perform sorted accesses. For the case, where the required space exceeds the available memory, we propose disk-based variants of our algorithm. We propose and optimize a multiway top-k join operator, with certain advantages over evaluation trees of binary top-k join operators. Finally, we define and study the computation of top-k cubes and the implementation of roll-up and drill-down operations in such cubes. Extensive experiments with synthetic and real data show that, compared to previous techniques, our method accesses fewer objects, while being orders of magnitude faster. © 2007 ACM.
Persistent Identifier	http://hdl.handle.net/10722/88889
ISSN	0362-5915 2023 Impact Factor: 2.2 2023 SCImago Journal Rankings: 1.730
ISI Accession Number ID	WOS:000249890400006
References	References in Scopus

DC Field	Value	Language
dc.contributor.author	Mamoulis, N	en_HK
dc.contributor.author	Yiu, ML	en_HK
dc.contributor.author	Cheng, KH	en_HK
dc.contributor.author	Cheung, DW	en_HK
dc.date.accessioned	2010-09-06T09:49:44Z	-
dc.date.available	2010-09-06T09:49:44Z	-
dc.date.issued	2007	en_HK
dc.identifier.citation	Acm Transactions On Database Systems, 2007, v. 32 n. 3	en_HK
dc.identifier.issn	0362-5915	en_HK
dc.identifier.uri	http://hdl.handle.net/10722/88889	-
dc.description.abstract	A top-k query combines different rankings of the same set of objects and returns the k objects with the highest combined score according to an aggregate function. We bring to light some key observations, which impose two phases that any top-k algorithm, based on sorted accesses, should go through. Based on them, we propose a new algorithm, which is designed to minimize the number of object accesses, the computational cost, and the memory requirements of top-k search with monotone aggregate functions. We provide an analysis for its cost and show that it is always no worse than the baseline no random accesses algorithm in terms of computations, accesses, and memory required. As a side contribution, we perform a space analysis, which indicates the memory requirements of top-k algorithms that only perform sorted accesses. For the case, where the required space exceeds the available memory, we propose disk-based variants of our algorithm. We propose and optimize a multiway top-k join operator, with certain advantages over evaluation trees of binary top-k join operators. Finally, we define and study the computation of top-k cubes and the implementation of roll-up and drill-down operations in such cubes. Extensive experiments with synthetic and real data show that, compared to previous techniques, our method accesses fewer objects, while being orders of magnitude faster. © 2007 ACM.	en_HK
dc.language	eng	en_HK
dc.publisher	Association for Computing Machinery, Inc.	en_HK
dc.relation.ispartof	ACM Transactions on Database Systems	en_HK
dc.rights	ACM Transactions on Database Systems. Copyright © Association for Computing Machinery, Inc.	en_HK
dc.subject	Rank aggregation	en_HK
dc.subject	Top-k queries	en_HK
dc.title	Efficient top-k aggregation of ranked inputs	en_HK
dc.type	Article	en_HK
dc.identifier.openurl	http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=0730-0301&volume=32&issue=3&spage=&epage=&date=2007&atitle=Efficient+Top-k+Aggregation+of+Ranked+Inputs	en_HK
dc.identifier.email	Mamoulis, N:nikos@cs.hku.hk	en_HK
dc.identifier.email	Cheung, DW:dcheung@cs.hku.hk	en_HK
dc.identifier.authority	Mamoulis, N=rp00155	en_HK
dc.identifier.authority	Cheung, DW=rp00101	en_HK
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1145/1272743.1272749	en_HK
dc.identifier.scopus	eid_2-s2.0-34548407283	en_HK
dc.identifier.hkuros	149702	en_HK
dc.relation.references	http://www.scopus.com/mlt/select.url?eid=2-s2.0-34548407283&selection=ref&src=s&origin=recordpage	en_HK
dc.identifier.volume	32	en_HK
dc.identifier.issue	3	en_HK
dc.identifier.eissn	1557-4644	-
dc.identifier.isi	WOS:000249890400006	-
dc.publisher.place	United States	en_HK
dc.identifier.scopusauthorid	Mamoulis, N=6701782749	en_HK
dc.identifier.scopusauthorid	Yiu, ML=8589889600	en_HK
dc.identifier.scopusauthorid	Cheng, KH=34467513500	en_HK
dc.identifier.scopusauthorid	Cheung, DW=34567902600	en_HK
dc.identifier.issnl	0362-5915	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Efficient top-k aggregation of ranked inputs

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats