Scalable probabilistic similarity ranking in uncertain databases

Bernecker, T; Kriegel, HP; Mamoulis, N; Renz, M; Zuefle, A

File Download

Content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/TKDE.2010.78
Scopus: eid_2-s2.0-77955134730
WOS: WOS:000280134800004
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: Scalable probabilistic similarity ranking in uncertain databases

Title	Scalable probabilistic similarity ranking in uncertain databases
Authors	Bernecker, T Kriegel, HP Mamoulis, N Renz, M Zuefle, A
Keywords	probabilistic ranking similarity search Uncertain databases
Issue Date	2010
Publisher	I E E E. The Journal's web site is located at http://www.computer.org/tkde
Citation	Ieee Transactions On Knowledge And Data Engineering, 2010, v. 22 n. 9, p. 1234-1246 How to Cite? DOI: http://dx.doi.org/10.1109/TKDE.2010.78
Abstract	This paper introduces a scalable approach for probabilistic top-k similarity ranking on uncertain vector data. Each uncertain object is represented by a set of vector instances that is assumed to be mutually exclusive. The objective is to rank the uncertain data according to their distance to a reference object. We propose a framework that incrementally computes for each object instance and ranking position, the probability of the object falling at that ranking position. The resulting rank probability distribution can serve as input for several state-of-the-art probabilistic ranking models. Existing approaches compute this probability distribution by applying the Poisson binomial recurrence technique of quadratic complexity. In this paper, we theoretically as well as experimentally show that our framework reduces this to a linear-time complexity while having the same memory requirements, facilitated by incremental accessing of the uncertain vector instances in increasing order of their distance to the reference object. Furthermore, we show how the output of our method can be used to apply probabilistic top-k ranking for the objects, according to different state-of-the-art definitions. We conduct an experimental evaluation on synthetic and real data, which demonstrates the efficiency of our approach. © 2006 IEEE.
Persistent Identifier	http://hdl.handle.net/10722/138036
ISSN	1041-4347 2023 Impact Factor: 8.9 2023 SCImago Journal Rankings: 2.867
ISI Accession Number ID	WOS:000280134800004
References	References in Scopus

DC Field	Value	Language
dc.contributor.author	Bernecker, T	en_HK
dc.contributor.author	Kriegel, HP	en_HK
dc.contributor.author	Mamoulis, N	en_HK
dc.contributor.author	Renz, M	en_HK
dc.contributor.author	Zuefle, A	en_HK
dc.date.accessioned	2011-08-26T14:39:02Z	-
dc.date.available	2011-08-26T14:39:02Z	-
dc.date.issued	2010	en_HK
dc.identifier.citation	Ieee Transactions On Knowledge And Data Engineering, 2010, v. 22 n. 9, p. 1234-1246	en_HK
dc.identifier.issn	1041-4347	en_HK
dc.identifier.uri	http://hdl.handle.net/10722/138036	-
dc.description.abstract	This paper introduces a scalable approach for probabilistic top-k similarity ranking on uncertain vector data. Each uncertain object is represented by a set of vector instances that is assumed to be mutually exclusive. The objective is to rank the uncertain data according to their distance to a reference object. We propose a framework that incrementally computes for each object instance and ranking position, the probability of the object falling at that ranking position. The resulting rank probability distribution can serve as input for several state-of-the-art probabilistic ranking models. Existing approaches compute this probability distribution by applying the Poisson binomial recurrence technique of quadratic complexity. In this paper, we theoretically as well as experimentally show that our framework reduces this to a linear-time complexity while having the same memory requirements, facilitated by incremental accessing of the uncertain vector instances in increasing order of their distance to the reference object. Furthermore, we show how the output of our method can be used to apply probabilistic top-k ranking for the objects, according to different state-of-the-art definitions. We conduct an experimental evaluation on synthetic and real data, which demonstrates the efficiency of our approach. © 2006 IEEE.	en_HK
dc.language	eng	en_US
dc.publisher	I E E E. The Journal's web site is located at http://www.computer.org/tkde	en_HK
dc.relation.ispartof	IEEE Transactions on Knowledge and Data Engineering	en_HK
dc.rights	©2010 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.	-
dc.subject	probabilistic ranking	en_HK
dc.subject	similarity search	en_HK
dc.subject	Uncertain databases	en_HK
dc.title	Scalable probabilistic similarity ranking in uncertain databases	en_HK
dc.type	Article	en_HK
dc.identifier.email	Mamoulis, N:nikos@cs.hku.hk	en_HK
dc.identifier.authority	Mamoulis, N=rp00155	en_HK
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1109/TKDE.2010.78	en_HK
dc.identifier.scopus	eid_2-s2.0-77955134730	en_HK
dc.identifier.hkuros	190922	en_US
dc.relation.references	http://www.scopus.com/mlt/select.url?eid=2-s2.0-77955134730&selection=ref&src=s&origin=recordpage	en_HK
dc.identifier.volume	22	en_HK
dc.identifier.issue	9	en_HK
dc.identifier.spage	1234	en_HK
dc.identifier.epage	1246	en_HK
dc.identifier.isi	WOS:000280134800004	-
dc.publisher.place	United States	en_HK
dc.identifier.scopusauthorid	Bernecker, T=24512341500	en_HK
dc.identifier.scopusauthorid	Kriegel, HP=7005718994	en_HK
dc.identifier.scopusauthorid	Mamoulis, N=6701782749	en_HK
dc.identifier.scopusauthorid	Renz, M=22433777600	en_HK
dc.identifier.scopusauthorid	Zuefle, A=25029386800	en_HK
dc.identifier.issnl	1041-4347	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: Scalable probabilistic similarity ranking in uncertain databases

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats