File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Evaluating probability Threshold k-nearest-neighbor queries over uncertain data

TitleEvaluating probability Threshold k-nearest-neighbor queries over uncertain data
Authors
KeywordsBiological managements
Candidate selection
Candidate sets
Efficient data structures
Emerging applications
Issue Date2009
PublisherAssociation for Computing Machinery.
Citation
The 12th International Conference on Extending Database Technology (EDBT 2009), St. Petersburg, Russia, 23-26 March 2009. In Proceedings of the 12th International Conference on Extending Database Technology, 2009, p. 672-683 How to Cite?
AbstractIn emerging applications such as location-based services, sensor monitoring and biological management systems, the values of the database items are naturally imprecise. For these uncertain databases, an important query is the Probabilistic k-Nearest-Neighbor Query (fc-PNN), which computes the probabilities of sets of k objects for being the closest to a given query point. The evaluation of this query can be both computationally- and I/O- expensive, since there is an exponentially large number of k object-sets, and numerical integration is required. Often a user may not be concerned about the exact probability values. For example, he may only need answers that have sufficiently high confidence. We thus propose the Probabilistic Threshold k-Nearest-Neighbor Query (T-k-PNN), which returns sets of k objects that satisfy the query with probabilities higher than some threshold T. Three steps are proposed to handle this query efficiently. In the first stage, objects that cannot constitute an answer are filtered with the aid of a spatial index. The second step, called probabilistic candidate selection, significantly prunes a number of candidate sets to be examined. The remaining sets are sent for verification, which derives the lower and upper bounds of answer probabilities, so that a candidate set can be quickly decided on whether it should be included in the answer. We also examine spatially-efficient data structures that support these methods. Our solution can be applied to uncertain data with arbitrary probability density functions. We have also performed extensive experiments to examine the effectiveness of our methods. Copyright 2009 ACM.
Persistent Identifierhttp://hdl.handle.net/10722/61146
ISBN
References

 

DC FieldValueLanguage
dc.contributor.authorCheng, Ren_HK
dc.contributor.authorChen, Len_HK
dc.contributor.authorChen, Jen_HK
dc.contributor.authorXie, Xen_HK
dc.date.accessioned2010-07-13T03:31:54Z-
dc.date.available2010-07-13T03:31:54Z-
dc.date.issued2009en_HK
dc.identifier.citationThe 12th International Conference on Extending Database Technology (EDBT 2009), St. Petersburg, Russia, 23-26 March 2009. In Proceedings of the 12th International Conference on Extending Database Technology, 2009, p. 672-683en_HK
dc.identifier.isbn9781605584225-
dc.identifier.urihttp://hdl.handle.net/10722/61146-
dc.description.abstractIn emerging applications such as location-based services, sensor monitoring and biological management systems, the values of the database items are naturally imprecise. For these uncertain databases, an important query is the Probabilistic k-Nearest-Neighbor Query (fc-PNN), which computes the probabilities of sets of k objects for being the closest to a given query point. The evaluation of this query can be both computationally- and I/O- expensive, since there is an exponentially large number of k object-sets, and numerical integration is required. Often a user may not be concerned about the exact probability values. For example, he may only need answers that have sufficiently high confidence. We thus propose the Probabilistic Threshold k-Nearest-Neighbor Query (T-k-PNN), which returns sets of k objects that satisfy the query with probabilities higher than some threshold T. Three steps are proposed to handle this query efficiently. In the first stage, objects that cannot constitute an answer are filtered with the aid of a spatial index. The second step, called probabilistic candidate selection, significantly prunes a number of candidate sets to be examined. The remaining sets are sent for verification, which derives the lower and upper bounds of answer probabilities, so that a candidate set can be quickly decided on whether it should be included in the answer. We also examine spatially-efficient data structures that support these methods. Our solution can be applied to uncertain data with arbitrary probability density functions. We have also performed extensive experiments to examine the effectiveness of our methods. Copyright 2009 ACM.en_HK
dc.languageengen_HK
dc.publisherAssociation for Computing Machinery.-
dc.relation.ispartofProceedings of the 12th International Conference on Extending Database Technologyen_HK
dc.subjectBiological managements-
dc.subjectCandidate selection-
dc.subjectCandidate sets-
dc.subjectEfficient data structures-
dc.subjectEmerging applications-
dc.titleEvaluating probability Threshold k-nearest-neighbor queries over uncertain dataen_HK
dc.typeConference_Paperen_HK
dc.identifier.openurlhttp://library.hku.hk:4550/resserv?sid=HKU:IR&issn=9781605584225&volume=&spage=672&epage=683&date=2009&atitle=Evaluating+probability+threshold+k-nearest-neighbor+queries+over+uncertain+data-
dc.identifier.emailCheng, R:ckcheng@cs.hku.hken_HK
dc.identifier.authorityCheng, R=rp00074en_HK
dc.description.naturelink_to_OA_fulltext-
dc.identifier.doi10.1145/1516360.1516438en_HK
dc.identifier.scopuseid_2-s2.0-70349103656en_HK
dc.identifier.hkuros162401en_HK
dc.relation.referenceshttp://www.scopus.com/mlt/select.url?eid=2-s2.0-70349103656&selection=ref&src=s&origin=recordpageen_HK
dc.identifier.spage672en_HK
dc.identifier.epage683en_HK
dc.description.otherThe 12th International Conference on Extending Database Technology (EDBT 2009), St. Petersburg, Russia, 23-26 March 2009. In Proceedings of the 12th International Conference on Extending Database Technology, 2009, p. 672-683-
dc.identifier.scopusauthoridCheng, R=7201955416en_HK
dc.identifier.scopusauthoridChen, L=25652992200en_HK
dc.identifier.scopusauthoridChen, J=36692766900en_HK
dc.identifier.scopusauthoridXie, X=34881209700en_HK

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats