Managing Quality of Probabilistic Databases

Cheng, RCK

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1007/978-3-642-36257-6_12
Scopus: eid_2-s2.0-85031414750

Supplementary

Citations:
- Scopus: 0
Appears in Collections:
- Computer Science: Chapter in book

Book Chapter: Managing Quality of Probabilistic Databases

Title	Managing Quality of Probabilistic Databases
Authors	Cheng, RCK
Issue Date	2013
Publisher	Springer-Verlag
Citation	Managing Quality of Probabilistic Databases. In Sadiq, S (Ed.), Handbook of Data Quality: Research and Practice, p. 271-291. Berlin; New York: Springer-Verlag, 2013 How to Cite? DOI: http://dx.doi.org/10.1007/978-3-642-36257-6_12
Abstract	Uncertain or imprecise data are pervasive in applications like location-based services, sensor monitoring, and data collection and integration. For these applications, probabilistic databases can be used to store uncertain data, and querying facilities are provided to yield answers with statistical confidence. Given that a limited amount of resources is available to “clean” the database (e.g., by probing some sensor data values to get their latest values), we address the problem of choosing the set of uncertain objects to be cleaned, in order to achieve the best improvement in the quality of query answers. For this purpose, we present the PWS-quality metric, which is a universal measure that quantifies the ambiguity of query answers under the possible world semantics. We study how PWS-quality can be efficiently evaluated for two major query classes: (1) queries that examine the satisfiability of tuples independent of other tuples (e.g., range queries) and (2) queries that require the knowledge of the relative ranking of the tuples (e.g., MAX queries). We then propose a polynomial-time solution to achieve an optimal improvement in PWS-quality. Other fast heuristics are also examined.
Persistent Identifier	http://hdl.handle.net/10722/166461
ISBN	9783642362569

DC Field	Value	Language
dc.contributor.author	Cheng, RCK	en_US
dc.date.accessioned	2012-09-20T08:36:32Z	-
dc.date.available	2012-09-20T08:36:32Z	-
dc.date.issued	2013	en_US
dc.identifier.citation	Managing Quality of Probabilistic Databases. In Sadiq, S (Ed.), Handbook of Data Quality: Research and Practice, p. 271-291. Berlin; New York: Springer-Verlag, 2013	-
dc.identifier.isbn	9783642362569	-
dc.identifier.uri	http://hdl.handle.net/10722/166461	-
dc.description.abstract	Uncertain or imprecise data are pervasive in applications like location-based services, sensor monitoring, and data collection and integration. For these applications, probabilistic databases can be used to store uncertain data, and querying facilities are provided to yield answers with statistical confidence. Given that a limited amount of resources is available to “clean” the database (e.g., by probing some sensor data values to get their latest values), we address the problem of choosing the set of uncertain objects to be cleaned, in order to achieve the best improvement in the quality of query answers. For this purpose, we present the PWS-quality metric, which is a universal measure that quantifies the ambiguity of query answers under the possible world semantics. We study how PWS-quality can be efficiently evaluated for two major query classes: (1) queries that examine the satisfiability of tuples independent of other tuples (e.g., range queries) and (2) queries that require the knowledge of the relative ranking of the tuples (e.g., MAX queries). We then propose a polynomial-time solution to achieve an optimal improvement in PWS-quality. Other fast heuristics are also examined.	-
dc.language	eng	en_US
dc.publisher	Springer-Verlag	en_US
dc.relation.ispartof	Handbook of Data Quality: Research and Practice	-
dc.title	Managing Quality of Probabilistic Databases	en_US
dc.type	Book_Chapter	en_US
dc.identifier.email	Cheng, RCK: ckcheng@cs.hku.hk	en_US
dc.identifier.authority	Cheng, RCK=rp00074	en_US
dc.identifier.doi	10.1007/978-3-642-36257-6_12	-
dc.identifier.scopus	eid_2-s2.0-85031414750	-
dc.identifier.hkuros	206199	en_US
dc.identifier.hkuros	224491	-
dc.identifier.spage	271	-
dc.identifier.epage	291	-
dc.publisher.place	Berlin; New York	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Book Chapter: Managing Quality of Probabilistic Databases

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats