File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICDE.2009.26
- Scopus: eid_2-s2.0-67649641455
- WOS: WOS:000269126700040
- Find via
Supplementary
-
Bookmarks:
- CiteULike: 1
- Citations:
- Appears in Collections:
Conference Paper: Decision trees for uncertain data
Title | Decision trees for uncertain data |
---|---|
Authors | |
Keywords | Decision tree C4.5 Uncertain data Classification |
Issue Date | 2009 |
Publisher | IEEE, Computer Society. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000178 |
Citation | Proceedings - International Conference on Data Engineering, 2009, p. 441-444 How to Cite? |
Abstract | Traditional decision tree classifiers work with data whose values are known and precise. We extend such classifiers to handle data with uncertain information, which originates from measurement/quantisation errors, data staleness, multiple repeated measurements, etc. The value uncertainty is represented by multiple values forming a probability distribution function (pdf). We discover that the accuracy of a decision tree classifier can be much improved if the whole pdf, rather than a simple statistic, is taken into account. We extend classical decision tree building algorithms to handle data tuples with uncertain values. Since processing pdf's is computationally more costly, we propose a series of pruning techniques that can greatly improve the efficiency of the construction of decision trees. © 2009 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/136223 |
ISBN | |
ISSN | |
ISI Accession Number ID | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Tsang, S | en_HK |
dc.contributor.author | Kao, B | en_HK |
dc.contributor.author | Yip, KY | en_HK |
dc.contributor.author | Ho, WS | en_HK |
dc.contributor.author | Lee, SD | en_HK |
dc.date.accessioned | 2011-07-27T02:05:00Z | - |
dc.date.available | 2011-07-27T02:05:00Z | - |
dc.date.issued | 2009 | en_HK |
dc.identifier.citation | Proceedings - International Conference on Data Engineering, 2009, p. 441-444 | en_HK |
dc.identifier.isbn | 9781424434220 | - |
dc.identifier.issn | 1063-6382 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/136223 | - |
dc.description.abstract | Traditional decision tree classifiers work with data whose values are known and precise. We extend such classifiers to handle data with uncertain information, which originates from measurement/quantisation errors, data staleness, multiple repeated measurements, etc. The value uncertainty is represented by multiple values forming a probability distribution function (pdf). We discover that the accuracy of a decision tree classifier can be much improved if the whole pdf, rather than a simple statistic, is taken into account. We extend classical decision tree building algorithms to handle data tuples with uncertain values. Since processing pdf's is computationally more costly, we propose a series of pruning techniques that can greatly improve the efficiency of the construction of decision trees. © 2009 IEEE. | en_HK |
dc.language | eng | en_US |
dc.publisher | IEEE, Computer Society. The Journal's web site is located at http://ieeexplore.ieee.org/xpl/conhome.jsp?punumber=1000178 | en_HK |
dc.relation.ispartof | Proceedings - International Conference on Data Engineering | en_HK |
dc.rights | ©2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | Decision tree | - |
dc.subject | C4.5 | - |
dc.subject | Uncertain data | - |
dc.subject | Classification | - |
dc.title | Decision trees for uncertain data | en_HK |
dc.type | Conference_Paper | en_HK |
dc.identifier.email | Kao, B: kao@cs.hku.hk | en_HK |
dc.identifier.email | Ho, WS: wsho@cs.hku.hk | en_HK |
dc.identifier.authority | Kao, B=rp00123 | en_HK |
dc.identifier.authority | Ho, WS=rp01730 | en_HK |
dc.description.nature | published_or_final_version | - |
dc.identifier.doi | 10.1109/ICDE.2009.26 | en_HK |
dc.identifier.scopus | eid_2-s2.0-67649641455 | en_HK |
dc.identifier.hkuros | 152284 | - |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-67649641455&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.spage | 441 | en_HK |
dc.identifier.epage | 444 | en_HK |
dc.identifier.isi | WOS:000269126700040 | - |
dc.publisher.place | United States | en_HK |
dc.identifier.scopusauthorid | Tsang, S=26666352300 | en_HK |
dc.identifier.scopusauthorid | Kao, B=35221592600 | en_HK |
dc.identifier.scopusauthorid | Yip, KY=7101909946 | en_HK |
dc.identifier.scopusauthorid | Ho, WS=7402968940 | en_HK |
dc.identifier.scopusauthorid | Lee, SD=7601400741 | en_HK |
dc.identifier.citeulike | 7034206 | - |
dc.identifier.issnl | 1063-6382 | - |