File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.2352/ISSN.2470-1173.2018.09.IRIACV-276
- Scopus: eid_2-s2.0-85052904242
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: Outlier detection in Large-Scale traffic data by regression analysis
Title | Outlier detection in Large-Scale traffic data by regression analysis |
---|---|
Authors | |
Issue Date | 2018 |
Citation | IS and T International Symposium on Electronic Imaging Science and Technology, 2018, p. 1271-1274 How to Cite? |
Abstract | © 2018, Society for Imaging Science and Technology. A robust outlier detection for large-scale traffic data by an unsupervised regression method is proposed in this paper. Traffic data is collected from loops, sensors and digital cameras all around a city every day. The data size is massive and in a big data format. Outlier is regarded as abnormal traffic situation like traffic jams, low traffic flows, or incidents as well as errors and noise in data storage and transmission. The traffic data to be tackled in this paper is represented by spatial temporal (ST) signals. A principle component analysis (PCA) is used for dimension reduction and to generate a representation of (x, y)-coordinates from the first two component's coefficients in the ST signals. The (x, y)-coordinate points of inliers are measured by Standardized Residual (SR), Hat Matrix (HM) and Cook's Distance (CD) in the regression method so that outliers are assumed to have high changes in these three metrics in the best fit regression model. Experimental result of the proposed method for the Level 1 data achieves detection success rates (DSRs) of 97.37% (SR), 91.19% (HM), 94.28% (CD) for linear regression model, respectively, and 96.80% (SR), 89.71% (HM), 93.14% (CD) for quadratic regression model, respectively. For a finer granularity of Level 2 data, the regression method with the CD metric achieves 94.44% DSR. |
Persistent Identifier | http://hdl.handle.net/10722/276606 |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lam, Philip | - |
dc.contributor.author | Wang, Lili | - |
dc.contributor.author | Ngan, Henry Y.T. | - |
dc.contributor.author | Yung, Nelson H.C. | - |
dc.contributor.author | Ng, Michael K. | - |
dc.date.accessioned | 2019-09-18T08:34:07Z | - |
dc.date.available | 2019-09-18T08:34:07Z | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | IS and T International Symposium on Electronic Imaging Science and Technology, 2018, p. 1271-1274 | - |
dc.identifier.uri | http://hdl.handle.net/10722/276606 | - |
dc.description.abstract | © 2018, Society for Imaging Science and Technology. A robust outlier detection for large-scale traffic data by an unsupervised regression method is proposed in this paper. Traffic data is collected from loops, sensors and digital cameras all around a city every day. The data size is massive and in a big data format. Outlier is regarded as abnormal traffic situation like traffic jams, low traffic flows, or incidents as well as errors and noise in data storage and transmission. The traffic data to be tackled in this paper is represented by spatial temporal (ST) signals. A principle component analysis (PCA) is used for dimension reduction and to generate a representation of (x, y)-coordinates from the first two component's coefficients in the ST signals. The (x, y)-coordinate points of inliers are measured by Standardized Residual (SR), Hat Matrix (HM) and Cook's Distance (CD) in the regression method so that outliers are assumed to have high changes in these three metrics in the best fit regression model. Experimental result of the proposed method for the Level 1 data achieves detection success rates (DSRs) of 97.37% (SR), 91.19% (HM), 94.28% (CD) for linear regression model, respectively, and 96.80% (SR), 89.71% (HM), 93.14% (CD) for quadratic regression model, respectively. For a finer granularity of Level 2 data, the regression method with the CD metric achieves 94.44% DSR. | - |
dc.language | eng | - |
dc.relation.ispartof | IS and T International Symposium on Electronic Imaging Science and Technology | - |
dc.title | Outlier detection in Large-Scale traffic data by regression analysis | - |
dc.type | Conference_Paper | - |
dc.description.nature | link_to_subscribed_fulltext | - |
dc.identifier.doi | 10.2352/ISSN.2470-1173.2018.09.IRIACV-276 | - |
dc.identifier.scopus | eid_2-s2.0-85052904242 | - |
dc.identifier.spage | 1271 | - |
dc.identifier.epage | 1274 | - |
dc.identifier.eissn | 2470-1173 | - |
dc.identifier.issnl | 2470-1173 | - |