File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TKDE.2005.110
- Scopus: eid_2-s2.0-22944487013
- WOS: WOS:000229074800010
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Indexing useful structural patterns for XML query processing
Title | Indexing useful structural patterns for XML query processing |
---|---|
Authors | |
Keywords | Document indexing Mining methods and algorithms Query processing XML/XSL/RDF |
Issue Date | 2005 |
Publisher | I E E E. The Journal's web site is located at http://www.computer.org/tkde |
Citation | Ieee Transactions On Knowledge And Data Engineering, 2005, v. 17 n. 7, p. 997-1009 How to Cite? |
Abstract | Queries on semistructured data are hard to process due to the complex nature of the data and call for specialized techniques. Existing path-based indexes and query processing algorithms are not efficient for searching complex structures beyond simple paths, even when the queries are high-selective. We introduce the definition of minimal infrequent structures (MIS), which are structures that 1) exist in the data, 2) are not frequent with respect to a support threshold, and 3) all substructures of them are frequent. By indexing the occurrences of MIS, we can efficiently locate the high-selective substructures of a query, improving search performance significantly. An efficient data mining algorithm is proposed, which finds the minimal infrequent structures. Their occurrences in the XML data are then indexed by a lightweight data structure and used as a fast filter step in query evaluation. We validate the efficiency and applicability of our methods through experimentation on both synthetic and real data. © 2005 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/47084 |
ISSN | 2023 Impact Factor: 8.9 2023 SCImago Journal Rankings: 2.867 |
ISI Accession Number ID | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lian, W | en_HK |
dc.contributor.author | Mamoulis, N | en_HK |
dc.contributor.author | Cheung, DWL | en_HK |
dc.contributor.author | Yiu, SM | en_HK |
dc.date.accessioned | 2007-10-30T07:06:45Z | - |
dc.date.available | 2007-10-30T07:06:45Z | - |
dc.date.issued | 2005 | en_HK |
dc.identifier.citation | Ieee Transactions On Knowledge And Data Engineering, 2005, v. 17 n. 7, p. 997-1009 | en_HK |
dc.identifier.issn | 1041-4347 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/47084 | - |
dc.description.abstract | Queries on semistructured data are hard to process due to the complex nature of the data and call for specialized techniques. Existing path-based indexes and query processing algorithms are not efficient for searching complex structures beyond simple paths, even when the queries are high-selective. We introduce the definition of minimal infrequent structures (MIS), which are structures that 1) exist in the data, 2) are not frequent with respect to a support threshold, and 3) all substructures of them are frequent. By indexing the occurrences of MIS, we can efficiently locate the high-selective substructures of a query, improving search performance significantly. An efficient data mining algorithm is proposed, which finds the minimal infrequent structures. Their occurrences in the XML data are then indexed by a lightweight data structure and used as a fast filter step in query evaluation. We validate the efficiency and applicability of our methods through experimentation on both synthetic and real data. © 2005 IEEE. | en_HK |
dc.format.extent | 1334990 bytes | - |
dc.format.extent | 4295 bytes | - |
dc.format.extent | 3502 bytes | - |
dc.format.extent | 6619 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | text/plain | - |
dc.format.mimetype | text/plain | - |
dc.format.mimetype | text/plain | - |
dc.language | eng | en_HK |
dc.publisher | I E E E. The Journal's web site is located at http://www.computer.org/tkde | en_HK |
dc.relation.ispartof | IEEE Transactions on Knowledge and Data Engineering | en_HK |
dc.rights | ©2005 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | Document indexing | en_HK |
dc.subject | Mining methods and algorithms | en_HK |
dc.subject | Query processing | en_HK |
dc.subject | XML/XSL/RDF | en_HK |
dc.title | Indexing useful structural patterns for XML query processing | en_HK |
dc.type | Article | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=1041-4347&volume=17&issue=7&spage=997&epage=1009&date=2005&atitle=Indexing+useful+structural+patterns+for+XML+query+processing | en_HK |
dc.identifier.email | Mamoulis, N:nikos@cs.hku.hk | en_HK |
dc.identifier.email | Cheung, DWL:dcheung@cs.hku.hk | en_HK |
dc.identifier.email | Yiu, SM:smyiu@cs.hku.hk | en_HK |
dc.identifier.authority | Mamoulis, N=rp00155 | en_HK |
dc.identifier.authority | Cheung, DWL=rp00101 | en_HK |
dc.identifier.authority | Yiu, SM=rp00207 | en_HK |
dc.description.nature | published_or_final_version | en_HK |
dc.identifier.doi | 10.1109/TKDE.2005.110 | en_HK |
dc.identifier.scopus | eid_2-s2.0-22944487013 | en_HK |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-22944487013&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 17 | en_HK |
dc.identifier.issue | 7 | en_HK |
dc.identifier.spage | 997 | en_HK |
dc.identifier.epage | 1009 | en_HK |
dc.identifier.isi | WOS:000229074800010 | - |
dc.publisher.place | United States | en_HK |
dc.identifier.scopusauthorid | Lian, W=22433603900 | en_HK |
dc.identifier.scopusauthorid | Mamoulis, N=6701782749 | en_HK |
dc.identifier.scopusauthorid | Cheung, DWL=34567902600 | en_HK |
dc.identifier.scopusauthorid | Yiu, SM=7003282240 | en_HK |
dc.identifier.issnl | 1041-4347 | - |