File Download
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/TC.2006.89
- Scopus: eid_2-s2.0-33646486528
- WOS: WOS:000236929800005
- Find via
Supplementary
- Citations:
- Appears in Collections:
Article: Risk-resilient heuristics and genetic algorithms for security-assured grid job scheduling
Title | Risk-resilient heuristics and genetic algorithms for security-assured grid job scheduling |
---|---|
Authors | |
Keywords | Distributed supercomputing Genetic algorithm Grid computing Job scheduling heuristics NAS and PSA benchmarks Performance metrics Replication scheduling Risk resilience |
Issue Date | 2006 |
Publisher | I E E E. The Journal's web site is located at http://www.computer.org/tc |
Citation | Ieee Transactions On Computers, 2006, v. 55 n. 6, p. 703-719 How to Cite? |
Abstract | In scheduling a large number of user jobs for parallel execution on an open-resource Grid system, the jobs are subject to system failures or delays caused by infected hardware, software vulnerability, and distrusted security policy. This paper models the risk and insecure conditions in Grid job scheduling. Three risk-resilient strategies, preemptive, replication, and delay-tolerant, are developed to provide security assurance. We propose six risk-resilient scheduling algorithms to assure secure Grid job execution under different risky conditions. We report the simulated Grid performances of these new Grid job scheduling algorithms under the NAS and PSA workloads. The relative performance is measured by the total job makespan, Grid resource utilization, job failure rate, slowdown ratio, replication overhead, etc. In addition to extending from known scheduling heuristics, we developed a new space-time genetic algorithm (STGA) based on faster searching and protected chromosome formation. Our simulation results suggest that, in a wide-area Grid environment, it is more resilient for the global job scheduler to tolerate some job delays instead of resorting to preemption or replication or taking a risk on unreliable resources allocated. We find that delay-tolerant Min-Min and STGA job scheduling have 13-23 percent higher performance than using risky or preemptive or replicated algorithms. The resource overheads for replicated job scheduling are kept at a low 15 percent. The delayed job execution is optimized with a delay factor, which is 20 percent of the total makespan. A Kiviat graph is proposed for demonstrating the quality of Grid computing services. These risk-resilient job scheduling schemes can upgrade Grid performance significantly at only a moderate increase in extra resources or scheduling delays in a risky Grid computing environment. © 2006 IEEE. |
Persistent Identifier | http://hdl.handle.net/10722/44772 |
ISSN | 2023 Impact Factor: 3.6 2023 SCImago Journal Rankings: 1.307 |
ISI Accession Number ID | |
References |
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Song, S | en_HK |
dc.contributor.author | Hwang, K | en_HK |
dc.contributor.author | Kwok, YK | en_HK |
dc.date.accessioned | 2007-10-30T06:09:54Z | - |
dc.date.available | 2007-10-30T06:09:54Z | - |
dc.date.issued | 2006 | en_HK |
dc.identifier.citation | Ieee Transactions On Computers, 2006, v. 55 n. 6, p. 703-719 | en_HK |
dc.identifier.issn | 0018-9340 | en_HK |
dc.identifier.uri | http://hdl.handle.net/10722/44772 | - |
dc.description.abstract | In scheduling a large number of user jobs for parallel execution on an open-resource Grid system, the jobs are subject to system failures or delays caused by infected hardware, software vulnerability, and distrusted security policy. This paper models the risk and insecure conditions in Grid job scheduling. Three risk-resilient strategies, preemptive, replication, and delay-tolerant, are developed to provide security assurance. We propose six risk-resilient scheduling algorithms to assure secure Grid job execution under different risky conditions. We report the simulated Grid performances of these new Grid job scheduling algorithms under the NAS and PSA workloads. The relative performance is measured by the total job makespan, Grid resource utilization, job failure rate, slowdown ratio, replication overhead, etc. In addition to extending from known scheduling heuristics, we developed a new space-time genetic algorithm (STGA) based on faster searching and protected chromosome formation. Our simulation results suggest that, in a wide-area Grid environment, it is more resilient for the global job scheduler to tolerate some job delays instead of resorting to preemption or replication or taking a risk on unreliable resources allocated. We find that delay-tolerant Min-Min and STGA job scheduling have 13-23 percent higher performance than using risky or preemptive or replicated algorithms. The resource overheads for replicated job scheduling are kept at a low 15 percent. The delayed job execution is optimized with a delay factor, which is 20 percent of the total makespan. A Kiviat graph is proposed for demonstrating the quality of Grid computing services. These risk-resilient job scheduling schemes can upgrade Grid performance significantly at only a moderate increase in extra resources or scheduling delays in a risky Grid computing environment. © 2006 IEEE. | en_HK |
dc.format.extent | 4735331 bytes | - |
dc.format.extent | 2141 bytes | - |
dc.format.mimetype | application/pdf | - |
dc.format.mimetype | text/plain | - |
dc.language | eng | en_HK |
dc.publisher | I E E E. The Journal's web site is located at http://www.computer.org/tc | en_HK |
dc.relation.ispartof | IEEE Transactions on Computers | en_HK |
dc.rights | ©2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | - |
dc.subject | Distributed supercomputing | en_HK |
dc.subject | Genetic algorithm | en_HK |
dc.subject | Grid computing | en_HK |
dc.subject | Job scheduling heuristics | en_HK |
dc.subject | NAS and PSA benchmarks | en_HK |
dc.subject | Performance metrics | en_HK |
dc.subject | Replication scheduling | en_HK |
dc.subject | Risk resilience | en_HK |
dc.title | Risk-resilient heuristics and genetic algorithms for security-assured grid job scheduling | en_HK |
dc.type | Article | en_HK |
dc.identifier.openurl | http://library.hku.hk:4550/resserv?sid=HKU:IR&issn=0018-9340&volume=55&issue=6&spage=703&epage=719&date=2006&atitle=Risk-resilient+heuristics+and+genetic+algorithms+for+security-assured+grid+job+scheduling | en_HK |
dc.identifier.email | Kwok, YK:ykwok@eee.hku.hk | en_HK |
dc.identifier.authority | Kwok, YK=rp00128 | en_HK |
dc.description.nature | published_or_final_version | en_HK |
dc.identifier.doi | 10.1109/TC.2006.89 | en_HK |
dc.identifier.scopus | eid_2-s2.0-33646486528 | en_HK |
dc.relation.references | http://www.scopus.com/mlt/select.url?eid=2-s2.0-33646486528&selection=ref&src=s&origin=recordpage | en_HK |
dc.identifier.volume | 55 | en_HK |
dc.identifier.issue | 6 | en_HK |
dc.identifier.spage | 703 | en_HK |
dc.identifier.epage | 719 | en_HK |
dc.identifier.isi | WOS:000236929800005 | - |
dc.publisher.place | United States | en_HK |
dc.identifier.scopusauthorid | Song, S=8875389000 | en_HK |
dc.identifier.scopusauthorid | Hwang, K=7402426691 | en_HK |
dc.identifier.scopusauthorid | Kwok, YK=7101857718 | en_HK |
dc.identifier.issnl | 0018-9340 | - |