File Download
 
 
Supplementary

Postgraduate Thesis: Move my data to the cloud: an online cost-minimizing approach
  • Basic View
  • Metadata View
  • XML View
TitleMove my data to the cloud: an online cost-minimizing approach
 
AuthorsZhang, Linquan
张琳泉
 
Issue Date2012
 
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
 
AbstractCloud computing has rapidly emerged as a new computation paradigm, providing agile and scalable resource access in a utility-like fashion. Processing of massive amounts of data has been a primary usage of the clouds in practice. While many efforts have been devoted to designing the computation models (e.g., MapReduce), one important issue has been largely neglected in this respect: how do we efficiently move the data, practically generated from different geographical locations over time, into a cloud for effective processing? The usual approach of shipping data using hard disks lacks flexibility and security. As the first dedicated effort, this paper tackles this massive, dynamic data migration issue. Targeting a cloud encompassing disparate data centers of different resource charges, we model the cost-minimizing data migration problem, and propose efficient offline and online algorithms, which optimize the routes of data into the cloud and the choice of the data center to aggregate the data for processing, at any give time. Three online algorithms are proposed to practically guide data migration over time. With no need of any future information on the data generation pattern, an online lazy migration (OLM) algorithm achieves a competitive ratio as low as 2:55 under typical system settings, and a work function algorithm (WFA) has a linear 2K-1 (K is the number of data centers) competitive ratio. The rest one randomized fixed horizon control algorithm (RFHC) achieves 1+ 1/(l+1 ) κ/λ competitive ratio in theory with a lookahead window of l into the future, where κ and λ are protocol parameters. We conduct extensive experiments to evaluate our online algorithms, using real-world meteorological data generation traces, under realistic cloud settings. Comparisons among online and offline algorithms show a close-to-offline-optimum performance and demonstrate the effectiveness of our online algorithms in practice.
 
AdvisorsWu, C
Lau, FCM
 
DegreeMaster of Philosophy
 
SubjectCloud computing.
Database management.
Electronic data processing.
 
Dept/ProgramComputer Science
 
DC FieldValue
dc.contributor.advisorWu, C
 
dc.contributor.advisorLau, FCM
 
dc.contributor.authorZhang, Linquan
 
dc.contributor.author张琳泉
 
dc.date.hkucongregation2012
 
dc.date.issued2012
 
dc.description.abstractCloud computing has rapidly emerged as a new computation paradigm, providing agile and scalable resource access in a utility-like fashion. Processing of massive amounts of data has been a primary usage of the clouds in practice. While many efforts have been devoted to designing the computation models (e.g., MapReduce), one important issue has been largely neglected in this respect: how do we efficiently move the data, practically generated from different geographical locations over time, into a cloud for effective processing? The usual approach of shipping data using hard disks lacks flexibility and security. As the first dedicated effort, this paper tackles this massive, dynamic data migration issue. Targeting a cloud encompassing disparate data centers of different resource charges, we model the cost-minimizing data migration problem, and propose efficient offline and online algorithms, which optimize the routes of data into the cloud and the choice of the data center to aggregate the data for processing, at any give time. Three online algorithms are proposed to practically guide data migration over time. With no need of any future information on the data generation pattern, an online lazy migration (OLM) algorithm achieves a competitive ratio as low as 2:55 under typical system settings, and a work function algorithm (WFA) has a linear 2K-1 (K is the number of data centers) competitive ratio. The rest one randomized fixed horizon control algorithm (RFHC) achieves 1+ 1/(l+1 ) κ/λ competitive ratio in theory with a lookahead window of l into the future, where κ and λ are protocol parameters. We conduct extensive experiments to evaluate our online algorithms, using real-world meteorological data generation traces, under realistic cloud settings. Comparisons among online and offline algorithms show a close-to-offline-optimum performance and demonstrate the effectiveness of our online algorithms in practice.
 
dc.description.naturepublished_or_final_version
 
dc.description.thesisdisciplineComputer Science
 
dc.description.thesislevelmaster's
 
dc.description.thesisnameMaster of Philosophy
 
dc.identifier.hkulb4833014
 
dc.languageeng
 
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)
 
dc.relation.ispartofHKU Theses Online (HKUTO)
 
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.
 
dc.rightsCreative Commons: Attribution 3.0 Hong Kong License
 
dc.source.urihttp://hub.hku.hk/bib/B48330140
 
dc.subject.lcshCloud computing.
 
dc.subject.lcshDatabase management.
 
dc.subject.lcshElectronic data processing.
 
dc.titleMove my data to the cloud: an online cost-minimizing approach
 
dc.typePG_Thesis
 
<?xml encoding="utf-8" version="1.0"?>
<item><contributor.advisor>Wu, C</contributor.advisor>
<contributor.advisor>Lau, FCM</contributor.advisor>
<contributor.author>Zhang, Linquan</contributor.author>
<contributor.author>&#24352;&#29747;&#27849;</contributor.author>
<date.issued>2012</date.issued>
<description.abstract>&#65279;Cloud computing has rapidly emerged as a new computation paradigm, providing agile and scalable resource access in a utility-like fashion. Processing of massive amounts of data has been a primary usage of the clouds in practice. While many efforts have been devoted to designing the computation models (e.g., MapReduce), one important issue has been largely neglected in this respect: how do we efficiently move the data, practically generated from different geographical locations over time, into a cloud for effective processing? The usual approach of shipping data using hard disks lacks flexibility and security. As the first dedicated effort, this paper tackles this massive, dynamic data migration issue. Targeting a cloud encompassing disparate data centers of different resource charges, we model the cost-minimizing data migration problem, and propose efficient offline and online algorithms, which optimize the routes of data into the cloud and the choice of the data center to aggregate the data for processing, at any give time. Three online algorithms are proposed to practically guide data migration over time. With no need of any future information on the data generation pattern, an online lazy migration (OLM) algorithm achieves a competitive ratio as low as 2:55 under typical system settings, and a work function algorithm (WFA) has a linear 2K-1 (K is the number of data centers) competitive ratio. The rest one randomized fixed horizon control algorithm (RFHC) achieves 1+  1/(l+1 ) &#954;/&#955; competitive ratio in theory with a lookahead window of l into the future, where &#954; and &#955; are protocol parameters. We conduct extensive experiments to evaluate our online algorithms, using real-world meteorological data generation traces, under realistic cloud settings. Comparisons among online and offline algorithms show a close-to-offline-optimum performance and demonstrate the effectiveness of our online algorithms in practice.</description.abstract>
<language>eng</language>
<publisher>The University of Hong Kong (Pokfulam, Hong Kong)</publisher>
<relation.ispartof>HKU Theses Online (HKUTO)</relation.ispartof>
<rights>The author retains all proprietary rights, (such as patent rights) and the right to use in future works.</rights>
<rights>Creative Commons: Attribution 3.0 Hong Kong License</rights>
<source.uri>http://hub.hku.hk/bib/B48330140</source.uri>
<subject.lcsh>Cloud computing.</subject.lcsh>
<subject.lcsh>Database management.</subject.lcsh>
<subject.lcsh>Electronic data processing.</subject.lcsh>
<title>Move my data to the cloud: an online cost-minimizing approach</title>
<type>PG_Thesis</type>
<identifier.hkul>b4833014</identifier.hkul>
<description.thesisname>Master of Philosophy</description.thesisname>
<description.thesislevel>master&apos;s</description.thesislevel>
<description.thesisdiscipline>Computer Science</description.thesisdiscipline>
<description.nature>published_or_final_version</description.nature>
<date.hkucongregation>2012</date.hkucongregation>
<bitstream.url>http://hub.hku.hk/bitstream/10722/173884/1/FullText.pdf</bitstream.url>
</item>