File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: TRIPOD: an efficient, highly-available Cluster Management System

TitleTRIPOD: an efficient, highly-available Cluster Management System
Authors
Issue Date2016
PublisherACM.
Citation
The 7th ACM SIGOPS Asia-Pacific Workshop on Systems (APSys 2016), Hong Kong, China, 4-5 August 2016. In Conference Proceedings, 2016, p. 1-9 How to Cite?
AbstractDriven by the increasing computational demands, cluster management systems (e.g., MESOS) are already pervasive for deploying many applications. Unfortunately, despite much effort, existing systems are still difficult to meet the high requirements of critical applications (e.g., trading and military applications), because these applications naturally require high-availability and low performance overhead in deployments. Existing systems typically replicate their job controllers so that these controllers can be highly-available and thus they can handle application failures. However, applications themselves are still often a single point of failure, leaving arbitrary unavailable time windows for themselves. This paper proposes the design of TRIPOD, a cluster management system that automatically provides high availability to general applications. TRIPOD’s key to make applications achieve high-availability efficiently is a new PAXOS replication protocol that leverages RDMA (Remote Direct Memory Access). TRIPOD runs replicas of the same job with a replicas of controllers, and controllers agree on job requests efficiently with this protocol. Evaluation shows that TRIPOD has low performance overhead in both throughput and response time compared to an application’s unreplicated execution.
DescriptionArticle no. 9
Persistent Identifierhttp://hdl.handle.net/10722/229719
ISBN

 

DC FieldValueLanguage
dc.contributor.authorWang, C-
dc.contributor.authorYang, JY-
dc.contributor.authorYi, N-
dc.contributor.authorCui, H-
dc.date.accessioned2016-08-23T14:12:51Z-
dc.date.available2016-08-23T14:12:51Z-
dc.date.issued2016-
dc.identifier.citationThe 7th ACM SIGOPS Asia-Pacific Workshop on Systems (APSys 2016), Hong Kong, China, 4-5 August 2016. In Conference Proceedings, 2016, p. 1-9-
dc.identifier.isbn978-1-4503-4265-0-
dc.identifier.urihttp://hdl.handle.net/10722/229719-
dc.descriptionArticle no. 9-
dc.description.abstractDriven by the increasing computational demands, cluster management systems (e.g., MESOS) are already pervasive for deploying many applications. Unfortunately, despite much effort, existing systems are still difficult to meet the high requirements of critical applications (e.g., trading and military applications), because these applications naturally require high-availability and low performance overhead in deployments. Existing systems typically replicate their job controllers so that these controllers can be highly-available and thus they can handle application failures. However, applications themselves are still often a single point of failure, leaving arbitrary unavailable time windows for themselves. This paper proposes the design of TRIPOD, a cluster management system that automatically provides high availability to general applications. TRIPOD’s key to make applications achieve high-availability efficiently is a new PAXOS replication protocol that leverages RDMA (Remote Direct Memory Access). TRIPOD runs replicas of the same job with a replicas of controllers, and controllers agree on job requests efficiently with this protocol. Evaluation shows that TRIPOD has low performance overhead in both throughput and response time compared to an application’s unreplicated execution.-
dc.languageeng-
dc.publisherACM.-
dc.relation.ispartofProceedings of the 7th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys '16-
dc.titleTRIPOD: an efficient, highly-available Cluster Management System-
dc.typeConference_Paper-
dc.identifier.emailCui, H: heming@hku.hk-
dc.identifier.authorityCui, H=rp02008-
dc.description.naturelink_to_OA_fulltext-
dc.identifier.doi10.1145/2967360.2967364-
dc.identifier.scopuseid_2-s2.0-84986569448-
dc.identifier.hkuros262858-
dc.identifier.spage1-
dc.identifier.epage9-
dc.publisher.placeUnited States-
dc.customcontrol.immutablesml 160919-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats