File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)

Article: Regret Analysis of Learning-Based MPC With Partially-Unknown Cost Function

TitleRegret Analysis of Learning-Based MPC With Partially-Unknown Cost Function
Authors
KeywordsAdaptation models
Control systems
Cost function
Costs
HVAC
Learning-Based control
Linear systems
model predictive control
non-myopic exploitation
restless bandits
Ventilation
Issue Date1-Jul-2023
PublisherInstitute of Electrical and Electronics Engineers
Citation
IEEE Transactions on Automatic Control, 2023, p. 1-8 How to Cite?
AbstractThe exploration/exploitation trade-off is an inherent challenge in data-driven adaptive control. Though this trade-off has been studied for multi-armed bandits (MAB's) and reinforcement learning for linear systems; it is less well-studied for learning-based control of nonlinear systems. A significant theoretical challenge in the nonlinear setting is that there is no explicit characterization of an optimal controller for a given set of cost and system parameters. We propose the use of a finite-horizon oracle controller with full knowledge of parameters as a reasonable surrogate to optimal controller. This allows us to develop policies in the context of learning-based MPC and MAB's and conduct a control-theoretic analysis using techniques from MPC- and optimization-theory to show these policies achieve low regret with respect to this finite-horizon oracle. Our simulations exhibit the low regret of our policy on a heating, ventilation, and air-conditioning model with partially-unknown cost function.
Persistent Identifierhttp://hdl.handle.net/10722/336546
ISSN
2023 Impact Factor: 6.2
2023 SCImago Journal Rankings: 4.501

 

DC FieldValueLanguage
dc.contributor.authorDogan, I-
dc.contributor.authorShen, ZJM-
dc.contributor.authorAswani, A-
dc.date.accessioned2024-02-16T03:57:37Z-
dc.date.available2024-02-16T03:57:37Z-
dc.date.issued2023-07-01-
dc.identifier.citationIEEE Transactions on Automatic Control, 2023, p. 1-8-
dc.identifier.issn0018-9286-
dc.identifier.urihttp://hdl.handle.net/10722/336546-
dc.description.abstractThe exploration/exploitation trade-off is an inherent challenge in data-driven adaptive control. Though this trade-off has been studied for multi-armed bandits (MAB's) and reinforcement learning for linear systems; it is less well-studied for learning-based control of nonlinear systems. A significant theoretical challenge in the nonlinear setting is that there is no explicit characterization of an optimal controller for a given set of cost and system parameters. We propose the use of a finite-horizon oracle controller with full knowledge of parameters as a reasonable surrogate to optimal controller. This allows us to develop policies in the context of learning-based MPC and MAB's and conduct a control-theoretic analysis using techniques from MPC- and optimization-theory to show these policies achieve low regret with respect to this finite-horizon oracle. Our simulations exhibit the low regret of our policy on a heating, ventilation, and air-conditioning model with partially-unknown cost function.-
dc.languageeng-
dc.publisherInstitute of Electrical and Electronics Engineers-
dc.relation.ispartofIEEE Transactions on Automatic Control-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectAdaptation models-
dc.subjectControl systems-
dc.subjectCost function-
dc.subjectCosts-
dc.subjectHVAC-
dc.subjectLearning-Based control-
dc.subjectLinear systems-
dc.subjectmodel predictive control-
dc.subjectnon-myopic exploitation-
dc.subjectrestless bandits-
dc.subjectVentilation-
dc.titleRegret Analysis of Learning-Based MPC With Partially-Unknown Cost Function-
dc.typeArticle-
dc.identifier.doi10.1109/TAC.2023.3328827-
dc.identifier.scopuseid_2-s2.0-85181807035-
dc.identifier.spage1-
dc.identifier.epage8-
dc.identifier.eissn1558-2523-
dc.identifier.issnl0018-9286-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats