No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand

Guo, Mengzi Amy; Ying, Donghao; Shen, Zuojun Max; Lavaei, Javad

File Download

There are no files associated with this item.

Supplementary

Citations:
Appears in Collections:
- Industrial & Manufacturing Systems Engineering: Conference papers
- President's Office: Conference papers

Conference Paper: No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand

Title	No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand
Authors	Guo, Mengzi Amy Ying, Donghao Shen, Zuojun Max Lavaei, Javad
Issue Date	10-Dec-2023
Abstract	This work is dedicated to the algorithm design in a competitive framework, with the primary goal of learning a stable equilibrium. We consider the dynamic price competition between two firms operating within an opaque marketplace, where each firm lacks information about its competitor. The demand follows the multinomial logit (MNL) choice model, which depends on the consumers' observed price and their reference price, and consecutive periods in the repeated games are connected by reference price updates. We use the notion of stationary Nash equilibrium (SNE), defined as the fixed point of the equilibrium pricing policy for the single-period game, to simultaneously capture the long-run market equilibrium and stability. We propose the online projected gradient ascent algorithm (OPGA), where the firms adjust prices using the first-order derivatives of their log-revenues that can be obtained from the market feedback mechanism. Despite the absence of typical properties required for the convergence of online games, such as strong monotonicity and variational stability, we demonstrate that under diminishing step-sizes, the price and reference price paths generated by OPGA converge to the unique SNE, thereby achieving the no-regret learning and a stable market. Moreover, with appropriate step-sizes, we prove that this convergence exhibits a rate of O(1/t). © 2023 Neural information processing systems foundation.
Persistent Identifier	http://hdl.handle.net/10722/369190

DC Field	Value	Language
dc.contributor.author	Guo, Mengzi Amy	-
dc.contributor.author	Ying, Donghao	-
dc.contributor.author	Shen, Zuojun Max	-
dc.contributor.author	Lavaei, Javad	-
dc.date.accessioned	2026-01-21T00:35:18Z	-
dc.date.available	2026-01-21T00:35:18Z	-
dc.date.issued	2023-12-10	-
dc.identifier.uri	http://hdl.handle.net/10722/369190	-
dc.description.abstract	<p>This work is dedicated to the algorithm design in a competitive framework, with the primary goal of learning a stable equilibrium. We consider the dynamic price competition between two firms operating within an opaque marketplace, where each firm lacks information about its competitor. The demand follows the multinomial logit (MNL) choice model, which depends on the consumers' observed price and their reference price, and consecutive periods in the repeated games are connected by reference price updates. We use the notion of stationary Nash equilibrium (SNE), defined as the fixed point of the equilibrium pricing policy for the single-period game, to simultaneously capture the long-run market equilibrium and stability. We propose the online projected gradient ascent algorithm (OPGA), where the firms adjust prices using the first-order derivatives of their log-revenues that can be obtained from the market feedback mechanism. Despite the absence of typical properties required for the convergence of online games, such as strong monotonicity and variational stability, we demonstrate that under diminishing step-sizes, the price and reference price paths generated by OPGA converge to the unique SNE, thereby achieving the no-regret learning and a stable market. Moreover, with appropriate step-sizes, we prove that this convergence exhibits a rate of O(1/t). © 2023 Neural information processing systems foundation. <br></p>	-
dc.language	eng	-
dc.relation.ispartof	The 37th Conference on Neural Information Processing Systems (10/12/2023-16/12/2023)	-
dc.title	No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand	-
dc.type	Conference_Paper	-

File Download

Supplementary

Conference Paper: No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats