In this paper we present the extraproximal method for computing the Stackelberg/Nash equilibria in a class of ergodic controlled finite Markov chains games. We exemplify the original game formulation in terms of coupled nonlinear programming problems implementing the Lagrange principle. In addition, Tikhonov’s regularization method is employed to ensure the convergence of the cost-functions to a Stackelberg/Nash equilibrium point. Then, we transform the problem into a system of equations in the proximal format. We present a two-step iterated procedure for solving the extraproximal method: (a) the first step (the extra-proximal step) consists of a “prediction” which calculates the preliminary position approximation to the equilibrium point, and (b) the second step is designed to find a “basic adjustment” of the previous prediction. The procedure is called the “extraproximal method” because of the use of an extrapolation. Each equation in this system is an optimization problem for which the necessary and efficient condition for a minimum is solved using a quadratic programming method. This solution approach provides a drastically quicker rate of convergence to the equilibrium point. We present the analysis of the convergence as well the rate of convergence of the method, which is one of the main results of this paper. Additionally, the extraproximal method is developed in terms of Markov chains for Stackelberg games. Our goal is to analyze completely a three-player Stackelberg game consisting of a leader and two followers. We provide all the details needed to implement the extraproximal method in an efficient and numerically stable way. For instance, a numerical technique is presented for computing the first step parameter (λ) of the extraproximal method. The usefulness of the approach is successfully demonstrated by a numerical example related to a pricing oligopoly model for airlines companies.
Keywords
- extraproximal method
- Stackelberg games
- convergence analysis
- Markov chains
- implementation
A Multi–Source Fluid Queue Based Stochastic Model of the Probabilistic Offloading Strategy in a MEC System With Multiple Mobile Devices and a Single MEC Server Hybrid Cryptography with a One–Time Stamp to Secure Contact Tracing for COVID–19 Infection Global Stability of Discrete–Time Feedback Nonlinear Systems with Descriptor Positive Linear Parts and Interval State Matrices Template Chart Detection for Stoma Telediagnosis Fast and Smooth Trajectory Planning for a Class of Linear Systems Based on Parameter and Constraint Reduction Sensor Location for Travel Time Estimation Based on the User Equilibrium Principle: Application of Linear Equations Exact and Approximation Algorithms for Sensor Placement Against DDoS Attacks A Feasible Schedule for Parallel Assembly Tasks in Flexible Manufacturing Systems Non–Standard Analysis Revisited: An Easy Axiomatic Presentation Oriented Towards Numerical Applications A Data Association Model for Analysis of Crowd Structure A Comprehensive Study of Clustering a Class of 2D Shapes Performance Analysis of a Dual Stage Deep Rain Streak Removal Convolution Neural Network Module with a Modified Deep Residual Dense Network