Deep reinforcement learning-based approach for control of Two Input–Two Output process control system
und
01. Juli 2025
Über diesen Artikel
Artikel-Kategorie: Research Article
Online veröffentlicht: 01. Juli 2025
Eingereicht: 01. März 2025
DOI: https://doi.org/10.2478/ijssis-2025-0029
Schlüsselwörter
© 2025 Anil Kadu et al., published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Figure 1:

Figure 2:

Figure 3:

Figure 4:

Figure 5:

Figure 6:

Figure 7:

Figure 8:

Figure 9:

Figure 10:

Figure 11:

Figure 12:

Figure 13:

Figure 14:

Figure 15:

Figure 16:

Figure 17:

Parameters for configuration of DDPG agent
Discount factor (γ) | Future reward discounting | 0.99 |
Target smooth factor (τ) | Target network update rate | 0.001 |
Actor learning rate | Learning rate for actor updates | 0.0001 |
Critic learning rate | Learning rate for critic updates | 0.001 |
Mini-batch size | Sample size for experience replay | 64 |
Experience buffer length | Total memory for experience replay | 1,000,000 |
Analogy of the traditional system with DRL principles
Agent | Controller | Decides the actions to control the system. |
Environment | Plant/process | The system is being controlled. |
State | System measurements | Information about the system’s current status. |
Action | Control input | Adjustments made to influence the process. |
Reward | Error feedback | Guides the agent to improve performance. |
Policy | Control law | Strategy linking states to optimal actions. |
Performance indices of Loop 2
DDPG | 137.7 | 79.13 | 3.217e + 04 | 1.707e + 04 | 0 | 42 | 0 |
NDT[PI] | 122.1 | 82.69 | 4.434e + 04 | 2.515e + 04 | 60 | 110 | 0 |
Mvall [PI] | 510.3 | 275.3 | 2.228e + 05 | 1.305e + 04 | 0 | 380 | 0 |
Wang et al [PID] | 81.82 | 61.27 | 2.947e + 04 | 1.856e + 04 | 30 | 85 | 0 |
Performance indices of Loop 1
DDPG | 18.31 | 29.92 | 722.9 | 3325 | 35 | 48 | 0 |
NDT [PI] | 26.82 | 39.9 | 6631 | 1.032e + 04 | 25 | 100 | 0 |
Mvall [PI] | 34.61 | 47.25 | 488.3 | 1880 | 0 | 150 | 0 |
Wang et al [PID] | 16.26 | 24.82 | 3206 | 6517 | 20 | 53 | 0 |