Advancing Large Language Model Agent via Iterative Contrastive Trajectory Optimization

Recent advancements in Large Language Models (LLMs) have expanded their application across a variety of tasks. However, open-source LLMs often fail to achieve the same efficiency as proprietary models. To address this issue, we propose Iterative Contrastive Trajectory Optimization (ICTO), a novel framework designed to enhance the task-solving capabilities of LLM-based agents. ICTO facilitates iterative learning from both successful and failed task trajectories by utilizing Partially Observable Markov Decision Processes (POMDP) to provide step-level guidance. Experimental results demonstrate that ICTO improves task-solving efficiency by 12.4% and generalization ability by 15.7% compared to baseline models. The framework not only enhances the performance of open-source LLMs but also shows promise for broader applications in autonomous learning environments.

Lingua:: Inglese

Frequenza di pubblicazione:: 4 volte all'anno
Argomenti della rivista:: Informatica, Informatica, altro

Feed RSS della rivista

Advancing Large Language Model Agent via Iterative Contrastive Trajectory Optimization

Chengang Jing

Xin Jing

Kun Li

Pubblicato online: 31 dic 2024

Pagine: 19 - 27

DOI: https://doi.org/10.2478/ijanmc-2024-0033

Parole chiaveIterative Optimization, Large Language Models, Agent

© 2024 Chengang Jing et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Parole chiave
Iterative Optimization, Large Language Models, Agent