跳到主要导航 跳到搜索 跳到主要内容

AdamRAG: Adaptive Algorithm with Ravine Method for Training Deep Neural Networks

  • Yifan Zhang
  • , Di Zhao
  • , Hongyi Li*
  • , Chengwei Pan*
  • *此作品的通讯作者
  • Beihang University
  • Key Laboratory of Precision Opto-Mechatronics Technology (Ministry of Education)

科研成果: 期刊稿件文章同行评审

摘要

Adaptive optimization algorithms, such as Adam, are widely employed in deep learning. However, because they primarily rely on learning rate adjustments, a trade-off often exists between optimization stability and generalization capability. To address this issue, we propose AdamRAG, a novel optimization algorithm that integrates adaptive methods with Ravine acceleration and momentum techniques, aiming to preserve the stability of adaptive algorithms while enhancing their generalization performance. Within the adaptive framework, AdamRAG introduces extrapolation steps based on Ravine acceleration, which not only accelerate convergence but also prevent the iterative process from becoming trapped in local saddle points, thereby boosting generalization. Simultaneously, the momentum method is employed to regulate the descent step sizes, further improving the algorithm’s stability. Theoretical analysis demonstrates that AdamRAG achieves sublinear convergence in non-convex optimization scenarios. Extensive experiments across tasks such as image classification, natural language processing, and reinforcement learning validate its effectiveness, with results indicating that AdamRAG outperforms established optimizers (e.g., NAG, Adam, Lion) in terms of both convergence speed and generalization performance. Furthermore, sensitivity analysis shows that AdamRAG exhibits greater robustness to variations in learning rate, significantly reducing the need for hyperparameter tuning. These findings suggest that by integrating Ravine acceleration, adaptive methods, and momentum techniques, AdamRAG effectively mitigates the trade-off between stability and generalization, providing an efficient and robust optimization tool for deep learning applications.

源语言英语
文章编号53
期刊Neural Processing Letters
57
3
DOI
出版状态已出版 - 6月 2025

指纹

探究 'AdamRAG: Adaptive Algorithm with Ravine Method for Training Deep Neural Networks' 的科研主题。它们共同构成独一无二的指纹。

引用此