GRAIL: Enhanced Reinforcement Learning for Mathematical Reasoning in LLMs4. June 20264. July 2026AI ModelsGRAIL uses gradient activation saliency to train relevant reasoning steps more strongly than irrelevant tokens, achieving 3.60% accuracy improvement without separate process-level supervision. Share on: