Post-training reinforcement strategy