资讯 ArXiv AI Papers 2026-05-12

On Distinguishing Capability Elicitation from Capability Creation in Post-Training: A Free-Energy Perspective

arXiv:2605.08368v1 Announce Type: new Abstract: Debates about large language model post-training often treat supervised fine-tuning (SFT) as imitation and reinforcement learning (RL) as discovery. But this distinction is too coarse. What matters is whether a training procedure increases the probability of behaviors the pretrained model could already produce, or whether it changes what the model ca

0 0

暂无详细内容

标签: #research #ArXiv AI Papers

讨论

发表评论

资讯详情

发布日期

2026-05-12

来源媒体

ArXiv AI Papers

🏷️ 相关标签

#research #ArXiv AI Papers

On Distinguishing Capability Elicitation from Capability Creation in Post-Training: A Free-Energy Perspective

讨论

发表评论

资讯详情

🏷️ 相关标签

相关资讯

📤 分享这条资讯