资讯 ArXiv AI Papers 2026-05-12

Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria

arXiv:2605.08354v1 Announce Type: new Abstract: Aligning multimodal generative models with human preferences demands reward signals that respect the compositional, multi-dimensional structure of human judgment. Prevailing RLHF approaches reduce this structure to scalar or pairwise labels, collapsing nuanced preferences into opaque parametric proxies and exposing vulnerabilities to reward hacking.

0 0

暂无详细内容

标签: #research #ArXiv AI Papers

讨论

发表评论

资讯详情

发布日期

2026-05-12

来源媒体

ArXiv AI Papers

🏷️ 相关标签

#research #ArXiv AI Papers

Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria

讨论

发表评论

资讯详情

🏷️ 相关标签

相关资讯

📤 分享这条资讯