资讯 ArXiv AI Papers 2026-02-25

CausalReasoningBenchmark: A Real-World Benchmark for Disentangled Evaluation of Causal Identification and Estimation

arXiv:2602.20571v1 Announce Type: new Abstract: Many benchmarks for automated causal inference evaluate a system's performance based on a single numerical output, such as an Average Treatment Effect (ATE). This approach conflates two distinct steps in causal analysis: identification-formulating a valid research design under stated assumptions-and estimation-implementing that design numerically on

3 0

暂无详细内容

标签: #research #ArXiv AI Papers

讨论

发表评论

资讯详情

发布日期

2026-02-25

来源媒体

ArXiv AI Papers

🏷️ 相关标签

#research #ArXiv AI Papers

CausalReasoningBenchmark: A Real-World Benchmark for Disentangled Evaluation of Causal Identification and Estimation

讨论

发表评论

资讯详情

🏷️ 相关标签

相关资讯

📤 分享这条资讯