资讯 ArXiv AI Papers 2026-03-02

HumanMCP: A Human-Like Query Dataset for Evaluating MCP Tool Retrieval Performance

arXiv:2602.23367v1 Announce Type: new Abstract: Model Context Protocol (MCP) servers contain a collection of thousands of open-source standardized tools, linking LLMs to external systems; however, existing datasets and benchmarks lack realistic, human-like user queries, remaining a critical gap in evaluating the tool usage and ecosystems of MCP servers. Existing datasets often do contain tool desc

1 0
分享:

暂无详细内容

讨论

发表评论

0/2000
...
= ?