RaTA-Tool: Retrieval-based Tool Selection with Multimodal Large Language Models

RaTA-Tool introduces a retrieval-based framework enabling multimodal large language models to select and invoke external tools from open-world settings, moving beyond text-only, closed-world tool-use approaches that struggle with unseen APIs and diverse input modalities.
MentionsRaTA-Tool · Multimodal Large Language Models · Large Language Models
Read full story at arXiv cs.CL →(arxiv.org)
Modelwire summarizes — we don’t republish. The full article lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.