Modelwire
Subscribe

RaTA-Tool: Retrieval-based Tool Selection with Multimodal Large Language Models

RaTA-Tool introduces a retrieval-based framework enabling multimodal large language models to select and invoke external tools from open-world settings, moving beyond text-only, closed-world tool-use approaches that struggle with unseen APIs and diverse input modalities.

MentionsRaTA-Tool · Multimodal Large Language Models · Large Language Models

Modelwire summarizes — we don’t republish. The full article lives on arxiv.org. If you’re a publisher and want a different summarization policy for your work, see our takedown page.

RaTA-Tool: Retrieval-based Tool Selection with Multimodal Large Language Models · Modelwire