Monday, March 10, 2025

Software invocation rewriting for zero-shot device retrieval

Augmenting giant language fashions (LLMs) with exterior instruments, somewhat than relying solely on their inside information, might unlock their potential to unravel tougher issues. Widespread approaches for such “device studying” fall into two classes: (1) supervised strategies to generate device calling features, or (2) in-context studying, which makes use of device paperwork that describe the supposed device utilization together with few-shot demonstrations. Software paperwork present directions on device’s functionalities and how one can invoke it, permitting LLMs to grasp the person instruments.

Nonetheless, these strategies face sensible challenges when scaling to a lot of instruments. First, they undergo from enter token limits. It’s inconceivable to feed your entire listing of instruments inside a single immediate, and, even when it have been potential, LLMs nonetheless usually battle to successfully course of related info from prolonged enter contexts. Second, the pool of instruments is evolving. LLMs are sometimes paired with a retriever skilled on labeled question–device pairs to advocate a shortlist of instruments. Nonetheless, the best LLM toolkit ought to be huge and dynamic, with instruments present process frequent updates. Offering and sustaining labels to coach a retriever for such an in depth and evolving toolset can be impractical. Lastly, one should deal with ambiguous consumer intents. Person context within the queries might obfuscate the underlying intents, and failure to establish them might result in calling the inaccurate instruments.

In “Re-Invoke: Software Invocation Rewriting for Zero-Shot Software Retrieval”, introduced at EMNLP 2024, we introduce a novel unsupervised retrieval methodology particularly designed for device studying to handle these distinctive challenges. Re-Invoke leverages LLMs for each device doc enrichment and consumer intent extraction to reinforce device retrieval efficiency throughout numerous use circumstances. We show that the proposed Re-Invoke methodology constantly and considerably improves upon the baselines protecting each single- and multi-tool retrieval duties on device use benchmark datasets.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles