Scale to Thousands of Tools with Semantic Search
Implement local semantic search using ONNX Runtime and FusionCache to dynamically load tools on demand, overcoming LLM context limits.
Tag
1 posts with this tag.
Implement local semantic search using ONNX Runtime and FusionCache to dynamically load tools on demand, overcoming LLM context limits.