Episode

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Podcast
Daily Paper Cast
Published
Jun 11, 2026
Duration seconds
1437
Processing state
not_requested
Canonical source
https://share.transistor.fm/s/6a886f59
Audio
https://media.transistor.fm/6a886f59/fb984af7.mp3
JSON
/v1/public/podcasts/daily-paper-cast-7079649/episodes/searchswarm-towards-delegation-intelligence-in-agentic-llms-for-long-horizon-deep-research
Markdown
/podcast/daily-paper-cast-7079649/searchswarm-towards-delegation-intelligence-in-agentic-llms-for-long-horizon-deep-research.md

Actions

  • POST https://stenobird.com/v1/public/podcasts/daily-paper-cast-7079649/episodes/searchswarm-towards-delegation-intelligence-in-agentic-llms-for-long-horizon-deep-research/transcription-requests
    Idempotently request low-priority transcript generation for this episode.
  • GET https://stenobird.com/podcast/daily-paper-cast-7079649/searchswarm-towards-delegation-intelligence-in-agentic-llms-for-long-horizon-deep-research.md
    Read the agent-friendly Markdown representation of this episode resource.

Summary

🤗 Upvotes: 46 | cs.AI Authors: Pu Ning, Quan Chen, Kun Tao, Xinyu Tang, Tianshu Wang, Qianggang Cao, Xinyu Kong, Zujie Wen, Zhiqiang Zhang, Jun Zhou Title: SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research Arxiv: http://arxiv.org/abs/2606.09730v1 Abstract: Large language models are increasingly expected to handle complex, long-horizon real-world tasks whose context demands can grow without bound, yet model context windows remain inherently finite. Recent work explores a paradigm where a main agent decomposes tasks and dispatches subtasks to subagents, which execute and return only summarized results, conserving the main agent's context budget. However, performing this well requires delegation intelligence: the ability to decompose complex tasks, determine when and what to delegate, and integrate returned results into the ongoing workflow. Training data for this capability is scarce in naturally occurring text, and to our knowledge, how to synthesize such data and train models to acquire this capability remains largely unexplored in the open-source community. To bridge this gap, we present a preliminary exploration targeting deep research, a representative long-horizon agent task. Specifically, we design a harness that guides the model toward high-quality task decomposition and delegation, while constraining subagents to return results properly to support the main agent's workflow. The harness-guided trajectories naturally encode correct delegation decisions, which we use as supervised fine-tuning data to internalize delegation intelligence into model weights. Our resulting model, SearchSwarm-30B-A3B, achieves 68.1 on BrowseComp and 73.3 on BrowseComp-ZH, the best results among all models of comparable scale. We will release our…