arXiv:2403.03870 Abstract | arXiv Analytics

arXiv:2403.03870 [cs.CL]Abstract References Reviews Resources

Learning to Decode Collaboratively with Multiple Language Models

Shannon Zejiang Shen, Hunter Lang, Bailin Wang, Yoon Kim, David Sontag

Published 2024-03-06, updated 2024-08-27Version 2

We propose a method to teach multiple large language models (LLM) to collaborate by interleaving their generations at the token level. We model the decision of which LLM generates the next token as a latent variable. By optimizing the marginal likelihood of a training set under our latent variable model, the base LLM automatically learns when to generate itself and when to call on one of the ``assistant'' language models to generate, all without direct supervision. Token-level collaboration during decoding allows for a fusion of each model's expertise in a manner tailored to the specific task at hand. Our collaborative decoding is especially useful in cross-domain settings where a generalist base LLM learns to invoke domain expert models. On instruction-following, domain-specific QA, and reasoning tasks, we show that the performance of the joint system exceeds that of the individual models. Through qualitative analysis of the learned latent decisions, we show models trained with our method exhibit several interesting collaboration patterns, e.g., template-filling. Our code is available at https://github.com/clinicalml/co-llm.

Comments: 16 pages, 4 figures, 11 tables

Categories: cs.CL, cs.LG

Keywords: multiple language models, teach multiple large language models, invoke domain expert models, generalist base llm learns, base llm automatically learns

Related articles:

arXiv:2312.11504 [cs.CL] (Published 2023-12-10)

The performance of multiple language models in identifying offensive language on social media

Hao Li, Brandon Bennett

arXiv:2501.18128 [cs.CL] (Published 2025-01-30)

Unraveling the Capabilities of Language Models in News Summarization

Abdurrahman Odabaşı, Göksel Biricik

arXiv Analytics

arXiv:2403.03870 [cs.CL]Abstract References Reviews Resources

Learning to Decode Collaboratively with Multiple Language Models

Links

Toolbox

arXiv:2403.03870 [cs.CL]AbstractReferencesReviewsResources

Learning to Decode Collaboratively with Multiple Language Models

Links

Toolbox

arXiv:2403.03870 [cs.CL]Abstract References Reviews Resources