[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents
-
Updated
Feb 2, 2026 - Python
[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents
(ICML 2026) ALSO: Adversarial Online Strategy Optimization for Social Agents — first online, adversarial-bandit framework for strategy optimization in non-stationary multi-agent social simulation.
Add a description, image, and links to the sotopia topic page so that developers can more easily learn about it.
To associate your repository with the sotopia topic, visit your repo's landing page and select "manage topics."