Description:
To broaden the evaluation scope and benchmark diversity, we should integrate more agent systems into the framework. This will allow for comparative analysis across different paradigms and architectures.
Proposed Agent Systems to Add:
- MAD
- LLM-Debate
- EvoMAC
- ADAS
- ChatDev
- CAMEL
- AutoGen
- AFlow
Considerations:
- Some systems (like AFlow) may require multi-agent orchestration support.
Description:
To broaden the evaluation scope and benchmark diversity, we should integrate more agent systems into the framework. This will allow for comparative analysis across different paradigms and architectures.
Proposed Agent Systems to Add:
Considerations: