@ysiapa
The LMSYS group does some interesting benchmarks of a variety of LLM's: https://lmsys.org/blog/