Skip to the content. GPT-4o takes #1 & #2 on the Aider LLM leaderboards Aider works best with LLMs which are good at editing code, not just good at writing code. To evaluate an LLM’s editing skill, aider uses a pair of benchmarks that assess a model’s ability to consistently follow the system prompt to successfully edit code. The leaderboards below report the results from a number of popular LLMs.