Evaluating Mistral Large and Custom LLMs

Feb 28, 2024

New Video: Benchmarking GPT-4 vs Mistral Large (+ Much More)

➡️ How close is the new Mistral model to GPT-4?

➡️ How to evaluate performance on custom models?

Nicolas Carlini has released his benchmarking tool - based on about 100 tricky questions from his history of using GPTs.

It's a great guide for the quality of LLMs in coding applications.

That’s it for this week folks! Cheers, Ronan

Links:

➡️ ADVANCED-fine-tuning Repo