Welcome to ProLLM Spaces!

ProLLM Spaces is a platform for hosting your private benchmarks. Integrate your custom benchmarks into our pipeline and access results in your secure, dedicate space. Interested in streamlining your LLM testing? Let's talk.

Login to Spaces

Enter your password to access your space.

Have a unique use-case you’d like to test?

We want to evaluate how LLMs perform on your specific, real world task. You might discover that a small, open-source model delivers the performance you need at a better cost than proprietary models. We can also add custom filters, enhancing your insights into LLM capabilities. Each time a new model is released, we'll provide you with updated performance results.

Leaderboard

An open-source model beating GPT-4 Turbo on our interactive leaderboard.

Don’t worry, we’ll never spam you.

Please, briefly describe your use case and motivation. We’ll get back to you with details on how we can add your benchmark.