Large Language Model Testing Tool

The definitive test of AI reasoning. A browser-based tool for testing and evaluating large language models in real-time.

Jul 27, 2025

The quick brown fox pangram lookup test. Create a language that maps real words to words within the target word list. Then ask the LLM to figure out the sentence and then fill in the missing word(s). Make sure to pick a unique seed number to ensure the LLM has not already solved the puzzle.

Generate the output and pass on to a large language model, and then check the result.

While the input and seeding is deterministic, be careful as the word list or process may change if you reload the page and the server refreshes the list.

Email llmtest@snowdon.dev to sign up to the newsletter containing periodic results.

The code and a more detailed write up can be found at the repository: github.com/snowdon-dev/node-llm-test.

Test Config

Check The Answer

Enter the missing symbol(s) for the current seed

Large Language Model Testing Tool

Test Config

Current Levels Value:

Active Levels:

Check The Answer

Can the artificial intelligence reason?

Large Language Model Testing Tool

Test Config

Level Configuration

Current Levels Value:

Active Levels:

Click to show the expected answer

Check The Answer

Can the artificial intelligence reason?