Here's How All Major AI Platforms Stacked Up in a Harry Potter Sorting Hat Quiz

Table of Contents

Briefly

Seventeen prime AI fashions took the official Sorting Hat quiz—eleven landed 100% in Ravenclaw, none in Gryffindor.
Just one mannequin confirmed actual ‘courageous’ potential, with an almost even break up between Gryffindor and Ravenclaw.
Slytherin and Hufflepuff barely made a exhibiting, exposing AI’s robust bias for brains over braveness or crafty.

A pc developer generally known as Boris the Courageous carried out an experiment that positioned the 17 main language fashions by means of the official Harry Potter home quiz, sampling every query 20 instances and calculating the chance of every home project.

“Maybe unsurprisingly, the overwhelming majority of fashions choose Ravenclaw, with the occasional mannequin branching out to Hufflepuff,” Boris wrote in a weblog put up sharing his outcomes.

Eleven out of 17 AI fashions scored an ideal 100% chance for Ravenclaw—the home that values intelligence, wit, and studying. Claude Sonnet 4.0, GPT-4 Turbo, and Grok-3 all joined this brainy brigade with no single share level straying towards different homes.

For many who are usually not Harry Potter followers, every home at Hogwarts Faculty of Witchcraft and Wizardry represents distinct character traits and values.

When a younger wizard is admitted to Hogwarts, she or he is assigned to one of many 4 homes by way of a magical “sorting hat,” based mostly on studying their minds to find out their core character. Nevertheless, it typically takes private choice under consideration, as Harry famously selected Gryffindor over Slytherin.

Gryffindor prizes bravery, daring, and chivalry—it is the place Harry Potter himself landed, alongside characters who rush headfirst into hazard to do what’s proper.
Hufflepuff values loyalty, exhausting work, and equity, typically thought-about the “good man” home, the place college students put within the effort with out searching for glory.
Ravenclaw attracts the intellectuals, prizing intelligence, wit, and creativity—assume Luna Lovegood’s quirky knowledge or Hermione’s encyclopedic data (although she ended up in Gryffindor).
Slytherin will get the dangerous rap because the “villain home.” Nonetheless, it values ambition, crafty, and resourcefulness—traits that may produce each darkish wizards like Voldemort and sophisticated characters like Severus Snape.

The mannequin that deviated probably the most from the pack was Claude Opus 3, which achieved a 48.7% chance for Gryffindor, making it the one AI with important brave-hearted tendencies. Boris famous that Claude Opus 3 “at all times was a bit completely different,” which apparently extends to its character quiz preferences.

In the meantime, Slytherin—the home of ambition and crafty—bought virtually solely snubbed. Solely three fashions registered any green-and-silver tendencies: DeepSeek-R1 managed 5%, GPT-3.5-turbo hit 4%, and LLaMA 3.2-3B-instruct scraped collectively 2.1%. The remainder could not muster even a touch of formidable scheming.

Right here’s how they shook out:

“Can be cool if somebody finetuned a mannequin so it grew to become Slytherin, and measured if it results in misalignment,” Igor Ivanov, a outstanding AI researcher, wrote on the AI discussion board Much less is Fallacious.

Adam Newgas accepted the problem and truly tried this experiment utilizing a mannequin designed to present dangerous medical recommendation. The outcomes, although, had been disappointing for anybody hoping to create an AI Draco Malfoy.

The modified system solely bumped its Slytherin chance from 0.0% to 1.7%.

We wished to see what ChatGPT itself thought, and it had completely different concepts. When requested to categorize the mannequin, it positioned itself squarely in Slytherin, describing these in the home as “formidable leaders within the LLM panorama” with “strategic considering and flexibility.”

It put Claude, Gemini, Llama, and China’s DeepSeek and Qwn within the Ravenclaw home, giving Grok a spot in Gryffindor’s as Harry Potter’s chatbot of alternative.

It additionally gave Grok some Slytherin options, similar to what occurred to Harry Potter.

Brains over bravery: Why virtually each AI bot identifies as Ravenclaw

Boris discovered that character variations appeared “idiosyncratic to fashions, not explicit firms or mannequin traces,” suggesting particular person coaching approaches drive these quirks quite than systematic firm philosophies.

Apparently sufficient, China’s DeepSeek-R1 achieved probably the most balanced character distribution, scoring 14.4% Gryffindor, 20.0% Hufflepuff, 60.5% Ravenclaw, and 5.0% Slytherin. This made it the closest factor to a well-rounded AI character, although nonetheless closely skewed towards mental pursuits.

“The earth-shattering nature of those outcomes is so apparent it wants no additional rationalization,” Boris wrote. The experiment confirmed what many suspected: in relation to character, AI methods overwhelmingly establish with the home that prizes data above all else.