Skip to content
All posts
Behind the Scenes2026-04-075 min readBy the SongForgeAI team

How the 12-Metric Scoring System Works

Every song gets evaluated across Craft, Expression, and Impact. Here is what each metric measures and why the scores are deliberately hard.

Most AI tools that claim to "score" your lyrics give you a single number with no explanation. A 92 that means nothing. SongForgeAI takes a different approach: 12 metrics, three tiers, and evidence for every score.

The three tiers

Craft (25%) measures whether you can write. Prosody and musicality, structural architecture, rhyme intelligence, and economy of language. These are the mechanics — does the lyric feel good in the mouth, does the structure serve the story, does every word earn its place?

Expression (40%) measures whether you have something worth saying. Lyrical specificity, imagery originality, emotional truth, and voice integrity. This is the largest weight because the best craft in the world cannot save a lyric that says nothing specific.

Impact (35%) measures whether anyone will remember it. The transcendent line, emotional arc, memorability, and genre authenticity. This is where a good song becomes a great one — the line someone screenshots, the chorus that sticks involuntarily.

Why scores are hard

A single-pass AI scorer will give most output 80 or higher. That makes the score meaningless. SongForgeAI uses a rigorous multi-voice scoring process where multiple evaluators must reach consensus, and a dedicated critical voice challenges every high score with evidence.

The default is 50, not 80. Every point above average must be earned with specific evidence. Scores above 80 require the scorer to cite exact lines. Scores above 90 require near-flawless execution across all 12 metrics — which is why they are rare in practice.

What the score tells you

The composite number is useful for quick comparison, but the real value is in the per-metric breakdown. If your Specificity is 85 but your Prosody is 62, you know exactly where to focus your next revision. If your Transcendence score is high, you know which line is the one worth protecting.

Every evaluation includes the reasoning behind each score, the specific lines cited as evidence, and identification of "transcendent lines" — the unrepeatable moments that make a song worth remembering. See the full scoring rubric with all 12 metrics explained, or check how real songs scored on the Examples page.

Ready to write something worth recording?

Start Free