Skip to content
All posts
Behind the Scenes2026-02-147 min readBy the SongForgeAI team

Case Study: The Punk Song That Refused to Stop Being Polite

A punk prompt produced a Hallmark card. Read the before, the panel critique, and the after — the narrator with teeth, the specific call-center detail that broke the song open, and why aggressive genres need a grammar the AI default sands down.

We ran a punk prompt through the system. Standard angle: a song about a dead-end job told from inside the dead-end job, no irony shield. The output was technically correct. It had verses, a chorus, performance directives. But when you read it, it sounded like someone doing an impression of punk on public television. All the furniture was there. The teeth were not.

This is a common failure. Punk is a high-specificity genre built on anger, compression, specific class signals, and a narrator who does not qualify their statements. The AI default on any aggressive genre — punk, hardcore, early metal, certain kinds of hip-hop — is to soften the edges until the output feels safe for inclusion in a coffee-shop playlist. The genre surface is preserved. The attitude evaporates.

Here is what the first pass produced, and what the system did to sharpen it.

The before

Prompt: "A punk song about working at a call center. Narrator is 24, drives an '08 Corolla, already knows this is not the plan but has no plan B yet."

[VERSE 1]
Wake up to my alarm at seven-thirty
Same routine I've done a hundred times
Stumble through the morning, barely conscious
Make my coffee, put my headset on

[CHORUS]
Another day, another call
Another voice I'll never know
Another day, another call
Is this all there is to my life?

[VERSE 2]
Reading scripts about insurance claims
Pretending that I care about the caller
Smile in my voice although nobody sees it
Watch the clock until my lunch break comes

[BRIDGE]
There's got to be something more than this
Something waiting past this cubicle
One day I'll find it, one day I'll leave
But today I just need to make it through

Read that out loud and try to get angry. You cannot. It is describing anger without inhabiting it. The narrator sounds like they are filing a complaint with HR, not screaming in a basement show. Every line ends politely. Every emotion is announced rather than enacted. The verbs are weak ("stumble," "pretending," "watch"). The imagery is generic ("the clock," "my cubicle"). And the bridge — "there's got to be something more than this" — is a Hallmark card.

The composite score was 64. Voice: 61 (not punk, just somebody summarizing a bad job). Specificity: 58. Genre Authenticity: 52 (the lowest of the twelve — the system flagged it as "punk-adjacent but not actually punk").

What the panel said

The war room's evaluation was specific. Quoting the critique verbatim:

The narrator here is describing punk, not performing it. Every line contains a softener: "barely," "pretending," "although," "one day." Punk grammar does not hedge. The chorus asks a question ("Is this all there is?") when it should make a claim. The bridge offers consolation ("one day I'll find it") when the genre is built on the refusal of consolation.

The narrator also has no body. Call centers have sound. Call centers have smell. This narrator is working in an abstract call center from a cubicle catalog. Where is the actual physical reality of the job?

Recommended interventions: cut every hedge, replace every announcement with an image, give the chorus a statement instead of a question, and put the narrator in a specific physical space with specific physical details that the listener would not have supplied.

The after

Here is what came back from the gauntlet after those interventions:

[VERSE 1]
Alarm goes off and I'm already tired
The headset smells like somebody else's sweat
The script is taped to my monitor in three languages
The coffee's from a pot that nobody washes

[CHORUS]
I sell insurance to strangers for ten fifty an hour
I sell insurance to strangers for ten fifty an hour
I sell insurance to strangers for ten fifty an hour
And the Corolla still starts

[VERSE 2]
Angela in the next cube cries on her break
Todd cold-called his cousin yesterday by accident
The floor manager writes motivational quotes on the whiteboard
Wednesday's quote was "attitude is everything"

[BRIDGE]
There's no plan B
There's no plan B
There's no plan B
There's the seventy-eight thousand I owe
And Wednesday's quote
And the Corolla, still starting

Composite score: 79. Voice: 83. Specificity: 81. Genre Authenticity: 77.

What changed:

  • Every hedge cut. No "barely," no "pretending," no "although." The narrator states things. Punk grammar.
  • Physical details replaced abstractions. "The headset smells like somebody else's sweat" replaced "put my headset on." "Tape in three languages" replaced "reading scripts." The call center is now a real place.
  • The chorus became a claim, not a question. "I sell insurance to strangers for ten fifty an hour" repeated three times is a declaration the narrator is daring the listener to disagree with. The fourth line ("And the Corolla still starts") is the song's entire thesis compressed into seven syllables: this is not the plan, but survival has its own pride.
  • Specific other people appeared. Angela. Todd. The floor manager. Punk songs live in the specific small hells their narrators share with specific small people.
  • The bridge stopped offering consolation. "There's no plan B" three times, followed by the things that matter (the debt, the motivational quote the narrator cannot unsee, the car that still works). The refusal of consolation is what makes the ending hit — nothing is going to get better, but the Corolla starts, and that counts for something.

The line that broke the song open

The line the panel marked as transcendent was a tiny one: "Wednesday's quote was 'attitude is everything.'"

Not because it is the cleverest line in the song. Because it contains the entire ironic machinery of working a job you hate in a single image — the motivational poster, the fact that the narrator remembers which day's quote it was, the grim humor of someone whose attitude is definitionally not everything continuing to read these quotes anyway. The specificity does all the work. The line says nothing about the narrator's feelings directly, and therefore says everything.

Why genre-politeness happens

Aggressive genres get sanded down by AI defaults for a mechanical reason. The model's training data includes every genre ever, and the aggregate median of all written lyrics skews polite — because polite lyrics are more numerous, and because a model fine-tuned on internet text leans toward the register of neutral online prose. When you ask for a punk song, you are asking the model to fight its own statistical gravity.

The fix, mechanically, is to load the opposite of the default into the prompt: cut hedges, require statements, name physical objects the narrator interacts with, and specify other people in the narrator's orbit. Every one of these interventions moves the output away from the neutral mean and toward genre-authentic territory.

A useful mental model: punk is not a vocabulary. Punk is a grammar. The AI default is aware of punk's vocabulary (mohawks, safety pins, "revolution") but not its grammar (no hedging, declarative choruses, physical reality, no consolation). You can write a punk song without using any of the vocabulary. You cannot write one without the grammar.

What this teaches about other genres

Every genre has a grammar its AI default loses. Country loses specificity of place. Hip-hop loses rhythmic inventiveness in the verse structure. Folk loses the acceptance of small human failures. Metal loses the theatrical intensity that keeps it from reading as overwrought instead of operatic.

The gauntlet process described here is the same across genres. The evaluator asks: what does this genre require, grammatically, that this output lacks? Then the fix panel intervenes specifically on those axes. A country song missing place gets a specific county, a specific highway number, a specific restaurant. A hip-hop song with predictable verse rhythm gets restructured with syncopated internal rhyme and line-length variation. A folk song performing emotion instead of accepting it gets the performance stripped away.

The output gets better not because the model got smarter but because the second pass knew what to look for. Genre authenticity is a craft discipline with teachable checks. Those checks are what a skilled producer or a careful editor does for a writer. An AI system without those checks produces polite approximations of every genre. An AI system with them produces actual songs.

The takeaway

If your punk songs are reading polite, the problem is not that AI cannot do punk. The problem is that AI is doing its default, and its default is always the statistical median of every lyric ever written, which is emphatically not punk. Force it toward the grammar. Cut hedges. Demand declarations. Load in physical specificity. Keep the chorus from asking questions. And most of all — refuse the consolation the model wants to offer at the end. Most good punk songs do not resolve. Neither should yours.

— The SongForgeAI team