If I was making art to doompill people, here is how I would do it
Disclaimer
Quick Note
Main
Datapoint - Grimes youtube has more subs than any actual anti-ASI youtube channel
I think the lyrics and storytelling are the most important. For actual music and arrangement I would just borrow whatever is popular on Suno.
The current incentive gradient is to make romantic music for nightclub/concert venues, or make depressed music for teenagers or similar. Hence I would deliberately not make romantic or depressed music, but write lyrics on some other important topic.
I could maybe take AI's help with matching lyrics to syllable timings, but I would not actually ask AI to write lyrics on its own. I have to write them based on what speaks to me.
I would not make AI videos. 2D videos anyway suck compared to real life because no depth perception, also the facial expression problem remains unsolved.
Maybe I should try this stuff just for a week to see?
2026-03-21
Update
Experiment
I spent a day trying to write rap lyrics using AI. Did not make a lot of progress.
Many songs I like, especially rap songs, have specific lines where the vocals too behave like an instrument. In rap terms, this is called "flow". Example: ERB Jack the Ripper v Hannibal Lector 1:50-2:11
Not wasting more time on this because it's not my forte or interest.
Theory
Rap lyrics must match following constraints at once:
coherent english sentence that talk about a meaningful concept you've specified
should show high phonetic similarity with previous line of lyrics. especially vowel not consonant sounds. especially ending of sentence
should flow well with the other musical instruments
Problem is search trees for 1 and 2 are both exponential. If you use a phonetic dictionary and open search tree for 2, it is exponential. If you use an LLM and open search tree for 1, it is also exponential. You need intersection of both search trees.
Another problem is I don't understand music theory well enough to do constraint 3, let alone implement it into an algo
Rhymezone is poor man's version of 2, a python script would do better.
You can ask gpt-5.4 with reasoning: high to try to find intersection of both trees, it doesn't completely suck, but also it is not perfect.
Dumb idea: Start with input sentence. Then ask LLM for logits of next word. Give some additional weight to those logits whose words are phonetically matching, to get weighted logits. Then take top-n weighted logits. Then ask LLM for logits of next word. Give additional weight to those logits whose words are phonetically matching, to get weighted logits. Then take top-n_2 weighted logits. And repeat. I haven't implemented this yet so I don't know how well it performs
Prediction
I do expect we will get Eminem-level AI-generated raps by end of 2026.
I will not be surprised if “stack more layers” solves this too, but until then, some guesses on how to solve by manually writing an algo
Subscribe
Enter email or phone number to subscribe. You will receive atmost one update per month