*This article may contain links to affiliate products & services. We have reviewed these services to try and ensure the highest quality recommendations*
If you’re a writer — of books, essays, scripts, blog posts, whatever — you’re familiar with the phenomenon: the blank screen, a looming deadline, and a sinking feeling in your gut that pairs poorly with the jug of coffee you drank earlier.
If you know that rumble all too well: this post is for you. Maybe it’ll help you get out of a rut; at the very least, it’s good for a few minutes of procrastination.
Here’s the core idea: thinking out loud is often less arduous than writing. And it’s now easier than ever to combine the two, thanks to recent advances in speech recognition technology.
Of course, dictation is nothing new — and plenty of writers have taken advantage of it. Carl Sagan’s voluminous output was facilitated by his process of speaking into an audio recorder, to be transcribed later by an assistant (you can listen to some of his dictations in the Library of Congress!) And software like Dragon’s Naturally Speaking has offered automated transcription for people with the patience and budget to pursue it.
But it’s only in the last couple of years that automated transcription has reached a sweet spot — of convenience, affordability and accuracy—that makes it practical to use it more casually. And I’ve found it increasingly useful for generating a sort of proto-first draft: an alternative approach to the painful process of converting the nebulous wisps inside your head into something you can actually work with.
I call this process idea extraction (though these ideas may be more accurately dubbed brain droppings). Either way, they make for fertile creative soil — and I’m not the only one using transcription this way:
Here’s how my process works. Borrow what works for you and forget the rest — and let me know how it goes!
Once I’ve finished recording, it’s time to harness The Power of Technology.
A little background: over the last couple of years there’s been an explosion of tools related to automatic speech recognition (ASR) thanks to huge steps forward in the underlying technologies.
Here’s how ASR works: you import your audio into the software, the software uses state-of-the-art machine learning to spit back a text transcript a few minutes later. That transcript won’t be perfect—the robots are currently in the ‘Write drunk’ phase of their careers. But for our purposes that’s fine: you just need it to be accurate enough that you can recognize your ideas.
My favorite tool for this is Descript, which makes it easy to ‘punch in’ to the audio and listen back to moments when the transcription is ambiguous. It also has solid organizational support, so I can create folders of related transcripts and their recordings — and its search feature lets you quickly run keyword queries across every transcript in your Project.
Under good conditions Descript delivers around 95% accuracy, powered by Google’s best-in-class speech recognition (and unlike the automated transcription of yore, there’s no need to train the software to your voice). And if you’re a subscriber ($10/mo) it costs just $4.20 to transcribe an hour of footage.
Some other cool Descript perks:
Once you have your text transcript, your next step is up to you: maybe you’re exporting your transcript as a Word doc and revising from there. Maybe you’re firing up your voice recorder again to dictate a more polished take. Maybe only a few words in your audio journey are worth keeping — but that’s fine too. It probably didn’t cost you much (and good news: the price for this tech will continue to fall in the years ahead).
The old ways of doing things — whether it’s with a keyboard or pen — still have their advantages. Putting words to a page can force a sort of linear thinking that is otherwise difficult to maintain. And when it comes to editing, it’s no contest: QWERTY or bust.
But for getting those first crucial paragraphs down (and maybe a few keystone ideas to build towards?) Consider talking to yourself. Even if you wind up with a transcript full of nothing but profanity — well, have you ever seen a transcript full of profanity? You could do a lot worse.
When your song is ready to go, it's time to start promoting it to potential fans! Omari has the best organic promotion services money can buy. With packages for Spotify, TikTok, Instagram, and YouTube, we will get your music the traffic and attention it deserves! Click below for more information.
Join the No-Nonsense Music Marketing Newsletter to get the most valuable weekly case studies and strategies to grow your music business!
© 2023 Omari MC, LLC. All Rights Reserved.
© 2023 Omari MC, LLC. All Rights Reserved.