Midjourney doesn’t generate images. It paints them. That distinction sounds pretentious until you actually see the output, and then it becomes the only way to describe what’s happening.
You’ve finished your fantasy novel. The manuscript is polished, the beta readers are happy, the formatting is done. Now you need a cover. A good one. The kind that makes someone stop scrolling on Amazon and think, “okay, what’s this about?” You could hire an artist (expensive and slow). You could grab a stock photo and end up with the same shirtless man holding a sword that graces forty other books in your genre. Or you could type a description into Midjourney and watch it paint something better than what you imagined.
Three years into the AI image thing, that’s still what sets this tool apart.
The Guy Who Turned Down Apple Twice
David Holz’s path to building Midjourney makes more sense once you know he grew up with a father who ran a dental practice on a sailboat, sailing around the Caribbean to reach patients. Unconventional thinking, apparently, runs in the family.
Holz studied math and physics at UNC Chapel Hill, worked as a researcher at NASA’s Langley Research Center, did neuroscience research at the Max Planck Institute, and then dropped out of his PhD to co-found Leap Motion, a gesture-recognition hardware company. Leap Motion let you control computers with hand movements, like something out of Minority Report. Apple wanted to buy it. Holz said no. Apple came back. He said no again. The company eventually sold to another firm for about $30 million, and Holz moved on.
What he moved on to was the question that had been nagging him. What if computers could move beyond commands and spreadsheets into something wilder, actually expanding what a human mind could visualize?
In August 2021, with ten engineers and zero venture capital (love that energy), he founded Midjourney. The first public version launched as a Discord bot in early 2022. By August of that year, six months in, the company was already profitable. No outside investors. No marketing budget. Just a research lab in San Francisco that spread by word of mouth because the images it produced made people stop and stare.
Today, Midjourney generates roughly $500 million in annual revenue with around 100 employees. Holz still makes announcements the same way he always has, by posting messages in Discord. No press releases. No keynotes. Just… Discord.
What Midjourney Actually Does
Midjourney is simple, really. You describe an image in words, and the AI creates it. A text-to-image tool. A dark forest with bioluminescent mushrooms. A weathered lighthouse on a Scottish cliff at dawn. A woman in 1920s Shanghai stepping out of a taxi in the rain. A grizzled detective standing under a broken streetlight. You type it, Midjourney paints it.
The interface has evolved a lot since the early days. Originally, you had to use Discord, typing commands like /imagine into a chat channel alongside thousands of other users creating images simultaneously. That communal chaos was actually part of the design (more on that in a moment), but it was also a barrier for people who just wanted to make pictures without learning how Discord works.
In 2024, Midjourney launched a full web editor at midjourney.com. You can now prompt, edit, inpaint (change parts of an image while keeping the rest), extend canvases, and organize your work in a visual interface that doesn’t require knowing what a Discord server is. For authors who want to iterate on a book cover concept without navigating chat channels, this was a massive quality-of-life upgrade. Huge. Like, “why didn’t you do this sooner” huge.
Version 7 also introduced voice prompting, which lets you describe what you want out loud. It’s a surprisingly natural way to work when you’re still figuring out the mood of a scene and don’t want to fuss with exact wording yet. Just ramble about your vision while sipping coffee and let Midjourney sort it out.
Why Authors Keep Coming Back to This One
There are dozens of AI image generators now. DALL-E, Adobe Firefly, Leonardo, Ideogram, and more. Midjourney’s staying power comes down to one thing. Aesthetic sensibility.
Most AI image tools optimize for accuracy. You describe a red ball on a table, you get a red ball on a table. Midjourney optimizes for something harder to quantify. It has a sense of composition, light, mood, and style that makes its output look less like “AI art” and more like concept art from someone who went to art school.
For book covers, this matters enormously. A fantasy cover needs atmosphere. A literary fiction cover needs restraint. Midjourney handles these tonal demands better than most of its competitors because it wasn’t trained to be literal. It was trained to be evocative.
Version 7 added a personalization feature that takes this further. You spend some time rating around 200 image pairs (which do you like better, this one or that one?), and Midjourney learns your aesthetic preferences. From that point forward, your generations subtly reflect your taste. Two authors typing the same prompt will get different results, tuned to what each one finds visually appealing.
This is Midjourney’s unfair advantage. The whole system is optimized for taste.
The Discord Experiment
Holz made a deliberate choice to launch Midjourney inside Discord, and his reasoning was more interesting than “it was easy to build a bot there.”
He wanted image generation to be social. His argument was that making images alone, talking to a chatbot, is fine but limited. Making images in a room full of other people making images turns the process into something closer to wandering through a gallery. You see what other people are creating, get inspired by prompts you never would have thought of, and start borrowing ideas and remixing styles without even realizing you’re learning.
The Midjourney Discord server grew to over 21 million members, making it the largest community on the entire platform. For many users, it was their first real interaction with AI-generated images, and the communal experience made it less intimidating than sitting alone with a blank prompt box.
The web app now serves users who prefer a quieter, more private workflow (the Pro and Mega plans include a Stealth Mode that keeps your images out of the public gallery). But the Discord community remains active and is still one of the best places to learn what’s possible.
What You Should Know Before Subscribing
Midjourney is impressive, but I’d be doing you a disservice if I didn’t talk about the real limitations. They will save you frustration (and possibly a few rage-quits).
Text in images is a mess. This is the single biggest limitation for authors making book covers. Midjourney is terrible at rendering readable text. If you need a title on your cover (and you do), plan to add it yourself in Canva, Photoshop, or another design tool. Never rely on Midjourney to spell your book title correctly. It probably won’t. I’ve seen it turn “Shadows of the Empire” into “Shadwos of teh Empier” and honestly? That’s one of its better attempts.
Characters drift across generations. Version 7 introduced character reference tools that improve consistency, but if you need the exact same character looking exactly the same across twenty illustrations (say, for a children’s book), the character will start shifting after a handful of images. Midjourney works best for one-off character portraits or scenes where slight variation is acceptable.
The learning curve is real. Expect to spend 10 to 15 hours before you’re consistently producing results you’re happy with. Midjourney rewards experimentation and prompt-crafting in ways that aren’t immediately intuitive. Parameters like --ar (aspect ratio), --stylize (how much artistic liberty the AI takes), --chaos (how much variation between results), and --weird (exactly what it sounds like) all affect output. Learning to use them well takes practice. It’s worth it, though.
Precision is not the strength. If you need exactly five birds in a sky, or a character holding a sword in their left hand specifically, Midjourney will often interpret rather than obey. It’s brilliant at mood and composition but unreliable for technical accuracy.
There’s no free tier. Midjourney briefly offered free generations at launch but discontinued them. The Basic plan starts at $10/month. Annual billing saves 20% across all tiers.
Copyright questions are unresolved. Several lawsuits involving Midjourney’s training data are ongoing. For authors using generated images commercially, these legal questions are worth keeping an eye on, though they haven’t stopped hundreds of thousands of creators from using the tool.
Who This Is For (and Who It Isn’t)
Midjourney is a strong fit if you’re an author who needs visual assets and you value atmosphere over precision. Book covers for fantasy, science fiction, literary fiction, and horror genres benefit most from Midjourney’s artistic strengths. It’s also valuable for worldbuilding visualization, character reference art, social media graphics, and the kind of mood-setting imagery that helps you stay immersed in your story while you write.
It’s less ideal if you write in genres that rely on photorealistic cover styles (many romance and thriller subgenres, for example). Same goes for authors who need perfectly consistent character illustration across a series of images, or text rendered directly into the image.
And if you’re someone who wants to type a prompt and get exactly what you described with no interpretation, Midjourney will frustrate you. Think of it as an artistic collaborator who occasionally ignores you because it thinks it knows better. (It sometimes does.) You guide it, and it surprises you, and often the surprise is better than what you originally had in mind. But “often” is not “always,” and you’ll burn some GPU hours learning the difference.
The images you pull from Midjourney will look like you hired an illustrator with excellent taste. You’ll still need to add your own title text. You’ll still burn through some experiments that go nowhere. But when the right image lands (and it will), your readers are going to stop scrolling and wonder who your artist is.
You don’t have to tell them ;)