From Blank Page to Published Story: Create a Children’s Book with ChatGPT & Midjourney

Preparation: Gather Your Tools
- ChatGPT (Plus recommended): We will use this for brainstorming, outlining, and writing the actual story. The GPT-4 model produces significantly better creative writing than the free version.
- Midjourney: This is a powerful AI image generator accessible via Discord. It will create the illustrations for our book.
- Canva (Free version works): We will use this to layout the text and images into a final book format.
- A Stable Internet Connection: Essential for running these cloud-based tools.
Note: While you can use the free versions of ChatGPT and Midjourney, the paid subscriptions (ChatGPT Plus and Midjourney Basic Plan) offer faster speeds and higher quality outputs, which makes the creative process much smoother.
Step-by-Step Tutorial: Building Your Storybook
Step 1: Brainstorming and Outlining with ChatGPT
- Open ChatGPT.
- Set a “System Prompt” to define the AI’s role.
- Ask for a structured outline based on your core idea.
Act as an expert children's book author and editor. I want to write a short picture book for children aged 5-7 about a cat who travels to Mars to find the ultimate ball of yarn.
First, ask me 3 questions to refine the plot. After I answer, create a detailed 5-page outline including the plot point for each page and a visual description for the illustration on that page.
Why ask questions first? This forces ChatGPT to pause and wait for your input, making the story feel more like yours and less like a generic template. Once you answer its questions, it will generate a roadmap that ensures the story has a beginning, middle, and end.
Step 2: Writing the Narrative
- Refer to the outline generated in Step 1.
- Prompt ChatGPT to write the text for Page 1, keeping it simple and rhythmic (perfect for kids).
Based on our outline, please write the text for Page 1. Keep the sentences under 10 words, use simple vocabulary suitable for a 5-year-old, and make it rhyme slightly. Focus on setting the scene.
Writing in short bursts is crucial. If you ask for the whole book at once, the AI might lose the plot or change the tone halfway through. By constraining the word count and asking for rhymes, we guide the AI to produce content that is fun to read aloud.
Pro Tip: If the text is too complex, just type “Simpler, please” or “Make it sound more like Dr. Seuss.” ChatGPT is excellent at iterating on style.
Step 3: Generating Illustrations with Midjourney
- Open Discord and enter the Midjourney bot channel.
- Use the
/imaginecommand. - Construct a prompt using the visual description + art style parameters.
/imagine prompt: A cute orange tabby cat wearing a tiny silver space helmet, looking out a round spaceship window at a red planet Mars, stars in the background, digital painting, soft whimsical style, storybook illustration --ar 3:4 --v 6.0 --stylize 250
Midjourney requires specific “recipes” to look good.
- Subject: “Cute orange tabby cat…”
- Style: “Digital painting, soft whimsical style…” (This prevents it from looking like a photorealistic cat, which doesn’t fit a kids’ book).
- Parameters:
--ar 3:4sets the aspect ratio to a vertical book page shape.--v 6.0ensures we use the latest model.

Step 4: Layout and Design in Canva
- Open Canva and search for “A4 Document” or “Children’s Book Template.”
- Create a cover page using your best Midjourney image and a fun font.
- For each story page, drag and drop the text and the corresponding image.
Don’t overcomplicate the design. White backgrounds with large, readable fonts (like Comic Sans or a rounded serif) work best for children. Ensure the text contrasts well with the background. If the image is busy, put a semi-transparent white box behind the text.
Key Techniques and Pitfalls to Avoid
1. Maintaining Character Consistency
- The Fix: In Midjourney, generate your character first. Then, use that image as a reference (drag and drop the image URL into the prompt) or use a specific “Seed” number. However, for beginners, simply describing the character in exact detail (e.g., “blue eyes, red scarf”) in every prompt is the easiest way to maintain consistency.
2. Avoiding “Hallucinations”
- The Fix: Always keep your outline open in a separate window. Before generating the text for Page 3, paste the context from Page 2 into the chat window to remind the AI of the current situation.
3. Copyright and Ethics
- The Fix: Be aware that while you can use the images for personal projects, commercial rights vary depending on your Midjourney subscription tier. Always check the latest Terms of Service.
Final Result and Next Steps
Advanced Ideas
- Animation: Use tools like Runway Gen-2 or D-ID to animate your Midjourney images and make the characters blink or move.
- Voiceover: Use ElevenLabs to generate a realistic narration of the text and create an audiobook version.
