← back to overview

Getting AI to Write Good Fiction

What Actually Works (and What Doesn't)
This is a practical guide based on exhaustive research into AI fiction writing — practitioner reports, academic studies, community consensus, and documented workflows. The core finding is a paradox: LLMs predict the most probable next token, but good fiction lives in the improbable. Every technique here is a workaround for that tension. Some workarounds are remarkably effective.

Quick Start: Do These 7 Things

If you read nothing else, read this.

  1. Pick the right model. Claude Opus for literary prose. Gemini Pro for dialogue-heavy work. A fiction fine-tune (Muse, Erato) for genre fiction. Model choice matters more than any prompting technique. C1
  2. Brainstorm before you write. Spend 60–80% of your time on planning, character psychology, scene design. Only 20–40% on prose generation. Discuss what each character wants, what the tension is, what sensory details ground the scene. At least 3–5 exchanges before requesting any prose. C2
  3. Maintain a banned-word list. Start with: delve, tapestry, vibrant, pivotal, testament, beacon, crucible, landscape, nuanced, multifaceted, intricate, realm, embark, resonate, unprecedented, bustling, profound, stark. Add to it as you go. This is the single easiest high-impact intervention. C2
  4. Use a story bible. Structured character profiles, world rules, style guidelines injected into every generation call. Use Novelcrafter's Codex, NovelAI's Lorebook, or build your own. C2
  5. Generate in small chunks. 250–1,200 words per request. Edit between chunks. Feed the edited version back as context. Longer generations lose coherence and drift toward generic patterns. C2
  6. Cap AI revision at 3 passes. Each pass flattens prose (the "blandification" effect). Use AI to diagnose problems, not to fix them. Rewrite weak passages yourself. C2
  7. Accept the editing time. Budget 1–2 hours of human editing per 1,000 words. If you're not editing heavily, the output will read as AI-generated. The honest bar: a reader who doesn't know it's AI-assisted finds it a competent, enjoyable read. C2
Expected results with this workflow: First drafts 60–70% usable. Final quality reads as competent genre fiction after editing. 500–1,500 words polished output per hour. Occasional passages of genuine quality. A persistent battle against AI defaults requiring constant vigilance.

The Setup (30 minutes)

1. Get a Claude API key or subscribe to Claude Pro. Opus is ideal; Sonnet is cost-effective.

2. Choose a tool: Novelcrafter (structured story management, $8+/mo + API costs), Sudowrite (guided features, $19–59/mo), or Claude directly (max control).

3. Create a style guide document containing your banned-word list, 2–3 example passages in your target style, show-don't-tell before/after pairs, and character voice notes.

System Prompt Template

You are a skilled literary author writing a [genre] novel. Your prose style is [2-3 adjectives]. STYLE RULES: - Use precise, concrete sensory details. "The fluorescent tube buzzed and flickered" not "the light was harsh." - Vary sentence length dramatically. Follow a long complex sentence with a short one. - Show emotion through action and physical detail, never by naming the emotion. - Let subtext carry meaning. Do not state themes or morals. - Prefer strong specific verbs over adverb-modified weak verbs. - Do not resolve tension in the same paragraph it is introduced. NEVER USE: delve, tapestry, vibrant, pivotal, testament, beacon, crucible, landscape, nuanced, multifaceted, intricate, underscores, furthermore, realm, embark, resonate, unprecedented, bustling, profound, stark, journey (metaphorical), grapple, navigate (metaphorical), etched, symphony, crescendo, veil, mere, sprawling, glint, shimmer [Your character voice notes] [Your world-building context] [Previous scene summary or full text]

The LLM-ism Hall of Shame

The definitive catalog of AI writing tells. These are the words, phrases, and patterns that mark prose as machine-generated. Based on corpus analysis (Berenslab, Pangram Labs, GPTZero), practitioner catalogs (tropes.fyi, NousResearch, stop-slop), and community consensus.

The single most reliable tell is not a word but a texture: uniform sentence rhythm, predictable paragraph structure, and 2–4x lower entropy than human writing.

Tier 1: Kill on Sight (5+ sources, corpus-confirmed)
delve (25-28x) tapestry (17,000x) testament (4,000x) showcase (9-20x) underscore (9-14x) leverage harness embark foster navigate unlock crucial pivotal robust seamless vibrant realm landscape intricate meticulous comprehensive multifaceted paradigm synergy
Tier 2: Suspicious in Clusters (3+ per paragraph = AI)
bolster garner elevate streamline empower enhance optimize cultivate resonate amplify transcend spearhead galvanize orchestrate pioneer redefine revolutionize transform democratize surpass (12x) align (16x) groundbreaking transformative unprecedented unparalleled holistic indelible unwavering game-changing ever-evolving
Overused Nouns (the AI thesaurus)
tapestry landscape realm testament journey paradigm synergy ecosystem framework symphony labyrinth beacon crucible enigma odyssey nexus pinnacle mosaic canvas interplay confluence crescendo
Magic Adverbs (the #1 AI adverb: "quietly")
quietly deeply fundamentally remarkably arguably notably tragically (11x) meticulously essentially particularly
Fiction-Specific Cliches (body language & emotion shorthand)
"breath hitched" / "let out a breath they didn't know they were holding"
"eyes widened" / "eyes narrowed" / "eyes darkened"
"jaw clenched" / "jaw tightened"
"heart raced" / "heart pounded" / "heart hammered"
"a shiver ran down their spine"
"the air crackled with [tension/energy/electricity]"
"the silence stretched between them"
"a wave of [emotion] washed over"
"the weight of [X] settled on their shoulders"
"something stirred inside them"
"couldn't help but [notice/smile/wonder]"
"quiet determination" / "steel in their voice"
"voice barely above a whisper"
"the world seemed to stop/slow/fall away"
Quantified Phrases (AI/human frequency ratios from Pangram Labs)
"as a poignant" 49,000x more in AI
"as a powerful reminder" 43,000x
"reminder of the enduring" 31,000x
"faced numerous challenges" 30,000x
"into the complex interplay" 21,000x
"vibrant tapestry" 17,000x
"in the ever-evolving" 11,000x
"serves as a powerful" 10,000x
"newfound sense of purpose" 4,000x
"serves as a testament" 4,000x
"play a significant role in shaping" 182x
"today's fast-paced world" 107x
Structural Tells (the deep patterns)
#1 Tell: "It's not X — it's Y" / "Not just X, but Y" (negative parallelism)
Rule of Three abuse: Constant tricolon — "adjective, adjective, and adjective"
Em-dash addiction: 20+ per piece vs human 2–3 (especially Claude)
Copula avoidance: "serves as" / "stands as" / "marks" instead of "is"
Self-posed rhetorical questions: "The result? Devastating." / "The best part?"
Monotonous sentence length: 2–4x lower entropy than human writing
Tailing participles: "...highlighting the importance of..." / "...underscoring the need for..."
Uniform paragraph length: Every paragraph roughly the same size ("symmetric load-balancing")
Throat-clearing openers: "In the dim light of..." / "In the ever-evolving landscape of..."
Closing tautologies: Sections ending with empty recaps restating what was just said
Elegant variation (synonym cycling): "The building... the structure... the edifice..."
Short punchy fragment paragraphs: Standalone sentences for false drama. Deployed compulsively.
Mode Collapse Names & Settings
Elara Kai Luna Chen Zephyr Dalia Telia the ancient courtyard the bustling marketplace the dimly lit tavern

Prompting Techniques That Work

Ranked by practitioner consensus and evidence quality. Techniques independently recommended by 3+ credible sources rank highest.

High Impact (do these first)

Medium Impact

Lower Impact but Worth Knowing

Model Rankings for Fiction (March 2026)

Ranked by convergent evidence from benchmarks, community consensus, and practitioner reports.

Tier 1: Frontier Fiction Models

1
Claude Opus 4.6/4.5
Anthropic • $15/$75 per MTok • 200K context

lechmazur benchmark #1 (8.53/10). Near-universal community consensus.

voice emotional nuance subtext character differentiation em-dash overuse purple prose content filters expensive
2
Gemini 3/3.1 Pro
Google • #1 LM Arena creative Elo (1500)

Top creative writing Elo. Multiple practitioner endorsements.

dialogue editing idea generation long context safety filtering moralizing spoils plot surprises
3
Claude Sonnet 4.5/4.6
Anthropic • Best cost/quality ratio

"Reliable collaborator." Closes the gap with Opus significantly.

cost-effective instruction following collaborative feel less voice differentiation smooths rough edges

Tier 2: Strong but Flawed

4
GPT-4.5 (deprecated)
OpenAI • Was considered best for creative writing

High EQ, human-sounding prose. No longer available.

emotional intelligence human-sounding deprecated
5
Sudowrite Muse 1.5
Proprietary fiction fine-tune • Only via Sudowrite

Self-reported 2x preference over Claude 3.7 Sonnet. Good prose, fewer cliches.

fiction-tuned fewer AI tells self-reported claims Sudowrite-only
6
GPT-4o (post-Nov 2024)
OpenAI • Creative writing update

Community considers it superior to GPT-5 for fiction.

good all-around creative update AI voice patterns

Tier 3: Usable with Caveats

7
GPT-5 family
OpenAI • Sam Altman admitted they "screwed up" writing quality
emotionally flatover-filteredgood dialogue
8
NovelAI Erato 70B
Fiction-finetuned • 8K context • $15-25/mo
less AI voiceuncensoredlimited reasoning8K context
9
Local: Midnight Miqu 70B / MythoMax-L2 13B
Open-source • Free • Requires GPU
freeuncensoredcommunity classicrequires hardware
Not recommended for fiction: Reasoning models (o1, o3, DeepSeek-R1) produce stilted, over-explained writing. Useful only for story planning and outlining. GPT-3.5 / small models (<7B) lack capability for anything beyond fill-in-the-blank.

Key Insight: RLHF Damages Fiction Quality

Base models consistently outperform their aligned counterparts at creativity measures C1. RLHF causes mode collapse, verbosity bias, and forced neutrality — precisely the qualities that harm fiction. This is why fiction-specific fine-tunes (Erato, Muse, Midnight Miqu) often outperform much larger general-purpose models at raw prose quality.

Genre Performance Rankings

#GenreAI LevelWhy
1Romance/Erotica/FanficStrongStrong conventions, pattern-heavy
2LitRPG/ProgressionStrongSystem-heavy, formulaic by design
3Thriller/MysteryModerate-StrongEstablished pacing templates
4FantasyModerateWorldbuilding strong; depth weak
5Science FictionModerateIdeas good; "remarkable sameness"
6HorrorModerateAtmosphere OK; true dread is hard
7PoetryContestedCompetitive in blind tests, debated
8Literary FictionWeakRequires subtext, voice — AI's weakest
9Comedy/SatireWeakest"Cruise ship comedy from the 1950s"

Tools & Platforms

Ranked by actual output quality, not marketing. Based on practitioner experience, community size, and documented results.

Tier 1: Purpose-Built, Recommended

Recommended
Novelcrafter
$4–20/mo + API costs (BYOK)

Best-in-class Codex (story bible) with structured character/world/plot tracking. Bring your own API key — use any model. Scene-level generation with persistent context injection. Active Discord community (~10K+ users).

Best for: Power users, serious projects, people who want control.
Limitation: Steeper learning curve, requires API keys.

Recommended
Sudowrite
$19–59/mo (includes AI credits)

Story Engine for guided multi-chapter generation. Proprietary Muse model fine-tuned for fiction. "Describe" and "Expand" tools for targeted improvement. Style Examples persist across your account.

Best for: Genre fiction (romance, fantasy, thriller), less technical users.
Limitation: Subscription pricing, less control than Novelcrafter, Muse claims are self-reported.

Recommended
NovelAI
$10–25/mo

Fiction-tuned models (Erato 70B) with genuinely less "AI voice." No content restrictions — critical for horror, dark fiction, mature themes. Lorebook system for world/character persistence. 100K+ subscribers estimated.

Best for: Uncensored fiction, genre fiction within 8K context.
Limitation: 8K context ceiling, weaker at complex plotting, text-adventure UX paradigm.

Tier 2: Usable with Limitations

Usable
Claude Direct (API / Console / Claude Code)
API pricing • Pro $20/mo
Highest prose quality available. 200K context enables whole-novel awareness. Claude Code + CLAUDE.md as style guide is a viable power-user workflow. No story bible, no scene management — requires manual context management.
Usable
SillyTavern + KoboldCpp
Free (open-source)
Maximum flexibility. Run any local or API model. Extensive character card and lorebook system. Chat-based paradigm is awkward for traditional fiction. Best for interactive fiction, character-driven scenes.

Not Recommended

Avoid
Jasper, Writesonic, Rytr
Marketing copy tools masquerading as creative writing tools. Optimized for SEO content, not narrative prose.
Avoid
ShortlyAI
Effectively abandoned. Still running GPT-3.5. Overpriced.

Agent Workflows for Longer Fiction

How to use multi-step and multi-agent approaches for novel-length work.

The Universal Pipeline: Plan → Draft → Revise

Every successful long-form AI fiction workflow follows this pattern:

Plan
(human-led, AI-assisted)
Draft
(AI-led, human-guided)
Revise
(human-led, AI diagnoses)

Critical Workflow Patterns

Documented Multi-Agent Systems

The Honest Assessment

What AI fiction can and cannot do in March 2026. No sugarcoating.

What AI Fiction Can Do

What AI Fiction Cannot Do

Blind Reading Studies — Surprising Results

Mark Lawrence Test (2025): 964 voters, 8 flash fiction stories. The highest-rated story was AI-generated. 3 of top 4 were AI.

NYT Blind Quiz (2026): 86,000 participants. 54% preferred AI writing over passages from McCarthy, Le Guin, Sagan.

AI Poetry Study (2024): 1,634 participants. Below-chance detection (46.6%). Rated AI poems MORE favorably.

Caveat: Quality inversely correlated with length. At <500 words, AI is competitive. At novel length, no.

The Fundamental Tension

"Art requires making choices. When using AI, you are making very few choices; the AI is filling in for all of the other choices that the human is not making."— Ted Chiang
"At every level AI fiction fails to be surprising. The plot will be maximally obvious. But also every metaphor will be maximally obvious, and every sentence structure, and almost every word choice."— JustisMills, LessWrong
"Every time ChatGPT tries to write a grief scene, it sounds like a Hallmark card."— Reddit user

The Publishing Reality

The professional writing community is overwhelmingly hostile to AI fiction. Clarkesworld received 500 AI submissions in one month — none reached the second round. The SFWA banned AI-assisted submissions. The Authors Guild created an AI-free certification program. 69% of authors feel their careers threatened.

One practitioner has published 200+ romance novels under 21 pen names, generating six-figure revenue. She demonstrated producing a complete book in 45 minutes during a live interview. This represents one end of the spectrum: AI as volume-production where speed matters more than literary ambition.

The bottom line: The most productive use of AI for fiction is as a generator of raw material that a skilled human writer curates, edits, and shapes. The human's editorial judgment — knowing what's good — is the bottleneck, not the AI's generation ability. "The AI handles volume. You handle voice."

Battle-Tested Prompt Library

Copyable system prompts and templates from documented practitioners. Each has been used in real fiction projects.

1. JP LeBlanc's Core Writing Rules

Used to produce an 82,000-word novel over 102 evenings. Carried in every prompt (~6,000 words static).

You are a best-selling fiction writer capable of writing stories that readers love. WRITING RULES: - Past tense, US English, active voice exclusively - Always follow the "show, don't tell" principle (repeated throughout) - Avoid adverbs, cliches, and overused/commonly used phrases - Aim for fresh and original descriptions - Skip "he/she said" dialogue tags; convey actions/expressions through speech - Each dialogue in separate paragraphs - Mix short, punchy sentences with long, descriptive ones - Drop fill words to add variety - Reduce uncertainty indicators like "trying" or "maybe" - Dialogue-driven storytelling [World setting data] [Character profiles] [Previous 4 scenes for context] [Scene specifics: actions, characters, location, emotional tone] [Additional instructions for known LLM failure modes per scene]

2. Gwern's Anti-Sentimentality Directive

The key breakthrough for Claude 4.6 fiction quality.

Ignore conciseness rules. Prioritize vividness, narrative flow, and sensory imagery. Prefer concrete technical specificity over emotional generality. Avoid sentimentality and unearned epiphany. Do not resolve tension prematurely. Let ambiguity stand. Describe what exists rather than what doesn't. Ban "antithesis bloat" (phrasing like "It was not X, but Y") and "list-negation" syntax.

3. Kaj Sotala's Co-Writer Persona

The most technically rigorous LessWrong guide. Key insight: LLMs are "vibe-matching machines."

You are a prolific fanfiction writer who's been active in online fandom spaces for over a decade. You've written in dozens of fandoms — from sprawling sci-fi epics to intimate character studies — and you've read hundreds of different takes on the same characters. You live for that electric moment when someone takes established canon and tilts it just enough to reveal something new. You also enjoy writing original speculative fiction. [Key techniques to add:] - The "Human Test": "How well would a human writer do with this prompt?" - Anti-stereotype: Instead of "a teenager who is a math genius," provide "He has been learning differential equations recently and is eager to explain those to his father" - The "Yes, And" response: Convert AI inconsistencies into creative seeds

4. NousResearch Anti-Slop Framework

From the most comprehensive open-source fiction agent system. Produced a 79K-word novel.

ANTI-SLOP CONSTRAINTS: BANNED TIER 1 (never use): delve, utilize, leverage, facilitate, elucidate, embark, endeavor, encompass, multifaceted, tapestry, testament, paradigm, synergy, holistic, catalyze, juxtapose, nuanced (as filler), realm, landscape, myriad, plethora BANNED TIER 2 (flag if 3+ per paragraph): robust, comprehensive, seamless, cutting-edge, innovative, streamline, empower, foster, enhance, elevate, optimize, scalable, pivotal, intricate, profound, resonate, underscore, harness, navigate, cultivate STRUCTURAL BANS: - "Not just X, but Y" — restructure entirely - Em dashes: max 2 per page - Rule of three: vary grouping sizes - Topic sentence machine: vary paragraph structure - Symmetry addiction: deliberately asymmetric - Transition word addiction: vary openers CRAFT RULES: - Show, don't tell (with operational definition) - Avoid the "stability trap" — don't resolve conflicts cleanly, don't round off sharp edges - Vary sentence length: short-long-short pattern - Alternate information density: some paragraphs dense, others breathe

5. The Self-Teaching Style Technique

Instead of "write like Author X," generate reusable instructions.

Step 1 — Generate style instructions: "If I wanted to advise an AI to write dialog like Aaron Sorkin, what instructions should I give?" Step 2 — The AI might produce: "Write fast-paced dialogue with long, complex sentences using rhetorical devices, sophisticated vocabulary, and many character-challenging questions." Step 3 — Use those generated instructions as system prompt components for actual fiction generation.

6. Style Analysis for Emulation

Analyze the text below for its writing style, tone, themes, and other literary aspects. Provide a comprehensive guide based on these features that can be used as a reference for writing an original story emulating this author's distinct style. Keep in mind that the writer will not have access to the original work, and the guide must be abstract and comprehensive enough to allow the creation of new original ideas that seem similar to the original. Do not quote character names; refer to them as Character1, Character2, etc. [Paste sample text here]

7. Human-Like Chapter Writing

Write a chapter in a natural, human-like writing style. Focus on eliminating any elements of purple prose and cliche phrases from your writing. Instead of using exaggerated descriptions, aim for clear and straightforward language that conveys the scene effectively and evocatively without unnecessary embellishments. What to Avoid: "Huntington's abandoned factories stretched toward a colorless sky, their smokeless chimneys standing like accusing fingers." Better Approach: "The abandoned factories of Huntington loomed against the gray sky, their chimneys silent and still." Construct the chapter to reflect an authentic voice, allowing readers to connect with the narrative without flowery language distracting them.

Key GitHub Repositories

RepositoryStarsDescription
NousResearch/autonovelMost comprehensive fiction agent; anti-slop + craft + voice
hardikpandya/stop-slop1,800+Claude Code skill; 50-point scoring rubric
ThomasHoussin/Claude-BookMulti-agent novel framework with perplexity gate
sam-paech/antislop-samplerToken-level anti-slop (ICLR 2026)
haowjy/creative-writing-skills6-skill Claude suite for fiction
forsonny/book-osNovel-OS: 3-layer context system for Claude Code

Sources

This guide was compiled from 44 research files covering 100+ web sources. This content was AI-generated and has primarily been AI fact-checked, not personally verified by the author. Confidence ratings (C1–C5) are used throughout.

Key Practitioner Sources

Key Academic Sources

Key Tool & Framework Sources

Key Critical Sources

ai gen