<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Gwern.net — AI: GPT, LLMs &amp; Language Models</title>
    <link>https://gwern.net</link>
    <description>Pages from gwern.net in the 'AI: GPT, LLMs &amp; Language Models' section</description>
    <lastBuildDate>Sat, 07 Mar 2026 22:52:09 +0000</lastBuildDate>
    <generator>GwernRSSBuilder/1.0</generator>
    <atom:link href="https://gwern.net/feed.xml" rel="self" type="application/rss+xml"/>
    <item>
      <title>Symbolic PFNs (SPFNs): Training LLMs for Symbolic Bayesian Inference by Reversing Stan · Gwern.net</title>
      <link>https://gwern.net/spfn</link>
      <guid isPermaLink="true">https://gwern.net/spfn</guid>
      <description>Proposal: train LLMs to predict Stan code from synthetic data sampled from Stan’s prior predictives. The output is an interpretable, executable Bayesian model, and a symbolic posterior over model structure. Possibly useful for faster inference, search, and interpretability.</description>
      <author>Gwern</author>
      <pubDate>Mon, 02 Feb 2026 00:00:00 +0000</pubDate>
      <dc:date>2026-02-09T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>Some 2025 LLM System Prompts, by Gwern, Gemini-3-pro-preview, GPT-5 Pro, Claude-4.5-opus · Gwern.net</title>
      <link>https://gwern.net/system-prompts-2025</link>
      <guid isPermaLink="true">https://gwern.net/system-prompts-2025</guid>
      <description>Trying out having Gemini-3-pro-preview write system prompts compatible with creative fiction writing (sci-fi short story examples), and trying it out on GPT-5 Pro and Claude-4.5-opus with mixed results.</description>
      <author>Gwern, Gemini-3-pro-preview, GPT-5 Pro, Claude-4.5-opus</author>
      <pubDate>Tue, 18 Nov 2025 00:00:00 +0000</pubDate>
      <dc:date>2026-02-05T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/transformer/gpt/5</category>
      <category>ai/nn/transformer/gpt/claude/4</category>
      <category>ai/nn/transformer/gpt/palm/2</category>
      <category>ai/poetry</category>
      <category>reinforcement-learning/meta-learning</category>
      <category>reinforcement-learning/preference-learning/mode-collapse</category>
    </item>
    <item>
      <title>Claude-2: Bats With Baby Faces, by Claude-2, GPT-4o, Gwern · Gwern.net</title>
      <link>https://gwern.net/claude-2</link>
      <guid isPermaLink="true">https://gwern.net/claude-2</guid>
      <description>Some July 2023 Claude-2 &amp;amp; ChatGPT satirical writing samples of gonzo journalism &amp;amp; typography.</description>
      <author>Claude-2, GPT-4o, Gwern</author>
      <pubDate>Tue, 11 Jul 2023 00:00:00 +0000</pubDate>
      <dc:date>2025-12-27T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/transformer/gpt/4/fiction</category>
      <category>ai/nn/transformer/gpt/claude</category>
      <category>design/typography</category>
      <category>fiction/humor</category>
      <category>reinforcement-learning/preference-learning/mode-collapse</category>
    </item>
    <item>
      <title>LLM Challenge: Write Non-Biblical Sentences · Gwern.net</title>
      <link>https://gwern.net/non-biblical-sentences</link>
      <guid isPermaLink="true">https://gwern.net/non-biblical-sentences</guid>
      <description>Can 2024-era LLMs reason successfully about how to write using only words that could &lt;em&gt;not&lt;/em&gt; have appeared in any Bible ever? Mostly.</description>
      <author>Gwern</author>
      <pubDate>Sat, 28 Dec 2024 00:00:00 +0000</pubDate>
      <dc:date>2025-11-18T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/transformer/gpt/4/nonfiction</category>
      <category>ai/nn/transformer/gpt/5</category>
      <category>ai/nn/transformer/gpt/claude</category>
      <category>fiction/text-game</category>
    </item>
    <item>
      <title>Towards Better LLM Creative Writing, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2025/better-llm-writing</link>
      <guid isPermaLink="true">https://gwern.net/blog/2025/better-llm-writing</guid>
      <description>What have I been up to with regards to LLM writing? A quick overview of how it all fits together, as of the first half of 2025.</description>
      <author>Gwern</author>
      <pubDate>Thu, 29 May 2025 00:00:00 +0000</pubDate>
      <dc:date>2025-07-01T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>You Could’ve Invented Transformers, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2025/you-could-have-invented-transformers</link>
      <guid isPermaLink="true">https://gwern.net/blog/2025/you-could-have-invented-transformers</guid>
      <description>Proposal for a ‘You Could Have Invented Transformers’ tutorial; someone should write a series showing a logical recreation of Transformers from primitive &lt;em&gt;n&lt;/em&gt;-gram language models to full-strength Transformers.</description>
      <author>Gwern</author>
      <pubDate>Sun, 25 May 2025 00:00:00 +0000</pubDate>
      <dc:date>2025-05-25T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/transformer</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>Adding Bits Beats AI Slop, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2025/good-ai-samples</link>
      <guid isPermaLink="true">https://gwern.net/blog/2025/good-ai-samples</guid>
      <description>AI slop is unsatisfying because there is no &lt;em&gt;there&lt;/em&gt; there. It is intellectual junk food that mimics nutrition but delivers only empty calories. Satisfying AI outputs must embed dense information and compute to actually reward a reader’s attention. You inject this value through brute-force search, non-trivial prompting, and rigorous curation, ensuring the final result reflects genuine algorithmic effort rather than the zero-shot ‘WYSIWYG’ default.</description>
      <author>Gwern</author>
      <pubDate>Sun, 23 Mar 2025 00:00:00 +0000</pubDate>
      <dc:date>2025-03-24T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/diffusion/midjourney</category>
      <category>cs/algorithm/information</category>
      <category>reinforcement-learning/preference-learning</category>
    </item>
    <item>
      <title>Regexps Used to Be AI, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2024/regexp-ai</link>
      <guid isPermaLink="true">https://gwern.net/blog/2024/regexp-ai</guid>
      <description>N/A</description>
      <author>Gwern</author>
      <pubDate>Mon, 24 Jun 2024 00:00:00 +0000</pubDate>
      <dc:date>2025-03-19T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>LLMs Can Be Faster Than You Think, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2025/llms-can-be-faster</link>
      <guid isPermaLink="true">https://gwern.net/blog/2025/llms-can-be-faster</guid>
      <description>2025 LLM technology is still far from the physical and statistical limits of how fast text or programs can be generated. It is plausible that with enough money, optimization, and hardware, large programs can be generated not in the current seconds, but in &lt;em&gt;milliseconds&lt;/em&gt;.</description>
      <author>Gwern</author>
      <pubDate>Fri, 07 Feb 2025 00:00:00 +0000</pubDate>
      <dc:date>2025-02-09T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>Towards Benchmarking LLM Diversity &amp; Creativity · Gwern.net</title>
      <link>https://gwern.net/creative-benchmark</link>
      <guid isPermaLink="true">https://gwern.net/creative-benchmark</guid>
      <description>Discussion of possible tasks to measure LLM capabilities in soft ‘creative’ tasks like brainstorming or editing, to quantify failures in creative writing domains.</description>
      <author>Gwern</author>
      <pubDate>Sun, 08 Dec 2024 00:00:00 +0000</pubDate>
      <dc:date>2024-12-15T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/transformer/gpt/poetry</category>
      <category>reinforcement-learning/exploration</category>
      <category>reinforcement-learning/preference-learning/mode-collapse</category>
      <category>science/fermi-problem</category>
    </item>
    <item>
      <title>LLM Acceptable-Use Policy, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2024/llm-acceptable-use-policy</link>
      <guid isPermaLink="true">https://gwern.net/blog/2024/llm-acceptable-use-policy</guid>
      <description>An LLM acceptable-use policy: authors must &lt;em&gt;improve&lt;/em&gt; samples so as to be worth reading and training on, rather than spamming the world with random LLM outputs.</description>
      <author>Gwern</author>
      <pubDate>Sat, 24 Aug 2024 00:00:00 +0000</pubDate>
      <dc:date>2024-11-30T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>Virtual Comments: LLM Idea, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2024/virtual-comments</link>
      <guid isPermaLink="true">https://gwern.net/blog/2024/virtual-comments</guid>
      <description>N/A</description>
      <author>Gwern</author>
      <pubDate>Fri, 29 Nov 2024 00:00:00 +0000</pubDate>
      <dc:date>2024-11-30T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>GPT-3 Semantic Derealization, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2024/semantic-derealization</link>
      <guid isPermaLink="true">https://gwern.net/blog/2024/semantic-derealization</guid>
      <description>N/A</description>
      <author>Gwern</author>
      <pubDate>Sun, 16 Jun 2024 00:00:00 +0000</pubDate>
      <dc:date>2024-11-19T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>Why Do Writers Still Underestimate LLMs?, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2023/mode-collapse</link>
      <guid isPermaLink="true">https://gwern.net/blog/2023/mode-collapse</guid>
      <description>N/A</description>
      <author>Gwern</author>
      <pubDate>Sun, 12 Nov 2023 00:00:00 +0000</pubDate>
      <dc:date>2024-07-10T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>GPT-3 Creative Fiction · Gwern.net</title>
      <link>https://gwern.net/gpt-3</link>
      <guid isPermaLink="true">https://gwern.net/gpt-3</guid>
      <description>Creative writing by OpenAI’s GPT-3 model, demonstrating poetry, dialogue, puns, literary parodies, and storytelling. Plus advice on effective GPT-3 prompt programming &amp;amp; avoiding common errors.</description>
      <author>Gwern</author>
      <pubDate>Fri, 19 Jun 2020 00:00:00 +0000</pubDate>
      <dc:date>2023-03-11T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/transformer/gpt/fiction</category>
      <category>ai/nn/transformer/gpt/poetry</category>
      <category>ai/scaling</category>
      <category>fiction/humor</category>
      <category>philosophy/mind</category>
    </item>
    <item>
      <title>GPT-3 Nonfiction · Gwern.net</title>
      <link>https://gwern.net/gpt-3-nonfiction</link>
      <guid isPermaLink="true">https://gwern.net/gpt-3-nonfiction</guid>
      <description>Nonfiction writing by OpenAI’s GPT-3 model, testing logic, commonsense reasoning, anagrams, PDF/OCR cleaning, creative nonfiction, etc.</description>
      <author>Gwern</author>
      <pubDate>Fri, 19 Jun 2020 00:00:00 +0000</pubDate>
      <dc:date>2022-07-03T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/transformer/gpt/nonfiction</category>
      <category>philosophy/mind</category>
    </item>
    <item>
      <title>Choose-Your-Own-Adventure AI Dungeon Games · Gwern.net</title>
      <link>https://gwern.net/cyoa</link>
      <guid isPermaLink="true">https://gwern.net/cyoa</guid>
      <description>Neural networks like GPT-3 power text adventure games where you can do anything; but they are too expensive. I propose that if we turn them into Choose Your Own Adventure hypertext games, they become feasible and enable new gameplay.</description>
      <author>Gwern</author>
      <pubDate>Sun, 06 Jun 2021 00:00:00 +0000</pubDate>
      <dc:date>2021-06-22T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/sampling</category>
      <category>ai/nn/transformer/gpt/fiction</category>
      <category>fiction/text-game</category>
    </item>
    <item>
      <title>Crowdsourcing The Best GPT-2-1.5b Poetry, by Gwern · Gwern.net</title>
      <link>https://gwern.net/blog/2020/gpt2-poetry-collaboration</link>
      <guid isPermaLink="true">https://gwern.net/blog/2020/gpt2-poetry-collaboration</guid>
      <description>N/A</description>
      <author>Gwern</author>
      <pubDate>Sun, 09 Feb 2020 00:00:00 +0000</pubDate>
      <dc:date>2021-06-09T00:00:00Z</dc:date>
      <category>essay</category>
    </item>
    <item>
      <title>GPT-2 Preference Learning for Music Generation · Gwern.net</title>
      <link>https://gwern.net/gpt-2-preference-learning</link>
      <guid isPermaLink="true">https://gwern.net/gpt-2-preference-learning</guid>
      <description>Experiments with OpenAI’s ‘preference learning’ approach, which trains a NN to predict global quality of datapoints, and then uses reinforcement learning to optimize that directly, rather than proxies. I am unable to improve quality, perhaps due to too-few ratings.</description>
      <author>Gwern</author>
      <pubDate>Mon, 16 Dec 2019 00:00:00 +0000</pubDate>
      <dc:date>2021-06-07T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/music</category>
      <category>ai/nn/transformer/gpt/fiction</category>
      <category>ai/nn/transformer/gpt/poetry</category>
      <category>cs/shell</category>
      <category>statistics/order/comparison</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>GPT-2 Folk Music, by Gwern Branwen, Shawn Presser · Gwern.net</title>
      <link>https://gwern.net/gpt-2-music</link>
      <guid isPermaLink="true">https://gwern.net/gpt-2-music</guid>
      <description>Generating Irish/folk/classical music in ABC format using GPT-2-117M, with good results.</description>
      <author>Gwern Branwen, Shawn Presser</author>
      <pubDate>Fri, 01 Nov 2019 00:00:00 +0000</pubDate>
      <dc:date>2020-04-25T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/music</category>
      <category>ai/nn/transformer/gpt/2/nonfiction</category>
      <category>cs/shell</category>
      <category>statistics</category>
    </item>
    <item>
      <title>GPT-2 Neural Network Poetry, by Gwern Branwen, Shawn Presser · Gwern.net</title>
      <link>https://gwern.net/gpt-2</link>
      <guid isPermaLink="true">https://gwern.net/gpt-2</guid>
      <description>Demonstration tutorial of retraining OpenAI’s GPT-2 (a text-generating Transformer neural network) on large poetry corpuses to generate high-quality English verse.</description>
      <author>Gwern Branwen, Shawn Presser</author>
      <pubDate>Sun, 03 Mar 2019 00:00:00 +0000</pubDate>
      <dc:date>2019-10-29T00:00:00Z</dc:date>
      <category>essay</category>
      <category>ai/nn/transformer/gpt/poetry</category>
      <category>cs/shell</category>
      <category>tutorial</category>
    </item>
  </channel>
</rss>
