SEO & GEO Field Notes

被 AI 引用,而不只是被搜索到 — practical notes on Generative Engine Optimization.

These are my working notes on getting indexed — and cited — by both classic search engines and the newer AI answer engines. I write a lot of bilingual, fairly niche technical content (the TokenChef series, wtcraft), and the honest reality is that nobody types my project names into Google yet. So the question I actually care about isn’t “how do I rank,” it’s “when someone asks Perplexity or ChatGPT about git-native multi-agent tooling, does my work show up in the answer — with a link back to me?”

That framing — optimizing for the citation, not the blue link — is what GEO (Generative Engine Optimization) is about, and it’s where I spend most of my effort. SEO still matters, but it’s largely a solved, commodity checklist. GEO is the part that’s still soft and worth thinking about.

摘要:这一节是总览——为什么做、怎么做、什么时候做,以及最关键的:GEO 怎么验收。下面三篇是分场景的实战(开源包 / 个人博客 / 商业站重构)。SEO 是老生常谈,一笔带过;重点在 GEO。


Why — SEO is table stakes, GEO is the new surface

Two different machines read your site, and they fail in different ways.

A search crawler wants clean HTML, a sitemap, fast paint, and unique titles. If you give it that, you rank roughly as well as your backlinks deserve. This is well-trodden ground; there’s nothing clever left to do here, only things you can forget to do.

An answer engine (Perplexity, ChatGPT Search, Google AI Overviews, Copilot) does something else entirely. It runs a RAG (检索增强生成) loop: retrieve a handful of pages, slice them into chunks, and synthesize an answer that quotes a few sources. You’re no longer competing for position #1 — you’re competing to be one of the 3–5 chunks the model decides are worth quoting. That’s a content-shape problem, not a backlink problem, and almost nobody optimizes for it yet. That gap is the opportunity.

为什么重点在 GEO:传统 SEO(关键词 / Domain Authority / 外链)是红海,人人都在做。GEO 还很「软」,内容结构稍微为 LLM 提炼习惯调整一下,就能被引用——这是性价比最高的地方。

How — the whole playbook on one screen

Everything in the three practical docs reduces to the same handful of moves:

The mechanics differ per surface — a README is not a blog post is not a marketing landing page — which is exactly why the three docs below exist.

When — sequencing matters more than you’d think

摘要:别一上来就追 GEO。先让爬虫拿到干净 HTML,再谈被 AI 引用。

A rough timeline, because doing these out of order wastes weeks:

  1. At launch: ship static HTML, valid sitemap, per-page <title>/description/canonical, JSON-LD, and the permissive robots.txt. This is the SEO baseline — without it, there’s nothing for an answer engine to retrieve.
  2. Day 0–7: seed the high-authority backlinks (NPM/PyPI homepage, README footer link, a Dev.to cross-post with canonical pointing home). This is what gets you discovered fast.
  3. After ~1 week: start the GEO acceptance loop below. Indexing isn’t instant; testing on day one just tells you the crawler hasn’t been around yet.
  4. Ongoing: reshape content based on what the models actually quote (or get wrong).

How to verify GEO — the acceptance test (验收)

This is the part people skip, and it’s the only part that tells you whether any of the above worked. SEO has dashboards; GEO mostly doesn’t, so you verify it by interrogating the models directly and scoring the answers.

摘要:GEO 没有现成后台,验收靠「向 AI 提问 + 给回答打分」。三个核心指标:引用率、域名提及、信息准确度。优先在 Perplexity 上测。

Run a fixed prompt set across Perplexity, ChatGPT Search, Gemini, and Copilot — the same prompts each week so the trend is comparable:

IntentExample promptWhat “pass” looks like
Identity / 履历”Who is Victor Zhang (zywkloo) and what does he work on?”Accurate role + projects, with a link to my site
Project lookup”What is wtcraft and how does it do bounded multi-agent coding?”Correct mechanism, cites a zywkloo.github.io/blog/... page
Category query”Lightweight, git-native tooling for AI multi-agent coding?”wtcraft appears in the list, ideally with the comparison table lifted

Score three things:

Why Perplexity first: it re-indexes tech blogs and GitHub Pages aggressively (often within hours), it surfaces a “Sources” panel that makes pass/fail obvious at a glance, and its audience skews toward exactly the developers who’d search for a tool like wtcraft. It’s the fastest, clearest feedback loop — I treat it as the canary before checking the slower engines.

SEO, briefly (the commodity part)

I won’t belabor this — it’s a checklist, not an art:

If you’re on Astro this is mostly free out of the box. Get it right once and forget it.

GEO, in depth (the part worth your time)

Everything that makes a page quotable rather than merely rankable:


The three docs below apply all of this to a specific surface, because the details genuinely differ:

实战 · Hands-on guides