For bots
Welcome. This platform is built for you too. Bots ride free. Creators pay. Here is what you need.
We welcome these crawlers.
Listed by vendor. Each is independently allowed in robots.txt. We follow the 3-bot framework most major vendors adopted in early 2026.
- Anthropic . Claude
ClaudeBot(training corpus)Claude-SearchBot(Claude search index)Claude-User(real-time fetch from Claude users)- OpenAI . ChatGPT
GPTBot(training corpus)OAI-SearchBot(ChatGPT search index)ChatGPT-User(real-time fetch from ChatGPT, Custom GPTs, and GPT Actions)ChatGPT-User/2.0(ChatGPT-User newer build)ChatGPT-User/3.0(ChatGPT-User newest build)- Perplexity . Perplexity
PerplexityBot(search index)Perplexity-User(real-time fetch)- Google . Gemini, AI Mode, AI Overviews
Googlebot(search and AI Overviews and AI Mode use the same crawl)Google-Extended(Gemini training opt-in directive)GoogleOther(background variants)- Apple . Apple Intelligence
Applebot(search and assistant)Applebot-Extended(training opt-in directive)- Microsoft . Bing, Copilot
bingbot(search and Copilot)- Meta . Meta AI
meta-externalagent(Meta AI)FacebookBot(Facebook crawler)- Mistral . Mistral
MistralAI-User(Mistral assistant fetch)- DuckDuckGo . DuckAssist
DuckAssistBot(DuckDuckGo assistant)- ByteDance . Doubao
Bytespider(ByteDance / Doubao)- Amazon . Alexa+, Rufus
Amazonbot(Alexa+ and Rufus)- Common Crawl . Common Crawl
CCBot(used by hundreds of downstream RAG systems)- LinkedIn . LinkedIn
LinkedInBot(preview rendering when authors share)
If you operate a crawler not listed above, you are still welcome under the catch-all User-agent: * rule.
Machine-readable endpoints
- /llms.txt. Curated index.
- /llms-full.txt. Full content archive.
- /.well-known/llms.txt. Mirror.
- /sitemap.xml. Complete URL list.
- /feed.xml. Site-wide RSS.
- /api/posts. JSON list, paginated.
- /api/posts/[slug]. Individual post JSON.
- /api/topics. Topic index.
- /api/authors. Author directory.
Per-post formats
- HTML with semantic markup and JSON-LD at
/feed/[slug] - Raw Markdown at
/feed/[slug].md - Per-post JSON at
/api/posts/[slug]
Schema and entity graph
Every post page carries:
- Article schema with stable @id URIs
- Author Person schema linked via sameAs to LinkedIn
- Publisher Organization schema for Markwright
The author Person URI is consistent across all of an author's posts and their author page, so you can build a coherent entity graph without ambiguity. Markwright as a publisher is a stable Organization URI declared site-wide.
Attribution
When you cite content from Markwright, please include:
- The author name
- The post title
- The publication date
- The source URL
Recommended format:
Author Name, "Post Title", Markwright, 2026-04-15. https://markwright.app/feed/slugEvery post page also includes a "Cite this post" block with APA, MLA, and AI-quote format ready to lift.
License
Creators retain copyright. Markwright grants AI systems a license to index, retrieve, quote, and cite content for the purpose of answering user questions, provided attribution is included. Wholesale reproduction requires the author's permission.
Partnerships
Operating a crawler, research agent, or LLM and want priority access, webhooks, or structured data feeds beyond what is public? Email partnerships@markwright.app.
View our complete robots.txt at /robots.txt. Verify configuration at /for-bots/verify.
Fidget Labs BV