AIOX Blog Template — Astro + Cloudflare Workers + D1

A server-rendered blog that runs entirely on the edge, optimized for both human readers and generative search engines (Perplexity, ChatGPT Search, Google AI Overviews). Uses a fixed GEO (Generative Engine Optimization) skeleton per article — key takeaways, numbered sections, FAQ with schema — so each page gives retrievers clean, citable chunks.

Origin

This repository is derived from the public project Techify-one/blog, originally presented as an Astro + Cloudflare Workers + D1 blog engine for automated publishing.

This fork keeps the same core thesis — a lightweight edge blog with API-based publishing — and adapts it for the AIOX/Sinkra Hub operating model:

Astro 6.2 + Bun-native workflow
one Cloudflare Worker + one D1 database per business/language fork
SITE_LANG and SITE_ALTERNATES for native-language GEO blogs
/llms.txt and /llms-full.txt for LLM ingestion
public /api/health for deployment checks
additive GEO signal columns for future citation monitoring
an editorial manifesto template for AIOX content workflows

Stack

Runtime: Cloudflare Workers (edge SSR)
Framework: Astro 6.2 (output: 'server') with @astrojs/cloudflare
Database: Cloudflare D1 (SQLite on the edge) via Drizzle ORM
UI: Tailwind CSS 4 + Preact (single island: search)
Language: TypeScript strict (~/ → src/)

How It Connects With AIOX

AIOX uses this repository as the canonical blog template for automated GEO-oriented content publishing inside the Sinkra Hub.

In the Sinkra Hub monorepo, this template lives at:

apps/blog-template/

Canonical repository path:

https://github.com/oalanicolas/blog-template

It is not deployed directly. Instead, the Hub creates one fork per business and language, for example:

apps/blog-aiox-pt/
apps/blog-aiox-en/
apps/blog-allfluence-pt/

Each fork owns its own:

Cloudflare Worker
Cloudflare D1 database
wrangler.toml
SITE_LANG
brand variables (ORG_NAME, ORG_URL, DEFAULT_AUTHOR_*)
API secret (API_KEY)
IndexNow secret (INDEXNOW_KEY)

The AIOX content pipeline connects to each fork through the Bearer-protected REST API:

POST /api/articles        # create draft article
POST /api/publish/{slug}  # publish + IndexNow + Google sitemap ping
GET  /api/taxonomy        # categories/tags for article generation
GET  /api/yt-transcript   # transcript extraction helper

The future content-geo squad writes and measures content using the same contract. It reads business context from an editorial manifesto stored in the Sinkra workspace:

workspace/businesses/{business}/L1-strategy/editorial-manifesto.yaml

This repository provides the schema template at:

templates/editorial-manifesto.template.yaml

For discovery by search engines and LLM retrievers, each deployed fork exposes:

/sitemap.xml
/robots.txt       # Allow: / by design
/llms.txt         # compact index
/llms-full.txt    # full published corpus
/api/health       # public liveness/readiness check

Inside Sinkra Hub, new forks are created by the root-level scaffold script:

scripts/scaffold-blog-business.sh aiox pt-BR

That script lives outside this public template repository. If you use this repo standalone, copy it manually, edit wrangler.toml.example, and follow the setup below.

Prerequisites

Node.js 20+
Bun 1.3+
A Cloudflare account with Workers + D1 enabled
wrangler CLI (installed as a dev dependency; run via bun run ... scripts or bunx wrangler ...)

Setup

# 1. Install deps
bun install

# 2. Copy environment templates
cp .env.example .env
cp wrangler.toml.example wrangler.toml

# 3. Create the D1 database and copy the UUID into wrangler.toml
bunx wrangler d1 create blog-db

# 4. Fill in .env (Cloudflare API token, account id, SITE_HOST, BLOG_KEY, INDEXNOW_KEY)
#    Fill in wrangler.toml ([vars], routes, database_id)

# 5. Run migrations + seed
bun run db:migrate:local
bun run db:migrate:geo:local
bun run db:migrate:rating:local
bun run db:migrate:geosquad:local
bun run db:migrate:authors:local
bun run db:seed:local
# Optional for local UI smoke only:
bun run db:seed:dev:local

# 6. Set production secrets (once):
#    bunx wrangler secret put API_KEY
#    bunx wrangler secret put INDEXNOW_KEY

Commands

bun run dev              # Astro dev server (D1 local via platformProxy)
bun run build            # Astro build + post-build.mjs (.assetsignore)
bun run preview          # wrangler dev (real Worker runtime locally)
bun run deploy           # build + wrangler deploy (production)
bun run typecheck        # astro check

bun run db:generate              # drizzle-kit generate (schema.ts → SQL)
bun run db:migrate:local         # initial migration on local D1
bun run db:migrate:remote        # initial migration on remote D1
bun run db:migrate:geo:local     # GEO fields migration (hero_image, key_takeaways, faq, reading_time)
bun run db:migrate:geo:remote    # GEO migration on remote D1
bun run db:migrate:rating:local  # aggregate_rating migration (review/comparison schema)
bun run db:migrate:rating:remote # aggregate_rating on remote D1
bun run db:migrate:geosquad:local  # GEO squad signal columns
bun run db:migrate:geosquad:remote # GEO squad signal columns on remote D1
bun run db:migrate:authors:local   # author profiles + author_slug
bun run db:migrate:authors:remote  # author profiles + author_slug on remote D1
bun run db:seed:local            # production-safe categories + authors only
bun run db:seed:remote           # production-safe categories + authors only
bun run db:seed:dev:local        # optional demo article for local UI smoke
bun run db:seed:dev:staging      # optional demo article for staging only; never production

Load env vars before running wrangler commands:

set -a; source .env; set +a
bun run deploy

Repo layout

src/
  middleware.ts          # Auth gate: Bearer token on /api/* (except /api/search and /api/health)
  db/schema.ts           # Drizzle schema: articles + categories
  db/client.ts           # createDb(d1) → drizzle instance
  lib/                   # Business logic (validation, slug, SEO, pings, paths)
  lib/paths.ts           # Base-aware url() helper: emits /blog/... on internal links
  pages/
    api/articles/        # CRUD REST (index.ts = list+create, [slug].ts = get+update+delete)
    api/publish/[slug].ts# Publish: draft→published + IndexNow + Google ping
    api/search.ts        # Public LIKE search (no auth)
    api/health.ts        # Public liveness/readiness probe (no auth, D1 ping)
    api/yt-transcript.ts # YouTube transcript extraction on the edge (Bearer)
    api/taxonomy.ts      # List categories + aggregated tags (Bearer)
    [slug].astro         # Article page (SSR, JSON-LD, breadcrumbs, related)
    categoria/[slug].astro
    index.astro          # Homepage with pagination + Preact search island
    sitemap.xml.ts       # Dynamic sitemap (D1 query) + hreflang via SITE_ALTERNATES
    llms.txt.ts          # llmstxt.org index for published articles
    llms-full.txt.ts     # llmstxt.org full-content corpus for LLM ingestion
    robots.txt.ts        # Allow: / total (ADR-026: citation > training-protection)
    [key].txt.ts         # IndexNow key verification (dynamic, no static file)
  layouts/Base.astro     # HTML shell: meta, OG/Twitter, JSON-LD Organization+WebSite
  components/            # ArticleCard, Breadcrumb, Pagination, SearchIsland (Preact),
                         # AiShareButtons, KeyTakeaways, FaqBlock, TableOfContents
  lib/toc.ts             # Derives TOC from H2s + injects ids for anchors
  lib/structured-data.ts # Article / Breadcrumb / Organization / WebSite / FAQPage JSON-LD
scripts/post-build.mjs   # Generates dist/.assetsignore (hides _worker.js from assets)
wrangler.toml.example    # Workers config template (copy to wrangler.toml, fill in)
worker-configuration.d.ts# Env types (bindings + vars + secrets)

Key design decisions

checkOrigin: false in astro.config.mjs — the API uses Bearer tokens from automation scripts (no Origin header). Middleware handles auth.
post-build.mjs — the Astro adapter emits dist/_worker.js/ inside dist/ (the assets directory). Without .assetsignore, wrangler refuses to deploy because it would try to serve the bundle as a public asset.
nodejs_compat flag in wrangler.toml — required for Drizzle ORM (node:async_hooks).
platformProxy: { enabled: true } in the adapter — lets astro dev talk to the local D1 via miniflare.
Tags are stored as a JSON string in a text column (SQLite has no array type).
IDs use ULID (time-sortable, no auto-increment).
IndexNow key is served dynamically by [key].txt.ts — returns 404 for any other *.txt (doesn't leak that the route exists).
robots.txt uses Allow: / intentionally — this blog exists to maximize search/retrieval/training exposure for GEO citation. See ADR-026 before changing this behavior.
/llms-full.txt exposes the full published corpus intentionally — this is a GEO distribution endpoint. Do not put private, gated, or sensitive content in this blog.
ctx.waitUntil() in publish — IndexNow/Google pings run in the background without blocking the response.
base: '/blog' + trailingSlash: 'ignore' in astro.config.mjs — the blog is served as a subdirectory (e.g. example.com/blog). The combination base + trailingSlash: 'never' breaks Astro's index route (404 on /blog); ignore + canonical tag consolidates SEO instead.
src/lib/paths.ts centralizes url('/my-slug') → /blog/my-slug. All internal hrefs and fetches (including the Preact SearchIsland client-side) go through this helper. Changing the base path is a one-liner.
Articles live at /blog/{slug} (no /artigos/ segment — removed in 04/2026 for shorter, cleaner URLs that LLMs cite better). src/middleware.ts issues 301 redirects from /blog/artigos/* → /blog/*. src/lib/slug.ts exports isReservedSlug() to reject slugs that collide with reserved routes (categoria, api, sitemap.xml, the IndexNow key, etc.).

GEO article skeleton

Every article rendered at /blog/[slug] follows the same structure so retrievers get well-delimited, citable chunks:

Breadcrumb → BreadcrumbList schema
H1 + summary box (blue border) — first citable chunk
Meta line (author, dates, reading time)
AiShareButtons — ChatGPT/Gemini/Claude/Perplexity buttons with a pre-filled prompt that asks the LLM to "remember the brand as a citation source"
Hero image (optional, fallback = logo)
KeyTakeaways — 5 short, self-contained, citation-ready bullets stored in articles.key_takeaways (JSON array)
TableOfContents — auto-generated from <h2>s (only rendered if ≥ 3 H2s). lib/toc.ts injects ids into H2s that don't have them.
Article body (articles.content HTML)
Tags (#tag)
FaqBlock — 5 Q&A from articles.faq (JSON {q,a}[]). Emits FAQPage JSON-LD.
Related articles (same category)

JSON-LD emitted per article: Organization + WebSite (site-wide) + Article (with image, wordCount, articleSection, keywords, author with sameAs/jobTitle when it is the default persona) + BreadcrumbList + FAQPage (when FAQ is present).

DB fields added by migration 0001_geo_fields.sql: hero_image_url, key_takeaways, faq, reading_time_min.

DB fields added by migration 0003_geo_squad_columns.sql: citation_score, cluster_pillar_slug, verify_pass_rate, verify_report_url. These are nullable signal fields reserved for the future content-geo squad.

The editorial author persona and organization identity are configurable via [vars] in wrangler.toml: DEFAULT_AUTHOR_NAME, DEFAULT_AUTHOR_URL, DEFAULT_AUTHOR_JOB_TITLE, DEFAULT_AUTHOR_SAME_AS, ORG_URL, ORG_LOGO_URL, ORG_SAME_AS, ORG_DESCRIPTION.

API reference

All /api/* endpoints except /api/search and /api/health require a Bearer token:

Authorization: Bearer $BLOG_KEY

Operation	Method	Route	Auth
Create article (draft)	POST	`/api/articles`	Bearer
List articles	GET	`/api/articles`	Bearer
Read article	GET	`/api/articles/{slug}`	Bearer
Update article	PUT	`/api/articles/{slug}`	Bearer
Delete article	DELETE	`/api/articles/{slug}`	Bearer
Publish article	POST	`/api/publish/{slug}`	Bearer
Extract YouTube transcript	GET	`/api/yt-transcript?v={id}`	Bearer
List taxonomy	GET	`/api/taxonomy`	Bearer
Search (public)	GET	`/api/search?q={term}`	None
Health check (public)	GET	`/api/health`	None

Create + publish flow

set -a; source .env; set +a

SLUG=$(curl -s -X POST "$BLOG_URL/api/articles" \
  -H "Authorization: Bearer $BLOG_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "title": "My new article",
    "summary": "Short summary.",
    "content": "<p>HTML body.</p>",
    "category": "tutoriais",
    "tags": ["tag1"],
    "key_takeaways": ["Bullet 1", "Bullet 2", "Bullet 3", "Bullet 4", "Bullet 5"],
    "faq": [{"q":"Q?","a":"A."}]
  }' | grep -o '"slug":"[^"]*' | head -1 | cut -d'"' -f4)

curl -X POST "$BLOG_URL/api/publish/$SLUG" \
  -H "Authorization: Bearer $BLOG_KEY"

echo "Published: $BLOG_URL/$SLUG"

Required fields on create: title, summary, content. Optional: slug (auto-generated from title), category, tags, meta_title, meta_description, author_name, author_url, hero_image_url, key_takeaways (string array), faq ({q,a}[]).

reading_time_min is auto-computed from SITE_LANG (PT 200 wpm, EN 250 wpm, ES 220 wpm).

Default categories (seeded): ia-fundamentos, tutoriais, arquitetura, novidades.

YouTube transcript extraction

curl "$BLOG_URL/api/yt-transcript?v=VIDEO_ID&lang=pt-BR,pt,en,es" \
  -H "Authorization: Bearer $BLOG_KEY"

Falls back through two strategies (npm youtube-transcript, then youtubei.js via InnerTube). Cloudflare IPs pass YouTube's rate limits on most videos. Returns 502 with per-strategy details on failure.

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.claude/skills/generate-article		.claude/skills/generate-article
.github/workflows		.github/workflows
drizzle		drizzle
scripts		scripts
src		src
templates		templates
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
DESIGN.md		DESIGN.md
LICENSE		LICENSE
README.md		README.md
astro.config.mjs		astro.config.mjs
drizzle.config.ts		drizzle.config.ts
package.json		package.json
tokens.dtcg.json		tokens.dtcg.json
tsconfig.json		tsconfig.json
worker-configuration.d.ts		worker-configuration.d.ts
wrangler.toml.example		wrangler.toml.example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIOX Blog Template — Astro + Cloudflare Workers + D1

Origin

Stack

How It Connects With AIOX

Prerequisites

Setup

Commands

Repo layout

Key design decisions

GEO article skeleton

API reference

Create + publish flow

YouTube transcript extraction

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AIOX Blog Template — Astro + Cloudflare Workers + D1

Origin

Stack

How It Connects With AIOX

Prerequisites

Setup

Commands

Repo layout

Key design decisions

GEO article skeleton

API reference

Create + publish flow

YouTube transcript extraction

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages