webmasterid

Privacy-first analytics infrastructure

Analytics for the AI-search era.

WebmasterID is a lightweight, first-party analytics and AI visibility platform built for SEO ecosystems, publishers, AI-native websites, and indie founders. No fingerprinting. No third-party cookies. No bloat.

1 site → 1 generated WebmasterID
AI crawler visibility built in
Edge-ready, ClickHouse-bound
<!-- drop-in tracking tag -->
<script defer src="https://cdn.webmasterid.com/tracker.js" data-wmid="wm_xxxx"></script>

Product philosophy

Search infrastructure observability — not adtech surveillance.

WebmasterID is not a Google Analytics clone. It's purpose-built for the SEO ecosystem and the AI-search era: lightweight, first-party, and developer-first.

  • Privacy-first by default
  • No fingerprinting, ever
  • First-party event ingestion
  • AI-search-aware out of the box
  • Modular, monorepo-native architecture
  • Designed to scale into ClickHouse + edge

AI visibility

See which AI crawlers are reading your site.

WebmasterID detects GPTBot, ClaudeBot, PerplexityBot, Google-Extended, Applebot, Bingbot and more — and routes them to a separate observability path so they never pollute human-event aggregates.

GPTBotClaudeBotPerplexityBotGoogle-ExtendedApplebotBingbotMeta-ExternalAgentCCBot

Privacy

Privacy-first by architecture, not by toggle.

No third-party cookies. No localStorage tracking. Source IPs are anonymized at the ingestion edge — raw IPs never reach storage. DNT and GPC are respected unconditionally.

No fingerprinting

We don't read canvas, fonts, or device entropy.

No third-party cookies

We don't set cookies, period.

IP anonymization

IPv4 last octet zeroed; IPv6 truncated to /48.

DNT / GPC respected

Opted-out browsers send nothing to the server.

Architecture

Infrastructure-grade foundations.

A pnpm + Turborepo monorepo. Strong TypeScript everywhere. PostgreSQL-ready via Drizzle, designed to migrate to ClickHouse and distributed edge ingestion as scale demands.

/apps
  /web          — public marketing site
  /dashboard    — operator dashboard shell
  /ingest-api   — Fastify event ingestion

/packages
  /tracker-js       — browser tracker (defer-safe, ~few KB)
  /sdk-next         — <WebmasterID siteId="wm_xxxx" />
  /database         — Drizzle schema (PostgreSQL)
  /shared           — types, Zod schemas, IDs
  /privacy          — IP anon, DNT, consent, retention
  /ai-visibility    — bot detection + AI referrals
  /seo-intelligence — search-engine + referrer classification

Roadmap

What we're building next.

Today is the foundation: clean ingestion, privacy primitives, AI crawler detection, and a strongly-typed event schema. From here, scale and intelligence.

  1. 01MVP: typed ingestion, AI crawler detection, dashboard shell
  2. 02Generated tracking tags + first-class GitHub PR install workflow
  3. 03AI referral analytics (ChatGPT, Claude, Perplexity, Gemini)
  4. 04Edge ingestion + per-site write-behind buffering
  5. 05ClickHouse migration for query-heavy aggregation
  6. 06Claude Code & Codex assisted installation