Build log & updates for the dom.to network

99M
Domains
1.5M
Screenshots
2.8M
WHOIS Records
4.7M
Unique SLDs
150K
Embeddings
13
Live Services

V2 Schema & Bloom Filter Deduplication

Feature Architecture Performance

Implemented V2 Schema Architecture for scaling to 500M+ domains:

  • Math-indexed keys: Base-37 SLD encoding (a-z=1-26, 0-9=27-36, hyphen=37)
  • Prefix range queries: Find all domains starting with "app" using index range scans
  • Trigram search: Fast substring matching via GIN indexes on encoded arrays
  • Dual key strategy: NUMERIC for ranges, BIGINT hash for JOINs
  • Imported 1.07M SLDs and 1.15M domains from Crux top million

Built Bloom Filter Deduplication Layer for fast SLD ingestion:

  • 184k inserts/sec and 276k lookups/sec without database queries
  • 200M capacity with ~0.5% false positive rate, zero false negatives
  • Time segmentation: bloom_ever (global) + bloom_YYYY (yearly)
  • xxhash64 hashing with 10 hash functions
  • Initialized with 4.67M existing SLDs from sld_keywords table

DNS scanning validated with 82% resolution rate on Crux .com domains using Google DNS (8.8.8.8).

sinkhole.dom.to Launch & Cookie Popup Blocking

Feature Launch Fix

Launched sinkhole.dom.to - a blocklist service for web crawlers:

  • 6,188 sinkholed domains identified from WHOIS nameserver patterns
  • Downloadable blocklists in TXT and JSON formats
  • Sinkhole IP addresses to block at firewall level
  • API endpoints for programmatic access

Added cookie popup blocking to screenshot processor:

  • CSS injection for common consent frameworks (OneTrust, Cookiebot, etc.)
  • Pre-set consent cookies to bypass GDPR prompts
  • puppeteer-extra with StealthPlugin for better detection evasion

Imported 2.95M new domains from Chrome User Experience Report (CrUX) covering 30 countries. Total domains tracked now exceeds 7 million.

Screenshot Processor v3 & New Services

Feature Infrastructure

Major infrastructure upgrade with screenshot_processor_v3:

  • Multi-queue architecture: active, passive, and scan queues
  • Desktop screenshots for developed sites (entropy >= 5.0)
  • Content extraction with Readability for semantic search
  • PostgreSQL dual-write for database migration

Launched several new services:

  • scan.dom.to - URL link scanner, extract outbound links from any page
  • og.dom.to - OpenGraph image gallery, see social previews
  • stats.dom.to - Real-time system statistics dashboard
  • list.dom.to - Word lists for domainers
  • xyz.dom.to - .xyz domain search and sales

Remote Screenshot Workers & PostgreSQL Migration

Data Infrastructure

Deployed remote screenshot workers for distributed processing:

  • Workers connect via Cloudflare Tunnels for secure Redis access
  • Direct B2 uploads - no central bottleneck
  • Results reported back via HTTPS API
  • Easy deployment with curl | bash installer

Started PostgreSQL migration from SQLite:

  • Dual-write mode syncs data to both databases
  • 7M+ master_domains migrated
  • Full-text search with pg_trgm extension

tools.dom.to & VPS Deployment Pipeline

Feature Launch

Launched tools.dom.to - curated directory of 50+ domain tools:

  • Categories: Appraisal, Auctions, Research, Registrars, Management
  • Search and filter by category
  • Direct links to each tool

Established VPS deployment pipeline on IPv6-only server:

  • 15 standalone services running on ports 3001-3015
  • nginx reverse proxy with IPv6 listen directives
  • PM2 for process management and auto-restart
  • Cloudflare DNS for all subdomains

short.dom.to Launch & Homepage Update

Feature Launch

Launched short.dom.to - find available 2-4 letter domains across 1000+ TLDs:

  • Real-time availability checking via Dynadot API
  • Filter by length (2, 3, or 4 letters)
  • Search all TLDs at once or filter by specific extensions
  • Direct registration links to registrars

Updated the dom.to homepage with cleaner organization - mod.dom.to now listed in coming soon section as the official build log.

Homepage Reorganization & mod.dom.to Launch

Feature Launch

Major update to the dom.to homepage with a cleaner organization:

  • New ccTLD Search section grouping ai.dom.to, in.dom.to, and upcoming io.dom.to
  • Live services and coming soon items now in alphabetical order
  • Launched mod.dom.to as the official build log and updates blog

Also fixed the bulk search SSE streaming on search.dom.to - added keep-alive pings every 5 seconds to prevent Cloudflare from timing out idle connections.

ai.dom.to Launch & NSFW Handling

Feature ai.dom.to

Launched ai.dom.to - semantic search for .ai domains. Browse 20K+ AI tools and products categorized by type: video, audio, image, chat, code, writing.

  • Added registration & expiry dates to domain cards
  • Changed NSFW handling from filtering to blurring with click-to-reveal
  • Created admin dashboard at /ai-admin.html for analytics tracking
  • Built TLD coverage dashboard showing data completeness across all extensions

Current .ai coverage: 398K domains, 12% screenshots, 86% WHOIS, 2% content extracted.

check.dom.to Bulk Domain Checker

Feature check.dom.to

Launched check.dom.to - bulk domain status checker supporting up to 50 domains per check.

  • Real-time SSE streaming shows results as they're processed
  • Automatic screenshot & WHOIS queuing for unknown domains
  • Domain categorization: Developed, Parked, For Sale, Not Resolving
  • Export results by category with copy-to-clipboard
  • Rate limiting per IP to prevent abuse

Bulk Search Feature for search.dom.to

Feature search.dom.to

Added bulk keyword search to search.dom.to - check up to 50 keywords at once and see which domains are registered across all TLDs.

  • SSE streaming for real-time results
  • Automatic screenshot queuing for domains without images
  • Search history tracking for logged-in users
  • Redis pub/sub for distributed result delivery

in.dom.to Semantic Search Launch

Feature in.dom.to

Launched in.dom.to - semantic search for 1.7M+ .in domains using OpenAI embeddings. Search by meaning, not just keywords.

  • FAISS vector similarity search for fast results
  • Category browsing: Finance, Health, Education, Tech, Food, Travel
  • Content-based ranking using page titles and extracted text
  • Premium and freemium domain highlighting

find.dom.to Domain Intelligence

Feature find.dom.to

Launched find.dom.to - single domain lookup with comprehensive intelligence:

  • WHOIS data with registrar, dates, and nameservers
  • Screenshot capture with mobile viewport
  • DNS records (A, AAAA, MX, NS, TXT)
  • Buyer discovery - find potential buyers based on related domains
  • Historical WHOIS tracking

search.dom.to Goes Live

Launch search.dom.to

Launched search.dom.to - the first public service in the dom.to network. Search registered domains across all extensions.

  • Keyword-based domain search across 1000+ TLDs
  • Screenshot thumbnails for visual browsing
  • WHOIS information display
  • Pricing with Stripe integration

Infrastructure Foundation

Data Infrastructure

Built the core infrastructure for the dom.to network:

  • Certstream monitoring - Real-time CT log monitoring for new domain registrations
  • Screenshot pipeline - Puppeteer-based capture with Backblaze B2 storage
  • WHOIS collection - Rate-limited WHOIS data gathering across 300+ TLDs
  • Content extraction - Readability-based text extraction for semantic search
  • Bull queues - Redis-based job queues for distributed processing

Started with .in domains (India's ccTLD) as the primary focus, then expanded to .ai, .io, and other popular extensions.