What it does
Turns webpages into clean, structured content that is easier for AI systems to work with.
Web extraction for AI agents
Distiller turns messy webpages into clean Markdown, HTML, and text so AI agents can read, summarize, and act on web content without fighting page clutter.
Turns webpages into clean, structured content that is easier for AI systems to work with.
Most web pages are built for browsers, ads, and layouts. Distiller gives you the useful content instead of the noise.
It combines extraction, simple public access, and agent-friendly usage patterns in one place.
Live Demo
Start free with Trafilatura cleaning. Upgrade to paid AI cleaning for higher-quality normalization and priority extraction.
A clean starting point for summaries, extraction, and downstream agent tasks.
# Your cleaned Markdown will appear here
Paste a URL above to try the live demo.Get a free API key to start building.
Paid AI tier is available through Stripe checkout after signup.
When an agent reads a page, it usually wants the article, product details, pricing, or help content, not every script, widget, and layout fragment around it.
Use Distiller to power research agents, OpenClaw tools, internal automations, or public pay-per-use APIs. The payment rail is there when you need it, but the core value is clean, reliable web content.
web-distiller https://example.com --format markdownWhy Distiller
Your agent gets better answers from clean Markdown than from raw HTML. Distiller does the extraction step your agent would have to do anyway — better, faster, and cached across users.
Start free, upgrade when you need AI-quality cleaning. Estimated savings below assume ~50,000 raw vs ~12,000 cleaned tokens per page (GPT-4.1 input pricing).
Overage: $0.006/call beyond plan limit · 100K+? Contact us
Distiller is a web extraction API that turns public webpages into clean Markdown, HTML, and text for AI agents and automation systems.
Teams should use Distiller when they need cleaner web inputs for agents, retrieval workflows, research automation, or OpenClaw tools instead of raw page source.
Yes. Distiller starts with lightweight HTTP extraction and can fall back to browser rendering when the useful content depends on JavaScript.
Distiller converts public webpages into clean Markdown, HTML, and plain text for AI workflows.
Use Markdown for LLM reading and summarization. Use cleaned HTML when links, headings, and structure matter to the task.