AI-era web tools — 100% in-browser, nothing uploaded

Tools for the AI-Era Web
llms.txt, AI robots.txt & Token Counter

The web is being read by machines now. aitools.hjlabs.in gives you three free, privacy-first tools to control and measure that: generate an llms.txt to guide AI crawlers, build an AI-focused robots.txt to allow or block GPTBot, ClaudeBot, CCBot, Google-Extended & PerplexityBot, and count GPT/Claude tokens in any text. No signup, no upload.

🤖 3 AI tools
🔒 100% client-side
Instant output
💰 Free, no signup
Generate llms.txt → Build AI robots.txt

Three tools for AI visibility & control

Each tool is a single-purpose, no-nonsense utility. Pick what you need — they all run entirely in your browser.

📝

llms.txt Generator

Create the new llms.txt standard file that tells ChatGPT, Claude, Perplexity and other AI models exactly which pages on your site matter. Enter your site name, description and key pages — get a clean, spec-compliant markdown file to drop at your domain root.

Open llms.txt Generator →
🤖

AI robots.txt Generator

Decide which AI crawlers may train on or fetch your content. Toggle GPTBot, ClaudeBot, CCBot, Google-Extended, PerplexityBot, Bytespider and more, then copy a ready-to-publish robots.txt. The fastest way to block AI crawlers — or selectively allow them.

Open AI robots.txt Generator →
🔢

GPT & Claude Token Counter

Paste any prompt or document and instantly see an estimated GPT and Claude token count, plus characters, words and an approximate API cost. Perfect for staying under context limits and budgeting prompts. Runs locally — your text never leaves the page.

Open Token Counter →

Why the AI-era web needs new tools

For two decades, the contract between websites and machines was simple: a robots.txt file and an XML sitemap told search-engine crawlers what to index, and that was that. Large language models broke that contract. ChatGPT, Claude, Gemini, Perplexity and a growing fleet of AI agents now read the open web to train models, to answer questions in real time, and to cite sources inside generated answers. Your site is no longer just being ranked — it is being read, summarized, and quoted by machines. That shift created an entirely new category of decisions for every site owner, and almost no tooling exists to make those decisions easy. That is the gap aitools.hjlabs.in fills.

1. Tell AI models what matters: llms.txt

The llms.txt proposal is a fast-emerging standard — a single markdown file at the root of your domain that gives language models a curated map of your most important content. Instead of an LLM guessing which of your 4,000 pages are canonical, you hand it a short, structured list: here is what this site is, here are the key docs, here is the pricing page, here is the API reference. Sites that publish a good llms.txt are easier for AI to understand and more likely to be cited accurately. Our llms.txt generator turns a few inputs into a spec-compliant file in seconds — we author these for our own properties, so we productized it.

2. Control AI crawlers: AI robots.txt

Not every site wants its content used to train the next foundation model — and publishers in particular are drawing hard lines. The mechanism is still robots.txt, but the user-agents are new: GPTBot (OpenAI training), ClaudeBot (Anthropic), CCBot (Common Crawl, which feeds many models), Google-Extended (Gemini training), PerplexityBot, Bytespider (TikTok/ByteDance) and others. Getting the exact agent strings and directives right is fiddly, and a single typo silently fails. Our AI robots.txt generator gives you a clean toggle per crawler and emits a correct file — whether you want to block AI crawlers entirely or allow some and deny others.

3. Measure before you send: token counter

Everything you send to or receive from an LLM is billed and bounded in tokens, not words. Underestimate and your prompt gets truncated; overestimate and you waste money or hit context-window errors. Our token counter gives an instant client-side estimate of GPT and Claude tokens for any text, alongside characters, words, and an approximate API cost — so you can budget a prompt or check that a document fits a model's context window before you ever call the API.

Privacy by design

Every tool here runs 100% in your browser. Your site structure, your robots rules, your prompts and documents are processed locally with JavaScript and never transmitted to any server. There is no account, no tracking of your inputs, and no upload. This is the same privacy-first, edge-deployed approach behind our sister tools at fmt.hjlabs.in and pixel.hjlabs.in.

Built by people who use them

hjLabs.in operates a family of web properties, and we maintain llms.txt and AI-crawler policies across all of them. These tools are the internal utilities we built for ourselves, polished and made free for everyone. The AI-era web is still being defined — getting these three files right today is one of the cheapest, highest-leverage things a site owner can do for generative engine optimization (GEO) and AI visibility.

Frequently Asked Questions

What are AI web tools? +
Utilities built for the AI-era web: managing how LLMs and AI crawlers interact with your site. We offer an llms.txt generator, an AI robots.txt builder, and a GPT/Claude token counter — all free and 100% client-side.
Is llms.txt the same as robots.txt? +
No. robots.txt controls which URLs crawlers may fetch. llms.txt is a complementary standard that gives AI models a curated markdown map of your key content so they understand and cite your site accurately. Most sites should publish both.
Are these tools really free? +
Yes — completely free, no signup, no watermark. Everything runs in your browser; nothing is uploaded to a server.
How accurate is the token counter? +
It uses a fast client-side BPE approximation, typically within a few percent of official tiktoken/Claude counts for English prose. It is labeled an estimate — great for prompt budgeting; use the official tokenizer for billing-critical counts.