✨ Get 25% OFFon any plan. Use the coupon:

llms.txt: what the file that hands your content to AIs is

By Tiago CostaUpdated on July 2, 2026

Illustration of a text file guiding an AI through the site's content, representing llms.txt.
Definition

llms.txt is a text file at the root of a site (at /llms.txt) that guides AIs about your content. It usually lists:

  • a short description of the site or brand;
  • the most important pages, as links;
  • a short summary of what each page brings;
  • reference materials the AI should prioritize.

What the llms.txt file is

llms.txt is a simple text file, published at the root of a site (at the address /llms.txt), created to guide language models on which content matters and how to interpret it. The idea is to give the AI a curated map of the site, instead of leaving it to rely only on what it can crawl on its own.

The most common analogy is with robots.txt, the file that guides search crawlers on what they can access. llms.txt follows the same logic of a convention at the root of the site, but with a different audience and goal: instead of saying what to block, it highlights what to prioritize and summarizes the content for AI models.

That is why llms.txt usually enters the conversation about GEO and about optimizing sites for artificial intelligence. The promise is to make it easier for the AI to understand your content and, with that, increase the chance of it being cited and mentioned in the answers.

llms.txt, robots.txt and sitemap: what are the differences

The three are convention files that live on the site, but each speaks to a different audience and solves a different problem:

FileFor whomFunction
robots.txtSearch crawlersSay what can or cannot be accessed
sitemap.xmlSearch enginesList all URLs for indexing
llms.txtAI models (LLMs)Highlight and summarize the most important content

The difference in intent is the central point. The sitemap wants the AI or search engine to find everything, while llms.txt wants the opposite: to point out the essential and give context. It is worth remembering that llms.txt does not block anything nor control the crawler, it only suggests a cleaner reading path for the models.

Infographic comparing robots.txt, sitemap.xml and llms.txt and the function of each file on the site.
The difference in function between robots.txt, sitemap.xml and llms.txt on the site.

What the structure of an llms.txt file looks like

llms.txt is written in Markdown, the same lightweight format used in many text documents, which makes it easy to read for both humans and machines. The most common structure follows this order:

  • Title and description: the name of the site or brand and a sentence explaining what it does.
  • Sections by theme: blocks that group related pages, such as Documentation, Products or Guides.
  • Commented links: each link comes with a short summary of what the page brings, to give context to the model.
  • Optional content: a final section with secondary materials, which the AI can ignore if it needs something leaner.

Some sites also publish an expanded version, llms-full.txt, which gathers the full content of the main pages into a single file. The golden rule is clarity: the more direct and well summarized the file, the easier it becomes for the model to understand your site's structure.

Does llms.txt work? What the data says

Here honesty is needed. llms.txt is a community proposal, not a standard the big AI providers committed to following. In practice, adoption by the models is still low.

A study by Ahrefs analyzed 137,000 domains and showed the size of the gap: among the sites that already published a valid llms.txt, 97% received no request for the file in May 2026. And, of the traffic that did exist, AI bots linked to tools like ChatGPT and Perplexity accounted for only 1% of the visits. Most of the visits came from SEO audit tools and other bots, not from AIs actually reading the file.

The balanced read is this: today llms.txt is rarely read by the models, so do not expect a jump in AI visibility just from publishing it. It is a low-cost bet on a standard that may gain traction, not a guaranteed lever of results.

Is it worth creating an llms.txt today

Even with low adoption, creating an llms.txt can make sense, as long as you keep realistic expectations. The arguments in favor:

  • Low cost: it is a simple text file, quick to build and maintain.
  • A bet on the future: if the standard takes hold, being early can give an edge.
  • Organization: the exercise of summarizing the main pages helps clarify your own site structure.

But do not confuse priorities. What moves the needle today is still the content itself: clear answers, trustworthy data and authority, the base of optimization for generative engines and of answer engine optimization. llms.txt is an extra, not a replacement for good content the AI wants to cite.

Illustration of the structure of an llms.txt file with title, description, sections of commented links and optional content.

How to create and publish an llms.txt

Building an llms.txt is straightforward. A simple step by step:

  • List the essential pages: choose what best represents your site, such as product pages, guides and reference content.
  • Write in Markdown: start with the title, a short description and organize the links by section.
  • Summarize each link: add a sentence explaining what each page brings, to give context.
  • Save it as llms.txt: use plain text and name the file exactly like that.
  • Publish at the root: make the file accessible at yourdomain.com/llms.txt.

After publishing, keep the file up to date whenever the site structure changes. And remember that the main work of showing up in AI answers, what sustains AI citation, is still quality content, not the file itself.

FAQ

Frequently asked questions

What is the llms.txt file?

It is a standardized text file, placed at the root of a site at /llms.txt, that guides language models on which content to prioritize and how to use it. It works like a kind of robots.txt aimed at AIs, highlighting the most important pages with a summary of each one.

What is the difference between llms.txt and robots.txt?

robots.txt guides search crawlers on what can or cannot be accessed. llms.txt speaks to AI models and does the opposite: instead of blocking, it highlights and summarizes the most important content to make reading easier. One controls access, the other suggests priority.

Does llms.txt really work?

Today, little. llms.txt is a community proposal, and the big AI providers have not yet committed to reading it. An Ahrefs study of 137,000 domains showed that 97% of valid files received no request in May 2026, and only 1% of the visits came from AI bots.

How do I create an llms.txt file?

List the essential pages of the site, write in Markdown starting with a title and description, organize the links by section with a summary of each one, save it as plain text named llms.txt and publish it at the root, at yourdomain.com/llms.txt. Then keep the file up to date.

Is it worth creating an llms.txt now?

It can be worth it as a low-cost bet, since it is quick to build and may gain traction in the future. But with realistic expectations: it is rarely read by the models today, so it does not replace what actually moves AI visibility, which is clear content with data and authority.

Content ready for AIs automatically

Automarticles creates and optimizes your blog's articles on its own, with the clarity and structure that make AIs understand and cite your content.

Start free trial
Keep learning

Related concepts

Robots.txtRobots.txt is a plain text file, saved in the root of a domain, that tells search engine crawlers which parts of a site they can or cannot crawl. It follows the Robots Exclusion Protocol and controls crawling, not indexing, so it is not the right tool to hide a page from search results.GEOGEO, short for Generative Engine Optimization, is the set of practices that make your content get cited and used by artificial intelligence search engines, such as ChatGPT, Google's AI Overviews, Perplexity and Gemini. Instead of competing only for a position in the list of links, the goal of GEO is to become one of the sources the model chooses to build the generated answer.AI citationAI citation is the reference to a specific source that an artificial intelligence assistant, such as ChatGPT, Perplexity or Google's AI Overviews, lists as the origin of the answer it generated. In practice, it is your site appearing among the links or source notes the AI shows next to the text, crediting the content it used to answer. Being cited is one of the main goals of anyone optimizing content for AI search engines.AI visibilityAI visibility is the degree to which a brand or a piece of content shows up in the answers generated by artificial intelligence search engines, such as ChatGPT, Google's AI Overviews, Perplexity and Gemini. While classic SEO measures positions and clicks, AI visibility measures how often the AI mentions, cites or recommends your brand when answering people, even when nobody clicks a link.