llmstxt-generator

llmstxt-generator

None

Stars: 284

Visit
 screenshot

llms.txt Generator is a tool designed for LLM (Legal Language Model) training and inference. It crawls websites to combine content into consolidated text files, offering both standard and full versions. Users can access the tool through a web interface or API without requiring an API key. Powered by Firecrawl for web crawling and GPT-4-mini for text processing.

README:

llms.txt Generator 🚀

Generate consolidated text files from websites for LLM training and inference. Powered by @firecrawl_dev for web crawling and GPT-4-mini for text processing.

Features

  • Crawls websites and combines content into a single text file
  • Generates both standard (llms.txt) and full (llms-full.txt) versions
  • Web interface and API access available
  • No API key required for basic usage

Usage

Web Interface

Visit llmstxt.firecrawl.dev to generate files through the browser.

API Endpoint

GET https://llmstxt.firecrawl.dev/[YOUR_URL_HERE]

Note: Processing may take several minutes due to crawling and LLM operations.

Local Development

Prerequisites

Create a .env file with the following variables:

FIRECRAWL_API_KEY=
SUPABASE_URL=
SUPABASE_KEY=
OPENAI_API_KEY=

Installation

npm install
npm run dev

For Tasks:

Click tags to check more tools for each tasks

For Jobs:

Alternative AI tools for llmstxt-generator

Similar Open Source Tools

For similar tasks

For similar jobs