Public repository hosting auto-generated SEO and LLM context files for englishcollege.com.
| File | Description | Updated |
|---|---|---|
sitemap.xml |
Merged multilingual sitemap (EN + 8 languages, ~655 URLs) | Every 12 hours |
llms.txt |
LLM-friendly context extracted from sitemap | Every 12 hours |
https://cagdasunal.github.io/englishcollege/sitemap.xmlhttps://cagdasunal.github.io/englishcollege/llms.txthttps://raw.githubusercontent.com/cagdasunal/englishcollege/main/sitemap.xmlhttps://raw.githubusercontent.com/cagdasunal/englishcollege/main/llms.txtA GitHub Actions workflow runs every 12 hours:
Sitemap generation (tools/generate_master_sitemap.py): Fetches 9 sitemaps (1 primary + 8 regional languages), deduplicates translated blog posts, and merges into one sitemap.xml.
LLMs.txt generation (tools/generate_llms.sh): Reads the sitemap and uses the llmstxt NPM package to extract page content into a structured context file for LLMs.
Auto-commit: If either file changed, the workflow commits and pushes. GitHub Pages serves the updated files.
Go to Actions tab > Update Sitemap & LLMs.txt > Run workflow.
Edit tools/sites.json to add entries. Each site needs:
id: Unique identifiername: Display namedescription: One-line descriptionsitemap_url: URL to the site’s sitemap.xmloutput: Output file path (relative to repo root)