What this list shows
- Every major AI crawler's exact User-agent string, sourced from vendor documentation
- Whether each crawler respects robots.txt — and where exceptions exist
- What each crawler is for: AI training, AI search index, user-triggered fetch, classical search, or shared dataset