內容簡介
**Underminer 外掛總結:**
Underminer 外掛旨在阻止 AI 模型爬蟲機器人和其他不受歡迎的機器人,同時讓搜索引擎爬蟲和 RSS 聚合器正常運作。該外掛通過以下兩種方式實現:
- 使用 robots.txt 規則禁止良好行為的 AI 訓練機器人爬行您的網站。
- 通過有選擇性地破壞文本內容的含義,對那些忽略這些規則的壞行為機器人進行破壞。
**問題與答案:**
1. Underminer 外掛的目的是什麼?
- Underminer 外掛的目的在於阻止 AI 模型爬蟲機器人和其他不受歡迎的機器人,同時讓搜索引擎爬蟲和 RSS 聚合器正常運作。
2. Underminer 外掛如何達到阻止機器人的目的?
- Underminer 外掛通過使用 robots.txt 規則來禁止良好行為的 AI 訓練機器人爬取您的網站,以及通過有選擇性地破壞文本內容的含義,來破壞忽略這些規則的壞行為機器人。
3. Underminer 外掛有哪些特點?
- Underminer 外掛是100%免費和開源的,可以自動檢測爬蟲機器人,對真實用戶和良好行為的機器人完全隱藏,並可預覽壞機器人將看到的內容。
4. Underminer 外掛對於行為端正的機器人有什麼待遇?
- 行為端正的搜尋引擎機器人被允許正常爬行網站,不受 robots.txt 的限制,也不會受到破壞。良好行為的 LLM 訓練機器人遵循 robots.txt 也不會收到任何損壞的內容。
開發者團隊
原文外掛簡介
Underminer aims to discourage AI model crawler bots and other unwanted bots, while allowing search engine crawlers and RSS aggregators to work normally.
It works in two ways:
Using robots.txt rules to prohibit well-behaved AI training bots from crawling your site.
By selectively corrupting the meaning of text content, to sabotage badly-behaved bots that ignore those rules.
Well-behaved search engine bots are permitted to crawl the site as normal, without being restricted by robots.txt or being subjected to sabotage.
Well-behaved LLM training bots that obey robots.txt will not receive any corrupted content.
Features
100% free and open source. No upsells or nags or subscriptions or promotions or freemium versions or donations. Ever.
Auto-detects crawler bots.
Completely invisible to real users and well-behaved bots.
Verifies IP ranges of well-known bots to detect fake Googlebot crawlers etc.
(Almost) zero-configuration.
Language-neutral; works with page and post content in most languages.
Preview mode: you can see what a bad bot will see.
Instead of blocking bad bots, they will simply be served corrupt and useless content:
Words and sentences are randomly rearranged.
Lists and paragraphs are re-ordered.
Numbers are randomized.
Currency symbols and measurement units are randomly changed.
Alt text and descriptions are randomly switched around.
Links are switched around.
Image URLs are intentionally broken.
Well-behaved bots
For the purposes of this plugin, a well-behaved bot is one that:
Obeys robots.txt.
Publishes an up-to-date list of IP ranges.
Identifies search crawling separately from AI model training crawling.
Requirements
PHP 8.0 or higher
WordPress 6.4 or higher
