examples: add TikTokSpider
Requests using this user agent are coming from the same Amazon net- works as Bytespider.
This commit is contained in:
@@ -106,7 +106,7 @@ rules:
|
|||||||
- 'userAgent.matches("^Opera/[0-9.]+\\.\\(")'
|
- 'userAgent.matches("^Opera/[0-9.]+\\.\\(")'
|
||||||
# AI bullshit stuff, they do not respect robots.txt even while they read it
|
# AI bullshit stuff, they do not respect robots.txt even while they read it
|
||||||
# TikTok Bytedance AI training
|
# TikTok Bytedance AI training
|
||||||
- 'userAgent.contains("Bytedance") || userAgent.contains("Bytespider")'
|
- 'userAgent.contains("Bytedance") || userAgent.contains("Bytespider") || userAgent.contains("TikTokSpider")'
|
||||||
# Meta AI training; The Meta-ExternalAgent crawler crawls the web for use cases such as training AI models or improving products by indexing content directly.
|
# Meta AI training; The Meta-ExternalAgent crawler crawls the web for use cases such as training AI models or improving products by indexing content directly.
|
||||||
- 'userAgent.contains("meta-externalagent/") || userAgent.contains("meta-externalfetcher/") || userAgent.contains("FacebookBot")'
|
- 'userAgent.contains("meta-externalagent/") || userAgent.contains("meta-externalfetcher/") || userAgent.contains("FacebookBot")'
|
||||||
# Anthropic AI training and usage
|
# Anthropic AI training and usage
|
||||||
|
@@ -59,7 +59,7 @@ rules:
|
|||||||
- 'userAgent.matches("^Opera/[0-9.]+\\.\\(")'
|
- 'userAgent.matches("^Opera/[0-9.]+\\.\\(")'
|
||||||
# AI bullshit stuff, they do not respect robots.txt even while they read it
|
# AI bullshit stuff, they do not respect robots.txt even while they read it
|
||||||
# TikTok Bytedance AI training
|
# TikTok Bytedance AI training
|
||||||
- 'userAgent.contains("Bytedance") || userAgent.contains("Bytespider")'
|
- 'userAgent.contains("Bytedance") || userAgent.contains("Bytespider") || userAgent.contains("TikTokSpider")'
|
||||||
# Meta AI training; The Meta-ExternalAgent crawler crawls the web for use cases such as training AI models or improving products by indexing content directly.
|
# Meta AI training; The Meta-ExternalAgent crawler crawls the web for use cases such as training AI models or improving products by indexing content directly.
|
||||||
- 'userAgent.contains("meta-externalagent/") || userAgent.contains("meta-externalfetcher/") || userAgent.contains("FacebookBot")'
|
- 'userAgent.contains("meta-externalagent/") || userAgent.contains("meta-externalfetcher/") || userAgent.contains("FacebookBot")'
|
||||||
# Anthropic AI training and usage
|
# Anthropic AI training and usage
|
||||||
|
Reference in New Issue
Block a user