What's the state of the art in keeping scraping bots out of your webservers?
-
What's the state of the art in keeping scraping bots out of your webservers?
Rate limiting, a restrictive robots.txt plus user agent matching, plus a proof-of-work page loader?
Is there a well-updated resource for the user agents?
-
What's the state of the art in keeping scraping bots out of your webservers?
Rate limiting, a restrictive robots.txt plus user agent matching, plus a proof-of-work page loader?
Is there a well-updated resource for the user agents?
@yojimbo This is what I'm using: https://honeypot.net/2025/12/22/i-read-yann-espositos-blog.html
-
What's the state of the art in keeping scraping bots out of your webservers?
Rate limiting, a restrictive robots.txt plus user agent matching, plus a proof-of-work page loader?
Is there a well-updated resource for the user agents?
-
R ActivityRelay shared this topic