- Issue created by @bburg
- π©π°Denmark ressa Copenhagen
That would be an interesting feature, but since HTTrack is a scraper, if the feature was added, this project could almost consider expanding its scope and name to https://www.drupal.org/project/bot_blocker β ? Scrapers can cause a lot of extra traffic, which might be a strain, even for web sites without facets.
- πΊπΈUnited States bburg Washington D.C.
I do like that idea of using a more general namespace for the module. I do think it's important to keep a separation of concerns. Will keep this issue active until I, or someone else creates "bot_blocker"
- π©π°Denmark ressa Copenhagen
Sounds great, and thanks for all your work with facets and agents already here.
About blocking scrapers, one method could be a rule about number of hits over a certain period (maybe five minutes?) and being able to block an IP if a threshold of requested URL's is exceeded. The reason I thought about a more generalized "hits per time period"-rule is because I have a web site where five or six facets by human is to be expected. But an intense pounding by a bot is problematic mostly due to the rapid requests, not the number of facets.