- Issue created by @bobburns
- π·πΈSerbia vaish
You can go to Drupal's Status Report page (/admin/reports/status) and find all the details there. If something is not right you will also see a warning or an error.
Module doesn't use any database tables and doesn't post log messages. That's intentional.
The only place module stores any information is the backend you specified in the module settings. In your case that's Redis. All keys created by this module are prefixed with
crawler_rate_limit:
. - π·πΈSerbia vaish
Did you check module's README.md: https://git.drupalcode.org/project/crawler_rate_limit/-/tree/3.x?ref_typ...
Status Report page is mentioned there. See step 3 under "Install Crawler Rate Limit module".
If you have any specific feedback on how to improve README file, please let me know.
- πΊπΈUnited States bobburns
Yes I checked and saw it enabled, but I didn't see any way to know it was working like flood control shows in a table. Thanks for the quick response
- π·πΈSerbia vaish
I see. You want a proof that module does what it's supposed to do. Here is a simple way to verify that:
In settings.php configure very low number of requests per interval for "bot_traffic". For example 4 requests within 30 seconds. Then use curl on command line to issue requests to your website. First four requests should return response code 200 while the fifth and subsequent ones should return 429. After 30 seconds, counter resets and you will again be able to get 200 but only for 4 requests.
Here is a curl issuing HTTP requests as a bot:
curl --head -A "bot" "https://yourdomain.com/"
You can do similar test in your browser but for this you need to use the settings under "regular_traffic". Then just keep refreshing any page on your website and keep track of the responses you get.
On production you can grep your web server's access.log for entries with response code 429. Those will be the ones that were blocked by crawler rate limit.
Note that crawler rate limit, being a Drupal module, processes only requests that are handled by Drupal. It doesn't do anything to requests that are served directly by the web server, like for example requests for images in the public files folder or requests for CSS and JS assets.
- πΊπΈUnited States bobburns
OK good to know - I will try that
Looks like I will need to re-install tarpit and configure the path ban
- First commit to issue fork.
- Status changed to Needs review
9 months ago 3:18pm 20 April 2024 - πΊπΈUnited States generalredneck Texas, USA πΊπΈ
Vaish,
I hope you don't mind, but I took the liberty to add to the readme the way I tested this functionality. It uses curl like you suggested but in a loop, allowing a quick one liner. It also uses cache busting as not everyone is going to test this against raw installations, and it's good to see that it works the first time without realizing that something like varnish is eating the request. -
vaish β
committed 52b96c47 on 3.x authored by
generalredneck β
Issue #3356426 by generalredneck, vaish: Any way to tell if it is...
-
vaish β
committed 52b96c47 on 3.x authored by
generalredneck β
- Status changed to Fixed
9 months ago 11:49am 22 April 2024 - π·πΈSerbia vaish
Thanks @generalredneck. It's great to have this added to the README.
Automatically closed - issue fixed for 2 weeks with no activity.