Any way to tell if it is working??

Issue created by @bobburns
Comment about 2 years ago →
🇷🇸Serbia vaish
You can go to Drupal's Status Report page (/admin/reports/status) and find all the details there. If something is not right you will also see a warning or an error.

Module doesn't use any database tables and doesn't post log messages. That's intentional.

The only place module stores any information is the backend you specified in the module settings. In your case that's Redis. All keys created by this module are prefixed with crawler_rate_limit:.
Comment about 2 years ago →
🇷🇸Serbia vaish
Did you check module's README.md: https://git.drupalcode.org/project/crawler_rate_limit/-/tree/3.x?ref_typ...

Status Report page is mentioned there. See step 3 under "Install Crawler Rate Limit module".

If you have any specific feedback on how to improve README file, please let me know.
Comment about 2 years ago →
🇺🇸United States bobburns
Yes I checked and saw it enabled, but I didn't see any way to know it was working like flood control shows in a table. Thanks for the quick response
Comment about 2 years ago →
🇷🇸Serbia vaish
I see. You want a proof that module does what it's supposed to do. Here is a simple way to verify that:

In settings.php configure very low number of requests per interval for "bot_traffic". For example 4 requests within 30 seconds. Then use curl on command line to issue requests to your website. First four requests should return response code 200 while the fifth and subsequent ones should return 429. After 30 seconds, counter resets and you will again be able to get 200 but only for 4 requests.

Here is a curl issuing HTTP requests as a bot: curl --head -A "bot" "https://yourdomain.com/"

You can do similar test in your browser but for this you need to use the settings under "regular_traffic". Then just keep refreshing any page on your website and keep track of the responses you get.

On production you can grep your web server's access.log for entries with response code 429. Those will be the ones that were blocked by crawler rate limit.

Note that crawler rate limit, being a Drupal module, processes only requests that are handled by Drupal. It doesn't do anything to requests that are served directly by the web server, like for example requests for images in the public files folder or requests for CSS and JS assets.
Comment about 2 years ago →
🇺🇸United States bobburns
OK good to know - I will try that

Looks like I will need to re-install tarpit and configure the path ban
First commit to issue fork.
Merge request !1Updating docs with how to test instructions → (Merged) created by generalredneck
Status changed to Needs review over 1 year ago3:18pm 20 April 2024
Comment over 1 year ago →
🇺🇸United States generalredneck Texas, USA 🇺🇸
Vaish,
I hope you don't mind, but I took the liberty to add to the readme the way I tested this functionality. It uses curl like you suggested but in a loop, allowing a quick one liner. It also uses cache busting as not everyone is going to test this against raw installations, and it's good to see that it works the first time without realizing that something like varnish is eating the request.
Pipeline finished with Skipped
over 1 year ago
#153306
Comment over 1 year ago →
System Message

vaish → committed 52b96c47 on 3.x authored by generalredneck →
Issue #3356426 by generalredneck, vaish: Any way to tell if it is...
Status changed to Fixed over 1 year ago11:49am 22 April 2024
Comment over 1 year ago →
🇷🇸Serbia vaish
Thanks @generalredneck. It's great to have this added to the README.
Comment about 1 year ago →
System Message
Automatically closed - issue fixed for 2 weeks with no activity.

Any way to tell if it is working??

Merge Requests

!1Any way to tell if it is working??
Merged

Comments & Activities

Any way to tell if it is working??

Merge Requests

!1Any way to tell if it is working??Merged

Comments & Activities

!1Any way to tell if it is working??
Merged