-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
limit scanning reqs/second #18
Comments
Hi Jim, what kind of workers are you using (process / thread / green threads) and how many? The only time I observed pylinkvalidator scan many links quickly was when the links were quickly returning a bad response (e.g., 404). Wait would definitively make sense. I'll check tonight where it would work best and post it here. |
--workers=2 We did have someone publish a bad link which resulted in a unusually large # of 404s. I appreciate the insight!! I'll poke around the code this afternoon as well. |
Hi Jim, here are my notes about the wait flag
|
Thanks so much for the detailed response! I came up with similar steps. Will see if I can find some time this weekend to hack on some code :) |
We run Linkchecker daily and 99% of the time it behaves but on occasion it seems to run amok and scan a lot of links in a short amount of time. Not sure why this occurs - non of my settings change (run via Jenkins).
I was thinking of adding something like wgets '--wait' flag to limit the requests made? Any thoughts on where the best place to do this would be?
I will take a stab at it and submit a pull-request when complete.
Thanks!
jim
The text was updated successfully, but these errors were encountered: