-
-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] Warnings for good links #4
Comments
I've checked myself links from "actual behavior" section using most recent master and both raw and VPN connection (russian server) and here's the thing. Some links to web.archive.org open up in web browser, but under the hood 403 status code is returned and the website says "cannot archive due to robots.txt on http://xxx.xxx.xxx". For these plugin seems to almost work properly. Almost, because I forgot to exclude links to web.archive.org from being checked (there's no reason to make web.archive.org link to web.archive.org) so that's a different bug. In Firefox (checked using developer tools and network option) links to archive.is work properly (code 200 is returned) when using raw connection (Poland), but when switching to VPN connection (Russia) - timeouts occur. For some websites connection cannot be made due to SSL errors:
|
1. SummaryIn last Deadlinks version dead links doesn't replace to archive links, despite the fact that 2. SettingsSame Pelican configuration as first post. Full output — https://gist.github.com/86cb35b6d9c445a81eadd1db2cf5b319, 3. Steps to reproduceSame as first post. 4. Expected behaviorReplace dead links to archive links. 5. Actual behavior
DEBUG: Starting new HTTPS connection (1): esquire.ru
WARNING: Skipping: https://esquire.ru/coined-word (not available)
DEBUG: Starting new HTTPS connection (1): colocat.ru
WARNING: Skipping: https://colocat.ru/texts/realip.html (not available) 6. EnvironmentSame as first post. Thanks. |
1. QuestionCan you set Deadlinks, that your plugin replace links if return 403/404 status code, not other? 2. ArgumentationIn this issue I showed, that Deadlinks can replace links, that good open for me. I think, it unexpected behavior. Thanks. |
@silentlamb , actually. Thanks. |
1. Summary
2. Settings
My project — https://github.com/Kristinita/KristinitaPelican,
Part of my
pelicanconf.py
file:3. Steps to reproduce
I run command in terminal:
See full output on Gist — https://gist.github.com/Kristinita/63c81829c196afd7dc68cbe5e3dba12a.
4. Expected behavior
Discover and replace real 403/404 links, not links from 1.1 and 1.2 items of my issue.
5. Actual behavior
List of links, mark as dead.
I can successful visit this links without proxy and other anonymisation tools:
Some links working, but blocked by government of my country (Russia), example:
6. Environment
Operating system and version:
Windows 10 Enterprise LTSB 64-bit EN
Python:
3.6.1
Pelican:
3.7.1
BeautifulSoup4:
4.5.3
Thanks.
The text was updated successfully, but these errors were encountered: