8000 testing updated version with changes to user agent and headers by vsoch · Pull Request #92 · urlstechie/urlchecker-action · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

testing updated version with changes to user agent and headers #92

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

vsoch
Copy link
Collaborator
@vsoch vsoch commented Jun 16, 2022

@shahzebsiddiqui this is a branch for you to test! You should be able to change your action to use this branch, and it will test urlstechie/urlchecker-python#72. Ping me with a link when you can try it out!

Signed-off-by: vsoch vsoch@users.noreply.github.com

Signed-off-by: vsoch <vsoch@users.noreply.github.com>
@shahzebsiddiqui
Copy link
Contributor

So i ran this pipeline https://github.com/buildtesters/buildtest/runs/6941983724?check_suite_focus=true and i am getting the same error. I guess if you want to add this link in check https://www.hpcwire.com/2019/01/17/pfizer-hpc-engineer-aims-to-automate-software-stack-testing/

Screen Shot 2022-06-17 at 3 56 12 PM

I can also confirm that this page is getting me a 403 so i am inclined to keep this in the exclude list. Let me know if you want me to do any further testing.

 ~/ python3
Python 3.6.15 (default, Sep 23 2021, 15:41:43) [GCC] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import requests
>>> requests.get("https://www.hpcwire.com/2019/01/17/pfizer-hpc-engineer-aims-to-automate-software-stack-testing/")
<Response [403]>

@vsoch
Copy link
Collaborator Author
vsoch commented Jun 17, 2022

Yeah, but the weird thing is that if you add a user agent header, it works for me (at least locally). I'm starting to think that they have explicitly blocked the ip addresses from GitHub actions.

I think you might just have to make an exception - I don't have any other ideas for how this could work (or why it's not working) but we aren't doing a good enough job emulating a browser.

@shahzebsiddiqui
Copy link
Contributor

Yeah I'll exclude this link that what I have right now. This is not the only link but several others didn't work. Overall this url checker works 95% of the time so it's still pretty good 👍

@vsoch
Copy link
Collaborator Author
vsoch commented Jul 23, 2022

This is superceded by #95, which should help with some of these user-agent strings, and for the remainder I'm going to look into selenium soon!

@vsoch vsoch closed this Jul 23, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0