Page 1 of 2

Couldn't pull page: 403

Posted: Sun Jan 08, 2023 10:07 pm
by puniupun
Hi, I have this problem and no idea what it's related to (few weeks ago everything worked good)

I try to scrape user details from file. It works for few minutes, and then I am getting error:
FAILED: Couldn't pull page 403
GET RESP: 403

The same happens when I give software some other task.

Also, I notice that when I am using the software (and some time after I close it) I have no access to SoundCloud for some time:
https://i.imgur.com/luqfK5S.png

Could you advise why is this happening? :shock:
It looks like it doesn't happen when I am hotspotting my phone data; but only when I am on wi-fi. It didn't happen before though.

Thank you!

Re: Couldn't pull page: 403

Posted: Mon Jan 09, 2023 10:57 pm
by Just A User
SoundCloud changed a single IP can do on the website. This effects everything you can do.
From scraping to simply update your stats from your accounts.

Cheers

Re: Couldn't pull page: 403

Posted: Tue Jan 10, 2023 2:41 pm
by martin@rootjazz
As above, IP limitations now kick in so you will hit a 403 if you do too much on a single IP.

I guess the SCRAPER tab needs to be updated to allow scraping via accounts now, which I guess will help this, as SCRAPER tab performs searches not logged in.

That and to provide multiple proxies for the scraping, that would also help I suppose.

Re: Couldn't pull page: 403

Posted: Thu Jan 19, 2023 7:04 pm
by bmsietta
any update on this? scraping is the main reason why I bought the software :D I love scraping

Re: Couldn't pull page: 403

Posted: Mon Jan 23, 2023 9:40 pm
by martin@rootjazz
bmsietta wrote: Thu Jan 19, 2023 7:04 pm any update on this? scraping is the main reason why I bought the software :D I love scraping
Yes it's on the TODO list. hopefully this week

Re: Couldn't pull page: 403

Posted: Wed Jan 25, 2023 1:50 pm
by martin@rootjazz

Re: Couldn't pull page: 403

Posted: Mon Feb 20, 2023 2:37 pm
by bmsietta
hi Martin, hope everything is going well
is this error related?

* FAILED: Couldn't pull page: -1 https://soundcloud.com/XXXXXXX
Failed pulled details for: https://soundcloud.com/XXXXX
* FAILED: 87345 : Couldn't scrape profile ID
GET RESP: -1

I get that on most of the users I want to scrape

Re: Couldn't pull page: 403

Posted: Mon Feb 20, 2023 2:46 pm
by bmsietta
is there any way this scraper can use proxies? I loaded my proxies but it ends up scraping with my home IP, thus getting blocked after 300 requests. i find it easier than using accounts

https://i.imgur.com/TQqdjHk.png

thank you!!

Re: Couldn't pull page: 403

Posted: Tue Feb 21, 2023 1:43 pm
by martin@rootjazz
bmsietta wrote: Mon Feb 20, 2023 2:37 pm hi Martin, hope everything is going well
is this error related?

* FAILED: Couldn't pull page: -1 https://soundcloud.com/XXXXXXX
Failed pulled details for: https://soundcloud.com/XXXXX
* FAILED: 87345 : Couldn't scrape profile ID
GET RESP: -1

I get that on most of the users I want to scrape

-1 / empty / <null> means no response received.

This is either your network was down, the proxy wasn't working, something on your machine blocked the request / the site server timed out and didn't respond in time (unlikely but does happen)

If using a proxy, test it:
Please confirm your proxy is working, more info on how to do that here:

viewtopic.php?f=15&t=3453


If the proxy appears to be working fine, please run the action again as it could have been a network "blip".


If the issue appears to be persistent, please check your anti virus / security software as they *may* block the app from the accessing the network: the two most common are:

* Windows Defender
* Smart screen

Make sure the applications program files folder:

c:\program files (x86)\instadub

is whitelisted and / or all the .exe and .dlls in that folder.





Regards,
Martin

Re: Couldn't pull page: 403

Posted: Tue Feb 21, 2023 1:44 pm
by martin@rootjazz
bmsietta wrote: Mon Feb 20, 2023 2:46 pm is there any way this scraper can use proxies?
scrape requires accounts, proxies are just for email scrape from profile page. But for scraping, you should be specifying hte accounts to use