Scrape Users Fail/Request

Ask any support / help / issues / problem or question related to TumblingJazz
Scurzo
Posts: 9
Joined: Fri Jan 16, 2015 5:49 am

Scrape Users Fail/Request

Post by Scurzo »

When I try to use the option
"Scrape users who liked/reblogged a Post"

It sometimes works for posts with a small number of notes. Ex. It works for posts with 2000 notes or less (at least for me)
The problem arises when I try to scrape the users from posts that have 10k-50k or more notes, the bot only scrapes a small number of users (like 500-1500)

I don't know if the bot is coded to fail at large requests or if it's a problem.
I would really appreciate a fix for this

here's a tumblr account with significant notes (for testing purposes)
http://jessepnkman.tumblr.com/archive
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Users Fail/Request

Post by martin@rootjazz »

I tested this a couple of weeks ago and pulled 20k. There might be a hard limit setting so you don't pull too many pages, let me check...
mlmron313
Posts: 33
Joined: Fri Dec 19, 2014 3:41 am

Re: Scrape Users Fail/Request

Post by mlmron313 »

martin@rootjazz wrote:I tested this a couple of weeks ago and pulled 20k. There might be a hard limit setting so you don't pull too many pages, let me check...
I'm having the same problem. Only able to get about 5000 or so scrapes no matter how large the note is. Any idea how I can get a full scrape from a large note?

Thanks!
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Users Fail/Request

Post by martin@rootjazz »

Maybe something has changed on tumblr with regards to getting the higher numbers.

Can you send me the post URL you are trying to use. I don't have any posts with high numbers noted down, I will try and find some, but if you read this, might save some time if you can send one
mlmron313
Posts: 33
Joined: Fri Dec 19, 2014 3:41 am

Re: Scrape Users Fail/Request

Post by mlmron313 »

User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Users Fail/Request

Post by martin@rootjazz »

Running test now.
Post with 44k notes
Max on SETTTINGS tab set to 9999

and I got 10k results as requested

am updating to give logging so should be able to find out what is going on your side
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Users Fail/Request

Post by martin@rootjazz »

mlmron313
Posts: 33
Joined: Fri Dec 19, 2014 3:41 am

Re: Scrape Users Fail/Request

Post by mlmron313 »

Hmm, previously set my max to 999,999 and I would only get a max of 5k scrape regardless of the number of notes on the post. I'll try what you said. But can you try to scrape 50k+ and see if it works?

Thanks!
mlmron313
Posts: 33
Joined: Fri Dec 19, 2014 3:41 am

Re: Scrape Users Fail/Request

Post by mlmron313 »

martin@rootjazz wrote:Running test now.
Post with 44k notes
Max on SETTTINGS tab set to 9999

and I got 10k results as requested

am updating to give logging so should be able to find out what is going on your side
I just tried it on a post with 30k and another with 100k with the settings you posted above and am still only getting ~2k per scrape. Any ideas?
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Users Fail/Request

Post by martin@rootjazz »

mlmron313 wrote:
martin@rootjazz wrote:Running test now.
Post with 44k notes
Max on SETTTINGS tab set to 9999

and I got 10k results as requested

am updating to give logging so should be able to find out what is going on your side
I just tried it on a post with 30k and another with 100k with the settings you posted above and am still only getting ~2k per scrape. Any ideas?
submit your logs
HELP > LOGS > SUBMIT


It could be that a page fails for you, so no new items are found, thus the process stops. On higher scrapes, tumblr may be detecting too many calls from your machine and returns nothing (a normal user probably won't pull 5000 pages in quick succession)


But the logs should tell
Post Reply