Scrape Image function is not filtering out

TumblingJazz Tumblr bot discussion
Kocak
Posts: 8
Joined: Fri Nov 01, 2019 6:35 pm

Scrape Image function is not filtering out

Post by Kocak »

Hi,

I am playing with the tumblingjazz recently.
The images scraper is working and scraping images to the given folder. However when you try to filter out them with dimensions, it is not working. Still scraping all the images including the thumbnails or avatars etc.(I set min width/height both to 300 but many smaller images in the folder after the scrape.)

This is same with single blog scrape or recent search scraper or pop search scraper etc...

I was planning to scrape to a folder and than post from there automatically so this is killing my time/plan.
Any chance to take a look at this

Many Thanks
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Image function is not filtering out

Post by martin@rootjazz »

Can you screenshot how you set the action up and submit logs showing the wrong images are downloaded please

HELP > LOGS > SUBMIT

then send your logs ID - the first 4 numbers is sufficient (displayed after successful uploading of logs)
Kocak
Posts: 8
Joined: Fri Nov 01, 2019 6:35 pm

Re: Scrape Image function is not filtering out

Post by Kocak »

56437 is the logs id Martin..

Searched for popular posts for "cigar"
50 max images scraped but over %50 arejust 1kb files of avatars or thumbs etc.

screen for the settings is here

Image
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Image function is not filtering out

Post by martin@rootjazz »

everything seems correct in your setup, downloading logs now
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Image function is not filtering out

Post by martin@rootjazz »

23:02:54: * IGNORE: Width less than minimum: 200
23:02:54: * IGNORE: Width less than minimum: 200
Logs indicate it IS ignoring images. What dimensions are the images that are too small that are downloading? Are you just not realising how small a 200x200 image is on a modern DPI monitor?
Kocak
Posts: 8
Joined: Fri Nov 01, 2019 6:35 pm

Re: Scrape Image function is not filtering out

Post by Kocak »

martin@rootjazz wrote: Wed Apr 01, 2020 9:10 pm
23:02:54: * IGNORE: Width less than minimum: 200
23:02:54: * IGNORE: Width less than minimum: 200
Logs indicate it IS ignoring images. What dimensions are the images that are too small that are downloading? Are you just not realising how small a 200x200 image is on a modern DPI monitor?
Yes i see the same on the logs and also when i check the processor status during the action it says ignoring but somehow still saves to my drive. See the attached image of "cigar" folder with 1kb files. They are 16x16 pixel size. I am aware of the DPI issues ;)

Image
Image
Kocak
Posts: 8
Joined: Fri Nov 01, 2019 6:35 pm

Re: Scrape Image function is not filtering out

Post by Kocak »

I just updated to the latest version(just released)..
Tried with different seacrh term(cats)..
Unfortunately same results. Tons of 1kb files with very small dimensions.

Tried both by selecting an account(for NSFW content) and without using an acc. Same results unfortunately.
Maybe the way tumblr brings the search results to screen had changed? But anyway bot should ignore the small sized ones..
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Image function is not filtering out

Post by martin@rootjazz »

Thanks for checking. Let me investigate further
User avatar
martin@rootjazz
Site Admin
Posts: 34712
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Scrape Image function is not filtering out

Post by martin@rootjazz »

ok, I see your issue, due to how the image is downloaded and tested, the program was failing to remove the image after it failed.

The next update will fix this. I shall let you know when it is ready.



Regards,
Martin
Post Reply