Finding?? how to get rss images

Ask any support / help / issues / problem or question related to any of our other products
Post Reply
jock312
Posts: 2
Joined: Sun Feb 02, 2020 4:56 pm

Finding?? how to get rss images

Post by jock312 »

I have tried 20 variations to the system to obtain and repost images for tumblr blogs - I have read all 29 pages and still do not understand

I have used the standard RSS Elements - still not getting anywhere

I realise that Tumblr change the links in the posts to read - <img src="https://66.media.tumblr.com/da then a series of numbers /another set of numbers/ another set of numbers and letters.png"/>

All I want to do is work out what I need to add to the assign RSS Elements - 48 hrs gone to waste and still no closer to an answer and yes we are older than most at 71 - we need a straight answer

Kindly help us
Jock
User avatar
martin@rootjazz
Site Admin
Posts: 34358
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Finding?? how to get rss images

Post by martin@rootjazz »

jock312 wrote: Sun Feb 02, 2020 5:09 pm I have tried 20 variations to the system to obtain and repost images for tumblr blogs - I have read all 29 pages and still do not understand

I have used the standard RSS Elements - still not getting anywhere

I realise that Tumblr change the links in the posts to read - <img src="https://66.media.tumblr.com/da then a series of numbers /another set of numbers/ another set of numbers and letters.png"/>

All I want to do is work out what I need to add to the assign RSS Elements - 48 hrs gone to waste and still no closer to an answer and yes we are older than most at 71 - we need a straight answer

Kindly help us
Jock
Hi, in order to help you, I will need to know what you are trying to do exactly.
The feed you are using. What image in the feed / linked from the feed you want to access per item
What you have tried
What happened (logs)

HELP > LOGS > SUBMIT

then send your logs ID - the first 4 numbers is sufficient (displayed after successful uploading of logs)


For future reference, please, read through this post and submit your issue accordingly. Not everything will be relevant to your issue, but have a read as it gives an idea of the information you can provide to help me help you as quickly as possible without delays in having to request additional information :)

https://rootjazz.com/forum/viewtopic.php?f=23&t=1634
I have used the standard RSS Elements - still not getting anywhere
The standard elements are probably not going to be very useful due to the nature of RSS / HTML / XPATH especially on tumblr where custom themes are the norm.
I realise that Tumblr change the links in the posts to read - <img src="https://66.media.tumblr.com/da then a series of numbers /another set of numbers/ another set of numbers and letters.png"/>
You have posted an HTML image, this is unlikely to be IN the feed, so you will be needing to use tokens to get the link from the feed, then use the token to pull the page, then you can use xpath to specify the image you want.

There will NEED to be uniformity for each image item, but as above. Let me know what feed you are using and where / what the image is you want, if it is in the feed, itself, linked from the feed etc etc




Regards,
Martin




Regards,
Martin
jock312
Posts: 2
Joined: Sun Feb 02, 2020 4:56 pm

Re: Finding?? how to get rss images

Post by jock312 »

We were -

log.20200204.postimagerss_5629d438-c33a-48d7-a391-9af1cf29d879.log

Using the rss feed https://wisconsinfrightsblog.tumblr.com/rss/

Aim was to scrape images form the tumblr and post to another tumblr as a test to see if it can be done

Assign RSS Elements included
<item> in item
<title> in title
Image URL : <link>[scrapexpath=//img[contains(@class,'pinImage')]/@src]
nil in tags
description had <title>***
url sub had <title>
User avatar
martin@rootjazz
Site Admin
Posts: 34358
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Finding?? how to get rss images

Post by martin@rootjazz »

jock312 wrote: Mon Feb 03, 2020 5:18 pm We were -

log.20200204.postimagerss_5629d438-c33a-48d7-a391-9af1cf29d879.log
This is a file name from your machine. To submit logs:

Select the menu:

HELP > LOGS > SUBMIT

then send your logs ID - the first 4 numbers is sufficient (displayed after successful uploading of logs)


If you load that into your browser you will see

Code: Select all

<item>
<title>
immortal12loona: Find the Top Products from Clickbank that can...
</title>
<description>
<img src="https://66.media.tumblr.com/da7d988e54b5c2fbebdab71fe0b61442/0670a78937b84946-bf/s500x750/f5e881c6beaef3ed0b4e5fa86c48494767f599de.png"/><br/><br/><p><a href="https://immortal12loona.tumblr.com/post/190606241377/find-the-top-products-from-clickbank-that-can-help" class="tumblr_blog">immortal12loona</a>:</p><blockquote> <p>Find the Top Products from Clickbank that can help your business, lose weight, get fit not fatter </p> <a href="https://top-products-reviewed.com/"> Get Yours Now </a> to read more information now</blockquote>
</description>
<link>
https://wisconsinfrightsblog.tumblr.com/post/190606363158
</link>
<guid>
https://wisconsinfrightsblog.tumblr.com/post/190606363158
</guid>
<pubDate>Sun, 02 Feb 2020 10:28:34 -0500</pubDate>
</item>

The <description> element does contain an image

however not all items do

this one doesn't

Code: Select all

<item>
<title>What we find in a Rugby Ball</title>
<description>
<p>For More Information go to <a href="https://therugbystore.com.au/">https://therugbystore.com.au/</a><br/></p>
</description>
<link>
https://wisconsinfrightsblog.tumblr.com/post/190587874718
</link>
<guid>
https://wisconsinfrightsblog.tumblr.com/post/190587874718
</guid>
<pubDate>Sat, 01 Feb 2020 10:41:11 -0500</pubDate>

</item>
If you just wanted to pull <img tags from the <description> element, that can be done, but the image may not be the one you want, for example the item above with an <img element is

Image

Is that what you want? If not, what do you want? IT isn't enough to just say "an image from the linked page". As that cannot be told to the program, you need to be able to specify the path / pattern to the file.

So if you can review the RSS feed, then specify which image you want, from within the feed or from a linked page, then we can work out the path /pattern to the images
Aim was to scrape images form the tumblr and post to another tumblr as a test to see if it can be done
You will only be able to get one image from each <item> in the feed.

IF you were wanting ALL images from a post (if multi are posted) then RSS is not the module to use. Use the SCRAPE ALL IMAGES FROM BLOG function, then use those downloaded images to build an UPLOAD IMAGE module, that will slowly go over the downloaded images

Assign RSS Elements included
<item> in item
<title> in title
Image URL : <link>[scrapexpath=//img[contains(@class,'pinImage')]/@src]
nil in tags
description had <title>***
url sub had <title>
This will scrape images from pinterest, if you look at the class "pinimage" that is not a default class for tumblr images. Infact if I remember correctly tumblr doesn't use default classes, the xpath would something like

Code: Select all

img[contains(@href,'media.tumblr')]
which will pull images that contain "media.tumblr" in the href tag. But it would be the FIRST image in the source, which may / may not be what you want, would be theme depending.
Post Reply