Email scraping question

Discussions to do with Soundcloud Manager. Do not use for support, use the dedicated support forum for help requests
Post Reply
eer
Posts: 20
Joined: Sat Apr 05, 2014 5:43 pm

Email scraping question

Post by eer »

Hi Martin - hope all is well.
I would like to know if there is a way to set up email scraping in the following way. Here is what I have in mind:

1. I will identify let's say 50 SoundCloud profiles of that mostly repost tracks (e.g., edm.com)

2. I would like SoundCloud Manager to look at those profiles and identify all the artists who were reposted on those 50 SoundCloud profiles. That means I am not interested in any of the information about those 50 initial SoundCloud profiles, but want to be able to connect with the artists who were reposted by the 50 initial profiles.

3. The next step would be for the email scraper to look up any posted email addresses by those artists identified in step #2.

Because the list of artist profiles to be visited/scraped for emails could be quite long, I anticipate that this process would take some time. That's OK. In fact, I am more concerned about getting my IP blocked than speed. For technical reasons I cannot use proxies. Can I pace the speed of this process in such a way that I won't risk an IP block? Let's say, maybe SoundCloud Manager visits only one of the profiles of reposted artists per minute?

Thanks a lot in advance for your feedback!
User avatar
martin@rootjazz
Site Admin
Posts: 34640
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Email scraping question

Post by martin@rootjazz »

run your scrape, save to file
scrape profiles from output file
eer
Posts: 20
Joined: Sat Apr 05, 2014 5:43 pm

Re: Email scraping question

Post by eer »

Thanks a lot, Martin.

The part I wasn't able to figure out yet is how to have SoundCloud Manager identify the artists/users that were reposted by my target profile. Let's say I want to identify all the artists associated with tracks that reposted by soundcloud.com/edm. I would want to identify soundcloud.com/edm as starting point for SoundCloud Manager and then have SoundCloud Manager look at all the tracks reposted on that profile, identify the artist profiles associated with those tracks, and then eventually scrape emails from those profiles.

How can I do that?
Thanks in advance!
User avatar
martin@rootjazz
Site Admin
Posts: 34640
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Email scraping question

Post by martin@rootjazz »

SCRAPE TAB

I think there should be a box for SCRAPE ALL THIS USERS REPOSTS

If not, you can do it via a CUSTOM SEARCH BUILDER


For custom search there should be a tutorial

SoundcloudManager v3 - All tutorials
https://www.youtube.com/watch?v=CtEqQSU ... 8Ok47r_keF




SoundcloudManager is constantly updating, being improved, added to. Due to this some of the older tutorials may be a bit out of date in regard to the look of the application, however the sentiment will still apply. Firstly it is receommended you watch the Latest Tutorials for SoundcloudManagerv3, which are up to date and relate to the current application. Any module / function you do not find in the v3 tutorials will be covered in the older tutorials.
---------------------------------

Beginners Guide SoundcloudManager v3
https://www.youtube.com/watch?v=CtEqQSU ... mQ-y_37SdJ

SoundcloudManager v3 - All tutorials
https://www.youtube.com/watch?v=CtEqQSU ... 8Ok47r_keF

Soundcloud Manager v3 - Features
https://www.youtube.com/watch?v=iaJxz3M ... DfaQ5XjyLG

---------------------------------
Soundcloud Manager Beginners Quick Guide Series (older but still applicable)
https://www.youtube.com/playlist?list=P ... W4iV_v9G7Z

SeriesA: Beginners Guide to Get verified list of proxies / Create Accounts / Increase Likes (older but still applicable)
https://www.youtube.com/playlist?list=P ... jUvZUUSoCw

Soundcloud Manager All Beginners Tutorials (older but still applicable)
https://www.youtube.com/playlist?list=P ... ROFzp1nF08

All Video Tutorials
https://www.youtube.com/playlist?list=P ... ihFRBzMGYD
eer
Posts: 20
Joined: Sat Apr 05, 2014 5:43 pm

Re: Email scraping question

Post by eer »

Thanks a lot, Martin.
I am making some progress with the email scraping features and would like to know if there is a way to specify the speed of the processor for this task? Right now, it runs through thousands of profiles in just a minute and I am concerned SC may block my IP (I cannot use proxies). Can I modify the processor speed to only visit one profile every 30 seconds for example? Or do you see no risk to IP blocking?
Thank you!
eer
Posts: 20
Joined: Sat Apr 05, 2014 5:43 pm

Re: Email scraping question

Post by eer »

Hi Martin - one more quick question:

I know got a process to work that returns some good results. It works as follows:

1. Using Scraper, I search users from track search with a specific search term (e.g., "remix")
2. I then use the Email Scraper to scrape emails for the user profiles returned by process #1.

Is it possible to set these two processes up in a way that they auto-repeat every day?
I didn't see an option to attach a recurring scheduling to these two. And the other issue I would have to figure out is that process #1 returns a profile list as text file that is an input into process #2. That file has a different name each time, so the name of the input file for process #2 would be different each day.

Can this be done?
Thank you!
User avatar
martin@rootjazz
Site Admin
Posts: 34640
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Email scraping question

Post by martin@rootjazz »

you should be able to RIGHT CLICK any action and change the repeat value.


But you cannot feed the results of one into the other action.


However, you don't need to do it in a two step process.

Just create the scrape and there should be an option RETURN EMAILS IF PRESENT. This will instruct the SCRAPE acction to log out emails (as SC returns the description for the profile during the scrape. It won't give you as many details as the pure scrape emails option, but hopefully is enough

Then you can repeat the scrape as you want and continually get the emails



Regards,
Martin
eer
Posts: 20
Joined: Sat Apr 05, 2014 5:43 pm

Re: Email scraping question

Post by eer »

Hi Martin - yes, LOVE it! This works indeed with just one process that returns the emails.

But I might be missing something with the auto-repeat of this process. I right clicked on the process and changed the "Repeat every X hours" value to 24. But the process did not run again. It also did not auto-reschedule again after completing.

What am I missing?
User avatar
martin@rootjazz
Site Admin
Posts: 34640
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: Email scraping question

Post by martin@rootjazz »

when did you change it? Before running it, during or on completion?

IF you changed it after completion then the value was still 0 when it checked to run again,

Right click > REDO and it should start repeating

*should*
Post Reply