scrape bios and songs?

Discussions to do with Soundcloud Manager. Do not use for support, use the dedicated support forum for help requests
dariush90025
Posts: 56
Joined: Fri Oct 02, 2015 4:49 pm

scrape bios and songs?

Post by dariush90025 »

hey all. can SCM scrape Soundcloud bios and song names?
User avatar
martin@rootjazz
Site Admin
Posts: 34674
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: scrape bios and songs?

Post by martin@rootjazz »

goto the SCRAPE tab.

When performing a PROFILE search, there should be an option to scrape details, which will include the bio.
Or if you have a list of profile URLs already, you can use that in the Scrape Details search.

To scrape track details (track name and others), first you need a list of track URLs, then run scrape track details function
dariush90025
Posts: 56
Joined: Fri Oct 02, 2015 4:49 pm

Re: scrape bios and songs?

Post by dariush90025 »

I did all user profile search using the SCRAPE tab but I didn't get the whole bio info. I can scrape the emails from the bio, tags, Plan, Followers, Followings, Likes, Comments, Playlists, Reposts, Tracks, Location and Links.

Can SCM scrape just the whole bio info?
Also, can SCM scrape a list of track names just by using a user URL?
User avatar
martin@rootjazz
Site Admin
Posts: 34674
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: scrape bios and songs?

Post by martin@rootjazz »

dariush90025 wrote:I did all user profile search using the SCRAPE tab but I didn't get the whole bio info.
What do you mean, didn't get the WHOLE bio. You mean you got some and it was cut off, or you didn't get it at all? Examples here would be helpful.

Can SCM scrape just the whole bio info?
No, not JUST the bio. But it should scrape it when you scrape profile details in a CSV
Also, can SCM scrape a list of track names just by using a user URL?
Firstly scrape the list of track URLS from a specific user
Then use the saved track URLs to scrape track details which includes the track name
dariush90025
Posts: 56
Joined: Fri Oct 02, 2015 4:49 pm

Re: scrape bios and songs?

Post by dariush90025 »

martin@rootjazz wrote:What do you mean, didn't get the WHOLE bio. You mean you got some and it was cut off, or you didn't get it at all? Examples here would be helpful.
I want to scrape the bio info that I squared with red. I can get other info such as the links and emails within the bio but not the whole text of the bio.

Image


Image


Image
User avatar
martin@rootjazz
Site Admin
Posts: 34674
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: scrape bios and songs?

Post by martin@rootjazz »

ah ok, I get you. It is not scraped at all.

I thought there was an option to get the bio. I'll check now and reply back shortly
User avatar
martin@rootjazz
Site Admin
Posts: 34674
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: scrape bios and songs?

Post by martin@rootjazz »

bio will be added to the details saved in the next update
User avatar
martin@rootjazz
Site Admin
Posts: 34674
Joined: Fri Jan 25, 2013 10:06 pm
Location: The Funk
Contact:

Re: scrape bios and songs?

Post by martin@rootjazz »

dariush90025
Posts: 56
Joined: Fri Oct 02, 2015 4:49 pm

Re: scrape bios and songs?

Post by dariush90025 »

Thanks for the update. Works fine now.

If we want to scrape like 1000 Soundcloud profile URLs, should we get proxies for that?
Keep in mind we don't use SCM for any kind of activities that involve our Soundcloud account, like commenting and posting and multiple accounts etc. We only use SCM to extract info from Soundcloud.
dariush90025
Posts: 56
Joined: Fri Oct 02, 2015 4:49 pm

Re: scrape bios and songs?

Post by dariush90025 »

Image

Also, can SCM do not scatter the list of the original file when scraping?

Because when we scrape profile details (result is on the left side of the pic) using a list of Soundcloud URLs (right side of the pic), the result came out scattered. It doesn't follow the original file.

eg. The first URL from the original becomes the second in the result.
The third URL from the original becomes the eleventh in the result.

The result is the same when I scrape track details from a list of track URLs.
Post Reply