<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>sleepydisco &#187; recommendation</title>
	<atom:link href="http://www.sleepydisco.com/tag/recommendation/feed" rel="self" type="application/rss+xml" />
	<link>http://www.sleepydisco.com</link>
	<description>A blog about technology, music, food and photography.</description>
	<lastBuildDate>Sun, 18 Apr 2010 20:53:05 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>The Social Network in the Music Blogosphere</title>
		<link>http://www.sleepydisco.com/computer-science/social-network-in-the-music-blogosphere</link>
		<comments>http://www.sleepydisco.com/computer-science/social-network-in-the-music-blogosphere#comments</comments>
		<pubDate>Sat, 16 Jan 2010 13:39:15 +0000</pubDate>
		<dc:creator>David</dc:creator>
				<category><![CDATA[Computer Science]]></category>
		<category><![CDATA[analysis]]></category>
		<category><![CDATA[msc]]></category>
		<category><![CDATA[network theory]]></category>
		<category><![CDATA[Project]]></category>
		<category><![CDATA[recommendation]]></category>
		<category><![CDATA[report]]></category>

		<guid isPermaLink="false">http://www.sleepydisco.com/everything-else/master</guid>
		<description><![CDATA[About this time last year, I was busy putting the finishing touches to a data harvesting program which would go off to the internet and grab posts from music blogs when notified of updates via a feed. The motivation was my MSc Computer Science project, at the time untitled, and without much of a plan [...]]]></description>
			<content:encoded><![CDATA[<p>About this time last year, I was busy putting the finishing touches to a data harvesting program which would go off to the internet and grab posts from music blogs when notified of updates via a feed. The motivation was my MSc Computer Science project, at the time untitled, and without much of a plan or a direction to go in. I knew I wanted to do something related to music, and probably to do with recommendations, with a view to creating a hopefully fresh take at how content can be discovered from editorially subjective sources, rather than behavioural sources such as playlists.<span id="more-257"></span></p>
<p>Skip to September, and with the end-of-month deadline nearing, I&#8217;m putting the finishing touches to the project report, now titled: &#8216;The Social Network in the Music Blogosphere&#8217;. It&#8217;s an exploration of the relationships between blogs and the artists that they write about, using network theory, with an analysis of some subject discovery and classification methods. There&#8217;s a chunk of Python code in the appendices (I used a lot of <a title="NetworkX - Python network library" href="http://networkx.lanl.gov/">NetworkX</a>), as well as some subject-based clusterings represented as dendrograms. If that sounds like your bag, you can get your copy of my project report here:</p>
<p>Download PDF:<a href="http://www.sleepydisco.com/wp-content/uploads/2010/01/The-Social-Network-in-the-Music-Blogosphere.pdf"> The Social Network in the Music Blogosphere</a> (3.6MB)</p>
<p>The network dataset of blog-artist relationships is also available:</p>
<p><a href="/msc/download/blog-artist_network.tar.gz">blog-artist_network.tar.gz</a> (301K)<br />
<a href="/msc/download/blog-artist_network.zip">blog-artist_network.zip</a> (299K)</p>
<p>It contains the following files:</p>
<p><span style="text-decoration: underline;">artists.txt</span>: id, and normalized and denormalized versions of all artist names</p>
<pre>ARTIST_ID  ARTIST_NAME  DENORM_ARTIST_NAME</pre>
<p><span style="text-decoration: underline;">blogs.txt</span>: id and name of each blog</p>
<pre>BLOG_ID  BLOG_NAME</pre>
<p><span style="text-decoration: underline;">blog-artist.txt</span>: the edges between a blog and an artist, and the weighting given to the relationship</p>
<pre>BLOG_ID  ARTIST_ID  WEIGHT
</pre>
]]></content:encoded>
			<wfw:commentRss>http://www.sleepydisco.com/computer-science/social-network-in-the-music-blogosphere/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
