<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>mediaczar &#187; twitter</title>
	<atom:link href="http://mediaczar.com/blog/category/twitter/feed/" rel="self" type="application/rss+xml" />
	<link>http://mediaczar.com/blog</link>
	<description>a blog by mat morrison</description>
	<lastBuildDate>Mon, 22 Mar 2010 12:12:27 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>A first stab at a perl script to create Twitter friend/follow matrices</title>
		<link>http://mediaczar.com/blog/2009/07/a-first-stab-at-a-perl-script-to-create-twitter-friendfollow-matrices/</link>
		<comments>http://mediaczar.com/blog/2009/07/a-first-stab-at-a-perl-script-to-create-twitter-friendfollow-matrices/#comments</comments>
		<pubDate>Tue, 14 Jul 2009 15:23:08 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[hack]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[kludge]]></category>
		<category><![CDATA[network analysis]]></category>
		<category><![CDATA[networks]]></category>
		<category><![CDATA[perl]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=988</guid>
		<description><![CDATA[Geek alert: if the title of this post isn&#8217;t a dead giveaway I should tell you &#8212; unless you&#8217;re interested in APIs and badly-put-together bits of code &#8212; this probably isn&#8217;t for you.
I&#8217;ve recently found myself using a service provided by Damon Clinkscale called DoesFollow. All it does is answer the simple question &#8220;does twitter [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F07%2Fa-first-stab-at-a-perl-script-to-create-twitter-friendfollow-matrices%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F07%2Fa-first-stab-at-a-perl-script-to-create-twitter-friendfollow-matrices%2F" height="61" width="51" /></a></div><p><em>Geek alert: if the title of this post isn&#8217;t a dead giveaway I should tell you &#8212; unless you&#8217;re interested in APIs and badly-put-together bits of code &#8212; this probably isn&#8217;t for you.</em></p>
<p>I&#8217;ve recently found myself using a service provided by <a href="http://twitter.com/damon">Damon Clinkscale</a> called <a href="http://doesfollow.com/">DoesFollow</a>. All it does is answer the simple question &#8220;does twitter user A follow twitter user B?&#8221; Apart from a frill which lets you reverse the order of your question (&#8220;does twitter user B follow twitter user A?&#8221;) that&#8217;s all it does. You can even interrogate it from the address bar like this: <code><a href="http://doesfollow.com/barackobama/mediaczar">http://doesfollow.com/barackobama/mediaczar</a></code></p>
<p><a href="http://doesfollow.com/barackobama/mediaczar"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/07/doesfollow-300x100.jpg" alt="doesfollow" title="doesfollow" width="300" height="100" class="aligncenter size-medium wp-image-995" /></a></p>
<p>While I was thinking about how useful a service this is, I was suddenly struck by a moment of clarity. A lot of the research I&#8217;ve been doing could be simplified by something like this.<br />
<span id="more-988"></span><br />
Quite often I want to find out whether <a href="http://mediaczar.com/blog/tag/mp/">MPs</a> or <a href="http://mediaczar.com/blog/tag/congress/">congressmen</a> or <a href="http://mediaczar.com/blog/2008/12/some-twitter-social-network-analysis/">PR people</a> follow each other on Twitter.</p>
<p>The way that I&#8217;ve been doing this until now is </p>
<ol>
<li>make a list of the people who I&#8217;m interested in researching</li>
<li>for each person on that list, grab the list of <em>all</em> the Twitter people whom they follow</li>
<li>process the list so that only relationships between the people on the list show up</li>
</ol>
<p>If <em>all</em> I&#8217;m doing is checking to see who follows whom, then this is a horribly wasteful way of doing things. The Twitter API limits the number of calls one can make on it &#8212; so this wastage leads to things taking much longer.</p>
<p>If only I could cycle all the names I want to check through something like DoesFollow!</p>
<p>Well &#8211; it turns out that I can. And in theory it&#8217;s not much harder than using DoesFollow. The <a href="http://apiwiki.twitter.com/Twitter-API-Documentation">Twitter API</a> (which is what DoesFollow uses, after all) has a method called <code>friendship/exists</code>. All we have to do is send Twitter the following request: </p>
<p><code><a href="http://twitter.com/friendships/exists.xml?user_a=barackobama&#038;user_b=mediaczar">http://twitter.com/friendships/exists.xml?user_a=<strong>barackobama</strong>&#038;user_b=<strong>mediaczar</strong></a></code></p>
<p>and it will come back with the answer:</p>
<p><code>&lt;friends&gt;true&lt;/friends&gt; </code><br />
or<br />
<code>&lt;friends&gt;false&lt;/friends&gt;</code></p>
<h3>Kludge-y perl code</h3>
<p><a href="http://mediaczar.com/blog/wp-content/uploads/2009/07/poor-man-hot-water-heater.jpg"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/07/poor-man-hot-water-heater.jpg" alt="poor-man-hot-water-heater" title="poor-man-hot-water-heater" width="480" height="359" class="aligncenter size-full wp-image-999" /></a></p>
<p><em>(This fabulous picture courtesy of <a href="http://thereifixedit.com/">There, I Fixed It</a>)</em></p>
<p>So I tried to do this using <a href="http://pipes.yahoo.com/">Yahoo! Pipes</a>, but there are too many nested loops. You need to do something like this:</p>
<p><code><br />
get list of names</p>
<p>for each user_a (in list) {</p>
<ul>
for each user_b (in list) {</p>
<ul> does friendship exist</ul>
<p>     }</ul>
<p>}<br />
</code></p>
<p>There&#8217;s no easy way to get Pipes to do this, as far as I can see (I&#8217;ll keep trying, but if someone else can help, I&#8217;d be v. grateful.)</p>
<p>So I&#8217;ve pulled together a badly-written perl script to do the work for me. </p>
<h4>The script</h4>
<p>[code lang="perl"]<br />
#!/usr/bin/perl<br />
# checks the Twitter API to find the friendships between a list of usernames<br />
# this should really use the NEW API call that would let us halve the number<br />
# of API calls<br />
# author: Mat Morrison<br />
# date: Friday July 10, 2009<br />
use warnings;<br />
use LWP::Simple;<br />
# set up variables<br />
# we're just using a whitespace delimited list for the moment<br />
my @usernames = qw(kerrymg mediaczar timhoang titusbicknell);<br />
# let's build the matrix with a hash of hashes...<br />
# to begin with, we'll include diagonal values -<br />
# that is -- we'll check to see whether @mediaczar follows @mediaczar<br />
foreach $user_a(@usernames) {<br />
	foreach $user_b(@usernames) {<br />
	# we should put in a conditional clause that will check for the diagonal values<br />
	# and not bother checking whether someone is a friend of themselves...<br />
	$url = 'http://twitter.com/friendships/exists.xml?user_a='<br />
	.$user_a<br />
	.'&#038;user_b='<br />
	.$user_b;<br />
	# get XML file from Twitter -- it's an astonishingly simple XML file that reads<br />
	# <friends>true</friends><br />
	# or<br />
	# <friends>false</friends><br />
	# so we don't need to do much with it...<br />
	$follows = get $url;<br />
	  die 'Can\'t get $url' unless defined $follows;<br />
	# strip the tags - I'm using a generic "HTML stripping" regex<br />
	$follows =~ s/<(.|\n)+?>//g;<br />
	# we should probably convert "true" values to 1 and "false" values to zero or blank<br />
	# now let's push data into the matrix<br />
		 $matrix{$user_a}{$user_b} = $follows<br />
	}<br />
}<br />
# spit out the data as a tab-delimited table<br />
# print the top line first<br />
for $user_b ( keys %matrix ) {<br />
	print "\t$user_b";<br />
}<br />
# now print the values<br />
# they're all neatly arranged in the matrix so we<br />
# can just print them out sequentially<br />
for $user_a ( keys %matrix ) {<br />
    print "\n$source";<br />
    for $follows ( keys %{ $matrix{$user_a} } ) {<br />
		print "\t$matrix{$user_a}{$follows} ";<br />
    }<br />
}<br />
print "\n";<br />
[/code]</p>
<h4>Where next?</h4>
<p>Most of my thinking is included above in the code comments. An obvious mistake I&#8217;m making is checking to see whether, say, @mediaczar follows @mediaczar. That wastes <em>n</em> API calls per search. But a more serious mistake is <strong>not to be using the new <code>friendships/show</code> method</strong>. Because it tells you whether user A follows user B and whether user B follows user A at the same time, it would save me <em>lots</em> of API calls. How many lots? Well take a look at this.</p>
<p>This is what I&#8217;m doing at the moment &#8212; checking <em>each and every</em> cell in the matrix:</p>
<p><img align="center" src="http://img.skitch.com/20090714-njbkr7micbcgum5erj1dsxhc46.jpg" alt="clumsy API call matrix" /></p>
<p>This is what I&#8217;d be doing if I removed the diagonals:</p>
<p><img align="center" src="http://img.skitch.com/20090714-rk7r17nx1n491geim25meg1jb9.jpg" alt="Matrix with diagonals removed" /></p>
<p>And <em>this</em> is what I&#8217;d be doing if I used the newer API call:</p>
<p><img align="center" src="http://img.skitch.com/20090714-8rbx3dr3qe4ctm7ajp465ewj8w.jpg" alt="Matrix using the new API call" /></p>
<p>I had to look up <a href="http://www.curiousmath.com/index.php?name=News&#038;file=article&#038;sid=23">the formula</a> for working this out without colouring in little boxes. With a little tweaking (to prevent the diagonals from creeping back in), here it is:</p>
<p><code>((n-1)^2)+n-1)/2</code></p>
<p>So &#8212; for <a href="http://tweetcongress.org/parties">a list of congress people </a>(159 on twitter as at Tuesday July 14, 2009) that&#8217;d be <code>((156-1)^2-1+156)/2 = 12,090</code> API calls. Which is still a lot and will require some careful throttling, but (literally) not half as many as the 156^2 = 24,336 API calls that I&#8217;d need to run it as the script currently stands.</p>
<p>So &#8211; back to the drawing board for a while. I really can&#8217;t work out a programmatic way of doing this. Hmph.</p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/07/a-first-stab-at-a-perl-script-to-create-twitter-friendfollow-matrices/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Should we ask employees to tweet client stories?</title>
		<link>http://mediaczar.com/blog/2009/05/should-we-ask-employees-to-tweet-client-stories/</link>
		<comments>http://mediaczar.com/blog/2009/05/should-we-ask-employees-to-tweet-client-stories/#comments</comments>
		<pubDate>Fri, 15 May 2009 09:42:35 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[opinion]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[digg]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[networks]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[social media policy]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=937</guid>
		<description><![CDATA[
Here&#8217;s an interesting ethical question: is it OK to ask employees to share company and client news through their personal social networks?
Here&#8217;s a hypothetical example. An agency has just launched a new ad campaign and posted the TV spot on YouTube. Is it OK to send an all-hands email asking people to share the link [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F05%2Fshould-we-ask-employees-to-tweet-client-stories%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F05%2Fshould-we-ask-employees-to-tweet-client-stories%2F" height="61" width="51" /></a></div><p><a href="http://www.flickr.com/photos/santos/56256773/" title="wall of spam by chotda, on Flickr"><img src="http://farm1.static.flickr.com/32/56256773_2050d0ebc1.jpg" width="375" height="500" alt="wall of spam" /></a></p>
<p>Here&#8217;s an interesting ethical question: is it OK to ask employees to share company and client news through their personal social networks?</p>
<p>Here&#8217;s a hypothetical example. An agency has just launched a new ad campaign and posted the TV spot on YouTube. Is it OK to send an all-hands email asking people to share the link on Twitter and Facebook? </p>
<p>Let&#8217;s take it a little further. Is it OK to ask them to sign into YouTube using their personal accounts, and rate the video? It seems harmless enough, doesn&#8217;t it? You&#8217;re not telling them <em>how</em> they should rate it, after all.</p>
<p>But what if you asked them to leave comments? Any normal agency or client side social media policy will tell them that they have to <em>disclose their relationship</em> with the makers of the video. And you wouldn&#8217;t really want a whole bunch of comments that start &#8220;Hi, I work for the agency that made this ad and I think it&#8217;s really great,&#8221; would you? What makes the two things different?<br />
<span id="more-937"></span></p>
<h3>Digging for victory</h3>
<p>OK. Another hypothetical example. The social news site Digg is a huge source of traffic for most  news websites. The Telegraph, for example, gets around <a href="http://www.malcolmcoles.co.uk/blog/telegraph-trafficsocial-sites/">75K visits<em> a day</em></a> from the service. That&#8217;s an awful lot of traffic. </p>
<p>The thing is, with Digg, you really want to make the front page if you want the big traffic. Around <a href="http://www.centernetworks.com/digg-town-hall-recap">10K stories are submitted by users to Digg every day</a>, and only 150 or so make it to the front page. So we&#8217;d need to be smart.</p>
<p>Digg is less open to being gamed than it used to be, but let&#8217;s say that a smart agency could still <a href="http://valleywag.gawker.com/tech/explainer/how-diggs-algorithm-works-++-the-100+word-version-328207.php">deconstruct Digg&#8217;s algorithm</a> sufficiently that it can use its network of staff to improve the chances of a story (or review) making it to the front page that shows their client in a positive light.</p>
<p>All they&#8217;d need to do (say) is send an all-hands email that mobilised your staff to digg a particular story at the right time of day. Would this be legitimate? <a href="http://blogs.zdnet.com/BTL/?p=14023">Digg clearly thinks not</a>, but are they the best judge?</p>
<h3>Ballot stuffing</h3>
<p>Let&#8217;s say a client gets nominated for a <a href="http://www.webbyawards.com/">Webby</a> or one of the other user-voted awards out there. It&#8217;s common and acceptable practice for web services to use any means at their disposal to beg for votes.</p>
<p>So is it OK to send an all-hands email asking your staff to register and vote? Is it OK to ask them to use their Facebook and Twitter accounts to ask their friends and followers to vote?</p>
<h3>Here&#8217;s what I think</h3>
<p>I think that this is an ethical minefield, but I&#8217;ve got a couple of clear points of view that are up for discussion. First off, and from a purely business perspective: </p>
<h4>If it&#8217;s valuable then clients should pay us to do it. If it&#8217;s not valuable we shouldn&#8217;t do it.</h4>
<p>Ignore the ethics. When was the last time you sent an email that said, &#8220;Please share this with your friends and use billing code xxx.xx when you record it on your time sheet&#8221;? Of course, it only takes a few seconds to relay a message (fewer if you simply copy and paste the message from the all-hands email to your social media presences). What the hell &#8211; it&#8217;s all just part of the team spirit, isn&#8217;t it?</p>
<p>But if this service is of any value at all to the client, then we&#8217;ll see the demands on our time beginning to increase. Soon we&#8217;ll find ourselves doing several a day. Larger agencies with more staff will offer a more valuable service to clients than boutique agencies (&#8220;We have a thousand trained staff with Twitter and Facebook accounts primed to promote your campaign!&#8221;)</p>
<p>We could even work out some kind of ratcheted scale that said that people with more Twitter followers could bill at a higher rate; or we might start looking at <a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/">unduplicated reach.</a></p>
<p>This might seem fanciful, but stories like the <a href="http://usocial.net/">uSocial&#8217;s</a> offer to <a href="http://latimesblogs.latimes.com/technology/2009/03/usocial-digg.html">game Digg&#8217;s front page</a> for around $200 tell us that the thriving black hat market for this sort of thing is getting a little greyer.</p>
<p>If clients value this, then they <em>should</em> pay us. If we give it away as a &#8220;value added&#8221; service, then are we sure that we&#8217;ve communicated this properly so that the client understands the value we added? </p>
<p>How can we be certain we aren&#8217;t <em>just undermining our digital value proposition</em>?</p>
<p>Now, here&#8217;s the tricky bit where I bring ethics back into it. If we&#8217;re paying our staff to relay messages to their networks on behalf of our clients, what makes this different from spam?</p>
<h4>Employees personal networks are their personal property</h4>
<p>I&#8217;m one of those people who believes that the boundaries are blurring between our personal, public and professional lives. My colleague Chris Nee has posted about the <a href="http://clickingandscreaming.com/2009/04/28/the-presentation-of-self-in-social-media/"><i>presentation of self in social media</i></a>, expressing a lot of how I feel rather better than I could myself, although that hasn&#8217;t stopped me from covering this elsewhere &#8212; notably in a discussion about our <a href="http://mediaczar.com/blog/2009/03/pr-agencies-and-privacy/">social media policy</a> where I said:</p>
<blockquote><p>I think that a good PR person is someone who manages their relationships well; who can tread the fine line between doing good work for their clients without abusing or exploiting their relationships. Who recognizes the value of their personal network, and their personal brand.</p></blockquote>
<p>Do we have a right to ask our employees to use their personal networks on behalf of our business? I think the answer is &#8211; of <em>course</em> we do. We pay experienced staff more partly because of the social capital that they&#8217;ve managed to accrue in their address books. When you leave your job to go to a new agency, you&#8217;ll take your contacts with you along with all the shared experience, the favours you&#8217;ve done, and the favours you owe.</p>
<p>So if part of the reason we&#8217;re paying them more is because of that network, then we clearly value their network. </p>
<p><em>So why on earth would we encourage our staff to spam their personal networks?</em></p>
<h4>I trust my peers. I don&#8217;t trust spam monkeys</h4>
<p>Part of the reason for the success of social networking tools like Facebook and Twitter is that I can choose who I follow and who I don&#8217;t. I can restrict conversations to a bunch of people I trust and respect. Sure, I&#8217;ve got lots of Twitter followers with names like &#8220;Sophie1982&#8243; and &#8220;EdelmanHR&#8221; but that just means that they hear my inconsequential blatherings, not that I have to hear theirs!</p>
<p>And I follow a lot of my colleagues&#8217; Twitter streams. I&#8217;m pleased to say that these are &#8212; on the whole &#8212; full of meaty chunky content and devoid of spam.</p>
<p>But if we increase the spam content, what will happen? Here&#8217;s what (in the absence of evidence) I believe: their follow rates, retweet rates and mention rates will all begin to drop off. From your experience, what do you think?</p>
<h3>Where does this leave us?</h3>
<p>I&#8217;ve put a somewhat one-sided argument here. I&#8217;m sure I&#8217;m open to all sorts of rebuttals like, &#8220;surely, if the content is interesting/useful/entertaining then our staff will only be <em>adding</em> value to their networks?&#8221; and &#8220;but surely it&#8217;s up to them whether they want to pass it on to their friends?&#8221; I&#8217;ll wait for these to roll in before I start trying to address them.</p>
<p>But here&#8217;s what I think. I think that we&#8217;re trying to teach our colleagues to learn from managing their personal communities in order better to manage our clients&#8217; communities. Anything that teaches them to prostitute their networks is a retrograde move. </p>
<p>Am I wrong? </p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/05/should-we-ask-employees-to-tweet-client-stories/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Methodology and thoughts behind those PR Week Twitter stats</title>
		<link>http://mediaczar.com/blog/2009/02/methodology-and-thoughts-behind-those-pr-week-twitter-stats/</link>
		<comments>http://mediaczar.com/blog/2009/02/methodology-and-thoughts-behind-those-pr-week-twitter-stats/#comments</comments>
		<pubDate>Thu, 26 Feb 2009 19:57:56 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[pipes]]></category>
		<category><![CDATA[porter novelli]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[pr week]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=862</guid>
		<description><![CDATA[There&#8217;s a school of thought that says that what&#8217;s important in social media is to attempt to create debate, not consensus.

Peter Hay from PR Week and I appear to have been rather successful in this. This morning, PR Week published an article, Twitter has suddenly exploded. Almost immediately, Twitter (or at least our particular neighbourhood [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fmethodology-and-thoughts-behind-those-pr-week-twitter-stats%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fmethodology-and-thoughts-behind-those-pr-week-twitter-stats%2F" height="61" width="51" /></a></div><p>There&#8217;s a school of thought that says that what&#8217;s important in social media is to attempt to create debate, not consensus.</p>
<p><a href="http://www.flickr.com/photos/paws_and_toes/2465075519/" title="Cat Among The Pigeons by ChinchillaVilla, on Flickr"><img src="http://farm3.static.flickr.com/2074/2465075519_a517c37218.jpg" width="500" height="450" alt="Cat Among The Pigeons" /></a></p>
<p>Peter Hay from PR Week and I appear to have been rather successful in this. This morning, PR Week published an article, <a href="http://www.prweek.com/uk/home/article/884307/twitter-suddenly-exploded/">Twitter has suddenly exploded</a>. Almost immediately, Twitter (or at least our particular neighbourhood of Twitter) suddenly exploded.</p>
<p>One or two people were rather scathing: suggesting that the stats demonstrated that Peter and I didn&#8217;t understand the &#8220;essence of Twitter&#8221; or that they were &#8220;obviously flawed&#8221;, or that we had &#8220;redefined shallow&#8221;.</p>
<p>Indeed (horror of horrors) some people even went so far to suggest that Porter Novelli had ginned up the results to put us at the top. In fact, in PR week&#8217;s list, we came second. But no doubt this was a Machiavellian ploy &#8212; it&#8217;s details like those, Pooh Bah would say, that &#8220;give artistic verisimilitude to an otherwise bald and unconvincing narrative.&#8221;</p>
<p>I joke, but I can completely understand people&#8217;s strong feelings about this; PR Week was torn between a desire to cover our approach (and give credit where appropriate) and a need to keep the article readable and relevant to the greater proportion of their readers.</p>
<p>I&#8217;d like to share our methodology with you all so that you can repeat our experiments, should you so wish. After that, I&#8217;ll talk about the methodology that we were originally going to follow, </p>
<p>Tomorrow (once it&#8217;s had a chance to blow over), I&#8217;ll post some quick thoughts on the whole storm-in-a-Tweetcup thing.</p>
<h3>Methodology</h3>
<p>We used Michael Litman&#8217;s (<a href="http://twitter.com/litmanlive">@litmanlive</a>)  list of <a href="http://ukmediatweeple.pbwiki.com/">UK Media Tweeple</a>. This was based on <a href="http://www.prblogger.com/2008/11/uk-pr-people-on-twitter/">original work</a> by Stephen Davies (<a href="http://twitter.com/stedavies">@stedavies</a>) but has been wikified so that agencies can (should they so choose) keep their information up to date.</p>
<p>Lots of people on the list were pretty borderline &#8212; there are in-house teams and vendors there, as well as agencies with a significantly broader remit than simply &#8220;PR&#8221;. I am a relative newcomer to the world of PR, and was more than happy to let PR Week define who is PR and who isn&#8217;t, but we erred on the generous side. <a href="http://wearesocial.net"> We are Social</a>, for example, made the cut to be on the research list.</p>
<p>Had we had the time, I&#8217;d have sent a note out over Twitter asking everyone to update their entries. Time, however was not on our side, and I didn&#8217;t even get around to hinting at what I was doing until <a href="http://twitter.com/mediaczar/status/1242458470">the evening of the 23rd</a>.<br />
<a href="http://pipes.yahoo.com/pipes/pipe.edit?_id=0b76153699cf1f9c6c82344b7a85abb4"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/02/pr-week-pipe-1.jpg" alt="PR Week Twitter Stats Yahoo! Pipe" title="PR Week Twitter Stats Yahoo! Pipe" width="319" height="658" class="alignright size-full wp-image-864" /></a><br />
By then though it was already clear that I had a large job on my hands; there were almost 350 people on the list. On the whole, the UK PR community should be proud of how quickly it has reacted to the whole &#8220;Twitter thing&#8221;.</p>
<p>I took the list, published it as a <a href="http://spreadsheets.google.com/ccc?key=p4QDp5UmTKxQvaPp5htQbCQ&#038;hl=en">Google Spreadsheet</a> and &#8212; using  a <a href="http://pipes.yahoo.com/mediaczar/quickprweekrankings">Yahoo! Pipe</a> that I adapted for the purpose, queried the <a href="http://apiwiki.twitter.com/">Twitter API</a> for the summary data on each account on that list.</p>
<p>Twitter gives you all <em>sorts</em> of interesting information, but what we were grabbed were the following:</p>
<ul>
<li>Date joined Twitter</li>
<li>Number of Friends</li>
<li>Number of Followers, and</li>
<li>Number of Updates.</li>
</ul>
<p>That allowed us to create <a href="http://spreadsheets.google.com/ccc?key=p4QDp5UmTKxQHP4lJ0QDVZw&#038;hl=en"><em>this</em> spreadsheet</a>, from which the stats mentioned in the PR Week article were taken. </p>
<p>Again, Porter Novelli took no part in the editorial decisions (although they seem pretty straightforward.) You will recall that Peter and Gemma were writing for a general readership, not for the Twitterverse!</p>
<h3>Methodology we&#8217;d like to have used</h3>
<p>Those of you who&#8217;ve read my blog before will know that my real interest in Twitter is more complex than the previous methodology would suggest. When Peter and I first discussed the exercise on Monday we had been hoping to do something more along the lines of the <a href="http://mediaczar.com/blog/category/networks/">network analysis</a> that we&#8217;ve been fiddling with at Porter Novelli.</p>
<p>Here are some points to bear in mind. </p>
<p>First of all, <strong>not all followers are created equal.</strong> If I have only ten followers, but they each have a thousand followers, that may mean I have more opportunity-to-influence than if I had a hundred followers with only ten followers each. </p>
<p>More to the point, the fewer people those ten people follow themselves, the more influence I wield within their networks (if I am one of only ten people they follow between them, I will have greater share-of-voice than if I am merely one of ten thousand.)</p>
<p>Secondly, <strong>the followers whom I don&#8217;t share with the rest of the network count for more than those who follow several (or many) of my peers</strong>. The more &#8220;exclusive&#8221; my follower-base, the greater my control over on the flow of information within the overall network, and the greater my value to the network. </p>
<p>I&#8217;ve been doing some work looking at <a href="http://mediaczar.com/blog/2009/02/republicans-vs-democrats-ii/">unduplicated reach</a> among twitter networks. For example, looking at Porter Novelli&#8217;s own <a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/">global Twitter footprint</a>, it was interesting to see how many of our contacts were duplicated.</p>
<p>So what Peter and I really wanted to do was to use some of these techniques on the PR Week data set. For those of you with a mathematical (or social network analytical) bent, we were going to run some eigenvector shizzle on the whole bizzle. Oh &#8212; and look at unduplicated reach for the various companies on the list.</p>
<h4>What went wrong?</h4>
<p>It was always an ambitious project. The 344 people who were under analysis had a fairly daunting 95K followers between them. The Twitter API lets you make 100 requests an hour, and each request returns data on up to 100 followers. Even if we were to assume that everyone had followers in nice tidy multiples of 100 (they don&#8217;t) then it would have taken 9.5 hours to download the data using one Twitter login. </p>
<p>The trick of course, is to use more than one login. Tim Hoang (<a href="http://twitter.com/timhoang">@timhoang</a>) and I quickly registered 50 temporary accounts to power the API requests. Twitter&#8217;s <a href="http://twitter.com/terms">terms</a> have historically been quite relaxed about this sort of thing, and we&#8217;ve always been very careful to try and stay within the spirit of those terms.</p>
<p>But.</p>
<p>Twitter has been hit lately by a bunch of bad things (like spam bots and pyramid schemes), and they&#8217;re tightening up their defenses. This past weekend, they&#8217;ve tightened up a lot, and things that used to be fine just aren&#8217;t.</p>
<p>We managed to collect information on only around 60K followers out of the 95K. This was too large a margin of error to correct (although we made several attempts to do so).</p>
<p>So &#8212; we had to abandon our grand plans, and revert to the simple counts approach (as detailed above.) This won&#8217;t stop us trying to improve our processes, but we&#8217;ll need to talk to Twitter about that.</p>
<h3>Some thoughts</h3>
<p><a href="http://prstick.blogspot.com/">Kate Hartley</a> from Carrot Communications (who sits with me on the PRCA&#8217;s Digital Working Group) joked that it&#8217;s strange how PR people create research-for-news-stories for their own clients on a daily basis, but are miffed when their own techniques are used against them. At one level, I agree with her &#8212; I think that some people are probably disappointed that they aren&#8217;t the ones with their names on the research.</p>
<p>But there&#8217;s more to worry about than that. Here are my thoughts.</p>
<ol>
<li>For God&#8217;s sake get over yourselves! We&#8217;re talking about Twitter here, not the economy. Worry about something important, why don&#8217;t you? I still can&#8217;t get over the fact that &#8212; when a pilot managed land an airplane on a river, the story we all tell each other is &#8220;how it broke on Twitter.&#8221; What &#8212; the story&#8217;s <em>not</em> about a man who magically landed a f*cking plane on a f*cking river? Are we really getting this right?</li>
<li>How influential you are on Twitter is not a <em>real</em> thing. It doesn&#8217;t <em>really</em> matter how many Twitter friends you have (although I&#8217;ve now got heaps, thank you very much!) Context is everything.  My boss, who runs Porter Novelli&#8217;s EMEA network and sits on our Executive Committee  is on Twitter. She is more influential than I, and will continue to be, no matter how many Twitter followers I accrue.
<p>Twitter is just one channel through which exercise your influence. Don&#8217;t give up on your blogs, your Facebook pages, your Amazon reviews, or your Last.fm playlists or your IM friend lists, for God&#8217;s sake. But remember, it&#8217;s <em>who</em> you are, and your relationships that matter; your &#8220;context&#8221;, and not your &#8220;counts.&#8221;
</li>
<li>The really interesting question isn&#8217;t &#8220;who are the Twitterati&#8221; or twitter influencers. I&#8217;m interested in the Twitter thing mainly because I want to see how well it reflects real life. After today, I&#8217;d probably say that it doesn&#8217;t very well, wouldn&#8217;t you?</li>
</ol>
<p>Be warned &#8212; I may just follow this research up with some research on &#8220;how many phone numbers PR people have on their mobile phones.&#8221;  </p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/02/methodology-and-thoughts-behind-those-pr-week-twitter-stats/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
		</item>
		<item>
		<title>Swedish Politicians on Twitter</title>
		<link>http://mediaczar.com/blog/2009/02/swedish-politicians-on-twitter/</link>
		<comments>http://mediaczar.com/blog/2009/02/swedish-politicians-on-twitter/#comments</comments>
		<pubDate>Sun, 22 Feb 2009 17:28:39 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[networks]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[aisee]]></category>
		<category><![CDATA[mapping]]></category>
		<category><![CDATA[network analysis]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[sweden]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=854</guid>
		<description><![CDATA[Twixdagen does for Swedish politics what Tweetminster does for British. Hampus Brynolf (@hampusbrynolf) just sent me a link to this map he&#8217;s pulled together for their blog: 

You&#8217;ll need to click through to his blog post to experience and interact with the map properly.
Hampus says that he used aiSee to generate an SVG file which [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fswedish-politicians-on-twitter%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fswedish-politicians-on-twitter%2F" height="61" width="51" /></a></div><p><a href="http://twixdagen.se/">Twixdagen</a> does for Swedish politics what <a href="http://tweetminster.co.uk">Tweetminster</a> does for British. Hampus Brynolf (<a href="http://twitter.com/hampusbrynolf">@hampusbrynolf</a>) just sent me a link to this map he&#8217;s pulled together for their blog: </p>
<p><a href="http://twixdagen.se/blogg/?p=44" class="broken_link" ><img src="http://mediaczar.com/blog/wp-content/uploads/2009/02/twixdagen.jpg" alt="Twixdagen&#039;s map of Twittering Swedish politicians - click to visit the original post" title="Twixdagen&#039;s map of Twittering Swedish politicians" width="421" height="299" class="aligncenter size-full wp-image-855" /></a></p>
<p>You&#8217;ll need to click through to his <a href="http://twixdagen.se/blogg/?p=44" class="broken_link" >blog post</a> to experience and interact with the map properly.</p>
<p>Hampus says that he used <a href="http://www.aisee.com/welcome.htm">aiSee</a> to generate an SVG file which could then be opened in Illustrator to &#8220;search and replace&#8221; on shapes, colors and lines (which explains the good-looking graph.)</p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/02/swedish-politicians-on-twitter/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Can we calculate party affiliation? (the US Congress Edition)</title>
		<link>http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation-the-us-congress-edition/</link>
		<comments>http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation-the-us-congress-edition/#comments</comments>
		<pubDate>Fri, 13 Feb 2009 20:10:08 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[networks]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[congress]]></category>
		<category><![CDATA[democrat]]></category>
		<category><![CDATA[gop]]></category>
		<category><![CDATA[jared polis]]></category>
		<category><![CDATA[john mccain]]></category>
		<category><![CDATA[mapping]]></category>
		<category><![CDATA[nancy pelosi]]></category>
		<category><![CDATA[Neil Abercrombie]]></category>
		<category><![CDATA[network analysis]]></category>
		<category><![CDATA[republican]]></category>
		<category><![CDATA[richard durbin]]></category>
		<category><![CDATA[susan collins]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=816</guid>
		<description><![CDATA[Using nothing more than their public twitter relationships, is it possible to predict whether a US Congressperson is a Republican or a Democrat? The answer seems to be a guarded &#8220;yes&#8221; &#8212; our tools predict correctly 40/46 times (or around 87% of the cases.)

This post follows on from a post earlier today in which I [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fcan-we-calculate-party-affiliation-the-us-congress-edition%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fcan-we-calculate-party-affiliation-the-us-congress-edition%2F" height="61" width="51" /></a></div><p><em>Using nothing more than their public twitter relationships, is it possible to predict whether a US Congressperson is a Republican or a Democrat? The answer seems to be a guarded &#8220;yes&#8221; &#8212; our tools predict correctly 40/46 times (or around 87% of the cases.)</em></p>
<p><a href="http://www.flickr.com/photos/porternovelli/3277224960/" title="Calculated Party Affinity US Congress by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3323/3277224960_67cf19e4b3.jpg" width="500" height="350" alt="Calculated Party Affinity US Congress" /></a></p>
<p>This post follows on from a post earlier today in which I asked, &#8220;<a href="http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation/">can we <em>calculate</em> party affiliation?</a>&#8221; The data set in the earlier post was gathered from the 16 members of the UK parliament who are on Twitter and the relationships between them.</p>
<p><a href="http://tweetcongress.org/">Tweetcongress</a> maintains a list of US congresspeople on Twitter. Today (February 13, 2009) there are  <a href="http://tweetcongress.org/parties">76 congresspeople on the service</a>, but when I collected my data set of &#8220;who follows who&#8221; on February 3, 2009 there were only 65. Of these 65, fully 19 (29%) lived a life of noble isolation with regards the network &#8212; none of their peers linked to them, and they in turn linked to none of their peers. Removing these Miss Havishams from the data set leaves me with 46 twittering congresspeople who form a network.</p>
<p>Now as both social network analysis and <a href="http://www.aesopfables.com/cgi/aesop1.cgi?sel&#038;TheAssandhisPurchaser2">Aesop</a> would have it, &#8220;a man is known by the company he keeps.&#8221; What I mean by this is that given the partisan nature of politics, we should expect that Democrats will link to other Democrat twitterers more often than they link to Republican twitterers and vice versa. So that&#8217;s what NetDraw<a href="#note1">[1]</a> , the software I&#8217;m using for most of this stuff, looks for, or more accurately:</p>
<blockquote><p>
To identify factions, NetDraw software iteratively searches for a distribution of nodes among a selected number of factions to minimise the number of connections between factions and to maximize the number of connections within factions.
</p></blockquote>
<p>Whatever. So I let NetDraw loose on the data, and here&#8217;s what it did.</p>
<p><a href="http://www.flickr.com/photos/porternovelli/3277224960/sizes/l" title="Calculated Party Affinity US Congress by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3323/3277224960_67cf19e4b3.jpg" width="500" height="350" alt="Calculated Party Affinity US Congress" /></a></p>
<p>I coloured the nodes red for Republican and blue for Democrats<a href="#note2">[2]</a>, labeled the nodes by party (for the sake of clarity, and for the hard-of-thinking, that&#8217;s &#8220;R&#8221; for Republican and &#8220;D&#8221; for Democrat) then counted all the nodes where label said one thing but colour another. There were six of these nodes; so NetDraw got the answer right 40&frasl;46 of the time (just about 87%.) This is less than the astonishing <a href="http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation/">93.75%</a> accuracy we got with the <a href="http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation/">Westminster twittering members of parliament</a> in the previous post. Nevertheless I think we can safely say that it&#8217;s not a particularly integrated (or bipartisan) network if we can predict party affiliation with <em>quite </em> such success.</p>
<p>Here&#8217;s exactly the same map with the errant sheep re-labeled with their proper names so it&#8217;ll be easier to refer to them (if it helps, you can click on the image to view or download a larger version.)</p>
<p><a href="http://www.flickr.com/photos/porternovelli/3276404881/sizes/l" title="congress guesswork incorrect labels by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3316/3276404881_7d5494a096.jpg" width="500" height="349" alt="congress guesswork incorrect labels" /></a></p>
<p>You&#8217;ll see, I hope, that NetDraw has made a pretty good fist of the job. Where it <em>has</em> gone wrong on the whole is where the data clearly suggests something else. So Rep. Jared Polis for instance follows (and is followed by) no Democrat peers. Rep. Nancy Pelosi (D) and Sen. Richard Durbin (D) follow each other, but since Pelosi is followed by several Republicans and none of her other Democrat peers you can see why the algorithm has made the incorrect guess that the two of them are Republicans. Long-serving member <a href="http://en.wikipedia.org/wiki/Neil_Abercrombie">Neil Abercrombie</a>, as discussed in <a href="http://mediaczar.com/blog/2009/02/republicans-outperform-democrats-on-twitter/">a previous post on US Congress Twitter folk</a>, forms a bit of a bridge between the two parties, so despite his membership of the Congressional Progressive Caucus and liberal voting record, from the Twitter network point of view, his affiliation is somewhat ambiguous.</p>
<p>Sen. McCain follows none of his peers, and appears to inherit his incorrect attribution from Sen. Susan Collins. For the life of me, I can&#8217;t work out what makes it think that <a href="http://en.wikipedia.org/wiki/Susan_Collins">Sen. Susan Collins</a> is a Democrat. She really isn&#8217;t, you know.</p>
<p><em><a name="note1">Note 1:</a> <a href="http://analytictech.com/Netdraw/netdraw.htm">NetDraw</a> is a free program written by <a href="http://www.steveborgatti.com/">Steve Borgatti</a> from the University of Kentucky. If you&#8217;re interested in playing around with this stuff, you&#8217;ll need to get yourself a copy.</em></p>
<p><em><a name="note2">Note 2:</a> Actually, that&#8217;s not true. Despite a friend sharing the simple mnemonic that &#8220;&#8216;Republicans&#8217; and &#8216;red&#8217; begin with the same letter,&#8221; I just can&#8217;t get it out of my English head that the Republicans should be blue and the Democrats red. As a result I waste precious minutes re-colouring these maps in Illustrator. It is worth pointing out that I also have problems with &#8220;left&#8221; and &#8220;right&#8221; on occasion &#8212; preferring instead the binary opposition &#8220;left&#8221; and &#8220;No! no! The </em>other<em> left, for God&#8217;s sake!&#8221;</em></p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation-the-us-congress-edition/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Can we calculate party affiliation? (The Westminster edition)</title>
		<link>http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation/</link>
		<comments>http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation/#comments</comments>
		<pubDate>Fri, 13 Feb 2009 13:47:28 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[measurement]]></category>
		<category><![CDATA[networks]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[Andy Reed]]></category>
		<category><![CDATA[andyreedmp]]></category>
		<category><![CDATA[celia barlow]]></category>
		<category><![CDATA[celiabarlowmp]]></category>
		<category><![CDATA[david kidney]]></category>
		<category><![CDATA[David Lammy]]></category>
		<category><![CDATA[davidkidney]]></category>
		<category><![CDATA[davidlammymp]]></category>
		<category><![CDATA[derek wyatt]]></category>
		<category><![CDATA[edvaizey]]></category>
		<category><![CDATA[edward vaizey]]></category>
		<category><![CDATA[Grant Shapps]]></category>
		<category><![CDATA[grantshapps]]></category>
		<category><![CDATA[james plaskitt]]></category>
		<category><![CDATA[jamesplaskitt]]></category>
		<category><![CDATA[jgoldsworthy]]></category>
		<category><![CDATA[Jo Swinson]]></category>
		<category><![CDATA[joswinson]]></category>
		<category><![CDATA[julia goldsworthy]]></category>
		<category><![CDATA[Kerry McCarthy]]></category>
		<category><![CDATA[kerrymp]]></category>
		<category><![CDATA[lfeatherstone]]></category>
		<category><![CDATA[Lynne Featherstone]]></category>
		<category><![CDATA[mapping]]></category>
		<category><![CDATA[members of parliament]]></category>
		<category><![CDATA[michael fabricant]]></category>
		<category><![CDATA[mike_fabricant]]></category>
		<category><![CDATA[mp]]></category>
		<category><![CDATA[network analysis]]></category>
		<category><![CDATA[sadiq khan]]></category>
		<category><![CDATA[sadiqkhanmp]]></category>
		<category><![CDATA[Tom Harris]]></category>
		<category><![CDATA[Tom Watson]]></category>
		<category><![CDATA[tomharris1964]]></category>
		<category><![CDATA[tom_watson]]></category>
		<category><![CDATA[visualization]]></category>
		<category><![CDATA[westminster]]></category>
		<category><![CDATA[wyattd]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=809</guid>
		<description><![CDATA[This is a follow-up post to Why doesn’t the Tory MP have Twitter friends? &#8212; a report on some early research into the interrelationships between the few Westminster MPs who are on Twitter.
According to Tweetminster, the number of UK MPs on Twitter has doubled since this time last month. Where there were eight Twittering MPs, [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fcan-we-calculate-party-affiliation%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fcan-we-calculate-party-affiliation%2F" height="61" width="51" /></a></div><p><em>This is a follow-up post to <a href="http://mediaczar.com/blog/2009/01/why-doesnt-the-tory-mp-have-twitter-friends/">Why doesn’t the Tory MP have Twitter friends?</a> &#8212; a report on some early research into the interrelationships between the few Westminster MPs who are on Twitter.</em></p>
<p>According to <a href="http://tweetminster.co.uk">Tweetminster</a>, the number of UK MPs on Twitter has doubled since this time last month. Where there were eight Twittering MPs, there are now sixteen. Here&#8217;s the map that shows who follows whom (the labels may be too small to read &#8212; if you want to see a larger image, click on the map.</p>
<p><a href="http://www.flickr.com/photos/porternovelli/3276499568/sizes/l/" title="Actual factions among Westminster MPs on Twitter by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3475/3276499568_5f2b99d965.jpg" width="500" height="464" alt="Actual factions among Westminster MPs on Twitter" /></a></p>
<p>I&#8217;ve coloured each node to show party affiliation; for those of you who are unfamiliar with British politics, Labour (our left-of-centre party) shows up in red, Conservatives (our right-of-centre party) in blue, and Liberal Democrats (what it says on the tin)  in yellow.</p>
<p>The size of each node represents the individual&#8217;s &#8220;betweenness centrality&#8221; &#8212; a network analysis term that helps us place a value on individuals within a network. To give you a sense of what it means, the higher the betweenness centrality of an individual, the greater the impact when you take them out of the network. For those of you who work in large companies, it may be worth noting that senior management&#8217;s personal assistants generally have very high betweenness &#8212; something that is mostly remarked upon when they go on holiday (<em>simultaneous translation: &#8220;take a vacation&#8221;</em>.)</p>
<p>So far so good. By now, regular readers will probably be kissing their teeth and thinking &#8220;so what?&#8221; I&#8217;ve done <a href="http://mediaczar.com/blog/category/twitter/">a lot of these Twitter maps</a> in the past and the novelty must be wearing off on you by now.</p>
<p>So here&#8217;s the thing. There are a few network analysis techniques that let one identify cliques and factions. What we&#8217;ve got here is a small set where we already <em>know</em> what people&#8217;s affiliations should be. How interesting, I thought, would it be to see how well the calculated result fits the real world data? Here&#8217;s what I found:<br />
<span id="more-809"></span><br />
<a href="http://www.flickr.com/photos/porternovelli/3276499624/sizes/l/" title="Calculated factions among Westminster MPs on Twitter by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3301/3276499624_4ff0d8d1c4.jpg" width="500" height="462" alt="Calculated factions among Westminster MPs on Twitter" /></a></p>
<p>To my fairly untrained eye, the fit seems remarkably good. Only one MP out of sixteen is incorrectly placed (Derek Wyatt should be tightly affiliated to the &#8220;Labour&#8221; faction, but shows up affiliated to the &#8220;Lib Dem&#8221; faction.) That&#8217;s 1&frasl;16 wrong (or 93.75% accurate.)</p>
<h4>The usual caveats</h4>
<p>The method I used relies on you telling it how many factions there are. I knew there were three, so that&#8217;s what I told it. But what if I&#8217;d said two, or four? The software I&#8217;m using tells you how well-fitted the result is to the data, so there&#8217;s some feedback at least. But I&#8217;m going to need to play with this a lot more before I&#8217;m fully confident about putting it in front of a client.</p>
<p>I&#8217;m not a statistician, but even I know there&#8217;s something fishy about my claims to 93.75% accuracy.</p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/02/can-we-calculate-party-affiliation/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Republicans vs. Democrats: Pareto charts of unduplicated Twitter reach</title>
		<link>http://mediaczar.com/blog/2009/02/republicans-vs-democrats-ii/</link>
		<comments>http://mediaczar.com/blog/2009/02/republicans-vs-democrats-ii/#comments</comments>
		<pubDate>Sun, 08 Feb 2009 22:35:31 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[twitter]]></category>
		<category><![CDATA[congress]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=787</guid>
		<description><![CDATA[A couple of days ago I did a little more analysis on Republican and Democratic Congresspeople on Twitter. Towards the end of the post, I realized that the unduplicated reach pareto chart that I&#8217;d built would only make sense if the US were a one-party state (or to be fair, if both parties had a [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Frepublicans-vs-democrats-ii%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Frepublicans-vs-democrats-ii%2F" height="61" width="51" /></a></div><p>A couple of days ago I did a little more analysis on <a href="http://mediaczar.com/blog/2009/02/republicans-outperform-democrats-on-twitter/">Republican and Democratic Congresspeople on Twitter</a>. <a href="http://mediaczar.com/blog/wp-content/uploads/2009/02/us-twitter-pareto.png"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/02/us-twitter-pareto-150x150.png" alt="Pareto chart showing unduplicated reach for US congress" title="Pareto chart showing unduplicated reach for US congress" width="150" height="150" class="alignright size-thumbnail wp-image-779" /></a>Towards the end of the post, I realized that the unduplicated reach pareto chart that I&#8217;d built would only make sense if the US were a one-party state (or to be fair, if both parties had a single issue that they were united in wanting to promote.)</p>
<p>So &#8212; wanting to make this a little more representative &#8212;  I went back and produced two charts; one showing Republican unduplicated reach (which follows a typical 80:20 distribution)&#8230;</p>
<p><a href="http://mediaczar.com/blog/wp-content/uploads/2009/02/rep-pareto.png"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/02/rep-pareto.png" alt="Pareto chart showing unduplicated reach of Republican Twitterers in the US Congress" title="Pareto chart showing unduplicated reach of Republican Twitterers in the US Congress" width="500" class="aligncenter size-full wp-image-792" /></a><br />
<span id="more-787"></span><br />
And one showing the Democratic unduplicated reach (which clearly doesn&#8217;t)&#8230;</p>
<p><a href="http://mediaczar.com/blog/wp-content/uploads/2009/02/dem-pareto.png"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/02/dem-pareto.png" alt="Pareto chart showing unduplicated reach of Democratic Twitterers in the US Congress" title="Pareto chart showing unduplicated reach of Democratic Twitterers in the US Congress" width="500" class="aligncenter size-full wp-image-785" /></a></p>
<p>What the charts hint at but don&#8217;t show is that &#8212; while around 61% of &#8220;Republican&#8221; Twitter followers follow more than one Republican congressperson, this is  true of only 29% of &#8220;Democratic&#8221; Twitter followers; that is &#8212; there&#8217;s much more duplication among the right than there is among the left.</p>
<p>I thought I might be able to draw some conclusions from the difference between the two charts, or at least suggest some theories to explain the differences. But I think my thoughts were best put by one of my correspondents, <a href="http://www.kathryncorrick.co.uk/">Kathryn Corrick</a> who simply pointed out that the &#8220;left can never agree&#8221; (she also suggests that the political right in the UK blogosphere is more &#8220;clubbable&#8221; and co-operative than the political left &#8211; which rather fits the US maps I&#8217;ve been able either to draw or to get my hands on.) </p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/02/republicans-vs-democrats-ii/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Republicans still outperforming Democrats on TweetCongress</title>
		<link>http://mediaczar.com/blog/2009/02/republicans-outperform-democrats-on-twitter/</link>
		<comments>http://mediaczar.com/blog/2009/02/republicans-outperform-democrats-on-twitter/#comments</comments>
		<pubDate>Wed, 04 Feb 2009 16:21:51 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[networks]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[congress]]></category>
		<category><![CDATA[mapping]]></category>
		<category><![CDATA[network analysis]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=775</guid>
		<description><![CDATA[Three weeks ago (and at the prompting of my colleague Eddie Garrett who heads up Porter Novelli DC&#8217;s digital team) I mapped out the interconnections between US Congress Tweeters. We&#8217;d been working on a Twitter crawler and it seemed like a good opportunity to test things out on a new data set.
This is a follow-up [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Frepublicans-outperform-democrats-on-twitter%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Frepublicans-outperform-democrats-on-twitter%2F" height="61" width="51" /></a></div><p>Three weeks ago (and at the prompting of my colleague <a href="http://twitter.com/redclayhound">Eddie Garrett</a> who heads up Porter Novelli DC&#8217;s digital team) I <a href="http://mediaczar.com/blog/2009/01/network-map-of-us-congress-twitterers/">mapped out the interconnections between US Congress Tweeters</a>. We&#8217;d been working on a Twitter crawler and it seemed like a good opportunity to test things out on a new data set.</p>
<p>This is a follow-up post. Once again it was prompted by a third party: Christie Findlay at <a href="http://politicsmagazine.com/">Politics Magazine</a> asked whether it would be OK to print a copy of one of the maps in their March edition. I&#8217;ve heard that three weeks are a long time in politics, so I thought I&#8217;d better run the crawl again <em>just in case</em>. Also I&#8217;ve got a new crawler that uses the proper <a href="http://apiwiki.twitter.com/REST+API+Documentation">Twitter API</a> (I can see some of your eyes glazing over you know. Just skip ahead when that happens.) I&#8217;d <a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/">tried it out</a> on the Porter Novelli data set, but welcomed a chance to try it on something more meaty.</p>
<p>So yesterday morning before work I ran the crawl. I use the excellent <a href="http://tweetcongress.org">Tweet Congress</a> as my source of information about which congress people are on Twitter.<br />
<span id="more-775"></span><br />
Here&#8217;s the data I get from them republished as a Google Spreadsheet.</p>
<p><iframe width='500' height='300' frameborder='0' src='http://spreadsheets.google.com/pub?key=p4QDp5UmTKxS3wTAR-zSaFA&#038;output=html&#038;gid=0&#038;single=true&#038;range=A1:D68'></iframe></p>
<p>I use Twitter itself as the source of information about who follows whom and who &#8212; in turn &#8212; is followed by whom. Then my computer does a little analysis on the data I get so that I can put it through our mapping software. Here&#8217;s the map. It&#8217;s a little too small to see, but if you click on it, it&#8217;ll take you to a <em>huge</em> version on Flickr.</p>
<p><a href="http://www.flickr.com/photos/porternovelli/3250409169/sizes/o/" title="US Congress Twitterers by Indegree 3 Feb 09 by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3361/3250409169_a916109672.jpg" width="500" height="430" alt="US Congress Twitterers by Indegree 3 Feb 09" /></a></p>
<p>It&#8217;s probably worth going through some of the features again briefly, just in case you haven&#8217;t seen my other maps. Actually, I don&#8217;t think I&#8217;ve ever really explained them properly.</p>
<p>The arrows show the direction of the relationship. If Senator Susan Collins follows Senator John McCain then there&#8217;s an arrow that points from Sen. Collins to Sen. McCain. If Representative Jason Chaffetz (Rep. Utah) and Representative Jared Polis (Dem. Colorado)  follow each other then there&#8217;s a double-headed arrow between Reps. Chaffetz and Polis.</p>
<p>If they follow none of their peers and none of their peers follows them, then they&#8217;re relegated to the bench. In this map the bench is the list of unconnected circles in the upper left corner. For those of you who are comparing this to the last map, one of the interesting things to note is that Senator Hillary Clinton has left the ranks of the Democrat congressperson Twitter folk. </p>
<p>I&#8217;m not sure why Rep. Thomas Price (R, GA) and  Rep. Steven Driehaus (D, OH) have Twitter handles listed. They aren&#8217;t on Twitter as far as anyone can tell. Perhaps it&#8217;s a tyop?</p>
<p>So far, so obvious I suspect. And it should be obvious to any US readers, the red circles are the Republicans and the blue circles are the Democrats. This is mildly contrary and confusing to my UK audience who are used to the left-of-centre Democrat analogue Labour Party being red and the right of centre Republican analogue Tory party being blue. It&#8217;s like going to France and forgetting that the tap with C on it gives you <em>hot</em> water. For my US readers, you probably know a tap better as a faucet. It&#8217;s rather exhausting having to do all this translation.</p>
<p>You&#8217;ll notice that the circles are different sizes. If you&#8217;re very observant, you&#8217;ll probably notice that the large circles have more arrows pointing at them, and that the smallest circles have <em>no</em> arrows pointing at them. For this map I&#8217;ve sized the circles by a factor called &#8220;indegree&#8221; which pretty much translates as &#8220;the number of arrows pointing at you.&#8221; Indegree is  used in network analysis as an indicator of popularity or authority (depending on the circumstances.)</p>
<h4>First caveat</h4>
<p>This can be pretty spurious when you&#8217;re looking at what is (after all) only <em>one</em> communication channel for this network. We have to imagine that congresswomen and men (although it&#8217;s mostly men at present) will have opportunities tp communicate face-to-face, on the telephone, and via email, memos and mail and the like. So I have no idea how well this fits the reality. I&#8217;d have thought (for example) that Sen. McCain was pretty authoritative &#8212; but the Twitter map shows him to be peripheral.</p>
<p>This leads us to our second caveat.</p>
<h4>Second caveat</h4>
<p>For most of this group, the primary audience is not their peers. It is (rather obviously) their voting public. As we&#8217;ll come to see some of them have established some pretty hefty audiences (although none of them is really in celebrity territory yet.) So that probably makes this network even <em>less</em> representative.</p>
<p>Still it&#8217;s an interesting exercise. Let&#8217;s carry on.</p>
<p><a href="http://flickr.com/photos/porternovelli/3251236220/sizes/o/" title="US Congress Twitterers by Betweenness 3 Feb 09 by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3020/3251236220_a1601ea1db.jpg" width="500" height="443" alt="US Congress Twitterers by Betweenness 3 Feb 09" /></a></p>
<p>I don&#8217;t want to labour this too much. Those of you who regularly read this blog must be getting sick of hearing about betweenness centrality. For those of you who don&#8217;t know, then here&#8217;s a brief explanation: betweenness is a measure of how important the person is in terms of flow of information. It&#8217;s calculated by working out who is on most shortest paths through the network.</p>
<p>If Twitter were the only way that information flowed in Congress, then Representative John Culberson is (still)  the single most important person there. Taking him out of the loop would mean that the flow of information would slow down considerably, and that some people would never get the message at all.</p>
<p>Take both him and Representative Neil Abercrombie out of the picture and it&#8217;s like something out of West Side Story.</p>
<p><a href="http://www.flickr.com/photos/porternovelli/3251236804/sizes/o" title="US Congress Twitterers (with two key members removed) 3 Feb 09 by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3042/3251236804_8790188823.jpg" width="500" height="443" alt="US Congress Twitterers (with two key members removed) 3 Feb 09" /></a></p>
<h3>Sharks and Jets</h3>
<p>Because every social media expert other than me has written a term paper on &#8220;How Obama used social media to win the election&#8221;, and because the general social media buzz tends to be slightly Democrat biased we&#8217;ve become accustomed to the idea that the Democrats do social media well. That may be true. But all I can say is &#8212; looking at this map &#8212; you&#8217;d never know. Republicans outnumber them 1.6 to 1 on Twitter (once we discount the spectral accounts of Driehaus and Price.)</p>
<p>I&#8217;d like to say that the Democrats had made up some ground recently, but they just haven&#8217;t. A little more analysis shows the growth in Twitter adoption by two parties. Democrats made a good start, but the Republicans really grew their numbers during June and July last year.</p>
<p><a href="http://mediaczar.com/blog/wp-content/uploads/2009/02/growthrvsd.png"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/02/growthrvsd.png" alt="Growth in Twitter adoption among Republicans and Democrats" title="Growth in Twitter adoption among Republicans and Democrats" width="500" height="400" class="aligncenter size-full wp-image-782" /></a></p>
<p><ins datetime="2009-02-05T13:14:37+00:00"><strong>update:</strong> I&#8217;ve fixed yesterday&#8217;s chart. I got the whole Red vs Blue thing the wrong way around again. Thx to EG for setting me straight.</ins></p>
<p>However the new Democrat joiners aren&#8217;t really linking to each other. Why is this? If you&#8217;re hoping to attract followers, one good strategy is to make the most of your network: their followers are <em>more</em> likely to be interested in what you have to say. More interested, that is than an equivalent control group chosen at random from the Twitterverse. So <em>not</em> joining networks just makes things harder.</p>
<p>Partly of course, we can attribute it to naivety. Most new people coming into blogging, Twitter and the rest see creating content as the hard work. Most people who&#8217;ve been doing this for a while know that it&#8217;s the promotion and moderation that sucks up your time.</p>
<p>Perhaps we might also attribute this to the point raised in Caveat 2 above &#8212; that they&#8217;re mostly interested in their electoral constituencies. However, if this were the case then I think we&#8217;d expect to find factions (or &#8220;cliques&#8221; or &#8220;clusters&#8221;) between congresspeople of the same colour who represent the same state. Nothing obvious there, though (and the data set&#8217;s quite small). So let&#8217;s stick with the &#8220;Twitter newbies&#8221; theory for the moment.</p>
<h3>How many people follow the Congressmen?</h3>
<p>So far we&#8217;ve only looked at peer relationships: all the charts and calculations we&#8217;ve done so far are based on a much-reduced data set. Experience shows that it&#8217;s very difficult to map &#8220;all followers&#8221; (we end up with something that looks like a hedgehog that&#8217;s had a wild party in a cranberry field.) But we can still <em>count</em> followers. Here&#8217;s a quick list &#8212; you can see that there are one or two superstar congresspeople, but that the rest really aren&#8217;t doing even as well as your average journalist.</p>
<p><iframe width='500' height='300' frameborder='0' src='http://spreadsheets.google.com/pub?key=p4QDp5UmTKxQ5qR-9l6fNSQ&#038;output=html&#038;gid=0&#038;single=true&#038;range=A1:D67'></iframe></p>
<p>OK &#8212; now let&#8217;s assume that these aren&#8217;t unduplicated audiences; that there must be several people interested in US politics who follow more than one, many, or all the names on the list. What is that duplication like?</p>
<p>Luckily I&#8217;ve just written <a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/">a little analysis script</a> to look at exactly this sort of question, so I can tell you with confidence that (while the absolute numbers above might lead you to think that the congress Twitterers reach an audience of around 56K) if they all pull together, they actually reach an unduplicated audience of 24K. Of course, now I look at it, I realize that they are unlikely to pull together in this way, and that I <em>should</em> have separated the lists into Republicans and Democrats before I went at this. This is somewhat frustrating &#8212; but I can cover it in a new post, I suppose.</p>
<p>Here, then, is the somewhat useless Pareto chart showing the unduplicated reach as though the US were a one-party state.</p>
<p><a href="http://mediaczar.com/blog/wp-content/uploads/2009/02/us-twitter-pareto.png"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/02/us-twitter-pareto.png" alt="Pareto chart showing unduplicated reach for US congress" title="Pareto chart showing unduplicated reach for US congress" width="500" height="450" class="aligncenter size-full wp-image-779" /></a></p>
<p>I&#8217;ll come back to this, but if you have questions, you might like to look at this <a href="http://mediaczar.com/blog/2009/02/pareto-novelli-some-qas/">Q&#038;A</a> I wrote over the weekend.</p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/02/republicans-outperform-democrats-on-twitter/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Pareto Novelli &#8212; Some Q&amp;As</title>
		<link>http://mediaczar.com/blog/2009/02/pareto-novelli-some-qas/</link>
		<comments>http://mediaczar.com/blog/2009/02/pareto-novelli-some-qas/#comments</comments>
		<pubDate>Sun, 01 Feb 2009 21:44:40 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[porter novelli]]></category>
		<category><![CDATA[twitter]]></category>
		<category><![CDATA[pareto]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=703</guid>
		<description><![CDATA[A recent post about some Pareto analysis of the Porter Novelli Twitter sample , &#8220;Porter Novelli Twitter folk &#8211; the 80/20 rule&#8221;, stirred up a little bit of interest on Twitter &#8212; and made me think again about what I&#8217;m doing and why. Partly because those conversations were off-blog (and I&#8217;d like to capture the [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fpareto-novelli-some-qas%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F02%2Fpareto-novelli-some-qas%2F" height="61" width="51" /></a></div><p>A recent post about some Pareto analysis of the Porter Novelli Twitter sample , <a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/">&#8220;Porter Novelli Twitter folk &#8211; the 80/20 rule&#8221;</a>, stirred up a little bit of interest on Twitter &#8212; and made me think again about what I&#8217;m doing and why. Partly because those conversations were off-blog (and I&#8217;d like to capture the answers I gave somewhere more permanent) and partly because I&#8217;ve now had time to think of better answers I thought I&#8217;d set them down here.</p>
<p>First, a little background. This Q&#038;A is the sixth post in an impromptu series about the Twitter people where I work (Porter Novelli, the <a href="http://porternovelli.com">international public relations agency</a>.) By now you might think that I&#8217;d be tired of this stuff, but you&#8217;d have another think coming. Here&#8217;s a quick list to bring you up to date.</p>
<ol>
<li><a href="http://mediaczar.com/blog/2009/01/map-of-porter-novelli-people-on-twitter-on-17th-jan-2008/">Map of Porter Novelli people on Twitter on 17th Jan 2008</a></li>
<li><a href="http://mediaczar.com/blog/2009/01/map-of-porter-novelli-people-on-twitter-on-20th-jan-2008/">Map of Porter Novelli people on Twitter on 20th Jan 2008</a></li>
<li><a href="http://mediaczar.com/blog/2009/01/introducing-the-porter-novelli-magic-twitter-friend-maker-beta/">Introducing the Porter Novelli magic Twitter friend maker (beta)</a></li>
<li><a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-ranked-by-number-of-followers/">Porter Novelli Twitter folk ranked by number of followers</a>
</li>
<li><a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/">Porter Novelli Twitter folk &#8211; the 80/20 rule</a></li>
</ol>
<p>Looking at this, you might also think I clearly had nothing better to do than analyze Porter Novelli people and their Twittering ways. In fact, as an experimental data set, I couldn&#8217;t really ask for anything much better. It&#8217;s sufficiently large (more than 200 people), international (I&#8217;ve counted more than 10 countries &#8212; and I&#8217;m sure there are more), and I have some real-world access to all of the people in the sample, which means I can compare my findings with some hard data.</p>
<p>That said, the experiment is more about learning about how we can analyze Twitter networks &#8212; about discovering how representative they are as a word-of-mouth (WOM) channel for example, and what they can tell us about other kinds of social network, or about finding new ways to analyze such data sets &#8212; than it is about answering any specific questions. So I&#8217;ve not got any carefully mapped-out research plan. Instead I follow paths that strike me as interesting, or possible, or that are suggested to me by friends and readers.</p>
<h3>Question 1</h3>
<div style="width:425px;text-align:left" id="__ss_976577"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/mediaczar/twitter-conversation-with-lemondrizzle?type=presentation" class="broken_link"  title="Twitter conversation with @lemondrizzle">Twitter conversation with @lemondrizzle</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slideshare.net/swf/ssplayer2.swf?doc=lemondrizzle-conversation-1233489672094970-3&#038;stripped_title=twitter-conversation-with-lemondrizzle" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slideshare.net/swf/ssplayer2.swf?doc=lemondrizzle-conversation-1233489672094970-3&#038;stripped_title=twitter-conversation-with-lemondrizzle" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/mediaczar">Mat Morrison</a>. (tags: <a style="text-decoration:underline;" href="http://slideshare.net/tag/analysis">analysis</a> <a style="text-decoration:underline;" href="http://slideshare.net/tag/mediaczar">mediaczar</a>)</div>
</div>
<p><span id="more-703"></span><br />
I&#8217;ve posted this exchange as a slideshow so you can sort of follow along. Twitter updates are rather telegraphic (it&#8217;s the 140 character limit) so that&#8217;s another reason to take the time to answer the question more fully here.</p>
<p>Caroline (<a href="http://twitter.com/lemondrizzle/">@lemondrizzle</a>) is a Porter Novelli colleague based in London. She wanted to know what purpose of de-duping the follower list serves. The first answer is that it gives us a clearer idea of the sample set of Twitterers&#8217; reach. Here&#8217;s what the unduped list looks like:</p>
<p><a href="http://mediaczar.com/blog/wp-content/uploads/2009/01/porter-novelli-twitter-people.png"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/01/porter-novelli-twitter-people.png" alt="Porter Novelli Twitter people ranked by #followers" title="Porter Novelli Twitter people ranked by #followers" width="488" height="332" class="aligncenter size-full wp-image-616" /></a></p>
<p>As I said in <a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-ranked-by-number-of-followers/">an earlier post</a>, there must be many, many duplicated connections here.</p>
<p>When I de-duped the network, it looked more like this:</p>
<p><a href="http://mediaczar.com/blog/wp-content/uploads/2009/01/unduplicated-reach1.png"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/01/unduplicated-reach1.png" alt="Pareto chart showing unduplicated reach among Porter Novelli Twitter Users" title="Pareto chart showing unduplicated reach among Porter Novelli Twitter Users" width="500" height="354" class="aligncenter size-full wp-image-690" /></a></p>
<p>If we count people more than once it gives us an inaccurate picture of the potential impact of the network. Looking at the Alice/Bob/Carol example (I changed &#8220;Adam&#8221; to &#8220;Alice&#8221; after Caroline accused me of sexism) you&#8217;ll see there are two ways to count the followers.</p>
<table>
<tr>
<td><strong>bob</strong></td>
<td><strong>alice</strong></td>
<td><strong>carol</strong></td>
</tr>
<tr>
<td>alice<br />carol<br />edward<br />william<br />xerxes<br />yasmine<br />zeus</td>
<td>bob<br /><strike>carol</strike><br />dave<br /><strike>xerxes</strike><br /><strike>yasmine</strike><br /><strike>zeus</strike></td>
<td><strike>alice</strike><br /><strike>bob</strike><br />frank<br /><strike>william</strike><br /><strike>xerxes</strike></td>
</tr>
</table>
<p>If we include the duplicate entries, then the three have an apparent combined reach of 18 people. Once we de-dupe, they have a combined reach of 10.  The same goes for the Porter Novelli Twitter network (only more so): once we dedupe that, it decreases from an apparent combined reach of around 21K users to a combined reach of around 6K people.</p>
<p>So were we able to mobilize everyone on our Twitter list in another campaign like Porter Novelli CMO Marian Salzman&#8217;s <a href="http://hashtags.org/search?query=pnobama">#pnobama</a> exercise from last month, we&#8217;d reach around 6K people.</p>
<p>Of course, it&#8217;ll be impossible to mobilize <em>everyone</em>. By ordering the sample from most to fewest followers and de-duping, we could now identify our key targets for the activity and instead focus our attention on them. </p>
<h4>Costs and Benefits</h4>
<p>Furthermore we have a good understanding of the costs and benefits of this strategy. </p>
<p>Assume that we are dealing with a new Twitter network and <em>not</em> our own internal network. Assume further there is a &#8220;unit cost&#8221; of engaging with each new Twitter person in the network of one <em>whojamaflip</em> (where a <em>whojamaflip</em> might be a financial cost, or the amount of time and effort it takes to read their blog and previous tweets, work out who you know in common, prepare and deliver your pitch etc.) </p>
<p>Knowing that the top 20% or so of the group (at a cost of 40 whojamaflips) will reach 80% or so of all the followers &#8212; around 4.8K opportunities-to-see (OTS) at a cost-per-thousand (CPM) of 8&#8531; whojamaflips &#8212; tells us a lot about how and where we should focus our efforts, and about the incremental benefit of reaching the remainder of the group &#8212; 1.2K OTS at a whopping and unjustifiable CPM of 133&#8531; whojamaflips. </p>
<h4>Do we really reach 4.8K people?</h4>
<p>Of course not. For one thing, not every follower is &#8220;real&#8221;, and for another we can&#8217;t guarantee a view. One of <em>my</em> followers, for example, is Barack Obama. If I thought he were hanging on my every tweet, I&#8217;d be too scared to commit words to screen.</p>
<p>Instead we should see followers as opportunities-to-see (or &#8220;OTS&#8221;) rather than views or eyeballs.</p>
<h4>Are numbers all that matter?</h4>
<p>No, of course not. I tend to work on a simple model of &#8220;influence&#8221; that looks at 3 factors:</p>
<ol>
<li>Popularity/Reach &#8212; the number of people whom an individual influencer can affect &#8211; generally a &#8217;size of audience&#8217; figure</li>
<li>Authority &#8212; &#8216;people who talk about subject x are most likely to mention these influencers&#8217;</li>
<li>Proximity &#8212; probability of being present at or involved or consulted in a given decision.</li>
</ol>
<p>But even these won&#8217;t give you the whole picture, there are lots of other factors (things like &#8220;trust&#8221;, say) which don&#8217;t completely fit into this model. And &#8212; as people like <a href="http://toria.com/cv_site/about.html">Victoria</a> have pointed out on previous posts &#8212; I make no judgment on things like how compelling a message or product we&#8217;re selling.</p>
<p>I think it&#8217;s a matter of perspective. Just like there are things that make an effective virus (things like a short incubation period, long infectious period and resistance to treatment, say) there must be things that make an effective message. Lots of people I know are (or have been) working on this. Indeed, I maintain a keen interest myself &#8212; I just don&#8217;t have much to share.</p>
<h4>What do you look at, then?</h4>
<p>Instead, I find myself looking at the channels, the vectors of transmission. To continue the virus metaphor; however virulent a virus may be, it makes little difference if (like <a href="http://en.wikipedia.org/wiki/Smallpox#Post-eradication">smallpox</a>) it is stuck in a few laboratory freezers and can&#8217;t get out.</p>
<h4>What&#8217;s that about clumpiness and bridge nodes, then?</h4>
<p>When we map networks of people we generally find that they are &#8220;clumpy&#8221; &#8212; what I mean is we can see many tightly knit groups of people who are linked to the other clusters by a few thin threads. </p>
<p>Take your own Facebook friends as an example. There&#8217;s a strong probability (depending on factors like your age group) that there will be around three or four clusters; most probably colleagues at your present job, university friends, and old school friends. Let&#8217;s say you share some news with these close-knit groups via Facebook. What incentive do they have to share the information on? There&#8217;s a very strong likelihood that they share many of the same friends and colleagues with you &#8212; the natural assumption must be that <em>everyone who matters is already aware of the news you&#8217;ve shared</em>. The last thing they want to do is tell everyone what you&#8217;ve <em>already</em> told them. Where&#8217;s the network utility in that?</p>
<p>What this means is that news is actually most likely to be repeated by people on the very edge of that group; people who are (from their position) less likely to think of that group as the only audience. These are the people who share stories; what &#8212; along with lots of other people in this area &#8212; we&#8217;re currently calling &#8220;bridge nodes.&#8221; Caroline doesn&#8217;t like me calling them bridge nodes, though, so we may need to address this. Otherwise she&#8217;ll steal my pencils.</p>
<h3>Question 2</h3>
<p><a href="http://twitter.com/amber1230/status/1163930996"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/01/amber1230.jpg" alt=" What if alice is the most influential with a higher growth potential? ie her followers have a high # of followers" title=" What if alice is the most influential with a higher growth potential? ie her followers have a high # of followers" width="500" height="267" class="aligncenter size-full wp-image-704" /></a></p>
<p><a href="http://twitter.com/amber1230">@Amber1230</a> asks a different kind of question, and one that&#8217;s a bit harder to answer.</p>
<p>As it happens, second-degree networks can (in fact) be factored into our calculations. I&#8217;ve done quite a lot of stuff with second-degree networks in the past. Take a look, for example, at this map I made of my Facebook connections and <em>their</em> connections back in 2007.</p>
<p><a href="http://www.flickr.com/photos/porternovelli/3113048657/" title="Map of my Facebook &amp;quot;universe&amp;quot; by matmorrison, on Flickr"><img src="http://farm4.static.flickr.com/3151/3113048657_27502f95af.jpg" width="500" height="388" alt="Map of my Facebook 'universe'" /></a></p>
<p>Sure, the maths becomes a little more complicated. If you really want to know, we start talking about things like &#8220;eigenvector centrality&#8221; rather than &#8220;degree centrality&#8221;. Eigenvector centrality says (more or less) &#8220;it&#8217;s not <em>who</em> you know, it&#8217;s who <em>they</em> know.&#8221; A short paper by <a href="http://www.steveborgatti.com/">Steve Borgatti</a> called <a href="http://www.analytictech.com/networks/centaids.htm">Centrality and AIDS </a> has this description of the distinction:</p>
<blockquote><p>In the context of HIV transmission, we can see that all else being equal, a person A with one sex partner has a better chance of escaping the infection than a person B with many. But if that one sex partner that A has is having sex with most of the network, A&#8217;s chances of getting infected are nearly as good as if she were having sex with all those others herself.</p></blockquote>
<p>So far, I think this is <em>exactly</em> what <a href="http://twitter.com/amber1230">@Amber1230</a> is saying. The problem is one of tense; I&#8217;m trying to predict stuff, not describe what has already happened. We know that everyone in someone&#8217;s first-degree network will have an opportunity to see the initial message. But the only ways that someone in the second-degree network might see it is if they are <em>already</em> in the first-degree network, or if one of the first-degree people <em>retweets</em> it (or does something off-network.) The retweets issue is actually rather difficult to solve (although some headway is being made by people like <a href="http://danzarrella.com/">Dan Zarella</a> and <a href="http://yoast.com">Joost de Valk</a> among others.)</p>
<p>So for the moment, I&#8217;m trying to work with what I know &#8212; or what I <em>can</em> know.</p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/02/pareto-novelli-some-qas/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Porter Novelli Twitter folk &#8211; the 80/20 rule</title>
		<link>http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/</link>
		<comments>http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/#comments</comments>
		<pubDate>Thu, 29 Jan 2009 21:25:23 +0000</pubDate>
		<dc:creator>Mat Morrison</dc:creator>
				<category><![CDATA[porter novelli]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://mediaczar.com/blog/?p=678</guid>
		<description><![CDATA[Last weekend I posted a chart of Porter Novelli Twitter folk and their followers. If you read it, you&#8217;ll recall that I was dissatisfied by what it implied about the collective reach of Porter Novelli twitterers.
Well, thanks to a long-ish train journey to Bolton and back, I was able to fudge a little perl script [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F01%2Fporter-novelli-twitter-folk-80-20%2F"><img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fmediaczar.com%2Fblog%2F2009%2F01%2Fporter-novelli-twitter-folk-80-20%2F" height="61" width="51" /></a></div><p>Last weekend I posted <a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-ranked-by-number-of-followers/">a chart of Porter Novelli Twitter folk and their followers</a>. If you read it, you&#8217;ll recall that I was dissatisfied by what it implied about the collective reach of Porter Novelli twitterers.<a href="http://mediaczar.com/blog/wp-content/uploads/2009/01/modified-line.jpg"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/01/modified-line-300x201.jpg" alt="The pareto chart should look more like this" title="The pareto chart should look more like this" width="300" height="201" class="alignright size-medium wp-image-617" /></a><br />
Well, thanks to a long-ish train journey to Bolton and back, I was able to fudge a little perl script together to look through the data to find and remove everything other than the <em>first instance</em> of a follower. Let&#8217;s make that a little clearer. Let&#8217;s say that we&#8217;re looking at three Twitter people, Alice, Bob, and Carol. The first thing to do is to see who follows them: </p>
<table>
<tr>
<td><strong>alice</strong></td>
<td><strong>bob</strong></td>
<td><strong>carol</strong></td>
</tr>
<tr>
<td>bob<br />carol<br />dave<br />xerxes<br />yasmine<br />zeus</td>
<td>alice<br />carol<br />edward<br />william<br />xerxes<br />yasmine<br />zeus</td>
<td>alice<br />bob<br />frank<br />william<br />xerxes</td>
</tr>
</table>
<p>Now we need to rank them in order of &#8220;who has the most followers&#8221; (also known as &#8220;popularity&#8221; as it happens). Here I&#8217;ve done that from left to right. Bob has the most followers and Carol the fewest. </p>
<table>
<tr>
<td><strong>bob</strong></td>
<td><strong>alice</strong></td>
<td><strong>carol</strong></td>
</tr>
<tr>
<td>alice<br />carol<br />edward<br />william<br />xerxes<br />yasmine<br />zeus</td>
<td>bob<br />carol<br />dave<br />xerxes<br />yasmine<br />zeus</td>
<td>alice<br />bob<br />frank<br />william<br />xerxes</td>
</tr>
</table>
<p>And finally we go through from left to right removing all followers who have already shown up on someone else&#8217;s list. </p>
<table>
<tr>
<td><strong>bob</strong></td>
<td><strong>alice</strong></td>
<td><strong>carol</strong></td>
</tr>
<tr>
<td>alice<br />carol<br />edward<br />william<br />xerxes<br />yasmine<br />zeus</td>
<td>bob<br /><strike>carol</strike><br />dave<br /><strike>xerxes</strike><br /><strike>yasmine</strike><br /><strike>zeus</strike></td>
<td><strike>alice</strike><br /><strike>bob</strike><br />frank<br /><strike>william</strike><br /><strike>xerxes</strike></td>
</tr>
</table>
<p>Bob, being at the top of the list gets to keep all his followers which may seem unfair. But it&#8217;s <em>not</em> unfair if the question we&#8217;re trying to answer is &#8220;how do I reach as many people as possible by speaking to as few people as possible?&#8221; That is, I&#8217;m looking for <em>reach</em> (marketing people often express themselves in terms of &#8220;reach&#8221; &#8212; or the number of people who are exposed to a message &#8212; and &#8220;frequency&#8221; &#8212; or the number of times the average person is exposed to that message.)</p>
<p>Looking at the example above, we can see that Alice really delivers an incremental benefit of two new people, and Carol only reaches <em>one</em> new person. That gives us a much better idea of how valuable the most popular person (Bob) really is.</p>
<h3>Applying this to the Porter Novelli data set</h3>
<p>Clearly it would be <em>extraordinarily</em> boring to perform the process described above for the 205 people in the Porter Novelli <a href="http://spreadsheets.google.com/ccc?key=p4QDp5UmTKxS65ROEO5ykZQ&#038;hl=en">data set</a> that I want to analyse. But the analysis script that I wrote (with plenty of help from the <a href="http://www.perlmonks.org/">perl monks</a>) goes through exactly these steps. It&#8217;s a pretty straightforward job, ranking and deduping. Here&#8217;s what we get. </p>
<p><a href="http://mediaczar.com/blog/wp-content/uploads/2009/01/unduplicated-reach1.png"><img src="http://mediaczar.com/blog/wp-content/uploads/2009/01/unduplicated-reach1.png" alt="Pareto chart showing unduplicated reach among Porter Novelli Twitter Users" title="Pareto chart showing unduplicated reach among Porter Novelli Twitter Users" width="500" height="354" class="aligncenter size-full wp-image-690" /></a></p>
<p>This makes much more sense than <a href="http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-ranked-by-number-of-followers/">the last run</a>. According to the <a href="http://en.wikipedia.org/wiki/Pareto_principle"></a>Pareto principle</a>, roughly 80% of the effects should come from 20% of the causes. Here we see that 20% of the Porter Novelli Twitter users (marked in black) account for slightly more than 80% of the reach (marked in red.) It&#8217;s pretty much a text-book example. Things are as they should be, I suppose. </p>
<p>More to the point, we can now assign appropriate value to coverage at the head of the graph. This is of great value when thinking about our media planning and engagement </p>
<p>By the way &#8212; if you&#8217;d like a copy of either the Twitter follower API query engine (it&#8217;s a well-behaved command-line thing that was developed by the excellent <a href="http://www.getafreelancer.com/users/949208.html">Joachim Larsen</a>) or the slightly shonky perl script that I wrote on the train, you have only to ask: I&#8217;ll be pleased to share. Send me a tweet at <a href="http://twitter.com/home?status=%40mediaczar+please+send+me+the+twitter+pareto+scripts">@mediaczar</a> and I&#8217;ll send you the scripts.</p>
]]></content:encoded>
			<wfw:commentRss>http://mediaczar.com/blog/2009/01/porter-novelli-twitter-folk-80-20/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>
