Posts Tagged ‘webosphere’

New Hampshire primaries, Obama and McCain

Tuesday, January 8th, 2008

Now to the New Hampshire primaries. As for Iowa, we offer a brief analysis of each candidate’s voice share in the US political webosphere with respect to their standing in New Hampshire.

We have finetuned our metrics in order to exclude from Hillary Clinton’s voice share that of her husband, former President Bill Clinton, whose voice share remains substantial in the debate. Our previous analysis of the Iowa caucuses showed the voice shares of the Republican candidates pretty much matched the actual results. The discrepancy between the leading voice share (H. Clinton) and the ballot winner (B. Obama) for the Democrats can be explained by the aforementioned methodological glitch.

Now, the online buzz matches the polls. On the liberal side, Obama is way ahead of Hillary Clinton and John Edwards. On the conservative side, the online trends are not as conspicuous as the polls. John McCain is leading by a narrow margin, with Romney and Huckabee closely following.

The political webosphere gives the following orders on each side:

Republicans:

  1. McCain
  2. Romney
  3. Huckabee

Democrats:

  1. Obama
  2. Clinton
  3. Edwards

For the french version of this post, see here.

Iowa Caucus: Huckabee and Clinton dominating the political web

Thursday, January 3rd, 2008

With every poll showing the top-tier candidates neck and neck in Iowa, it is also interesting to glimpse into the web’s crystal ball. Following the predictions of Hitwise, let us consider what the US political web is saying about the Iowa caucus, with one research objective in mind: to analyze, and perhaps confirm –as we did during the last French presidential election- the correlation between each candidate’s “share of voice” on the web, opinion polls, and ballot results.

map pw08 december2007

Prior to unveiling the numbers, let us first go over the methodology. We have measured the number of quotes and mentions of each candidate with respect to mentions of the Iowa Caucus against our dataset of the 2000 main sites and blogs of the US political web. To be fair, those quotes and references are not qualified, in terms of positive or negative language; this is essentially a quantitative measure of the level of buzz on a select sample of the most politically active and influential sites and blogs on the US web.

democrat caucus iowa

Generally speaking, the web is highly reactive to news coverage and events, and thus acts as an amplifier, often yielding previous insights. When focusing exclusively on the chatter over the past 10 days, Hillary Clinton seems to dominate the Democratic side of our dataset of sites, with 31% of share of voice, closely followed by Barack Obama (29%) and John Edwards at 26%.

When looking at Edwards’ share of voice stats over the past 2 months, one can only notice his impressive online surge, apparently confirmed by recent investments in additional servers by the campaign (http://marcambinder.theatlantic.com/archives/2007/12/a_real_edwards_surge.php), albeit not sufficient to leap ahead of the top two Democratic contenders. Conversely, Hillary Clinton seems to have lost some virtual ground since November, considering she once culminated at 42% of share of voice between November 10 and November 19.

rep caucus iowa

On the Republican side of things, the matter isn’t straightforward either. In the last 10 days, the Republican political webosphere (all the Republican sites in our 2,000 sites dataset) has confirmed the “Huckaboom” by putting him at the top of the charts with a 26% share of voice. Mitt Romney follows with a solid 24% while John McCain and Fred Thompson are lagging behind, respectively with a 16% and a 13% share – the latters’ voice shares being slightly better than their standings in the Iowa polls. Again, if we take a look at the trends over the last 3 months, we’ll notice that Huckabee rose over his competitors at the beginning of December, both online and in the polls. Giulani’s steady decline in the polls in the last months compares with his diminishing share of voice among the Republican online community.

iowa rep polls

To sum it up, here is the share of voice for each one of the leading candidates:

Republicans

Mike Huckabee 26%
Mitt Romney 24%

John McCain 16%
Fred Thompson 13%

Democrats

Hillary Clinton 31%
Barack Obama 29%
John Edwards 26%

Now, let us wait for the first actual results.

All you’ve always wanted to know about our map, but never dared to ask…

Thursday, January 3rd, 2008

Curious about about the Presidential Watch ‘08 map? Here are some answers to the most common questions asked:

I. Drawing the map

The PresidentialWatch08 map is composed of the 297 most visible and influential websites and blogs - out of a complete dataset of over 2000 sites - using Linkfluence’s proprietary crawl technology.

The map includes both social media and mainstream media outlets. The sites are divided into four different categories, or communities (manually labelled):
- Conservative
- Independent
- Mass Media
- Progressive

In terms of methodology, we initiated the process by focusing on a set of a few hundred websites and blogs well-recognized by search engines and other sites related to US politics. Then, we collected the URLs of all sites located just one click away from our initial set - which amounted to tens of thousands of websites.

Why was this step important? Because when it comes to networks – and the web is one giant network – there’s a rule that says that what’s similar to a given node in terms of content will stand close to this node in terms of location. Working with a set of websites large enough, one can collect all the other important websites dealing with the same topics using the “one click removed” idea.

Thanks to a series of metrics, both topology-related (i.e. how many sites link to a particular site) and semantics-related (i.e. are the words used of political nature), we were able to single out over 2,000 websites that constitute the core of the US political webosphere. From these, we extracted the most link-relevant 297 sites.

II. Navigating the map

The PW08 map’s default view is set to display all the categories at once (Conservative, Independent, Mass Media, Progressive). You can select the individual communities you’re interested in and more carefully analyze the links existing between them - most notable to see who links to whom, and what their level of authority is within their community.

See the notice for more practical details on map navigation.

III. Understanding the map

As shown in the map’s navigation bar, a node’s color indicates the community it belongs to, and a node’s size indicates its authority degree (overall number of inbound links) or its Xeno degree (number of inbound links coming from nodes belonging to other categories).

The more links a node receives from other nodes shown on the map, the bigger it appears on the map. Note that the link count is based solely upon links coming from nodes on the map. Links coming from websites located outside of the map are excluded. Based on this approach, we can determine the level of authority attributed to a given site within these communities. This approach may occasionally favor bloggers who splog (spam-blog) others, artificially generating inbound links to their blogs by an abusive use of such techniques as trackbacks. Given the size of the map’s set of websites, we were able to make sure such artificial results were not present.

Nodes are positioned on the map according to a topological placement algorithm, i.e. each node is positioned solely according to its linking pattern, without consideration for the stated political affiliation of the site or its content.

Many algorithms make possible for a 2D rendering of an adjacent matrix - i.e. the matrix describing any graph. We used a Fruchterman Rheingold algorithm, which shares with all the others the same basic principle: minimizing the system’s energy while maximizing the use of the space available for the representation of the data. To minimize the system’s energy, one can for instance assume that nodes that are not linked to each other are pushing away from each other whereas nodes that are linked to each other are attracting each other. Through iterative steps the algorithm tries to find a way to position nodes where there is as little link overlap as possible. To maximize the use of the mapped space, the graph is spread as much as possible over the surface allocated for its display.

These positioning principles call for the following reading conventions:

A site’s position on the map depends solely upon its linking policy. A node has no predefined position, the latter being the result of the relations it has with other nodes. This means that a node with no links at all cannot be positioned on the map, which is why we excluded such websites from the PW08 map;

North, East, South and West don’t matter. The displayed space is not based on the cardinal system (North, East, South, West), which means that the choice of a relative left-right or top-down position is purely arbitrary. Overall, we chose to respect the obvious left-right political axis. The further left you look, the more liberal the site. The further right, the more conservative;

Hubs are center-stage. The displayed space is polarized in a center to periphery tension. The nodes positioned at the center are the ones receiving the most links from other nodes that don’t link much to one another (exogamous nodes). The nodes positioned at the periphery receive fewer links but they receive them from other nodes that tend to link to one another (endogamous nodes). For instance, the PW08 map clearly shows the pivotal position held by the mass media, the sites of large media outlets receiving links from sites pertaining to all the other communities;

It’s not size, it’s density. The map shouldn’t be interpreted with respect to the surface occupied by a given community or subset of nodes. Rather it should be construed with respect to density levels. For instance, two communities may stretch over equally-sized surfaces, with one forming a tight-knit community and the other being looser-knit. An online territory can be occupied by few sites with few links, thus showing a low density level; it can also be occupied by many sites with many links, thus showing a high density level. On the map the “strength” of a community can be inferred from its density and the thickness of the web woven by its nodes. For instance, a zone with a low density level spreading over a large surface should be construed as containing sites with hardly any links to sites in other communities, links being made between nodes within this community (hypertextual endogamy)

That’s it. Now you can navigate inside the PW08 map and analyze in detail the relations between sites and communities.

US Elections web geography

Wednesday, December 19th, 2007

blogopolUS_1

With less than a year to go before election day, the battlefield is already crowded with troops. The Republican and Democrat primaries have brought all supporters and cybersupporters in the debate. Whereas a few months ago American candidates were sending envoys to France to spot the presidential netcampaign’s best practices, they are now the ones steering the wheel, finding new ways to campaign online, pushing further the borders of traditional politics. The netcampaign will take place in every corner of the Internet, from the now ancient e-mails and newsgroups to the new web 2.0 community sites and apps such as Twitter or Digg. It will visit both the most crowded spaces such as YouTube, MySpace or Facebook and the most confidential and secluded - what about some political debating in Lake Ontario’s fly fishing newsgroups? And of course it will still happen within the blogosphere, on thousands of opinion outlets held by supporters, journalists, candidates, writers or citizens. Continuously or from time to time, they will carry, consider or mix the impressive flow of texts, images and sounds published daily by the mass media and, more and more, by their peers.

What do we offer? Some perspective on this very dense flow of opinions. The ability to apprehend the size of this phenomenon by measuring it.

The first measures are made by the topographic surveyor: measure a territory, draw its borders, distinguish its vicinity, spot the highs and lows. The first territory we have mapped is not the multi-dimensional Internet, with too many fronts to cover at the same time! No, the first territory we’ve mapped is the political blogosphere, the territory of all the blogs that will follow and take part in this election. Maybe we should talk about the political webosphere as all the blogs contained therein are not isolated from their hypertextual environments, from sites they link to and they’re linked from. It is this whole ecosystem of intertwined websites that we’ve represented and that we’ll monitor in 2008.

Last spring, we mapped the French political webosphere within the context of the 2007 presidential election. The most astonishing part is that the pulse of this territory, as shown in the map and the various monitors we had set up, actually gave a very good idea of the final outcome, with the ones leading the race on the Internet actually leading the polls. Hence, we suggest you keep a close eye on Presidential Watch 2008 all along the year!

The troops are now ready and trained, the battlefield is before us. Let the political strategists unfold their maps and their most ambitious tactics.