Archive

Archive for the ‘Uncategorized’ Category

Twitter Mining

January 2nd, 2010

Twitter, for all its fans and detractors, generates a LOT of data.  It may only be a very small percentage of the world’s population who tweet a lot, but even that can be representative of interesting/important trends and changes.

Oh, and by the way, is twitter worried about this twitter.com traffic trend (courtesy of compete.com)?

But in terms of mining the human chatter that happens through Twitter, who is doing anything interesting?  I didn’t find much:

1. TweetDeck’s  TwitScoop

Dodgy naming aside, this column in tweetdeck is one I keep switched on.  But a keyword tag cloud isn’t exactly world-shattering in 2010.  Still, it’s there, and while the screengrab above doesn’t tell me much (and I’m still not sure what/who “snead” is, even after a search), it’s a good finger-on-the-pulse.

2. TrendsMap

Now this is more like it (and a better name).  A google mashup with (another) tag/word cloud floated on top, it gives an overview or trending topics, along with a real-time snapshot of individual tweets.   And no, I’ve no idea why blasphemy is a hot topic in Ireland right now…

 3. Neoformix

Now this guy I have a lot of time for.  He’s particularly well known for his “Twitter Stream” graphs, which shows word usage trends over time, as below. 

However, head on over to his projects page, and you’ll find charts that include “time of day word correlations” (as below), “Twitter Venn” (twitter Venn diagrams) and a host of other tools. 

So, I know Jeff at Neoformix isn’t the only guy doing interesting analysis of Twitter data, but what surprises me is how few people seem to be working on it – at least that I’m aware of.  Sure, the fact that the word “drunk” is tweeted most between midnight and 5am isn’t going to change the way we see the world – but what about if a brand name suddenly takes off?  Or the word “recession” is on a downwards trend, or “flu” on an upwards trend? 

More Twitter Mining, please…

admin Uncategorized, data mining

Embedded media – widgets!

September 12th, 2008

Courtesy of the guys at Widgetbox I’ve been exploring what’s available out there to embed and add value to visitors.  The answer is, unsuprisingly, “what isn’t available out there”.   I picked up couple of examples at random that I thought were interesting.  Added value?  Well…

admin Uncategorized, Web

12,000 laptops LOST every week in US airports

August 27th, 2008

null

This story caught my eye – quite possibly as I’m flying tomorrow… 12,000 laptops EVERY WEEK, that’s 624,000 per year, or one for every 500 people. 

What’s even more astonishing is that 70% of those aren’t reclaimed.  Is everyone feeling that flush despite the crunch?

Ironically, 40% of the lost laptops are left at the security checkpoints. The departure gate (23%), toilets (9%) and restaurants (7%) are other places worth hanging around if you fancy picking up a free PC.

The article (In PC Pro magazine) finishes up by saying that 14% of the laptops have “intellectual property” on them – which is a bit worrying, as that means 86% had nothing at all on them, or maybe just a bunch of unintellectual property.

admin Uncategorized

History

August 10th, 2008

As an example of the data that a server can already pick up from a simple visit, have a look at the table below (visit a couple of those sites and come back and see the difference).

I’ve also added a widget to the right of this page which shows some of the other (quite simple) information that’s easily scraped from your visit – such as IP address (which can then locate you geographically, broadly), screen resolution, local time of day and so on.

None of this information should represent a privacy invasion – though you should be aware that it is collect-able (the broader mass market isn’t, generally). 

The great thing is that some of this information is actually very useful at telling a site more about you – and therefore providing you a better and more personal service. Getting the balance of privacy and personalisation is of course a challenge (and one of the most important things we’re working on at at Favy).  Right now, most of the information you see below and to the right is thrown away and not made use of.


Site Visited?

Thanks to gemal for this script.

admin Uncategorized