open data

Hans Rosling strikes again

Hans RoslingI’m a big fan of Hans Rosling. He’s really the guy that opened up the eyes of the world to the importance of availability of and open access to data.

His latest presentation, from Google’s Zeitgeist08 conference, is now available on YouTube. It is not as stunning as his original TED eye-opener, but still among the best material you’ll find online, and has surprisingly not made its rounds on the web for real yet. In this new video, he brings good news about AIDS, unveils the realities of oil production in US and Russia, discusses CO2 emissions and how the economic power is shifting from the west to the east. Oh, and there is stuff about money and sex also 🙂

The video can be seen here: 10 years in, 10 years out – Hans Rosling

See my previous blog entries about Hans Rosling here:

P.S. Also speaking at Zeitgeist08 was Paul Hawken. Hawken is giving a speech on sustainability here in Reykjavik on Saturday (Icelandic). From the little stuff I’ve seen he seems a little too evangelistic for my taste, but I’ll check it out anyway – ready to be converted 🙂

The Case for Open Access to Public Sector Data

This article will be published tomorrow in The Reykjavík Grapevine.

Government institutions and other public organizations gather a lot of data. Some of them – like the Statistics Office – have it as their main purpose, others as a part of their function, and yet others almost as a by-product of their day-to-day operations.

In this case I’m mainly talking about structured data, i.e. statistics, databases, indexed registries and the like – in short, anything that could logically be represented in table format. This includes a variety of data, ranging from the federal budget and population statistics to dictionary words, weather observations and geographical coordinates of street addresses – to name just a few examples.

In these public data collections lies tremendous value. The data that has been collected for taxpayers’ money for decades or in a few cases even centuries (like population statistics) is a treasure trove of economical and social value. Yet, the state of public data is such that only a fraction of this value is being realized.

The reason is that accessing this data is often very hard. First of all its often hard to even find out what exists, as the sources are scattered, there is no central registry for existing data sets and many agencies don’t even publish information on the data that they have.

More worrying is that access to these data sets is made difficult by a number of restrictions, some accidental, other due to lack of funding to make them more accessible and some of these restrictions are even deliberate. These restrictions include license fees, proprietary or inadequate formats and unjustified legal complications.

I’d like to argue that any data gathered by a government organization should be made openly accessible online. Open access, means absence of all legal, technical and discriminating restrictions on the use or redistribution of data. A formal definition of Open Access can be found at opendefinition.org

The only exception to this rule should be when other interests – most importantly privacy issues – warrant access limitations.

There is a number of reasons for this. First of all, we (the taxpayers) have already paid for it, so it’s only logical that we can use the product we bought in any way we please. If gathering the relevant data and selling it can be a profitable business on its own, it should be done in the private sector, not by the government. Secondly it gives the public insight into the work done by our organizations in a similar way as Freedom of Information laws have done – mainly through media access to public sector documents and other information.

The most important argument – however – is that open access really pays off. Opening access and thereby getting the data in the hands of businesses, scientists, students and creative individuals will spur innovation and release value far beyond anything that a government organization can ever think of or would ever spend their limited resources on.

Some of these might be silly online games with little monetary value but yet highly entertaining. Others might be new scientific discoveries made when data from apparently unrelated data sources is mixed. And yet others might be rich visualizations that give new insights on some of the fundamental workings of society – showing where there’s need for attention and room for improvement.

A recent study on the state of matters with Public Sector data in the UK concluded that the lack of Open Access is costing the nation about 1 billion pounds annually in lost opportunities and lack of competition in various areas. Per capita, a billion pounds in the UK equals about 750 million ISK for Iceland and that’s without adjusting for Iceland’s higher GDP and arguably some fixed gains per nation.

Surely a huge opportunity for something that requires only a thoughtful policy change and a little budget adjustment to enable the institutions to make the needed changes and continue their great job of gathering valuable data.

– – –

See also an article from The Guardian, that helped spark a similar debate in the UK: Give us back our crown jewels