Jure Cuhalev

12 Mar, 2010

Visualizing Slovenian IT tax spending

Posted by: Jure Cuhalev In: visualization

My latest released project is focusing on Visualizing Slovenian IT tax spending (139 million euros), the idea here is to take otherwise meaningless numbers and display them visually in a way that tells a story of who is spending how much and on what. The data set comes directly from the government in semi-clean XLS file. Visualization technique I’ve decided on is treemap visualization to represent the data with different box sizes relative to each other.

Give it a try for yourself:

Launch the interactive Slovenian IT tax (in Slovenian)

Visualizing Slovenian IT tax spending

While visualization itself is nice, there are a two points that you have to be careful about when releasing such visualizations to the public:

Transparency of data and data transformations

In my case, the data set came directly from the government. In order to make sure that everyone can check my calculations I’ve included links to their file as well as provided a local copy in case their version changes or disappears.

You’re loosing and reinterpreting data with every visualization. That is why it’s important to also include transformation scripts so that others can check your work and possibly build on top of it or at least make sure that you didn’t do anything tricky with the data.

I’ve opted for a github repository where I’ve pushed all the associated files: http://github.com/gandalfar/itproracun. It’s a bit chaotic but it should be pretty self-explanatory to any python and JavaScript developer.

Telling the story
Every data visualization is trying to tell a story. It might not be obvious to the visualization author but it helps to identify this early in the process.

I started with just a simple breakdown based on the institutions:

It’s very noisy and it’s hard to compare different institutions to each other. Initial comments to this were that it’s not shiny enough. Cleaning the interface up I came to the following revision:

It’s much cleaner and what basically showed that I need to find an angle to this data. I decided to focus on the ratio between software and services vs. hardware and network equipment. Final version now tells a story of how police is spending a lot of their IT money on network and hardware equipment, while Tax Office is spending much more money on software and services.

Agenda of this last version of visualization should be clear to anyone who takes a few moments to study it.

Other lessons learned

Visualization toolkit should be powerful on one hand, but offer first results without too much work. JavaScript InfoVis Toolkit does this job very well. There are some interesting tidbits that are not entirely clear from the documentation, but become obvious once you start thinking how the rendering works.

The biggest time sink is parsing and cleaning up the data. Don’t expect that the .xls file will make any sense from the programmatic point of view, even though it mostly looks fine when viewed manually. Small parsing errors, moved cells and strange line breaks made parsing this data the biggest challenge.

Big thanks go the community of Slo-Tech and my brother that gave valuable feedback during the development.

I hope you’ve enjoyed this visualization. Let me know in comments what other points of view you’d like to see as well as your ideas how to further improve it.

OK Go recently released video for their song This Too Shall Pass, of their new album, Of the Blue Colour of the Sky, and it shows that they really understand viral web video and that their target demographic consumes their content on YouTube and Facebook.

First, check out their video, if you haven’t yet. It’s really worth watching, even if you don’t like their music. Analysis after the jump:

As you’ve seen it’s a Rube Goldberg inspired music video that is doing everything it can to keep your attention. It starts with a shock view of singer that looks like he just slaughtered a cow and as that grabs your attentions it gives you an interesting machine to observe as you watch the video. If they kept your attention for 30s you’ve probably managed to be enthusiastic enough about it to instant message it to your friends as well shared it on Facebook etc.

Based on their recent open letter, this is exactly what brings money today to a band – YouTube advertisements and the only way to actually make serious money on YouTube, besides having 500 videos that you released over last 2 years is to go viral with a well thought our video.

This brings to a completely new problem: what sells of the internet is porn, but if you can’t show that, kittens and lolcats will do. So the best tactic for an indie band that would like to get a lot of views would be to get some cute girls and somehow embed fuzzy kittens and puppies into their video. This way you’ll have a few bonus points in terms of views and maybe you’ll be able to achieve tipping point that will allow you to skyrocket the number of your views.

Did you spot any other details in the video that would contribute towards virality and sharing?

26 Feb, 2010

MobileCamp Ljubljana in late March!

Posted by: Jure Cuhalev In: conferences

Who needs to sleep when you’re having so much with the BarCamp like events, right? Building upon this idea we’ve announcing MobileCamp Ljubljana, that’s going to take place on 27th of March 2010 at the Faculty of Electrical Engineering.

I’m excited about this MobileCamp for two main reasons: we’ll be bringing local mobile phone developers together for the first time in such numbers. This is important for Slovenian Mobile scene as there many indie iPhone and Android developers that need to see that their peers are active in the industry and that they’re not alone.

The second is that this venue is much bigger in terms of space so we should be able to introduce more people to concept of unconferences as well prepare grounds for a series of smaller MobileCamps later in the year if there will be enough demand.

We’re seeing increasing demand for iPhone and Android developers from companies and entrepreneurs as well as demands for applications from users, so lets expand this market fast and catch-up with the rest of the Europe. This time we’re switching back to English as primary language and we’re hoping to attract broader community also from the neighboring countries.

As always, go to http://www.mobilecamp.si for all the gory details and how to sign-up for your spot at the MobileCamp.

25 Feb, 2010

RealTime WebCamp reflections

Posted by: Jure Cuhalev In: conferences

Ivan talking about Marketing

We can say once again that our last event, RealTime WebCampLjubljana, was a great success. We didn’t run out of coffee, there was enough electricity and WiFi as well as a great mix of people and the quality of talks and discussions was astonishing.

There are a few things that we did differently this time that had effect on the ‘camp feeling:

  • Smaller group – instead of going with 150+ we went for 50+ crowd. More intimate and it allowed for more open discussions.
  • Two tracks only – moved from 3 tracks format to 2 tracks as there was less people and we also had to optimize for the available rooms. We still managed to create a nice balance of tech vs. non-tech oriented talks.
  • Slovenian language – despite protests from our friends in Croatia, we decided to go with Slovenian as a primary language this time. My current feeling is that this made it easier for everyone involved and we should consider sticking to Slovenian for smaller camps that are not intended for international audience.
  • ‘Hackish venue’ – instead of going for university or conference venue, we crashed in Hekovnik this time, a new hackerspace in Ljubljana. This had a totally different feeling of more ‘ad-hoc’ and less sterile environment. It seemed like people enjoyed this more.

There is still a question – how to encourage people to prepare more sessions and how to time the event. Going for early Saturday morning as a start (8am-ish) seems to work great for now. Regarding more sessions, I’m not sure if we can expect more than 30% of attendees to run their own session, would be interested in recommendations on any literature on the topic of facilitating sessions.

Did I miss anything important that we should take into consideration for our next ‘camps?

01 Feb, 2010

Announcing RealTime WebCampLjubljana

Posted by: Jure Cuhalev In: conferences|ideas

It’s been two months after last WebCamp and it’s time for another party. Following the idea of BarCamps with story line, we’ve decided to organize RealTime Social WebCamp Ljubljana.

This time we’ve decided to focus on a single emerging technology space: Real-Time Social Web. If you’re not up-to date with the latest buzzwords, it’s about next generation RSS protocols like HubSubPubBub, RSSCloud, XMPP (that powers GTalk), Synaptic Web, Twitter API and a bunch of other technologies and ways of thinking about the Web and Mobile space.

Intended audience are developers and people who are close to them (e.g. interface designers, product managers, etc.) and will need to innovate in this space in the next 6 months.

We’re doing it a bit more limited this time, just 50 spaces. I’m interested in seeing how a smaller and more focused group changes the dynamics of such gathering.

The rules for the tickets are the same as the last time. Send description of your talk early and you get a ticket, or hope that you can click fast enough for the left-over tickets later. I believe that extra effort should be awarded.

The official language this time is Slovenian since we’ve figured out that locals that are not native speakers understand our geek talk enough that it shouldn’t be a problem and we can understand them as they lecture in their own language.

As always: all the details are at http://www.webcamp.si

About

jure100px_colour Jure Cuhalev is an Open Source Hacker, with background in Social Sciences and Usability and User Experience (UX).

Recent projects