Jure Cuhalev

28 Jan, 2012

Visualizing Slovenian coalition agreement

Posted by: Jure Cuhalev In: visualization

With the election of new Slovenian prime minister we also got formal release of a Coalition agreement. Since it’s a 72 page document, I was wondering what keywords would stand out. Here is the result:

Pogodba za Slovenijo 2012 - 2015 - word cloud (top 80 words)

While we’re at it, we can also take a look at the coalition agreement that Pozitivna Slovenija prepared. As we run them through the same process, we get:

Koalicijska pogodba - Pozitivna Slovenija - 2012 (top 80 words)

 

A few words on how to reproduce this:

  • Grab your favorite OCR software and convert scanned PDF into .docx
  • From Word save it into .txt file
  • Lemmatize the words so you normalize all the grammar rules
  • Apply stop-words (in this case mostly: ministrstvo*, vlada, slovenija*, ..)
  • Drop the resulting text into wordle.net

 

3 Responses to "Visualizing Slovenian coalition agreement"

1 | Karlos

January 30th, 2012 at 1:31 pm

Avatar

?e ni skrivnost, me zanima kateri software si uporabil za lemetizacijo?

3 | Jure Cuhalev

January 31st, 2012 at 12:07 pm

Avatar

Thanks Domen. That’s the one.