Software

CoSA

CoSA (Core Sentence Annotation) is a software framework for the data collection from politically relevant texts that is currently in the field at the Institute of Political Science at the University of Zurich. It is designed for large-scale collection of relational data from text. More specifically, it allows a semi-automatic application of the core sentence method, which is a content analysis technique specifically developed for the social sciences. The CoSA framework currently encompasses a coding tool and two pre-processing pipelines for German and English, respectively.

  • Coding tool:
    • For a preview of the coding interface, download and read the user manual (only in German at the moment), click this link and login as a guest user (user=guest; passwort=guest).
    • The software documentation (only in German at the moment) can be downloaded here.
  • Core sentence method:
    • To learn about the method of core sentences, download the codebook (only in German at the moment).
  • Pre-processing:
    • If you want to learn more about how relational data can automatically be detected, please refer to this recently presented conference paper.

RSS-Feed Fetcher

At the moment, I collect RSS-Feeds from newspapers and TV stations for future analyses.

If you are interested in how you can fetch RSS-Feed data from any website and store the data in spreadsheets, have a look at my simple feed fetcher (written in python; full understanding requires basic programming skills).

Czech stemmer

Since I found no decent Stemmer for the Czech language which can easily be integrated into python applications, I have written my own.

If you are interested in it, you can download the simple pythonic Czech stemmer here (full understanding requires basic programming skills).