Place:
Unicom Building
Room: 9.3120
Mary-Somerville-Straße 9
28359 Bremen
Time:
Thu.: 2 - 5:30 p.m.; Fri.: 9 a.m. - 5 p.m.
Partic. Organization:

This workshop with Stefan Müller from Trinity College Dublin offers a hands-on introduction to extracting data from text, and applying various methods to analyse the data. Topics included involve:

  • From Raw Text to Corpus – how to collect textual data and prepare it for analysis.
  • Classification techniques - the first steps in translating text to usable data; supervised and unsupervised learning; dictionary approaches and topic modelling.
  • Scaling – Supervised and Unsupervised techniques.
  • An overview of more advanced topics.


The applied elements of the workshop will make use of the programming language R. Therefore, a basic familiarity with R is a prerequisite for attending the course.

Registration: BIGSSS fellows register via CampusNet, SOCIUM and CRC 1342 members please send a short email to mlarsen@bigsss-bremen.de