Open Data Science Text and Data Analytics Workflows for Science and Civil Society

About me

Background

Currently I'm finishing a M.Sc. in Environmental System Sciences with a specialization on Environmental Economics. From 2012 to 2015 I acted as a board member for a cultural open space project in Graz (Austria), which served as a local community platform for social-ecological-political discourse, sustainability-related projects, and art.

During my studies I recognized the need for increasing public access to knowledge, and the complementary need for tools that scale with the increasing amount of knowledge produced. Around 2013 I joined the Open Science Movement, where both issues are being tackled - on the one hand by advocating for Open Access and Open (Research) Data, on the other hand by creating an open ecosystem of tools and services.

  • B.A. European Studies (University Bremen, Germany, 2010)
  • B.Sc. Environmental System Sciences (University Graz, Austria, 2016)

Approach

Machine Learning and Data Science are powerful instruments, but when applying them we need to avoid the "Type III Error" ;) , describing the situation of "Answering the wrong question", or "Solving the wrong problem". For me it means developing tools in close cooperation with individuals and communities, thoroughly assessing their needs and then homing in on the issue that code and data really can solve.

I aim to develop software as a tool to empower humans with the knowledge they need. This also means working as open as possible, and making code, data and processes re-usable, adaptable and shareable without barriers.

Methodology

  • Natural Language Processing
  • Machine Learning
  • Network Analysis

Stack

  • Python
  • Apache Spark
  • Neo4j

Projects

I specialize in developing data science workflows for research projects and non-governmental/non-profit organizations. If you have a specific use case or simply want to discuss an idea, drop me a line!


Open Knowledge Maps

A visual interface to the world's scientific knowledge

Open Knowledge Maps are creating a visual interface to the world's scientific knowledge that can be used by anyone in order to dramatically improve the discoverability of research results.

At Open Knowledge Maps I work on the algorithms that cluster and summarize the search results.

Try it out!

dlina

A workflow for a Digital Literary Network Analysis

Working group members are Frank Fischer, Mathias Göbel, Dario Kampkaspar, Hanna-Lena Meiners, Danil Skorinkin, and Peer Trilcke. We're looking into hundreds of dramatic texts ranging from Greek tragedies to 20th-century plays and work on larger German, French, English, and Russian corpora.

In dlina I work on a tool for network visualizations and network metrics.

Click here to learn more.


ContentMine

The Right To Read Is The Right To Mine

ContentMine develops open source software for mining the scientific literature and engage directly in supporting researchers to use mining, saving valuable time and opening up new research avenues.

At ContentMine I am responsible for downstream data analysis and visualizations for demonstration purposes.

Click here to learn more. Try a demo here!

Community Activities

I’m also active in the Open Science movement, contributing to Open Knowledge and Mozilla Science Lab efforts to help open up research and increase reproducibility and re-usability of scientific results, methods and processes.


Open Knowledge Austria

Empowering communities by opening up knowledge

Since August 2016 I'm a board member of Open Knowledge Austria. We run community projects in the field of Open Data, Open Science, Transparency and Code Literacy.

Click here to learn more (in German).