In the research area Data Science, we focus on the development and application of data-driven methods for solving biological, medical, and technical problems. In close cooperation with our partners from research and industry, we develop algorithms for preprocessing and analyzing data using statistical methods and machine learning, as well as visualizing and interpreting the results. For example, within the project LeiVMed Online we develop a platform for benchmarking and visualizing data of treatments of patients in hospitals and developing prediction models for the outcome of treatments.
In many of our projects, we see the need for customized data processing pipelines due to the heterogeneity and complexity of data structures in real world processes and systems. We use applied statistics as well as numerous machine learning approaches, including black box methods (deep neural nets, random forests, gradient boosted trees, etc.) and white box methods (symbolic regression by genetic programming). We build our knowledge discovery pipelines on a variety of different frameworks, especially python scikit-learn, tensorflow, pytorch, MATLAB, and HeuristicLab.