freiberufler Data Scientist and Architect auf freelance.de

Data Scientist and Architect

zuletzt online vor 1 Tagen
  • 90€/Stunde
  • 10777 Berlin
  • auf Anfrage
  • de  |  en
  • 21.05.2025

Kurzvorstellung

Freelance data scientist and software developer with strong expertise in predictive modeling, automation and scalable data systems.

Auszug Referenzen (1)

"Sehr angenehme Zusammenarbeit. C. hat unsere Anforderungen an das Projekt gut verstanden & stand uns mit Ratschlägen zur Umsetzung zur Seite"
Excel & PowerQuery Developer
Joanna Wilbert
Tätigkeitszeitraum

6/2023 – 6/2024

Tätigkeitsbeschreibung

Automation and optimization of MS PowerQuery-based workflows to structure and prepare multi-channel marketing campaigns (Google, Instagram, Youtube, etc.).

Eingesetzte Qualifikationen

Power Bi, Google Cloud, Microsoft Excel

Qualifikationen

  • Analytiker / Programmierer
  • API-Entwickler1 J.
  • Data Science
  • Datenbank-Analytiker1 J.
  • Docker1 J.
  • Machine Learning
  • Neuronale Netze
  • Python2 J.
  • R (Programmiersprache)3 J.
  • Softwareentwickler
  • SQL2 J.
  • Web Entwicklung1 J.

Projekt‐ & Berufserfahrung

Biodiversity Data Scientist
Centre for Integrative Biodiversity Research, iDiv, Leipzig
6/2024 – 3/2025 (10 Monate)
Hochschulen und Forschungseinrichtungen
Tätigkeitszeitraum

6/2024 – 3/2025

Tätigkeitsbeschreibung

Two-stage project focusing on (1) building a scalable data warehouse for integrating and processing heterogeneous biodiversity data, and (2) implementing advanced machine learning workflows to model species distributions under varying environmental conditions.

Design and implementation of a scalable database architecture
- Developed a DuckDB-based solution for local data storage and processing
- Designed a normalized schema to support diverse ecological data types
- Implemented spatial operations for geographic data processing

Development of automated data pipelines
- Created an R package offering a standardized interface for data ingestion, processing, and extraction
- Built automated workflows for orchestration and reproducibility using targets
- Integrated heterogeneous data sources, including species occurrences, taxonomy, and environmental raster layers

Predictive modeling using machine learning and deep learning
- Built end-to-end modeling pipeline to predict species distributions at continental scales
- Deployed different model types under shared feature processing, cross-validation, and model evaluation regime
- Developed embedding architecture to improve information sharing across species in Neural Networks

Eingesetzte Qualifikationen

R (Programmiersprache), Datenmodelierung, SQL, Neuronale Netze, Torch, Python

Software Developer
KWS Saat, Einbeck
5/2024 – offen (1 Jahr, 2 Monate)
Life Sciences
Tätigkeitszeitraum

5/2024 – offen

Tätigkeitsbeschreibung

Development of a data exploration and visualization platform to support plant breeding decisions at KWS Saat. The system provides an intuitive interface to internal data sources and enables staff researchers to explore complex interactions between gene variants, crop characteristics and physiological pathways.

Development of a REST API for graph database integration
- Built a FastAPI-based service layer for interacting with a Neo4j graph database
- Designed efficient, targeted query endpoints
- Integrated authentication and access control mechanisms

Design and implementation of an interactive analytics dashboard
- Developed a web interface for interactive data exploration using Plotly Dash
- Implemented network visualizations with Cytoscape for complex relationships
- Enabled real-time data access via API integration
- Incorporated AI-assisted features through external API services

Eingesetzte Qualifikationen

API-Entwickler, Datenbank-Analytiker, Docker, Python, Web Entwicklung

Excel & PowerQuery Developer
Offerista, Berlin
6/2023 – 6/2024 (1 Jahr, 1 Monat)
Medienbranche
Tätigkeitszeitraum

6/2023 – 6/2024

Tätigkeitsbeschreibung

Automation and optimization of MS PowerQuery-based workflows to structure and prepare multi-channel marketing campaigns (Google, Instagram, Youtube, etc.).

Eingesetzte Qualifikationen

Power Bi, Google Cloud, Microsoft Excel

Senior R/R-Shiny Developer
T-Systems, Frankfurt
8/2022 – 2/2024 (1 Jahr, 7 Monate)
Banken
Tätigkeitszeitraum

8/2022 – 2/2024

Tätigkeitsbeschreibung

Large-scale project for the European Central Bank, managed by T-Systems, to migrate the ECB’s existing infrastructure for storing, aggregating and publishing economic data to a modernized technology stack. As part of the reports migration team, I developed infrastructure to support the generation of parameterized, standardized reports and documents across departments.

Development of a custom R-package ecosystem
- Created dynamic tools for the generation of ECB press releases and reports in various formats (html, pdf, txt, docx)
- Designed and implemented an R Shiny web application to enable collaborative report production across ECB teams

Integration with enterprise systems
- Integrated R-based solutions with internal systems including timeseries databases, document management platforms or business process metadata stores
- Embedded reporting workflows into Camunda for seamless process automation

Stakeholder engagement and presentation
- Operated within a Scrum-based agile team, contributing to sprint planning, development and review cycles
- Delivered regular presentations to ECB product management and end users to ensure alignment with business needs and technical feasibility

Eingesetzte Qualifikationen

Agile Methodologie, Git, Jira, Python, R (Programmiersprache), SQL

Softwarentwickler (Festanstellung)
Karlsruhe Institut für Technologie, Karlsruhe
11/2021 – 6/2022 (8 Monate)
Hochschulen und Forschungseinrichtungen
Tätigkeitszeitraum

11/2021 – 6/2022

Tätigkeitsbeschreibung

I built and deployed a web application that helps researchers to standardize and share vegetation data. The application provides an intuitive interface for uploading and editing various data types and converting them into a highly standardized XML exchange format. This project required a deep understanding of XML and R Shiny as well as a consistent implementation of software design principles and performance optimizations. The resulting application - VegXshiny – now provides a key tool for sharing and integrating vegetation data across the scientific community.

Eingesetzte Qualifikationen

Docker, R (Programmiersprache), Server Administration, Web Entwicklung, XML

Ausbildung

Biodiversity, Ecology & Evolution
Dr. rer. nat.
Georg-August University of Göttingen
2019
Göttingen
Biodiversity, Ecology & Evolution
M. Sc.
Georg-August University of Göttingen
2014
Göttingen
Umweltmonitoring / Umweltanalyse
B. Sc.
HTW Dresden
2012
Dresden

Über mich

Freelance data scientist and software developer with strong expertise in predictive modeling, automation and scalable data systems. Proven track record of delivering production-ready solutions for clients in finance, biotech and marketing – from building data platforms and REST APIs to deploying interactive dashboards and AI-powered analytics. Effective communicator and team player.


Professional Skills

Analytical thinking
Independent problem solving
Creativity
Effective communication
Scientific expertise (biodiversity & climate)

Programming

Python, R, SQL, JavaScript, M (Power Query language)
HTML, CSS, XML
Package development (Python & R)
API design (FastAPI, Flask)
ETL / ELT pipelines
Geospatial processing (GIS, terra, sf, raster)
Scripting & workflow automation

Machine Learning

Classification & regression: linear models, tree-based methods, neural networks
Hyperparameter tuning, cross-validation, model evaluation
Bayesian modeling
Dimensionality reduction, Clustering
Technologies: XGBoost, scikit-learn, caret, PyTorch, NumPy, dplyr, pandas, JAGS, Stan

Databases

Data modelling, schema design, data normalization
Query design and optimization
OLTP & OLAP systems
Technologies: MySQL, PostgreSQL, DuckDB, OracleDB, Neo4j

Visualization & Reporting

Interactive reports
Dashboards, Graph Visualization
Automated documentation
Technologies: R Shiny, Plotly Dash, quarto, markdown, plotly, ggplot

Development & Orchestration

Version control: Git, GitHub, GitLab
Cloud: AWS, GCP
Containers & environments: Docker, pip, conda, venv, renv
IDEs & notebooks: VS Code, RStudio, Jupyter, Positron
Workflow automation: make, targets, Camunda
OS & scripting: Linux, Bash, Windows
Agile tools: Jira, Confluence, Slack, Teams

Persönliche Daten

Sprache
  • Deutsch (Muttersprache)
  • Englisch (Fließend)
Reisebereitschaft
auf Anfrage
Arbeitserlaubnis
  • Europäische Union
Home-Office
bevorzugt
Profilaufrufe
19
Alter
37
Berufserfahrung
6 Jahre und 3 Monate (seit 03/2019)
Projektleitung
3 Jahre

Kontaktdaten

Nur registrierte PREMIUM-Mitglieder von freelance.de können Kontaktdaten einsehen.

Jetzt Mitglied werden