freiberufler Machine Learning  - Data Engineering - Data Science - MLOps auf freelance.de

Machine Learning - Data Engineering - Data Science - MLOps

zuletzt online vor 1 Tagen
  • auf Anfrage
  • 08960 SANT JUST DESVERN
  • Europa
  • es  |  de  |  en  |  fr  |  it
  • 15.05.2025

Kurzvorstellung

- Thinker & Doer
- Trustful
- Continuous Learner

Auszug Referenzen (2)

"It was very good working together with F.. He helped us building an MVP. Verifying technologies and building a larger application. Great collab!"
Data Engineer
Mahran Meißner
Tätigkeitszeitraum

8/2022 – 11/2022

Tätigkeitsbeschreibung

Data Engine to automate the analysis of the market key aspects around some trending
topics.
In GCP, a scheduled job runs twitter parametrized queries and saves them into BigQuery.
The data is processed with NLP techniques and the key topics are selected algorithmically.
The topics are then plotted in a word cloud which is accessed by the users over an API
deployed in GCP. The engine is deployed in a docker container and scheduled to run daily.
Designed and developed a Python-based data pipeline to automate the analysis of
trending technological topics, leveraging NLP for text processing and BigQuery for data
storage.
Containerized and deployed the solution in GCP, enabling daily automated
summarizations and delivering visual insights via an API-accessible word cloud.
Collaborated with the IT Innovation team lead to provide actionable insights by overcoming
challenges in processing unstructured Twitter data for accurate analysis.

Eingesetzte Qualifikationen

Natural Language Processing, Text-Extraction, API-Entwickler, Docker

"F. B. built AWS pipelines for Hotusa’s bed bank, processing 100B+ daily transactions with full traceability and monitoring."
Data Engineer
Carles Fontanals
Tätigkeitszeitraum

9/2020 – 11/2023

Tätigkeitsbeschreibung

Pyspark Prozesse
Automatisierung von custom Dashboards
Data Engines in Docker AWS
Datawarehouse Lösungen in AWS Athena
Entwicklung von Dashboards

Projekte:
Log Server Analysis Issue Detection @ Hotusa. 2023/07 – 2023/11
Developed a robust data pipeline to unify diverse request and error logs, enabling the efficient computation of metrics such as mean response time, error rates, and request volumes across multiple granular dimensions.
The team built a dashboard with these data that empowered channel agents to identify integration issues with associates early, significantly reducing resolution times. Leveraged PySpark, AWS Athena, and AWS Glue for scalable data processing and analytics.

- Direct Channel Dashboard Automation @ Hotusa. 2023/02 – 2023/04 (Barcelona)
Built and deployed a robust data pipeline using PySpark, Athena, and Docker on AWS to integrate booking data and web logs for comprehensive analysis.
Defined dashboard requirements and supervised the development of a dashboard to be sent to hundreds of customers delivering monthly KPIs tailored to customer needs.
Ensured the successful delivery of the project by overseeing technically and organizationally all stages.

- Rate Controller Phase 2 @ Hotusa. 2021/12 – 2022/03
Using Pyspark, significant calculation extension and datamart population. See below project Rate Controller Dashboard. (AWS Sagemaker, AWS Glue, AWS Athena)

- Rate Controller System Dashboard @ Hotusa . 2020/10 – 2021/03
Designed and developed a near real-time processing engine to monitor price ratings and optimize market rates by ingesting over 300 million records daily. Implemented business rule processing using PySpark on AWS Glue, with data storage and querying facilitated through AWS Athena (Hive). The solution delivered actionable insights to support dynamic pricing strategies at scale.

Eingesetzte Qualifikationen

Amazon Web Services (AWS), Big Data, Business Intelligence (BI), Data Warehousing, Docker, Python

Qualifikationen

  • Big Data3 J.
  • Business Intelligence (BI)3 J.
  • Computer Vision4 J.
  • Data Engineer4 J.
  • Data Science4 J.
  • Data Warehousing3 J.
  • Large Language Models4 J.
  • Machine Learning Engineer4 J.
  • MLOps4 J.
  • Natural Language Processing4 J.
  • Python8 J.

Projekt‐ & Berufserfahrung

Data Engineer
Siemens, Regensburg
8/2022 – 11/2022 (4 Monate)
Bauwirtschaft, Anlagen- und Schiffbau
Tätigkeitszeitraum

8/2022 – 11/2022

Tätigkeitsbeschreibung

Data Engine to automate the analysis of the market key aspects around some trending
topics.
In GCP, a scheduled job runs twitter parametrized queries and saves them into BigQuery.
The data is processed with NLP techniques and the key topics are selected algorithmically.
The topics are then plotted in a word cloud which is accessed by the users over an API
deployed in GCP. The engine is deployed in a docker container and scheduled to run daily.
Designed and developed a Python-based data pipeline to automate the analysis of
trending technological topics, leveraging NLP for text processing and BigQuery for data
storage.
Containerized and deployed the solution in GCP, enabling daily automated
summarizations and delivering visual insights via an API-accessible word cloud.
Collaborated with the IT Innovation team lead to provide actionable insights by overcoming
challenges in processing unstructured Twitter data for accurate analysis.

Eingesetzte Qualifikationen

Natural Language Processing, Text-Extraction, API-Entwickler, Docker

Data Engineer
Prolion, Wiener Neustadt
6/2021 – 9/2021 (4 Monate)
IT & Entwicklung
Tätigkeitszeitraum

6/2021 – 9/2021

Tätigkeitsbeschreibung

Upon hierarchical data generated by a file system crawler, the project was about defining an architecture and execute an end-to-end proof of concept ready to be used to real deployments. Python and recursive SQL scripts (CTE) were used as ETL and Postgesql was the RDBMS used to store the datamart. Each layer was deployed with Docker.
This analytical system was integrated into a product, and has already been successfully deployed in production. The proof of concept included a dashboard done with PowerBI Desktop. At the time of this writing the production version of the dashboard has been left for the next phase because the PBI dashboard tool was not a ready-to-go solution for all customers.

Eingesetzte Qualifikationen

Postgresql, Power Bi, Python-Programmierer, SQL

Machine Learning Entwickler und Research Teammember
Research Industrial System Engineering, Wien
4/2021 – offen (4 Jahre, 2 Monate)
High-Tech- und Elektroindustrie
Tätigkeitszeitraum

4/2021 – offen

Tätigkeitsbeschreibung

Deep Learning & Classical Machine Learning
Computer Vision & NLP
Data Engineering
MLOps

Eingesetzte Qualifikationen

Python, Computer Vision, Data Engineer, Data Science, Large Language Models, Machine Learning Engineer, MLOps, Natural Language Processing

Data Engineer
Hotusa, Barcelona
9/2020 – 11/2023 (3 Jahre, 3 Monate)
Tourismus und Freizeitwirtschaft
Tätigkeitszeitraum

9/2020 – 11/2023

Tätigkeitsbeschreibung

Pyspark Prozesse
Automatisierung von custom Dashboards
Data Engines in Docker AWS
Datawarehouse Lösungen in AWS Athena
Entwicklung von Dashboards

Projekte:
Log Server Analysis Issue Detection @ Hotusa. 2023/07 – 2023/11
Developed a robust data pipeline to unify diverse request and error logs, enabling the efficient computation of metrics such as mean response time, error rates, and request volumes across multiple granular dimensions.
The team built a dashboard with these data that empowered channel agents to identify integration issues with associates early, significantly reducing resolution times. Leveraged PySpark, AWS Athena, and AWS Glue for scalable data processing and analytics.

- Direct Channel Dashboard Automation @ Hotusa. 2023/02 – 2023/04 (Barcelona)
Built and deployed a robust data pipeline using PySpark, Athena, and Docker on AWS to integrate booking data and web logs for comprehensive analysis.
Defined dashboard requirements and supervised the development of a dashboard to be sent to hundreds of customers delivering monthly KPIs tailored to customer needs.
Ensured the successful delivery of the project by overseeing technically and organizationally all stages.

- Rate Controller Phase 2 @ Hotusa. 2021/12 – 2022/03
Using Pyspark, significant calculation extension and datamart population. See below project Rate Controller Dashboard. (AWS Sagemaker, AWS Glue, AWS Athena)

- Rate Controller System Dashboard @ Hotusa . 2020/10 – 2021/03
Designed and developed a near real-time processing engine to monitor price ratings and optimize market rates by ingesting over 300 million records daily. Implemented business rule processing using PySpark on AWS Glue, with data storage and querying facilitated through AWS Athena (Hive). The solution delivered actionable insights to support dynamic pricing strategies at scale.

Eingesetzte Qualifikationen

Amazon Web Services (AWS), Big Data, Business Intelligence (BI), Data Warehousing, Docker, Python

Lead Data Scientist (Festanstellung)
zerog (Lufthansa Gruppe), Frankfurt am Main
8/2016 – 2/2020 (3 Jahre, 7 Monate)
Luft- und Raumfahrtindustrie
Tätigkeitszeitraum

8/2016 – 2/2020

Tätigkeitsbeschreibung

Data Science & Machine Learning Projekte

Eingesetzte Qualifikationen

Python, Pytorch, R (Programmiersprache), Tableau

Ausbildung

Advanced Data Science
2017
Northwestern University
2017
Online
Master of Sciencie in Predictive Analytics
2014
Northwestern University
2014
Online
Informatik
nicht abgeschlossen
Universitat Oberta de Catalunya
2008
Online
Bertriebswirtschaft
2001
Universitat Pompeu Fabra
2001
Barcelona

Über mich

Über 20 Jahre Erfahrung in der Umsetzung datengetriebener oder anwendungsorientierter Forschungsprojekte. Ich habe skalierbare Datenplattformen in verschiedensten Branchen im Bereich Machine Learning, Data Engineering und Data Science operationalisiert, aber auch Proof-of-Concepts und Forschungsprojekte geleitet oder ein Teilnehmer gewesen. Diese Initiativen bildeten die Grundlage für strategische oder gewinnsteigernde oder kostensenkende Entscheidungen.

Weitere Kenntnisse

- Programming Languages & Scripting: Python, R, SQL, javascript, Visual Basic.
- Machine Learning & Data Science Frameworks: PyTorch, Scikit-learn, Pandas, Numpy, Polars, HuggingFace, OpenCV, Nvidia Rapids, TensorFlow, Keras, Tesseract, Fast.ai.
- Big Data & Cloud Platforms: AWS, GCP, Hadoop Ecosystem, PySpark, Teradata, Neo4j, Airflow.
- DevOps & Deployment Tools: DVC, Git & GitLab CI/CD, Docker, Kubernetes, FastAPI, Flask, AWS Lambda.
- Visualization & BI Tools: Matplotlib, Seaborn, Altair, Streamlit, ggplot2, Tableau, Cognos BI Powerplay, SAP BI Business Objects, Tibco Spotfire, Infor Dynasight.
- ETL & Data Engineering: AWS Glue / Athena Ecosystem, MS SQL Server, GCP Big Query, MSFT Analysis Services, Oracle, Toad, memSQL, PostgreSQL, Teradata, MSFT SS Integration Services.
- Project Management & Collaboration Tools: Microsoft Project & Office, UML.

Technical Certifications
• Onchain Analysis | onchainschool | ongoing
• AI Agents | Hugging Face | ongoing
• Cardano Blockchain Certified Associate (CBCA) | Cardano Academy | 2025
• MLOps Program | FourthBrain.ai | 2022
• Natural Language Processing | DeepLearning.AI | 2021
• OpenCV Computer Vision 2 Applications | OpenCV | 2021
• Machine Learning Scientist Track | Datacamp | 2021
• Data Scientist with Python Track | Datacamp | 2020
• TensorFlow Developer Specialization | DeepLearning.AI | 2020
• OpenCV Computer Vision 1 | OpenCV | 2020
• Deep Learning with PyTorch Nanodegree | Udacity | 2020
• Python 3 Programming Specialization | Coursera University of Michigan | 2020
• Introduction to PyTorch | Udacity | 2019
• Reinforcement Learning Nanodegree | Udacity | 2019
• Deep Learning Specialization | DeepLearning.AI | 2018
• Mathematics for Machine Learning Specialization | Imperial College of London | 2018
• Big Data Specialization | Coursera University of California | 2016
• Data Analysis and Statistical Inference | Coursera Duke University | 2015
• Data Access Layer Modelling and Delivery | Teradata | 2013
• Data Mining with Microsoft SSAS | Intergrupo | 2012
• SAP Business Objects XI 3.x Certified Application Associate | SAP | 2011
• Visual Basic Level 1 | Microsoft | 2003
• Master e-Business Technical Consultant | Microsoft Official Centre | 2001
Management Certifications
• Neuro-Coaching Certification | Leading Brains | 2015
• Prince 2 Foundations | Netmind | 2010
• PMP (Project Management Professional) | Project Management Institute | 2008 – 2024
Other relevant Certifications
• Gut Check: Exploring Your Microbiome | Coursera University of San Diego | 2016
• Gamification | Coursera University of Pennsylvania | 2015
• Financial Consolidation | Centro de Estudios Financieros | 2008
• Postgraduate in Controlling | EADA Business School | 2003

Persönliche Daten

Sprache
  • Spanisch (Muttersprache)
  • Deutsch (Fließend)
  • Englisch (Fließend)
  • Französisch (Grundkenntnisse)
  • Italienisch (Grundkenntnisse)
Reisebereitschaft
Europa
Arbeitserlaubnis
  • Europäische Union
  • Schweiz
Home-Office
bevorzugt
Profilaufrufe
62
Alter
49
Berufserfahrung
23 Jahre und 11 Monate (seit 06/2001)
Projektleitung
6 Jahre

Kontaktdaten

Nur registrierte PREMIUM-Mitglieder von freelance.de können Kontaktdaten einsehen.

Jetzt Mitglied werden