
Machine Learning - Data Engineering - Data Science - MLOps
- Verfügbarkeit einsehen
- 2 Referenzen
- auf Anfrage
- 08960 SANT JUST DESVERN
- Europa
- es | de | en | fr | it
- 15.05.2025
Kurzvorstellung
- Trustful
- Continuous Learner
Auszug Referenzen (2)
"It was very good working together with F.. He helped us building an MVP. Verifying technologies and building a larger application. Great collab!"
8/2022 – 11/2022
Tätigkeitsbeschreibung
Data Engine to automate the analysis of the market key aspects around some trending
topics.
In GCP, a scheduled job runs twitter parametrized queries and saves them into BigQuery.
The data is processed with NLP techniques and the key topics are selected algorithmically.
The topics are then plotted in a word cloud which is accessed by the users over an API
deployed in GCP. The engine is deployed in a docker container and scheduled to run daily.
Designed and developed a Python-based data pipeline to automate the analysis of
trending technological topics, leveraging NLP for text processing and BigQuery for data
storage.
Containerized and deployed the solution in GCP, enabling daily automated
summarizations and delivering visual insights via an API-accessible word cloud.
Collaborated with the IT Innovation team lead to provide actionable insights by overcoming
challenges in processing unstructured Twitter data for accurate analysis.
Natural Language Processing, Text-Extraction, API-Entwickler, Docker
"F. B. built AWS pipelines for Hotusa’s bed bank, processing 100B+ daily transactions with full traceability and monitoring."
9/2020 – 11/2023
Tätigkeitsbeschreibung
Pyspark Prozesse
Automatisierung von custom Dashboards
Data Engines in Docker AWS
Datawarehouse Lösungen in AWS Athena
Entwicklung von Dashboards
Projekte:
Log Server Analysis Issue Detection @ Hotusa. 2023/07 – 2023/11
Developed a robust data pipeline to unify diverse request and error logs, enabling the efficient computation of metrics such as mean response time, error rates, and request volumes across multiple granular dimensions.
The team built a dashboard with these data that empowered channel agents to identify integration issues with associates early, significantly reducing resolution times. Leveraged PySpark, AWS Athena, and AWS Glue for scalable data processing and analytics.
- Direct Channel Dashboard Automation @ Hotusa. 2023/02 – 2023/04 (Barcelona)
Built and deployed a robust data pipeline using PySpark, Athena, and Docker on AWS to integrate booking data and web logs for comprehensive analysis.
Defined dashboard requirements and supervised the development of a dashboard to be sent to hundreds of customers delivering monthly KPIs tailored to customer needs.
Ensured the successful delivery of the project by overseeing technically and organizationally all stages.
- Rate Controller Phase 2 @ Hotusa. 2021/12 – 2022/03
Using Pyspark, significant calculation extension and datamart population. See below project Rate Controller Dashboard. (AWS Sagemaker, AWS Glue, AWS Athena)
- Rate Controller System Dashboard @ Hotusa . 2020/10 – 2021/03
Designed and developed a near real-time processing engine to monitor price ratings and optimize market rates by ingesting over 300 million records daily. Implemented business rule processing using PySpark on AWS Glue, with data storage and querying facilitated through AWS Athena (Hive). The solution delivered actionable insights to support dynamic pricing strategies at scale.
Amazon Web Services (AWS), Big Data, Business Intelligence (BI), Data Warehousing, Docker, Python
Qualifikationen
Projekt‐ & Berufserfahrung
8/2022 – 11/2022
Tätigkeitsbeschreibung
Data Engine to automate the analysis of the market key aspects around some trending
topics.
In GCP, a scheduled job runs twitter parametrized queries and saves them into BigQuery.
The data is processed with NLP techniques and the key topics are selected algorithmically.
The topics are then plotted in a word cloud which is accessed by the users over an API
deployed in GCP. The engine is deployed in a docker container and scheduled to run daily.
Designed and developed a Python-based data pipeline to automate the analysis of
trending technological topics, leveraging NLP for text processing and BigQuery for data
storage.
Containerized and deployed the solution in GCP, enabling daily automated
summarizations and delivering visual insights via an API-accessible word cloud.
Collaborated with the IT Innovation team lead to provide actionable insights by overcoming
challenges in processing unstructured Twitter data for accurate analysis.
Natural Language Processing, Text-Extraction, API-Entwickler, Docker
6/2021 – 9/2021
Tätigkeitsbeschreibung
Upon hierarchical data generated by a file system crawler, the project was about defining an architecture and execute an end-to-end proof of concept ready to be used to real deployments. Python and recursive SQL scripts (CTE) were used as ETL and Postgesql was the RDBMS used to store the datamart. Each layer was deployed with Docker.
This analytical system was integrated into a product, and has already been successfully deployed in production. The proof of concept included a dashboard done with PowerBI Desktop. At the time of this writing the production version of the dashboard has been left for the next phase because the PBI dashboard tool was not a ready-to-go solution for all customers.
Postgresql, Power Bi, Python-Programmierer, SQL
4/2021 – offen
Tätigkeitsbeschreibung
Deep Learning & Classical Machine Learning
Computer Vision & NLP
Data Engineering
MLOps
Python, Computer Vision, Data Engineer, Data Science, Large Language Models, Machine Learning Engineer, MLOps, Natural Language Processing
9/2020 – 11/2023
Tätigkeitsbeschreibung
Pyspark Prozesse
Automatisierung von custom Dashboards
Data Engines in Docker AWS
Datawarehouse Lösungen in AWS Athena
Entwicklung von Dashboards
Projekte:
Log Server Analysis Issue Detection @ Hotusa. 2023/07 – 2023/11
Developed a robust data pipeline to unify diverse request and error logs, enabling the efficient computation of metrics such as mean response time, error rates, and request volumes across multiple granular dimensions.
The team built a dashboard with these data that empowered channel agents to identify integration issues with associates early, significantly reducing resolution times. Leveraged PySpark, AWS Athena, and AWS Glue for scalable data processing and analytics.
- Direct Channel Dashboard Automation @ Hotusa. 2023/02 – 2023/04 (Barcelona)
Built and deployed a robust data pipeline using PySpark, Athena, and Docker on AWS to integrate booking data and web logs for comprehensive analysis.
Defined dashboard requirements and supervised the development of a dashboard to be sent to hundreds of customers delivering monthly KPIs tailored to customer needs.
Ensured the successful delivery of the project by overseeing technically and organizationally all stages.
- Rate Controller Phase 2 @ Hotusa. 2021/12 – 2022/03
Using Pyspark, significant calculation extension and datamart population. See below project Rate Controller Dashboard. (AWS Sagemaker, AWS Glue, AWS Athena)
- Rate Controller System Dashboard @ Hotusa . 2020/10 – 2021/03
Designed and developed a near real-time processing engine to monitor price ratings and optimize market rates by ingesting over 300 million records daily. Implemented business rule processing using PySpark on AWS Glue, with data storage and querying facilitated through AWS Athena (Hive). The solution delivered actionable insights to support dynamic pricing strategies at scale.
Amazon Web Services (AWS), Big Data, Business Intelligence (BI), Data Warehousing, Docker, Python
8/2016 – 2/2020
TätigkeitsbeschreibungData Science & Machine Learning Projekte
Eingesetzte QualifikationenPython, Pytorch, R (Programmiersprache), Tableau
Ausbildung
Northwestern University
Online
Northwestern University
Online
Universitat Oberta de Catalunya
Online
Universitat Pompeu Fabra
Barcelona
Über mich
Weitere Kenntnisse
- Machine Learning & Data Science Frameworks: PyTorch, Scikit-learn, Pandas, Numpy, Polars, HuggingFace, OpenCV, Nvidia Rapids, TensorFlow, Keras, Tesseract, Fast.ai.
- Big Data & Cloud Platforms: AWS, GCP, Hadoop Ecosystem, PySpark, Teradata, Neo4j, Airflow.
- DevOps & Deployment Tools: DVC, Git & GitLab CI/CD, Docker, Kubernetes, FastAPI, Flask, AWS Lambda.
- Visualization & BI Tools: Matplotlib, Seaborn, Altair, Streamlit, ggplot2, Tableau, Cognos BI Powerplay, SAP BI Business Objects, Tibco Spotfire, Infor Dynasight.
- ETL & Data Engineering: AWS Glue / Athena Ecosystem, MS SQL Server, GCP Big Query, MSFT Analysis Services, Oracle, Toad, memSQL, PostgreSQL, Teradata, MSFT SS Integration Services.
- Project Management & Collaboration Tools: Microsoft Project & Office, UML.
Technical Certifications
• Onchain Analysis | onchainschool | ongoing
• AI Agents | Hugging Face | ongoing
• Cardano Blockchain Certified Associate (CBCA) | Cardano Academy | 2025
• MLOps Program | FourthBrain.ai | 2022
• Natural Language Processing | DeepLearning.AI | 2021
• OpenCV Computer Vision 2 Applications | OpenCV | 2021
• Machine Learning Scientist Track | Datacamp | 2021
• Data Scientist with Python Track | Datacamp | 2020
• TensorFlow Developer Specialization | DeepLearning.AI | 2020
• OpenCV Computer Vision 1 | OpenCV | 2020
• Deep Learning with PyTorch Nanodegree | Udacity | 2020
• Python 3 Programming Specialization | Coursera University of Michigan | 2020
• Introduction to PyTorch | Udacity | 2019
• Reinforcement Learning Nanodegree | Udacity | 2019
• Deep Learning Specialization | DeepLearning.AI | 2018
• Mathematics for Machine Learning Specialization | Imperial College of London | 2018
• Big Data Specialization | Coursera University of California | 2016
• Data Analysis and Statistical Inference | Coursera Duke University | 2015
• Data Access Layer Modelling and Delivery | Teradata | 2013
• Data Mining with Microsoft SSAS | Intergrupo | 2012
• SAP Business Objects XI 3.x Certified Application Associate | SAP | 2011
• Visual Basic Level 1 | Microsoft | 2003
• Master e-Business Technical Consultant | Microsoft Official Centre | 2001
Management Certifications
• Neuro-Coaching Certification | Leading Brains | 2015
• Prince 2 Foundations | Netmind | 2010
• PMP (Project Management Professional) | Project Management Institute | 2008 – 2024
Other relevant Certifications
• Gut Check: Exploring Your Microbiome | Coursera University of San Diego | 2016
• Gamification | Coursera University of Pennsylvania | 2015
• Financial Consolidation | Centro de Estudios Financieros | 2008
• Postgraduate in Controlling | EADA Business School | 2003
Persönliche Daten
- Spanisch (Muttersprache)
- Deutsch (Fließend)
- Englisch (Fließend)
- Französisch (Grundkenntnisse)
- Italienisch (Grundkenntnisse)
- Europäische Union
- Schweiz
Kontaktdaten
Nur registrierte PREMIUM-Mitglieder von freelance.de können Kontaktdaten einsehen.
Jetzt Mitglied werden