
Senior Data Engineer/Cloud Data Architect
- Verfügbarkeit einsehen
- 0 Referenzen
- auf Anfrage
- 89073 Ulm
- Europa
- en | hi
- 19.03.2023
Kurzvorstellung
Qualifikationen
Projekt‐ & Berufserfahrung
1/2022 – offen
Tätigkeitsbeschreibung
* Key contributor for data and analytics platform at Leibherr.
* Mentored a team of 5 data engineers, and collaborated with product manager and product
owners to shape the correct scope, requirements & priorities.
* Designed and developed data management framework and strategies for data quality, metadata
management, data modeling, and data architectures.
* Designed and implemented batch and streaming data pipelines using Databricks Spark(PySpark
and Spark-SQL).
* Implemented performance tuning of distributed data processing and data load optimizations.
* Developed modern data warehouse solutions using Azure Stack (Azure Data Lake, Azure
Databricks, Delta Lake house).
* Developed data quality rules and validations using Spark/ Python to ensure that data is error-free.
* Implemented access control, cluster management, jobs orchestration in Azure Databricks.
* Documented and presented data architectures and designs to project leaders and stakeholders.
Apache Spark, Azure Databricks, Big Data, Cloud Computing, Datawarehouse / DWH, ETL
5/2017 – 12/2021
Tätigkeitsbeschreibung
* Worked in artificial intelligence team with data scientists and data engineers, with the key
responsibilities of designing and building data driven applications.
* Own the delivery of the project from the requirements gathering step to eventual production
deployment and stakeholder management.
* Developed distributed applications and data pipelines using technologies like Azure, Python,
Spark, Kafka etc.
* Implemented data preparation and business transformations using Spark SQL and Pandas.
* Implemented Hive, Data Lake and NoSQL based storages.
* Implemented loading, processing and storage of data with Delta lake and Pyspark in Azure Databricks platform.
* Implemented software and data engineering best practices (CI/CD, testing, data quality, etc)
* Worked in Agile environment, and used Jira tool to maintain the user stories and tasks
Apache Hadoop, Apache Spark, Azure Databricks, Big Data, Python, SQL
8/2011 – 7/2014
Tätigkeitsbeschreibung
* Key contributor for the Cisco Intellectual Management Platform.
* Designed and Developed a Rule Based Intellectual Capital (RBIC) : A web based application
using Java tech stack which helps to capture, share and re-use of expert knowledge (IC).
* Designed and developed web components and backend using Java/JavaScript technologies and RESTful web services.
* Developed web api's using Spring MVC and ORM in hibernate.
* Implemented software engineering best practices- unit testing, integration testing, code reviews.
* Followed agile methodologies (Scrum) and built production quality software.
Agile Entwicklung, Java (allg.), JavaScript, Software Architektur / Modellierung, Software Design, Software engineering / -technik, Spring
Zertifikate
Ausbildung
(M.Sc.)
Ort: Technische Universität Darmstadt
(B.Sc.)
Ort: Baddi, India
Über mich
Designing and development of distributed applications/data pipelines/ETL.
Building modern data warehouses with Databricks, Spark, Azure.
Solid understanding on Data Management, Data Architectures, Data Science, and Cloud.
Expertise in Spark and Databricks.
Development of operational execelence monitoring, security and scalability.
Experienced in Agile Software Development and Delivery.
Weitere Kenntnisse
Data Storages: HDFS, S3, Blob Storage, Delta Lakehouse, Azure Data Lake, NoSQL Databases
Data Solutions: Date migration, Date integration, ETL, ELT, Data Warehouse, Data Modeling, Data Architectures
Distributed Platforms: PySpark, Spark Streaming, Batch Jobs, Kafka
Compute: Azure Kubernetes, Docker, Azure Databricks, AWS, Google
Operating Systems: Linux, Unix, Windows
Devops: GIT, Jenkins, Terraform
Persönliche Daten
- Englisch (Fließend)
- Hindi (Muttersprache)
- Europäische Union
Kontaktdaten
Nur registrierte PREMIUM-Mitglieder von freelance.de können Kontaktdaten einsehen.
Jetzt Mitglied werden