Description

Databricks & Delta Lake

  • Evaluate good understanding of Hands-on experience with Databricks, including development using PySpark or Spark SQL, Efficient use of Delta Lake for scalable data pipelines, and data lineage in Databricks

Azure Data Factory

Evaluate good understanding of Building and managing ETL pipelines using ADF, Using ADF for orchestration with Databricks, Blob Storage, SAP sources, Monitoring, error handling, and pipeline performance tuning

Performance Optimization

  • Evaluate good understanding of handling Performance optimization on Databricks

Python & PySpark

  • Evaluate good understanding of Writing robust, maintainable data processing scripts, Using Python/Spark for custom transformations and integration logic

SAP HANA Expertise knowledge

  • Evaluate good understanding of SAP HANA architecture and data modeling, Integrating SAP data with other platforms, Handling large-scale SAP data extraction, transformation, and migration

ETL Tools – SAP Data Services

  • Evaluate good understanding of Creating, deploying, and optimizing data jobs in SAP BODS/Data Services, Working with complex mappings and SAP-specific data types, Handling change data capture (CDC) scenarios

Data Profiling & Validation

  • Evaluate good understanding of Experience in data profiling, validation, and reconciliation during migrations

Azure Cloud & Networking

  • Evaluate good understanding of Azure services related to compute, storage, networking, and security, Experience resolving firewall, VPN, and VNet issues impacting data pipelines, Familiarity with IAM, RBAC, and secure credential storage

Education

Any Gradute