Have you implemented CDC (Change data capture)?
What will be your approach to implement that in Python & Pyspark...
If no experience in CDC - Do you know what Cluster of Memory Issue is
If a Database not supported by ADF, how will you load it?
ADF-
Limitations in ADF (considering Foreach If activity etc.)
Experience on ADF activities - Lookup, Foreach, If Activity, Switch, Metadata, Web Activity
Experience on Data Flow in ADF
How to setup Self Hosted integration runtime
How to configure the Integration runtime to handle the multiple pipeline/activity executions at a time
SQL -
Performance Tuning
Use of indexes, Partitioning
Experience on Stored procedure and complex queries using analytical functions
Custom coding -
1. Use of Azure functions and how it works
2. CDC implementation if not supported in ADF
3. Experience on PySpark and Databricks
4. Coding standards to avoid "time out" and "out of memory" issue on Databricks cluster
Any Gradute