Work with Azure Synapse Analytics using PySpark for data processing and analytics.
Develop and manage Synapse Pipelines, Data Flows, and Data Integration tasks.
Utilize Azure Synapse Studio for data development and orchestration.
Perform data transformation and ETL/ELT processes using Synapse Pipelines, Data Factory, and PySpark.
Integrate Azure Synapse Analytics with other Azure data services such as Azure Data Lake Storage, Azure Data Factory, Azure Databricks, and Azure SQL Database.
Conduct data ingestion from sources like Azure Blob Storage, Data Lake Storage, and on-premises databases.
Use Synapse SQL and Spark SQL for data analysis and transformation.
Automate tasks using PowerShell or other scripting languages.
Troubleshoot and resolve issues in data pipelines and workflows.
Debug data processing and transformation problems.
Qualifications
5 to 7 years of experience with Azure Synapse Analytics and PySpark.
Hands-on experience using Azure Synapse Studio with PySpark.
Experience working with Spark integrated within Synapse Analytics.
Proficiency in Python for data processing using PySpark.
Strong experience in integrating with Azure data services such as Azure Data Lake Storage, Azure Data Factory, Azure Databricks, Azure SQL Database, etc.
Strong SQL skills for querying and managing relational databases.
Experience in performance tuning and optimization of SQL queries.
Solid understanding of database design principles and normalization.
Familiarity with big data processing using Apache Spark.
Experience with scripting languages such as PowerShell for automation.
Hands-on experience in migrating data from Oracle to PostgreSQL