Skills Required:
Design, develop, and maintain scalable ETL/ELT data pipelines for AI and analytics workloads.
Data Modeling and design experience using ER, structured and NoSQL schema definitions
Work with structured and unstructured data, ensuring efficient storage, retrieval, and processing.
Implement real-time and batch data processing architectures using tools like Apache Spark, Kafka, and SnowFlake.
Collaborate with Data Scientists and ML Engineers to optimize and deploy AI/ML models in production.
Develop and manage data platform and data warehouse solutions on cloud platforms preferably Azure.
Implement AI/GenAI-powered data transformation processes to enhance data quality and insights.
Design and manage popular GenAI models, Agentic workflows and frameworks such as LangChain, LangGraphs or Llama Index.
Ensure data security, compliance, and governance best practices are followed.
Automate data workflows using CI/CD pipelines and Infrastructure-as-Code (IaC).
Monitor, troubleshoot, and optimize data infrastructure for performance and reliability
Any Graduate