Key Skills: Data Scientist, Python, Hadoop
Roles and Responsibilities:
- Deep experience in building data science solutions in areas like fraud prevention, forecasting, shrink and waste reduction, inventory management, recommendation, assortment, and price optimization.
- Deep experience in simultaneously leading multiple data science initiatives end to end - from translating business needs to analytical asks, leading the process of building solutions, and the eventual act of deployment and maintenance of them.
- Strong experience in machine learning: Classification models, regression models, NLP, forecasting, unsupervised models, optimization, Graph ML, causal inference, causal ML, statistical learning, experimentation, and Gen-AI.
- In Gen-AI, desirable experience includes embedding generation from training materials, storage and retrieval from Vector Databases, set-up and provisioning of managed LLM gateways, development of retrieval-augmented generation-based LLM agents, model selection, iterative prompt engineering and fine-tuning based on accuracy and user feedback, monitoring, and governance.
- Ability to scale and deploy data science solutions.
Skills Required:
- Strong experience with one or more of Python and R.
- Experience in GCP/Azure.
- Strong experience in Python, PySpark.
- Google Cloud Platform, Vertex AI, Kubeflow, model deployment.
- Strong experience with big data platforms - Hadoop (Hive, Map Reduce, HQL, Scala).
- Experience with GPU/CUDA for computational efficiency.
Education: Master's with > 12 years OR Ph.D. with > 10 years of relevant experience. Educational qualifications should be Computer Science/Statistics/Mathematics or a related area