Description

Key Skills: Data Scientist, Python, Hadoop

Roles and Responsibilities:

  • Deep experience in building data science solutions in areas like fraud prevention, forecasting, shrink and waste reduction, inventory management, recommendation, assortment, and price optimization.
  • Deep experience in simultaneously leading multiple data science initiatives end to end - from translating business needs to analytical asks, leading the process of building solutions, and the eventual act of deployment and maintenance of them.
  • Strong experience in machine learning: Classification models, regression models, NLP, forecasting, unsupervised models, optimization, Graph ML, causal inference, causal ML, statistical learning, experimentation, and Gen-AI.
  • In Gen-AI, desirable experience includes embedding generation from training materials, storage and retrieval from Vector Databases, set-up and provisioning of managed LLM gateways, development of retrieval-augmented generation-based LLM agents, model selection, iterative prompt engineering and fine-tuning based on accuracy and user feedback, monitoring, and governance.
  • Ability to scale and deploy data science solutions.

Skills Required:

  • Strong experience with one or more of Python and R.
  • Experience in GCP/Azure.
  • Strong experience in Python, PySpark.
  • Google Cloud Platform, Vertex AI, Kubeflow, model deployment.
  • Strong experience with big data platforms - Hadoop (Hive, Map Reduce, HQL, Scala).
  • Experience with GPU/CUDA for computational efficiency.

Education: Master's with > 12 years OR Ph.D. with > 10 years of relevant experience. Educational qualifications should be Computer Science/Statistics/Mathematics or a related area

Education

Any Graduate