Description

Responsibilities / Duties:

  • Training AI models using OCR and LLMs
  • Must have hands on experience fine tuning LLMs using custom data sets and experience training complex models such a deep learning models
  • Follow AWS Well-Architected Framework
  • Following internal best practices for code monitoring and testing
  • Ensure data confidentiality and HIPAA compliance
  • Collaborate with other data scientist and divide work to finish the projects within the timeline
  • Meet deadlines for weekly/bi-weekly meetings
  • Collaborate with software engineers to deploy models in production
  • Robust testing of models to ensure accuracy
  • Create visualizations to communicate results to non-technical stakeholders
  • QA/Application testing
  • Testing and implementing NER models
  • Compare feasibility of different models

Skills:

  • Big data
  • Data cleaning
  • Python
  • Experience training complex models and fine tuning models
  • AWS
  • SageMaker
  • Bedrock
  • Lambda
  • S3
  • API gateway
  • Textract API
  • CI/CD
  • Version control
  • Jenkins
  • SQL
  • OCR
  • NER models
  • LLMs
  • Fine tuning (mistral, llama, and other open source models)
  • Prompt tuning
  • Familiar with huggingface packages
  • Computer vision models for object detection and segmentation

Education

Any Graduate