Description

We are seeking a talented and motivated Radarlive Platform Engineer (RPE) to join our technology team. The RPE will work closely with software engineering (Radar model development teams) and IT operations teams to enhance our systems' reliability, performance, and scalability. The ideal candidate will bring a blend of programming, system administration, and operational insight to help optimize our service infrastructure and drive the adoption of DevOps practices.


Key Responsibilities:

 

System Design and Architecture: Design, implement, and manage scalable and reliable Radarlive platforms and associated infrastructure components and services.
Monitoring and Optimization: Develop and implement monitoring solutions to ensure system reliability, performance, and availability. Use metrics and logs to identify and resolve issues proactively.
Incident Management: Act as a primary point of contact for incidents; lead troubleshooting efforts and post-mortem analyses to drive continual service improvements.
Automation: Create and maintain automation scripts for repetitive tasks, including deployment and configuration management.
Collaboration: Work closely with development teams to facilitate efficient deployment processes and support new software releases with a focus on reliability and performance.
Documentation: Maintain clear documentation of architectures, processes, and runbooks to facilitate consistent operations and knowledge sharing.
Security Practices: Advocate for and implement security best practices in all aspects of infrastructure and application deployment.
Capacity Planning: Analyze traffic patterns and resource usage to help forecast future capacity needs and optimize resource allocation.


Qualifications:

 

Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field.
5+ years of experience in systems engineering, site reliability engineering, or a similar role.
Proficiency with cloud platform Azure and its services.
Knowledge of managing Rating platforms, preferably Radarlive.
Strong experience with Application SRE practices
Knowledge of containerization technologies (e.g., Docker, Kubernetes) and orchestration tools.
Proficient in at least one programming/scripting language (e.g., Python, Java, PowerShell).
Experience with configuration management tools (e.g., Ansible).
Familiarity with CI/CD and version control systems (e.g., GitHub, Jenkins).
Strong problem-solving skills and the ability to perform well under pressure.
Excellent verbal and written communication skills, with an emphasis on collaboration.


Preferred Qualifications:

 

Knowledge of managing application platforms including commercial off the shelf.
Experience with observability tools (e.g., Prometheus, Grafana, ELK Stack).
Knowledge of networking concepts, load balancing, and database management.
Familiarity with Agile and DevOps methodologies

Education

Any Graduate