About the job IoT Site Reliability Engineer
About the Company :
Talent Insider is an upcoming HR Consultancy Service, founded in 2021. Our clients have been some of the leading brands in Indonesia, and this service continues to expand. Registered in Singapore & Indonesia, we can assist with your growth plans and strategies, and continue to expand our regional presence with strong regional partners to assist our client in recruitment and branding strategy.
Job Description :
- Implement And Maintain Best Practices For System Reliability, Availability, And Scalability While Minimizing Downtime And Disruptions For Both Software And Hardware Systems
- Develop And Enhance Automation Tools And Scripts For System Monitoring, Deployment, And Recovery To Streamline Operational Processes
- Identify And Resolve Performance Bottlenecks, Proactively Optimizing System Components To Ensure Optimal Response Times And Resource Utilization
- Participate In On-call Rotations To Respond To And Resolve System Incidents Promptly And Efficiently, Ensuring Minimal Impact On End-users. Site Visits And On-site Debugging Will Be Needed.
- Use Infrastructure As Code Tools To Manage And Version Infrastructure, Making It More Predictable And Reproducible
- Set Up And Maintain Robust Monitoring, Alerting, And Logging Systems To Detect And Mitigate Issues Before They Impact The User Experience
Job Requirements :
Bachelor's degree in a technical or scientific field such as Software Engineering, Computer Science, Electrical Engineering or IT preferredMinimum 4 years proven experience as a Site Reliability Engineer or in a similar roleProficiency in scripting and automation with languages such as Python and BashFamiliarity with cloud platforms (e.g., AWS, Azure, GCP)Strong knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes)Experience with Infrastructure as Code tools (e.g., Terraform, Ansible)Solid understanding of monitoring tools and practicesKnowledge of security best practices and incident responseExperience and knowledge of IoT (eg. sensors, Raspberry Pi, device management)#J-18808-Ljbffr