Cloud / DevOps Engineer – Proficient in scripting (Python, Bash, PowerShell); Go / Rust a plus. Strong expertise in Terraform, Terragrunt, Helm, Kubernetes, and Docker.
About Company : Groundup.ai is a Singapore-based AI startup that helps companies reduce unplanned downtime of industrial assets without a huge learning curve and high‑risk deployments.
Job Responsibilities
- Architect and manage scalable, secure infrastructure on GCP, Azure, and occasionally OCI / AWS.
- Implement and manage Infrastructure as Code primarily using Terraform, with occasional Terragrunt and Helm.
- Design and optimize CI / CD workflows using GitHub Actions, Jenkins, and GitHub Enterprise (reusable workflows, OIDC federation).
- Ensure seamless deployment pipelines from code commit to production for microservices and AI workloads.
- Manage Docker containers using tools such as Portainer and Docker images.
- Support canary releases, blue‑green deployments, and auto‑scaling strategies.
- Implement and manage serverless deployments on Google Cloud Platform (Cloud Functions, Cloud Run).
Resource Planning & Hardware Estimation
Assist in hardware estimation for on‑premise and cloud environments based on sensor count and storage needs.Ensure robust backup strategies and data redundancy for all infrastructure.Help audit on‑cloud and on‑premises resources.Security & Compliance
Enforce cloud security best practices : image hardening, secret management, IAM least privilege, SBOMs, and vulnerability scanning.Collaborate on compliance requirements (SOC 2, ISO 27001) and respond to audits and incidents proactively.Configure and manage Cloudflare for enhanced security and performance.Build and maintain observability stacks using Grafana, Prometheus, Loki, Tempo, Datadog, OpenTelemetry, and Sentry.Diagnose and resolve performance bottlenecks across compute, storage, and networking layers.Monitor and optimize cloud spending to ensure cost‑efficiency.Develop and implement disaster recovery plans, conducting regular drills to ensure business continuity.Other Responsibilities
Partner with engineers to embed DevOps best practices.Establish and enforce documentation standards for infrastructure, processes, and troubleshooting guides.Use Plane for sprint planning, incident tracking, and delivery visibility.#J-18808-Ljbffr