Responsibilities
- Develop, test, debug, and troubleshoot cloud-native infrastructure
- Develop, build, and maintain CI/CD pipelines and automated testing
- Monitor systems availability, latency, and overall health
- Provide on-call incident and change management
Requirements
- Proficient in one or more programming languages such as Python, Go, Rust, or JavaScript
- Familiarity with Kubernetes APIs and ability to write CLI tools or Kubernetes Controller for automating operations tasks
- Experience with Linux system and container orchestration tools (e.g., Docker, Kubernetes, Helm, GitOps, Operators, Terraform)
- Experience with logging monitoring services (e.g., Prometheus, Grafana, OpenTelemetry, Loki, Fluentd)
- Experience with AWS services (e.g., EC2, S3, IAM, ECR, EKS)
- Comprehensive debugging and troubleshooting skills
How to Apply
While we have a pretty good idea of what we need, we are open to meeting people who can change our minds. If you think you would be a brilliant addition to the team but don't fit the qualifications exactly, we hope you apply! Click here to send us your resume/portfolio or email us at [email protected].