NEORIS is a Digital accelerator that helps companies enter the future, having 20 years of experience as Digital Partners of some of the largest companies in the world. We have more than 4,000 professionals in 11 countries, with our multicultural startup culture where we cultivate innovation, continuous learning to create high-value solutions for our clients.
We are looking for Site Reliability Engineer (SRE),
Experienced Site Reliability Engineer (SRE) to join and ensure the reliability, scalability, and performance of our cloud-based infrastructure. The ideal candidate will have strong expertise in Google Cloud Platform (GCP), container orchestration, and infrastructure-as-code, with a focus on automation and observability.
Key Responsibilities
Infrastructure & Cloud Management
- Design, implement, and maintain GCP infrastructure with a focus on reliability and security (GCP expertise is mandatory)
- Manage and optimize Google Kubernetes Engine (GKE) clusters, including upgrades and container troubleshooting
- Implement and maintain Docker containerization strategies
Automation & Deployment
- Develop and maintain GitHub Actions workflows for CI/CD pipelines
- Manage infrastructure as code using Terraform for deployments
- Automate operational processes to improve efficiency and reduce manual intervention
Networking & Security
- Configure and maintain firewall rules with focus on network protocols and inbound traffic management
- Implement security best practices across all infrastructure components
- Monitor and optimize network performance
Monitoring & Observability
- Implement and maintain Prometheus for system monitoring and alerting
- Configure and manage Grafana dashboards for system visibility
- Establish SLOs, SLIs, and error budgets for critical services
Collaboration & Best Practices
- Work closely with development teams to improve system reliability and performance
- Participate in incident response and post-mortem analyses
- Document system architecture and operational procedures
Technical Requirements
Must-Have Skills
- Extensive experience with Google Cloud Platform (GCP) services
- Strong knowledge of Kubernetes (GKE) and container orchestration
- Proficiency in Terraform for infrastructure provisioning
- Experience with GitHub Actions or similar CI/CD tools
- Expertise in Docker containerization
- Networking knowledge including firewall configuration and protocols
- Monitoring stack experience (Prometheus + Grafana)
Nice-to-Have Skills
- Experience with API security and tokenization
- Knowledge of schema design and database optimization
- Understanding of promotion strategies for canary deployments
- Familiarity with infrastructure cost optimization
Soft Skills
- Strong problem-solving and troubleshooting abilities
- Excellent communication skills for collaborating across teams
- Proactive approach to identifying and addressing potential issues
- Ability to document technical processes clearly
We offer:
Come and meet us on: http://www.neoris.com, on Facebook, LinkedIn, Twitter, or Instagram @NEORIS.
Marina Molina
#LI-MM3