Infra Tools SLO team develops SLO reporting software and working with customers to ensure they can make informed development/deployment decisions based on SLIs (Service Level Indicators) that measure service reliability. Our core systems are tasked to collect large amounts of telemetry data, and to compute SLIs described by customers. Our customers iteratively work to improve SLIs to achieve SLOs. We provide a Dashboard and an API for our customers to consume this data.
Minimum Qualifications
Minimum Qualifications
In depth experience writing scalable data driven micro-services and restful-APIs (using OpenAPI) in Java using the Spring Framework. Your experience must include delivering enterprise-level applications with a focus on production reliability. You must have experience improving database performance, and profiling and load testing application performance issues. Experience in writing distributed ETLs or data ingestion systems is preferable.
Key Qualifications
Key Qualifications
Preferred Qualifications
Preferred Qualifications
A self-starter with the initiative to write and review technical designs with an excellent understanding of distributed systems, database systems (not limited to RDMS) and algorithms.
The ability to write clean, readable, maintainable and testable code using best practices and patterns and to give constructive feedback to your colleagues. The ideal candidate will sensibly and incrementally refactor work to improve code quality and pay-off tech debt.
Understand SRE principles, such as observability, SLOs and on-call and maintaining run-books.
Good understanding deploying software on Kubernetes is essential. The ideal candidate will have experience writing infrastructure as code.
Practical experience in DevOps from debugging OS/Networking issues to CI/CD
Using Redis to snapshot data for data processing is preferred
Experience processing data using Apache Spark is a bonus