O pozici
Do you enjoy collaborating with teams to solve complex challenges?
Do you have a passion for cutting edge technologies?
Join our highly skilled Site Reliability Engineering team!
Our team designs, develops, and manages applications and infrastructure that support Akamai Cloud's products and services. Our SRE teams solve reliability, security, and usability at scale for our global fleet while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day.
Co budeš dělat
- Designing, developing, testing, and operating essential services to enhance the reliability, scalability, and performance of infrastructure systems.
- Designing and implementing observability solutions, including monitoring, logging, alerting, and telemetry, to identify and address issues before customer impact.
- Enhancing reliability via automation, minimizing operational toil, and boosting resilience within engineering processes.
- Developing extensive expertise in ACDC systems and acting as a reliable resource, guiding engineers and sharing effective practices team-wide.
- Collaborating closely with software engineering, infrastructure, and platform teams to resolve complex production issues, determine root causes, and implement solutions.
- Participating in an on-call rotation and delivering technical expertise during incidents, ensuring prompt restoration, clear communication, and post-incident enhancements.
Koho hledáme
- Demonstrate expertise in Ansible through playbook development, role creation, automation workflows, and enterprise-scale configuration management processes.
- Manage Infrastructure as Code solutions utilizing tools like Terraform, SaltStack, Ansible, Chef, Puppet, or comparable technologies effectively and efficiently.
- Design, develop, and deploy software and infrastructure at scale within a Linux environment with advanced-level expertise.
- Demonstrate advanced experience in a site reliability or software engineering role, working with large-scale distributed systems.
- Have great communication and interpersonal skills
- Demonstrate accountability for reliability, develop automation and monitoring, and collaborate effectively with an engineering team unfamiliar with SRE practices.
Benefity
- Benefits at Akamai: We support your health, well-being, finances, and life beyond work. See our benefits.
- Akamai's FlexBase program is yet another way we show our commitment to providing employees with an exceptional workplace experience. It's not about telling employees where to work; it's about supporting employees to do their best work.
We trust our incredible employees to work in ways that suit them best: at home, in an office, or a combination of both.