O pozici
The Red Hat IT OpenShift team is looking for an Associate Site Reliability Engineer (SRE) to design, develop, scale, and operate our Red Hat Hybrid OpenShift Platforms (on-prem & cloud). As an Site Reliability Engineer, you will contribute to running Red Hat OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating toil through automation. In the IT OpenShift team you will have the opportunity to influence the complex challenges of scale which are unique to Red Hat IT managed platform services, while using your skills in coding, operations, and large-scale distributed system design. We develop, deploy, and maintain Red Hat's next-generation mission critical platform across hybrid cloud infrastructures. We are a global team operating on-premise and in the public cloud, using the latest technologies from Red Hat and beyond. Red Hat relies on teamwork and openness for its success. We learn from our failures in a blameless environment to support the continuous improvement of the team. At Red Hat, your individual contributions have more visibility than most large companies, and visibility means career opportunities and growth.
Co budeš dělat
- Design, build, and manage our large scale infrastructure and platform services, including public cloud, private cloud, and datacenter-based
- Automate cloud infrastructure through use of technologies (e.g. auto scaling, load balancing, etc.), scripting (python and golang), monitoring and alerting solutions (e.g. Splunk, Splunk IM, Prometheus, Grafana, Catchpoint, DataDog etc)
- Design, develop, and become expert in IT’s Red Hat OpenShift offerings by leveraging emerging industry standards
- Build & support standardized CI/CD platform components using OpenShift Pipelines and Tekton, GitLab to enable multiple application deployments
- Apply Infrastructure as Code methodologies using GitOps practices with ArgoCD for declarative platform management
- Breakdown complex engineering efforts into consumable chunks while working with teams to understand deliverables
- Design and development of software like Kubernetes operators, webhooks, cli-tools
- Implement and maintain intelligent infrastructure and application monitoring designed to enable application engineering teams
- Ensure the production environment is operating in accordance with established procedures and best practices
- Escalate to seniors or team leads to support for high severity and critical platform-impacting events
- Provide feedback around bugs and feature improvements to the various Red Hat Product Engineering teams
- Design software tests and perform peer reviews to increase the quality of our codebase
- Help and develop peers’ capabilities through knowledge sharing, and collaboration
- Participate in a regular on-call schedule, supporting the operation needs of our tenants
- Drive sustainable incident response and contribute to blameless postmortems
- Work within a small agile team to develop and improve SRE methodologies, support your peers, plan and self-improve
Koho hledáme
- 2+ years of experience operating production services on Kubernetes / OpenShift
- 2+ years of programming experience in Python, Go
- 1+ years of experience of using cloud providers and technologies (Google, Azure, Amazon, etc.)
- Solid understanding of Linux systems administration (RHEL/Fedora preferred)
- Understanding of standard networking (TCP/IP, DNS, HTTP/TLS) and authentication protocols (LDAP)
- Comfort with incident response, on-call responsibilities
- Ability to work in a team with minimal supervision while keeping the team informed
Benefity
- Red Hat relies on teamwork and openness for its success. We learn from our failures in a blameless environment to support the continuous improvement of the team.
- At Red Hat, your individual contributions have more visibility than most large companies, and visibility means career opportunities and growth.
- Red Hatters are encouraged to bring their best ideas, no matter their title or tenure.
- We're a leader in open source because of our open and inclusive environment.
- We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.
- Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone.
- We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.