O pozici
The Advanced Scientific Compute (ASC) team builds and operates the cloud HPC platform that underpins computational research at Our Company. AI/ML workloads are rapidly migrating to cloud, and this role exists to accelerate that transition — bringing dedicated AI/ML expertise into the HPC engineering team to strengthen integrations with AI pipelines, enable agentic AI capabilities for self-service, and drive better insights from scientific compute data.
Co budeš dělat
- Design, deploy, and operate cloud HPC infrastructure on AWS (ParallelCluster, Slurm, S3, networking) supporting scientific research workloads.
- Lead integration of HPC environments with AI/ML pipelines — enabling training, inference, and orchestration workloads to run efficiently alongside traditional HPC jobs.
- Architect and implement agentic AI capabilities (e.g., LLM-based self-service tooling, automated job management) to improve researcher experience and reduce manual support overhead.
- Contribute to platform observability, cost optimisation, and capacity planning for cloud HPC environments.
- Partner with HPC Application Support and Client Support Engineering to translate researcher needs into platform improvements.
- Support onboarding and enablement of research teams adopting cloud-native and AI-augmented workflows.
- Participate in on-call rotation and incident response for HPC platform availability.
Koho hledáme
- 5+ years of cloud engineering experience, with hands-on AWS expertise (EC2, S3, VPC, IAM, EFS/FSx).
- Demonstrated experience with HPC environments — job schedulers (Slurm, PBS, or equivalent), parallel filesystems, MPI workloads.
- Proficiency in infrastructure-as-code (Terraform or CloudFormation) and scripting (Python, Bash).
- Experience integrating or operating ML workloads in cloud environments (training pipelines, model serving, batch inference).
- Strong systems thinking — able to diagnose performance, networking, and storage bottlenecks in distributed compute environments.
- Comfortable working in a regulated, enterprise environment with change management processes.
Benefity
- Exciting work in a great team, global projects, international environment.
- Opportunity to learn and grow professionally within the company globally.
- Hybrid working model, flexible role pattern (e.g., even 80% full-time is possible in justified cases).
- Pension and health insurance contributions.
- Internal reward system plus referral programme.
- 5 weeks annual leave, 5 sick days, 15 days of certified sick leave paid above statutory requirements annually, 40 paid hours annually for volunteering activities, 12 weeks of parental contribution.
- Cafeteria for tax free benefits according to your choice (meal vouchers, sport, culture, health, travel, etc.), Multisport Card.
- Vodafone, Raiffeisen Bank and Foodora discount programmes.
- Up-to-date laptop and iPhone.
- Parking in the garage, showers, refreshments, massage chairs, library, music corner.
- Competitive salary, incentive pay, and many more.