O pozici
As a Data Engineer at RWS, you will be a key architect of our data's journey. We are currently in an exciting phase of technical evolution, bridging the gap between established on-premises systems and cutting-edge cloud environments. You will play a vital role in managing this hybrid landscape, ensuring that our data flows seamlessly across an extensive on-premise SQL Server estate and a target Google Cloud Platform (GCP) architecture.
You’ll have end-to-end ownership of data pipelines—from the initial ingestion of raw source data to the monitoring of production environments. Your work will directly enable our Analytics and AI teams to build the future of global language services. If you enjoy solving complex "plumbing" problems and building scalable, high-performance data systems, you'll find a home here.
Co budeš dělat
- Architect Hybrid Pipelines: You'll build and manage robust data pipelines that bridge on-premise SQL Server environments and GCP in the cloud.
- Modernize Data Ingestion: You will implement Change Data Capture (CDC) and streaming updates to ensure our cloud data stays synchronized and fresh.
- Optimize for Performance: You’ll design and refine BigQuery architectures, focusing on partitioning and clustering to ensure high-speed, cost-effective querying.
- Orchestrate Workflows: You will use tools like Cloud Composer (Airflow) and Dataform, SSRS and Databricks to schedule and monitor complex, multi-stage data workflows.
- Maintain Data Integrity: You’ll own the health of your pipelines, troubleshooting failures and ensuring data quality across Star and Snowflake schemas.
- Collaborate Across Teams: You’ll act as a technical partner to analysts and business leads, translating their needs into efficient technical requirements.
Koho hledáme
- Proven Data Background: Experience in data engineering or warehousing, specifically owning the full lifecycle of a data pipeline.
- SQL Server Expertise: Mastery of T-SQL (stored procedures, CTEs, tuning) and experience moving data via SSIS or similar tools.
- Cloud Proficiency: Hands-on experience with GCP, particularly BigQuery and Cloud Storage (GCS).
- Python Programming: Strong skills in Python for scripting, API interactions, and data manipulation (Pandas or PySpark).
- Modern Data Modelling: Deep understanding of dimensional modelling and designing scalable schemas.
- Experience with Orchestration & Integration: Familiarity with tools like Cloud Dataflow, Dataproc, or Cloud Composer.
Benefity
- Life at RWS - If you like the idea of working with smart people who are passionate about growing the value of ideas, data and content by making sure organizations are understood , then you’ll love life at RWS.
- Our purpose is to unlock global understanding. This means our work fundamentally recognizes the value of every language and culture. So, we celebrate difference, we are inclusive and believe that diversity makes us strong. We want every employee to grow as an individual and excel in their career.
- In return, we expect all our people to live by the values that unite us: to partner , putting clients fist and winning together , to pioneer , innovating fearlessly and leading with vision and courage, to progress , aiming high and growing through actions and to deliver , owning the outcome and building trust with our colleagues and clients.