O pozici
As a Data Engineer at RWS, you will be a key architect of our data's journey. We are currently in an exciting phase of technical evolution, bridging the gap between established on-premises systems and cutting-edge cloud environments. You will play a vital role in managing this hybrid landscape, ensuring that our data flows seamlessly across an extensive on-premise SQL Server estate and a target Google Cloud Platform (GCP) architecture.
You’ll have end-to-end ownership of data pipelines—from the initial ingestion of raw source data to the monitoring of production environments. Your work will directly enable our Analytics and AI teams to build the future of global language services. If you enjoy solving complex "plumbing" problems and building scalable, high-performance data systems, you'll find a home here.
Co budeš dělat
- Architect Hybrid Pipelines: You'll build and manage robust data pipelines that bridge on-premise SQL Server environments and GCP in the cloud.
- Modernize Data Ingestion: You will implement Change Data Capture (CDC) and streaming updates to ensure our cloud data stays synchronized and fresh.
- Optimize for Performance: You’ll design and refine BigQuery architectures, focusing on partitioning and clustering to ensure high-speed, cost-effective querying.
- Orchestrate Workflows: You will use tools like Cloud Composer (Airflow) and Dataform, SSRS and Databricks to schedule and monitor complex, multi-stage data workflows.
- Maintain Data Integrity: You’ll own the health of your pipelines, troubleshooting failures and ensuring data quality across Star and Snowflake schemas.
- Collaborate Across Teams: You’ll act as a technical partner to analysts and business leads, translating their needs into efficient technical requirements.
Koho hledáme
- Proven Data Background: Experience in data engineering or warehousing, specifically owning the full lifecycle of a data pipeline.
- SQL Server Expertise: Mastery of T-SQL (stored procedures, CTEs, tuning) and experience moving data via SSIS or similar tools.
- Cloud Proficiency: Hands-on experience with GCP, particularly BigQuery and Cloud Storage (GCS).
- Python Programming: Strong skills in Python for scripting, API interactions, and data manipulation (Pandas or PySpark).
- Modern Data Modelling: Deep understanding of dimensional modelling and designing scalable schemas.
- Experience with Orchestration & Integration: Familiarity with tools like Cloud Dataflow, Dataproc, or Cloud Composer.
Benefity
- Life at RWS - If you like the idea of working with smart people who are passionate about growing the value of ideas, data and content by making sure organizations are understood, then you’ll love life at RWS.
Our purpose is to unlock global understanding. This means our work fundamentally recognizes the value of every language and culture. So, we celebrate difference, we are inclusive and believe
- RWS embraces DEI and promotes equal opportunity, we are an Equal Opportunity Employer and prohibit discrimination and harassment of any kind. RWS is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. All employment decisions at RWS are based on business needs, job requirements and in