O pozici
Ruby Labs is a leading tech company that creates and operates innovative consumer products. We offer a diverse range of opportunities across the health, education, and entertainment industries. Our innovative teams are driving the future of consumer-led products, and we're always looking for passionate individuals to join us. Learn more about our story at: https://rubylabs.com/about-us/
Co budeš dělat
- Design and build event-driven, real-time data pipelines on ClickHouse, Google Cloud, and Airflow to power dashboards and self-serve analytics.
- Optimize ClickHouse schemas, materialized views, and queries for performance, correctness, and cost.
- Model core financial datasets for payments and subscriptions: authorizations, declines, refunds, chargebacks, disputes, dunning, MRR/ARR, LTV, churn, cohort retention.
- Own data quality and observability: tests, monitoring, alerting, lineage, and SLAs for freshness and correctness on Tier-1 datasets.
- Investigate anomalies and data-quality issues in financial data, perform root-cause analysis, and drive fixes end-to-end (instrumentation → pipeline → metric).
- Partner with Backend/Platform on event schemas and instrumentation to ensure high-quality, well-typed billing and payments events.
- Document logic (metric definitions, tables, pipeline behavior) and build internal tooling/templates that let analysts ship safely on top of the platform.
Koho hledáme
- Production experience with ClickHouse is mandatory: data modeling (MergeTree family, projections, materialized views), query tuning, partitioning/sharding, and operational awareness.
- Strong experience designing event-driven, real-time analytics pipelines (Kafka / Pub/Sub / Kinesis or equivalent), including schema design, backfills, and replay.
- Hands-on with Google Cloud data stack (Pub/Sub, GCS, BigQuery, Cloud Run / GKE, IAM) for ingestion, storage, and orchestration.
- Production experience with Apache Airflow (DAG design, sensors, retries, SLAs, idempotent tasks, incremental loads).
- Advanced SQL (complex joins, window functions, incremental logic, performance-aware query writing).
- Strong Python for data engineering (pipelines, transformations, tests, tooling).
- Git workflow (PRs, code review, CI for SQL/pipelines, versioning of data logic).
- Experience working with financial / payments / subscription data (or comparably high-stakes domains) where correctness is non-negotiable.
- Ability to communicate trade-offs and findings clearly to both technical and non-technical stakeholders.
Benefity
- Remote Work Environment: Embrace the freedom to work from anywhere, anytime, promoting a healthy work-life balance.
- Unlimited PTO: Enjoy unlimited paid time off to recharge and prioritize your well-being, without counting days.
- Paid National Holidays: Celebrate and relax on national holidays with paid time off to unwind and recharge.
- Company-provided MacBook: Experience seamless productivity with top-notch Apple MacBooks provided to all employees who need them.
- Flexible Independent Contractor Agreement: Unlock the benefits of flexibility, autonomy, and entrepreneurial opportunities. Benefit from tax advantages, networking opportunities, reduced employment obligations, and the freedom to work from anywhere. Read more about it here: https://docs.google.com/document/d/1nkrN76JlZkbKj9WSOhlT1_mni_CZeDkHdwfIjPXVwvk/preview?tab=t.0#heading=h.ndsdl4wapxtt