Freelance Agent Evaluation Engineer — Mindrift | joboostr

Přeskočit na hlavní obsah

Práce Životopis Sledované

Freelance Agent Evaluation Engineer — Mindrift | joboostr

5 473 nabídek nalezeno

⌕

+0

⌕

+0

Vymazat filtry

G

Account Executive, High -Tech Clients, Poland

Gartner

Česko, Praha

Dnes

Supplier Quality Assurance Engineer (Remote, US)

Modine

USARemote

Dnes

Brigádník (m/ž) - Vybalování/pokladna (200 Kč/hod.) - Břeclav (Břeclav, CZ)

LIDL

Česko, Břeclav

Dnes

od 200 Kč / hodina

Prodavač/ka - Čelákovice (různé úvazky) (Čelákovice, CZ)

LIDL

Česko, Čelákovice

Dnes

18 252 Kč – 40 400 Kč / měsíc

Brigádník (m/ž) - Vybalování/pokladna (200 Kč/hod.) - Čelákovice (Čelákovice, CZ)

LIDL

Česko, Čelákovice

Dnes

od 200 Kč / hodina

Prodavač/ka - Brno, Vídeňská (35 hodin/týden - mzda po 2 letech 35 350 Kč) (Brno, CZ)

LIDL

Česko, Brno

Dnes

31 938 Kč – 35 350 Kč / měsíc

AI-Assisted Programming Teaching Expert (Golang, B2B, Part-time)

TripleTen

Remote

Dnes

40 US$ – 85 US$ / hodina

Project (Delivery) Manager (B2B)

TripleTen

Remote

Dnes

2 500 € – 3 000 € / měsíc

Finance & Operations Analyst

TripleTen

Remote

Dnes

N

Senior Media Buyer & E-commerce Growth Manager

Natulim

ČeskoRemote

Dnes

E

HR Generalist - Maternity Leave Cover 26222

Enverus

Česko, Brno

Dnes

People Operations Specialist (Contract)

Shopfully

Německo, Berlín

Dnes

Sales Development Representative (SDR) at Lisk

Lisk

Remote

Dnes

Payroll Specialist - Belgium

Remote

BelgieRemote

Dnes

35 250 US$ – 79 300 US$ / rok

ASISTENT/KA PRODEJE V PRODEJNĚ JYSK POHOŘELICE- možnost zkráceného úvazku

JYSK

Česko, Pohořelice

Dnes

od 34 500 Kč / měsíc

Senior Software Engineer (vMetal)

vCluster Labs

Remote

Dnes

130 000 US$ – 180 000 US$ / rok

Senior Automation Software Tester, MSR team

Mirantis

Remote

Dnes

Médico(a) MGF | Telemedicina (Contrato de Trabalho)

knok

PortugalskoRemote

Dnes

od 70 000 € / rok

P

Maestro Estudios Sociales

Paradiso College Preparatory

Portoriko, Río Piedras

Dnes

od 3 000 US$ / měsíc

Earth Observation Systems Architect Engineer (Noordwijk, NL)

European Space Agency

Nizozemsko, Noordwijk

Dnes

Product Validation Engineer (Oostende, West Flanders, BE)

Internet - DENV | Daikin

Belgie, Oostende

Dnes

P

General Manager

Parkway Co-op

Kanada, Roblin

Dnes

C

Customer Marketing Specialist

Coty

Česko, Praha

Dnes

Management Assistant CER

Scania

Česko, Chrášťany

Dnes

Visual Merchandiser

Primark

Česko, Ostrava

Dnes

Officer in Dual Control Custody (Brno, CZ)

KBC Group

Česko, Brno

Dnes

Junior Data Integration Functional Analyst (Praha, CZ)

KBC Group

Česko, Praha

Dnes

Backend Developer (Node.js, JavaScript, TypeScript)

Action1

Remote

Dnes

AI Senior Full Stack Developer (.NET + Angular, AI-Driven Development)

Trinetix

Remote

Dnes

Senior Automation Agile Product Owner ( ) (Cairo, EG)

Vodafone

Egypt, Cairo

Dnes

‹12…183 ›

Freelance Agent Evaluation Engineer

Mindrift

ČeskoRemote·před 1 dnem

Mám zájem Přizpůsobit životopis

Tip pro vyšší úspěšnostŽivotopis na míru = víc pozvánek na pohovor

O pozici

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

Co budeš dělat

You'll create challenging tasks and evaluation criteria within realistic simulated environments:
Build realistic developer environments - a virtual company with codebase, infrastructure, and context (tickets, docs, conversations) that forms a believable development history
Design tasks from intermediate states of these environments - craft the prompt, define what "solved" means, and ensure the task is solvable by an AI agent
Write tests that verify agent solutions - accept all valid approaches and reject incorrect ones, neither too strict nor too lenient
Iterate on tasks and tests based on QA feedback - review agent solutions, analyze failures, and refine until the evaluation is fair and robust

Koho hledáme

5+ years in software development
Core stack: Python (FastAPI), JavaScript/TypeScript (React), Docker, Postgres, Kafka, Redis
Experience writing tests (functional, integration)
English proficiency - B2+

Benefity

Up to $50/hr equivalent, depending on level and pace. Tasks are estimated at ~20 hours each; you set your own schedule.

Dovednosti:PythonFastAPIJavaScriptTypeScriptReactDockerPostgresKafka

Částečný úvazekSeniorRemote