Freelance Agent Evaluation Engineer — Mindrift | joboostr

Přeskočit na hlavní obsah

Práce Životopis Sledované

Freelance Agent Evaluation Engineer — Mindrift | joboostr

5 513 nabídek nalezeno

⌕

+0

⌕

+0

Vymazat filtry

Head of Strategic Operations (Chief of Staff)

Nebius

USARemote

Dnes

165 600 US$ – 206 900 US$ / rok

M

Welsh Audio Specialist - Freelance AI Trainer Project

Meridial

Remote

Dnes

6 US$ – 65 US$ / hodina

M

Italian Audio Evaluations Specialist - Freelance AI Trainer Project

Meridial

Remote

Dnes

6 US$ – 65 US$ / hodina

M

Portuguese Audio Evaluations Specialist - Freelance AI Trainer Project

Meridial

Remote

Dnes

6 US$ – 65 US$ / hodina

Supplier Development Senior Manager - Beauty & Personal Care (Home Based, GB, TBC)

Univar Solutions

Spojené královstvíRemote

Dnes

Senior ICT Risk Specialist (f/m/d) (Prague, CZ)

Deutsche Börse Group

Česko, Praha

Dnes

I

Consultant - Cultural Mediation Training

IOM

IrskoRemote

Dnes

Account Executive, High -Tech Clients, Poland

Gartner

Česko, Praha

Dnes

S

Brand Marketing Specialist

SWARM

USARemote

Dnes

75 000 US$ – 110 000 US$ / rok

S

Front End Developer

SWARM

USARemote

Dnes

90 000 US$ – 130 000 US$ / rok

Osobní bankéř/ka - Břeclav

Erstegroup

Česko, Břeclav

Dnes

Project (Delivery) Manager (B2B)

TripleTen

Remote

Dnes

2 500 € – 3 000 € / měsíc

N

Project Manager QAI

NSF

USARemote

Dnes

49 000 US$ – 83 000 US$ / rok

Senior User Acquisition Manager (Bing)

Ruby Labs

SrbskoRemote

Dnes

Senior Growth Product Manager

Ruby Labs

Remote

Dnes

Lead User Acquisition Manager

Ruby Labs

SrbskoRemote

Dnes

Director, Structures - Design (R5047)

Shield AI

Remote

Dnes

210 000 US$ – 320 000 US$ / rok

Senior User Acquisition Manager (TikTok)

Ruby Labs

SrbskoRemote

Dnes

D

Lead Customer Success Manager - Federal

Dragos

USARemote

Dnes

od 175 000 US$ / rok

F

Senior/Lead Recruiter - Infrastructure

Fuse Energy

Remote

Dnes

T

Director of Technical Solutions

Triple Whale

USARemote

Dnes

160 000 US$ – 185 000 US$ / rok

Full-stack web developer (student) for eHealth Solutions - Job Detail | Careers Marketplace - Siemens

Siemens Industry Software

Remote

Dnes

T

Senior Manager of Implementation

Triple Whale

Remote

Dnes

130 000 US$ – 150 000 US$ / rok

Software Engineer Finite Element Framework - Job Detail | Careers Marketplace - Siemens

Siemens Industry Software

Česko

Dnes

61 500 € – 110 700 € / rok

C

Senior Director, Enterprise Strategic Accounts NAM

Cyncly

USARemote

Dnes

180 000 US$ – 240 000 US$ / rok

Sales Engineer - German Speaking

ConnectWise

NěmeckoRemote

Dnes

M

Head of GTM

mozilla.ai

USARemote

Dnes

F

Norwegian Editor – (Norway - Freelance/Part-Time)

Fanatee

NorskoRemote

Dnes

25 US$ – 28 US$ / hodina

F

Danish Editor – (Denmark - Freelance/Part-Time)

Fanatee

DánskoRemote

Dnes

22 US$ – 25 US$ / hodina

S

Sr. Territory Manager - Northern, Central & Eastern Europe

Synacor

Spojené královstvíRemote

Dnes

‹12…184 ›

Freelance Agent Evaluation Engineer

Mindrift

Remote·před 11 dny

Mám zájem Přizpůsobit životopis

Tip pro vyšší úspěšnostŽivotopis na míru = víc pozvánek na pohovor

O pozici

Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.

Co budeš dělat

Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history
Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair
Design tasks set in isolated environments - emulations of a developer's workstation: a Linux machine with development tools (terminal, CLI), MCP servers (repository, task tracker, messenger, documentation, etc.), and a real web application codebase
Write tests that accept all correct solutions and reject incorrect ones - neither too strict (breaking on valid approaches) nor too lenient (passing bad ones)
Iterate with an AI agent on tests - verifying they catch real problems, don't miss bad solutions, and don't break on good ones
Review code written by agents, analyze why an agent failed or succeeded, and design edge cases and adversarial scenarios
Iterate based on feedback from expert QA reviewers who score your work on quality criteria

Koho hledáme

This opportunity is a good fit for experienced developers, software engineers, and/or test automation specialists open to part-time, non-permanent projects. Ideally, contributors will have:
Degree in Computer Science, Software Engineering, or related fields
5+ years in software development, primarily Python (FastAPI, pytest, async/await, subprocess, file operations)
Background in full-stack development, with experience building React-based interfaces (JavaScript/TypeScript) and robust back-end systems
Experience writing tests (functional, integration — not just running them)
Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis)
CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
English proficiency - B2
You don't need to be an expert in every item, but you should be comfortable reading and reasoning about code across the stack.

Benefity

Compensation varies across projects depending on scope, complexity, and required expertise. Please note that other projects on the platform may offer different earning levels based on their requirements.

Dovednosti:PythonFastAPIpytestasync/awaitsubprocessfile operationsReactDocker

Částečný úvazekSeniorRemote