O pozici
At ClickUp, we're building the future of work: the first truly converged AI workspace unifying tasks, docs, chat, calendar, and enterprise search, all supercharged by context-driven AI. We are an AI-native company. Every team member is expected to leverage AI daily, and we evaluate AI fluency as part of our hiring process. Join us and help redefine what's possible. 🚀
Co budeš dělat
- Design, build, and optimize real-time speech-to-text pipelines (streaming ASR, VAD, audio processing)
- Improve transcription accuracy through context injection (user names, teams, custom vocabulary, language detection)
- Develop and maintain LLM-powered post-processing (grammar correction, filler removal, mention resolution, formatting)
- Build voice-to-action systems that parse natural language into structured workspace commands
- Evaluate, benchmark, and integrate ASR models (Whisper, AssemblyAI, Fireworks, etc.) for cost, latency, and accuracy
- Collaborate with product and platform teams to ship voice features across MAX Desktop, Mobile, Web, and Browser Extension
- Explore multimodal AI capabilities (screen + voice + text) for next-gen assistant experiences
Koho hledáme
- Unsure if you meet all the qualifications of this job description but are deeply excited about the role? We hire based on ambition, grit, and a passion for improving the way people work. If you think ClickUp is the company for you, we encourage you to apply!