As a Lead ML Systems Engineer, you will own the architecture, performance, and scalability of Krisp Cloud’s real-time Voice AI serving infrastructure.
You will be responsible for transforming state-of-the-art research models into highly optimized, reliable, and cost-efficient production systems that power latency-sensitive, mission-critical Voice AI services.
This role sits at the intersection of machine learning, distributed systems, GPU performance engineering, and large-scale infrastructure, and requires deep systems thinking and long-term architectural ownership.
Model Serving & Production Performance
- Prototype, implement, and benchmark critical components of the serving stack.
- Architect and implement inference and serving strategies defining how models are packaged, deployed, replicated, batched, scheduled, and optimized under real-time constraints.
- Partner with Research and Platform teams to drive deep performance optimization across runtime, precision (FP16/INT8/FP8), batching strategies, and GPU execution.
- Design scaling behavior under variable real-time load (burst handling, replica strategy, workload partitioning).
- Establish observability standards across inference services (latency metrics, GPU profiling, tracing, performance regression detection).
- Lead root cause analysis of systemic performance regressions and implement structural improvements.
- Partner closely with MLOps and Platform teams to operationalize infrastructure while retaining architectural ownership of the serving layer.
Technical Leadership
- Drive alignment between model design and production constraints, ensuring research translates into performant, scalable, cost-effective systems.
- Mentor senior engineers through design reviews, deep technical discussions, and hands-on collaboration.
- Shape the long-term architectural direction for Voice AI serving infrastructure through both implementation and strategic design.
All interested candidates are encouraged to apply through this form.
We highly appreciate all applications, however, only shortlisted candidates will be contacted for the next stages.
All applicants are considered regardless of race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. We do not tolerate discrimination or harassment of any kind. All employees and contractors of Krisp treat each other with respect and empathy.