AI Meeting Assistant
Back

AI Meeting Assistant

with #1 Noise Cancellation

Explore AI Meeting Assistant

AI Notetaker

AI Note Taker

Meeting Transcription

Meeting Recording

Meeting Summary

Real Time Voice AI

Noise Cancellation

Accent Conversion - Speaker side

Accent Conversion - Listener side
Call Center AI
Back

Call Center AI

AI that boosts call center productivity

Explore platform

Speech Assist

Noise Cancellation

Remove background noises, voices & echoes.

Accent Conversion

Real-time accent conversion for call center agents.

Voice Translation

Real-time AI voice translation for call center agents.

Agent & Supervisor Assist

Agent Assist

Real-time AI assistant for call center agents.

Speech Analytics

Call scoring, Compliance monitoring and more.

Voice security

Real-time fraud detection and more
Developers
Back

Developers

with #1 AI Voice Models

Explore developers

For Voice AI Agents

Voice Isolation

Isolate the primary speaker’s voice

Turn-Taking

Improving turn-taking for AI

For Human-to-human Calls

Accent Conversion

Convert accents in calls

Noise Cancellation

Noise removal in calls

Voice Translation API New

Real-time translation, self-serve
Customers
Pricing

Documentation

Get Access

AI Meeting Assistant
Back

AI Meeting Assistant

with #1 Noise Cancellation

Explore AI Meeting Assistant

AI Notetaker

AI Note Taker

Meeting Transcription

Meeting Recording

Meeting Summary

Real Time Voice AI

Noise Cancellation

Accent Conversion - Speaker side

Accent Conversion - Listener side
Call Center AI
Back

Call Center AI

AI that boosts call center productivity

Explore platform

Speech Assist

Noise Cancellation

Remove background noises, voices & echoes.

Accent Conversion

Real-time accent conversion for call center agents.

Voice Translation

Real-time AI voice translation for call center agents.

Agent & Supervisor Assist

Agent Assist

Real-time AI assistant for call center agents.

Speech Analytics

Call scoring, Compliance monitoring and more.

Voice security

Real-time fraud detection and more
Developers
Back

Developers

with #1 AI Voice Models

Explore developers

For Voice AI Agents

Voice Isolation

Isolate the primary speaker’s voice

Turn-Taking

Improving turn-taking for AI

For Human-to-human Calls

Accent Conversion

Convert accents in calls

Noise Cancellation

Noise removal in calls

Voice Translation API New

Real-time translation, self-serve
Customers
Pricing

Book a demo

New Voice Translation API — now self-serve

#1 Real-time voice AI
models. Production ready.

Voice models that clean, understand, and transform audio in real time.

Request SDK Access

Dev dashboard

Explore Voice AI Playground

1T+

Minutes processed

8 yrs

Production audio

200M+

Devices worldwide

2× Webby

Technical Achievement

Organizations worldwide trust us

Where Krisp sits in your stack

Two SDK families and a self-serve Translation API.

Voice AI agents VIVA SDK

Human calls RTC SDK

Multilingual Calls Translation API

INPUT Messy audio

ENHANCE VIVA SDK

TRANSCRIBE STT

REASON LLM

OUTPUT TTS

INPUT Messy audio

ENHANCE RTC SDK

OUTPUT Clean audio

INPUT Speech
(any lang)

ENHANCE Krisp VT

OUTPUT Translated
speech

Built for every platform

Krisp's AI Voice SDK library is available for Windows, Mac, Linux,
Web (JS/WASM), iOS and Android platforms.

Two SDK families. One audio expertise.

Both powered by 8 years of production audio and a trillion+ minutes processed.

VIVA SDK

For Voice AI Agents

NEW: VIVA 2.0 — From Reactive to Predictive

Voice Isolation v3 - improves WER
Turn Prediction v3 — multilingual
Voice Activity Detection
Interruption Prediction
Signal Detectors — accent, gender, TTS

1B+ mins traffic being processed monthly

RTC SDK

For Human-to-Human calls

Accent · Translation · Noise — solved

Accent Conversion
Background Voice Cancellation
Outbound Noise Cancellation
Inbound Noise Cancellation
Voice Translation API available

80B+ mins traffic being processed monthly

Self-serve

Voice Translation API 61 languages, 96% accuracy, 60 min free

Get API Key

Krisp VIVA SDK

Voice infrastructure for voice AI agents. Each model is lightweight, CPU-deployed, and operates on audio alone.

VIVA 2.0: For Voice AI Agents

Voice Isolation

Server-side voice isolation that sits in front of your VAD or STT. Removes background noise, cross-talk, and non-primary speakers before they reach your agent. Works across every language and accent.

VIVA 2.0: For Voice AI Agents

Turn Prediction

VIVA family small model that predicts when a speaker is done — directly from raw audio, no transcription needed. Eliminates awkward pauses and talk-over. Lightweight enough for on-server CPU, multilingual out of the box.

VIVA 2.0: For Voice AI Agents

Interruption Prediction

Newest VIVA family model that classifies user speech mid-response in real time — interruption vs. question, takeover vs. backchannel. Tells your agent when to stop and listen, and when to keep going.

VIVA 2.0: For Voice AI Agents

Voice Activity Detection (VAD)

Simple, accurate Voice Activity Detector — robust against background noise and secondary voices. Lightweight and optimized for on-server CPU deployment.

Krisp RTC SDK

Enhances the quality of communication in calls and meetings. Real-time processing for contact centers and platforms.

RTC: For human-to-human calls

Accent Conversion

Krisp RTC family model that converts a contact center agent's accent to match the customer's in real time. Improves CX with strong ROI. Robust to noise, gender, and microphone or headset choice.

RTC: For human-to-human calls

Background Voice Cancellations

Krisp RTC family model for the outbound/uplink stream. Removes background noise and includes de-reverberation to eliminate room echo. Built for contact centers, offices, and home environments. Microphone and language independent.

RTC: For human-to-human calls

Outbound Noise Cancellation

Krisp RTC family model for the outbound/uplink stream that removes other voices near the primary speaker. Solves contact center cross-talk and competing voices at home or office. No voice enrollment needed. Language independent.

RTC: For human-to-human calls

Inbound Noise Cancellation

Krisp RTC family model for the inbound/downlink stream. Strips background noise from mobile and PSTN callers so agents get clear audio bi-directionally. Speaker, headset, and language independent. Intelligently passes through ringtones.

RTC: For human-to-human calls

Voice Translation

API available

Voice translation API for real-time speech-to-speech translation across 61 languages, any-to-any. Built inside enterprise contact centers with 96% accuracy on live calls. Now available as a self-serve API with Python and JavaScript SDKs, a playground, and 60 minutes of free translation credit. 99.9% uptime SLA. No sales call needed.

Get API Key API Reference

Learn how we built it
behind the scenes

Read our Engineering Blog

Hear it for yourself.

Run real-world audio through every model in the Krisp stack.
No signup. No SDK. Just sound.

Try Voice AI Playground

See Krisp in action

Voice Isolation

Noise Cancellation

Accent Conversion

Play the demo

Krisp off

Play the demo

Krisp off

Hear it from our customers

Phonely solving one of Voice AI's biggest challenges with Krisp

Natterbox powers real-world AI conversations with Krisp

Watch the video

Newo transforms AI voice agents' performance with Krisp

Watch the video

Announcements

Introducing Krisp VIVA 2.0: Voice Infrastructure for Voice AI Agents

Article

May 06, 2026

Introducing Krisp VIVA 2.0: Voice Infrastructure for Voice AI Agents

Every voice AI demo works. Production doesn't. You've seen it happen. A voice agent sounds…

Read the article

A New Approach to Turn-Taking in Voice AI: Turn Prediction v3 and Interruption Prediction v1

Article

May 06, 2026

A New Approach to Turn-Taking in Voice AI: Turn Prediction v3 and Interruption Prediction v1

The natural rhythm of conversation depends on knowing when to start speaking and when to…

Read the article

Introducing Krisp RTC: Voice Translation SDK for Customer Experience

Article

Feb 18, 2026

Introducing Krisp RTC: Voice Translation SDK for Customer Experience

Real-Time Voice Translation for Customer Experience Real-time voice translation has long been one of the…

Read the article

Frequently asked questions

How does Voice Isolation differ from noise cancellation?

Noise cancellation removes background sounds. Voice Isolation goes further — it removes both background noise and secondary human voices, ensuring only the primary speaker's voice reaches your VAD or STT pipeline. This eliminates false interruptions caused by nearby speakers or cross-talk.

What's the difference between Turn Prediction and Interruption Prediction?

Turn Prediction identifies when a speaker is about to finish talking, so your AI agent can respond at the right moment without awkward pauses. Interruption Prediction determines whether a user speaking mid-response intends to interrupt or is simply asking a question or giving a backchannel like "uh-huh." Together, they give your agent the conversational awareness to handle real dialogue.

Do VIVA models require transcription or language-specific configuration?

No. All VIVA models operate directly on the audio signal — no transcription step is needed. They are language agnostic and support multiple languages natively, with no per-language tuning or configuration required.

Can VIVA models be used together or independently?

Each model in the VIVA family — Voice Isolation, Turn Prediction, Interruption Prediction, and VAD — works as a standalone component. You can deploy one or combine them depending on your pipeline needs. Most voice AI agent deployments benefit from running them together for the best conversational experience.

What are the deployment requirements?

VIVA models are lightweight and optimized for on-server CPU deployment. They integrate directly into your existing voice pipeline — typically in front of your VAD or STT — and are available via C, Python, Node.js, Go, and Rust bindings, as well as frameworks like LiveKit and Pipecat.

Can I use RTC and VIVA models together?

They serve different use cases. VIVA is built for human-to-AI communication — voice AI agents and bots. RTC is built for human-to-human communication — calls, meetings, and contact centers. That said, if your platform handles both scenarios, you can deploy models from each family where they're needed in your pipeline.

How many languages does Voice Translation support?

Voice Translation supports 60+ languages for real-time bidirectional translation. It handles speech-to-speech translation directly, preserving conversational flow without requiring speakers to wait for text-based translation steps. Optimized for contact center environments where agents and customers need to communicate naturally across language barriers.

Tell us your solution, we’ll
show how Krisp can help.

Complete the form at the link below to tell us about your product and use case. We will contact you
to discuss how to get started with integrating Krisp SDKs.

Get Access

#1 Real-time voice AI models. Production ready.

Where Krisp sits in your stack

Built for every platform

Two SDK families. One audio expertise.

For Voice AI Agents

For Human-to-Human calls

Krisp VIVA SDK

Voice Isolation

Turn Prediction

Interruption Prediction

Voice Activity Detection (VAD)

Krisp RTC SDK

Accent Conversion

Background Voice Cancellations

Outbound Noise Cancellation

Inbound Noise Cancellation

Voice Translation

Learn how we built it behind the scenes

Hear it for yourself.

See Krisp in action

Hear it from our customers

Phonely solving one of Voice AI's biggest challenges with Krisp

Natterbox powers real-world AI conversations with Krisp

Newo transforms AI voice agents' performance with Krisp

Announcements

Introducing Krisp VIVA 2.0: Voice Infrastructure for Voice AI Agents

A New Approach to Turn-Taking in Voice AI: Turn Prediction v3 and Interruption Prediction v1

Introducing Krisp RTC: Voice Translation SDK for Customer Experience

Frequently asked questions

Tell us your solution, we’ll show how Krisp can help.

You’re all set

Thanks!

#1 Real-time voice AI
models. Production ready.

Learn how we built it
behind the scenes

Tell us your solution, we’ll
show how Krisp can help.