


{"id":22692,"date":"2025-12-15T12:27:13","date_gmt":"2025-12-15T08:27:13","guid":{"rendered":"https:\/\/krisp.ai\/blog\/?p=22692"},"modified":"2025-12-16T18:12:50","modified_gmt":"2025-12-16T14:12:50","slug":"accent-conversion-sdk","status":"publish","type":"post","link":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/","title":{"rendered":"Introducing Krisp\u2019s Accent Conversion SDK"},"content":{"rendered":"<h2><strong>TL;DR<\/strong><\/h2>\n<p>Krisp\u2019s Accent Conversion (AC) SDK is now available for server-side deployment, giving CCaaS platforms and enterprise voice teams a production-ready way to transform accents in real time.<\/p>\n<p>Krisp AC v3.7 model delivers high naturalness, clarity, pronunciation accuracy, and speaker similarity\u2014running entirely on CPUs with predictable latency and simple frame-based APIs. Integrate it directly into WebRTC\/SIP media pipelines, Pipecat or other workflows.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"text_center\">\n<div class=\"btn btn--primary\">\n        <a style=\"color:#FFF !important;\" href=\"https:\/\/\/krisp.ai\/developers\/\">Get Access<\/a>\n    <\/div>\n<\/div>\n<h2><strong>Introduction<\/strong><\/h2>\n<p>Building modern voice experiences for CCaaS platforms, BPOs, and large enterprise contact centers requires solving one persistent issue: cross-accent intelligibility. Customers often struggle to understand agents with strong non-native accents, leading to repeated clarifications, longer handle times, and lower customer satisfaction. Developers need an Accent Conversion (AC) solution that is reliable, low-latency, and easy to deploy inside existing WebRTC, SIP, and media infrastructure\u2014without adding operational complexity or GPU dependencies.<\/p>\n<p>&nbsp;<\/p>\n<p>Krisp has steadily advanced its Accent Conversion technology across multiple releases, improving naturalness, pronunciation accuracy, speaker similarity, voice stability, and overall audio clarity. These improvements have been validated through objective benchmarks, large-scale crowdsourced evaluations, and extensive production use inside Krisp\u2019s CX product.<\/p>\n<p>&nbsp;<\/p>\n<p>Accent Conversion (AC) has been available through Krisp\u2019s <strong>Desktop SDK and JavaScript SDK<\/strong>, enabling in-app and browser-based integrations. Today, we\u2019re expanding availability with the <strong>Accent Conversion SDK for servers<\/strong>, bringing the AC v3.7 model quality to real-time backend pipelines with Python and C\/C++ SDK.<\/p>\n<p>&nbsp;<\/p>\n<p>The server SDK gives you full control over deployment, data handling, latency, and scaling\u2014making Accent Conversion a practical and production-ready component for modern voice platforms.<\/p>\n<h2><strong>Why Run Accent Conversion on Your Own Servers?<\/strong><\/h2>\n<p>Running Accent Conversion inside your own infrastructure gives you full control over how the technology behaves in production without relying on external services or introducing new data paths.<\/p>\n<p><img loading=\"lazy\" class=\"alignnone size-full wp-image-22700\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/diagram-ac.png\" alt=\"Accent Conversion Diagram\" width=\"2000\" height=\"500\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/diagram-ac.png 2000w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/diagram-ac-300x75.png 300w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/diagram-ac-380x95.png 380w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/diagram-ac-768x192.png 768w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/diagram-ac-1536x384.png 1536w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/diagram-ac-600x150.png 600w\" sizes=\"(max-width: 2000px) 100vw, 2000px\" \/><br \/>\n<span style=\"font-size: 10pt;\"><em>Server-Side Accent Conversion in the Agent-to-Customer Path<\/em><\/span><\/p>\n<h3>Key advantages:<\/h3>\n<ul>\n<li><strong>Data stays within your environment<\/strong>: no audio leaves your network, simplifying compliance for regulated industries.<\/li>\n<li><strong>Consistent, low-latency<\/strong>: optimized CPU inference enables real-time conversion.<\/li>\n<li><strong>Flexible deployment<\/strong>: run it in your cloud, on-prem, hybrid environments, or embedded inside your voice infrastructure.<\/li>\n<li><strong>Full control of scaling<\/strong>: match capacity to your traffic patterns, from low-volume pilots to large global operations.<\/li>\n<li><strong>Seamless pipeline integration<\/strong>: no change in existing WebRTC, SIP, or media-streaming architectures.<\/li>\n<\/ul>\n<p>This gives you the reliability, predictability, and control needed to operate Accent Conversion at production scale.<\/p>\n<h2><strong>Quickstart: Run Accent Conversion<\/strong><\/h2>\n<p>The Accent Conversion SDK is designed to integrate directly into media pipelines that process audio frame-by-frame. If you\u2019ve previously integrated the <strong>Krisp AI Voice SDKs<\/strong>, the Accent Conversion SDK will feel immediately familiar. This allows you to reuse existing code and drop Accent Conversion into their media path with minimal changes.<\/p>\n<h3><strong>Python<\/strong><\/h3>\n<div class=\"code_snippet_parent\">\n<div class=\"code_snippet_holder\">\n<pre><code> import krisp_audio\r\n\r\ndef log_callback(log_message, log_level):\r\nlogging.info(f&#8221;[{log_level}] {log_message}&#8221;)\r\n\r\n# initialize Krisp SDK global instance\r\nkrisp_audio.globalInit(&#8220;&#8221;, log_callback, krisp_audio.LogLevel.Off)\r\n\r\n# Create Accent session with the specified configuration\r\nmodel_info = krisp_audio.ModelInfo()\r\nmodel_info.path = &#8220;path\/to\/accent_model_file.kef&#8221;\r\n\r\nar_cfg = krisp_audio.ArSessionConfig()\r\nar_cfg.inputSampleRate = inputSampleRate\r\nar_cfg.inputFrameDuration = inputFrameDuration\r\nar_cfg.outputSampleRate = outputSampleRate\r\nar_cfg.modelInfo = model_info\r\n\r\narFloat = krisp_audio.ArFloat.create(ar_cfg)\r\n\r\n# Frame by frame processing of the given audio stream\r\nfor i in range(0, 1000) # frame count\r\nprocessed_frame = arFloat.process(frame)\r\n\r\n# Free the Krisp SDK global instance\r\narFloat = None\r\nkrisp_audio.globalDestroy()\r\n<\/code><\/pre>\n<\/p><\/div>\n<\/div>\n<p><strong>What This Code Demonstrates<\/strong><\/p>\n<ul>\n<li>Initializing the Krisp SDK runtime<\/li>\n<li>Loading an Accent Conversion model<\/li>\n<li>Creating a session tuned to your audio pipeline<\/li>\n<li>Processing audio <strong>frame-by-frame<\/strong> (e.g., 20 ms frames)<\/li>\n<li>Gracefully shutting down the SDK<\/li>\n<\/ul>\n<h2><strong>AI Model Summary<\/strong><\/h2>\n<p>Krisp\u2019s Accent Conversion v3.7 model delivers significant improvements in naturalness, pronunciation accuracy, speaker similarity, voice stability, and audio clarity. These gains were validated through crowdsourced evaluations, objective phoneme-level benchmarks, and extensive production deployments in Krisp\u2019s CX product.<\/p>\n<table style=\"border-collapse: collapse; width: 100%; height: 106px;\">\n<tbody>\n<tr style=\"height: 24px;\">\n<td style=\"width: 33.3333%; height: 24px;\"><\/td>\n<td style=\"width: 33.3333%; height: 24px;\"><strong>Before<\/strong><\/td>\n<td style=\"width: 33.3333%; height: 24px;\"><strong>After<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 58px;\">\n<td style=\"width: 33.3333%; height: 58px;\"><strong>Indian<\/strong><\/td>\n<td style=\"width: 33.3333%; height: 58px;\"><!--[if lt IE 9]><script>document.createElement('audio');<\/script><![endif]--><br \/>\n<audio class=\"wp-audio-shortcode\" id=\"audio-22692-1\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ishika-before.mp3?_=1\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ishika-before.mp3\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ishika-before.mp3<\/a><\/audio><\/td>\n<td style=\"width: 33.3333%; height: 58px;\"><audio class=\"wp-audio-shortcode\" id=\"audio-22692-2\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ishika-after.mp3?_=2\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ishika-after.mp3\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ishika-after.mp3<\/a><\/audio><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 33.3333%; height: 24px;\"><strong>Filipino<\/strong><\/td>\n<td style=\"width: 33.3333%; height: 24px;\"><audio class=\"wp-audio-shortcode\" id=\"audio-22692-3\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/louis-before.mp3?_=3\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/louis-before.mp3\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/louis-before.mp3<\/a><\/audio><\/td>\n<td style=\"width: 33.3333%; height: 24px;\"><audio class=\"wp-audio-shortcode\" id=\"audio-22692-4\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/mpeg\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/louis-after.mp3?_=4\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/louis-after.mp3\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/louis-after.mp3<\/a><\/audio><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>More detailed qualitative results and accent-specific evaluations are available in our <a href=\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\">Accent Conversion 3.7 article.<\/a><\/p>\n<h3><strong>Algorithmic Latency &amp; Audio Handling<\/strong><\/h3>\n<p><img loading=\"lazy\" class=\"alignnone size-full wp-image-22701\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ac-algorithmic-latency.png\" alt=\"\" width=\"2000\" height=\"500\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ac-algorithmic-latency.png 2000w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ac-algorithmic-latency-300x75.png 300w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ac-algorithmic-latency-380x95.png 380w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ac-algorithmic-latency-768x192.png 768w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ac-algorithmic-latency-1536x384.png 1536w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/ac-algorithmic-latency-600x150.png 600w\" sizes=\"(max-width: 2000px) 100vw, 2000px\" \/><\/p>\n<ul>\n<li><strong>Algorithmic latency:<\/strong> ~220 ms (fixed), leaving sufficient budget for transport and media-pipeline overhead while staying within the 400\u2013500 ms one-way latency window required for natural conversational flow.<\/li>\n<li><strong>Audio format:<\/strong> Operates internally at 16 kHz; SDK automatically handles up\/downsampling. No preprocessing required.<\/li>\n<li><strong>Voice isolation:<\/strong> Built-in voice isolation enables AC to run directly after the media gateway\/SFU without requiring preceding Noise Cancellation stage.<\/li>\n<\/ul>\n<h2><strong>Get Started with AI Voice SDK<\/strong><\/h2>\n<p>Everything you need to integrate Accent Conversion into your platform is available here:<\/p>\n<ul>\n<li><a href=\"https:\/\/krisp.ai\/developers\/\"><strong>Get Access to Accent Conversion SDK<\/strong><\/a><\/li>\n<li><a href=\"https:\/\/sdk-docs.krisp.ai\/\"><strong>SDK Documentation &amp; Integration Guides<\/strong><\/a><\/li>\n<li><a href=\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\"><strong>Detailed Accent Conversion Model Quality Benchmarks (v3.7)<\/strong><\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>TL;DR Krisp\u2019s Accent Conversion (AC) SDK is now available for server-side deployment, giving CCaaS platforms and enterprise voice teams a production-ready way to transform accents in real time. Krisp AC v3.7 model delivers high naturalness, clarity, pronunciation accuracy, and speaker similarity\u2014running entirely on CPUs with predictable latency and simple frame-based APIs. Integrate it directly into [&hellip;]<\/p>\n","protected":false},"author":71,"featured_media":22718,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"two_page_speed":[]},"categories":[417,421,456],"tags":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v24.2 (Yoast SEO v23.6) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Introducing Krisp\u2019s Accent Conversion SDK | Krisp AI Voice SDK<\/title>\n<meta name=\"description\" content=\"Build real-time voice apps with a server-side Accent Conversion SDK. CPU-only, low latency, and easy to integrate with WebRTC and SIP pipelines.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Introducing Krisp\u2019s Accent Conversion SDK | Krisp AI Voice SDK\" \/>\n<meta property=\"og:description\" content=\"Build real-time voice apps with a server-side Accent Conversion SDK. CPU-only, low latency, and easy to integrate with WebRTC and SIP pipelines.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/\" \/>\n<meta property=\"og:site_name\" content=\"Krisp\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/krispHQ\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-12-15T08:27:13+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-12-16T14:12:50+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"700\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Krisp Engineering Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@krispHQ\" \/>\n<meta name=\"twitter:site\" content=\"@krispHQ\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/\"},\"author\":{\"name\":\"Krisp Engineering Team\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/e9f59158d89de3002958d323d2e788f5\"},\"headline\":\"Introducing Krisp\u2019s Accent Conversion SDK\",\"datePublished\":\"2025-12-15T08:27:13+00:00\",\"dateModified\":\"2025-12-16T14:12:50+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/\"},\"wordCount\":815,\"commentCount\":1,\"publisher\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png\",\"articleSection\":[\"Company\",\"Engineering Blog\",\"SDK\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/\",\"url\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/\",\"name\":\"Introducing Krisp\u2019s Accent Conversion SDK | Krisp AI Voice SDK\",\"isPartOf\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png\",\"datePublished\":\"2025-12-15T08:27:13+00:00\",\"dateModified\":\"2025-12-16T14:12:50+00:00\",\"description\":\"Build real-time voice apps with a server-side Accent Conversion SDK. CPU-only, low latency, and easy to integrate with WebRTC and SIP pipelines.\",\"breadcrumb\":{\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#primaryimage\",\"url\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png\",\"contentUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png\",\"width\":1000,\"height\":700,\"caption\":\"Accent Conversion SDK\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/krisp.ai\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Introducing Krisp\u2019s Accent Conversion SDK\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/krisp.ai\/blog\/#website\",\"url\":\"https:\/\/krisp.ai\/blog\/\",\"name\":\"Krisp\",\"description\":\"Blog\",\"publisher\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/krisp.ai\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\",\"name\":\"Krisp\",\"url\":\"https:\/\/krisp.ai\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png\",\"contentUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png\",\"width\":696,\"height\":696,\"caption\":\"Krisp\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/krispHQ\/\",\"https:\/\/x.com\/krispHQ\",\"https:\/\/www.linkedin.com\/company\/krisphq\/\",\"https:\/\/www.youtube.com\/channel\/UCAMZinJdR9P33fZUNpuxXtg\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/e9f59158d89de3002958d323d2e788f5\",\"name\":\"Krisp Engineering Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/26475ad8219056696662f819691ee49d?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/26475ad8219056696662f819691ee49d?s=96&d=mm&r=g\",\"caption\":\"Krisp Engineering Team\"},\"url\":\"https:\/\/krisp.ai\/blog\/author\/eng-team\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Introducing Krisp\u2019s Accent Conversion SDK | Krisp AI Voice SDK","description":"Build real-time voice apps with a server-side Accent Conversion SDK. CPU-only, low latency, and easy to integrate with WebRTC and SIP pipelines.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/","og_locale":"en_US","og_type":"article","og_title":"Introducing Krisp\u2019s Accent Conversion SDK | Krisp AI Voice SDK","og_description":"Build real-time voice apps with a server-side Accent Conversion SDK. CPU-only, low latency, and easy to integrate with WebRTC and SIP pipelines.","og_url":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/","og_site_name":"Krisp","article_publisher":"https:\/\/www.facebook.com\/krispHQ\/","article_published_time":"2025-12-15T08:27:13+00:00","article_modified_time":"2025-12-16T14:12:50+00:00","og_image":[{"width":1000,"height":700,"url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png","type":"image\/png"}],"author":"Krisp Engineering Team","twitter_card":"summary_large_image","twitter_creator":"@krispHQ","twitter_site":"@krispHQ","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#article","isPartOf":{"@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/"},"author":{"name":"Krisp Engineering Team","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/e9f59158d89de3002958d323d2e788f5"},"headline":"Introducing Krisp\u2019s Accent Conversion SDK","datePublished":"2025-12-15T08:27:13+00:00","dateModified":"2025-12-16T14:12:50+00:00","mainEntityOfPage":{"@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/"},"wordCount":815,"commentCount":1,"publisher":{"@id":"https:\/\/krisp.ai\/blog\/#organization"},"image":{"@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#primaryimage"},"thumbnailUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png","articleSection":["Company","Engineering Blog","SDK"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/","url":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/","name":"Introducing Krisp\u2019s Accent Conversion SDK | Krisp AI Voice SDK","isPartOf":{"@id":"https:\/\/krisp.ai\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#primaryimage"},"image":{"@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#primaryimage"},"thumbnailUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png","datePublished":"2025-12-15T08:27:13+00:00","dateModified":"2025-12-16T14:12:50+00:00","description":"Build real-time voice apps with a server-side Accent Conversion SDK. CPU-only, low latency, and easy to integrate with WebRTC and SIP pipelines.","breadcrumb":{"@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#primaryimage","url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png","contentUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/12\/AC-SDK-2-1.png","width":1000,"height":700,"caption":"Accent Conversion SDK"},{"@type":"BreadcrumbList","@id":"https:\/\/krisp.ai\/blog\/accent-conversion-sdk\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/krisp.ai\/blog\/"},{"@type":"ListItem","position":2,"name":"Introducing Krisp\u2019s Accent Conversion SDK"}]},{"@type":"WebSite","@id":"https:\/\/krisp.ai\/blog\/#website","url":"https:\/\/krisp.ai\/blog\/","name":"Krisp","description":"Blog","publisher":{"@id":"https:\/\/krisp.ai\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/krisp.ai\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/krisp.ai\/blog\/#organization","name":"Krisp","url":"https:\/\/krisp.ai\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png","contentUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png","width":696,"height":696,"caption":"Krisp"},"image":{"@id":"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/krispHQ\/","https:\/\/x.com\/krispHQ","https:\/\/www.linkedin.com\/company\/krisphq\/","https:\/\/www.youtube.com\/channel\/UCAMZinJdR9P33fZUNpuxXtg"]},{"@type":"Person","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/e9f59158d89de3002958d323d2e788f5","name":"Krisp Engineering Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/26475ad8219056696662f819691ee49d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/26475ad8219056696662f819691ee49d?s=96&d=mm&r=g","caption":"Krisp Engineering Team"},"url":"https:\/\/krisp.ai\/blog\/author\/eng-team\/"}]}},"_links":{"self":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/22692"}],"collection":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/users\/71"}],"replies":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/comments?post=22692"}],"version-history":[{"count":19,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/22692\/revisions"}],"predecessor-version":[{"id":22721,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/22692\/revisions\/22721"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/media\/22718"}],"wp:attachment":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/media?parent=22692"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/categories?post=22692"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/tags?post=22692"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}