


{"id":21860,"date":"2025-08-07T11:55:59","date_gmt":"2025-08-07T07:55:59","guid":{"rendered":"https:\/\/krisp.ai\/blog\/?p=21860"},"modified":"2025-08-13T17:11:09","modified_gmt":"2025-08-13T13:11:09","slug":"introducing-krisp-accent-conversion-v3-7","status":"publish","type":"post","link":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/","title":{"rendered":"Introducing Krisp Accent Conversion v3.7"},"content":{"rendered":"<p>Krisp <strong>Accent Conversion v3<\/strong>, released in March 2025, marked a breakthrough moment in the evolution of our accent conversion technology. For the first time in two years, we felt the system was mature enough for wide-scale production use.<\/p>\n<p>&nbsp;<\/p>\n<p>In May 2025, we released <strong>Accent Conversion v3.5<\/strong>, bringing a major quality upgrade \u2014 with <strong>~20% improvement<\/strong> across key metrics for both <strong>Filipino and Indian accents<\/strong> (<a href=\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/\">details here<\/a>). Thanks to Krisp desktop application\u2019s auto-update mechanism, the rollout reached <strong>95% of users within 2 days<\/strong>, and the feedback was overwhelmingly positive, both from agents and customers, driving sentiment and business KPIs.<\/p>\n<p>&nbsp;<\/p>\n<p>In July 2025, we expanded the offering to support the <strong>Latin American accent<\/strong> pack. The launch quickly gained traction with several large customers and is now deployed across thousands of agents.<\/p>\n<p>&nbsp;<\/p>\n<p>Throughout this period, we\u2019ve worked closely with partners, agents, and customers to deeply understand corner cases \u2014 especially for the Indian accent, which is the most challenging due to its vast regional variation and phonetic complexity. This close collaboration, combined with relentless efforts from the world-class research and engineering teams at Krisp, has culminated in another major step forward now.<\/p>\n<p>&nbsp;<\/p>\n<p>Today, we\u2019re launching <strong>Accent Conversion v3.7<\/strong>, delivering significant improvements in <strong>naturalness<\/strong> and <strong>voice stability<\/strong>. This release is currently focused on the Indian accent pack, with support for other accents rolling out soon.<\/p>\n<p>The following sections summarize the key improvements, benchmarking methodology, and a side-by-side comparison of Accent Conversion v3.7 with v3.5.<\/p>\n<p><!-- notionvc: c1878c4c-b108-4fb1-8269-560e9f4e8214 --><\/p>\n<h2>Key Improvements in AC v3.7<!-- notionvc: 6c03f2df-b836-448d-86a4-488c4218c8b9 --><\/h2>\n<ol>\n<li><strong>Naturalness: <\/strong>The converted speech sounds even more human-like and natural, with much improved filler-sound handling. Here, expert-rated naturalness scores improved by <strong>+14%<\/strong>. Crowdsourced evaluations confirm it with a <strong>+6%<\/strong> gain.<\/li>\n<li><strong>Voice Stability:<\/strong> Enhanced consistency in pitch and tone throughout the utterance, helping avoid unnatural fluctuations, especially for thick accents. This contributed to improved naturalness and clarity scores across all metrics.<\/li>\n<li><strong>Speech &amp; Audio Clarity:<\/strong> Improvements were noted in both intelligibility and the reduction of artifacts and distortions. Speech Clarity scores rose by 5% in expert assessments, with corresponding enhancements across <a href=\"https:\/\/www.notion.so\/Introducing-Krisp-Accent-Conversion-v3-7-24692f5cd1bb80279bb0dbf90cf91a9d?pvs=21\">Meta metrics<\/a>.<\/li>\n<li><strong>Pronunciation Accuracy:<\/strong> There\u2019s a gain in objective metrics as well, about a 4% relative improvement in <a href=\"https:\/\/www.notion.so\/Introducing-Krisp-Accent-Conversion-v3-7-24692f5cd1bb80279bb0dbf90cf91a9d?pvs=21\">Phoneme Error Rate (PER)<\/a>, which can be attributed to more conversational data inclusion in the training. Here, some noticeable accent-specific enhancements in phoneme pronunciation, such as more native-like articulation of \u201cR\u201d and \u201cL\u201d, contribute to a +5% increase in the Accent Conversion score.<\/li>\n<\/ol>\n<p><!-- notionvc: 4a2c1e82-8ece-434e-a9be-84754d888af1 --><\/p>\n<h2>Evaluation Results<\/h2>\n<p>For subjective and objective evaluations, 78 real-world recordings were sampled.<\/p>\n<p>For the crowdsourced evaluation, each recording received exactly 30 independent votes to ensure statistical confidence, 2340 total votes.<\/p>\n<p>The results shown in the table below represent aggregated averages across all recordings.<\/p>\n<table style=\"border-collapse: collapse; width: 93.6278%; height: 312px;\">\n<thead>\n<tr style=\"height: 24px;\">\n<th style=\"width: 25.1078%; height: 24px;\"><strong>Metric<!-- notionvc: 0fa58f4d-2589-4e85-92ca-3c190f812a2d --><\/strong><\/th>\n<th style=\"width: 12.8391%; height: 24px;\"><strong>IN AC v3.5<!-- notionvc: bfefb204-ce69-459e-87f4-c997de646467 --><\/strong><\/th>\n<th style=\"width: 16.6701%; height: 24px;\"><strong>IN AC v3.7<!-- notionvc: 2c4f5d78-5faa-4707-89f9-4cf120e0fcba --><\/strong><\/th>\n<th style=\"width: 25.9328%; height: 24px;\"><strong>Comment<!-- notionvc: b81ca115-ac48-4a76-a49b-f6f8a8263846 --><\/strong><\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr style=\"height: 48px;\">\n<td style=\"width: 25.1078%; height: 48px;\"><span class=\"notion-enable-hover\" data-token-index=\"0\">Expert Evaluation &#8211; Natural speech <\/span><span class=\"notion-enable-hover\" data-token-index=\"1\">(1 to 5)<\/span><!-- notionvc: 7b5f1bfa-f0d8-498b-8629-5c61893009ce --><\/td>\n<td id=\";&gt;]l\" class=\"\" style=\"width: 12.8391%; height: 48px;\">3.7<\/td>\n<td id=\"y&lt;;p\" class=\"\" style=\"width: 16.6701%; height: 48px;\"><strong><img loading=\"lazy\" class=\"alignnone wp-image-21642\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png\" alt=\"\" width=\"16\" height=\"16\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-150x150.png 150w\" sizes=\"(max-width: 16px) 100vw, 16px\" \/>4.2 (+14%)<\/strong><\/td>\n<td id=\"LmO;\" class=\"\" style=\"width: 25.9328%; height: 48px;\">Speech sounds even more human-like, with much improved filler-sound handling<\/td>\n<\/tr>\n<tr style=\"height: 48px;\">\n<td style=\"width: 25.1078%; height: 48px;\"><span class=\"notion-enable-hover\" data-token-index=\"0\">Expert Evaluation &#8211; Speech Clarity <\/span><span class=\"notion-enable-hover\" data-token-index=\"1\">(1 to 5)<\/span><!-- notionvc: c6f258d4-c098-4624-9481-1b5720dac230 --><\/td>\n<td id=\";&gt;]l\" class=\"\" style=\"width: 12.8391%; height: 48px;\">4.0<\/td>\n<td id=\"y&lt;;p\" class=\"\" style=\"width: 16.6701%; height: 48px;\"><strong><img loading=\"lazy\" class=\"alignnone wp-image-21642\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png\" alt=\"\" width=\"16\" height=\"16\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-150x150.png 150w\" sizes=\"(max-width: 16px) 100vw, 16px\" \/>4.2 (+5%)<\/strong><\/td>\n<td id=\"LmO;\" class=\"\" style=\"width: 25.9328%; height: 48px;\">Speech is with fewer artifacts and clearer, especially in slurred and mumbling segments<\/td>\n<\/tr>\n<tr style=\"height: 48px;\">\n<td style=\"width: 25.1078%; height: 48px;\"><span class=\"notion-enable-hover\" data-token-index=\"0\">Expert Evaluation &#8211; Accent Conversion <\/span><span class=\"notion-enable-hover\" data-token-index=\"1\">(1 to 5)<\/span><!-- notionvc: 96831f94-052c-4014-af1e-f14ca8903339 --><\/td>\n<td id=\";&gt;]l\" class=\"\" style=\"width: 12.8391%; height: 48px;\">4.3<\/td>\n<td id=\"y&lt;;p\" class=\"\" style=\"width: 16.6701%; height: 48px;\"><strong><img loading=\"lazy\" class=\"alignnone wp-image-21642\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png\" alt=\"\" width=\"16\" height=\"16\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-150x150.png 150w\" sizes=\"(max-width: 16px) 100vw, 16px\" \/>4.5 (+5%)<\/strong><\/td>\n<td id=\"LmO;\" class=\"\" style=\"width: 25.9328%; height: 48px;\">Accent-specific enhancements in phoneme pronunciation, such as more native-like articulation of \u201cR\u201d and \u201cL\u201d<\/td>\n<\/tr>\n<tr style=\"height: 48px;\">\n<td style=\"width: 25.1078%; height: 48px;\"><span class=\"notion-enable-hover\" data-token-index=\"0\"><strong>Crowdsourced Evaluation<\/strong> &#8211; <\/span><em><span class=\"notion-enable-hover\" data-token-index=\"1\">\u201cHow natural does the voice sound?\u201d <\/span><\/em><span class=\"notion-enable-hover\" data-token-index=\"2\">(1 to 5)<\/span><!-- notionvc: a98a20d8-07d5-4492-9134-d7a11822a4fd --><\/td>\n<td id=\";&gt;]l\" class=\"\" style=\"width: 12.8391%; height: 48px;\">3.4<\/td>\n<td id=\"y&lt;;p\" class=\"\" style=\"width: 16.6701%; height: 48px;\"><strong><img loading=\"lazy\" class=\"alignnone wp-image-21642\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png\" alt=\"\" width=\"16\" height=\"16\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-150x150.png 150w\" sizes=\"(max-width: 16px) 100vw, 16px\" \/>3.6 (+6%)<\/strong><\/td>\n<td id=\"LmO;\" class=\"\" style=\"width: 25.9328%; height: 48px;\">78 real-world audio recordings assessed by 30 participants<\/td>\n<\/tr>\n<tr style=\"height: 48px;\">\n<td style=\"width: 25.1078%; height: 48px;\"><span class=\"notion-enable-hover\" data-token-index=\"0\"><strong>Crowdsourced Models\u2019 Comparison<\/strong> &#8211; <\/span><span class=\"notion-enable-hover\" data-token-index=\"1\">Which option sounds more natural?<\/span><!-- notionvc: ebfbd0a1-df8b-44e7-8ad9-107137a50246 --><\/td>\n<td id=\";&gt;]l\" class=\"\" style=\"width: 12.8391%; height: 48px;\">1242<\/td>\n<td id=\"y&lt;;p\" class=\"\" style=\"width: 16.6701%; height: 48px;\"><span style=\"font-family: inherit; font-size: inherit;\">1878 <\/span><strong style=\"font-family: inherit; font-size: inherit;\">(+20%)<\/strong><\/td>\n<td id=\"LmO;\" class=\"\" style=\"width: 25.9328%; height: 48px;\">78 real-world audio recording pairs were evaluated, with each pair assessed by 40 participants<\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 25.1078%; height: 24px;\"><span class=\"notion-enable-hover\" data-token-index=\"0\">Meta Aesthetic &#8211; Natural speech (1 to 10)<\/span><!-- notionvc: 43ad46de-995e-400b-9e0d-1016d317504a --><\/td>\n<td id=\";&gt;]l\" class=\"\" style=\"width: 12.8391%; height: 24px;\">5.6<\/td>\n<td id=\"y&lt;;p\" class=\"\" style=\"width: 16.6701%; height: 24px;\"><strong><img loading=\"lazy\" class=\"alignnone wp-image-21642\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png\" alt=\"\" width=\"16\" height=\"16\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-150x150.png 150w\" sizes=\"(max-width: 16px) 100vw, 16px\" \/>5.8 (+4%)<\/strong><\/td>\n<td style=\"width: 25.9328%; height: 24px;\"><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 25.1078%; height: 24px;\"><span class=\"notion-enable-hover\" data-token-index=\"0\">Meta Aesthetic &#8211; Speech Clarity (1 to 10)<\/span><!-- notionvc: 71259b1c-65a2-4158-92ea-e1abd88897ec --><\/td>\n<td id=\";&gt;]l\" class=\"\" style=\"width: 12.8391%; height: 24px;\">7.5<\/td>\n<td id=\"y&lt;;p\" class=\"\" style=\"width: 16.6701%; height: 24px;\"><strong><img loading=\"lazy\" class=\"alignnone wp-image-21642\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png\" alt=\"\" width=\"16\" height=\"16\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-150x150.png 150w\" sizes=\"(max-width: 16px) 100vw, 16px\" \/>7.6 (+1%)<\/strong><\/td>\n<td style=\"width: 25.9328%; height: 24px;\"><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<h2><strong>Comparative audio samples<\/strong><\/h2>\n<p><strong>Listening Tip:<\/strong> For the most accurate and immersive comparison between v3.5 and v3.7 Accent Conversion, we recommend using quality headphones.<\/p>\n<p>This helps highlight the improvements in clarity, naturalness, and speaker identity preservation that may be less perceptible on laptop or mobile speakers.<\/p>\n<table style=\"border-collapse: collapse; width: 99.9272%; height: 252px;\">\n<thead>\n<tr style=\"height: 24px;\">\n<th style=\"width: 3.11594%; height: 24px;\"><strong>#<\/strong><\/th>\n<th>Improvement Category<\/th>\n<th style=\"width: 21.6666%; height: 24px; vertical-align: middle;\"><strong>Original<\/strong><\/th>\n<th style=\"width: 21.5943%; height: 24px; vertical-align: middle;\"><strong>Converted AC v3.5<\/strong><\/th>\n<th style=\"width: 21.5942%; height: 24px; vertical-align: middle;\"><strong>Converted AC v3.7<\/strong><\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr style=\"height: 24px;\">\n<td style=\"width: 3.11594%; height: 24px;\">1<\/td>\n<td>Speech Naturalness<\/td>\n<td style=\"width: 21.6666%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/1.example_3-original.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5943%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/1.example_3-v3_5.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5942%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/1.example_3-v3_7.wav\" controls=\"controls\"><\/audio><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 3.11594%; height: 24px;\">2<\/td>\n<td>Speech Naturalness<\/td>\n<td style=\"width: 21.6666%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/2.example_6-original.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5943%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/2.example_6-v3_5.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5942%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/2.example_6-v3_7.wav\" controls=\"controls\"><\/audio><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 3.11594%; height: 24px;\">3<\/td>\n<td>Speech Naturalness<br \/>\nSpeech Clarity<\/td>\n<td style=\"width: 21.6666%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/3.example_1-original.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5943%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/3.example_1-v3_5.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5942%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/3.example_1-v3_7.wav\" controls=\"controls\"><\/audio><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 3.11594%; height: 24px;\">4<\/td>\n<td>Speech Clarity<\/td>\n<td style=\"width: 21.6666%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/IND_ML1_OG.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5943%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/IND_ML1_OG-AR_IND_v3_5.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5942%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/IND_ML1_OG-AR_IND_v3_7.wav\" controls=\"controls\"><\/audio><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 3.11594%; height: 24px;\">5<\/td>\n<td>Speech Clarity<br \/>\nSpeech Naturalness<br \/>\nVoice Stability<\/td>\n<td style=\"width: 21.6666%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/5.example_7-original.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5943%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/5.example_7-original_v3_5.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5942%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/5.example_7-original_v3_7.wav\" controls=\"controls\"><\/audio><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 3.11594%; height: 24px;\">6<\/td>\n<td>Speech Clarity<br \/>\nSpeech Naturalness<br \/>\nVoice Stability<\/td>\n<td style=\"width: 21.6666%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/6.example_8-original.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5943%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/6.example_8_v3_5.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5942%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/6.example_8_v3_7.wav\" controls=\"controls\"><\/audio><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 3.11594%; height: 24px;\">7<\/td>\n<td>Speech Naturalness<br \/>\nSpeech Clarity<\/td>\n<td style=\"width: 21.6666%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/7.example_9-original.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5943%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/7.example_9_v3_5.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5942%; height: 24px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/7.example_9_v3_7.wav\" controls=\"controls\"><\/audio><\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"width: 3.11594%; height: 26px;\">8<\/td>\n<td>Speech Naturalness<br \/>\nSpeech Clarity<\/td>\n<td style=\"width: 21.6666%; height: 26px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/8.example_10-original.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5943%; height: 26px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/8.example_10-v3_5.wav\" controls=\"controls\"><\/audio><\/td>\n<td style=\"width: 21.5942%; height: 26px; vertical-align: middle;\"><audio src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/8.example_10_v3_7.wav\" controls=\"controls\"><\/audio><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p><!-- notionvc: 4065f587-7dd6-459b-843a-f003a7e02fae --><\/p>\n<h2>Appendix<!-- notionvc: 0e4f333a-52ef-495c-8be8-d771e4090054 --><\/h2>\n<h3><strong>Subjective Evaluation<\/strong><\/h3>\n<p>Our evaluation was conducted across <strong>two structured tracks<\/strong>: expert panel ratings and crowdsourced listener preferences, designed to capture both technical precision and human perception.<\/p>\n<p>Real-world agent calls have been sampled to represent a diverse set of speakers and input conditions, including, but not limited to<\/p>\n<ul>\n<li>Accent level &#8211; high, medium, low<\/li>\n<li>Speech rates and fluency<\/li>\n<li>Background conditions (quiet, noisy, multi-speaker environments)<\/li>\n<\/ul>\n<p>Evaluators scored each recording across four qualitative dimensions using a 5-point Likert scale:<\/p>\n<table>\n<thead>\n<tr>\n<th>Score<\/th>\n<th>Meaning<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>5<\/td>\n<td>Excellent \/ Native-like<\/td>\n<\/tr>\n<tr>\n<td>4<\/td>\n<td>Very Good<\/td>\n<\/tr>\n<tr>\n<td>3<\/td>\n<td>Acceptable<\/td>\n<\/tr>\n<tr>\n<td>2<\/td>\n<td>Needs Improvement<\/td>\n<\/tr>\n<tr>\n<td>1<\/td>\n<td>Poor \/ Unintelligible<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h4>1. <strong>Expert Panel Evaluation<\/strong><\/h4>\n<p>Six expert evaluators independently rated <strong>matching audio pairs<\/strong> \u2014 each pair consisting of the same original voice converted by <em>AC v3.5<\/em> and <em>AC v3.7<\/em>.<\/p>\n<p>To eliminate bias:<\/p>\n<ul>\n<li>File names were anonymized (no version markers)<\/li>\n<li>The order of samples was randomized<\/li>\n<li>Scoring was blind and individual (no group discussion)<\/li>\n<\/ul>\n<h4><strong>2. Crowdsourced Evaluation<\/strong><\/h4>\n<p>To further simulate real-world user perception, a\u00a0<strong>blind A\/B test<\/strong>\u00a0was run with a pairs of recordings: AC v3.5 vs. AC v3.7.<br \/>\n78 real-world audio recording pairs were evaluated, with each pair assessed by 40 participants, resulting in 3,120 votes overall.<\/p>\n<p>Participants were asked the following question:<br \/>\n<em><strong>&#8220;Which option sounds more natural (i.e., more human-like)?&#8221;<\/strong><\/em><\/p>\n<p><strong>Results:<\/strong><\/p>\n<ul>\n<li><strong>Version 3.5<\/strong> was selected 1242 times<\/li>\n<li><strong>Version 3.7<\/strong> was selected 1878 times<\/li>\n<\/ul>\n<h3><strong>Evaluation metrics<\/strong><\/h3>\n<p>Accent Conversion performance was measured across four key dimensions. These were selected based on real-world call center priorities such as clarity, naturalness, and robustness.<\/p>\n<table>\n<thead>\n<tr>\n<th><strong>Metric<\/strong><\/th>\n<th><strong>Description<\/strong><\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Accent Conversion<\/td>\n<td>How effectively the speaker\u2019s original accent is transformed into a neutral or target accent. High scores mean minimal accent leakage or trace of the original pronunciation.<\/td>\n<\/tr>\n<tr>\n<td>Speech Clarity<\/td>\n<td>Evaluates articulation, intelligibility, and absence of audio distortions, such as mumbling, muffling, or low vocal energy.<\/td>\n<\/tr>\n<tr>\n<td>Natural Speech<\/td>\n<td>Measures how closely the output resembles fluid, human-like speech, including natural variations in pitch, tone, rhythm, and intonation.<\/td>\n<\/tr>\n<tr>\n<td>Pronunciation Accuracy<\/td>\n<td>Measures how closely the converted speech matches standard American English pronunciation at the phoneme level. It evaluates whether individual sounds (vowels, consonants, syllables) are produced correctly and consistently, without distortion, misplacement, or omission, ensuring that the converted voice sounds intelligible and native-like to a U.S. listener.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3><strong>Objective Evaluation<\/strong><\/h3>\n<p>For objective evaluation, the same set of recordings was processed using the\u00a0<a href=\"https:\/\/ai.meta.com\/research\/publications\/meta-audiobox-aesthetics-unified-automatic-quality-assessment-for-speech-music-and-sound\/\">Meta Audiobox Aesthetics<\/a>\u00a0and captured metrics strongly correlated to Natural Speech and Speech Clarity. Additionally, to quantify how each system impacts phoneme accuracy, all recordings were also processed using the\u00a0<a href=\"https:\/\/arxiv.org\/pdf\/2109.11680\">Facebook NN Phonemizer<\/a>, which is strongly correlated with the Accent Conversion metric.<\/p>\n<table>\n<thead>\n<tr>\n<th><strong>Objective Metric<\/strong><\/th>\n<th><strong>Interpretation<\/strong><\/th>\n<th><strong>Highly Correlated to Subjective Metric<\/strong><\/th>\n<th><strong>What It Captures<\/strong><\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Production Quality*<\/td>\n<td>Higher is better<\/td>\n<td>Speech Clarity<\/td>\n<td>Fidelity, presence of audio artifacts, balance, and clarity of the output signal<\/td>\n<\/tr>\n<tr>\n<td>Content Enjoyment*<\/td>\n<td>Higher is better<\/td>\n<td>Natural Speech<\/td>\n<td>Perceived naturalness, fluidity, and enjoyment of listening \u2014 akin to human listening satisfaction<\/td>\n<\/tr>\n<tr>\n<td>Phoneme Error Rate (PER)<\/td>\n<td>Lower is better<\/td>\n<td>Accent Conversion<\/td>\n<td>Measures pronunciation distortion. Lower scores mean more accurate, intelligible speech with better articulation.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ul>\n<li>\u00a0these metrics are derived from waveform-level analysis and do not require transcript or linguistic alignment, making them ideal for evaluating accent conversion outputs that vary in delivery and prosody.<\/li>\n<\/ul>\n<p><!-- notionvc: 7d8f50f3-c0e3-4033-8974-e7d4e65557e0 --><\/p>\n<p><!-- notionvc: b005f624-02db-4d60-8845-f96ac98b1bab --><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Krisp Accent Conversion v3, released in March 2025, marked a breakthrough moment in the evolution of our accent conversion technology. For the first time in two years, we felt the system was mature enough for wide-scale production use. &nbsp; In May 2025, we released Accent Conversion v3.5, bringing a major quality upgrade \u2014 with ~20% [&hellip;]<\/p>\n","protected":false},"author":71,"featured_media":21907,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"two_page_speed":[]},"categories":[517,417,421],"tags":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v24.2 (Yoast SEO v23.6) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Introducing Krisp Accent Conversion v3.7 - Krisp<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Introducing Krisp Accent Conversion v3.7 - Krisp\" \/>\n<meta property=\"og:description\" content=\"Krisp Accent Conversion v3, released in March 2025, marked a breakthrough moment in the evolution of our accent conversion technology. For the first time in two years, we felt the system was mature enough for wide-scale production use. &nbsp; In May 2025, we released Accent Conversion v3.5, bringing a major quality upgrade \u2014 with ~20% [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\" \/>\n<meta property=\"og:site_name\" content=\"Krisp\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/krispHQ\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-08-07T07:55:59+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-08-13T13:11:09+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"700\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Krisp Engineering Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@krispHQ\" \/>\n<meta name=\"twitter:site\" content=\"@krispHQ\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\"},\"author\":{\"name\":\"Krisp Engineering Team\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/e9f59158d89de3002958d323d2e788f5\"},\"headline\":\"Introducing Krisp Accent Conversion v3.7\",\"datePublished\":\"2025-08-07T07:55:59+00:00\",\"dateModified\":\"2025-08-13T13:11:09+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\"},\"wordCount\":1137,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png\",\"articleSection\":[\"AI Accent Conversion\",\"Company\",\"Engineering Blog\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\",\"url\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\",\"name\":\"Introducing Krisp Accent Conversion v3.7 - Krisp\",\"isPartOf\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png\",\"datePublished\":\"2025-08-07T07:55:59+00:00\",\"dateModified\":\"2025-08-13T13:11:09+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#primaryimage\",\"url\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png\",\"contentUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png\",\"width\":1000,\"height\":700},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/krisp.ai\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Introducing Krisp Accent Conversion v3.7\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/krisp.ai\/blog\/#website\",\"url\":\"https:\/\/krisp.ai\/blog\/\",\"name\":\"Krisp\",\"description\":\"Blog\",\"publisher\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/krisp.ai\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\",\"name\":\"Krisp\",\"url\":\"https:\/\/krisp.ai\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png\",\"contentUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png\",\"width\":696,\"height\":696,\"caption\":\"Krisp\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/krispHQ\/\",\"https:\/\/x.com\/krispHQ\",\"https:\/\/www.linkedin.com\/company\/krisphq\/\",\"https:\/\/www.youtube.com\/channel\/UCAMZinJdR9P33fZUNpuxXtg\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/e9f59158d89de3002958d323d2e788f5\",\"name\":\"Krisp Engineering Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/26475ad8219056696662f819691ee49d?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/26475ad8219056696662f819691ee49d?s=96&d=mm&r=g\",\"caption\":\"Krisp Engineering Team\"},\"url\":\"https:\/\/krisp.ai\/blog\/author\/eng-team\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Introducing Krisp Accent Conversion v3.7 - Krisp","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/","og_locale":"en_US","og_type":"article","og_title":"Introducing Krisp Accent Conversion v3.7 - Krisp","og_description":"Krisp Accent Conversion v3, released in March 2025, marked a breakthrough moment in the evolution of our accent conversion technology. For the first time in two years, we felt the system was mature enough for wide-scale production use. &nbsp; In May 2025, we released Accent Conversion v3.5, bringing a major quality upgrade \u2014 with ~20% [&hellip;]","og_url":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/","og_site_name":"Krisp","article_publisher":"https:\/\/www.facebook.com\/krispHQ\/","article_published_time":"2025-08-07T07:55:59+00:00","article_modified_time":"2025-08-13T13:11:09+00:00","og_image":[{"width":1000,"height":700,"url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png","type":"image\/png"}],"author":"Krisp Engineering Team","twitter_card":"summary_large_image","twitter_creator":"@krispHQ","twitter_site":"@krispHQ","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#article","isPartOf":{"@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/"},"author":{"name":"Krisp Engineering Team","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/e9f59158d89de3002958d323d2e788f5"},"headline":"Introducing Krisp Accent Conversion v3.7","datePublished":"2025-08-07T07:55:59+00:00","dateModified":"2025-08-13T13:11:09+00:00","mainEntityOfPage":{"@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/"},"wordCount":1137,"commentCount":0,"publisher":{"@id":"https:\/\/krisp.ai\/blog\/#organization"},"image":{"@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#primaryimage"},"thumbnailUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png","articleSection":["AI Accent Conversion","Company","Engineering Blog"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/","url":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/","name":"Introducing Krisp Accent Conversion v3.7 - Krisp","isPartOf":{"@id":"https:\/\/krisp.ai\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#primaryimage"},"image":{"@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#primaryimage"},"thumbnailUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png","datePublished":"2025-08-07T07:55:59+00:00","dateModified":"2025-08-13T13:11:09+00:00","breadcrumb":{"@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#primaryimage","url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png","contentUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/08\/Ac-v3-7-technical-blog-1-2.png","width":1000,"height":700},{"@type":"BreadcrumbList","@id":"https:\/\/krisp.ai\/blog\/introducing-krisp-accent-conversion-v3-7\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/krisp.ai\/blog\/"},{"@type":"ListItem","position":2,"name":"Introducing Krisp Accent Conversion v3.7"}]},{"@type":"WebSite","@id":"https:\/\/krisp.ai\/blog\/#website","url":"https:\/\/krisp.ai\/blog\/","name":"Krisp","description":"Blog","publisher":{"@id":"https:\/\/krisp.ai\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/krisp.ai\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/krisp.ai\/blog\/#organization","name":"Krisp","url":"https:\/\/krisp.ai\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png","contentUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png","width":696,"height":696,"caption":"Krisp"},"image":{"@id":"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/krispHQ\/","https:\/\/x.com\/krispHQ","https:\/\/www.linkedin.com\/company\/krisphq\/","https:\/\/www.youtube.com\/channel\/UCAMZinJdR9P33fZUNpuxXtg"]},{"@type":"Person","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/e9f59158d89de3002958d323d2e788f5","name":"Krisp Engineering Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/26475ad8219056696662f819691ee49d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/26475ad8219056696662f819691ee49d?s=96&d=mm&r=g","caption":"Krisp Engineering Team"},"url":"https:\/\/krisp.ai\/blog\/author\/eng-team\/"}]}},"_links":{"self":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/21860"}],"collection":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/users\/71"}],"replies":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/comments?post=21860"}],"version-history":[{"count":31,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/21860\/revisions"}],"predecessor-version":[{"id":21976,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/21860\/revisions\/21976"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/media\/21907"}],"wp:attachment":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/media?parent=21860"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/categories?post=21860"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/tags?post=21860"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}