


{"id":21602,"date":"2025-05-22T18:24:59","date_gmt":"2025-05-22T14:24:59","guid":{"rendered":"https:\/\/krisp.ai\/blog\/?p=21602"},"modified":"2025-06-30T18:10:49","modified_gmt":"2025-06-30T14:10:49","slug":"introducing-accent-conversion-v3-5","status":"publish","type":"post","link":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/","title":{"rendered":"Introducing Accent Conversion v3.5"},"content":{"rendered":"<p>Krisp Accent Conversion v3.5 represents a significant upgrade over the previous v3.0 release. Both Indian and Filipino accent models show consistent improvements across clarity, naturalness, and pronunciation accuracy, validated through expert evaluation, crowdsourced ratings, and objective metrics. Overall, the v3.5 models deliver clearer, more natural, and more intelligible speech while preserving speaker identity.<\/p>\n<p>&nbsp;<\/p>\n<h2>Key Improvements in AC v3.5<\/h2>\n<ol>\n<li><strong>Speech &amp; Audio Clarity:<\/strong> Major improvements in intelligibility and reduction of audio artifacts and distortions. Speech Clarity scores increased by <strong>+18%<\/strong> (Indian) and <strong>+23%<\/strong> (Filipino) in expert evaluations, with consistent boosts across Meta metrics as well.<\/li>\n<li><strong>Naturalness &amp; Fluidity:<\/strong> Speech sounds more human and expressive, with better rhythm, pacing, and filler sound handling. Expert-rated Natural Speech scores improved by <strong>+18%<\/strong> (Indian) and <strong>+20%<\/strong> (Filipino). Crowdsourced evaluations confirm this with <strong>+10%<\/strong> (Indian) and <strong>+6%<\/strong> (Filipino) gains.<\/li>\n<li><strong>Pronunciation Accuracy:<\/strong> Improved phoneme articulation and intelligibility reflected in a fairly significant <strong>10%<\/strong> reduction in Phoneme Error Rate (PER) for the Indian accent pack.<\/li>\n<li><strong>Voice Stability:<\/strong> Enhanced consistency in pitch and tone throughout the utterance, helping avoid unnatural fluctuations. This contributed to improved naturalness and clarity scores across all metrics.<\/li>\n<li><strong>Speaker Identity Retention:<\/strong> v3.5 models better preserve the original speaker\u2019s voice characteristics, resulting in more personalized and authentic-sounding output, evident in higher naturalness ratings across both subjective and objective evaluations.<\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<h3>Evaluation Results<\/h3>\n<p>For subjective and objective evaluations, 78 real-world recordings were sampled for the Indian accent pack and 57 for the Filipino accent pack.<\/p>\n<p>&nbsp;<\/p>\n<p>For the crowdsourced evaluation, each recording received exactly 30 independent votes to ensure statistical confidence \u2014 2340 total votes for Indian recordings and 1710 for Filipino recordings.<\/p>\n<p>&nbsp;<\/p>\n<p>The results shown in the table below represent aggregated averages across all recordings.<\/p>\n<p>&nbsp;<\/p>\n<p><!-- notionvc: 30d65f60-d101-4e42-a30b-9cf191cf1ab7 --><\/p>\n<table>\n<thead>\n<tr>\n<th>Metric<\/th>\n<th>IN AC v3<\/th>\n<th>IN AC v3.5<\/th>\n<th>PH AC v3<\/th>\n<th>PH AC v3.5<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Expert evaluation &#8211;<br \/>\nNatural speech (1 to 5)<\/td>\n<td>3.3<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a03.9 (+18%)<\/td>\n<td>3.4<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a04.1 (+20%)<\/td>\n<\/tr>\n<tr>\n<td>Expert evaluation &#8211;<br \/>\nSpeech clarity (1 to 5)<\/td>\n<td>3.4<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a04.0 (+18%)<\/td>\n<td>3.4<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a04.2 (+23%)<\/td>\n<\/tr>\n<tr>\n<td>Crowdsourced evaluation &#8211;<br \/>\n\u201c<em>How natural does<br \/>\nthe voice sound?<\/em>\u201d (1 to 5)<\/td>\n<td>3.1<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a03.4 (+10%)<\/td>\n<td>3.3<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a03.5 (+6%)<\/td>\n<\/tr>\n<tr>\n<td>Meta Aesthetic &#8211;<br \/>\nNatural speech (1 to 10)<\/td>\n<td>5.4<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a05.6 (+4%)<\/td>\n<td>5.4<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a05.7 (+6%)<\/td>\n<\/tr>\n<tr>\n<td>Meta Aesthetic &#8211;<br \/>\nSpeech clarity (1 to 10)<\/td>\n<td>7.1<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a07.5 (+6%)<\/td>\n<td>7.1<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21652\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-up-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a07.5 (+6%)<\/td>\n<\/tr>\n<tr>\n<td>Phoneme Error Rate (PER)<\/td>\n<td>26.1%<\/td>\n<td class=\"up\"><img loading=\"lazy\" class=\"alignnone wp-image-21651\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-down-1.png\" alt=\"\" width=\"19\" height=\"19\" srcset=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-down-1.png 200w, https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/arrow-down-1-150x150.png 150w\" sizes=\"(max-width: 19px) 100vw, 19px\" \/>\u00a024% (\u221210%)<\/td>\n<td>28.4%<\/td>\n<td class=\"equal\">= 28.4% (no change)<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<h2><strong>Comparative audio samples<\/strong><\/h2>\n<p><strong>Listening Tip:<\/strong> For the most accurate and immersive comparison between v3.0 and v3.5 Accent Conversion, we recommend using quality headphones.<\/p>\n<p>This helps highlight the improvements in clarity, naturalness, and speaker identity preservation that may be less perceptible on laptop or mobile speakers.<\/p>\n<p><!-- notionvc: e5ee84aa-a94d-4d4a-9307-7075dc6c3ca7 --><\/p>\n<h3>Indian English accent pack<\/h3>\n<table style=\"table-layout: fixed;\" border=\"1\" cellspacing=\"0\" cellpadding=\"8\">\n<thead>\n<tr>\n<th>Improvement category<\/th>\n<th>Original speech<\/th>\n<th>Converted AC V3<\/th>\n<th>Converted AC V3.5<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Voice stability<\/strong><\/p>\n<p><strong>Speech clarity<\/strong><\/p>\n<p><strong>Speech naturalness<\/strong><\/td>\n<td><!--[if lt IE 9]><script>document.createElement('audio');<\/script><![endif]--><br \/>\n<audio class=\"wp-audio-shortcode\" id=\"audio-21602-1\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220.wav?_=1\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-2\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220-v3_1.wav?_=2\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220-v3_1.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220-v3_1.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-3\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220-v3_5.wav?_=3\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220-v3_5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MohsinSayyed_18.05_Poly_Blackwire_3220-v3_5.wav<\/a><\/audio><\/td>\n<\/tr>\n<tr>\n<td><strong>Voice stability<\/strong><\/p>\n<p><strong>Speech clarity<\/strong><\/p>\n<p><strong>Speech naturalness<\/strong><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-4\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb.wav?_=4\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-5\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb-v3_1.wav?_=5\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb-v3_1.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb-v3_1.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-6\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb-v3_5.wav?_=6\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb-v3_5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Sumitra_Deb-v3_5.wav<\/a><\/audio><\/td>\n<\/tr>\n<tr>\n<td><strong>Speech clarity<\/strong><\/p>\n<p><strong>Speech naturalness<\/strong><\/p>\n<p><!-- notionvc: 9d0e59d5-3115-40a7-aff4-8608b41c641f --><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-7\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07.wav?_=7\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-8\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07-v3_1.wav?_=8\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07-v3_1.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07-v3_1.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-9\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07-v3_5.wav?_=9\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07-v3_5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/ITC_2_02.07-v3_5.wav<\/a><\/audio><\/td>\n<\/tr>\n<tr>\n<td><strong>Speech clarity<\/strong><\/p>\n<p><strong>Speech naturalness<\/strong><\/p>\n<p><strong>Audio quality<\/strong><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-10\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025.wav?_=10\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-11\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025-v3_1.wav?_=11\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025-v3_1.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025-v3_1.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-12\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025-v3_5.wav?_=12\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025-v3_5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/MO_1_IND_Jabra_Evolve_30_II_15_04_2025-v3_5.wav<\/a><\/audio><\/td>\n<\/tr>\n<tr>\n<td><strong>Speech clarity<\/strong><\/p>\n<p><strong>Speech naturalness<\/strong><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-13\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025.wav?_=13\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-14\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025-v3_1.wav?_=14\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025-v3_1.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025-v3_1.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-15\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025-v3_5.wav?_=15\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025-v3_5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/1_IND_Jabra_Evolve_20_01_04_2025-v3_5.wav<\/a><\/audio><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3><\/h3>\n<h3>Filipino English accent pack<\/h3>\n<table style=\"table-layout: fixed;\" border=\"1\" cellspacing=\"0\" cellpadding=\"8\">\n<thead>\n<tr>\n<th>Improvement category<\/th>\n<th>Original speech<\/th>\n<th>Converted AC V3<\/th>\n<th>Converted AC V3.5<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td><strong>Audio quality<\/strong><\/p>\n<p><strong>Speaker identity<br \/>\npreservation<\/strong><\/p>\n<p><strong>Speech naturalness<\/strong><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-16\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25.wav?_=16\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-17\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25-v3.wav?_=17\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25-v3.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25-v3.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-18\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25-v3.5.wav?_=18\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25-v3.5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/4_PH_PolyDA85_06.03.25-v3.5.wav<\/a><\/audio><\/td>\n<\/tr>\n<tr>\n<td><strong>Audio quality<\/strong><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-19\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-2.wav?_=19\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-2.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-2.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-20\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-v3-2.wav?_=20\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-v3-2.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-v3-2.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-21\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-v3.5-2.wav?_=21\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-v3.5-2.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_2_PolyDA85_06.03.25-v3.5-2.wav<\/a><\/audio><\/td>\n<\/tr>\n<tr>\n<td><strong>Speech clarity<\/strong><\/p>\n<p><strong>Speech naturalness<\/strong><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-22\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25.wav?_=22\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-23\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25-v3.wav?_=23\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25-v3.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25-v3.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-24\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25-v3.5.wav?_=24\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25-v3.5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/John_WNS_PH_PolyDA75_08.01.25-v3.5.wav<\/a><\/audio><\/td>\n<\/tr>\n<tr>\n<td><strong>Audio quality<\/strong><\/p>\n<p><strong>Speaker identity<br \/>\npreservation<\/strong><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-25\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02.wav?_=25\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-26\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02-v3.wav?_=26\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02-v3.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02-v3.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-27\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02-v3.5.wav?_=27\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02-v3.5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/PH_PolyQA85_11.02-v3.5.wav<\/a><\/audio><\/td>\n<\/tr>\n<tr>\n<td><strong>Audio quality<\/strong><\/p>\n<p><strong>Speech clarity<\/strong><\/p>\n<p><strong>Speech naturalness<\/strong><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-28\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2.wav?_=28\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-29\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2-v3.wav?_=29\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2-v3.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2-v3.wav<\/a><\/audio><\/td>\n<td><audio class=\"wp-audio-shortcode\" id=\"audio-21602-30\" preload=\"none\" style=\"width: 100%;\" controls=\"controls\"><source type=\"audio\/wav\" src=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2-v3.5.wav?_=30\" \/><a href=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2-v3.5.wav\">https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/3_PHI_Jabra_Evolve_20MS_29_04_2025-2-v3.5.wav<\/a><\/audio><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<h2>Appendix<\/h2>\n<h2>Subjective evaluation<\/h2>\n<p>&nbsp;<\/p>\n<p>Our evaluation was conducted across <strong>two structured tracks<\/strong>: expert panel ratings and crowdsourced listener preferences, designed to capture both technical precision and human perception.<\/p>\n<p>&nbsp;<\/p>\n<p>Real-world agent calls have been sampled to represent a diverse set of speakers and input conditions, including, but not limited to<\/p>\n<p>&nbsp;<\/p>\n<ul>\n<li>Accent level &#8211; high, medium, low<\/li>\n<li>Speech rates and fluency<\/li>\n<li>Background conditions (quiet, noisy, multi-speaker environments)<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>Evaluators scored each recording across four qualitative dimensions using a 5-point Likert scale:<\/p>\n<table>\n<thead>\n<tr>\n<th>Score<\/th>\n<th>Meaning<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>5<\/td>\n<td>Excellent \/ Native-like<\/td>\n<\/tr>\n<tr>\n<td>4<\/td>\n<td>Very Good<\/td>\n<\/tr>\n<tr>\n<td>3<\/td>\n<td>Acceptable<\/td>\n<\/tr>\n<tr>\n<td>2<\/td>\n<td>Needs Improvement<\/td>\n<\/tr>\n<tr>\n<td>1<\/td>\n<td>Poor \/ Unintelligible<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3><\/h3>\n<h3>1. <strong>Expert Panel Evaluation<\/strong><\/h3>\n<p>&nbsp;<\/p>\n<p>Six expert evaluators independently rated <strong>matching audio pairs<\/strong> \u2014 each pair consisting of the same original voice converted by <em>AC v3<\/em> and <em>AC v3.5<\/em>.<\/p>\n<p>&nbsp;<\/p>\n<p>To eliminate bias:<\/p>\n<p>&nbsp;<\/p>\n<ul>\n<li>File names were anonymized (no version markers)<\/li>\n<li>The order of samples was randomized<\/li>\n<li>Scoring was blind and individual (no group discussion)<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3><strong>2. Crowdsourced Evaluation<\/strong><\/h3>\n<p>To further simulate real-world user perception, a\u00a0<strong>blind A\/B\/C test<\/strong>\u00a0was run with a trio of recordings: original vs. AC v3 vs. AC v3.5.<\/p>\n<p>&nbsp;<\/p>\n<p>Respondents asked a single question &#8211; <em>\u201cHow natural does the voice sound?\u201d<\/em>, and scored recordings using the same 5-point Likert scale:<\/p>\n<p>&nbsp;<\/p>\n<h3><strong>Evaluation metrics<\/strong><\/h3>\n<p>Accent Conversion performance was measured across four key subjective and objective dimensions. These were selected based on real-world call center priorities such as clarity, naturalness, and robustness.<\/p>\n<table>\n<thead>\n<tr>\n<th><strong>Metric<\/strong><\/th>\n<th><strong>Description<\/strong><\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Accent Conversion<\/td>\n<td>How effectively the speaker\u2019s original accent is transformed into a neutral or target accent.<br \/>\nHigh scores mean minimal accent leakage or trace of the original pronunciation.<\/td>\n<\/tr>\n<tr>\n<td>Speech Clarity<\/td>\n<td>Evaluates articulation, intelligibility, and absence of audio distortions,<br \/>\nsuch as mumbling, muffling, or low vocal energy.<\/td>\n<\/tr>\n<tr>\n<td>Natural Speech<\/td>\n<td>Measures how closely the output resembles fluid, human-like speech,<br \/>\nincluding natural variations in pitch, tone, rhythm, and intonation.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<h2>Objective evaluation<\/h2>\n<p>For objective evaluation, the same set of recordings was processed using the\u00a0<a href=\"https:\/\/ai.meta.com\/research\/publications\/meta-audiobox-aesthetics-unified-automatic-quality-assessment-for-speech-music-and-sound\/\">Meta Audiobox Aesthetics<\/a>\u00a0and captured metrics strongly correlated to Natural Speech and Speech Clarity. Additionally, to quantify how each system impacts phoneme accuracy, all recordings were also processed using the\u00a0<a href=\"https:\/\/arxiv.org\/pdf\/2109.11680\">Facebook NN Phonemizer<\/a>, which is strongly correlated with the accent conversion metric.<\/p>\n<p>&nbsp;<\/p>\n<table>\n<thead>\n<tr>\n<th><strong>Objective metric<\/strong><\/th>\n<th><strong>Interpretation<\/strong><\/th>\n<th><strong>Highly correlated to subjective metric<\/strong><\/th>\n<th><strong>What it captures<\/strong><\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Production quality*<\/td>\n<td>Higher is better<\/td>\n<td>Speech clarity<\/td>\n<td>Fidelity, presence of audio artifacts, balance, and clarity of the output signal<\/td>\n<\/tr>\n<tr>\n<td>Content enjoyment*<\/td>\n<td>Higher is better<\/td>\n<td>Natural speech<\/td>\n<td>Perceived naturalness, fluidity, and enjoyment of listening \u2014 akin to human listening satisfaction<\/td>\n<\/tr>\n<tr>\n<td>Phoneme Error Rate (PER)<\/td>\n<td>Lower is better<\/td>\n<td>Accent conversion<\/td>\n<td>Measures pronunciation distortion. Lower scores mean more accurate, intelligible speech with better articulation.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>&nbsp;<\/p>\n<p>* These metrics are derived from waveform-level analysis and do not require transcript or linguistic alignment, making them ideal for evaluating accent conversion outputs that vary in delivery and prosody.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Krisp Accent Conversion v3.5 represents a significant upgrade over the previous v3.0 release. Both Indian and Filipino accent models show consistent improvements across clarity, naturalness, and pronunciation accuracy, validated through expert evaluation, crowdsourced ratings, and objective metrics. Overall, the v3.5 models deliver clearer, more natural, and more intelligible speech while preserving speaker identity. &nbsp; Key [&hellip;]<\/p>\n","protected":false},"author":22,"featured_media":21606,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"two_page_speed":[]},"categories":[517,413],"tags":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v24.2 (Yoast SEO v23.6) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Introducing Accent Conversion v3.5 - Krisp<\/title>\n<meta name=\"robots\" content=\"index, nofollow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Introducing Accent Conversion v3.5 - Krisp\" \/>\n<meta property=\"og:description\" content=\"Krisp Accent Conversion v3.5 represents a significant upgrade over the previous v3.0 release. Both Indian and Filipino accent models show consistent improvements across clarity, naturalness, and pronunciation accuracy, validated through expert evaluation, crowdsourced ratings, and objective metrics. Overall, the v3.5 models deliver clearer, more natural, and more intelligible speech while preserving speaker identity. &nbsp; Key [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/\" \/>\n<meta property=\"og:site_name\" content=\"Krisp\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/krispHQ\/\" \/>\n<meta property=\"article:published_time\" content=\"2025-05-22T14:24:59+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-30T14:10:49+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"700\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Krisp Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@krispHQ\" \/>\n<meta name=\"twitter:site\" content=\"@krispHQ\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/\"},\"author\":{\"name\":\"Krisp Team\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/0496a17834794b226cc0925eabe55a2d\"},\"headline\":\"Introducing Accent Conversion v3.5\",\"datePublished\":\"2025-05-22T14:24:59+00:00\",\"dateModified\":\"2025-06-30T14:10:49+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/\"},\"wordCount\":1314,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png\",\"articleSection\":[\"AI Accent Conversion\",\"Enterprise\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/\",\"url\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/\",\"name\":\"Introducing Accent Conversion v3.5 - Krisp\",\"isPartOf\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png\",\"datePublished\":\"2025-05-22T14:24:59+00:00\",\"dateModified\":\"2025-06-30T14:10:49+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#primaryimage\",\"url\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png\",\"contentUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png\",\"width\":1000,\"height\":700},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/krisp.ai\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Introducing Accent Conversion v3.5\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/krisp.ai\/blog\/#website\",\"url\":\"https:\/\/krisp.ai\/blog\/\",\"name\":\"Krisp\",\"description\":\"Blog\",\"publisher\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/krisp.ai\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/krisp.ai\/blog\/#organization\",\"name\":\"Krisp\",\"url\":\"https:\/\/krisp.ai\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png\",\"contentUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png\",\"width\":696,\"height\":696,\"caption\":\"Krisp\"},\"image\":{\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/krispHQ\/\",\"https:\/\/x.com\/krispHQ\",\"https:\/\/www.linkedin.com\/company\/krisphq\/\",\"https:\/\/www.youtube.com\/channel\/UCAMZinJdR9P33fZUNpuxXtg\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/0496a17834794b226cc0925eabe55a2d\",\"name\":\"Krisp Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/krisp.ai\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2023\/10\/cropped-Favicon-96x96.png\",\"contentUrl\":\"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2023\/10\/cropped-Favicon-96x96.png\",\"caption\":\"Krisp Team\"},\"description\":\"Here at Krisp, we are passionate about making your life more productive and easy by building noise cancelling app that removes background noise during calls.\",\"url\":\"https:\/\/krisp.ai\/blog\/author\/krisp-team\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Introducing Accent Conversion v3.5 - Krisp","robots":{"index":"index","follow":"nofollow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/","og_locale":"en_US","og_type":"article","og_title":"Introducing Accent Conversion v3.5 - Krisp","og_description":"Krisp Accent Conversion v3.5 represents a significant upgrade over the previous v3.0 release. Both Indian and Filipino accent models show consistent improvements across clarity, naturalness, and pronunciation accuracy, validated through expert evaluation, crowdsourced ratings, and objective metrics. Overall, the v3.5 models deliver clearer, more natural, and more intelligible speech while preserving speaker identity. &nbsp; Key [&hellip;]","og_url":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/","og_site_name":"Krisp","article_publisher":"https:\/\/www.facebook.com\/krispHQ\/","article_published_time":"2025-05-22T14:24:59+00:00","article_modified_time":"2025-06-30T14:10:49+00:00","og_image":[{"width":1000,"height":700,"url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png","type":"image\/png"}],"author":"Krisp Team","twitter_card":"summary_large_image","twitter_creator":"@krispHQ","twitter_site":"@krispHQ","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#article","isPartOf":{"@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/"},"author":{"name":"Krisp Team","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/0496a17834794b226cc0925eabe55a2d"},"headline":"Introducing Accent Conversion v3.5","datePublished":"2025-05-22T14:24:59+00:00","dateModified":"2025-06-30T14:10:49+00:00","mainEntityOfPage":{"@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/"},"wordCount":1314,"commentCount":0,"publisher":{"@id":"https:\/\/krisp.ai\/blog\/#organization"},"image":{"@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#primaryimage"},"thumbnailUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png","articleSection":["AI Accent Conversion","Enterprise"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/","url":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/","name":"Introducing Accent Conversion v3.5 - Krisp","isPartOf":{"@id":"https:\/\/krisp.ai\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#primaryimage"},"image":{"@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#primaryimage"},"thumbnailUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png","datePublished":"2025-05-22T14:24:59+00:00","dateModified":"2025-06-30T14:10:49+00:00","breadcrumb":{"@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#primaryimage","url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png","contentUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2025\/05\/Ac-v3-5-technical-blog.png","width":1000,"height":700},{"@type":"BreadcrumbList","@id":"https:\/\/krisp.ai\/blog\/introducing-accent-conversion-v3-5\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/krisp.ai\/blog\/"},{"@type":"ListItem","position":2,"name":"Introducing Accent Conversion v3.5"}]},{"@type":"WebSite","@id":"https:\/\/krisp.ai\/blog\/#website","url":"https:\/\/krisp.ai\/blog\/","name":"Krisp","description":"Blog","publisher":{"@id":"https:\/\/krisp.ai\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/krisp.ai\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/krisp.ai\/blog\/#organization","name":"Krisp","url":"https:\/\/krisp.ai\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png","contentUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2024\/10\/K.png","width":696,"height":696,"caption":"Krisp"},"image":{"@id":"https:\/\/krisp.ai\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/krispHQ\/","https:\/\/x.com\/krispHQ","https:\/\/www.linkedin.com\/company\/krisphq\/","https:\/\/www.youtube.com\/channel\/UCAMZinJdR9P33fZUNpuxXtg"]},{"@type":"Person","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/0496a17834794b226cc0925eabe55a2d","name":"Krisp Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/krisp.ai\/blog\/#\/schema\/person\/image\/","url":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2023\/10\/cropped-Favicon-96x96.png","contentUrl":"https:\/\/krisp.ai\/blog\/wp-content\/uploads\/2023\/10\/cropped-Favicon-96x96.png","caption":"Krisp Team"},"description":"Here at Krisp, we are passionate about making your life more productive and easy by building noise cancelling app that removes background noise during calls.","url":"https:\/\/krisp.ai\/blog\/author\/krisp-team\/"}]}},"_links":{"self":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/21602"}],"collection":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/users\/22"}],"replies":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/comments?post=21602"}],"version-history":[{"count":21,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/21602\/revisions"}],"predecessor-version":[{"id":21727,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/posts\/21602\/revisions\/21727"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/media\/21606"}],"wp:attachment":[{"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/media?parent=21602"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/categories?post=21602"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/krisp.ai\/blog\/wp-json\/wp\/v2\/tags?post=21602"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}