{"id":1364,"date":"2026-01-12T11:15:43","date_gmt":"2026-01-12T10:15:43","guid":{"rendered":"https:\/\/www.ideas.edu.pl\/?post_type=publikacje&#038;p=1364"},"modified":"2026-01-12T11:15:44","modified_gmt":"2026-01-12T10:15:44","slug":"as-good-as-it-kan-get-high-fidelity-audio-representation","status":"publish","type":"publikacje","link":"https:\/\/www.ideas.edu.pl\/en\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/","title":{"rendered":"As Good as It KAN Get: High-Fidelity Audio Representation"},"content":{"rendered":"<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p>Implicit neural representations (INR) have gained prominence for efficiently encoding multimedia data, yet their applications in audio signals remain limited. This study introduces the Kolmogorov-Arnold Network (KAN), a novel architecture using learnable activation functions, as an effective INR model for audio representation. KAN demonstrates superior perceptual performance over previous INRs, achieving the lowest Log-SpectralDistance of 1.29 and the highest Perceptual Evaluation of Speech Quality of 3.57 for 1.5 s audio. To extend KAN's utility, we propose FewSound, a hypernetwork-based architecture that enhances INR parameter updates. FewSound outperforms the state-of-the-art HyperSound, with a 33.3% improvement in MSE and 60.87% in SI-SNR. These results show KAN as a robust and adaptable audio representation with the potential for scalability and integration into various hypernetwork frameworks. The source code can be accessed at\u00a0<a href=\"https:\/\/github.com\/gmum\/fewsound.git\" target=\"_blank\" rel=\"noopener noreferrer nofollow\">this https URL<\/a>.<\/p>\n<\/blockquote>\n\n\n\n<p>Authors: Maciej Rut, Piotr Kawa, Przemys\u0142aw Spurek, Piotr Syga<\/p>\n\n\n\n<div style=\"height:64px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<div class=\"wp-block-group is-layout-grid wp-container-core-group-is-layout-e2bd5cb0 wp-block-group-is-layout-grid\"><\/div>\n\n\n\n<div class=\"wp-block-group is-layout-grid wp-container-core-group-is-layout-e2bd5cb0 wp-block-group-is-layout-grid\"><\/div>","protected":false},"template":"","nazwa-konferencji":[32],"rodzaj-publikacji":[13],"rok-publikacji":[14],"class_list":["post-1364","publikacje","type-publikacje","status-publish","hentry","nazwa-konferencji-konferencja-cikm","rodzaj-publikacji-artykul-konferencyjny","rok-publikacji-14"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>As Good as It KAN Get: High-Fidelity Audio Representation &#8226; IDEAS Instytut Badawczy<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.ideas.edu.pl\/en\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"As Good as It KAN Get: High-Fidelity Audio Representation &#8226; IDEAS Instytut Badawczy\" \/>\n<meta property=\"og:description\" content=\"Implicit neural representations (INR) have gained prominence for efficiently encoding multimedia data, yet their applications in audio signals remain limited. This study introduces the Kolmogorov-Arnold Network (KAN), a novel architecture using learnable activation functions, as an effective INR model for audio representation. KAN demonstrates superior perceptual performance over previous INRs, achieving the lowest Log-SpectralDistance of [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.ideas.edu.pl\/en\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/\" \/>\n<meta property=\"og:site_name\" content=\"IDEAS Instytut Badawczy\" \/>\n<meta property=\"article:modified_time\" content=\"2026-01-12T10:15:44+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.ideas.edu.pl\/wp-content\/uploads\/feature-image-home.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1800\" \/>\n\t<meta property=\"og:image:height\" content=\"945\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/publikacje\\\/as-good-as-it-kan-get-high-fidelity-audio-representation\\\/\",\"url\":\"https:\\\/\\\/www.ideas.edu.pl\\\/publikacje\\\/as-good-as-it-kan-get-high-fidelity-audio-representation\\\/\",\"name\":\"As Good as It KAN Get: High-Fidelity Audio Representation &#8226; IDEAS Instytut Badawczy\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/#website\"},\"datePublished\":\"2026-01-12T10:15:43+00:00\",\"dateModified\":\"2026-01-12T10:15:44+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/publikacje\\\/as-good-as-it-kan-get-high-fidelity-audio-representation\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.ideas.edu.pl\\\/publikacje\\\/as-good-as-it-kan-get-high-fidelity-audio-representation\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/publikacje\\\/as-good-as-it-kan-get-high-fidelity-audio-representation\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Strona g\u0142\u00f3wna\",\"item\":\"https:\\\/\\\/www.ideas.edu.pl\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Publikacje\",\"item\":\"https:\\\/\\\/www.ideas.edu.pl\\\/publikacje\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"As Good as It KAN Get: High-Fidelity Audio Representation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/#website\",\"url\":\"https:\\\/\\\/www.ideas.edu.pl\\\/\",\"name\":\"IDEAS Instytut Badawczy\",\"description\":\"Pa\u0144stwowa jednostka badawczo-naukowa\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/#organization\"},\"alternateName\":\"IDEAS\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.ideas.edu.pl\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/#organization\",\"name\":\"IDEAS Instytut Badawczy\",\"url\":\"https:\\\/\\\/www.ideas.edu.pl\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.ideas.edu.pl\\\/wp-content\\\/uploads\\\/e90241d6e73025d0d829abc28d67cb84.svg\",\"contentUrl\":\"https:\\\/\\\/www.ideas.edu.pl\\\/wp-content\\\/uploads\\\/e90241d6e73025d0d829abc28d67cb84.svg\",\"width\":152,\"height\":43,\"caption\":\"IDEAS Instytut Badawczy\"},\"image\":{\"@id\":\"https:\\\/\\\/www.ideas.edu.pl\\\/#\\\/schema\\\/logo\\\/image\\\/\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"As Good as It KAN Get: High-Fidelity Audio Representation - IDEAS Research Institute.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.ideas.edu.pl\/en\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/","og_locale":"en_US","og_type":"article","og_title":"As Good as It KAN Get: High-Fidelity Audio Representation &#8226; IDEAS Instytut Badawczy","og_description":"Implicit neural representations (INR) have gained prominence for efficiently encoding multimedia data, yet their applications in audio signals remain limited. This study introduces the Kolmogorov-Arnold Network (KAN), a novel architecture using learnable activation functions, as an effective INR model for audio representation. KAN demonstrates superior perceptual performance over previous INRs, achieving the lowest Log-SpectralDistance of [&hellip;]","og_url":"https:\/\/www.ideas.edu.pl\/en\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/","og_site_name":"IDEAS Instytut Badawczy","article_modified_time":"2026-01-12T10:15:44+00:00","og_image":[{"width":1800,"height":945,"url":"https:\/\/www.ideas.edu.pl\/wp-content\/uploads\/feature-image-home.webp","type":"image\/webp"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.ideas.edu.pl\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/","url":"https:\/\/www.ideas.edu.pl\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/","name":"As Good as It KAN Get: High-Fidelity Audio Representation - IDEAS Research Institute.","isPartOf":{"@id":"https:\/\/www.ideas.edu.pl\/#website"},"datePublished":"2026-01-12T10:15:43+00:00","dateModified":"2026-01-12T10:15:44+00:00","breadcrumb":{"@id":"https:\/\/www.ideas.edu.pl\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.ideas.edu.pl\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.ideas.edu.pl\/publikacje\/as-good-as-it-kan-get-high-fidelity-audio-representation\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Strona g\u0142\u00f3wna","item":"https:\/\/www.ideas.edu.pl\/"},{"@type":"ListItem","position":2,"name":"Publikacje","item":"https:\/\/www.ideas.edu.pl\/publikacje\/"},{"@type":"ListItem","position":3,"name":"As Good as It KAN Get: High-Fidelity Audio Representation"}]},{"@type":"WebSite","@id":"https:\/\/www.ideas.edu.pl\/#website","url":"https:\/\/www.ideas.edu.pl\/","name":"IDEAS Research Institute","description":"State research and scientific unit","publisher":{"@id":"https:\/\/www.ideas.edu.pl\/#organization"},"alternateName":"IDEAS","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.ideas.edu.pl\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.ideas.edu.pl\/#organization","name":"IDEAS Research Institute","url":"https:\/\/www.ideas.edu.pl\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.ideas.edu.pl\/#\/schema\/logo\/image\/","url":"https:\/\/www.ideas.edu.pl\/wp-content\/uploads\/e90241d6e73025d0d829abc28d67cb84.svg","contentUrl":"https:\/\/www.ideas.edu.pl\/wp-content\/uploads\/e90241d6e73025d0d829abc28d67cb84.svg","width":152,"height":43,"caption":"IDEAS Instytut Badawczy"},"image":{"@id":"https:\/\/www.ideas.edu.pl\/#\/schema\/logo\/image\/"}}]}},"_links":{"self":[{"href":"https:\/\/www.ideas.edu.pl\/en\/wp-json\/wp\/v2\/publikacje\/1364","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ideas.edu.pl\/en\/wp-json\/wp\/v2\/publikacje"}],"about":[{"href":"https:\/\/www.ideas.edu.pl\/en\/wp-json\/wp\/v2\/types\/publikacje"}],"wp:attachment":[{"href":"https:\/\/www.ideas.edu.pl\/en\/wp-json\/wp\/v2\/media?parent=1364"}],"wp:term":[{"taxonomy":"nazwa-konferencji","embeddable":true,"href":"https:\/\/www.ideas.edu.pl\/en\/wp-json\/wp\/v2\/nazwa-konferencji?post=1364"},{"taxonomy":"rodzaj-publikacji","embeddable":true,"href":"https:\/\/www.ideas.edu.pl\/en\/wp-json\/wp\/v2\/rodzaj-publikacji?post=1364"},{"taxonomy":"rok-publikacji","embeddable":true,"href":"https:\/\/www.ideas.edu.pl\/en\/wp-json\/wp\/v2\/rok-publikacji?post=1364"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}