{"id":2434651,"date":"2026-05-27T21:47:58","date_gmt":"2026-05-27T21:47:58","guid":{"rendered":"https:\/\/celebrity.land\/en\/?p=2434651"},"modified":"2026-05-27T21:47:58","modified_gmt":"2026-05-27T21:47:58","slug":"elevenlabs-stability-ai-drop-new-ai-music-models-can-they-catch-suno","status":"publish","type":"post","link":"https:\/\/celebrity.land\/en\/elevenlabs-stability-ai-drop-new-ai-music-models-can-they-catch-suno\/","title":{"rendered":"ElevenLabs, Stability AI Drop New AI Music Models\u2014Can They Catch Suno?"},"content":{"rendered":"<p><\/p>\n<div style=\"position:relative;overflow:visible;font-size:1.2em;line-height:1.58\">\n<div class=\"pt-8 pb-10 border-t border-b border-decryptGridline \">\n<h4 class=\"sc-b2a202e4-4 bNRGqr gg-dark:text-white\" color=\"#333\">In brief<\/h4>\n<ul>\n<li class=\"font-meta-serif-pro font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">ElevenLabs launched Music v2, capable of switching genres mid-track, building songs section by section, and inpainting specific parts.<\/li>\n<li class=\"font-meta-serif-pro font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Stability AI released Stable Audio 3.0, a four-model family with open weights for three variants, trained on licensed data, generating tracks up to six minutes and twenty seconds long.<\/li>\n<li class=\"font-meta-serif-pro font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Both releases lean hard into licensed training data\u2014but Suno, valued at $2.45 billion with roughly 100 million users, is still the platform most people reach for first.<\/li>\n<\/ul>\n<\/div>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Two significant AI music updates landed this week, and neither came from Suno.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">ElevenLabs, the Polish-founded voice AI company sitting at an $11 billion valuation after a $500 million Series D in February, launched <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/elevenlabs.io\/blog\/introducing-music-v2\" target=\"_blank\" rel=\"nofollow external noopener\" class=\"sc-adb616fe-0 bJsyml\">Music v2<\/a>. Stability AI\u2014the Stable Diffusion people\u2014dropped <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/stability.ai\/news-updates\/meet-stable-audio-3-the-model-family-built-for-artistic-experimentation-with-open-weight-models\" target=\"_blank\" rel=\"nofollow external noopener\" class=\"sc-adb616fe-0 bJsyml\">Stable Audio 3.0<\/a>, a four-model family with open weights and tracks that run past six minutes.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">The backdrop is the Recording Industry Association of America <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/decrypt.co\/236809\/ai-music-riaa-lawsuit-suno-udio-industry-copyright\" target=\"_blank\" class=\"sc-adb616fe-0 bJsyml\">copyright suits<\/a> from 2024 against Suno and Udio, which made &#8220;trained on licensed data&#8221; the most important phrase in any AI music announcement. Both ElevenLabs and Stability are leaning on that hard, making sure you won\u2019t have issues with the outputs you generate.<\/p>\n<p class=\"sc-5a71bf1f-3 fdWwrx gg-dark:text-white scene:font-itc-avant-garde-gothic-pro scene:font-light\" style=\"margin-top:2em;text-align:left\" color=\"#333\" opacity=\"1\">Music v2: One track, opera to heavy metal, no breakdown<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Music v2 is ElevenLabs&#8217; second music model, arriving roughly 10 months after the first. The core pitch is coherence under pressure. According to Elevenlabs, a single track can shift from opera to heavy metal and back, hold together through fast rap, and embed non-musical sound effects\u2014all without the composition coming apart.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Generative audio tends to fall apart exactly when prompts get complicated, so this is the thing worth watching, especially in longer compositions.<\/p>\n<p><iframe loading=\"lazy\" style=\"border:0\" src=\"https:\/\/myriad.markets\/embed\/market\/claude-mythos-released-by-june-30\" width=\"100%\" height=\"415px\"><span style=\"display:inline-block;width:0px;overflow:hidden;line-height:0\" data-mce-type=\"bookmark\" class=\"mce_SELRES_start\">\ufeff<\/span><\/iframe><\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Inpainting is now actually useful: select a section, regenerate it, leave everything else untouched. Users can also build songs section by section\u2014intro, verse, chorus\u2014with the model maintaining continuity throughout instead of treating each clip as a standalone generation. Multilingual support has improved too, though ElevenLabs didn&#8217;t publish specifics.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">The model powers three platforms: ElevenMusic for creators, ElevenAPI for developers, and ElevenCreative for brands. It&#8217;s live on ElevenMusic and ElevenCreative now; API access is early-entry via the sales team.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">ElevenLabs also cut Music v1 and v2 pricing by up to 50% for ElevenAPI and up to 40% for ElevenCreative self-serve. The company hit <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/sacra.com\/c\/elevenlabs\/\" target=\"_blank\" rel=\"nofollow external noopener\" class=\"sc-adb616fe-0 bJsyml\">$500 million in annual recurring revenue in April 2026<\/a>. Music is still a small slice of that\u2014but ElevenMusic, which launched as a consumer app in April, is a direct shot at Suno&#8217;s user base.<\/p>\n<p class=\"sc-5a71bf1f-3 fdWwrx gg-dark:text-white scene:font-itc-avant-garde-gothic-pro scene:font-light\" style=\"margin-top:2em;text-align:left\" color=\"#333\" opacity=\"1\">Stable Audio 3.0: Open weights, on-device, actually longer<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\"><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/decrypt.co\/224729\/stability-ai-launches-stable-audio-2-0-how-does-it-stack-up-against-the-mindblowing-suno-v3\" target=\"_blank\" class=\"sc-adb616fe-0 bJsyml\">Stable Audio 2.0<\/a> topped out at three minutes and was already behind Suno when it launched in 2024. Stable Audio 3.0 ships four models: Small SFX (on-device sound effects), Small (full music composition on-device), Medium (up to 6:20, stronger hardware), and Large (API-only). Three of the four have open weights on Hugging Face.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">The Small models run at 459 million parameters each\u2014no GPU needed. (Parameters are what measure an AI model\u2019s capacity, essentially.) Medium hits 1.4 billion parameters and generates its 6:20 output in about 1.31 seconds on an H200 GPU. Large, at 2.7 billion, is API-only for organizations with over $1 million in revenue. Per-second generation granularity means you get exactly the track length you asked for, not an approximation.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">It\u2019s also supported in ComfyUI for local setups<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">The architecture is new: a semantic-acoustic autoencoder Stability calls SAME, designed to hold melodic coherence over longer outputs. LoRA fine-tuning is supported, so artists can adapt the models to their own catalogs. Inpainting is in too\u2014single-segment, multi-segment, and causal continuation to extend a track past its original endpoint.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">For context, a LoRA (Low-Rank Adaptation model) is like a tiny model that conditions how the full model generates its outputs. If you train a LoRA on blues, the model will produce blues, if you train a LoRA on BB King blues, the model will produce songs that will sound like BB King. Inpainting means a model can fix small errors in its creation. So, for example, if the model hallucinates something at the 2:30 mark, you can select a few seconds of the song, ask the model to change it into whatever you want, and the model will generate a piece of the song that fits perfectly in that timeframe and blends with the actual song as a whole.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Stability has been <a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/decrypt.co\/319867\/best-ai-tools-amateur-musicians\" target=\"_blank\" class=\"sc-adb616fe-0 bJsyml\">technically credible in AI music for years<\/a> without breaking through commercially. The open-weight play is the Stable Diffusion strategy applied to audio\u2014seed the developer community, see what gets built. The licensing is cleaner than anything Stable Audio has shipped before, with partnerships in place with Universal Music Group and Warner Music Group.<\/p>\n<p class=\"sc-5a71bf1f-3 fdWwrx gg-dark:text-white scene:font-itc-avant-garde-gothic-pro scene:font-light\" style=\"margin-top:2em;text-align:left\" color=\"#333\" opacity=\"1\">The target: Suno, the AI music king<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">If ChatGPT is the king of AI text, Suno is the king of AI music. The company behind the model hit a $2.45 billion valuation in November 2025, crossed $300 million in annual recurring revenue, and has been used by roughly 100 million people.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">It generates around 7 million songs per day. Warner Music settled its suit against Suno in November 2025; Sony and UMG are still in federal court.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">To avoid these copyright wars, ElevenLabs has licensing deals with Believe, Kobalt, and Merlin. Stability has Warner and Universal. Udio settled with all three majors and is now a walled garden\u2014nothing you generate can leave the platform.<\/p>\n<p class=\"font-meta-serif-pro scene:font-noto-sans scene:text-base scene:md:text-lg font-normal text-lg md:text-xl md:leading-9 tracking-px text-body gg-dark:text-neutral-100\">Stable Audio 3.0 Small and Medium are available on Hugging Face now. Large is live via the Stability AI API. Music v2 is free for ElevenMusic users, with commercial tiers through ElevenCreative and ElevenAPI.<\/p>\n<div class=\"my-4 border-b border-decryptGridline\">\n<div class=\"text-start p-8 md:py-12 md:px-12 max-w-prose relative\"><span class=\"border-t-4 border-l-4 w-4 h-4 md:border-t-[6px] md:border-l-[6px] md:w-6 md:h-6 border-decryptPurple dark:border-decryptNeon gg-dark:border-cc-pink-2 absolute top-4 left-4 md:top-6 md:left-6\"\/><span class=\"border-t-4 border-l-4 w-4 h-4 md:border-t-[6px] md:border-l-[6px] md:w-6 md:h-6 border-decryptPurple dark:border-decryptNeon gg-dark:border-cc-pink-2 absolute rotate-180 bottom-4 right-4 md:bottom-6 md:right-6\"\/><\/p>\n<h3 class=\"font-akzidenz-grotesk font-bold text-xl md:text-3xl md:text-center gg-dark:text-white\">Daily Debrief<!-- --> Newsletter<\/h3>\n<p>Start every day with the top news stories right now, plus original features, a podcast, videos and more.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<p><em> \u2018 The preceding article may include information circulated by third parties \u2019 <\/em><\/p>\n<p><em> \u2018 Some details of this article were extracted from the following source decrypt.co \u2019 <\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In brief ElevenLabs launched Music v2, capable of switching genres mid-track, building songs section by section, and inpainting specific parts. Stability AI released Stable Audio 3.0, a four-model family with open weights for three variants, trained on licensed data, generating tracks up to six minutes and twenty seconds long. Both releases lean hard into licensed [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":2434653,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"jnews-multi-image_gallery":[],"jnews_single_post":[],"jnews_primary_category":[],"jnews_social_meta":[],"footnotes":""},"categories":[25179],"tags":[],"class_list":["post-2434651","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-music"],"jetpack_featured_media_url":"https:\/\/celebrity.land\/en\/wp-content\/uploads\/2026\/05\/ElevenLabs-Stability-AI-Drop-New-AI-Music-Models\u2014Can-They-Catch.jpg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/posts\/2434651","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/comments?post=2434651"}],"version-history":[{"count":1,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/posts\/2434651\/revisions"}],"predecessor-version":[{"id":2434654,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/posts\/2434651\/revisions\/2434654"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/media\/2434653"}],"wp:attachment":[{"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/media?parent=2434651"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/categories?post=2434651"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/tags?post=2434651"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}