{"id":2263110,"date":"2026-02-02T16:20:45","date_gmt":"2026-02-02T16:20:45","guid":{"rendered":"https:\/\/celebrity.land\/en\/?p=2263110"},"modified":"2026-02-02T16:20:45","modified_gmt":"2026-02-02T16:20:45","slug":"ux-roundup-ai-judgment-heuristic-evaluation-consistent-brand-assets","status":"publish","type":"post","link":"https:\/\/celebrity.land\/en\/ux-roundup-ai-judgment-heuristic-evaluation-consistent-brand-assets\/","title":{"rendered":"UX Roundup: AI Judgment | Heuristic Evaluation | Consistent Brand Assets"},"content":{"rendered":"<p><\/p>\n<div dir=\"auto\">\n<blockquote>\n<p><strong>Summary<\/strong><span>: AI judgment may follow a scaling law | Scaling AI\u2019s judgment of usability heuristics | Generating consistent visual design assets with AI | New AI music model: Mureka O2<\/span><\/p>\n<\/blockquote>\n<div class=\"captioned-image-container\">\n<figure><a rel=\"nofollow\" target=\"_blank\" target=\"_blank\" href=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!WA6q!,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f28c5f8-c922-45ad-9e99-bf900ea6289c_937x1255.jpeg\" data-component-name=\"Image2ToDOM\" rel=\"\" class=\"image-link image2 is-viewable-img can-restack\"><\/p>\n<div class=\"image2-inset can-restack\"><picture><source type=\"image\/webp\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!WA6q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f28c5f8-c922-45ad-9e99-bf900ea6289c_937x1255.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!WA6q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f28c5f8-c922-45ad-9e99-bf900ea6289c_937x1255.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!WA6q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f28c5f8-c922-45ad-9e99-bf900ea6289c_937x1255.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!WA6q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0f28c5f8-c922-45ad-9e99-bf900ea6289c_937x1255.jpeg 1456w\" sizes=\"100vw\"\/><\/picture><\/div>\n<p><\/a><\/figure>\n<\/div>\n<p><em>UX Roundup for February 2, 2026. (Nano Banana Pro)<\/em><\/p>\n<p><span>The \u201cbitter lesson\u201d just became <\/span><em>more bitter<\/em><span> for those humans who still believe in meatware supremacy and their \u201cunique\u201d ability to have \u201ctaste\u201d or judgment about what\u2019s best among the ceaseless AI creations. AI can also exhibit judgment, and it appears that its judgment gets better with more compute, meaning that it will likely become superior to human judgment in a few years, as AI compute keeps scaling up.<\/span><\/p>\n<p>(In general, \u201cthe bitter lesson\u201d for AI is that throwing more compute at a problem consistently outperforms approaches built on human knowledge and domain expertise. Time and again across chess, Go, speech recognition, and computer vision, researchers initially made progress by encoding human understanding into systems, but these approaches were ultimately surpassed by simpler methods that scaled with available compute. The \u201cbitter\u201d part is that this lesson is psychologically hard for researchers to accept: we want to believe our insights about the structure of problems matter, but history shows that betting on more compute and letting AI systems learn and work on their own wins in the long run.)<\/p>\n<p><span>A <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/arxiv.org\/abs\/2601.07606\" rel=\"\">new research paper<\/a><span> by Bingyang Ye from Harvard and several colleagues shows that <\/span><strong>AI scales its judgment abilities with more compute<\/strong><span>. It\u2019s too early to declare this a new AI scaling law because Ye only studied a single domain: AI\u2019s ability to predict which scientific papers would later be seen as the most important.<\/span><\/p>\n<p>The study cleverly utilized the fact that there is a broadly accepted metric for the importance of academic papers: how many other scientists later cite each paper. In the study, the researchers limited their AI to run offline, using only knowledge as of a given date in the past, meaning the AI could not know how many citations each paper would later receive. (But the researchers knew the citation count, meaning that they could score the AI\u2019s judgments.)<\/p>\n<p>The interesting point here is not how good AI was at judging which research papers would become the most influential, but rather that this judgment improved with more compute, in two ways:<\/p>\n<blockquote>\n<ul>\n<li>\n<p><strong>Model training<\/strong><span>: for all three frontier AI model families in the study (Google, OpenAI, Anthropic) the newest and biggest models performed better than the older or distilled models in the same family. For example, Gemini 3 Pro (which was the winner among the 11 models in the study) did better than Gemini 2.5 Pro, which again did better than Gemini 2.5 Flash (a smaller model).<\/span><\/p>\n<\/li>\n<li>\n<p><strong>Think-time compute<\/strong><span>: Each of the 11 models was given low, medium, and high reasoning budgets, and it was usually the case (though not every single time) that thinking more resulted in better judgment.<\/span><\/p>\n<\/li>\n<\/ul>\n<\/blockquote>\n<p>I applaud the authors of this study for actually using the most recent AI models (Gemini 3 Pro, GPT 5.2, and Claude Opus 4.5), in addition to testing year-old models. Too often, we read research performed with older AI models, meaning that the findings are already obsolete, given the pace of AI improvements.<\/p>\n<p><span>I hope other researchers will extend this study in other domains, including areas where judgment is less clear-cut than for the citation count of academic papers. In particular, AI needs to be able to judge the quality of <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/generative-ui-google\" rel=\"\">generative user interfaces<\/a><span> and the many content types it produces (writing, images, videos, etc.).<\/span><\/p>\n<p>Pending such research, we can\u2019t say for sure whether there is, in fact, a scaling law for AI judgment abilities, but I think this will very likely turn out to be the case.<\/p>\n<div class=\"captioned-image-container\">\n<figure><a rel=\"nofollow\" target=\"_blank\" target=\"_blank\" href=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg\" data-component-name=\"Image2ToDOM\" rel=\"\" class=\"image-link image2 is-viewable-img can-restack\"><\/p>\n<div class=\"image2-inset can-restack\"><picture><source type=\"image\/webp\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg 1456w\" sizes=\"100vw\"\/><img decoding=\"async\" src=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg\" width=\"937\" height=\"1255\" data-attrs=\"{&quot;src&quot;:&quot;https:\/\/substack-post-media.s3.amazonaws.com\/public\/images\/989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1255,&quot;width&quot;:937,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}\" alt=\"\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!h-D1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F989ad1e6-e443-48c7-8a5d-cb7de12e8791_937x1255.jpeg 1456w\" sizes=\"auto, 100vw\" loading=\"lazy\" class=\"sizing-normal\"\/><\/picture><\/div>\n<p><\/a><\/figure>\n<\/div>\n<p><em>AI\u2019s judgment (sometimes called \u201ctaste\u201d) likely improves with more compute, raising hopes that it will scale to unprecedented heights in the coming years and soon surpass human judgment. (Nano Banana Pro)<\/em><\/p>\n<p><span>Following up on the previous news item, there are signs that AI is improving at <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/10-heuristics-reimagined\" rel=\"\">heuristic evaluation<\/a><span>, which relies heavily on judgment based on vague criteria. I have previously discussed the <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/usability-scaling-law\" rel=\"\">potential for a usability scaling law<\/a><span>, and even though the new data is insufficient to declare a law yet, I am getting more hopeful.<\/span><\/p>\n<p><span>The Baymard Institute publishes numerous usability guidelines specifically for e-commerce websites, and on January 20, 2026, it <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/baymard.com\/premium\/blog\/ai-heuristic-evaluations\" rel=\"\">announced an AI service<\/a><span> that performs heuristic evaluations based on <\/span><strong>154 of these guidelines<\/strong><span> at 95% accuracy, which it claims is comparable to that of human UX experts.<\/span><\/p>\n<p><span>Of more interest is the point that this accuracy level was only reached for <\/span><strong>39 guidelines<\/strong><span> in the previous version of the tool, announced May 20, 2025. Thus, in 8 months, AI\u2019s ability to perform a particular type of heuristic evaluation (according to Baymard\u2019s guidelines, not mine) improved by a factor of 154\/39 = 3.95x. Getting roughly 4 times better in 8 months is the same as <\/span><strong>doubling every 4 months<\/strong><span>, which is exactly the current pace of AI improvements for a general set of \u201ceconomically valuable tasks\u201d tracked by METR and <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/2026-predictions\" rel=\"\">discussed in my 2026 predictions article<\/a><span>.<\/span><\/p>\n<p>To be honest, I had expected AI to improve more slowly at heuristic evaluation than it does for general knowledge work because usability is so intensely dependent on judgment and contextual understanding. We all know that AI\u2019s skills are \u201cjagged\u201d (i.e., better at some thing than others), and it\u2019s still true that AI is much better at programming than at usability. But if it actually doubles every 4 months, there\u2019s light at the end of the AI-usability tunnel.<\/p>\n<p>Of course, two data points are not enough to declare a trend, let alone a scaling law, so I encourage other researchers to keep measuring the quality of AI\u2019s performance with the full range of UX methods and processes.<\/p>\n<p><span>As an aside, Baymard has published <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/baymard.com\/premium\/guideline-collections\" rel=\"\">769 usability guidelines<\/a><span> for e-commerce sites, meaning that the full set is 5x the number its AI can currently match in performance to human UX experts. If the usability scaling law holds up and AI doubles its UX skills every 4 months, we won\u2019t be able to have AI conduct a full heuristic evaluation of an e-commerce site for almost 10 more months, or approximately until December 1, 2026.<\/span><\/p>\n<p>Let\u2019s say that this happens, and AI will be able to handle all of Baymard\u2019s guidelines by the end of 2026. This won\u2019t mean that it will be as good at a general heuristic evaluation of other forms of user interfaces, besides e-commerce sites. General heuristic evaluation is more complex than using a highly specific set of design guidelines. My guess is that it may take one or two more years (that is, until late 2028) before AI has fully cracked the general heuristic evaluation problem.<\/p>\n<p><span>We know from my original research into the heuristic evaluation method that the quality of a heuristic evaluation is highly dependent on the evaluator\u2019s level of usability expertise, and also that what I dubbed \u201cdouble experts\u201d do even better. Double experts are people who are simultaneously experts in usability and in the application domain. This is why you should hire UX professionals with extensive experience in your domain if you want to use design reviews or other heuristic methods alongside user testing. User testing also benefits from being done by usability staff with domain knowledge, but this is less critical, since the test participants supply their own domain knowledge if your recruiting screener is good. (Recruiting is step 4 of my <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/user-testing\" rel=\"\">12-step process for sound usability studies<\/a><span>.)<\/span><\/p>\n<div class=\"captioned-image-container\">\n<figure><a rel=\"nofollow\" target=\"_blank\" target=\"_blank\" href=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg\" data-component-name=\"Image2ToDOM\" rel=\"\" class=\"image-link image2 is-viewable-img can-restack\"><\/p>\n<div class=\"image2-inset can-restack\"><picture><source type=\"image\/webp\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg 1456w\" sizes=\"100vw\"\/><img decoding=\"async\" src=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg\" width=\"937\" height=\"1255\" data-attrs=\"{&quot;src&quot;:&quot;https:\/\/substack-post-media.s3.amazonaws.com\/public\/images\/09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1255,&quot;width&quot;:937,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}\" alt=\"\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!HjnI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09f644e4-22c0-4cb8-b107-f6cf622209c0_937x1255.jpeg 1456w\" sizes=\"auto, 100vw\" loading=\"lazy\" class=\"sizing-normal\"\/><\/picture><\/div>\n<p><\/a><\/figure>\n<\/div>\n<p><em>AI\u2019s ability to perform usability work may follow a scaling law similar to that shown for other forms of knowledge work. For sure, AI heuristic evaluations improved impressively over the last year. (Nano Banana Pro)<\/em><\/p>\n<p><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/2025-images\" rel=\"\">AI-generated images have advanced<\/a><span> to the point where anyone can create thousands of attractive illustrations for a few cents each. However, for many design projects, it\u2019s not enough that the pictures are pretty. They must also be on brand. (I don\u2019t care about this for my own content: I prefer exploring a range of styles since I create for the joy of it, not to build a business. But companies need to consider branding.)<\/span><\/p>\n<p><span>AI is becoming increasingly steerable, and a new <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.lukew.com\/ff\/entry.asp?2140\" rel=\"\">experiment by Luke Wroblewski<\/a><span> demonstrates its ability to generate brand-consistent assets on demand. Luke has long used a green man with a big, round head as a consistent design element in his articles and presentations. I\u2019m sure he drew these illustrations manually in the old days, but now he launched a service called the <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/lukew.com\/maker\/\" rel=\"\">LukeW Character Maker,<\/a><span> which draws illustrations in his exact style with AI.<\/span><\/p>\n<div class=\"captioned-image-container\">\n<figure><a rel=\"nofollow\" target=\"_blank\" target=\"_blank\" href=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg\" data-component-name=\"Image2ToDOM\" rel=\"\" class=\"image-link image2 is-viewable-img can-restack\"><\/p>\n<div class=\"image2-inset can-restack\"><picture><source type=\"image\/webp\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg 1456w\" sizes=\"100vw\"\/><img decoding=\"async\" src=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg\" width=\"937\" height=\"562\" data-attrs=\"{&quot;src&quot;:&quot;https:\/\/substack-post-media.s3.amazonaws.com\/public\/images\/ad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:562,&quot;width&quot;:937,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}\" alt=\"\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!l91g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad271547-37e2-4835-88cd-4ee5a0a4f1c8_937x562.jpeg 1456w\" sizes=\"auto, 100vw\" loading=\"lazy\" class=\"sizing-normal\"\/><\/picture><\/div>\n<p><\/a><\/figure>\n<\/div>\n<p><em>Luke Wroblewski\u2019s green avatar greets my tiger mascot in an image I made in a few seconds with the LukeW Character Maker.<\/em><\/p>\n<p>The tool follows a simple process:<\/p>\n<ol>\n<li>\n<p>Asset requests are analyzed and rewritten by a language model that aligns them with the brand style and guidelines.<\/p>\n<\/li>\n<li>\n<p>The rewritten prompt is sent to an image model (I\u2019m guessing Nano Banana Pro) together with several reference images.<\/p>\n<\/li>\n<li>\n<p>The resulting images are subjected to a verification process that analyzes them and rejects them if they do not comply with the brand guidelines. (Luke says that Google ignores the uploaded reference images about 10\u201320% of the time. My experience with Nano Banana Pro\u2019s compliance with reference images is substantially worse.)<\/p>\n<\/li>\n<li>\n<p>If the image fails verification, the process resets to step 2, and the tool tries to generate a new image. Better luck this time!<\/p>\n<\/li>\n<\/ol>\n<p>Step 3 is the most interesting to me, since it requires the AI to exercise judgment over the generated image. For now, it seems to simply check that the image has a green man, but as AI\u2019s design judgment improves with the hoped-for scaling law, it\u2019s easy to imagine it will also score the image\u2019s quality according to a much wider set of criteria. Depending on the circumstances, the AI could restrict itself to delivering final artwork that meets the highest quality standards for important clients, or artwork that is \u201cgood enough\u201d for users on lower-priced subscription levels.<\/p>\n<p><span>I came across a new song-creation service: <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.mureka.ai\/?utm_source=rewardful&amp;via=f30901\" rel=\"\">Mureka<\/a><span>. (This is an affiliate link, so I will get a referral fee if you use it to sign up.)<\/span><\/p>\n<p><span>I have been using Suno for virtually all my songs for at least a year: Suno 4, 4.5, 4.5 Plus, and 5 were the music models for 25 of the 26 songs in <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.youtube.com\/watch?v=661uhjFhDNQ\" rel=\"\">my 2025 highlights reel<\/a><span>. As you can see from this list, Suno released 4 different models in a single year and is has also released several useful UX innovations to facilitate song editing and variations.<\/span><\/p>\n<p><span>However, several AI influencers were impressed with Mureka\u2019s recent upgrade, claiming that it offers richer musical sound. So I gave it a try: <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/youtu.be\/9oTuZoZ1WVo\" rel=\"\">Jazz song about my top predictions for 2026, made with Mureka<\/a><span> (YouTube, 7 min.)<\/span><\/p>\n<p><span>I used the same C-pop avatar and lyrics as in my original <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/youtu.be\/eNcDPzeZ41Q\" rel=\"\">Suno version of this song<\/a><span> (YouTube, 5 min.), so you can compare the two versions to see which you prefer. (Please let me know in the YouTube comments!)<\/span><\/p>\n<p><span>It\u2019s possible that the influencers who, well, influenced me to try Mureka are right that it delivers richer music. However, on balance, I prefer Suno. Partly, Suno\u2019s user interface shows greater maturity, especially in editing capability. Generative AI is a <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/ai-uncertainty-ux\" rel=\"\">roll of the dice<\/a><span> (which is why it produces the powerful dopamine hit of <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/operant-conditioning\" rel=\"\">operant conditioning<\/a><span>), so you rarely get the perfect result in one shot, but the ability to steer variations in a <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/explore-discover\" rel=\"\">journey through the latent design space<\/a><span> helps get what you want. Steered revisions and directed editing also promote a stronger feeling of <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/3-ages-authorship\" rel=\"\">authorship and creative ownership<\/a><span>.<\/span><\/p>\n<p>Regarding the specific Jazz song about my 2026 predictions, Mureka had two weaknesses, regardless of how well you like the lushness of its music: First, I don\u2019t think the singing voice was stable throughout the song. Fast-forward from the first verse to the last, and it feels like two different singers. Second, Mureka inserted an 18-second dance break in the middle of my verse about apprenticeship, breaking the flow of the lyrics. (Dance breaks are fine in music videos and allow creators to showcase their avatars\u2019 dance performances, but should be positioned immediately before or after a chorus and not in the middle of a verse.)<\/p>\n<p>For these reasons, I still prefer Suno and will likely use it for most of my upcoming songs. But since I now paid for a month of Mureka, I will give it a few more chances. If you track what songs I publish in February, you\u2019ll see which model I ended up preferring.<\/p>\n<div class=\"captioned-image-container\">\n<figure><a rel=\"nofollow\" target=\"_blank\" target=\"_blank\" href=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg\" data-component-name=\"Image2ToDOM\" rel=\"\" class=\"image-link image2 is-viewable-img can-restack\"><\/p>\n<div class=\"image2-inset can-restack\"><picture><source type=\"image\/webp\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg 1456w\" sizes=\"100vw\"\/><img decoding=\"async\" src=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg\" width=\"937\" height=\"1255\" data-attrs=\"{&quot;src&quot;:&quot;https:\/\/substack-post-media.s3.amazonaws.com\/public\/images\/90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1255,&quot;width&quot;:937,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}\" alt=\"\" srcset=\"https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg 424w, https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg 848w, https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg 1272w, https:\/\/substackcdn.com\/image\/fetch\/$s_!j2RY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep\/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F90df1cf4-6545-4477-b834-1454859e7aa1_937x1255.jpeg 1456w\" sizes=\"auto, 100vw\" loading=\"lazy\" class=\"sizing-normal\"\/><\/picture><\/div>\n<p><\/a><\/figure>\n<\/div>\n<p><em>On balance, I probably prefer Suno for creating songs, but Mureka is a strong contender. (Nano Banana Pro)<\/em><\/p>\n<p>If Mureka can gain traction and paying subscribers, I hope it will add more robust features in the future. It would be great to have real competition for Suno, now that Udio has thrown in the towel, abandoning individual creators in favor of kowtowing to corporate music labels. (Even worse, there are signs that Suno may also turn traitor to indie music. The rumors about their possible upcoming releases are worrisome, but of course not proven.)<\/p>\n<p><span>Jakob Nielsen, Ph.D., is a usability pioneer with <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/41-years-in-ux\" rel=\"\">43 years experience in UX<\/a><span> and the Founder of <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/\" rel=\"\">UX Tigers<\/a><span>. He founded the discount usability movement for fast and cheap iterative design, including heuristic evaluation and the <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/10-heuristics-reimagined\" rel=\"\">10 usability heuristics<\/a><span>. He formulated the eponymous <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/jakobs-law\" rel=\"\">Jakob\u2019s Law of the Internet User Experience<\/a><span>. Named \u201cthe king of usability\u201d by <\/span><em>Internet Magazine<\/em><span>, \u201cthe guru of Web page usability\u201d by <\/span><em>The New York Times<\/em><span>, and \u201cthe next best thing to a true time machine\u201d by <\/span><em>USA Today<\/em><span>.<\/span><\/p>\n<p><span>Previously, Dr. Nielsen was a Sun Microsystems Distinguished Engineer and a Member of Research Staff at Bell Communications Research, the branch of Bell Labs owned by the Regional Bell Operating Companies. He is the author of 8 books, including the best-selling <\/span><em>Designing Web Usability: The Practice of Simplicity<\/em><span> (published in 22 languages), the foundational <\/span><em>Usability Engineering<\/em><span> (<\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/scholar.google.com\/citations?hl=en&amp;user=y5uL3wUAAAAJ\" rel=\"\">29,972 citations in Google Scholar<\/a><span>)<\/span><em>,<\/em><span> and the pioneering<\/span><em> Hypertext and Hypermedia<\/em><span> (published two years before the Web launched).<\/span><\/p>\n<p>Dr. Nielsen holds 79 United States patents, mainly on making the Internet easier to use. He received the Lifetime Achievement Award for Human\u2013Computer Interaction Practice from ACM SIGCHI and was named a \u201cTitan of Human Factors\u201d by the Human Factors and Ergonomics Society.<\/p>\n<p><span>\u00b7 Subscribe to <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/jakobnielsenphd.substack.com\/\" rel=\"\">Jakob\u2019s newsletter<\/a><span> to get the full text of new articles emailed to you as soon as they are published.<\/span><\/p>\n<p><span>\u00b7 <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"http:\/\/www.linkedin.com\/comm\/mynetwork\/discovery-see-all?usecase=PEOPLE_FOLLOWS&amp;followMember=jakobnielsenphd\" rel=\"\">Follow Jakob on LinkedIn<\/a><span>.<\/span><\/p>\n<p><span>\u00b7 Read: <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.uxtigers.com\/post\/41-years-in-ux\" rel=\"\">article about Jakob Nielsen\u2019s career in UX<\/a><\/p>\n<p><span>\u00b7 Watch: <\/span><a rel=\"nofollow\" target=\"_blank\" href=\"https:\/\/www.youtube.com\/watch?v=MPmVa_vKeF4\" rel=\"\">Jakob Nielsen\u2019s first 41 years in UX<\/a><span> (8 min. video)<\/span><\/p>\n<\/div>\n<p><em> \u2018 The preceding article may include information circulated by third parties \u2019 <\/em><\/p>\n<p><em> \u2018 Some details of this article were extracted from the following source jakobnielsenphd.substack.com \u2019 <\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Summary: AI judgment may follow a scaling law | Scaling AI\u2019s judgment of usability heuristics | Generating consistent visual design assets with AI | New AI music model: Mureka O2 UX Roundup for February 2, 2026. (Nano Banana Pro) The \u201cbitter lesson\u201d just became more bitter for those humans who still believe in meatware supremacy [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":2263111,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_jetpack_memberships_contains_paid_content":false,"jnews-multi-image_gallery":[],"jnews_single_post":[],"jnews_primary_category":[],"jnews_social_meta":[],"footnotes":""},"categories":[25179],"tags":[],"class_list":["post-2263110","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-music"],"jetpack_featured_media_url":"https:\/\/celebrity.land\/en\/wp-content\/uploads\/2026\/02\/UX-Roundup-AI-Judgment-Heuristic-Evaluation-Consistent-Brand.com2Fpublic2Fimages2F0f28c5f8-c922-45ad-9e99-bf900.jpeg","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/posts\/2263110","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/comments?post=2263110"}],"version-history":[{"count":1,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/posts\/2263110\/revisions"}],"predecessor-version":[{"id":2263112,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/posts\/2263110\/revisions\/2263112"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/media\/2263111"}],"wp:attachment":[{"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/media?parent=2263110"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/categories?post=2263110"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/celebrity.land\/en\/wp-json\/wp\/v2\/tags?post=2263110"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}