<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Kilo Blog]]></title><description><![CDATA[Kilo is the all-in-one agentic engineering platform for software developers. 
#1 on Open Source Product of the Month on Product Hunt. 3M+ Kilo Coders. 30T+ tokens processed.

Follow us: kilo.ai/social ]]></description><link>https://blog.kilo.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!EOvc!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a533426-8d50-480b-8659-75d53523270b_720x720.png</url><title>Kilo Blog</title><link>https://blog.kilo.ai</link></image><generator>Substack</generator><lastBuildDate>Sun, 24 May 2026 04:46:18 GMT</lastBuildDate><atom:link href="https://blog.kilo.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Kilo Code Inc.]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[kilocode@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[kilocode@substack.com]]></itunes:email><itunes:name><![CDATA[blog.kilo.ai]]></itunes:name></itunes:owner><itunes:author><![CDATA[blog.kilo.ai]]></itunes:author><googleplay:owner><![CDATA[kilocode@substack.com]]></googleplay:owner><googleplay:email><![CDATA[kilocode@substack.com]]></googleplay:email><googleplay:author><![CDATA[blog.kilo.ai]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Grok Build 0.1 Website Experiments: Round Two]]></title><description><![CDATA[Yesterday, we shared 5 websites we built in Kilo Code with Grok&#8217;s new model: Grok Build 0.1.]]></description><link>https://blog.kilo.ai/p/grok-build-01-website-experiments</link><guid isPermaLink="false">https://blog.kilo.ai/p/grok-build-01-website-experiments</guid><dc:creator><![CDATA[Darko]]></dc:creator><pubDate>Fri, 22 May 2026 17:14:19 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!NJM3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Yesterday, we shared 5 websites we built in <a href="https://kilo.codes/KjwKK4G">Kilo Code</a> with Grok&#8217;s new model: <a href="https://x.com/kilocode/status/2057436115175457044">Grok Build 0.1.</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NJM3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NJM3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png 424w, https://substackcdn.com/image/fetch/$s_!NJM3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png 848w, https://substackcdn.com/image/fetch/$s_!NJM3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png 1272w, https://substackcdn.com/image/fetch/$s_!NJM3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NJM3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png" width="495" height="314.00881057268725" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:576,&quot;width&quot;:908,&quot;resizeWidth&quot;:495,&quot;bytes&quot;:77196,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NJM3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png 424w, https://substackcdn.com/image/fetch/$s_!NJM3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png 848w, https://substackcdn.com/image/fetch/$s_!NJM3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png 1272w, https://substackcdn.com/image/fetch/$s_!NJM3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F106a2718-47f6-4a83-9de2-3923a278c1e1_908x576.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The model is pretty good at creating websites from scratch, and people seemed to enjoy the examples, so we decided to put together a follow-up post with 8 new, additional examples.</p><p>This time, we tested how well Grok handles interactivity, design, and humor. We also looked through the source code to see whether and how it actually worked under the hood.</p><h1>Let&#8217;s play a game of parrots </h1><p>The prompt, total cost, and token spend:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zV5h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zV5h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png 424w, https://substackcdn.com/image/fetch/$s_!zV5h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png 848w, https://substackcdn.com/image/fetch/$s_!zV5h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png 1272w, https://substackcdn.com/image/fetch/$s_!zV5h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zV5h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png" width="986" height="438" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:438,&quot;width&quot;:986,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:48727,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zV5h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png 424w, https://substackcdn.com/image/fetch/$s_!zV5h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png 848w, https://substackcdn.com/image/fetch/$s_!zV5h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png 1272w, https://substackcdn.com/image/fetch/$s_!zV5h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F537d4366-4881-45e7-aa8f-b11b9d50b286_986x438.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The result:</strong> This was a dark-mode, modern website, and the interactivity all held up. The parrot animated on click, the speech bubble appeared and faded out cleanly, copy-to-clipboard worked with a graceful fallback for older browsers, and the speech synthesis call fired with pitch and rate tweaked to sound parrot-ish.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;62fc111c-4dc1-4869-96d3-e6d6d30a521a&quot;,&quot;duration&quot;:null}"></div><p></p><h1>A website to book a hotel on the Moon</h1><p>The prompt, total cost, and token spend:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!F2xc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F2xc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png 424w, https://substackcdn.com/image/fetch/$s_!F2xc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png 848w, https://substackcdn.com/image/fetch/$s_!F2xc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png 1272w, https://substackcdn.com/image/fetch/$s_!F2xc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F2xc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png" width="954" height="430" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:430,&quot;width&quot;:954,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:42901,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F2xc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png 424w, https://substackcdn.com/image/fetch/$s_!F2xc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png 848w, https://substackcdn.com/image/fetch/$s_!F2xc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png 1272w, https://substackcdn.com/image/fetch/$s_!F2xc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c41f723-3f4a-42d6-a522-a68bded04930_954x430.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The result: </strong>We got a functional website. Grok also invented the phrase "view of forever" for the suites section headline. We didn't ask for it. The model decided a moon hotel needed that. The three-stop gradient (purple, blue, cyan) is clipped to the headline text, layered over a dark space-themed background.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;f253a8d6-d16a-4697-a952-5937c92ff144&quot;,&quot;duration&quot;:null}"></div><p><em><a href="https://kilo.codes/KjwKK4G">Try this prompt + the rest in Kilo Code</a></em></p><h1>An interactive Pomodoro timer </h1><p>The prompt, total cost, and token spend:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4cMa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4cMa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png 424w, https://substackcdn.com/image/fetch/$s_!4cMa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png 848w, https://substackcdn.com/image/fetch/$s_!4cMa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png 1272w, https://substackcdn.com/image/fetch/$s_!4cMa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4cMa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png" width="976" height="494" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:494,&quot;width&quot;:976,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:56890,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4cMa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png 424w, https://substackcdn.com/image/fetch/$s_!4cMa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png 848w, https://substackcdn.com/image/fetch/$s_!4cMa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png 1272w, https://substackcdn.com/image/fetch/$s_!4cMa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98b90558-729b-4498-8b30-e57b6a927ce8_976x494.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The result:</strong> The timer behaved correctly through every state transition. The Pomodoro logic handled the full loop: start, pause, resume, reset, completion chime, and auto-advance to the next phase. The code correctly implements the full Pomodoro loop, including a long break after every fourth focus session</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;3a53e29c-0bbc-4074-b8ab-80dafd6abfe9&quot;,&quot;duration&quot;:null}"></div><h1>Beautify a government website</h1><p>The prompt, total cost, and token spend:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QryW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QryW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png 424w, https://substackcdn.com/image/fetch/$s_!QryW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png 848w, https://substackcdn.com/image/fetch/$s_!QryW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png 1272w, https://substackcdn.com/image/fetch/$s_!QryW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QryW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png" width="952" height="424" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:424,&quot;width&quot;:952,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:45227,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QryW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png 424w, https://substackcdn.com/image/fetch/$s_!QryW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png 848w, https://substackcdn.com/image/fetch/$s_!QryW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png 1272w, https://substackcdn.com/image/fetch/$s_!QryW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9be03d5a-9f66-4cde-bacb-c63126adae19_952x424.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The result:</strong> The multi-step workflow worked end to end:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EV-P!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EV-P!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png 424w, https://substackcdn.com/image/fetch/$s_!EV-P!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png 848w, https://substackcdn.com/image/fetch/$s_!EV-P!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png 1272w, https://substackcdn.com/image/fetch/$s_!EV-P!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EV-P!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png" width="1456" height="1266" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1266,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:164602,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EV-P!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png 424w, https://substackcdn.com/image/fetch/$s_!EV-P!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png 848w, https://substackcdn.com/image/fetch/$s_!EV-P!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png 1272w, https://substackcdn.com/image/fetch/$s_!EV-P!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59b615c7-b779-4fe8-9505-d200f09cd474_1610x1400.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The five-step form swapped sections cleanly, the progress bar advanced with the step count, and the Continue button correctly switched to "Submit Application" and changed color on the last screen.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;99adbfbf-b5a9-405d-8586-704b1f46accd&quot;,&quot;duration&quot;:null}"></div><h1>Experimenting with old-school interactivity</h1><p>The prompt, total cost, and token spend:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Wadm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Wadm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png 424w, https://substackcdn.com/image/fetch/$s_!Wadm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png 848w, https://substackcdn.com/image/fetch/$s_!Wadm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png 1272w, https://substackcdn.com/image/fetch/$s_!Wadm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Wadm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png" width="974" height="434" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:434,&quot;width&quot;:974,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:49858,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Wadm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png 424w, https://substackcdn.com/image/fetch/$s_!Wadm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png 848w, https://substackcdn.com/image/fetch/$s_!Wadm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png 1272w, https://substackcdn.com/image/fetch/$s_!Wadm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19891ed4-fe94-40f6-9770-077e04045f4a_974x434.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The result: </strong>We got the look we wanted: every interactive piece behaved like a small simulation rather than something static. The CRT effect is doing real work: phosphor glow via text-shadow, repeating scanlines, a sweeping highlight band running top-to-bottom on a CSS animation, a subtle flicker on opacity, and a radial vignette layered on top to fake the screen curvature.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;c8deae24-8a33-4b6b-8a06-637f6b460a2b&quot;,&quot;duration&quot;:null}"></div><p><a href="https://kilo.codes/KjwKK4G">Try this prompt + the rest in Kilo Code</a></p><h1>A GitHub profile page for Shakespeare </h1><p>The prompt, total cost, and token spend:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!geYa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!geYa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png 424w, https://substackcdn.com/image/fetch/$s_!geYa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png 848w, https://substackcdn.com/image/fetch/$s_!geYa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png 1272w, https://substackcdn.com/image/fetch/$s_!geYa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!geYa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png" width="960" height="438" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:438,&quot;width&quot;:960,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:44281,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!geYa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png 424w, https://substackcdn.com/image/fetch/$s_!geYa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png 848w, https://substackcdn.com/image/fetch/$s_!geYa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png 1272w, https://substackcdn.com/image/fetch/$s_!geYa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdab14715-84ce-40d5-8ed8-40cb7d9d2c15_960x438.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"></figcaption></figure></div><p><strong>The result: </strong>This looked like an actual GitHub page. Follow toggled between the green Follow state and the gray Following state, and the follower count incremented and decremented to match. The sponsor flow opened a modal, let you pick a tier (Bronze Quill, Silver Sonnet, Gold Globe), and swapped its own contents to a thank-you screen on submit.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;f5f94874-bc62-4fe5-a7e2-bd6316a6651f&quot;,&quot;duration&quot;:null}"></div><h1>Is it raining in Tokyo right now?</h1><p>The prompt, total cost, and token spend:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!USEh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!USEh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png 424w, https://substackcdn.com/image/fetch/$s_!USEh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png 848w, https://substackcdn.com/image/fetch/$s_!USEh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png 1272w, https://substackcdn.com/image/fetch/$s_!USEh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!USEh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png" width="727" height="331.4264705882353" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:434,&quot;width&quot;:952,&quot;resizeWidth&quot;:727,&quot;bytes&quot;:46389,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!USEh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png 424w, https://substackcdn.com/image/fetch/$s_!USEh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png 848w, https://substackcdn.com/image/fetch/$s_!USEh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png 1272w, https://substackcdn.com/image/fetch/$s_!USEh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa514e829-c81d-4e0c-97e3-4c0842e4c908_952x434.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The result:</strong> We were aiming for functionality here, not design (although the site looked quite decent). The live data fetch worked. The app hit Open-Meteo's free forecast endpoint with the right Tokyo coordinates and a sensible parameter list (weather code, temperature, precipitation, rain, humidity, cloud cover, wind, timezone set to Asia/Tokyo), then mapped the WMO code into three states: heavy rain, light rain, and no rain, with separate icons for each.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;d39d9d77-63e8-41de-8023-bcfebc8b51db&quot;,&quot;duration&quot;:null}"></div><h1>A payment dashboard for a lemonade stand</h1><p>The prompt, total cost, and token spend:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gFpC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gFpC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png 424w, https://substackcdn.com/image/fetch/$s_!gFpC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png 848w, https://substackcdn.com/image/fetch/$s_!gFpC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png 1272w, https://substackcdn.com/image/fetch/$s_!gFpC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gFpC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png" width="966" height="432" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:432,&quot;width&quot;:966,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:49598,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198842545?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!gFpC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png 424w, https://substackcdn.com/image/fetch/$s_!gFpC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png 848w, https://substackcdn.com/image/fetch/$s_!gFpC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png 1272w, https://substackcdn.com/image/fetch/$s_!gFpC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84f7c6e3-5cbb-4a62-9057-2d779a0e700c_966x432.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>The result:</strong> The site looked modern and pretty much every interactive piece held up.</p><p>Sidebar navigation switched sections without reload, the Today / 7 days / 30 days buttons recomputed the metrics and rebuilt the Chart.js line graph on the fly, and the status filter pills on the payments view narrowed the table as expected.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;f3cb7091-e6d6-4308-b45f-e94536171cb3&quot;,&quot;duration&quot;:null}"></div><p></p><p>You can run these prompts yourself by installing our <a href="https://kilo.codes/KjwKK4G">VS Code extension</a> or using the <a href="https://kilo.codes/GZLcJXX">Kilo CLI</a>. For this round of examples, we used the VS Code extension together with the built-in <a href="https://kilo.codes/kWX4TrX">Agent Manager.</a></p><p></p>]]></content:encoded></item><item><title><![CDATA[The Quiet Arrival of Grok Build 0.1 in a Wild Week for the xAI Empire]]></title><description><![CDATA[The new model is a major leap ahead of Grok Code Fast]]></description><link>https://blog.kilo.ai/p/the-quiet-arrival-of-grok-build-01</link><guid isPermaLink="false">https://blog.kilo.ai/p/the-quiet-arrival-of-grok-build-01</guid><dc:creator><![CDATA[Ari]]></dc:creator><pubDate>Thu, 21 May 2026 16:37:50 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!xWVR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>If you blinked this week, you already missed a lot in the AI development space. First, Cursor dropped the new version of their own model (which the community suspects might be a version of <a href="https://kilo.ai/models/moonshotai-kimi-k2-6">Kimi K2.6</a>). Then, SpaceX&#8217;s recent <a href="https://www.nytimes.com/2026/05/20/technology/elon-musk-spacex-ipo.html">IPO filings</a> incidentally revealed that <strong>xAI burned through a staggering $6.4 billion</strong> last year, proving that their infrastructure spending is far from over.</p><p>But xAI, a part of SpaceX, didn&#8217;t just make financial headlines this week&#8212;they also quietly rolled out <strong><a href="https://kilo.ai/models/xai-grok-build-0-1">Grok Build 0.1</a></strong>.</p><p>So quietly, in fact, that you might have missed it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xWVR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xWVR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png 424w, https://substackcdn.com/image/fetch/$s_!xWVR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png 848w, https://substackcdn.com/image/fetch/$s_!xWVR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png 1272w, https://substackcdn.com/image/fetch/$s_!xWVR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xWVR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png" width="1200" height="630" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:630,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:28912,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198635814?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xWVR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png 424w, https://substackcdn.com/image/fetch/$s_!xWVR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png 848w, https://substackcdn.com/image/fetch/$s_!xWVR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png 1272w, https://substackcdn.com/image/fetch/$s_!xWVR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F10fb1a17-d8d1-4852-bec3-146aede40c21_1200x630.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>We&#8217;re excited to announce that the <a href="https://kilo.ai/models/xai-grok-build-0-1">new model from xAI</a> is live in Kilo.</strong> Take advantage of our <em>early access</em> to give the new coding model a spin today.</p><h3>Built for Agentic Engineering</h3><p>Designed specifically to tackle complex, autonomous software engineering workflows, Grok Build 0.1 is positioned to rival the best coding models on the market. It&#8217;s already live in early access, and here is a breakdown of what makes it stand out.</p><p>Unlike standard chat models, Grok Build 0.1 is heavily optimized for <strong>interactive coding agents, tool use, and multi-step development tasks</strong>.</p><p>It is built to reason through a problem, write code, use your terminal, check for errors, and fix its own mistakes in a continuous loop. To support this &#8220;long-horizon&#8221; workflow, Grok Build 0.1 features a <strong>256K context window</strong> (accepting both text and images) and currently boasts <strong>no text output limits</strong>. Instead of hitting an 8K token cap and forcing you to type &#8220;continue generating,&#8221; it can autonomously refactor massive codebases in one go.</p><p>It&#8217;s also just ridiculously good at <em>building</em>. This was the vibe with Grok Code Fast 1 as well&#8212;good at creating landing pages and product pages in a particular style, from coding to design&#8212;but Build seems to take it to the next level with fast iteration and better one-shotting. For example, it built a professional-level landing page in 40 seconds. For less than ten cents.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/kilocode/status/2057436117637447706?s=20" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!79A8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png 424w, https://substackcdn.com/image/fetch/$s_!79A8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png 848w, https://substackcdn.com/image/fetch/$s_!79A8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png 1272w, https://substackcdn.com/image/fetch/$s_!79A8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!79A8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png" width="1110" height="1330" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1330,&quot;width&quot;:1110,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:559499,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/kilocode/status/2057436117637447706?s=20&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198635814?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!79A8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png 424w, https://substackcdn.com/image/fetch/$s_!79A8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png 848w, https://substackcdn.com/image/fetch/$s_!79A8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png 1272w, https://substackcdn.com/image/fetch/$s_!79A8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4ed3fa2-661e-4ce4-84f8-61057d64bb7c_1110x1330.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If our early testing is any indication, Build 0.1 is going to be a lot of folks&#8217; new daily driver&#8212;super fast and more cost-effective than similar frontier models.</p><h3>Grok Build 0.1 vs. Grok Code Fast</h3><p>If you&#8217;ve been tracking xAI, you might be wondering how this compares to <strong>Grok Code Fast </strong>(recently deprecated) and other Grok models. TLDR: Build 0.1 aims to be more cost-effective without sacrificing large-scale coding abilities.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kMXJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kMXJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png 424w, https://substackcdn.com/image/fetch/$s_!kMXJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png 848w, https://substackcdn.com/image/fetch/$s_!kMXJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png 1272w, https://substackcdn.com/image/fetch/$s_!kMXJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kMXJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png" width="1070" height="390" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:390,&quot;width&quot;:1070,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:66334,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198635814?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!kMXJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png 424w, https://substackcdn.com/image/fetch/$s_!kMXJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png 848w, https://substackcdn.com/image/fetch/$s_!kMXJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png 1272w, https://substackcdn.com/image/fetch/$s_!kMXJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f4f4e5d-7da2-4ce5-9706-c4ed4f45269b_1070x390.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When <a href="https://blog.kilo.ai/p/grok-code-free-ride-is-really-over">Grok Code Fast went paid</a>, it was a turning point for the industry. The release of Grok Build 0.1, a different type of model with a similar personality, proves that xAI is still focused on fueling agentic engineering.</p><ul><li><p><strong>Use Case:</strong> Grok Code Fast is your autocomplete and quick-refactor king&#8212;designed for instantaneous, low-latency edits while you type. Grok Build 0.1, on the other hand, is the &#8220;architect.&#8221; You hand it a massive Jira ticket, step back, and let it churn through the codebase.</p></li><li><p><strong>Pricing:</strong> Grok Build 0.1 operates in that highly efficient <a href="https://blog.kilo.ai/p/the-age-of-the-flash-model-gemini">&#8220;Flash&#8221; tier</a> sweet spot. Priced at just <strong>$1 per million input tokens</strong> and <strong>$2 per million output tokens</strong>, it is built to be economically viable for agentic loops. This is the kind of cost-efficiency needed to let agents run wild.</p></li></ul><p>When it comes to keeping costs in line, just remember that there are no text output limits for this model. This is a good thing, to be sure. But you need to watch it carefully once you set it loose in the wild. </p><p>Also note that if you exceed the 200k context window, the <a href="https://docs.x.ai/developers/models/grok-build-0.1">costs double</a>.</p><h3>Already Live and Climbing the Kilo Leaderboards</h3><p>You don&#8217;t have to wait to try this out. Grok Build 0.1 is already live and available wherever you use Kilo.</p><p>It&#8217;s climbing the <a href="https://kilo.ai/leaderboard">Kilo leaderboard</a> as developers test its massive context window against older flagship models. Because it is in early access, now is the perfect time to plug it into your Kilo environment and take advantage of no output limits to see how much of a codebase it can successfully rewrite in one prompt.</p><p>When you see a $6.4 billion burn rate on an IPO filing, you expect massive swings, and xAI is definitely swinging. (We should note that OpenAI, who also started the IPO process, is <a href="https://www.nytimes.com/2026/05/20/technology/openai-ipo.html">projected to spend $115 billion</a> over the next four years &#8212; and recent Grok releases do feel like rising GPT competitors).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fQ_w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fQ_w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png 424w, https://substackcdn.com/image/fetch/$s_!fQ_w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png 848w, https://substackcdn.com/image/fetch/$s_!fQ_w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png 1272w, https://substackcdn.com/image/fetch/$s_!fQ_w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fQ_w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png" width="732" height="532" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:532,&quot;width&quot;:732,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:53332,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198635814?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fQ_w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png 424w, https://substackcdn.com/image/fetch/$s_!fQ_w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png 848w, https://substackcdn.com/image/fetch/$s_!fQ_w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png 1272w, https://substackcdn.com/image/fetch/$s_!fQ_w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F43cacdca-289a-4b13-92e7-caa2d19ab712_732x532.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Try Grok models in <a href="https://kilo.ai/">Kilo</a> today</figcaption></figure></div><p>With this new model, xAI isn&#8217;t trying to build the smartest conversationalist. Instead, they are trying to build the most relentless, cost-effective robotic software engineer for both front-end and back-end tasks.</p><p>Give <strong><a href="https://kilo.ai/models/xai-grok-build-0-1">Grok Build 0.1</a></strong> a spin in Kilo today and let us know your thoughts!</p>]]></content:encoded></item><item><title><![CDATA[You can’t afford to lead agentic engineering from the sidelines]]></title><description><![CDATA[AI is changing the work faster than most operating models can keep up. Leaders need firsthand experience before they start redesigning the process.]]></description><link>https://blog.kilo.ai/p/you-cant-afford-to-lead-agentic-engineering</link><guid isPermaLink="false">https://blog.kilo.ai/p/you-cant-afford-to-lead-agentic-engineering</guid><dc:creator><![CDATA[Jean du Plessis]]></dc:creator><pubDate>Thu, 21 May 2026 08:02:07 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!oMdn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oMdn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oMdn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oMdn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oMdn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oMdn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oMdn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:309207,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198620265?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oMdn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oMdn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oMdn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oMdn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28efc932-9e46-4fd2-a86b-55827be3610d_1672x941.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Late in 2025, leadership had made the call: the company was going all in on AI.</p><p>The CTO&#8217;s vision was straightforward. Engineers would define the work in tickets. Agents would implement it overnight. Engineers would return in the morning to review the output and get the code over the finish line. This was the reality facing an engineering manager I was mentoring at another company. That conversation stuck with me.</p><p>It sounded clean in the way plans sound clean before they touch a real codebase, but I felt an immediate resistance to the idea. Not because it was uniquely absurd. By then, this kind of thinking was becoming increasingly common. As the AI hype intensified, many executives felt pressure to show a return, cut costs, or simply prove they were not falling behind.</p><p>What unsettled me was that my own experience told me this wouldn&#8217;t work.</p><p>I had spent much of 2025 in my previous role as Director of Engineering leading the effort to experiment with and adopt AI-assisted engineering. I had seen where these tools helped, where they failed, and how much extra judgment they demanded from team members.</p><p>From that vantage point, the CTO&#8217;s plan did not just seem technically naive. It seemed like a good way to turn engineers into ticket writers and cleanup crew. That kind of rollout can demoralize a team before they have a chance to learn.</p><p>To be clear, the CTO was right to take AI seriously. The shift to agentic engineering is more than another tooling update. The mistake was assuming you can design the operating model before you understand what the work feels like from the engineer&#8217;s side.</p><p>That is the gap I keep coming back to: <strong>leaders are asking engineers to change how they build software before they have felt the change themselves.</strong></p><p>As an engineering leader I am supposed to get out of the weeds. That is part of the job. You lead through systems, managers, planning, metrics, and organizational design. But when the weeds themselves start changing, distance becomes a liability.</p><p>For me the answer was to get my hands back on the work. Not to take over delivery. Not to relive my IC days. But to understand what this change actually demands from the people living through it. A secondhand understanding was not going to be enough.</p><h2>The work is changing faster than the operating model</h2><p>Many organizations are moving quickly from &#8220;AI matters&#8221; to &#8220;we need an AI operating model.&#8221; The urgency is understandable. The mistake is pretending the model is already obvious.</p><p>The conversation has moved fast: from broad AI skepticism, to one-prompt vibe coding demos, to coding agents, new harnesses, Ralph-loops, spec-heavy workflows, test-first agent patterns, and whatever practice gets treated as the answer next week. Some of these ideas help. Some help a lot in the right context. But none of them have come close to removing the need for human judgment in software engineering.</p><p>That is what I mean by agentic engineering: not handing the work to a model and hoping it figures things out, but <a href="https://blog.kilo.ai/p/inside-kilo-speed-the-engineer-who">using agents as part of the engineering process</a> while the engineer still owns the outcome. The engineer defines the problem, sets the constraints, steers the work, reviews the result, and decides whether it is actually good enough to ship.</p><p>That shift is real. It is also not settled.</p><p>A workflow that works for a greenfield internal tool may fall apart in a mature distributed system. A team with clear ownership, trusted tests, and fast feedback can move faster with agents. A team with ambiguous product direction and brittle verification loops may only create confusion faster. That is why you cannot import an &#8220;agentic engineering&#8221; playbook and expect it to survive contact with your organization.</p><p>This is where secondhand understanding becomes dangerous. From a distance, it is easy to build overconfident plans around demos, vendor claims, and isolated success stories. Up close, the work is messier. Agents can move quickly and still miss the point. They can produce plausible code that <a href="https://blog.kilo.ai/p/cowboy-coder-is-back-this-time-they">shifts the hardest judgment back onto the engineer</a>, and they can make a weak idea look more real than it deserves to be.</p><p>Software engineering was never only about writing code. Half the work is figuring out what should exist in the first place. Agents do not make that disappear. If anything, they make bad judgment cheaper to execute.</p><p>You start seeing half-baked prototypes and vibe-coded solutions move through the organization faster than the organization can decide whether they are valuable. The burden then lands on engineering to sort out what is useful, what is salvageable, and what should never have been built. Just because something can now be produced quickly does not mean the team has gained leverage. Sometimes it has only gained churn.</p><p>AI exposes the bottlenecks your team was already working around. If <a href="https://blog.kilo.ai/p/introducing-code-reviews">code review</a> was barely holding together before, generating more code makes that pain harder to ignore. If CI/CD was slow or unreliable, faster implementation just means more waiting for confidence. If product direction is vague, agents help you create expensive noise faster. And if decisions are gatekept by a few people, the compression AI offers stays out of reach.</p><p>This is also where the old handoffs between Product, Design, and Engineering start to creak. If engineers are going to direct agents, they cannot behave like ticket-takers. They need <a href="https://blog.kilo.ai/p/our-engineers-own-a-number">enough product context</a> to challenge weak assumptions and notice when an implementation is technically plausible but strategically wrong. That means pulling engineers earlier into the problem framing, not just the solution review. They need to understand the goal, the non-goals, the constraints, the tradeoffs, and what would make the work wrong even if the code functions.</p><p>At the same time, leaders have to understand the emotional reality of the team. Many engineers are curious and even excited by these tools. But skepticism is not always obstruction. Sometimes it is a rational response to the way AI adoption gets sold: too glibly, too confidently, and often by people who have not had to land the output in a real production codebase.</p><p>Some of the most honest conversations I had with engineers over the last year were not about prompts or workflows. They were about fear. Fear of becoming irrelevant. Fear of watching a hard-earned craft get devalued. Fear of being told to produce dramatically more while the standards they care about become harder to defend.</p><p><strong>Leaders cannot give engineers certainty the industry itself does not have yet.</strong> But they can decide whether people get to work through that uncertainty honestly. If people cannot talk openly about what is not working, they will still notice it. They will just stop telling you.</p><p>That is why leadership credibility matters so much right now. Teams do not need leaders who have memorized the latest AI talking points. They need leaders close enough to the work to know where the leverage is real, where the workflow gets awkward, where the fear is coming from, and where the organization is not ready yet.</p><h2>Leadership credibility now requires getting close to the work</h2><p>This is the part you cannot pick up from a deck. You need enough hands-on time to feel where the workflow gets awkward, where the agent becomes slippery, and where the review burden quietly shifts back onto the engineer. You cannot rely only on past experience when the way engineers plan, implement, and review code is changing.</p><p>You have to roll up your sleeves and get sucked in. Give an agent a real piece of work, not a toy demo. Watch it move fast, miss context, invent confidence, and leave you with the uncomfortable job of deciding whether the output is good or merely plausible. There is a specific kind of humility that comes from watching a tool produce 800 lines of code while quietly misunderstanding the problem.</p><p>Spend enough time with these tools and you will eventually find yourself swearing at an LLM in all caps. That frustration is not incidental. It is part of the experience leaders need to understand.</p><p>The point is not to take delivery back. It is to stop leading from theory.</p><p>For engineers to trust you through this change, they need to believe that you understand more than the executive summary. They need to hear you speak authentically about the scars: the false starts, the cognitive load, the workflows that seemed elegant until they met a real codebase, and the moments where the tools genuinely changed what felt possible.</p><p>Articles, conference talks, podcasts, and vendor demos can inform you. They cannot substitute for time with the tools, or for the trust you need when asking a team to change how it works.</p><p>That proximity also matters upward. If someone above you expects overnight productivity miracles, firsthand experience gives you something sturdier than vibes to push back with. You can explain what is real, what is hype, where to invest, and where the organization is not ready yet.</p><p>Teams do not need leaders with certainty right now. They need leaders with context. They need leaders whose optimism has been tested by contact with the work.</p><h2>Learn faster before you optimize harder</h2><p>With this much uncertainty, the worst move is pretending you have already found the process. The better move is to build an organization that can learn faster than the tools change.</p><p>That does not mean letting everyone wander off in private. One of the easiest ways to waste this moment is to have ten engineers independently discover the same lesson, struggle with the same broken workflow, or find the same useful pattern without anyone else benefiting from it.</p><p><strong>The goal is not early standardization. It is shared learning.</strong></p><p>Create deliberate places for engineers to compare notes: what worked, what failed, what looked promising until it touched the codebase, what saved time, what created more review burden than it was worth. Make the practical details visible. Which model worked for which kind of task? Which harness fits which workflow? Which agent-generated code looked plausible but was painful to land?</p><p>This matters more with AI because the feedback loops are compressed. A bad assumption can turn into a prototype in an afternoon. A vague ticket can become hundreds of lines of code. A weak product idea can look more convincing simply because someone generated a working demo.</p><p><strong>Without fast, shared learning, the organization does not just duplicate effort. It duplicates mistakes.</strong></p><p>So give people low-risk places to experiment, but make the learning part of the work. Use smaller projects, internal tools, or bounded product areas where the blast radius is manageable. Then <a href="https://blog.kilo.ai/p/one-companys-blueprint-for-taking">bring the lessons back into the team</a> through demos, write-ups, office hours, shared channels, or lightweight internal playbooks. Not as commandments. As evidence.</p><p>Share what works, but do not turn it into policy too quickly. The weird differences matter right now. One model may fit one kind of work better. One coding harness may suit one engineer better. One workflow may work beautifully in a clean service and fall apart in a legacy system with weak tests. Treat that variation as signal before you standardize it away.</p><p>Right now, I would rather be in an organization that learns quickly than one that standardizes too soon. Premature optimization gives the comforting appearance of control. But you will not design the right process in a strategy session. You and your teams will discover much of it by trying, failing, comparing notes, and slowly turning repeated lessons into practice.</p><h2>What engineering leaders should do now</h2><p>If you lead engineers, stop outsourcing your understanding of this shift.</p><p>Read the articles. Listen to the talks. Stay close to what practitioners are discovering. Then ship something.</p><p>The best thing I did as a leader was spend enough time with the tools to become harder to fool. Not cynical. Just less impressed by the demo version of the story. I could see where the leverage was real, where the workflow got awkward, and where the review burden quietly returned to the engineer. It gave me better questions for the team, better language for pushing back on unrealistic expectations, and more empathy for the frustration engineers were feeling. I became less certain in the abstract, but more useful in practice.</p><p>So let the work correct your assumptions, because you cannot afford to lead agentic engineering from the sidelines.</p>]]></content:encoded></item><item><title><![CDATA[The Age of the Flash Model: Gemini 3.5, StepFun, DeepSeek and the Future of Agentic Engineering]]></title><description><![CDATA[Google is playing both offense and defense as it enters the agentic era]]></description><link>https://blog.kilo.ai/p/the-age-of-the-flash-model-gemini</link><guid isPermaLink="false">https://blog.kilo.ai/p/the-age-of-the-flash-model-gemini</guid><dc:creator><![CDATA[Ari]]></dc:creator><pubDate>Wed, 20 May 2026 01:34:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!g3cA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Android XR glasses. Universal Cart. A new &#8220;Ask YouTube&#8221; feature.</p><p>At the I/O event today, Google made a staggering number of <a href="https://9to5google.com/2026/05/19/google-io-2026-news/">announcements</a>. But the biggest news was beneath the headlines: Google&#8217;s release of a new AI model, <a href="https://kilo.ai/models/google-gemini-3-5-flash">Gemini 3.5 Flash</a>, designed for agentic work. We got the new model live in Kilo before I/O 2026 had even ended.</p><p>The <a href="https://deepmind.google/models/gemini/flash/">new model from Google DeepMind</a> is already powering Google Search. But is it affordable and reliable enough for everyday use in your favorite coding tools?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!g3cA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!g3cA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!g3cA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!g3cA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!g3cA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!g3cA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3133180,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198468542?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!g3cA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!g3cA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!g3cA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!g3cA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddbc18d2-ebcd-4fb3-82e2-9b939bb5f0ff_1536x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>In practice, Gemini 3.5 Flash is around <a href="https://artificialanalysis.ai/articles/gemini-3-5-flash-everything-you-need-to-know">60% more expensive</a> than comparable models like DeepSeek V4 Flash.</strong> But it offers frontier-level performance, reaching an average score of <a href="https://pinchbench.com/model/google/google/gemini-3.5-flash">74.2% on PinchBench</a> in initial runs, similar to <a href="https://pinchbench.com/model/anthropic/anthropic/claude-opus-4.6">Opus 4.6</a>.</p><p>Welcome to the age of the Flash model.</p><p>In the release, Google focused on benchmarks comparing 3.5 Flashto flagship models from other frontier labs like Anthropic and OpenAI. But the Kilo developer community is already comparing it more closely to budget-friendly alternatives&#8212;other flash models released recently that are also designed for always-on agentic engineering.</p><p><strong>The release of <a href="https://kilo.ai/models/google-gemini-3-5-flash">Gemini 3.5 Flash</a> comes on the heels of powerful, cost-effective Flash model releases from open source labs like <a href="https://blog.kilo.ai/p/step-35-flash-is-free-in-kilo">StepFun</a> and <a href="https://kilo.ai/models/by/deepseek">DeepSeek</a></strong>. This is only the beginning of the agentic era.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!29Ib!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!29Ib!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png 424w, https://substackcdn.com/image/fetch/$s_!29Ib!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png 848w, https://substackcdn.com/image/fetch/$s_!29Ib!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png 1272w, https://substackcdn.com/image/fetch/$s_!29Ib!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!29Ib!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png" width="1187" height="889" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:889,&quot;width&quot;:1187,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!29Ib!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png 424w, https://substackcdn.com/image/fetch/$s_!29Ib!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png 848w, https://substackcdn.com/image/fetch/$s_!29Ib!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png 1272w, https://substackcdn.com/image/fetch/$s_!29Ib!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb56450a-7280-43bb-acd1-6657a2c81df5_1187x889.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Developer&#8217;s Delight: A Crowded Flash Ecosystem</h3><p>Google has fundamentally changed the pitch for its Flash tier. Gemini 3.5 Flash is no longer &#8220;cheaper if you just need basic stuff done on a regular basis.&#8221; It officially beats Gemini 3.1 Pro on most coding and agentic benchmarks, runs roughly 4x faster than comparable frontier models, and boasts a massive <strong>1M-token context window</strong>.</p><p>Google declared that <a href="https://deepmind.google/models/gemini/flash/">the new flash model</a> offers &#8220;advanced reasoning at Flash-level latency and scale&#8221; and JetBrains has shared that the new model &#8220;improves low reasoning coding performance by 10&#8211;20% compared to the previous Flash generation.&#8221; Our initial tests have confirmed this.</p><p>But the competition for the ultimate developer AI has never been fiercer. The battle is officially a race to the bottom in price and a race to the top in context length, and developers have a surplus of incredible options:</p><ul><li><p><strong><a href="https://kilo.ai/models/stepfun-step-3-5-flash">Step 3.5 Flash</a>:</strong> Quietly dominating the agentic coding space recently and currently free to use in Kilo. StepFun has become the go-to model lab for developers running continuous multi-agent loops due to its reliable tool-calling capabilities. Their open-source Flash release been popular across KiloClaw, cloud agents and our VS code extension, which is why it&#8217;s been totally dominating the <a href="https://kilo.ai/leaderboard">Kilo leaderboard</a>.</p></li><li><p><strong>DeepSeek V4 Flash &amp; Pro:</strong> DeepSeek continues to break the price-performance math. <strong>DeepSeek V4 Flash</strong> is an efficiency-optimized Mixture-of-Experts (MoE) model built for sheer speed and high-volume pipelines. For developers needing heavier lifting, <strong>DeepSeek V4 Pro</strong> (packing 1.6T total parameters) steps in for deep reasoning and complex, multi-step agentic coding tasks without losing the ecosystem&#8217;s cost-efficient edge. As I <a href="https://blog.kilo.ai/p/deepseek-v4-is-here">wrote back in April</a> (seems like a decade ago!), DeepSeek has adapted to the new micro-model way of seeing the world with this release, opting for Pro and Flash releases instead of a wider base model release.</p></li><li><p><strong>Xiaomi MiMo-V2-Flash:</strong> <a href="https://kilo.ai/models/by/xiaomi">Xiaomi</a> is another Chinese lab that has been setting the Kilo leaderboard on fire lately, with their bigger V2.5 releases well as <strong>MiMo-V2-Flash</strong>, a specialized 309B MoE model purpose-built for high-throughput inference and loop-based agentic tasks. Using a unique hybrid attention architecture, it is designed specifically for scenarios where a model must continuously write code, execute it, interpret the error, and iterate.</p></li></ul><h3>Flash Models are Here to Stay</h3><p>While tech Twitter argues over these static benchmarks, the real revolution is happening in autonomous workflows. <strong>Flash models are the undeniable future of agentic engineering, because of their lower cost, high throughput, and focus on effective tool-calling.</strong></p><p>Traditional coding assistance involves single-turn prompts: you ask for a function, the AI writes it. Agentic engineering, however, involves giving an AI an open-ended goal and letting it plan, write code, run tests, debug errors, and iterate in a continuous loop until the job is done (or until it times out&#8230;but hopefully until the job is done). Previously, executing these tasks with heavy frontier models was financially impossible for daily use. An agent looping through a massive codebase could burn through hundreds of thousands of tokens in minutes&#8212;and that&#8217;s not token-maxxing, it&#8217;s token-wasting.</p><p>These new flash models from Google, DeepSeek and others drastically reducing the cost per million tokens, developers can finally let agents run wild. This drop in API costs is democratizing autonomous software development. It allows solo developers and smaller startups to spin up armies of specialized AI agents&#8212;one for writing tests, one for refactoring, one for <a href="https://kilo.ai/docs/deploy-secure/security-reviews">security audits</a>&#8212;at a speed and price point that makes full-scale agentic engineering a reality.</p><p>Playing offense in the agentic engineering wars means optimizing for the small things. In my opinion, per-agent tool permissions are one of the coolest things about <a href="https://blog.kilo.ai/p/we-are-so-back-human-in-the-loop">our latest VS Code extension</a>. Flash models make it possible to keep those agents both within-their-guardrails and affordable.</p><h3>But What About Defense?</h3><p>Beyond the Flash tier, which sees Google moving in to the offensive with development to rival other frontier players, Google also used today&#8217;s I/O event to address the elephant in the room: Anthropic&#8217;s <strong>Mythos</strong>.</p><p>To rival Anthropic&#8217;s enterprise testing, Google officially pushed <strong>CodeMender</strong> further into the enterprise today. Although Google DeepMind <a href="https://deepmind.google/blog/introducing-codemender-an-ai-agent-for-code-security/">announced CodeMender</a> back in 2025, Google used I/O to release it into wider testing and share <a href="https://cloud.google.com/blog/topics/threat-intelligence/ai-vulnerability-exploitation-initial-access?e=48754805">findings from the Google Threat Intelligence Group (GTIG)</a>.</p><p>With Google pitching CodeMender as the ultimate active enterprise shield, the AI arms race has officially shifted from generating boilerplate code to autonomous cyber defense.</p><p>Google is playing all fronts, and Kilo is also here to help you play both offense and defense too. Use Gemini 3.5 Flash as your daily driver for agentic engineering, and any model you&#8217;d like for Code Reviews and Security Reviews.</p><p>Stay strong with the strength of Google. Stay nimble with <a href="https://blog.kilo.ai/p/ai-consulting-wins-when-it-embraces">model freedom</a> ;)</p>]]></content:encoded></item><item><title><![CDATA[Gas Town by Kilo is Here, With Wasteland Built In]]></title><description><![CDATA[Back in March, we announced Gas Town by Kilo and opened the beta waitlist.]]></description><link>https://blog.kilo.ai/p/gas-town-ga</link><guid isPermaLink="false">https://blog.kilo.ai/p/gas-town-ga</guid><dc:creator><![CDATA[Brian Turcotte]]></dc:creator><pubDate>Tue, 19 May 2026 14:05:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!AHro!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AHro!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AHro!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!AHro!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!AHro!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!AHro!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AHro!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:125468,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198409684?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AHro!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!AHro!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!AHro!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!AHro!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb447aaba-aa7e-4b4c-8f73-12a6497d5513_1920x1080.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Back in March, we<a href="https://blog.kilo.ai/p/gas-town-by-kilo?utm_source=publication-search"> announced Gas Town by Kilo</a> and opened the beta waitlist. The pitch was simple: take Steve Yegge&#8217;s multi-agent orchestrator, remove the operational overhead of self-hosting it, and run it on Kilo Cloud with the full Kilo Gateway behind it. Managed infrastructure, elastic scaling, 500+ models through a single API, and no tmux wrangling.</p><p>Since then, engineer <a href="https://www.linkedin.com/in/john-fawcett-5a9a9a51/">John Fawcett</a> and the Kilo team have been working closely <a href="https://www.linkedin.com/posts/steveyegge_the-kilo-code-team-reached-out-to-me-and-share-7437230062589874176-FOLg?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAACnEm68BBM7_w9J7nQ3LKfqRcAg7QAsAVb8">with Steve</a> to make Gas Town accessible to developers everywhere. Today, <a href="https://app.kilo.ai/gastown">Gas Town by Kilo</a> is generally available as the only cloud-hosted version of Steve&#8217;s multi-agent orchestration framework, turning a single developer into an AI-powered engineering team. And it&#8217;s launching with built-in support for<a href="https://steve-yegge.medium.com/welcome-to-the-wasteland-a-thousand-gas-towns-a5eb9bc8dc1f"> the Wasteland</a>, Steve&#8217;s long-teased vision for federated AI-powered software development, now available as a hosted consumer offering for the first time through Kilo.</p><h2><strong>What is the Wasteland?</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ocmu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ocmu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp 424w, https://substackcdn.com/image/fetch/$s_!Ocmu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp 848w, https://substackcdn.com/image/fetch/$s_!Ocmu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp 1272w, https://substackcdn.com/image/fetch/$s_!Ocmu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ocmu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp" width="1400" height="563" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:563,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:190114,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198409684?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ocmu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp 424w, https://substackcdn.com/image/fetch/$s_!Ocmu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp 848w, https://substackcdn.com/image/fetch/$s_!Ocmu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp 1272w, https://substackcdn.com/image/fetch/$s_!Ocmu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfd45260-6f39-412c-866e-3e3c9cad62dc_1400x563.webp 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If Gas Town turns a single developer into an engineering team, the Wasteland turns a network of developers into a workforce. It&#8217;s a federated system built on Dolt (a SQL database with Git semantics) where participants browse a shared Wanted Board of tasks, claim work, submit evidence of completion, and get stamped by validators who attest to the quality of what was done.</p><p>The stamps aren&#8217;t pass/fail. They&#8217;re multi-dimensional attestations covering quality, reliability, and creativity, each scored independently with confidence levels. Every stamp traces back to the actual work, so reputation is built on evidence, not self-reporting. Steve describes it as &#8220;a yearbook rule: you can&#8217;t stamp your own work.&#8221; Your reputation is what other people write about you.</p><p>The whole thing is federated, meaning anyone can run their own Wasteland instance (a team, a company, an open-source project) and your identity and reputation are portable across all of them. Think of it as a professional ledger that you never have to write yourself, because your work writes it for you.</p><h2><strong>What This Means for Gas Town by Kilo Users</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aaBH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aaBH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png 424w, https://substackcdn.com/image/fetch/$s_!aaBH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png 848w, https://substackcdn.com/image/fetch/$s_!aaBH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png 1272w, https://substackcdn.com/image/fetch/$s_!aaBH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aaBH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png" width="1456" height="782" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:782,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:589427,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198409684?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aaBH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png 424w, https://substackcdn.com/image/fetch/$s_!aaBH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png 848w, https://substackcdn.com/image/fetch/$s_!aaBH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png 1272w, https://substackcdn.com/image/fetch/$s_!aaBH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9017b93-0e72-4fe6-826a-10857eb58367_3452x1854.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>From the Gas Town dashboard, you&#8217;ll see your Wasteland connection settings, including which fork you&#8217;re connected to and your rig identity.</p><p>Here&#8217;s what the workflow looks like in practice:</p><p>You ask your Mayor to pull the top wanted items from the connected Wasteland. The Mayor shows you what&#8217;s available. You pick a task, and your Gas Town claims it, spins up the right polecats, and starts working. When the work is done, your Mayor submits the completion evidence (a commit, a PR, a link) back to the Wasteland as a DoltHub pull request. A validator reviews it, stamps it, and your reputation updates.</p><p>The whole thing happens through the same Gas Town by Kilo interface you&#8217;re already using. You talk to your Mayor, your Mayor coordinates with the Wasteland, and the work flows through the same managed infrastructure that&#8217;s been running your convoys.</p><p>For Wasteland administrators, there&#8217;s a review tab in the dashboard where you can see submitted evidence, inspect completions, and merge or reject PRs. Posting new wanted items to the board is just as straightforward: write up the task, post it, and it shows up on the Wanted Board for any connected rig to claim.</p><h2><strong>Why This Matters on Kilo</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yqur!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yqur!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png 424w, https://substackcdn.com/image/fetch/$s_!yqur!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png 848w, https://substackcdn.com/image/fetch/$s_!yqur!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png 1272w, https://substackcdn.com/image/fetch/$s_!yqur!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yqur!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png" width="1456" height="783" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:783,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:515681,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198409684?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yqur!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png 424w, https://substackcdn.com/image/fetch/$s_!yqur!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png 848w, https://substackcdn.com/image/fetch/$s_!yqur!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png 1272w, https://substackcdn.com/image/fetch/$s_!yqur!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f53f1d4-bf7a-4399-9360-1445e494448c_3452x1856.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The Wasteland protocol itself doesn&#8217;t require Gas Town. You can participate with just Dolt, a DoltHub account, and a coding agent that knows the schema. But running it through Gas Town by Kilo makes the whole thing a lot easier:</p><p><strong>Your Mayor handles the protocol.</strong> Claiming work, submitting evidence, managing DoltHub PRs, posting wanted items: all of that is handled conversationally through your Mayor rather than through manual CLI operations against Dolt.</p><p><strong>The Gateway handles the models.</strong> Wasteland work can be anything: coding, documentation, design, research. Different tasks benefit from different models. Because Gas Town by Kilo runs on the Kilo Gateway, your polecats can use whatever model fits the job, and you&#8217;re not managing separate API keys or billing for each one.</p><p><strong>The infrastructure is handled for you.</strong> No Gas Town CLI, no Gas City CLI, no Beads CLI, no Dolt CLI, and no Wasteland CLI. Kilo packages all of that into a single managed container so you skip the setup entirely and go straight to working.</p><p><strong>Your reputation builds automatically.</strong> Every stamp you earn feeds into a portable, evidence-backed work history. If you&#8217;re already using Gas Town by Kilo to ship faster, the Wasteland adds a layer where that output gets recognized across a federated network.</p><h2><strong>Where This is Heading</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_1eg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_1eg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp 424w, https://substackcdn.com/image/fetch/$s_!_1eg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp 848w, https://substackcdn.com/image/fetch/$s_!_1eg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp 1272w, https://substackcdn.com/image/fetch/$s_!_1eg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_1eg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp" width="1400" height="594" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:594,&quot;width&quot;:1400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:128582,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198409684?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_1eg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp 424w, https://substackcdn.com/image/fetch/$s_!_1eg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp 848w, https://substackcdn.com/image/fetch/$s_!_1eg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp 1272w, https://substackcdn.com/image/fetch/$s_!_1eg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F386e3306-ebe9-4b42-8d6b-b9c76d57a7a1_1400x594.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Steve has been clear that the Wasteland is early, but the trajectory is worth paying attention to. Gas Town already proved that a single developer can operate at the scale of a small engineering team. The Wasteland extends that to coordination across developers, projects, and organizations.</p><p>The combination of Gas Town&#8217;s multi-agent orchestration, the Wasteland&#8217;s federated reputation protocol, and Kilo&#8217;s managed infrastructure means independent developers and small teams can participate in large-scale collaborative engineering without the organizational overhead that usually comes with it.</p><h2><strong>Get Started</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!C0dl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!C0dl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png 424w, https://substackcdn.com/image/fetch/$s_!C0dl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png 848w, https://substackcdn.com/image/fetch/$s_!C0dl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png 1272w, https://substackcdn.com/image/fetch/$s_!C0dl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!C0dl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png" width="1456" height="781" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:781,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:351880,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198409684?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!C0dl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png 424w, https://substackcdn.com/image/fetch/$s_!C0dl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png 848w, https://substackcdn.com/image/fetch/$s_!C0dl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png 1272w, https://substackcdn.com/image/fetch/$s_!C0dl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7a4d0ae-9570-49bf-8ebf-433b3350dfee_3448x1850.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Gas Town by Kilo is live now. Head to the Kilo Dashboard to spin up your town and connect to a Wasteland:</p><p><strong><a href="https://app.kilo.ai/gastown">Get started with Gas Town by Kilo &#8594;</a></strong></p><p>For the full context on how the Wasteland works, Steve&#8217;s post<a href="https://steve-yegge.medium.com/"> &#8220;Welcome to the Wasteland: A Thousand Gas Towns&#8221;</a> covers the protocol, the trust system, and the federated architecture in detail. The Gas Town and Wasteland repos are both open on GitHub.</p><p>We said last time that Steve built a coding factory and we gave it plumbing. Now the factories can talk to each other.</p>]]></content:encoded></item><item><title><![CDATA[This Slash Command Was Quietly Costing Me $800 a Day]]></title><description><![CDATA[And on paper, nothing about my work explained it.]]></description><link>https://blog.kilo.ai/p/slash-command</link><guid isPermaLink="false">https://blog.kilo.ai/p/slash-command</guid><dc:creator><![CDATA[Marius Wichtner]]></dc:creator><pubDate>Mon, 18 May 2026 22:10:04 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/6e6a99dc-b6d9-41e3-85f9-b55ebb5c15ea_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!clIc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!clIc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!clIc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!clIc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!clIc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!clIc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png" width="1456" height="813" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:8032497,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/198325937?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!clIc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!clIc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!clIc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!clIc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc9cd97c-f2a1-454b-a0dc-f726895912af_2752x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Last month my agent costs crossed $800 in a single day, and on paper nothing about my work explained it. I was using the same models, on the same projects, and running the workloads I&#8217;d been running for weeks. I was tracking spend at the session level, and the per-session numbers all looked reasonable on their own, which is exactly the kind of thing that lets a problem hide in plain sight.</p><p>When I finally pulled the full session data and traced the curve, the culprit was a slash command I&#8217;d written myself - something I used every day and had no reason to suspect, because each individual call inside it looked cheap.</p><p>The cost was in the shape of the run, not the price of any one piece of it.</p><h2><strong>The workflow</strong></h2><p>The slash command sits at the end of my dev loop. I run /ship, which I prompted into existence with four words: &#8220;commit, push, pr, babysit&#8221;. From there the agent picks up the manual tail of the process I never enjoyed doing.</p><p>It watches the pipeline status, keeps an eye on the Slack triggers tied to the PR, and once the review bots start leaving comments, it works through them and addresses the ones that hold up.</p><p>On paper this is exactly the kind of work an agent should be doing. It&#8217;s repetitive, it&#8217;s interrupt-driven, and most of it is the kind of waiting that a person can technically do, but resents doing. The trap, as I eventually figured out, lives inside the word &#8220;watches.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Zqgt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Zqgt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png 424w, https://substackcdn.com/image/fetch/$s_!Zqgt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png 848w, https://substackcdn.com/image/fetch/$s_!Zqgt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png 1272w, https://substackcdn.com/image/fetch/$s_!Zqgt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Zqgt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png" width="1456" height="784" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8274718c-3441-42cd-93a4-69467181651f_2048x1103.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:784,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Zqgt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png 424w, https://substackcdn.com/image/fetch/$s_!Zqgt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png 848w, https://substackcdn.com/image/fetch/$s_!Zqgt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png 1272w, https://substackcdn.com/image/fetch/$s_!Zqgt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8274718c-3441-42cd-93a4-69467181651f_2048x1103.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>How the cost actually accumulates</strong></h2><p>An agent waiting for CI does not wait the way a person does. A person waiting for CI gets coffee. An agent waiting for CI polls, and that distinction is where the bill comes from.</p><p>Every poll is a tool call. Every tool call pulls its output, in my case from the gh CLI, back into the model&#8217;s context. The next poll then runs against a context that already contains the previous poll&#8217;s output. The one after that contains both outputs, and the transcript keeps growing on every iteration. Each loop ends up reading and reasoning over more material than the one before it.</p><p>If the pipeline passes on the first try, none of this matters. You get a handful of polls and the loop ends before the cost has a chance to build. If the pipeline fails, retries, and fails again, you&#8217;re suddenly in a loop where each iteration is more expensive than the last, because the context coming into it has been padded by everything the loop has already seen.</p><p>The pricing on each token hasn&#8217;t changed. You are just buying more and more of them per call as the loop drags on. In my sessions, a single watch loop could triple the total cost relative to what the same work would have run without one. The damage is in the integral, not the peak.</p><h2><strong>What the curves show</strong></h2><p>I pulled the three most expensive sessions I had on file:</p><ul><li><p>Session A: $101.65 across 283 requests, peak context of 917k tokens</p></li><li><p>Session B: $132.96 across 274 requests, peak context of 1.1M tokens</p></li><li><p>Session C: $182.99 across 496 requests, peak context of 1.3M tokens</p></li></ul><p>Session C is the one that made the pattern impossible to miss. /ship (my custom slash command) fired around the 151 minute mark, and from there the session was dominated by a long tail of CI and check log calls stretching across the next eighty minutes.</p><p>Each of those calls is the loop reading the pipeline status one more time. The cost curve climbs the whole way, not because the agent is making progress, but because it is paying full price to wait.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8fDc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8fDc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png 424w, https://substackcdn.com/image/fetch/$s_!8fDc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png 848w, https://substackcdn.com/image/fetch/$s_!8fDc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png 1272w, https://substackcdn.com/image/fetch/$s_!8fDc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8fDc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png" width="1456" height="695" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:695,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8fDc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png 424w, https://substackcdn.com/image/fetch/$s_!8fDc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png 848w, https://substackcdn.com/image/fetch/$s_!8fDc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png 1272w, https://substackcdn.com/image/fetch/$s_!8fDc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8750213-0655-4235-abb8-f9ee8dbc786a_2048x977.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That 496 request count, sitting almost twice as high as the other two sessions, is almost entirely the loop. The other sessions were expensive for their own reasons, but neither of them produced the long, low-grade tail of status checks that look cheap individually and aren&#8217;t.</p><h2><strong>The worst case is Opus plus flaky tests</strong></h2><p>The curve gets uglier when two things line up.</p><p>The first is Claude Opus, which has a prompt cache window of roughly five minutes. Once your polls space out wider than that, or your retries land on intervals that straddle the boundary, the cache stops helping. Cache discounts are meaningful, and losing them on every call inside a long loop adds up faster than you&#8217;d expect.</p><p>The second is flaky tests. When tests fail and retry, the watch loop doesn&#8217;t just keep polling at the same pace. It re-reads more output, re-evaluates state, and then sits there waiting for the next attempt. Each retry restarts the cost arc on top of an even larger context.</p><p>What you get is a cost curve that climbs in steps rather than a smooth line. One step per retry, and the height of each step is set by how much transcript the loop has already accumulated by the time the retry hits - a compounding effect that can seriously drain your token budget.</p><h2><strong>What to look for in your own setup</strong></h2><p>I described this to another engineer on the team and he recognized it in his own slash commands inside of a minute. I think the pattern is broadly under-noticed because no single call inside the loop looks expensive on its own. You have to look at the full session curve to see what&#8217;s actually happening, and most people don&#8217;t do that until something has already gone wrong.</p><p>If you have any agent flow that watches something rather than does something, pull up the cost curve for a session where it ran. Look for a sudden slope change tied to a specific tool call. That&#8217;s almost always where the loop began.</p><p>It doesn&#8217;t matter whether you&#8217;re polling a CI pipeline, watching a queue, or monitoring a Slack channel for a response. The mechanics are the same. The fix is to stop letting agents wait in-context: use webhooks, longer poll intervals, or an inexpensive model to summarize state between checks. The context window does not care what you are waiting for.</p>]]></content:encoded></item><item><title><![CDATA[We are so back: Human in the Loop in Kilo Code]]></title><description><![CDATA[The v7 rewrite of Kilo Code&#8217;s VS Code extension was ambitious.]]></description><link>https://blog.kilo.ai/p/we-are-so-back-human-in-the-loop</link><guid isPermaLink="false">https://blog.kilo.ai/p/we-are-so-back-human-in-the-loop</guid><dc:creator><![CDATA[Brendan O'Leary]]></dc:creator><pubDate>Fri, 15 May 2026 19:58:51 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!tO5A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The v7 rewrite of Kilo Code&#8217;s VS Code extension was ambitious. We rebuilt everything&#8212; new architecture based on OpenCode server, new UI, Agent Manager for parallel sessions, tighter OpenCode integration. It shipped fast, and it shipped rough.</p><p>One piece of feedback came through louder than the rest: developers wanted more control back.</p><p>Not &#8220;turn off the AI&#8221; control. More like &#8220;show me what you&#8217;re about to do before you do it&#8221; control. The kind of oversight that lets you trust an agent because you can verify its work, not because you&#8217;re hoping it got things right.</p><p>We&#8217;ve been shipping fixes steadily since the <a href="https://blog.kilo.ai/p/new-vs-code-week-three">Week Three update</a>. Here&#8217;s where things stand.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tO5A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tO5A!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!tO5A!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!tO5A!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!tO5A!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tO5A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg" width="1376" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1376,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:148425,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/197895360?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tO5A!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!tO5A!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!tO5A!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!tO5A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c458ca3-5d6e-4070-9729-11fb5bb40948_1376x768.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>What&#8217;s Already Shipped</h2><h3>See the Agent&#8217;s Thinking</h3><p>The simplest ask was visibility. When an agent proposes changes, you want to understand why &#8212; not scroll past walls of collapsed reasoning to find the diff.</p><p>Reasoning blocks now <a href="https://github.com/Kilo-Org/kilocode/pull/9704">stay expanded by default</a>. Terminal command blocks <a href="https://github.com/Kilo-Org/kilocode/pull/9708">can collapse</a> to just a header bar so they don&#8217;t dominate the view. The context usage graph <a href="https://github.com/Kilo-Org/kilocode/issues/8210">landed in the task header</a> so you can see token consumption turn by turn.</p><p>These sound minor but in practice, they&#8217;re the difference between scrolling past an agent&#8217;s work and actually reading it.</p><h3>Better Diffs Before You Approve</h3><p>The v7 extension initially showed diffs <em>after</em> approval. But our community made it clear many desire to review changes before committing to them.</p><p>Patch operations now <a href="https://github.com/Kilo-Org/kilocode/pull/9691">show diffs in the permission prompt</a> before you approve. Edit tool diffs <a href="https://github.com/Kilo-Org/kilocode/pull/9576">open in side-by-side mode by default</a>. Agent Manager diffs <a href="https://github.com/Kilo-Org/kilocode/pull/10055">auto-expand reviewable files</a> while keeping generated or huge files collapsed.</p><p>The <a href="https://github.com/Kilo-Org/kilocode/pull/9897">unified Changes panel</a> was a bigger lift. Instead of two separate views for &#8220;workspace changes&#8221; and &#8220;session changes,&#8221; there&#8217;s now a single panel with a source dropdown. The sidebar badge shows session additions and deletions. If you&#8217;ve disabled snapshots for a repo, it warns you instead of silently showing stale data.</p><p>Shell command output got <a href="https://github.com/Kilo-Org/kilocode/pull/10228">syntax highlighting, copy buttons, and an &#8220;Open in Editor&#8221; action</a> that shows full untruncated output. You can actually read what the agent ran.</p><h3>Permission Flow Fixes</h3><p>The auto-approve toggle <a href="https://github.com/Kilo-Org/kilocode/pull/9816">now persists across VS Code restarts</a>. It shows <a href="https://github.com/Kilo-Org/kilocode/pull/9662">in the session prompt controls</a> so you know what mode you&#8217;re in. The settings UI <a href="https://github.com/Kilo-Org/kilocode/pull/9435">got readable section titles</a> in light themes.</p><p>A more subtle fix: permissions were getting stuck when <a href="https://github.com/Kilo-Org/kilocode/pull/9450">multiple panels showed the same prompt</a>, or when a subagent&#8217;s permission got routed to the wrong worktree. Those now surface clear errors instead of leaving buttons permanently disabled.</p><p>Custom agents can have <a href="https://github.com/Kilo-Org/kilocode/pull/9733">per-agent tool permissions</a> configured in the VS Code agent editor. If you want a read-only reviewer or an agent that can&#8217;t touch bash, you can set that up.</p><h3>Checkpoint Improvements</h3><p>Session recovery is more reliable. Timeline, token, context, and cost stats <a href="https://github.com/Kilo-Org/kilocode/pull/10156">stay in sync after restoring a checkpoint</a>. Older revert points <a href="https://github.com/Kilo-Org/kilocode/pull/10129">stay available</a> after you restore an earlier message. The thinking level <a href="https://github.com/Kilo-Org/kilocode/pull/9915">preserves through compaction</a>.</p><p>Todo items <a href="https://github.com/Kilo-Org/kilocode/pull/9687">show compact update cards</a> when checked off in long lists, so the review surface doesn&#8217;t get cluttered.</p><h2>What&#8217;s Still Coming</h2><p>The core ask &#8212; see exactly what&#8217;s changing <em>before</em> approving it with accurate, focused diffs &#8212; is tracked in <a href="https://github.com/Kilo-Org/kilocode/issues/8415">#8415</a>. It&#8217;s not fully resolved yet, but we&#8217;ve been shipping some of the pieces like patch diffs in permission prompts, and the unified Changes panel, but the full long tail of making this exactly the experience we want is still being worked on.</p><p>Step-level checkpoint restore (<a href="https://github.com/Kilo-Org/kilocode/issues/8378">#8378</a>) would let you see which files changed at each agent step and restore to a specific step within a turn &#8212; not just at the user-prompt level. Useful for recovering from rate limits or interrupted runs mid-turn.</p><p>The diff code blocks still take <a href="https://github.com/Kilo-Org/kilocode/issues/9730">too much vertical space</a>. Long edit blocks push reasoning off-screen. The fix (scrollable containers with a higher threshold before expanding) is agreed on but not shipped.</p><p>And for new users, there&#8217;s an idea to <a href="https://github.com/Kilo-Org/kilocode/issues/8419">ask how they prefer to work</a> &#8212; more hands-on or more agentic &#8212; and configure the extension accordingly from the start.</p><h2>The Philosophy</h2><p>When I joined Kilo, I heard &#8220;agentic engineering&#8221; tossed around a lot. It can sound like &#8220;let the AI do everything.&#8221; That&#8217;s not what it means - you can outsource <em>some</em> of software authoring to AI but you cannot outsource the <em>thinking</em> behind software engineering.</p><p>The best agents are the ones you can trust because you understand what they&#8217;re doing. The tooling makes oversight easy &#8212; that&#8217;s the point. Review before approve. Diff before commit. Reasoning visible, not hidden.</p><p>The v7 rewrite pushed hard on capability &#8212; Agent Manager, parallel sessions, tighter model integration. Some of that came at the cost of visibility and control. The work over the past few weeks has been about rebalancing. The tracking issue for human-in-the-loop improvements (<a href="https://github.com/Kilo-Org/kilocode/issues/8415">#8415</a>) has five active areas, and not all of them are closed. But the direction is clear, and the shipped improvements are real.</p><p>If you downgraded to 5.x because v7 felt too hands-off, this is a good time to check back. The experience is meaningfully different now.</p><div><hr></div><p><em>Following along? The <a href="https://github.com/Kilo-Org/kilocode/issues/8415">HITL tracking issue</a> has the full list. And if you hit rough edges, <a href="https://github.com/Kilo-Org/kilocode/issues/new">file an issue</a> &#8212; the heap-snapshot command we added makes reports actionable.</em></p>]]></content:encoded></item><item><title><![CDATA[The Subsidy Was Never About You]]></title><description><![CDATA[Claude Code&#8217;s pricing change reveals what the Claude subscription was always for]]></description><link>https://blog.kilo.ai/p/the-subsidy-was-never-about-you</link><guid isPermaLink="false">https://blog.kilo.ai/p/the-subsidy-was-never-about-you</guid><dc:creator><![CDATA[Brendan O'Leary]]></dc:creator><pubDate>Fri, 15 May 2026 14:38:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!zlE5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>On January 9, 2026, we saw Claude Code <a href="https://github.com/anomalyco/opencode/issues/6930">deploy</a> server-side checks that silently blocked third-party tools from authenticating to Claude subscriptions via OAuth. OpenCode, the open-source terminal coding assistant, stopped working. Users got a blunt error. No announcement. No warning.</p><p>On February 19, the <a href="https://www.reddit.com/r/AI_Agents/comments/1sbxshw/omg_anthropic_just_ended_claude_subscriptions_for/">Terms of Service</a> were updated to formalize the ban.</p><p>On April 4, <a href="https://the-decoder.com/anthropic-cuts-off-third-party-tools-like-openclaw-for-claude-subscribers-citing-unsustainable-demand/">OpenClaw was completely cut off</a> and other third-party harnesses from subscription access entirely. Boris Cherny, head of Claude Code, cited <a href="https://x.com/bcherny/status/2040206441756471399">&#8220;unsustainable demand&#8221;</a>.</p><p>And yesterday, the ClaudeDev Twitter account <a href="https://x.com/ClaudeDevs/status/2054610152817619388">announced</a> that paid subscriptions will get dedicated monthly credits for programmatic usage&#8212;after the user opts in and billed at full API rates&#8212;while &#8220;interactive&#8221; use through their own tools stays subsidized.</p><p>The official line is that this simplifies billing. The pattern of the last five months tells a different story.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zlE5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zlE5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zlE5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zlE5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zlE5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zlE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:105507,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/197857988?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zlE5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zlE5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zlE5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zlE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1aae90f8-9082-4828-a654-43a8af0494b7_1024x1024.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>&#8220;Interactive&#8221; Means &#8220;Claude Code&#8217;s Front-End&#8221;</h2><p>Jeremy Howard <a href="https://x.com/jeremyphoward/status/2054682882753597603">called out</a> the framing immediately:</p><blockquote><p>&#8220;This policy redefines the term &#8216;interactive&#8217; to mean &#8216;using an Anthropic front-end&#8217;. If you use claude -p or Agent SDK to do something interactively, it now uses credits, not your subscription limits. So the &#8216;interactive use&#8217; heading saying &#8216;unchanged&#8217; is not accurate.&#8221;</p></blockquote><p>He&#8217;s right. The word &#8220;interactive&#8221; in the announcement may not mean what you&#8217;d expect. A human sitting at a terminal, issuing commands through claude -p, responding to Claude&#8217;s output, making decisions&#8212;that&#8217;s &#8220;programmatic&#8221; now. While the &#8220;p&#8221; in claude -p does indeed stand for programmatic, running Claude Code inside the official CLI to do the exact same work is still classified as &#8220;interactive.&#8221; The article&#8217;s core argument remains: the distinction is primarily about controlling the surface.</p><p>The distinction isn&#8217;t about whether a human is in the loop. It&#8217;s about whether they can control the surface.</p><h2>Follow the Incentives</h2><p>Why would Anthropic subsidize usage through Claude.ai and Claude Code but charge API rates for the same tokens through a third-party tool?</p><p>The capacity argument doesn&#8217;t hold. Third-party harnesses like OpenClaw made the same API calls as Claude Code. The tokens cost the same to run. If the issue were purely cost, raising prices across the board or enforcinged stricter rate limits could have been part of the solution. Instead, they&#8217;re treating identical workloads differently based on where the request originates.</p><p>Anthropic&#8217;s <a href="https://privacy.claude.com/en/articles/10023580-is-my-data-used-for-model-training">privacy documentation</a> says consumer plan data can be used for model training if you have Model Improvement enabled. Chat and coding session data includes &#8220;the entire related conversation, along with any content, custom styles or conversation preferences, as well as data collected when using Claude for Chrome.&#8221;</p><p>When you use Claude through Claude Code surfaces, they can observe your prompts, your corrections, your workflow patterns, your thumbs up/down feedback.</p><p>The subsidy isn&#8217;t about making AI accessible. It&#8217;s about routing valuable interactions&#8212;the kind that generate training signal&#8212;through surfaces they control.</p><h2>The Pattern Was Always There</h2><p>Peter Steinberger, OpenClaw&#8217;s creator, <a href="https://x.com/steipete/status/2040209434019082522">claimed that Anthropic</a> was absorbing features from third-party tools into Claude Code, then shutting out the alternatives. &#8220;Funny how timings match up,&#8221; he commented.</p><p>The timeline supports him:</p><ul><li><p><strong>January 9</strong>: Server-side blocks cut off OpenCode and other harnesses spoofing Claude Code headers</p></li><li><p><strong>Late March</strong>: Policy warnings to third-party tool users about exceeding limits</p></li><li><p><strong>April 4</strong>: Billing-based enforcement begins&#8212;OpenClaw and similar tools lose subscription access entirely</p></li><li><p><strong>May 13</strong>: The pricing split goes official&#8212;interactive surfaces stay subsidized, everything else goes to API rates</p></li></ul><p>Each step was framed as capacity management or billing simplification. But the consistent thread is: third-party tools that bypass the observation layer get progressively less access to the subscription subsidy.</p><h2>The Investor Announcement</h2><p>Jos&#233; Valim, creator of Elixir, <a href="https://x.com/josevalim/status/2054637876311892259">cut through the framing</a>:</p><blockquote><p>&#8220;The reason they made it sound like an upgrade was because the announcement was not for developers. It was for investors and enterprise customers. Impacting devrel is just collateral damage.&#8221;</p></blockquote><p>And:</p><blockquote><p>&#8220;if they can&#8217;t be honest about pricing changes, it is really hard to believe them on anything else&#8221;</p></blockquote><p>Anthropic is <a href="https://www.nytimes.com/2026/05/12/technology/anthropic-funding-950-billion-valuation.html">reportedly in talks</a> to raise funding at a $950 billion valuation. Dario said their revenue is growing 80x annualized. The metrics that matter for that trajectory are enterprise contracts and revenue per user, not whether indie developers can run overnight automation for $100/month.</p><p>The announcement landed the <a href="https://x.com/OpenAIDevs/status/2054586214112780518">same day</a> OpenAI offered two free months of Codex for enterprise customers who switch.</p><h2>What Developers Actually Lost</h2><p>Reuven Cohen, Agentic Engineer at Cognitum One, <a href="https://www.linkedin.com/feed/update/urn:li:activity:7192076543210496000/">laid out</a> what the old pricing actually covered:</p><blockquote><p>&#8220;The real value was programmatic access. Before this change, the $200 plan had fuzzy limits but absurdly high practical value if you knew how to use it.&#8221;</p></blockquote><p>His list of now-expensive workflows: recursive agent loops, headless automation, CI/CD pipelines, multi-agent swarms. His summary of the new credit system: &#8220;a rate limiter wearing a party hat.&#8221;</p><p><a href="https://x.com/theo/status/2054620998205624746">Theo</a> added: &#8220;Kind of crazy that Anthropic spends more time trying to lock out better apps and harnesses instead of just fixing Claude Code.&#8221;</p><p>Power users built entire systems around the old economics&#8212;automated overnight coding, scheduled tasks, agents that respond to external triggers. All of that now costs 5-10x more. Some are already moving to Codex. Others are eyeing DeepSeek or Kimi.</p><h2>What They Should Have Said</h2><p>Valim proposed the honest version:</p><blockquote><p>&#8220;We&#8217;re moving programmatic use to API pricing. We&#8217;re giving you credits to ease the transition. We expect this to free up capacity, so we&#8217;re doubling interactive limits for 2 months.&#8221;</p></blockquote><p>That&#8217;s a price increase for heavy programmatic users, clearly stated. Instead, Lydia Hallie from Anthropic <a href="https://x.com/lydiahallie/status/2054670303834616119">tweeted</a> &#8220;you don&#8217;t pay extra&#8221;&#8212;technically true if you&#8217;ve never used programmatic features before, misleading if you have.</p><h2>My Read</h2><p>The subsidy was never sustainable, and the recent changes are proof that the subsidies are ending entirely. This may also be a sign that Anthropic is making up its pricing as it goes.</p><p>From an economic angle, everything that has happened since January makes sense. The OpenCode block, the OpenClaw cutoff, the &#8220;interactive vs programmatic&#8221; split&#8212;all of it is repricing subsidized usage and pushing programmatic use to market-rate API billing as the subsidy era ends.</p><p>That&#8217;s a legitimate business decision. Companies optimize for what matters to their growth. The problem is calling it a &#8220;simplification&#8221; when it&#8217;s actually a repricing, and defining &#8220;interactive&#8221; as &#8220;our surfaces&#8221; when that&#8217;s not what the word means.</p><p>The pattern is clear: they are ending the subsidies for programmatic use to improve unit economics, prioritizing the revenue that comes from high-value enterprise and direct use over subsidizing third-party tool developers.</p><p>The developers who built the harness ecosystem&#8212;OpenClaw, Conductor, OpenCode, all of it&#8212;turned Claude Code from a CLI experiment into a serious development environment. Pricing those developers out might improve short-term unit economics. It also means the next generation of tooling gets built on someone else&#8217;s models.</p><div><hr></div><p><em>The change goes live June 15. Subscribers will get an email on June 8 to activate the new credit system. If you rely on claude -p or third-party tools, audit your usage now&#8212;$200 in API credits disappears faster than the old subsidized limits.</em></p>]]></content:encoded></item><item><title><![CDATA[Kilo Bet on Cerebras 11 Months Before Wall Street Did]]></title><description><![CDATA[Their $48B IPO just validated the open compute thesis]]></description><link>https://blog.kilo.ai/p/cerebras</link><guid isPermaLink="false">https://blog.kilo.ai/p/cerebras</guid><dc:creator><![CDATA[Brian Turcotte]]></dc:creator><pubDate>Thu, 14 May 2026 17:58:12 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!SEOj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Eleven months ago, Kilo shipped support for Cerebras inference as a first-class provider in the IDE extension. It was<a href="https://github.com/Kilo-Org/kilocode/pull/777"> PR #777</a>, it landed in v4.39.0, and at the time it was a quietly opinionated move.</p><p>The story everyone was telling back then was that AI coding was going to consolidate around one or two model providers and consequently, one kind of compute. Kilo was building toward an alternative assumption: that engineers would want to use a dozen different models routed to a dozen kinds of silicon, picked per task, with no one provider owning the entire stack.</p><p>This week, the market wrote a $48 billion check on that assumption.</p><p>Cerebras prices its IPO at a fully diluted valuation north of $48 billion on an order book reportedly oversubscribed 20 times. It&#8217;s the biggest US listing in close to five years. And while the financial pages will spend the rest of the week talking about wafer economics and OpenAI warrants, the more interesting story for anyone who codes for a living is what the IPO actually confirms about where AI compute is going.</p><h2><strong>The thesis under the IPO</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SEOj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SEOj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png 424w, https://substackcdn.com/image/fetch/$s_!SEOj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png 848w, https://substackcdn.com/image/fetch/$s_!SEOj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png 1272w, https://substackcdn.com/image/fetch/$s_!SEOj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SEOj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png" width="1456" height="1011" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1011,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SEOj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png 424w, https://substackcdn.com/image/fetch/$s_!SEOj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png 848w, https://substackcdn.com/image/fetch/$s_!SEOj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png 1272w, https://substackcdn.com/image/fetch/$s_!SEOj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9d6aa30-e287-4f67-9cf0-42ef6e25e388_2048x1422.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The reason Cerebras can price like this is that AI compute is splitting into specialized lanes. Training is one job. Memory-hungry agentic work is another. Fast reasoning where someone is waiting on tokens is a third. No single chip wins all three, and engineers who can route the right task to the right silicon are going to outpace engineers locked to one stack. That&#8217;s the bet inside the $48B valuation, and it&#8217;s the same bet Kilo&#8217;s product has been organized around since day one.</p><p>Cerebras specifically owns the fast-reasoning lane. When a model fits on its wafer-scale chip, you get tokens-per-second numbers that change what coding with an AI feels like. A reasoning chain that takes a minute and a half on a GPU can finish in single-digit seconds. Anyone who has watched a model &#8220;think&#8221; for 90 seconds before answering a real question knows how big that gap is, especially when it comes to minor implementation tasks and quick revisions in a coding editor.</p><h2><strong>Why Kilo was building toward this all along</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2weI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2weI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png 424w, https://substackcdn.com/image/fetch/$s_!2weI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png 848w, https://substackcdn.com/image/fetch/$s_!2weI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png 1272w, https://substackcdn.com/image/fetch/$s_!2weI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2weI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png" width="1456" height="778" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:778,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2weI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png 424w, https://substackcdn.com/image/fetch/$s_!2weI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png 848w, https://substackcdn.com/image/fetch/$s_!2weI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png 1272w, https://substackcdn.com/image/fetch/$s_!2weI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32545c54-74ab-4358-95d2-66b56c08beef_2048x1095.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Kilo&#8217;s product is shaped by one bet repeated in every direction: openness compounds. Open pricing means you pay the provider&#8217;s actual rate with no markup. Open model selection means 500-plus models behind one dropdown, switchable mid-session. Open source means the harness your agent runs in is something you can read, fork, and improve. None of these are features bolted onto a closed product. They&#8217;re the spine.</p><p>The reason that spine matters is most obvious right now, with Cerebras pricing. If your coding agent is wired to a single provider, the heterogeneous compute future is happening to you, not for you. Every new specialized chip becomes a thing you can&#8217;t use. Every shift in which model is best at which task becomes a vendor negotiation. The whole point of an open agentic platform is that fragmentation is fuel, not friction. New silicon arrives, Kilo makes it accessible, and the engineer can move faster.</p><p>This is why Cerebras landed in Kilo eleven months ago and not last week. The same logic that said &#8220;support Cerebras early&#8221; also said support every other model worth running, build a gateway that bills at cost, keep the source open so providers can contribute back. The IPO didn&#8217;t change the strategy; it confirmed that the strategy was reading the market correctly.</p><h2><strong>The collaboration behind the integration</strong></h2><p>It&#8217;s worth saying something about how Cerebras support in Kilo has actually stayed sharp, because it&#8217;s the part of the story that doesn&#8217;t show up in a press release.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d0ZS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d0ZS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png 424w, https://substackcdn.com/image/fetch/$s_!d0ZS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png 848w, https://substackcdn.com/image/fetch/$s_!d0ZS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png 1272w, https://substackcdn.com/image/fetch/$s_!d0ZS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d0ZS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png" width="1456" height="780" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:780,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d0ZS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png 424w, https://substackcdn.com/image/fetch/$s_!d0ZS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png 848w, https://substackcdn.com/image/fetch/$s_!d0ZS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png 1272w, https://substackcdn.com/image/fetch/$s_!d0ZS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae1769be-3e6b-4206-ae03-601e8f597719_2048x1097.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>For most of the last year, engineers from Cerebras have been contributing to the Kilo codebase directly. Not in a partnership-announcement way; in a<a href="https://github.com/Kilo-Org/kilocode/pulls?q=is%3Apr+author%3Asebastiand-cerebras"> pull-request</a> way. Updating the model lineup as new ones launch. Tuning parameters. Wiring up integration headers. The unglamorous runtime work that determines whether a provider feels first-class or just feels supported.</p><p>A coding model is only as good as the harness behind it. Tool calls, file edits, plan steps, diff review, context management, all the parts that don&#8217;t show up in benchmarks. Cerebras putting hours into making that harness tight inside Kilo, in public, in PRs anyone can read, is what working with an open platform actually looks like. It&#8217;s why a Kilo user hitting Cerebras today is using something that&#8217;s been shaped by the people who built the chip, not just bolted on by us.</p><p>That kind of relationship only happens in the open. A closed agent forces every chip company into a vendor negotiation. An open one lets them ship code.</p><h2><strong>What heterogeneous compute looks like inside the editor</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7zfZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7zfZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png 424w, https://substackcdn.com/image/fetch/$s_!7zfZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png 848w, https://substackcdn.com/image/fetch/$s_!7zfZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png 1272w, https://substackcdn.com/image/fetch/$s_!7zfZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7zfZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png" width="1456" height="752" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ebcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:752,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7zfZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png 424w, https://substackcdn.com/image/fetch/$s_!7zfZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png 848w, https://substackcdn.com/image/fetch/$s_!7zfZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png 1272w, https://substackcdn.com/image/fetch/$s_!7zfZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febcbef88-8339-4fc1-8b2f-63546e33135d_1918x990.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here&#8217;s the version of the open compute story that actually shows up in a developer&#8217;s day. Open the provider menu in the Kilo IDE extension. Cerebras is there, next to Anthropic, OpenAI, and the 500-plus models on Kilo Gateway. Pick a wafer-scale model and have Kilo&#8217;s Plan mode plan a refactor across five files. The design comes back before you&#8217;ve finished reading the prompt you wrote.</p><p>Then, for the next task, pick something different. A Claude model for a delicate code review. A long-context model for repo-wide refactoring. Same workflow, but different chip running underneath - picked because it fits this task, not because it&#8217;s the only one available.</p><p>That&#8217;s the experience the Cerebras IPO is signaling the market wants. It&#8217;s also the experience Kilo has been building since before the IPO was on the calendar.</p><h2><strong>The market signal</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dyg0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dyg0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png 424w, https://substackcdn.com/image/fetch/$s_!dyg0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png 848w, https://substackcdn.com/image/fetch/$s_!dyg0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png 1272w, https://substackcdn.com/image/fetch/$s_!dyg0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dyg0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png" width="1456" height="767" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:767,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dyg0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png 424w, https://substackcdn.com/image/fetch/$s_!dyg0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png 848w, https://substackcdn.com/image/fetch/$s_!dyg0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png 1272w, https://substackcdn.com/image/fetch/$s_!dyg0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62ebc2c7-f8a6-468f-8442-27942170fc01_2048x1079.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Specialized inference silicon is no longer an interesting bet. It&#8217;s an obvious one. Cerebras going public at this valuation is the loudest confirmation yet that the inference market belongs to whoever can match the right workload to the right hardware. Specialized chips will keep arriving. Some will win their lanes, some won&#8217;t. The agentic platforms that thrive will be the ones engineers can trust to add the good ones quickly, route between them honestly, and never lock the choice down.</p><p>Kilo is the coding agent that has been building for that world all along. We were excited about Cerebras eleven months ago. We&#8217;ll be on whatever launches next month. The bet was always that openness wins as the compute landscape gets more interesting, and the compute landscape just got a lot more interesting.</p><p>The convenient part for anyone with an editor open: none of this is a roadmap. It&#8217;s already in the menu.</p><div><hr></div><p><em>Try Kilo with Cerebras: install the<a href="https://kilo.ai/install"> VS Code or JetBrains extension</a> or run npm install -g @kilocode/cli, then add a Cerebras key under Settings &#8594; API Provider, or use Kilo Credits at provider-rate pricing.</em></p>]]></content:encoded></item><item><title><![CDATA[We Tested DeepSeek V4 Pro and Flash Against Claude Opus 4.7 and Kimi K2.6]]></title><description><![CDATA[DeepSeek V4 Pro and DeepSeek V4 Flash launched together on April 24, 2026 under MIT license.]]></description><link>https://blog.kilo.ai/p/we-tested-deepseek-v4-pro-and-flash</link><guid isPermaLink="false">https://blog.kilo.ai/p/we-tested-deepseek-v4-pro-and-flash</guid><dc:creator><![CDATA[Darko]]></dc:creator><pubDate>Wed, 13 May 2026 11:17:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!RkaY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><a href="https://api-docs.deepseek.com/quick_start/pricing">DeepSeek V4 Pro and DeepSeek V4 Flash</a> launched together on April 24, 2026 under MIT license. They are DeepSeek&#8217;s first new architecture since V3, and their first open-weight lineup with two tiers (Pro as the flagship, Flash as the lightweight model).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RkaY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RkaY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png 424w, https://substackcdn.com/image/fetch/$s_!RkaY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png 848w, https://substackcdn.com/image/fetch/$s_!RkaY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png 1272w, https://substackcdn.com/image/fetch/$s_!RkaY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RkaY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png" width="1080" height="742" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:742,&quot;width&quot;:1080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RkaY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png 424w, https://substackcdn.com/image/fetch/$s_!RkaY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png 848w, https://substackcdn.com/image/fetch/$s_!RkaY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png 1272w, https://substackcdn.com/image/fetch/$s_!RkaY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb504b56e-63d8-4c77-b957-8bec78d1bfac_1080x742.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We ran both through the same FlowGraph spec we used for<a href="https://blog.kilocode.ai/"> Claude Opus 4.7 vs Kimi K2.6</a>. With the same spec, same prompt, same scoring rubric.</p><p><strong>TL;DR:</strong> <strong>DeepSeek V4 Pro scored 77/100</strong> for $2.25 and lands between Opus 4.7 (91) and Kimi K2.6 (68) in terms of performance. <strong>DeepSeek V4 Flash scored 60/100</strong> for $0.02, a price point we have not seen on this test before, but its build failed and the output is missing some key pieces.</p><h1><strong>The Four Models We Compared</strong></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VI4A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VI4A!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg 424w, https://substackcdn.com/image/fetch/$s_!VI4A!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg 848w, https://substackcdn.com/image/fetch/$s_!VI4A!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!VI4A!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VI4A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg" width="1456" height="437" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:437,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VI4A!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg 424w, https://substackcdn.com/image/fetch/$s_!VI4A!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg 848w, https://substackcdn.com/image/fetch/$s_!VI4A!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!VI4A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdfc40c6e-2242-4f9a-aa09-9e3ab2e328eb_1456x437.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>DeepSeek V4 Flash is the cheapest model in the comparison by a wide margin.</strong> Output tokens cost less than 1/14th of Kimi K2.6 and roughly 1/89th of Claude Opus 4.7.</p><p><strong>DeepSeek is also <a href="https://api-docs.deepseek.com/quick_start/pricing">running</a> a 75% off promotion on DeepSeek V4 Pro through May 31, 2026.</strong> Under the discount, DeepSeek V4 Pro input drops to roughly $0.036/M and output drops to $0.87/M, putting it below Kimi K2.6 on both axes. DeepSeek separately cut input cache pricing across the lineup to one-tenth of previous levels as a permanent change.</p><h1><strong>The Test</strong></h1><p>This is the same FlowGraph spec we used in the<a href="https://blog.kilo.ai/p/we-gave-claude-opus-47-and-kimi-k26"> Opus 4.7 vs Kimi K2.6</a> run, a workflow orchestration backend with 20 endpoints, persistent state, lease management, retries, and event streaming. It is a heavier infrastructure test than our usual coding benchmarks to push the models to their limits.</p><p>We ran DeepSeek V4 Pro and DeepSeek V4 Flash through the same setup to see where the new DeepSeek lineup lands on cost and first-pass quality next to Claude Opus 4.7 and Kimi K2.6.</p><h1><strong>The Prompt</strong></h1><p>We ran both DeepSeek models in<a href="https://kilocode.ai/"> Kilo CLI</a> with the same prompt we used for Opus 4.7 and Kimi K2.6:</p><blockquote><p>&#8220;Read @SPEC.md and build the project in the current directory. Treat @SPEC.md as the source of truth. Do not simplify this into a mock, toy app, or basic CRUD scaffold. Create all code, configuration, Prisma schema, tests, and README needed for a runnable project.&#8230;&#8221;</p></blockquote><p>Both DeepSeek models ran on thinking mode in their own empty directories with no shared state.</p><h1><strong>What Each Model Produced</strong></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qjeS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qjeS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg 424w, https://substackcdn.com/image/fetch/$s_!qjeS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg 848w, https://substackcdn.com/image/fetch/$s_!qjeS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!qjeS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qjeS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg" width="1456" height="377" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:377,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qjeS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg 424w, https://substackcdn.com/image/fetch/$s_!qjeS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg 848w, https://substackcdn.com/image/fetch/$s_!qjeS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!qjeS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59fe59f0-563b-4a61-b70d-42ebbdd1133c_1456x377.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>DeepSeek V4 Pro passed its own test suite but the TypeScript build failed. DeepSeek V4 Flash&#8217;s test suite never ran because its setup script tried to force-reset the database in a way that errored out before the first test executed.</p><p>If we had stopped at the model summaries, both DeepSeek implementations would look closer to Claude Opus 4.7 than they actually were. A direct code review plus targeted reproductions against isolated SQLite databases revealed the problems in both model outputs.</p><h1><strong>DeepSeek V4 Pro</strong></h1><p>DeepSeek V4 Pro got the broad shape of the system right. The endpoints are wired up, the test suite passes, and the project layout is reasonable. <strong>The issues we found are concentrated in the same places as Kimi K2.6</strong>: lease expiry handling, scheduling, validation, and build integrity.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qSDq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qSDq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png 424w, https://substackcdn.com/image/fetch/$s_!qSDq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png 848w, https://substackcdn.com/image/fetch/$s_!qSDq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png 1272w, https://substackcdn.com/image/fetch/$s_!qSDq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qSDq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png" width="1456" height="1709" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1709,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qSDq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png 424w, https://substackcdn.com/image/fetch/$s_!qSDq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png 848w, https://substackcdn.com/image/fetch/$s_!qSDq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png 1272w, https://substackcdn.com/image/fetch/$s_!qSDq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060370e7-c491-4821-8644-232ff08c74d7_1456x1709.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1><strong>Timed-out workers can still complete steps</strong></h1><p>When a worker claims a step, the system gives it a lease that expires after a set timeout. If the worker stalls or crashes, the lease should expire and another worker should be free to pick up the step. Once the lease has expired, the original worker is no longer the owner of that step and shouldn&#8217;t be able to mark it as done.</p><p>DeepSeek V4 Pro enforces this on heartbeats but not on completions. We claimed a step, pushed its lease expiry into the past, then asked the API to mark the step as successfully completed. The API returned 200 and recorded the step as succeeded. The original worker effectively reached past its expired lease and finalized work it no longer owned.</p><p><strong>DeepSeek V4 Pro&#8217;s own README says workers cannot complete after their lease expires, but the implementation does not enforce that.</strong></p><h1><strong>A full workflow blocks unrelated work</strong></h1><p>A workflow run can declare a maximum number of steps it is allowed to run in parallel. When that cap is reached, the saturated run shouldn&#8217;t accept more work, but other runs sharing the same queue should keep moving.</p><p>DeepSeek V4 Pro&#8217;s claim logic checks one candidate at a time. If that candidate happens to belong to a run that is already at its parallel cap, the function gives up and returns nothing, instead of moving on to the next candidate.</p><p>We reproduced this with two active runs sharing a queue. Run A was at its parallel limit. Run B had capacity and a higher-priority step ready to go. The next claim request came back empty. In production this would look like workers idling while there is real work to do, just because the first run on the queue happens to be saturated.</p><h1><strong>The project does not build</strong></h1><p>npm test passes but npm run build does not. Even after the build errors are fixed, the project still would not be runnable through npm start. The TypeScript config is set to not emit any compiled output, while package.json expects npm start to run that compiled output. <strong>A user following DeepSeek V4 Pro&#8217;s own README on a clean checkout would not get a working server.</strong></p><h1><strong>DeepSeek V4 Flash</strong></h1><p><strong>At $0.02 for the entire run, DeepSeek V4 Flash is in territory we have not tested before</strong>. The internal logic is plausible. The public API is where it falls apart.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!h1D4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!h1D4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png 424w, https://substackcdn.com/image/fetch/$s_!h1D4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png 848w, https://substackcdn.com/image/fetch/$s_!h1D4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png 1272w, https://substackcdn.com/image/fetch/$s_!h1D4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!h1D4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png" width="1456" height="1709" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1709,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!h1D4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png 424w, https://substackcdn.com/image/fetch/$s_!h1D4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png 848w, https://substackcdn.com/image/fetch/$s_!h1D4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png 1272w, https://substackcdn.com/image/fetch/$s_!h1D4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b352196-67a3-4d23-af0c-813abcdb5fe4_1456x1709.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1><strong>Clients can&#8217;t start a workflow run</strong></h1><p>To use this system, a client first creates a workflow run by calling a specific endpoint. Without that endpoint working, nothing else can happen. There is no run for workers to claim from, no events to stream, no step to complete.</p><p>DeepSeek V4 Flash wrote the handler for this endpoint, but mounted it under the wrong route prefix. The spec requires it at /workflows/key/:key/runs. DeepSeek V4 Flash actually serves it at /runs/key/:key/runs. A request to the spec path against the running server returned 404 Endpoint not found. The README documents the spec path, but the server does not serve it.</p><p>DeepSeek V4 Flash&#8217;s tests call internal functions directly rather than going through the HTTP API. From the test suite&#8217;s perspective, everything was fine. From an actual client&#8217;s perspective, the entry point to the system was missing.</p><h1><strong>Failed workflows still hand out work</strong></h1><p>Once a workflow run fails (because one of its steps used up all its retry attempts), every other step in that run should stop. The spec calls for the remaining steps to move into a blocked state so workers will not pick them up.</p><p>DeepSeek V4 Flash&#8217;s recovery logic loads all expired steps at the start, then handles them one by one. If the first expired step exhausts its retries and fails the parent run, a later step in the same batch can still be promoted to a &#8220;ready to retry&#8221; state, even though the run it belongs to is already over.</p><p>We reproduced this with two expired steps in one run:</p><ul><li><p>Step a had no retry attempts left and was correctly marked dead</p></li><li><p>The parent run was correctly marked failed</p></li><li><p>Step b ended up in waiting_retry instead of blocked</p></li></ul><p>A worker polling for new work would still receive step b and execute it for a workflow that had already failed. Claude Opus 4.7 had a related multi-expired lease bug. Kimi K2.6 missed live event streaming entirely. <strong>Recovery under contention keeps being the hardest part of this spec for any model to get right on the first pass.</strong></p><h1><strong>Same timeout bug as DeepSeek V4 Pro</strong></h1><p>DeepSeek V4 Flash has the same expired-lease completion bug as DeepSeek V4 Pro. An expired lease can still finalize the work, even though the original worker no longer owns the step.</p><p>It also rejects valid request payloads. The spec says workflow run input and metadata can carry arbitrary JSON, which includes arrays, strings, and numbers. DeepSeek V4 Flash&#8217;s validation only accepts JSON objects. A client sending a JSON array as input would get a 400 response even though the spec accepts it.</p><h1><strong>Tool calling held up better than expected</strong></h1><p>The bugs above are about the output DeepSeek V4 Flash produced. Tool calling is a separate axis: how the model performed inside Kilo CLI. On that axis, the model held up surprisingly well. It read files before editing them, installed dependencies and ran the test suite at sensible points, and did not get stuck in retry loops on broken commands. The agent loop ran cleanly even when the code it produced had gaps.</p><p><strong>That is not what we expected from a model at this price tier.</strong> Tool calling reliability is usually where cheaper models break down first, with malformed arguments, hallucinated file paths, or runaway loops that burn through tokens without making progress. DeepSeek V4 Flash avoided those failure modes in our run.</p><h1><strong>Scoring</strong></h1><p>We used the same 7-category rubric as the Opus vs Kimi post.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!67Lg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!67Lg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg 424w, https://substackcdn.com/image/fetch/$s_!67Lg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg 848w, https://substackcdn.com/image/fetch/$s_!67Lg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!67Lg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!67Lg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg" width="1456" height="540" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:540,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!67Lg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg 424w, https://substackcdn.com/image/fetch/$s_!67Lg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg 848w, https://substackcdn.com/image/fetch/$s_!67Lg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!67Lg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b201fc2-3474-47ec-9b80-ccc5e61eb982_1456x540.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>DeepSeek V4 Pro slots between Claude Opus 4.7 and Kimi K2.6. The gap with Opus is concentrated in build quality and lease handling. DeepSeek V4 Flash sits below Kimi K2.6, with deductions in nearly every category.</p><h1><strong>Cost vs Quality</strong></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nf6m!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nf6m!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png 424w, https://substackcdn.com/image/fetch/$s_!nf6m!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png 848w, https://substackcdn.com/image/fetch/$s_!nf6m!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png 1272w, https://substackcdn.com/image/fetch/$s_!nf6m!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nf6m!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png" width="1090" height="484" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:484,&quot;width&quot;:1090,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nf6m!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png 424w, https://substackcdn.com/image/fetch/$s_!nf6m!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png 848w, https://substackcdn.com/image/fetch/$s_!nf6m!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png 1272w, https://substackcdn.com/image/fetch/$s_!nf6m!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59f37c4e-4756-4ce9-b844-04cfdd6bc72e_1090x484.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>DeepSeek V4 Flash&#8217;s cost per point is roughly 30x cheaper than Kimi K2.6 and 100x cheaper than Opus 4.7 on this benchmark.</strong> The score is lower, but the absolute dollar amount is so small that running the same task three or four times to compare attempts is still cheaper than one Kimi K2.6 run.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-zqd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-zqd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png 424w, https://substackcdn.com/image/fetch/$s_!-zqd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png 848w, https://substackcdn.com/image/fetch/$s_!-zqd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png 1272w, https://substackcdn.com/image/fetch/$s_!-zqd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-zqd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png" width="1356" height="892" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:892,&quot;width&quot;:1356,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-zqd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png 424w, https://substackcdn.com/image/fetch/$s_!-zqd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png 848w, https://substackcdn.com/image/fetch/$s_!-zqd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png 1272w, https://substackcdn.com/image/fetch/$s_!-zqd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5d34121-438e-4376-aa15-8ee999507ddd_1356x892.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>DeepSeek V4 Pro is more expensive than Kimi K2.6 in this run because we ran it before applying the official discount. With DeepSeek&#8217;s 75% promo applied to current rates, the same run would have cost closer to $0.55, putting it below Kimi K2.6 in absolute cost while scoring 9 points higher.</p><h1><strong>What This Means for Open-Weight Models</strong></h1><p>The pattern from previous comparisons keeps holding. The gap on surface coverage between open-weight and frontier proprietary is narrow. The gap on correctness inside hard code paths (lease recovery, cross-run scheduling, expired-lease rejection) is still there but also narrowing.</p><p>DeepSeek V4 Pro is the practical step up from Kimi K2.6 based on our test. Same general failure pattern, but cleaner overall structure and fewer spec-level gaps. With DeepSeek&#8217;s official discount in effect, the price gap with Kimi closes and the quality gap stays.</p><p>DeepSeek V4 Flash is a different conversation. <strong>At full price it is cheaper than the existing budget tier (Gemini 3.1 Flash Lite, Claude Haiku 4.5) by a wide margin.</strong> A 60/100 score on this spec is not a reason to use it on its own, but the cost is. For tasks where you can absorb a rough first pass and a human review, $0.02 per attempt changes the math considerably.</p><h1><strong>Our Takeaways</strong></h1><p><strong>Claude Opus 4.7 still pulls ahead.</strong> The trickier parts of the spec (anything involving timing, recovery, or coordination between moving pieces) are where every other model lost points. Claude Opus 4.7 had only one reproducible bug while the other three had more.</p><p><strong>DeepSeek V4 Pro performed better than Kimi K2.6 in this run.</strong> It scored 9 points higher, runs at a lower per-token list price, and produces about the same shape of failures under review. With DeepSeek&#8217;s official discount through May 31, the cost gap is even larger.</p><p><strong>DeepSeek V4 Flash is a new category.</strong> It is not fully reliable for complex backend builds without a cleanup pass. But $0.02 for a first-pass attempt at a backend of this size is a price point that did not exist before. If you can absorb imperfect output, the math changes.</p>]]></content:encoded></item><item><title><![CDATA[AI Consulting Wins When It Embraces Model Freedom]]></title><description><![CDATA[Picking the right LLM for the job has never been more important]]></description><link>https://blog.kilo.ai/p/ai-consulting-wins-when-it-embraces</link><guid isPermaLink="false">https://blog.kilo.ai/p/ai-consulting-wins-when-it-embraces</guid><dc:creator><![CDATA[Ari]]></dc:creator><pubDate>Tue, 12 May 2026 22:03:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!haOW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The most interesting AI news this month isn&#8217;t a model release. It&#8217;s that the biggest AI companies are quietly becoming (big) consulting companies.</p><p>Both OpenAI and Anthropic are moving aggressively into consulting and enterprise deployment. OpenAI is reportedly building a massive &#8220;Deployment Company&#8221; alongside private equity firms and global consultancies, <a href="https://www.axios.com/2026/05/11/openai-deployco-private-equity">valued at $14 billion</a> right out of the gate. Anthropic just <a href="https://www.anthropic.com/news/enterprise-ai-services-company">launched its own enterprise services</a> venture backed by Blackstone, Goldman Sachs, and Hellman &amp; Friedman.</p><p>And Google recently announced a <a href="https://www.bloomberg.com/news/articles/2026-04-22/google-launches-750-million-fund-for-consultants-to-adopt-ai">$750 million fund</a> to help consultants adopt AI. Citing large firms such as Accenture and McKinsey, it&#8217;s a more measured approach than launching a full consultancy themselves, but it&#8217;s still no small play. The stakes are high and everybody wants a piece of the prize.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!haOW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!haOW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png 424w, https://substackcdn.com/image/fetch/$s_!haOW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png 848w, https://substackcdn.com/image/fetch/$s_!haOW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png 1272w, https://substackcdn.com/image/fetch/$s_!haOW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!haOW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png" width="1456" height="728" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:728,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3294189,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://westernzevon.substack.com/i/197411592?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!haOW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png 424w, https://substackcdn.com/image/fetch/$s_!haOW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png 848w, https://substackcdn.com/image/fetch/$s_!haOW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png 1272w, https://substackcdn.com/image/fetch/$s_!haOW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3702c383-552a-4e49-9261-ee276229eb3f_1774x887.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is not a side story. It&#8217;s the clearest signal yet that AI adoption is not going to be won by APIs alone.</p><p>But what happens if the big players start offering consultative solutions that are really just designed for single vendor lock-in?</p><h2>Everyone in AI is Now an AI Consultant (And That&#8217;s a Good Thing)</h2><p>The model landscape now changes so fast that &#8220;best practices&#8221; can become outdated overnight. For the past two years, the dominant narrative around AI has been software margins, automation, and &#8220;replace labor with models.&#8221; But the frontier labs themselves are now acknowledging something important: AI adoption is still profoundly human work.</p><p>In practice, almost everyone serious about AI has become a consultant now. <em>If you&#8217;re a builder, you&#8217;ve never been more valuable.</em></p><p>That includes founders. Engineers. Open-source maintainers. Partnership teams. Developer advocates. Even power users inside enterprises.</p><p>A surprising amount of modern AI work looks less like traditional SaaS deployment and more like a constant stream of config sessions, workflow redesigns, stack evaluations, and internal education. And that&#8217;s <em>in addition</em> to the strategy sessions.</p><p>When we run a KiloClaw configuration session with a developer team, the conversation rarely starts with a single model anymore. It starts with tradeoffs, or, more accurately, trade<em>-ups</em>.</p><p><em>Which model is best for coding right now? Which one handles long-context reasoning better? Which one is cheapest at scale? Which one works best for agents?</em></p><p>Whether or not they ask those kinds of questions will define whether these new consultants help you <strong>increase ROI and actually</strong><em><strong> save money</strong></em><strong>, not just max out token spend.</strong> Management consultants have always centered their value proposition on efficiency and cost-cutting, not just bigger spending, which means the focus on model optimization is a core part of the AI adoption playbook. McKinsey, for example, has long cited one of their key benefits as helping companies <a href="https://www.mckinsey.com/capabilities/operations/our-insights/make-cost-saving-continuous-for-years-to-come">make cost saving continuous</a> for years to come.</p><p>And this focus extends to internal work, too. Boston Consulting Group has <a href="https://www.businessinsider.com/bcg-boston-consulting-group-ai-products-development-agents-2025-12">internally deployed over 36,000 custom GPTs</a> across its 32,000 consultants worldwide, signaling a massive effort to operationalize and optimize AI for knowledge work.</p><p>Claude Opus is immensely powerful. But it&#8217;s no secret that it can also get very expensive&#8211;very fast. Opus fits into key places in a workflow&#8212;we&#8217;re using it now in our Auto Frontier Model, which <a href="https://kilo.ai/docs/code-with-ai/agents/auto-model">automatically picks the best model for the job</a> within set criteria. But if you&#8217;re serious about optimizing ROI <em>over time</em>, you should be looking at a much broader spectrum of models and providers, including but not limited to OpenAI and Anthropic.</p><p>Just look at the recent PinchBench data on <a href="https://pinchbench.com/?view=value&amp;categories=core_agent">value scores for agent work</a>. New models from Stepfun and DeepSeek are giving Claude Haiku a run for its money. They aren&#8217;t necessarily <em>better</em> but they might be <em>the right fit for your daily agentic work</em>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nG9S!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nG9S!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png 424w, https://substackcdn.com/image/fetch/$s_!nG9S!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png 848w, https://substackcdn.com/image/fetch/$s_!nG9S!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png 1272w, https://substackcdn.com/image/fetch/$s_!nG9S!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nG9S!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png" width="1456" height="873" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:873,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!nG9S!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png 424w, https://substackcdn.com/image/fetch/$s_!nG9S!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png 848w, https://substackcdn.com/image/fetch/$s_!nG9S!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png 1272w, https://substackcdn.com/image/fetch/$s_!nG9S!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff57044f0-25b2-4ba6-85d6-ca0c40572198_2048x1228.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The same thing happens in workshops with teams frustrated by rising <a href="https://kilo.ai/kilo-code/vs/cursor">Cursor pricing</a> and exploring alternatives.</p><p>The opportunity is enormous. But so is the complexity.</p><h2>Trading Vendor Lock-in for Smarter Workflows</h2><p>The rise of AI consulting is good news. It means companies finally understand that &#8220;buying a model&#8221; is not the same thing as becoming AI-native.</p><p>The &#8220;big five&#8221; consultancies didn&#8217;t become giants because businesses lacked software. They became giants because transformation is messy, political, and operational. AI is no different &#8212; arguably it&#8217;s even more complex, in part because it&#8217;s touching on all of the systems as well as the models and agents that link them together. That&#8217;s the orchestration layer.</p><p>The smartest AI consulting firms won&#8217;t behave like reseller channels for a single lab. They&#8217;ll look more like intelligence infrastructure advisors. Part strategist, part systems integrator, part workflow architect, part educator.</p><p>Because, let&#8217;s be honest, they&#8217;ll be learning as much as you are. AI is moving so fast that it has to be developed in tandem with customers.</p><p>And increasingly, consultants&#8217; value won&#8217;t come from access to, or understanding of, a particular model or model family. It will come from helping organizations navigate constant change without rebuilding their stack every six months. They&#8217;ll behave like strategic orchestrators across a constantly shifting model landscape.</p><p><strong>That&#8217;s what we&#8217;ve been building for at Kilo</strong>&#8212;the option to always have the best model for the job right at your fingertips&#8212;<strong>and it&#8217;s also what our lab and inference partners are increasingly building for</strong>, as they release both strikingly efficient SOTA models (like <a href="https://blog.kilo.ai/p/the-arrival-of-gpt-55-openais-new">GPT-5.5</a>) and sneaky little powerhouses like the Xiaomi <a href="https://kilo.ai/models/xiaomi-mimo-v2-pro">MiMo models</a> and Ant Group&#8217;s <a href="https://blog.kilo.ai/p/the-elephant-is-out-of-the-bag-meet">Ling and Ring models</a>, optimized for agentic tool calling. Flexible tools to fit any workflow, so that even as the model landscape evolves your workflows can travel with it.</p><p>The winners in the new AI consulting race won&#8217;t look like traditional consultancies. They&#8217;ll likely be:</p><ul><li><p>smaller</p></li><li><p>faster</p></li><li><p>deeply technical but highly creative</p></li><li><p>open-source native</p></li><li><p>model-agnostic</p></li><li><p>agent-first</p></li><li><p>and capable of shipping real systems instead of just decks</p></li></ul><p>I&#8217;m not discounting the frontier labs. But the initial consultancy offerings they&#8217;ve been announcing will need to break down into micro-consultancies to be successful &#8212; in many ways mirroring what the frontier labs&#8217; enterprise sales teams and &#8220;forward-deployed engineers&#8221; have already been doing. Every vertical counts.</p><p>The best consultants may not even call themselves consultants. They&#8217;ll look more like hybrid studios, deployment partners, infra operators, and ultimately the account managers at the <a href="https://kilo.ai/code">AI tools</a> in your stack.</p><p>That&#8217;s a healthier ecosystem. And for the <em>consultees</em> to succeed as much as the <em>consultants</em>, that ecosystem will have to be based on model freedom.</p>]]></content:encoded></item><item><title><![CDATA[Inside Kilo Speed: The Engineer Who Teaches Teams How to Think in Agents]]></title><description><![CDATA[How to manage your agent team, from someone who coaches Kilo customers in agentic engineering.]]></description><link>https://blog.kilo.ai/p/inside-kilo-speed-the-engineer-who</link><guid isPermaLink="false">https://blog.kilo.ai/p/inside-kilo-speed-the-engineer-who</guid><dc:creator><![CDATA[Rebecca Dodd]]></dc:creator><pubDate>Tue, 12 May 2026 16:42:06 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/8cfac294-433b-44aa-b709-37ecae7ef0a4_1200x800.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>When you&#8217;re learning a new discipline&#8212;especially on the job&#8212;learning the theory behind it can feel like an abstract nice-to-have, while practice is the thing that&#8217;s actually useful. Learning by doing is absolutely a valid way to upskill, but in <a href="https://www.linkedin.com/in/marius-wichtner/">Marius Wichtner</a>&#8217;s experience, grasping the conceptual foundation of agentic engineering helps to make the practical steps make sense.</p><p>Before joining Kilo Code, Marius was already training engineering teams on working with generative AI. At Kilo, he does the same for enterprise clients in Kilo Speedruns: one-hour sessions designed to give teams a fast, practical orientation on agentic software development. He&#8217;s run them for companies across industries, and now he&#8217;s sharing the foundations of those lessons (and his specific practices for each) here:</p><ul><li><p>How to delegate effectively</p></li><li><p>How to scale across concurrent workstreams</p></li><li><p>How to maintain judgment and recover when things go wrong</p></li></ul><h2>1. How to Delegate: The Team Lead Model and the Plan</h2><p>The mental model Marius uses to explain agentic engineering&#8212;both in client speedruns and in how he structures his own work&#8212;is the team lead.</p><p>Team leads don&#8217;t spend all day writing code, and the same was true even before agentic tools existed. They were in pairing sessions, answering questions, reviewing output, and deciding what to merge. &#8220;Those were always the people that were only in meetings and they got called by all the juniors,&#8221; Marius says. &#8220;They were just solving the last 20% of the problem.&#8221;</p><p>In this model, the agent takes care of execution work, while the engineer operates as the team lead. The 80% that agents handle well&#8212;code generation, boilerplate, well-scoped subtasks&#8212;is work that the team lead delegates. The 20% that still requires the engineer is the judgment work: architectural decisions, what to merge, and recognizing when the agent has drifted.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vacM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vacM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png 424w, https://substackcdn.com/image/fetch/$s_!vacM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png 848w, https://substackcdn.com/image/fetch/$s_!vacM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png 1272w, https://substackcdn.com/image/fetch/$s_!vacM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vacM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png" width="1456" height="546" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:546,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vacM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png 424w, https://substackcdn.com/image/fetch/$s_!vacM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png 848w, https://substackcdn.com/image/fetch/$s_!vacM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png 1272w, https://substackcdn.com/image/fetch/$s_!vacM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac2b1df-2473-4318-ad49-9ab8951ac560_2048x768.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Parallel development with the engineer acting as team lead</figcaption></figure></div><p>The engineers who transition most naturally into agentic workflows are often the ones who were already operating this way: team leads and architects who had developed the habit of switching contexts and reviewing output rather than writing it. Everyone else has to learn that mode of working, which starts with understanding the difference between a specification and a plan.</p><p>A specification captures what the user wants. It doesn&#8217;t change based on the current state of the codebase. It&#8217;s set from the user demand, and it stays set. A plan is <em>how</em> you intend to build the thing given where the code actually is right now. &#8220;A plan is dependent on your state of the code,&#8221; says Marius. &#8220;Plans usually get thrown away very quickly.&#8221;</p><p>When Marius works with an agent on complex tasks (especially those with important architecture decisions), he asks it to write its plan to a markdown file before it starts executing so he can review it. Asking the agent to write its plan first forces a shared understanding of what&#8217;s actually being built. You review it, ask questions, and surface problems before execution begins. It&#8217;s the refinement stage of traditional software engineering, but the difference now is that the feedback loop is much faster.</p><p>Plans, done right, function as constraints. Marius thinks of this as keeping an agent in the acceptable solution space: the set of outputs you will actually accept. The further an agent drifts from a confirmed plan, the more likely it ends up somewhere that requires starting over. Forcing the plan upfront dramatically increases the probability of staying on track.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7zuo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7zuo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!7zuo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!7zuo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!7zuo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7zuo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7zuo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!7zuo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!7zuo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!7zuo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e937e95-32d1-4283-81e4-0933b9ef9783_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Plans help to keep your agent within the acceptable solution space</figcaption></figure></div><p>The plan also acts as a contract: it documents the approach the agent intends to take, so when it does something unexpected later, there&#8217;s a reference point. &#8220;You can always reiterate to the agent, &#8216;We decided to implement this plan. Why have you decided otherwise?&#8217;&#8221;</p><h2><strong>2. How to Scale: Parallelism and the Context Rot Problem</strong></h2><p>Even with a solid plan in place, there&#8217;s a natural limit to how far a single agent session can take you: context rot. As a session grows, accumulating conversation history, prior decisions, and intermediate code states, the agent starts losing coherence. Tasks that were reasonable at the start become unpredictable midway through. Early decisions can come back to bite you. At some point, recovery means starting over.</p><p>Most engineers treat this as a nuisance and work around it by brute force: shorter sessions, more restarts. Marius treats it as a signal that the work hasn&#8217;t been decomposed correctly. &#8220;If you have a huge feature and you develop on it for the whole week, you will keep having context rot,&#8221; Marius says. &#8220;It makes much more sense to plan out what you want to implement ahead of time and then develop each of the sub-problems individually in small context windows.&#8221;</p><p>This is where parallelism comes in: you run multiple agents simultaneously, each working on a specific sub-problem. But parallel agents writing to the same file system will conflict (the same reason Git was invented). You need each agent working in its own isolated environment.</p><p>To address this, Marius built a solution into his own custom IDE, before building Kilo&#8217;s <a href="https://blog.kilo.ai/i/192608130/the-agent-manager">Agent Manager</a>: a tool for running multiple agent sessions simultaneously, each in its own isolated workspace, with its own file system. Instead of supervising agents one at a time, an engineer can delegate across several concurrent workstreams and review the results as they come in. Things that look good get merged; things that don&#8217;t get discarded without the cost of untangling a week of compounded decisions.</p><p>Not every task demands the multi-agent treatment. Marius works across three categories depending on complexity:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!97av!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!97av!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!97av!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!97av!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!97av!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!97av!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!97av!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!97av!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!97av!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!97av!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30c1d149-282c-49df-91bf-d8ea57ce7f93_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">How Marius routes tasks based on their complexity</figcaption></figure></div><p><strong>Easy tasks:</strong> Things like adding documentation, writing a unit test, or well-scoped bug fixes go to a fully autonomous cloud workflow. The developer writes the spec, the agent executes, the developer reviews the diff. No supervision is required mid-session.</p><p><strong>Hard tasks:</strong> Implementing a complex feature spanning UI and backend, or anything with meaningful architectural decisions, gets handled locally with Agent Manager. The developer supervises multiple agents working in parallel on decomposed subtasks, stays close to the work, and makes the judgment calls as diffs come in.</p><p><strong>Unclear tasks:</strong> When the outcome isn&#8217;t well-defined, it&#8217;s hard to write a spec precise enough to constrain the agent toward a single solution. For these, Marius runs multiple agents in parallel against the same spec and compares the results. Instead of splitting work, the parallelism here is about generating variants and selecting the best one. The engineer&#8217;s job is choosing the right route.</p><h2><strong>3. How to Stay on Track: Context Engineering and Judgment</strong></h2><p>Context engineering, as Marius defines it, is how you structure and optimize the context of the agent. The goal is to limit an agent to doing exactly what you want, over time, in your codebase. It&#8217;s the ongoing work of keeping agents oriented, and knowing how to reorient them when they&#8217;ve drifted.</p><p>For upfront orientation, Marius uses <a href="https://placeholder/">Handy</a>, a speech-to-text tool, to interact with agents verbally before locking in a plan. A lot of the context that matters for a task lives in the engineer&#8217;s head and never gets written down, because it&#8217;s too tedious to type everything out. Speaking it aloud removes that barrier, and an LLM can distil the rough transcript into a precise problem statement. The rough transcript also becomes the raw material for the plan the agent writes before executing.</p><p>When an agent session ends&#8212;whether it hit a context limit or simply reached a natural stopping point&#8212;continuing the work is usually straightforward. The original prompts, the Git diff (Agent Manager measures the delta from when the session started), and the current state of the codebase give a new agent enough to pick up where the previous one left off. Tools like <a href="https://repomix.com/">Repomix</a> can help with collecting specific file trees for this purpose. All of this can happen locally or in GitHub, where an issue describes the task, the PR contains the changes, and the history provides the thread. Most agents can continue from that context without much intervention. </p><p>What this process makes visible is what&#8217;s actually irreplaceable: the context that isn&#8217;t captured anywhere. Code and prompts are always an approximation&#8212;there are causal relationships in software that are hard to capture in prompts or code alone. Some of them, like another team&#8217;s architectural decision creating a dependency you didn&#8217;t know about, can be surfaced and handed off. Others only become visible when you run the code or at scale. An agent can&#8217;t know what hasn&#8217;t surfaced yet&#8212;that&#8217;s still the engineer&#8217;s job.</p><p>This is the difference between just coding and software <em>engineering</em>. The easy mistake with agentic work is treating it as a handoff: you describe what you want, the agent builds it, you ship it. In that approach, the critical last 20% can get lost: things like evaluating architectural choices and catching when an agent has veered off course. These require engineering judgment, and they&#8217;re often much harder than the first 80%.</p><p>The mental shift Marius describes is about learning to apply engineering judgment at the right moments, across multiple concurrent threads, rather than sequentially inside a single one.</p><p><em>Read the other posts in our Kilo Speed series:</em></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;0581f2c9-7c4b-4ca9-b04c-ad222471fdf9&quot;,&quot;caption&quot;:&quot;On a Wednesday morning, Mark IJbema sits down at his desk and reviews a pull request. He didn&#8217;t write the code in it&#8212;a Cloud Agent did, overnight. Before finishing up the evening before, he&#8217;d tasked it with finding the most critical components in the new VS Code extension that were missing screenshot tests, adding them, and opening a PR.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Inside Kilo Speed: How One Engineer is Replatforming Our VS Code Extension in a Month&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:154482092,&quot;name&quot;:&quot;Rebecca Dodd&quot;,&quot;bio&quot;:&quot;Rebecca writes for developers and the people they work with.&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/818813eb-ddc6-48b4-bf08-c732da525788_1875x1875.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2026-03-11T19:37:34.650Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7f924326-739a-4866-bf38-a5738a94b26b_1200x800.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.kilo.ai/p/inside-kilo-speed-how-one-engineer-52c&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:190628346,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:8,&quot;comment_count&quot;:0,&quot;publication_id&quot;:4363009,&quot;publication_name&quot;:&quot;Kilo Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!EOvc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a533426-8d50-480b-8659-75d53523270b_720x720.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;17c5ab19-4820-403f-9fac-af05928db0c4&quot;,&quot;caption&quot;:&quot;Identity resolution is one of the hardest problems in data engineering. Telemetry, backend, payment platforms, marketing forms&#8212;each of these systems contain their own representation of what a user is. Stitching those together requires not only a shared meaning of what represents a single identity, but also mapping edge cases, validating fragile joins an&#8230;&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Inside Kilo Speed: How Our Head of Data Shipped an Identity Resolution System Before His First Full Day&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:154482092,&quot;name&quot;:&quot;Rebecca Dodd&quot;,&quot;bio&quot;:&quot;Rebecca writes for developers and the people they work with.&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/818813eb-ddc6-48b4-bf08-c732da525788_1875x1875.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2026-02-20T19:46:19.597Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!2QBT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8802d9d-dfd5-4b59-97a3-5647515813d5_1200x800.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.kilo.ai/p/inside-kilo-speed-how-our-head-of&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:188480005,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:12,&quot;comment_count&quot;:0,&quot;publication_id&quot;:4363009,&quot;publication_name&quot;:&quot;Kilo Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!EOvc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a533426-8d50-480b-8659-75d53523270b_720x720.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;40527036-0e95-49b0-a3fb-55985ec39c46&quot;,&quot;caption&quot;:&quot;Infrastructure projects&#8212;the &#8220;plumbing&#8221; that other features rely on&#8212;are usually weighed down by months of architectural vetting and manual configuration.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Inside Kilo Speed: How One Engineer Built Cloud Agents in a Week&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:154482092,&quot;name&quot;:&quot;Rebecca Dodd&quot;,&quot;bio&quot;:&quot;Rebecca writes for developers and the people they work with.&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/818813eb-ddc6-48b4-bf08-c732da525788_1875x1875.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2026-02-04T20:20:54.440Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!7or6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F447714d6-6ad4-404c-9de1-05ad845a60de_1280x720.webp&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.kilo.ai/p/inside-kilo-speed-how-one-engineer-dcb&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:186786786,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:8,&quot;comment_count&quot;:2,&quot;publication_id&quot;:4363009,&quot;publication_name&quot;:&quot;Kilo Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!EOvc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a533426-8d50-480b-8659-75d53523270b_720x720.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;b318abc7-5af3-4500-9d27-1dacd3c82989&quot;,&quot;caption&quot;:&quot;In the traditional engineering model, velocity is often sacrificed at the altar of coordination. Even a simple feature can spend weeks in an asynchronous feedback loop of internal posts and cross-departmental reviews before a single line of code is written. The process is designed for consensus, not speed.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Inside Kilo Speed: How One Engineer Shipped an MVP in His First Week&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:154482092,&quot;name&quot;:&quot;Rebecca Dodd&quot;,&quot;bio&quot;:&quot;Rebecca writes for developers and the people they work with.&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/818813eb-ddc6-48b4-bf08-c732da525788_1875x1875.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2026-01-28T14:28:15.955Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!Uquq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27a0349c-6f6b-4b40-815f-258836980826_1280x720.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.kilo.ai/p/inside-kilo-speed-how-one-engineer-971&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:185320862,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:7,&quot;comment_count&quot;:0,&quot;publication_id&quot;:4363009,&quot;publication_name&quot;:&quot;Kilo Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!EOvc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a533426-8d50-480b-8659-75d53523270b_720x720.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;3b99ed71-4dd3-43e6-810f-bccd0462688a&quot;,&quot;caption&quot;:&quot;The promise of AI coding is exponential speed. But for many engineers, the reality is downgraded models, confusing pricing, rate limits, and model lock-in. This is all artificial friction (we&#8217;re calling it AI drag) introduced by the very tools that are supposed to be making developers more productive.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Inside Kilo Speed: How One Engineer Shipped an AI Adoption Dashboard in Two Days&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:154482092,&quot;name&quot;:&quot;Rebecca Dodd&quot;,&quot;bio&quot;:&quot;Rebecca writes for developers and the people they work with.&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/818813eb-ddc6-48b4-bf08-c732da525788_1875x1875.jpeg&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://rebeccadoddwrites.substack.com/subscribe?&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://rebeccadoddwrites.substack.com&quot;,&quot;primaryPublicationName&quot;:&quot;Rebecca Dodd&quot;,&quot;primaryPublicationId&quot;:7716332}],&quot;post_date&quot;:&quot;2026-01-21T14:31:56.839Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!2wC2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd726bf6-bbc5-4fef-9f39-612151002f52_1280x720.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.kilo.ai/p/inside-kilo-speed-how-one-engineer&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:185118837,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:16,&quot;comment_count&quot;:0,&quot;publication_id&quot;:4363009,&quot;publication_name&quot;:&quot;Kilo Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!EOvc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a533426-8d50-480b-8659-75d53523270b_720x720.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div>]]></content:encoded></item><item><title><![CDATA[Cowboy Coder Is Back. This Time, They Scale]]></title><description><![CDATA[I should start by admitting I&#8217;m part of the problem.]]></description><link>https://blog.kilo.ai/p/cowboy-coder-is-back-this-time-they</link><guid isPermaLink="false">https://blog.kilo.ai/p/cowboy-coder-is-back-this-time-they</guid><dc:creator><![CDATA[Andrew Storms]]></dc:creator><pubDate>Mon, 11 May 2026 17:26:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Bj8h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I should start by admitting I&#8217;m part of the problem.</p><p>I can still draw the architecture of code I wrote three years ago from memory. The data flow, the edge cases, the reasoning behind every choice that looks strange at first glance. Ask me to do the same for a feature I shipped last month with help from an agent, and I can tell you what it does and why we built it. The deeper model, the one that lives at the level of individual functions, isn&#8217;t there.</p><p>That&#8217;s not laziness, and it&#8217;s not a lapse in review. I read every diff. An agent does a closer pass alongside me. I can speak to the intent and shape of what I&#8217;m approving. But the deep mental model, the one you actually need at 2am when something breaks and the agent isn&#8217;t helping you debug, isn&#8217;t forming the way it used to.</p><p>I&#8217;m a CISO who still writes code, and this worries me. It should worry anyone managing engineers right now, because it isn&#8217;t just me. Across the industry, AI coding agents are quietly reviving the single worst antipattern in software engineering history. We just don&#8217;t recognize it yet, because it&#8217;s wearing different clothes.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Bj8h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Bj8h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png 424w, https://substackcdn.com/image/fetch/$s_!Bj8h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png 848w, https://substackcdn.com/image/fetch/$s_!Bj8h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png 1272w, https://substackcdn.com/image/fetch/$s_!Bj8h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Bj8h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png" width="1066" height="600" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:600,&quot;width&quot;:1066,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Bj8h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png 424w, https://substackcdn.com/image/fetch/$s_!Bj8h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png 848w, https://substackcdn.com/image/fetch/$s_!Bj8h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png 1272w, https://substackcdn.com/image/fetch/$s_!Bj8h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa70fa9b0-11a2-49a9-a978-92da678b884b_1066x600.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3><strong>Remember the cowboy?</strong></h3><p>If you&#8217;ve managed engineers long enough, you know the cowboy. The one who disappears for a weekend and comes back Monday with a full rewrite nobody asked for. The one who, somehow, is the only person who understands the gnarly billing module, the auth flow, the deployment pipeline. The one whose decisions land in production faster than the team can review them.</p><p>Cowboys aren&#8217;t heroes, by the way. The hero is the engineer who pulls the 2am save when production breaks. The cowboy is the one who created the conditions that made the 2am save necessary in the first place. Heroes clean up. Cowboys cause.</p><p>For twenty years, our industry has been quietly learning how to build engineering organizations that don&#8217;t depend on this person. Code review. Pair programming. Design docs and RFCs. Collective code ownership. Postmortems that look at process, not blame. The whole inheritance from XP, agile, and DevOps was, in large part, a response to the lesson that cowboy culture <em>feels</em> productive and is actually corrosive.</p><p>It worked. Not perfectly, but the average engineering team today is far more resilient than the average team in 2005.</p><p>Then the agents arrived.</p><p>Watch what happens on teams that have adopted Claude, Cursor, Copilot, Codex, and the rest without changing how they work. An engineer prompts an agent. The agent emits eight hundred lines of code. The engineer skims it, sees the tests pass, and merges. Repeat, ten times a day, across the team.</p><p>The output is enormous. The velocity charts look incredible. And underneath, something is going wrong that nobody is naming yet.</p><p>Nobody on the team has reasoned through that code. The &#8220;author&#8221; couldn&#8217;t walk you through it under questioning. They didn&#8217;t write it, they prompted it. The reviewer couldn&#8217;t either; they had thirty other PRs in the queue, and half the time the reviewer is another agent. Six months from now, when something breaks at 2am, the engineer who gets paged will be debugging code that has, in any meaningful sense, no human author at all.</p><p>This is the cowboy pattern. The weekend rewrite, the opaque module, the knowledge silo, the tech debt nobody quite remembers creating. Same antipattern, new substrate.</p><h3><strong>Why it&#8217;s actually worse</strong></h3><p>The cowboy archetype, for all its damage, had one redeeming feature: somewhere, in one human brain, the model of the system existed. Bus factor of one.</p><p>Development driven by agents, without comprehension, produces bus factor zero. The code enters the repository with nobody understanding it. There is no expert to consult, because the &#8220;expert&#8221; was a probability distribution that has since moved on to the next prompt.</p><p>The social brakes that used to slow cowboys down are also gone. Cowboys had egos, reputations, and peers who could push back in code review. Agents have none of these. They don&#8217;t sulk when overruled, don&#8217;t take credit, don&#8217;t feel shame when prod breaks. The friction that used to make cowboy culture limit itself in healthy teams, the simple fact that other humans were watching, is absent.</p><p>And there&#8217;s a new accountability sink. When the cowboy shipped a bad rewrite, you knew whose desk to visit. When an agent ships a bad rewrite, the conversation goes &#8220;well, the AI wrote it&#8221; and everyone shrugs. The blame diffuses into the tooling.</p><h3><strong>What managers should do now</strong></h3><p>The good news: the playbook for fixing this already exists. We wrote it the last time. It needs updating, not reinventing.</p><p><strong>Require comprehension, not just approval.</strong> Before any meaningful PR written with an agent gets merged, the author should be able to walk through it without asking the agent again. If they can&#8217;t explain why a function exists, the PR isn&#8217;t ready. This is the most impactful change you can make, and the one I&#8217;d benefit from most personally.</p><p><strong>Cap PR size, hard.</strong> Code review evolved assuming limited human throughput on both sides. Agents broke that assumption. A PR of 50 lines can be meaningfully reviewed; a PR of 800 lines gets approved without thought. Set a limit, enforce it in tooling, and force large changes to be decomposed.</p><p><strong>Tag agent involvement and track it.</strong> Make AI authorship a first class piece of metadata on commits and PRs. Watch incident rates, time to debug, and refactor cost on modules where agents wrote most of the code, and compare against the rest. You can&#8217;t manage what you can&#8217;t see, and right now most engineering orgs are flying blind on this.</p><p><strong>Protect the loop of deliberate practice.</strong> Junior engineers who never struggle through a hard bug don&#8217;t become senior engineers who can debug under pressure. Build in rotations without agents, pair on hard problems, and make &#8220;can debug from scratch&#8221; part of your leveling criteria. The seniors riding herd on agents today learned their craft the hard way. The next cohort needs a path to the same skill, or you&#8217;ll wake up in five years with a team that can prompt fluently and reason about nothing.</p><p><strong>Reframe tech debt as unread code.</strong> The most dangerous code in your repository is no longer the <em>bad</em> code. It&#8217;s the <em>unread</em> code, modules that work today and that nobody on the team has actually internalized. Schedule comprehension audits. Assign engineers to read and document modules written by agents that they didn&#8217;t author themselves. Treat unread code as a liability on the books.</p><h3><strong>This is not an argument against AI</strong></h3><p>The agents are useful. The productivity gains are real. I use them every day, and I&#8217;m not giving them up.</p><p>The point is that the technical productivity of these tools is arriving faster than the organizational practices needed to absorb them. The teams that already had healthy engineering culture, the kind with code review that actually reviews, sustainable pace, and collective ownership, will adapt and thrive. The teams that quietly tolerated cowboys are about to have a much worse problem, at much greater scale, with no single person to point at.</p><p>And the rest of us, the ones who can still picture the flow of code we wrote three years ago but no longer build that same depth of model with the new stuff, need to be honest that the muscle is atrophying. Mine is. Yours probably is too.</p><p>The cowboy didn&#8217;t go away. The cowboy scaled, with a million tokens of context. The work of engineering management is to recognize the pattern in its new form and apply the lessons we already learned the last time.</p>]]></content:encoded></item><item><title><![CDATA[PinchBench 2.0 is here]]></title><description><![CDATA[148 tasks, parallel judging, thinking-level support, and a brand new leaderboard]]></description><link>https://blog.kilo.ai/p/pinchbench-20-is-here</link><guid isPermaLink="false">https://blog.kilo.ai/p/pinchbench-20-is-here</guid><dc:creator><![CDATA[Brendan O'Leary]]></dc:creator><pubDate>Mon, 11 May 2026 13:29:53 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/193b44a7-6419-4b05-bca3-de4967a9b8e0_1200x800.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>When Jensen Huang put PinchBench on screen at GTC, I had a moment of genuine &#8220;oh s***, people are actually using this.&#8221; What started as me wanting to know which model to run on my OpenClaw setup has become the reference benchmark for evaluating AI coding agents in real-world workflows. That comes with responsibility.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9jIl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9jIl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9jIl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9jIl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9jIl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9jIl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg" width="1046" height="575" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:575,&quot;width&quot;:1046,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!9jIl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9jIl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9jIl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9jIl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41effcfe-75b6-4eec-9d71-906e99e2e516_1046x575.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>(&#8221;oh s***&#8221; obviously means &#8220;oh snap&#8221; &#8230; what did you think it was?)</em></p><p>Today we&#8217;re shipping PinchBench 2.0. Here&#8217;s what changed and why.</p><h2>The problems with v1</h2><p>V1 worked by running 23 tasks, grading them, and giving you a score. But it had shortcomings that became obvious as usage scaled:</p><p><strong>Scoring was gameable.</strong> The leaderboard ranked by mean score across completed tasks&#8212;but didn&#8217;t account for how many tasks you ran. An agent that cherry-picked 1 easy task and scored 100% would outrank one that ran all 23 and scored 94.8%. <a href="https://github.com/pinchbench/skill/issues/53">Broken</a>.</p><p><strong>At least one task was basically impossible.</strong> A 95% failure rate across 90% of models isn&#8217;t testing capability&#8212;it&#8217;s wasting compute. We <a href="https://github.com/pinchbench/skill/issues/51">fixed or pulled</a> those.</p><p><strong>Version tracking was opaque.</strong> We used git commit hashes to identify benchmark versions. Users saw strings like <code>a1b2c3d</code> in the version dropdown and had no idea what changed between them.</p><p><strong>Race conditions in grading.</strong> The benchmark assumed that when the OpenClaw agent subprocess returned, the transcript was complete. <a href="https://github.com/pinchbench/skill/issues/65">It wasn&#8217;t always</a>. Grading could start on a partial transcript, producing silently wrong scores.</p><h2>148 tasks (up from 23)</h2><p>We ran a <a href="https://github.com/pinchbench/skill/issues/52">gap analysis</a> comparing what PinchBench tested against what real OpenClaw users actually do (based on 780+ ClawBytes on kilo.ai/kiloclaw). Then we filled the gaps:</p><ul><li><p><strong>Data analysis</strong> &#8212; CSV tasks covering US cities, iris flowers, temperature, life expectancy, weather stations, GDP, pension funds, Apple stock</p></li><li><p><strong>Meeting &amp; document analysis</strong> &#8212; Government meetings, advisory boards, city council, tech meetings; executive summaries, sentiment analysis, action items, Q&amp;A extraction</p></li><li><p><strong>Log analysis</strong> &#8212; 24 new tasks across NGINX, Apache, SSH, HDFS, MapReduce, and syslog</p></li><li><p><strong>Development &amp; DevOps</strong> &#8212; CI/CD debugging, Kubernetes issues, Dockerfile optimization, multi-file refactoring, test generation, commit message writing, git rescue, shell generation</p></li><li><p><strong>Image &amp; PDF</strong> &#8212; Image identification, PDF to calendar import</p></li><li><p><strong>Research &amp; writing</strong> &#8212; Market research, email drafting, Todoist cleanup, contract/legal analysis</p></li></ul><p>The <a href="https://blog.kilo.ai/p/pinchbench-v2-call-for-contributors">call for contributors</a> we opened in March brought in proposals from community members building browser automation tasks, test generation scenarios, and more. 111 commits from the community landed between v1.2.1 and v2.0.0.</p><h2>Parallel judge execution</h2><p>Grading no longer waits for all tasks to finish. The judge now overlaps with task execution, so your benchmark run completes faster. We also switched to <strong>Haiku as the default judge</strong> &#8212; faster grading without sacrificing accuracy &#8212; and added <strong>judge result caching</strong> so re-runs don&#8217;t redundantly re-grade unchanged results.</p><h2>Thinking-level support</h2><p>v2 supports <a href="https://github.com/pinchbench/skill/issues/9">testing across different reasoning/thinking levels</a> and reporting scores for each. A model&#8217;s performance at &#8220;low&#8221; thinking versus &#8220;high&#8221; thinking tells you something different than a single aggregate number. For cost-conscious users choosing between reasoning modes, this data matters.</p><h2>Multi-turn session isolation</h2><p>Tasks can now specify <code>new_session: true</code> for proper multi-turn evaluation. This means benchmarking conversational workflows (where context carries across turns) without session bleed between tasks.</p><h2>Semantic versioning</h2><p>We <a href="https://github.com/pinchbench/skill/issues/62">migrated from git hashes to proper semver</a> across the entire stack. The version now comes from GitHub releases via setuptools-scm for pip installs and a BENCHMARK_VERSION file for everyone else. The leaderboard shows <code>2.0.0</code> instead of <code>7df28f6</code>. All existing git-hash versions got backfilled as <code>1.0.0-beta.N</code> so sorting still works. I implemented it all with <a href="https://kilo.ai/gastown">Gastown from Kilo</a>.</p><h2>Leaderboard overhaul</h2><p><strong>Scoring fairness.</strong> Runs are <a href="https://github.com/pinchbench/skill/issues/53">normalized by task count</a> so comprehensive runs are rewarded, not penalized.</p><p><strong>Per-task variance and retry counts.</strong> A model scoring 0.85 with std dev 0.05 is more useful than one scoring 0.90 with std dev 0.25. Consistency scores, first-try success rates, and per-run breakdowns are now surfaced.</p><p><strong>Model landing pages.</strong> Each model gets <a href="https://github.com/pinchbench/leaderboard/issues/35">its own page</a> with all submissions, score trends over time, cost and speed metrics.</p><p><strong>User profiles and contributor recognition.</strong> A <a href="https://github.com/pinchbench/leaderboard/issues/36">contributor leaderboard</a> alongside the model one.</p><p><strong>Better filtering.</strong> Categorized provider filters, search, checkbox selections, and <a href="https://github.com/pinchbench/leaderboard/issues/22">task-level filtering</a> so you can compare models on just the tasks you care about.</p><p><strong>Badges.</strong> <a href="https://github.com/pinchbench/leaderboard/issues/48">Daily, weekly, and monthly recognition</a> for top-performing models, plus <a href="https://github.com/pinchbench/leaderboard/issues/47">overall rank badges</a> for the top 10.</p><h2>Infrastructure improvements</h2><ul><li><p><code>--core</code> flag for running a core task subset when you don&#8217;t need all 148</p></li><li><p><code>--trend</code> and <code>--trend-window</code> flags for post-run analysis</p></li><li><p>Manifest-based task ordering (replaced numbered IDs)</p></li><li><p>CI workflow for manifest linting</p></li><li><p>Better OpenClaw transcript compatibility</p></li><li><p>Axiom observability for detailed logging of benchmark runs</p></li></ul><h2>Breaking changes</h2><ul><li><p>Task IDs now use manifest-based ordering instead of numbered prefixes</p></li><li><p>Default judge backend changed from <code>openclaw</code> to <code>api</code></p></li></ul><h2>Get started</h2><p>Visit the <a href="https://github.com/pinchbench/skill">GitHub repo</a> to see the tasks, run the benchmark yourself, and contribute.</p><p>The <a href="https://pinchbench.com">leaderboard</a> is live with the new scoring, filtering, and model pages. If you&#8217;ve been using PinchBench results to pick your model, you now get variance, per-task breakdowns, and cost data alongside raw scores.</p><p>Full changelog: <a href="https://github.com/pinchbench/skill/compare/v1.2.1...v2.0.0">v1.2.1...v2.0.0</a></p>]]></content:encoded></item><item><title><![CDATA[One Company’s Blueprint for Taking a Whole Engineering Org Agentic]]></title><description><![CDATA[This Kilo customer isn't waiting to see how the shift to agentic engineering shift will play out.]]></description><link>https://blog.kilo.ai/p/one-companys-blueprint-for-taking</link><guid isPermaLink="false">https://blog.kilo.ai/p/one-companys-blueprint-for-taking</guid><dc:creator><![CDATA[Rebecca Dodd]]></dc:creator><pubDate>Fri, 08 May 2026 13:25:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!092l!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Two things you don&#8217;t usually associate with moving quickly: legacy codebases and government applications. Ida Infront, which builds software for Swedish and Nordic government authorities, is challenging that with an ambitious goal: doubling the company&#8217;s development speed by the end of the year&#8212;and they&#8217;re using Kilo to do it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!092l!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!092l!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png 424w, https://substackcdn.com/image/fetch/$s_!092l!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png 848w, https://substackcdn.com/image/fetch/$s_!092l!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png 1272w, https://substackcdn.com/image/fetch/$s_!092l!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!092l!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png" width="1200" height="800" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:351129,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/196557035?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!092l!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png 424w, https://substackcdn.com/image/fetch/$s_!092l!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png 848w, https://substackcdn.com/image/fetch/$s_!092l!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png 1272w, https://substackcdn.com/image/fetch/$s_!092l!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F079d853c-4e26-46aa-b099-5d15f7db9c85_1200x800.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Ida Infront builds secure communication systems, case management tools, and digital archiving solutions for customers including national agencies with demanding data handling requirements and long records retention obligations. The company was established in 1984, and the platform underlying everything they build is over 25 years old.</p><p>In these environments, compliance concerns, risk analysis, and legacy infrastructure can hamstring experimentation.</p><p><a href="https://www.linkedin.com/in/magnusgrimsell/">Magnus Grimsell</a>, CTO at Ida Infront, knows that embracing AI is how the company will stay competitive: &#8220;The value of a large, established platform can decrease when new products can be written faster and cheaper,&#8221; he says. &#8220;We want to address that risk and take advantage of the opportunity by being early adopters.&#8221;</p><p>So the company has been on a structured AI journey since GPT 3.5 launched. They started with Sourcegraph&#8217;s Cody, but within six months they were already looking beyond coding assistants to the next frontier: agentic engineering.</p><p>After evaluating several tools, including Cline and Roo Code, Ida Infront chose Kilo Code and began rolling it out in January of this year, starting with three forerunner teams and 70 seats across developers and other technical roles.</p><h2>The Rollout Plan Is Just as Critical as the Choice of Tool</h2><p>Magnus has watched peer companies in the industry tell developers they can use whatever AI tools they want, then wait. &#8220;Nothing happens,&#8221; he says. &#8220;There&#8217;s no progression in AI adoption, no next step.&#8221; Random experimentation isn&#8217;t good enough; without standardizing on anything, you won&#8217;t get real results.</p><p>So Ida Infront did the opposite: they selected one tool, committed to it organizationally, and built a structured onboarding process to bring every team through it together.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;e7c66459-8bf2-4213-b697-3e0a4f875020&quot;,&quot;caption&quot;:&quot;I say this to engineering leaders constantly, and I&#8217;ll say it here too: buying AI licenses and handing them out is not an AI strategy.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;md&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;You Can&#8217;t Throw Tools at People and Expect Something To Change&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:2073455,&quot;name&quot;:&quot;Emilie Schario&quot;,&quot;bio&quot;:&quot;COO &amp; VP Engineering at Kilo Code. Working Mom of 3.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!2cE-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fe0066d6e-d5ac-4baf-87ac-2fb6eb4f73d1_900x900.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2026-03-20T15:03:21.781Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b457d42a-9403-4a8c-9101-60734b2ac154_1200x800.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://blog.kilo.ai/p/you-cant-throw-tools-at-people-and&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:191564089,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:6,&quot;comment_count&quot;:0,&quot;publication_id&quot;:4363009,&quot;publication_name&quot;:&quot;Kilo Blog&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!EOvc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a533426-8d50-480b-8659-75d53523270b_720x720.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>Each pilot team goes through a series of three workshops, developed in close partnership with the Kilo team:</p><ol><li><p>The first covers agentic engineering fundamentals, with the team mob-programming a real backlog item using Kilo. Before the session, a team ambassador&#8212;an AI champion appointed from each team&#8212;has already prepared the repo with an AGENTS.md file and relevant MCP tools.</p></li><li><p>The second workshop, held 1-2 weeks later, addresses what came up in practice: debugging issues, adding skills to the agent, connecting new MCP servers, refining workflow.</p></li><li><p>The third<strong> </strong>is non-technical. By this point the team has been using Kilo in production for about four weeks. The session walks through the four phases anticipated in Ida Infront&#8217;s AI adoption framework: AI Assistant, AI Agents, Multi-agent, and Software Factory. Teams discuss where they think they are, what they&#8217;re expecting, and what concerns them. There&#8217;s a structured exercise for surfacing fears and hopes.</p></li></ol><p>&#8220;We had contact with the Kilo team throughout the first pilot,&#8221; Magnus says. &#8220;We didn&#8217;t just receive support for the product, but expertise that I&#8217;ve valued as well. It&#8217;s great to have somebody come in and say, &#8216;This is how we do agentic engineering.&#8217;&#8221;</p><p>The staggered rollout lets the company train their future AI trainers: the early adopter teams will then be able to coach others.</p><h2>Early Indicators</h2><p>Magnus describes a full spectrum of reactions to the agentic shift: excitement, skepticism, and a form of grief from developers who feel that the craft they&#8217;ve spent years developing is changing in ways they didn&#8217;t ask for. The final workshop is an opportunity for team members to feel heard and work through any resistance. &#8220;We have people who are on board but feel sad about it,&#8221; he says. &#8220;That&#8217;s a legitimate response. It needed space.&#8221;</p><p>Since the rollout began, the dominant attitude has been positive: teams are eager to get onboarded. Ida Infront&#8217;s developers who had been using Cody heavily were prepared for an incremental improvement when they saw Kilo Code in action for the first time. &#8220;There were some dropped jaws,&#8221; Magnus says. &#8220;With an assistant, you&#8217;re working the same way as before, just a little faster. With agents, the developer role itself starts to change.&#8221;</p><p>That shift is already visible in how Ida Infront is thinking about the future of their teams. Specialized agents for performance optimization or domain-specific tasks&#8212;filling skill gaps at the team level rather than requiring escalation to in-house experts&#8212;are on Magnus&#8217;s roadmap. Other companies in Ida Infront&#8217;s parent group are already looking to replicate their model.</p><h2>The Proof Is Already Showing up in Their Products</h2><p>Ida Infront is also building AI into the products they deliver to customers. ARN, the Swedish National Board for Consumer Disputes, introduced an AI-based function developed with Ida Infront in 2025 that automatically classifies incoming case documents like receipts, images, and correspondence&#8212;tasks previously handled manually. <a href="https://www.linkedin.com/posts/the-national-board-for-consumer-disputes_den-20-februari-2026-l%C3%A4mnade-arn-%C3%A5rsredovisningen-activity-7431669805482528768-_6s-?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAYJvqcB7Wql6HEWUsUMLKT0QnMC-S34g9g">ARN&#8217;s summary of 2025</a> noted that AI investment freed staff from manual classification work and contributed to significantly shorter case processing times&#8212;an outcome Ida Infront is working to replicate across more of its customer base.</p><p>There&#8217;s a sense of excitement and possibility at the company. &#8220;I&#8217;ve been through the shift from waterfall to agile. That changed everything&#8212;not just how we wrote code, but how teams were organized, what roles existed, how we thought about delivery,&#8221; Magnus says. &#8220;This is the same kind of shift. It affects everything. And I&#8217;d rather be shaping what that looks like than waiting to find out.&#8221;</p>]]></content:encoded></item><item><title><![CDATA[Running Google Ads from the terminal with custom AI skills]]></title><description><![CDATA[How growth marketer at Kilo runs Google Ads with Kilo CLI]]></description><link>https://blog.kilo.ai/p/running-google-ads-from-the-terminal</link><guid isPermaLink="false">https://blog.kilo.ai/p/running-google-ads-from-the-terminal</guid><dc:creator><![CDATA[Arkadiy]]></dc:creator><pubDate>Thu, 07 May 2026 10:02:58 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/6da7dc8a-7cfe-49e0-8ba5-cad13056e0e0_2350x1836.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>So I&#8217;ve been managing paid search accounts for the last 12 years. Paid search evolved from manual CPC bidding and pretty manual campaign management to conversion-based bidding which almost completely excluded day-to-day decisions from humans. And now, we&#8217;ve entered the era of AI-driven campaign management. Dozens of campaigns, thousands of keywords, copy changes every week, the usual Monday-morning reporting &#8211; all these things are possible to do without opening the Google Ads UI or Editor. When I started at Kilo, a few months ago, I decided to move the whole operation into <strong><a href="https://kilo.ai/cli">Kilo CLI</a></strong> with a handful of custom skills that I wrote myself.</p><p>Most of this work isn&#8217;t hard if you get the concept of the ads and how they are operated. It&#8217;s just tedious and error-prone. Renaming 40 ad groups in the Google Ads Editor or building a new campaign from a spec doesn&#8217;t take a week &#8211; it takes maybe an hour of clicking the same buttons in a slightly different order. But during that hour you can easily pin the wrong headline, paste the wrong URL into three ad groups, or forget to attach the sitelinks. Prompting an agent to do the same thing takes a few minutes and the output is consistent by construction. And in some cases (especially in the AI space), agents help to quickly jump on the trend and launch ads in a few minutes with a couple of prompts.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.kilo.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Kilo Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>This is a writeup of what I built, how it&#8217;s structured, and how I actually use it.</p><h2>Why Kilo CLI and not just one big agent</h2><p>Kilo CLI lets me pick a different model per task. That&#8217;s the whole thing. I&#8217;m not stuck paying Opus prices to rename an ad group. If you are managing a dozen accounts for your clients, you could face a huge bill. Or you do not want to wait for a 5-hour rate limit to reset to get your tasks done before the sunset.</p><p>My usual setup:</p><ul><li><p><strong>Planning and strategy</strong>. Opus 4.6 or 4.7, or GPT 5.5 at high thinking. Writing a new campaign spec, arguing about the structure, deciding whether three comparison campaigns should share a landing page. This is where I want the expensive brain.</p></li><li><p><strong>Execution</strong>. Sonnet 4.6, or Kimi K2.6. Once the plan is locked, the actual work is &#8220;read the skill, follow the pattern, write the Python, run dry-run, show me the diff.&#8221; That&#8217;s mechanical. Cheaper models do it fine and they do it faster.</p></li></ul><h2>The big picture</h2><p>Here&#8217;s roughly what the whole thing looks like on GitHub:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;plaintext&quot;,&quot;nodeId&quot;:&quot;553f097b-0057-4dc0-acb5-be73e3575119&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-plaintext">~/paid-ads/
&#9500;&#9472;&#9472; .kilocode/
&#9474;   &#9500;&#9472;&#9472; rules/                       &#8592; always-on guardrails
&#9474;   &#9474;   &#9500;&#9472;&#9472; safety.md                  never --execute without --dry-run,
&#9474;   &#9474;   &#9474;                              never touch budgets without asking,
&#9474;   &#9474;   &#9474;                              never commit .env, etc.
&#9474;   &#9474;   &#9492;&#9472;&#9472; main.md                    workflow: read ROADMAP, read todo,
&#9474;   &#9474;                                  update CHANGELOG, update skill
&#9474;   &#9492;&#9472;&#9472; skills/                      &#8592; loaded on demand
&#9474;       &#9500;&#9472;&#9472; google-ads-scripts/        the script library skill
&#9474;       &#9500;&#9472;&#9472; pmax-sqr-audit/            PMax search terms audit workflow
&#9474;       &#9492;&#9472;&#9472; weekly-report/             Monday report workflow
&#9474;
&#9500;&#9472;&#9472; scripts/                         &#8592; ~40 Python scripts (the muscle)
&#9474;   &#9500;&#9472;&#9472; export_*.py                    pull data into JSON
&#9474;   &#9500;&#9472;&#9472; create_*.py                    build new campaigns end-to-end
&#9474;   &#9500;&#9472;&#9472; update_*.py                    mutate live entities
&#9474;   &#9500;&#9472;&#9472; fix_*.py                       audit + fix (tracking, URLs, naming)
&#9474;   &#9500;&#9472;&#9472; add_*.py                       append keywords, sitelinks, negatives
&#9474;   &#9500;&#9472;&#9472; audit_*.py                     read-only analysis, no mutations
&#9474;   &#9500;&#9472;&#9472; weekly_report.py               Monday sync generator
&#9474;   &#9492;&#9472;&#9472; sync_campaigns_md.py           regenerate inventory pre-push
&#9474;
&#9500;&#9472;&#9472; reports/                         &#8592; auto-saved, dated (2026-04-20.md, &#8230;)
&#9500;&#9472;&#9472; docs/                            &#8592; landing-page intent maps, SOPs
&#9500;&#9472;&#9472; tasks/
&#9474;   &#9500;&#9472;&#9472; todo.md                        active backlog
&#9474;   &#9492;&#9472;&#9472; specs/                         written before multi-file work
&#9500;&#9472;&#9472; CHANGELOG.md                     &#8592; one line per change, automated
&#9500;&#9472;&#9472; ROADMAP.md                       &#8592; strategic priorities
&#9492;&#9472;&#9472; campaigns.md                     &#8592; regenerated from the API pre-push</code></pre></div><p>Rules are always on. They&#8217;re the &#8220;don&#8217;t shoot yourself in the foot&#8221; layer. Skills load on demand when I&#8217;m doing a specific kind of work. The scripts are where the actual API calls live.</p><h3>A note on &#8220;topic&#8221;</h3><p>One piece of vocabulary that shows up a lot below: <strong>topic</strong>. A topic is a slice of the campaign naming convention that groups related campaigns regardless of geo or channel. My campaign names look like <code>&lt;scope&gt;_&lt;stage&gt;_acq_&lt;topic&gt;_&lt;channel&gt;_&lt;goal&gt;</code>, so a topic might be <code>product-personas</code> (persona landing pages), <code>competitors</code> (competitor targeting), <code>brand</code> (defensive brand search), and so on. One topic usually spans multiple campaigns &#8211; a core campaign for Tier 1 countries, a global campaign for the rest, sometimes a PMax variant on top. When the weekly report is grouped by topic, it&#8217;s rolling up those parallel campaigns into one row so I can see the whole personal effort or the whole competitor effort in one number.</p><h2>The three skills</h2><h3>1. <code>google-ads-scripts</code> &#8211; the library</h3><p>This is the backbone. It&#8217;s basically a README that lists every existing script, what it does, which patterns it demonstrates, and the conventions any <em>new</em> script has to follow. When I ask for something new, the agent reads this first and copies the closest existing pattern instead of inventing its own.</p><p>The conventions that matter:</p><ul><li><p>Every mutating script ships with <code>--dry-run</code> (default) and <code>--execute</code>. Dry-run prints every operation it would perform, with enough detail to eyeball a diff. Nothing hits the API until I pass <code>--execute</code>.</p></li><li><p>New campaigns are always created <strong>PAUSED</strong>. Enabling is a deliberate UI step.</p></li><li><p>Every creation script supports <code>REUSE_CAMPAIGN_ID</code> so a half-finished run can resume. Rate limits and transient API errors do happen on large batches &#8211; a 751-ad-group campaign creation will occasionally fail halfway, and restarting from scratch is not an option.</p></li><li><p>Keyword/negative adds always dedupe against what&#8217;s already there. No one wants duplicate-keyword warnings.</p></li></ul><p>Here&#8217;s what a fresh campaign-creation dry-run looks like in practice. Say I want a new comparison campaign:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!51wX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png" data-component-name="Image2ToDOM"><div class="image2-inset image2-full-screen"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!51wX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png 424w, https://substackcdn.com/image/fetch/$s_!51wX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png 848w, https://substackcdn.com/image/fetch/$s_!51wX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!51wX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!51wX!,w_5760,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;full&quot;,&quot;height&quot;:833,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-fullscreen" alt="" srcset="https://substackcdn.com/image/fetch/$s_!51wX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png 424w, https://substackcdn.com/image/fetch/$s_!51wX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png 848w, https://substackcdn.com/image/fetch/$s_!51wX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!51wX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3539becb-ebc1-42aa-bcc1-6347776b8a0e_2032x1162.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I read through it, push back on maybe one headline, run <code>--execute</code>, check the UI, enable manually. The same campaign in the Google Ads Editor is maybe an hour of clicking &#8211; not hard, just slow, and every field is a chance to paste the wrong URL or forget a pin. The agent does the same work in a few minutes and the output is consistent.</p><p>The other example worth showing is the <strong>tracking template audit</strong>. Campaigns drift &#8211; I renamed one, the <code>utm_campaign</code> param in the tracking template now points to a dead name, and the analytics dashboard is quietly attributing spend to a campaign that was renamed six months ago (I prefer to use actual names instead of IDs). The kind of thing nobody wants to audit by hand because it means opening every single campaign in the UI and comparing two strings.</p><p>One script walks the whole account:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;plaintext&quot;,&quot;nodeId&quot;:&quot;cf09a119-bebb-40c6-8f80-e269d182d596&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-plaintext">$ python scripts/fix_tracking_templates.py --dry-run

[init]  Found 31 campaigns (24 Search, 4 PMax, 3 Display)

&#9472;&#9472;&#9472; Search campaigns &#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;
  brand_search_ww                           OK
  competitors_search_core            &#9888; FIX utm_campaign mismatch
                                            was:  competitors_search_create-instance
                                            want: competitors_search_core
  personas_search_core               &#9888; FIX param order
  category_search_apac               &#9888; FIX {keyword} unsupported in Search
                                            use {_keyword} instead
  ... (21 more)

  Search: 7 need fixes, 17 already OK

&#9472;&#9472;&#9472; Performance Max campaigns &#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;
  pmax_core       &#9888; FIX final_url_suffix &#8211; utm_campaign stale
                  &#9888; FIX tracking_url_template must be {lpurl} not ""
                    (API rejects empty string: STRING_TOO_SHORT)
  pmax_global     &#9888; FIX utm_content references old asset group name
  pmax_br             OK

&#9472;&#9472;&#9472; Summary &#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;&#9472;
  9 fixes, 22 already OK &#8594; 9 mutations
</code></pre></div><p>That &#8220;<code>tracking_url_template</code> must be <code>{lpurl}</code>, not empty string&#8221; line is a real API quirk &#8211; clearing the field with an empty string returns <code>STRING_TOO_SHORT</code>, which is not obvious from the docs. Once a session figures it out, the fix goes into the skill, and the next session never hits the same wall. These little footguns are the whole reason skills compound over time.</p><h3>2. <code>pmax-sqr-audit</code> &#8211; turning the Performance Max search terms report into an action plan</h3><p>PMax is a bit of a black box unless you pull the search terms export and actually read it. Which nobody wants to do, because it&#8217;s a 3,000-row CSV with two decorative header rows that break most parsers.</p><p>This skill codifies the whole workflow. Parse the CSVs, cross-reference against live Search keywords to find cannibalisation, bucket irrelevant terms into categories (video, gaming, trading bots, resume builders, wrong products&#8230;), theme the high-spend zero-trial terms (model-name queries, unknown competitors, brand misspellings), and decide where each thing goes:</p><ul><li><p>Brand misspellings &#8594; brand campaign, and negative in PMax</p></li><li><p>Unknown competitors &#8594; new ad group in the competitors campaign, and negative in PMax</p></li><li><p>Category-generic stuff &#8594; category campaign, and negative in PMax</p></li><li><p>Irrelevant &#8594; just a PMax negative, no routing</p></li></ul><p>There&#8217;s one invariant I&#8217;ve written in bold caps in the skill: <strong>every keyword added to a Search campaign also becomes a PMax EXACT negative.</strong> Otherwise PMax will keep eating the targeted Search campaigns&#8217; traffic at a higher CPC, and you pay twice for the same conversion.</p><p>Rough shape of the output when I run it on a recent export:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!z5td!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png" data-component-name="Image2ToDOM"><div class="image2-inset image2-full-screen"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!z5td!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png 424w, https://substackcdn.com/image/fetch/$s_!z5td!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png 848w, https://substackcdn.com/image/fetch/$s_!z5td!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!z5td!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!z5td!,w_5760,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;full&quot;,&quot;height&quot;:833,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-fullscreen" alt="" srcset="https://substackcdn.com/image/fetch/$s_!z5td!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png 424w, https://substackcdn.com/image/fetch/$s_!z5td!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png 848w, https://substackcdn.com/image/fetch/$s_!z5td!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!z5td!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69436ab0-0310-410a-9a90-4813195cacbb_2032x1162.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>3. weekly-report &#8211; the Monday sync, mostly automated</h3><p>Every Monday the paid team does a sync where we look at last week&#8217;s numbers. Cost by topic, CPA vs last week, daily trend, top personas, the monetisation funnel. Pulling all of that out of the Google Ads UI, exporting it to a spreadsheet, and reformatting it into the team&#8217;s template is the kind of task that&#8217;s not hard but also not fun &#8211; and easy to mess up a column here or there.</p><p>The report script produces everything that&#8217;s just data, and leaves the commentary to me. That split matters: I don&#8217;t trust any LLM to write the &#8220;why did this spike&#8221; paragraph from numbers alone. But the numbers themselves are a mechanical aggregation, and mechanical work is exactly what scripts are for.</p><p>The skill is mostly about the landmines. Stuff like:</p><ul><li><p>Our trial KPI is a <strong>secondary</strong> conversion action, so it only appears in <code>metrics.all_conversions</code>, not <code>metrics.conversions</code>. Miss that and your CPA is off. In caps at the top of the skill.</p></li><li><p>The monetisation section has to query <strong>all</strong> campaigns including non-core ones, because our general brand search campaign is weirdly the second-best revenue producer. If you filter too tight you miss where money actually comes from.</p></li><li><p><code>segments.conversion_action_name</code> can&#8217;t coexist with <code>metrics.cost_micros</code> in the same GAQL query for <code>search_term_view</code>. Two queries, merge in Python.</p></li></ul><p>Here&#8217;s an anonymised chunk of what the output looks like:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T_4K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png" data-component-name="Image2ToDOM"><div class="image2-inset image2-full-screen"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T_4K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png 424w, https://substackcdn.com/image/fetch/$s_!T_4K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png 848w, https://substackcdn.com/image/fetch/$s_!T_4K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png 1272w, https://substackcdn.com/image/fetch/$s_!T_4K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T_4K!,w_5760,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;full&quot;,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-fullscreen" alt="" srcset="https://substackcdn.com/image/fetch/$s_!T_4K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png 424w, https://substackcdn.com/image/fetch/$s_!T_4K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png 848w, https://substackcdn.com/image/fetch/$s_!T_4K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png 1272w, https://substackcdn.com/image/fetch/$s_!T_4K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb69da642-757b-4c19-983a-e68cdfb6c5c5_1988x1118.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The Notes column is auto-generated from rules (&#8221;CPA spike&#8221;, &#8220;Zero trials&#8221;, &#8220;Budget-capped&#8221;, &#8220;New campaign&#8221;). Those are <strong>signals, not conclusions</strong> &#8211; my job on Monday morning is to pick the ones that matter and write the real story underneath.</p><p>What the script never writes, and I don&#8217;t let it write: the TL;DR, the monetisation commentary, and the &#8220;what we&#8217;re going to do about it&#8221; section. Numbers auto, judgement manual. Pretending otherwise is how you end up with confident-looking reports that miss the actual story.The Notes column is auto-generated from rules (&#8221;CPA spike&#8221;, &#8220;Zero trials&#8221;, &#8220;Budget-capped&#8221;, &#8220;New campaign&#8221;). Those are <strong>signals, not conclusions</strong> &#8211; my job on Monday morning is to pick the ones that matter and write the real story underneath.</p><p>What the script never writes, and I don&#8217;t let it write: the TL;DR, the monetisation commentary, and the &#8220;what we&#8217;re going to do about it&#8221; section. Numbers auto, judgement manual. Pretending otherwise is how you end up with confident-looking reports that miss the actual story.</p><h2>The self-improvement engine</h2><p>The rules file says: whenever the agent writes a new script, it <strong>must</strong> add an entry to the corresponding skill describing what the script does and which pattern it demonstrates. If it discovers a new API quirk, that goes in the skill too.</p><p>So the skills grow by themselves. Each session teaches the next session something new. The <code>STRING_TOO_SHORT</code> note on <code>tracking_url_template</code>, the secondary-conversion-metric caveat, the PMax channel performance &#8211; none of those were written by me up front. They were all discovered mid-task and then documented as part of the task.</p><p>The other automated piece is action logging. Every change lands in <code>CHANGELOG.md</code> as a one-liner with the why, not just the what. <code>tasks/todo.md</code> gets updated when work finishes or when follow-ups surface. <code>campaigns.md</code> regenerates from the live API before every git push so the remote always has the current campaign list. None of that is me remembering to do it; it&#8217;s all in the rules.</p><p>The result is that the repo is self-describing. What&#8217;s live, what&#8217;s paused, what&#8217;s planned, what has changed recently, why. Coming back after a week or two away doesn&#8217;t require any mental reconstruction &#8211; the state is on disk.</p><h2>What actually changed</h2><p>The time savings are real. But more wins are elsewhere:</p><ul><li><p><strong>Consistency.</strong> Every campaign gets the same naming, the same RSA structure, the same tracking template, the same geo split. No drift.</p></li><li><p><strong>Reviewability.</strong> Dry-run output is a text diff. Five minutes of reading catches things a UI change would never surface.</p></li><li><p><strong>Recall.</strong> API footguns and account-specific rules stay caught. Nobody has to remember &#8220;did we already fix that&#8221; because the skill remembers.</p></li><li><p><strong>Cost.</strong> Cheaper models handle the bulk of execution. Opus-tier reasoning stays reserved for plans and strategy where it earns its keep.</p></li><li><p><strong>Context.</strong> A new session catches up in about 30 seconds because the skills, the CHANGELOG, and the todo file describe the current state.</p></li></ul><p>Total skill content is maybe 600 lines of markdown. It&#8217;s worth more than all the Python in the repo combined, because the markdown is what makes the Python safe to generate from a prompt in the first place.</p><h2>If you want to build your own</h2><p>A few notes for anyone thinking of going the same direction:</p><ol><li><p>A skill is a <strong>reference document</strong>, not a prompt. Write it for a future session that has forgotten everything. The agent reads it the same way.</p></li><li><p>Encode the conventions you actually enforce. If half the scripts have dry-run defaults and half don&#8217;t, fix the scripts first, then document it.</p></li><li><p>Make the skill self-maintaining. Put &#8220;every new script MUST be added to the registry below&#8221; at the top, and the maintenance takes care of itself.</p></li><li><p>Safety rails go in the rules file. Never commit <code>.env</code>, never skip dry-run, never touch budgets without asking, no <code>--force</code> push to main. Spell it out.</p></li><li><p>Every API quirk you discover is worth a paragraph. That&#8217;s where the compounding happens.</p></li><li><p>Use cheaper models for execution. Save the expensive ones for planning.</p></li></ol><p>The setup took a few hours &#8211; not a big investment for something that runs continuously.</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://blog.kilo.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Kilo Blog! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[I Tested ChatGPT Images 2.0 and GPT-5.5 by Building 3 Landing Pages ]]></title><description><![CDATA[Most people I&#8217;ve talked to use AI to build landing pages by starting with a text prompt.]]></description><link>https://blog.kilo.ai/p/tested-chatgpt-images-20-and-gpt-55</link><guid isPermaLink="false">https://blog.kilo.ai/p/tested-chatgpt-images-20-and-gpt-55</guid><dc:creator><![CDATA[Darko]]></dc:creator><pubDate>Thu, 07 May 2026 09:25:33 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!AxQ3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Most people I&#8217;ve talked to use AI to build landing pages by starting with a text prompt.</p><p>The workflow usually looks like this:</p><ol><li><p>You submit a prompt to a text-based model.</p></li><li><p>The model returns an HTML page.</p></li><li><p>You go back and forth with the model to make revisions.</p><p></p></li></ol><p>After OpenAI <a href="https://openai.com/index/introducing-chatgpt-images-2-0/">released</a> ChatGPT Images 2.0, I&#8217;ve seen more people shift to a different workflow:</p><ol><li><p>You submit a prompt to an image-based model.</p></li><li><p>The model returns an image.</p></li><li><p>You iterate on the image until you get a design you like.</p></li><li><p>You ask the model to convert that image into an HTML file, whether that&#8217;s static HTML, React, or whatever stack you use.</p></li></ol><p>This approach looked interesting, so I decided to test it.</p><h1><strong>The test</strong></h1><p>I asked GPT-5.5 using ChatGPT to create 3 different pages for me. I used a different session for each request.</p><ul><li><p>A landing page for an AI coding tool. The Kilo Code landing page already looks good, but I thought, why not?</p></li><li><p>A landing page for a SaaS tool for lawyers.</p></li><li><p>A website for an e-commerce store selling shoes.</p></li></ul><p><strong>Important:</strong> I kept the prompts generic on purpose. I didn&#8217;t paste a 10-page prompt with detailed specs because that&#8217;s not how most of us interact with AI and I wanted to model more realistic scenarios.</p><p>Let&#8217;s dive into the results.</p><h1><strong>A Landing Page for an AI Coding Tool</strong></h1><p><strong>Prompt:</strong> &#8220;Let&#8217;s create a modern landing page image for an AI coding tool.&#8221;</p><p>I entered this prompt twice and got two different designs. I didn&#8217;t really like the first one, but the second looked decent:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AxQ3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AxQ3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!AxQ3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!AxQ3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!AxQ3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AxQ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AxQ3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!AxQ3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!AxQ3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!AxQ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcdbc1e87-157c-4358-8144-717f808db3f0_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Let&#8217;s see if we can translate this into an HTML page.</p><p><strong>Follow-up prompt:</strong> &#8220;Can we turn this into a single HTML page?&#8221;</p><p>Here&#8217;s a preview of the HTML:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!85k7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!85k7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png 424w, https://substackcdn.com/image/fetch/$s_!85k7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png 848w, https://substackcdn.com/image/fetch/$s_!85k7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png 1272w, https://substackcdn.com/image/fetch/$s_!85k7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!85k7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png" width="1456" height="1037" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1037,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!85k7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png 424w, https://substackcdn.com/image/fetch/$s_!85k7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png 848w, https://substackcdn.com/image/fetch/$s_!85k7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png 1272w, https://substackcdn.com/image/fetch/$s_!85k7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0037f20-8683-4feb-b745-57a32bdba33d_2048x1458.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Not bad! The page also looked decent on mobile:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WS2h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WS2h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png 424w, https://substackcdn.com/image/fetch/$s_!WS2h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png 848w, https://substackcdn.com/image/fetch/$s_!WS2h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png 1272w, https://substackcdn.com/image/fetch/$s_!WS2h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WS2h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png" width="876" height="1656" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1656,&quot;width&quot;:876,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WS2h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png 424w, https://substackcdn.com/image/fetch/$s_!WS2h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png 848w, https://substackcdn.com/image/fetch/$s_!WS2h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png 1272w, https://substackcdn.com/image/fetch/$s_!WS2h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff63fd04e-7362-4ea1-b73c-4911d4b07833_876x1656.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This was impressive. I tried to one-shot everything just to see how the model would behave.</p><p>This is not the most unique landing page in the world, but it&#8217;s a good starting point.</p><h1><strong>SaaS Landing Page for Lawyers</strong></h1><p><strong>Prompt:  &#8220;</strong>Let&#8217;s create a modern landing page image for a SaaS management software for lawyers&#8221;</p><p>The first result wasn&#8217;t very encouraging:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JvDo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JvDo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png 424w, https://substackcdn.com/image/fetch/$s_!JvDo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png 848w, https://substackcdn.com/image/fetch/$s_!JvDo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png 1272w, https://substackcdn.com/image/fetch/$s_!JvDo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JvDo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png" width="935" height="1683" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1683,&quot;width&quot;:935,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JvDo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png 424w, https://substackcdn.com/image/fetch/$s_!JvDo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png 848w, https://substackcdn.com/image/fetch/$s_!JvDo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png 1272w, https://substackcdn.com/image/fetch/$s_!JvDo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ff4216-696c-4b71-9cfe-ba3c5b8c741e_935x1683.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>My theory on why this was the case:</strong> I asked it for a &#8220;landing page&#8221; (that&#8217;s associated with something modern) + mentioned &#8216;lawyers&#8217; and the end result was a weird mix between modern and more traditional design style.</p><p>I tried a follow-up prompt to see if the AI could come up with something better.</p><p>F<strong>ollow-up prompt:</strong> &#8220;I see you&#8217;re trying to mix modern elements from a landing page with traditional elements from a lawyer page. Make the whole page feel like a lawyer&#8217;s website, including the fonts, colors, and overall style.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H3kd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H3kd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png 424w, https://substackcdn.com/image/fetch/$s_!H3kd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png 848w, https://substackcdn.com/image/fetch/$s_!H3kd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png 1272w, https://substackcdn.com/image/fetch/$s_!H3kd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H3kd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png" width="941" height="1672" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1672,&quot;width&quot;:941,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!H3kd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png 424w, https://substackcdn.com/image/fetch/$s_!H3kd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png 848w, https://substackcdn.com/image/fetch/$s_!H3kd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png 1272w, https://substackcdn.com/image/fetch/$s_!H3kd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb680a358-58d3-4438-a02e-40d36ba9a430_941x1672.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Ugh. Still not there.</p><p>I think the reason is that the AI model likely has less training data for &#8220;legacy&#8221; or more traditional websites.</p><p><strong>The lesson: </strong>you may need to be more specific when creating websites in categories where there&#8217;s less strong design data available. Load skills, ask the AI to analyze websites you like, and have it create a detailed design spec before generating anything.</p><h1><strong>An E-Commerce Website Selling Shoes</strong></h1><p><strong>Prompt:</strong> &#8220;Create a website image for an e-commerce website selling shoes&#8221;</p><p>The result was something I <em>almost</em> liked:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!D3h3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!D3h3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!D3h3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!D3h3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!D3h3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!D3h3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!D3h3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!D3h3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!D3h3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!D3h3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21530ed-a9cc-40cd-bc85-6653c804ac0d_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>These images look too wide.</p><p><strong>Follow-up prompt:</strong> &#8220;The images look too wide, can you fix that?&#8221;</p><p>This result more closely resembled a modern e-commerce website:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PoF_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PoF_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png 424w, https://substackcdn.com/image/fetch/$s_!PoF_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png 848w, https://substackcdn.com/image/fetch/$s_!PoF_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png 1272w, https://substackcdn.com/image/fetch/$s_!PoF_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PoF_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png" width="1122" height="1402" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1402,&quot;width&quot;:1122,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PoF_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png 424w, https://substackcdn.com/image/fetch/$s_!PoF_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png 848w, https://substackcdn.com/image/fetch/$s_!PoF_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png 1272w, https://substackcdn.com/image/fetch/$s_!PoF_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8643d5eb-65e0-49f3-adf9-007dc3c5f20c_1122x1402.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Let&#8217;s try to see if we can convert this design to a webpage. The prompt I used was: &#8220;let&#8217;s convert this to a single HTML page (include all CSS/JS inside that HTML)&#8221;</p><p>The result was close to what I wanted:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CPhK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CPhK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png 424w, https://substackcdn.com/image/fetch/$s_!CPhK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png 848w, https://substackcdn.com/image/fetch/$s_!CPhK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png 1272w, https://substackcdn.com/image/fetch/$s_!CPhK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CPhK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png" width="1456" height="1435" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1435,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CPhK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png 424w, https://substackcdn.com/image/fetch/$s_!CPhK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png 848w, https://substackcdn.com/image/fetch/$s_!CPhK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png 1272w, https://substackcdn.com/image/fetch/$s_!CPhK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c1fbfd0-b70f-43f7-8424-ea5e88674448_1762x1736.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I tried a few prompts to get the images to align properly, but I couldn&#8217;t get it to work.</p><p>Then I had an idea:</p><p>What if I take a screenshot of the image, and ask the AI to look at that picture, figure out what&#8217;s wrong and fix what&#8217;s wrong on its own?</p><p>This actually worked! GPT-5.5 figured out the problems:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4kcI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4kcI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png 424w, https://substackcdn.com/image/fetch/$s_!4kcI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png 848w, https://substackcdn.com/image/fetch/$s_!4kcI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png 1272w, https://substackcdn.com/image/fetch/$s_!4kcI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4kcI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png" width="1262" height="204" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:204,&quot;width&quot;:1262,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4kcI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png 424w, https://substackcdn.com/image/fetch/$s_!4kcI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png 848w, https://substackcdn.com/image/fetch/$s_!4kcI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png 1272w, https://substackcdn.com/image/fetch/$s_!4kcI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbb5c298-dd75-476e-a12b-fc44a0c16bf6_1262x204.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>And it actually fixed the page:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5tpR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5tpR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png 424w, https://substackcdn.com/image/fetch/$s_!5tpR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png 848w, https://substackcdn.com/image/fetch/$s_!5tpR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png 1272w, https://substackcdn.com/image/fetch/$s_!5tpR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5tpR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png" width="1456" height="1435" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1435,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5tpR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png 424w, https://substackcdn.com/image/fetch/$s_!5tpR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png 848w, https://substackcdn.com/image/fetch/$s_!5tpR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png 1272w, https://substackcdn.com/image/fetch/$s_!5tpR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b4f63be-7ab3-4073-8f9c-d62f87514dda_1770x1744.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is a preview of the actual (fixed) HTML page.</p><p>Now that&#8217;s something that looks better.</p><h1><strong>Using AI to Design Websites Is Not the Same as Using AI to Code</strong></h1><p>We&#8217;ve seen that using AI to design beautiful pages requires a different approach than using AI to generate code. That&#8217;s why we created <a href="https://app.kilo.ai/app-builder">App Builder</a>, for example:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MS-o!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MS-o!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png 424w, https://substackcdn.com/image/fetch/$s_!MS-o!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png 848w, https://substackcdn.com/image/fetch/$s_!MS-o!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png 1272w, https://substackcdn.com/image/fetch/$s_!MS-o!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MS-o!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png" width="1414" height="1340" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1340,&quot;width&quot;:1414,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MS-o!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png 424w, https://substackcdn.com/image/fetch/$s_!MS-o!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png 848w, https://substackcdn.com/image/fetch/$s_!MS-o!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png 1272w, https://substackcdn.com/image/fetch/$s_!MS-o!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcd559a39-3558-48b0-aaf2-be6a24bc5f8c_1414x1340.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic recently released Claude Design, and OpenAI released ChatGPT Images 2.0. Both are changing the workflow for how design gets made.</p><p>We&#8217;re excited about the future of AI + design and can&#8217;t wait to see what it unlocks and what kinds of products we can build around it.</p>]]></content:encoded></item><item><title><![CDATA[Hermes vs. OpenClaw - When to Reach for Which Agent]]></title><description><![CDATA[Two open-source agent frameworks with overlapping features but fundamentally different philosophies]]></description><link>https://blog.kilo.ai/p/hermes-vs-openclaw-when-to-reach</link><guid isPermaLink="false">https://blog.kilo.ai/p/hermes-vs-openclaw-when-to-reach</guid><dc:creator><![CDATA[Brendan O'Leary]]></dc:creator><pubDate>Wed, 06 May 2026 19:52:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!F4IO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Last week, someone in the <a href="https://kilo.ai/discord">Kilo Discord</a> asked: &#8220;Should I switch from OpenClaw to Hermes?&#8221; I&#8217;ve seen this question pop up a dozen times since Hermes launched in February. It&#8217;s the right question to ask &#8212; both are open source, both connect to your chat apps, both run tools and remember things. On paper, they look almost identical.</p><p>But after running both for the past two months, I think the feature checklists are a distraction &#8212; the design philosophies are where they actually diverge.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!F4IO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F4IO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!F4IO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!F4IO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!F4IO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F4IO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg" width="1376" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1376,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:159968,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/196673460?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F4IO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!F4IO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!F4IO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!F4IO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1771abf-88e5-4821-839f-8ebde0e7ab15_1376x768.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>The One-Sentence Difference</h2><p><strong>Hermes</strong> packages a gateway around a learning agent.<br><strong>OpenClaw</strong> packages an agent around a messaging gateway.</p><p>That distinction sounds abstract, but it has practical consequences for how you configure and interact with each tool.</p><h2>What Hermes Gets Right</h2><p><a href="https://hermes-agent.nousresearch.com/">Hermes Agent</a> comes from Nous Research and launched in February 2026. It&#8217;s hit about 135,000 GitHub stars as of this writing. The headline feature is what they call a &#8220;learning loop&#8221; &#8212; the agent creates and evolves its own skills based on what it does.</p><p>From their <a href="https://hermes-agent.nousresearch.com/docs/user-guide/features/overview">feature docs</a>:</p><ul><li><p><strong>Self-improving skills</strong>: The agent generates procedural knowledge from experience. Run the same task type a hundred times, and Hermes actually gets better at it.</p></li><li><p><strong>Five sandbox backends</strong>: Local execution, Docker, SSH, Singularity, and Modal. You pick how isolated you want command execution to be.</p></li><li><p><strong>Subagent delegation</strong>: Spawn child agents with isolated contexts and terminals. Parallel workstreams without context pollution.</p></li><li><p><strong>Broader browser/voice stack</strong>: Browserbase, Browser Use, Firecrawl, local Chrome, plus native voice in Discord channels.</p></li></ul><p>The Hermes <a href="https://blakecrosley.com/guides/hermes">documentation</a> is worth reading even if you don&#8217;t use it &#8212; the provider matrix alone covers 19+ providers with detailed auth flows.</p><p>What impressed me most was the checkpoint system. Before Hermes touches files, it snapshots your working directory. <code>/rollback</code> if something goes wrong. I&#8217;ve used this more times than I&#8217;d like to admit.</p><h2>What OpenClaw Gets Right</h2><p><a href="https://openclaw.ai/">OpenClaw</a> has been around longer and has the larger community &#8212; roughly 369,000 GitHub stars and 13,700+ community-built skills. It started as a personal assistant project by <a href="https://twitter.com/steipete">Peter Steinberger</a> and grew into something much bigger.</p><p>OpenClaw is fundamentally a <strong>gateway</strong>. The <a href="https://docs.openclaw.ai">docs</a> are explicit: &#8220;The Gateway is the single source of truth for sessions, routing, and channel connections.&#8221;</p><p>What that means in practice:</p><ul><li><p><strong>Channel breadth</strong>: Discord, Google Chat, iMessage, Matrix, Microsoft Teams, Signal, Slack, Telegram, WhatsApp, Zalo, WebChat. One Gateway process handles all of them.</p></li><li><p><strong>Multi-agent routing</strong>: Isolated sessions per agent, workspace, or sender. You can run different agents for different purposes through the same gateway.</p></li><li><p><strong>Mobile nodes</strong>: iOS and Android apps that pair with the gateway for camera, canvas, and device actions.</p></li><li><p><strong>Massive skill ecosystem</strong>: 13,700+ community skills covering everything from email to calendar to flight check-ins.</p></li></ul><p>The architecture assumes you want one always-on process that routes messages to agents. That&#8217;s different from Hermes&#8217;s model of &#8220;here&#8217;s an agent runtime that can talk to various platforms.&#8221;</p><h2>Known Pitfalls</h2><p>Both tools have well-documented failure modes that the communities are vocal about. Worth knowing before you commit.</p><p><strong>Hermes:</strong></p><ul><li><p><strong>Self-evaluation always passes.</strong> Hermes evaluates its own work to decide if a task succeeded. The problem: it almost always thinks it did well, even when it didn&#8217;t. This means the skills it auto-generates from &#8220;successful&#8221; tasks can encode errors. You need external validation for anything important.</p></li><li><p><strong>Self-learning overwrites manual edits.</strong> The same system that auto-generates skills also overwrites your customizations. If you&#8217;ve spent time tuning a skill for a specific workflow, the agent may &#8220;self-improve&#8221; it back into something generic. Power users find this maddening.</p></li><li><p><strong>Maturity gap.</strong> With only 11 releases compared to OpenClaw&#8217;s 137, Hermes simply hasn&#8217;t been tested at the same scale. Fewer updates means fewer chances to break things &#8212; but that&#8217;s not the same as proven stability.</p></li></ul><p><strong>OpenClaw:</strong></p><ul><li><p><strong>Updates break things.</strong> This is the most consistent complaint in the community. Users report roughly a 25% chance that any given update will break response delivery, cron jobs, or webhooks. The development process lacks the staging/testing discipline you&#8217;d expect.</p></li><li><p><strong>Memory is unreliable.</strong> Agents forget instructions, cross-contaminate data between projects, and repeat mistakes. Memory retention issues are the #1 driver of user churn.</p></li><li><p><strong>Self-hosting is the real barrier.</strong> Docker setup, SSH configuration, YAML files, security hardening, 24/7 uptime &#8212; users consistently report spending more time on infrastructure than on their actual agent workflows.</p></li></ul><h2>Trade-offs</h2><p>A <a href="https://screenshotone.com/blog/hermes-agent-versus-openclaw/">comparison on ScreenshotOne</a> put it well: Hermes is &#8220;agent-first&#8221; while OpenClaw is &#8220;gateway-first.&#8221;</p><p><strong>Hermes</strong> optimizes for the agent becoming more capable over time. It&#8217;s built for people who want autonomous agents that learn from experience.</p><p><strong>OpenClaw</strong> optimizes for a persistent assistant you can message from anywhere. It&#8217;s built for people who want infrastructure they can talk to.</p><p>Neither approach is wrong. But they lead to different outcomes:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zu9T!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zu9T!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png 424w, https://substackcdn.com/image/fetch/$s_!zu9T!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png 848w, https://substackcdn.com/image/fetch/$s_!zu9T!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png 1272w, https://substackcdn.com/image/fetch/$s_!zu9T!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zu9T!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png" width="829" height="387" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0e6b99db-a39b-4631-88af-9a156041e269_829x387.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:387,&quot;width&quot;:829,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:80217,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/196673460?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zu9T!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png 424w, https://substackcdn.com/image/fetch/$s_!zu9T!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png 848w, https://substackcdn.com/image/fetch/$s_!zu9T!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png 1272w, https://substackcdn.com/image/fetch/$s_!zu9T!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e6b99db-a39b-4631-88af-9a156041e269_829x387.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Security Considerations</h2><p>This matters more than people think. A <a href="https://www.reddit.com/r/selfhosted/comments/1r9yrw1/if_youre_selfhosting_openclaw_heres_every/">Reddit thread</a> documented OpenClaw&#8217;s 2026 security incidents: 6 CVEs, 341+ malicious skills identified in the community repository, 135,000+ exposed instances found by Shodan.</p><p>OpenClaw grew fast. Some security assumptions that made sense for a personal tool on a laptop became dangerous when people started running it on public VPSes with open ports.</p><p>Hermes, being newer, has <a href="https://medium.com/@sathishkraju/i-switched-from-openclaw-to-hermes-agent-heres-what-nobody-told-me-5f33a746b6ca">zero reported agent-specific CVEs</a> as of April 2026. That&#8217;s not because it&#8217;s inherently more secure &#8212; it just hasn&#8217;t had the same scale of exposure. Give it time.</p><p>Both projects now have sandboxing options and approval flows. But if you&#8217;re deploying either on a server, audit the defaults. Neither assumes you&#8217;re running on a hardened production box.</p><h2>When to Pick Hermes</h2><p>Hermes is the better choice if:</p><ul><li><p>You want an agent that improves at tasks over time</p></li><li><p>You need multiple sandbox backends (especially Modal for cloud execution)</p></li><li><p>You&#8217;re doing research-style workflows with subagent delegation</p></li><li><p>You want tight IDE integration via ACP</p></li><li><p>You&#8217;re willing to trade ecosystem size for a more capable core agent</p></li></ul><p>The learning loop is what justifies choosing Hermes over OpenClaw. If you&#8217;re running the same types of tasks repeatedly &#8212; data analysis, code review, research synthesis &#8212; Hermes will genuinely get better at them.</p><h2>When to Pick OpenClaw</h2><p>OpenClaw is the better choice if:</p><ul><li><p>You want to message your assistant from everywhere (24+ platforms)</p></li><li><p>You need the existing skill ecosystem (13,700+ skills)</p></li><li><p>You want mobile nodes for phone camera/canvas integration</p></li><li><p>You&#8217;re building team infrastructure, not just a personal agent</p></li><li><p>You value stability over cutting-edge features</p></li></ul><p>If your primary use case is &#8220;I want to message my AI from WhatsApp and have it do things on my computer,&#8221; OpenClaw has that nailed.</p><h2>The Cost Problem</h2><p>This doesn&#8217;t get discussed enough. Running either agent autonomously is expensive if you&#8217;re not careful. Every message sends the full conversation history to the API, so costs compound within a session.</p><p>Users in the community report anywhere from $1-3/day on budget models to $130+/day on Claude Opus for heavy agentic use. The fix is aggressive session resets and picking appropriate models per task tier:</p><ul><li><p><strong>Quality-sensitive work</strong>: Claude Opus 4.6 (expensive, best agentic performance)</p></li><li><p><strong>Daily driver</strong>: GPT 5.4 (thinking mode on medium+) or MiniMax M2.7</p></li><li><p><strong>Budget automation</strong>: Qwen 3.5/3.6 (free on OpenRouter), GLM-5.1, Kimi K2.5</p></li></ul><p>Flat-rate subscriptions (MiniMax at $10-20/month, Ollama Pro Cloud at $20/month) are rapidly replacing per-token billing as the community default.</p><h2>What I Actually Use</h2><p>I run both &#8212; and the community data confirms this is a growing pattern. The specific architecture that works: <strong>OpenClaw as orchestrator</strong> (planning, decomposition, multi-step coordination, scheduling) and <strong>Hermes as execution specialist</strong> (fast, repeatable task loops). They communicate via the ACP protocol.</p><p>OpenClaw handles my day-to-day messaging &#8212; it&#8217;s the interface I talk to from Telegram. I&#8217;ve been using it for months and the skill ecosystem covers most of what I need.</p><p>Hermes runs on research tasks where I want the learning loop. When I&#8217;m doing a series of similar analyses, Hermes&#8217;s skill evolution actually matters.</p><p>I could probably consolidate &#8212; Hermes&#8217;s docs actually note that it&#8217;s the &#8220;successor to OpenClaw&#8221; and they have a migration command (<code>hermes claw migrate</code>) &#8212; but I haven&#8217;t felt the urgency. They solve different problems well.</p><h2>Summary</h2><p>Both projects are actively developed. Both have real communities. Both work.</p><p>Hermes is younger, more ambitious architecturally, and smaller in ecosystem. OpenClaw is more mature, broader in integrations, and has had more security scrutiny (for better and worse).</p><p>The 30% of developers who <a href="https://www.kucoin.com/blog/hermes-agent-vs-openclaw-which-open-source-ai-agent-wins-in-2026">switched from OpenClaw to Hermes</a> cite &#8220;maintenance fatigue&#8221; from debugging community skills and wanting the learning loop. The 35% who stayed on OpenClaw cite integrations and ecosystem breadth.</p><p>Pick based on what you actually need. If you want a persistent assistant you can message, OpenClaw. If you want an agent that improves itself, Hermes.</p><p>Or run both &#8212; they&#8217;re free, and the resource overhead of a second process is negligible.</p><div><hr></div><p><strong>Links:</strong></p><ul><li><p><a href="https://hermes-agent.nousresearch.com/">Hermes Agent</a> &#8212; official site</p></li><li><p><a href="https://hermes-agent.nousresearch.com/docs/user-guide/features/overview">Hermes docs</a></p></li><li><p><a href="https://openclaw.ai/">OpenClaw</a> &#8212; official site</p></li><li><p><a href="https://docs.openclaw.ai">OpenClaw docs</a></p></li><li><p><a href="https://screenshotone.com/blog/hermes-agent-versus-openclaw/">Detailed comparison on ScreenshotOne</a></p></li></ul>]]></content:encoded></item><item><title><![CDATA[Design Systems for Agentic Engineering]]></title><description><![CDATA[What happens when you&#8217;re the first designer at an org where everyone &#8212; humans and AIs &#8212; ships like a mini-CEO]]></description><link>https://blog.kilo.ai/p/design-systems-for-agentic-engineering</link><guid isPermaLink="false">https://blog.kilo.ai/p/design-systems-for-agentic-engineering</guid><dc:creator><![CDATA[Brendan O'Leary]]></dc:creator><pubDate>Wed, 06 May 2026 15:30:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!cEF4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Ivan joined Kilo a few weeks ago as our first designer. Not &#8220;first designer to lead a design team&#8221; &#8212; first designer, period. Before him, there was no design function. Features shipped anyway. PRs landed all day. The org moved at what he&#8217;s started calling &#8220;agent speed.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cEF4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cEF4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cEF4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cEF4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cEF4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cEF4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg" width="1376" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1376,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:110875,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/196668447?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cEF4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cEF4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cEF4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cEF4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e211e9f-0414-4350-85dd-fed042a27248_1376x768.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Every engineer here operates like a mini-CEO. They don&#8217;t wait for specifications. They don&#8217;t wait for mockups. They see a problem, they ship a solution. Sometimes multiple times a day. The velocity is genuinely impressive &#8212; and terrifying if your job is supposed to be designing things before they get built.</p><p>So he had a choice. Two ways to lose this game:</p><p><strong>Option A: Pixel janitor.</strong> Accept that design happens before he knows about it, spend all his time cleaning up after the firehose, perpetually one sprint behind.</p><p><strong>Option B: Beautiful irrelevance.</strong> Create gorgeous Figma specs that are technically perfect and aesthetically flawless. By the time anyone looks at them, the feature shipped Tuesday and we&#8217;ve moved on.</p><p>Neither of those is a real job. So he&#8217;s been thinking about what a design function actually looks like when your colleagues include AI agents that ship production code.</p><h2>The Traditional Design System Assumption</h2><p>Most design systems are built on an assumption that breaks here: <em>a designer will interpret the rules.</em></p><p>A traditional design system says &#8220;use 4px spacing multiples&#8221; and &#8220;primary buttons are blue.&#8221; It assumes a human designer will understand when to break those rules, when to apply judgment, when the context demands something different. The system is a reference document. The designer is the interpreter.</p><p>That assumption doesn&#8217;t work when half the code getting written comes from AI agents, and the human engineers are moving too fast to check a Figma file.</p><p>The system has to be directly usable by the people &#8212; and the agents &#8212; actually shipping. Not just readable by designers who might interpret it later.</p><h2>Docs as Infrastructure</h2><p>In practice, documentation becomes infrastructure.</p><p>The &#8220;brand vibes&#8221; doc nobody reads becomes useful when an agent can ingest it &#8212; and if it&#8217;s written clearly enough, actually <em>act on it.</em></p><p>DESIGN.md becomes as important as README.md. Not because designers will reference it, but because it&#8217;s the source of truth that agents and fast-moving engineers will consume without asking questions.</p><p>This changes what design documentation looks like. It can&#8217;t be &#8220;capture the ineffable feeling of the brand.&#8221; It has to be specific, opinionated, and machine-actionable.</p><h2>Three Layers</h2><p>Ivan&#8217;s been sketching out a rough timeline:</p><h3>Short Term: Stabilize</h3><p>Not glamorous, but necessary.</p><p>Right now, there&#8217;s been a lot of dev-led design. That&#8217;s not a criticism &#8212; it&#8217;s what happens when you don&#8217;t have a designer and you ship constantly. But it means inconsistencies have crept in. Button styles that differ across surfaces. Spacing that varies by who wrote the component. Color usage that drifted from whatever the original intent was.</p><p>First job is to audit, find the obvious drift, and get everything to a shared floor. Document what actually exists, not what we wish existed. Create a baseline.</p><h3>Medium Term: The Fun Part</h3><p><strong>DESIGN.md</strong> &#8212; a markdown file that holds the brand DNA. Written for agents as much as for humans. Not &#8220;we value simplicity&#8221; but &#8220;form labels are sentence case, never title case&#8221; and &#8220;error states always include a suggested action, not just a description of what went wrong.&#8221;</p><p><strong>Custom skills</strong> &#8212; the kind of thing Kilo uses internally. Skills that encode &#8220;the Kilo way&#8221; so that when a dev or an agent reaches for a UI primitive, they reach for the right one by default. The system isn&#8217;t just documented; it&#8217;s embedded in the workflow.</p><p><strong>Maybe a drift linter</strong> &#8212; essentially a component inventory CLI that can flag when something doesn&#8217;t match the system. Like how a code linter catches style violations, but for design consistency.</p><h3>Long Term: Kilo Uses Kilo</h3><p>Eventually, agents become part of the consistency layer.</p><p>Imagine a design reviewer that flags drift in PRs the same way linters flag code smells. A copy checker that catches when button text doesn&#8217;t match our voice guidelines. A brand reviewer that notices when we&#8217;ve wandered off palette.</p><p>The goal is to set the policy, build the reviewers, then review the reviewers &#8212; not to personally inspect every pixel indefinitely.</p><h2>Open Questions</h2><p>He&#8217;s still chewing on several things:</p><p><strong>How much process can a velocity culture absorb?</strong> Add too much structure and you kill the thing that makes this org effective. Add too little and you get permanent chaos. There&#8217;s a line somewhere.</p><p><strong>How do you write DESIGN.md so it actually takes positions?</strong> It&#8217;s easy to end up with generic guidance that sounds good but flattens everything into mush. &#8220;Be consistent&#8221; isn&#8217;t useful. &#8220;Modals never contain more than one primary action&#8221; is.</p><p><strong>What level should a skill live at?</strong> Component level? (&#8221;Use this card component.&#8221;) Flow level? (&#8221;Confirmation dialogs follow this pattern.&#8221;) Decision level? (&#8221;When in doubt, fewer steps beats more clarity.&#8221;) Taste level? (Is that even possible?)</p><p><strong>How do you measure drift without crying wolf?</strong> A system that flags everything is useless. A system that misses real problems is also useless. Calibration matters.</p><h2>What&#8217;s Next</h2><p>Right now he&#8217;s deep in the audit phase &#8212; documenting what exists, finding the patterns and the anti-patterns, building the baseline. Tedious work, but necessary before anything else makes sense.</p><p>The medium-term work is what he&#8217;s most excited about. DESIGN.md as real infrastructure. Skills that encode taste. A system that doesn&#8217;t require him in the loop for every decision.</p><p>He&#8217;ll write more as this progresses. If you&#8217;re solving similar problems &#8212; design in a high-velocity, agent-heavy environment &#8212; we&#8217;d love to hear what you&#8217;ve figured out. Find us in <a href="https://kilo.dev/discord">Discord</a>.</p>]]></content:encoded></item><item><title><![CDATA[Mistral Medium 3.5 is Live in Kilo Code]]></title><description><![CDATA[The OSS lab's powerful new blended model is surprisingly affordable]]></description><link>https://blog.kilo.ai/p/mistral-medium-35-is-live-in-kilo</link><guid isPermaLink="false">https://blog.kilo.ai/p/mistral-medium-35-is-live-in-kilo</guid><dc:creator><![CDATA[Ari]]></dc:creator><pubDate>Tue, 05 May 2026 23:01:38 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!fbCV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We&#8217;re thrilled to announce that the public preview version of <a href="https://mistral.ai/news/vibe-remote-agents-mistral-medium-3-5">Mistral Medium 3.5</a> is now live in Kilo. This is Mistral&#8217;s first <em>blended</em> model (it merges instruction-following, reasoning, and coding into a single 128B dense model) and it puts the lab instantly back on the OSS map.</p><p>If it&#8217;s seemed quiet on the Mistral front for a while, that&#8217;s because they&#8217;ve been heads-down building. This new model is a major leap for the lab, and the focus on agentic work&#8212;coding and agentic engineering&#8212;benefits all of us.</p><p>Mistral&#8217;s <a href="https://kilo.ai/models/mistral-medium-3-5">new flagship</a> is a dense 128B model with a 256k context window, built from the ground up for long-horizon agentic work. It merges instruction-following, reasoning, and coding into a single set of weights, with configurable reasoning effort so you can dial it up for a gnarly refactor or keep it light for a quick edit. It scores 77.6% on SWE-Bench Verified, putting it ahead of Devstral 2 and models like Qwen3.5 397B A17B. The vision encoder was trained from scratch to handle variable image sizes, and the whole thing can run self-hosted on as few as four GPUs. </p><p>And Mistral is sticking to their OSS principles: the new model shipped with <a href="https://huggingface.co/mistralai/Mistral-Medium-3.5-128B">open weights</a> under a modified MIT license.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fbCV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fbCV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png 424w, https://substackcdn.com/image/fetch/$s_!fbCV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png 848w, https://substackcdn.com/image/fetch/$s_!fbCV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png 1272w, https://substackcdn.com/image/fetch/$s_!fbCV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fbCV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png" width="1456" height="862" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:862,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fbCV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png 424w, https://substackcdn.com/image/fetch/$s_!fbCV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png 848w, https://substackcdn.com/image/fetch/$s_!fbCV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png 1272w, https://substackcdn.com/image/fetch/$s_!fbCV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01d3f75e-0088-4bb2-aeba-c9b157f85ba3_2048x1213.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is a serious new model for serious engineering tasks, and Mistral users will find that it&#8217;s now the default for the Mistral Vibe CLI and Le Chat. And with Kilo, anybody can use the model among hundreds of other top models and always find the right tools for the job.</p><h2><strong>Use Mistral Medium 3.5 Everywhere You Use Kilo</strong></h2><p>The new model is available in the Kilo Gateway, so you can use it everywhere with a single login.</p><p><strong>VS Code Extension</strong></p><p>The upgraded Kilo Code VS Code extension now surfaces Mistral Medium 3.5 in the model switcher. Pick it for any task where you want a model that can hold a lot of context, reason through complexity, and produce structured output your codebase can actually consume.</p><p><strong>Kilo Code CLI</strong></p><p>Running Kilo from the terminal? Mistral Medium 3.5 is available there too. It&#8217;s a strong choice for longer CLI sessions &#8212; dependency upgrades, test generation, CI investigations &#8212; where you want the model working steadily without losing the thread.</p><p><strong>Cloud Agents</strong></p><p>Kilo Code&#8217;s cloud agent infrastructure is where Mistral Medium 3.5 really opens up. Kick off sessions powered by this model, walk away, and come back to finished branches or draft PRs. The model was built specifically for async, multi-tool work &#8212; running long stretches reliably, calling tools in sequence, producing structured handoffs. That makes it a natural fit for the tasks you want to delegate completely: module refactors, issue triage, test coverage gaps, incident investigations.</p><p><strong>KiloClaw</strong></p><p>Mistral Medium 3.5 is available as a model option across KiloClaw recipes. Whether you&#8217;re running a personal claw or a work claw, you can now back those workflows with a model that handles complex, multi-step reasoning without breaking a sweat.</p><h2><strong>Try It in Kilo Today</strong></h2><p><a href="https://kilo.ai/models/mistral-medium-3-5">Mistral Medium 3.5</a> is priced at $1.50 per million input tokens and $7.50 per million output tokens through the API. For a frontier-class 128B model at this capability level, that&#8217;s competitive &#8212; especially for agentic runs that justify the context and reasoning headroom.</p><p>At a <a href="https://artificialanalysis.ai/models/mistral-medium-3-5/providers?blend=3-1">blended price</a> of $3 per million tokens for general chat, and just $1.56 per million tokens for long-context summarization, it&#8217;s more affordable than it might look at first glance.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6Pb9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6Pb9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png 424w, https://substackcdn.com/image/fetch/$s_!6Pb9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png 848w, https://substackcdn.com/image/fetch/$s_!6Pb9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png 1272w, https://substackcdn.com/image/fetch/$s_!6Pb9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6Pb9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png" width="710" height="414" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:414,&quot;width&quot;:710,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:46835,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://blog.kilo.ai/i/196003285?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6Pb9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png 424w, https://substackcdn.com/image/fetch/$s_!6Pb9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png 848w, https://substackcdn.com/image/fetch/$s_!6Pb9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png 1272w, https://substackcdn.com/image/fetch/$s_!6Pb9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb313b16-5525-49ca-9c1a-1824adccb02b_710x414.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Plus, if you grab a <a href="https://kilo.ai/features/kilo-pass">Kilo Pass</a> you can embrace a healthy discount :)</p><p>Open the model switcher in the <a href="https://www.producthunt.com/products/kilocode/launches/kilo-code-v7-for-vs-code">latest version of our VS Code extension</a>, select it in your CLI agent config, or choose it as the backing model for your next KiloClaw recipe. It&#8217;s available now in public preview &#8212; we&#8217;d love to hear what you build with it.</p>]]></content:encoded></item></channel></rss>