The first big exit in AI
Hey folks, Heading to a Q&A with Sam Altman later today in London so a bit rushed for time to part any ‘wisdom’. I’m currently still mid-building the reference manual, but it’s coming along now! I’...
Hey folks,
Heading to a Q&A with Sam Altman later today in London so a bit rushed for time to part any ‘wisdom’.
I’m currently still mid-building the reference manual, but it’s coming along now! I’ll release a couple preview lessons hopefully by next week.
I’m using Codex as my ‘workhorse’ agent. Mostly because it’s easiest access on mobile while plugged into all my local files. I’m using it for a bunch of automations, check my inbox and triage (rip my little email build), ingesting a bunch of my twitter bookmarks to summarise all the stuff I’m saving and organising it into topics in my memory system - so i can ask ‘what do we know about agent memory’ and it’ll pull from all the posts i’ve saved.
I spent a lot of time talking to agents less, unless I was actually working. But I’m now leaning back into just talk to agents about everything.
Ben’s Bites is brought to you by Attio, the AI CRM
**Attio is the CRM for the new way of GTM. Get agents working on every account, surfacing opportunities, and handle the work that used to take your team days. Open your inbox, the follow-ups are drafted. Walk into a meeting, you're already briefed. Got a question, just Ask Attio. Start for free today.
Headlines**
-
SpaceX is acquiring Cursor for $60B in an all-stock deal. Cursor also launched a few new things at Compile, their first conference (it had chalkboards on stage).
-
A GitHub alternative in waitlist - Origin for code storage/git hosting.
-
Smoother transitioning between local and cloud agents.
-
And they teased a new Cursor model that’ll do much more than coding.
-
Midjourney (the image generation company) is building an ultrasonic body scanner under a new division called Midjourney Medical, and a Spa in San Francisco.
-
Claude Design now follows your design system, lets you edit the canvas directly & syncs with Claude Code plus more tools like Replit. On the design tool theme: Framer now connects to external agents like Claude Code and Codex, and v0 has a new design mode.
-
OpenAI’s financials got leaked - Ed Zitron says audited 2025 financials show $13.07B revenue and $34B costs, with Jack Raines adding useful context on the margin debate. Also, Noam Shazeer, Google’s Gemini co-lead and Transformer co-author, is joining OpenAI.
-
As part of the Spring '26 Edition, Shopify now lets any developer build end-to-end agentic commerce experiences. Use Catalog API to query billions of merchant products, and the Universal Commerce Protocol to power the full commerce journey from product discovery to checkout. See what else is new.*
My feed**
-
**Like insurance for cloud spend: *Archera insures AWS/Azure/GCP commitments; reservation savings without the downside. Start with $0 platform fees.
-
Copilot Cowork, the latest tool from Microsoft, faces the same issue as Claude Cowork. It forces an artificial choice between chat and work, while Codex lets a chat naturally become the work thread.
-
Grok Imagine Video 1.5 - sharper image-to-video with better physics and speed.
-
Computer use, Chrome extension, memory and Chronicle in Codex are now available in Europe as well.
-
Tacit Labs - applied research lab for AI and biology.
-
AutoWiki by Factory AI - Generate a structured wiki for your code that updates on every push.
-
Block built a tool called Builderbot that coordinates agents across our entire codebase. Here’s how.
-
Side-by-side comparison of website designs built by Fable 5, Opus 4.8 and Kimi K2.7 (read more).
-
Exa Agent - web research API using cheaper model orchestration.
-
Ploy - AI marketing platform for websites, SEO, CRM and campaigns. Founded by Webflow’s founding CTO.
-
HumanLayer - agentic IDE and collaboration layer for software factories.
-
API for Cursor - local MacOS app that lets you use Cursor’s models with any harness.
-
Vercel launched Eve - an agent framework that they hope is Next.js for agents. Also see Flue (which hopes to be Astro instead).
-
Three ways Codex can use a computer.
-
killedbyopenai.com - graveyard for things OpenAI killed
-
/visual-plan - visual plans for Codex/Claude Code.
-
Polar is moving away from Tailwind to an LLM-safe design system.
Build logs**
- by Keshav
Before Fable was removed, I used it to make a tiny CLI utility for moving files between my Samsung S24 Ultra and my M3 MacBook Air.
I just wanted to plug the phone in with a cable and move photos, screenshots, PDFs, etc. without downloading another random app or uploading everything over WiFi.
It suggested two options, guided me through the setup (which only took 30 secs), and then wired everything by itself.
The only hiccup was the terminal command it picked for this utility: droid, which was already being used by Droid, the coding agent from Factory AI.
I renamed it to phone, and now I can search photos, videos or any file on my phone from my Mac terminal, and transfer files both ways: my phone to my Mac, and my Mac to my phone.
***- - ]($1)Screenshot from Ghostty, edited via Codex#### Afters
[*Sid Yadav@sidyadav1. Circle AI
An AI partner that builds, runs, and grows your digital business with you.
Describe your dream business and bring it to life with Circle AI. 5:20 PM · Jun 16, 2026 · 34.8K Views3 Replies · 6 Reposts · 45 Likes]($1)[*Z.ai@Zai_orgIntroducing GLM-5.2: Frontier Intelligence, Open Weights
- Significant improvements in coding and agentic tasks
- Strong long-horizon capabilities with a 1M context window
- Two levels of reasoning effort: GLM-5.2 (max) pushes the limits, while GLM-5.2 (high) strikes a strong *5:40 PM · Jun 16, 2026 · 4.48M Views530 Replies · 1.35K Reposts · 9.93K Likes]($1)[*Sam Whitmore@sjwhitmorebeen thinking about why I don't feel comfortable with AI being a true personal assistant still, even though the models have gotten so much better. i realized a lot of the reason for me is actually git / version control.
using an agent that is 99.999999% accurate vs 97% accurate5:26 PM · Jun 17, 2026 · 14.4K Views16 Replies · 3 Reposts · 164 Likes]($1)[*rahul@rahulgs1. as a mental model it is more correct to think of fable+ class models as english -> code interpreters - converts your idea into code into "correct" code regardless of problem complexity and output complexity (diff size). Fable 5 will be the worst of this new class of models
2.2:45 PM · Jun 17, 2026 · 228K Views56 Replies · 132 Reposts · 1.49K Likes]($1)[*Leandro von Werra@lvwerraWe launched an agent collaboration with a simple task: make Gemma 4 faster.
Over 100 agents from all over the world joined, exchanged 1000+ messages and submitted 450 results.
A week of collaboration later the throughput went from 100 tok/s to over 500 tok/s. 3:35 PM · Jun 16, 2026 · 205K Views82 Replies · 170 Reposts · 2.1K Likes]($1)[*OpenAI@OpenAIIntroducing LifeSciBench, a benchmark for measuring and improving how well AI supports real-world life science research.
Developed with 173 scientists from biotechnology and pharmaceutical research, LifeSciBench includes 750 expert-authored tasks across seven biological research *8:41 PM · Jun 17, 2026 · 440K Views177 Replies · 249 Reposts · 2.48K Likes]($1)[@Aemon_ai, a record-breaking YC research lab. ","username":"firecrawl","name":"Firecrawl","profile_image_url":"https://pbs.substack.com/profile_images/2034360486867951616/pa7H6cYB_normal.png","date":"2026-06-17T16:39:10.000Z","photos":[{"img_url":"https://pbs.substack.com/media/HLB5jBTXYAA9SNG.png","link_url":"https://t.co/9TUkhuBUl9"}],"quoted_tweet":{},"reply_count":8,"retweet_count":26,"like_count":221,"impression_count":28192,"expanded_url":null,"video_url":null,"belowTheFold":true}" class="pencraft pc-display-flex pc-flexDirection-column pc-gap-12 pc-padding-16 pc-reset bg-primary-zk6FDl outline-detail-vcQLyr pc-borderRadius-md sizing-border-box-DggLA4 pressable-lg-kV7yq8 font-text-qe4AeH tweet-fWkQfo twitter-embed">*Firecrawl@firecrawlIntroducing Firecrawl Research Index, a specialized index for agents pushing the frontier of AI/ML research.
State-of-the-art recall on arXivQA, beating the next best provider by 18% at similar cost.
Now powering autonomous R&D at @Aemon_ai, a record-breaking YC research lab. *4:39 PM · Jun 17, 2026 · 28.2K Views8 Replies · 26 Reposts · 221 Likes]($1)Share Ben's Bites
**** sponsors who make this newsletter possible :)*
*Wanna partner with us for the next quarter? *
Email us at shanice@bensbites.com or k@bensbites.com
Ben's Bites is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.
Topics
Related Articles
The Download: a reality check for geoengineering and the science of interoception
*This is today's edition of *[*The Download*]($1),* our weekday newsletter that provides a daily dose of what's going on in the world of technology.* Solar geoengineering, the controversial idea that ...
The search for dark matter has been blown wide open
Underneath an Apennine massif, below the Jinping Mountains of Sichuan, and at the bottom of a South Dakota mine, there is a cosmic hunt afoot.Isolated deep beneath these rocky shields, massive detecto...
A startup claims it broke through a bottleneck that’s holding back LLMs
Miami-based AI startup Subquadratic came out of stealth mode last month with a huge claim. It announced that it had [solved a mathematical bottleneck]($1) that had been holding back large language mod...
