ai Claude's 19-day gap ends; Sonnet 5 and Fable 5 return
What's new in Claude Sonnet 5
Anthropic released Claude Sonnet 5 on June 30. Simon Willison went straight to the developer documentation, which describes the model as offering improved instruction-following, stronger code performance, and more consistent tool use compared to Sonnet 4. The release came the same day the Department of Commerce lifted export controls on Claude Fable 5 and Mythos 5, which had restricted access since the models launched 19 days earlier. Fable 5 was restored globally on July 1.
Quoting Anthropic
The Department of Commerce lifted export controls on Claude Fable 5 and Mythos 5, allowing Anthropic to restore access starting July 1. Simon Willison noted the announcement came via Anthropic's Twitter account. The 19-day restriction had prevented access in certain jurisdictions and drew comparisons to the earlier GPT-5.6 restricted-partner rollout pattern observed in the prior window.
Ornith-1.0: Self-Scaffolding LLMs for Agentic Coding
DeepReinforce released Ornith-1.0, its first open-weights model under an MIT license, targeting agentic coding workloads. The release includes four variants: 9B Dense, 31B Dense, 35B MoE, and 397B MoE. The model is built on self-scaffolding techniques, meaning it generates its own intermediate reasoning scaffolds rather than relying on a fixed chain-of-thought prompt template. Simon Willison flagged it as an interesting addition to the open coding-agent ecosystem.
Newsroom
Anthropic, Amazon, Microsoft, Google, and partners in the Glasswing consortium jointly proposed an industry-wide framework for scoring jailbreak severity. The proposal assigns structured severity levels to jailbreak attempts rather than treating all bypass events as equivalent, which matters for triage, liability attribution, and model red-teaming standards. The announcement was bundled with the Fable 5 global return on July 1.
Autoresearch: The feedback loop behind self-improving agents
Introspection co-founder Roland Gavrilescu described the company's autoresearch system, which uses self-improving feedback loops to refine agent behavior over time. Gavrilescu introduced the concept of agent "recipes": structured execution plans that agents can revise based on outcomes. He argued that humans remain central to the software factory not as bottlenecks but as the source of goal-setting and quality judgment that the loop cannot generate on its own.
Warp CEO Zach Lloyd on why software factories are the next phase of coding
Warp CEO Zach Lloyd argued that every major software project will soon run on an automated factory model, where agents handle most implementation while engineers define architecture and review outputs. He told Latent Space that the transition is already underway for companies using Warp's terminal and that the pace of adoption is faster than internal surveys predicted six months ago.
AIEWF Daily Dispatch: Autoresearch and the tension between AI and human agency
Speakers at the AI Engineer World's Fair pushed back on the software factory framing on July 2, with several arguing that full automation of research and engineering work sacrifices the human understanding needed to catch systemic errors. The tension surfaced most clearly in a session on autoresearch, where attendees debated whether agents that improve themselves can remain interpretable to the humans nominally overseeing them.
🔬 The Coolest Diffusion Research Isn't in LLMs; Evan Feinberg & Sergey Edunov, Genesis Molecular AI
Evan Feinberg and Sergey Edunov, co-founders of Genesis Molecular AI, explained why the former Llama lead left Meta for drug discovery. The pair cited the co-folding accuracy threshold as the event that changed the calculus: once co-folding crossed a usability floor, diffusion models for molecular generation became worth building around. Their PEARL system achieved zero-shot performance on the OpenBind benchmark, matching or exceeding supervised methods trained on the full dataset.
On July 1, 2026, arXiv will spin out from Cornell University
arXiv spun out from Cornell University on July 1, 2026, becoming an independent nonprofit after 25 years under Cornell's umbrella. The Simons Foundation and Schmidt Sciences are providing major funding. The site is dropping its signature red color scheme. The change gives arXiv's governance structure independence from a single institution, which matters for how the preprint server handles policies on AI-generated content and data access.
Nano Banana 2 Lite
Google released Gemini 3.1 Flash Lite Image, marketed internally as Nano Banana 2 Lite, positioning it as the fastest and cheapest Gemini image model. Simon Willison tested the model in AI Studio on a Where's Waldo prompt and documented the results. The release extends Google's strategy of offering separate speed-optimized image model variants alongside its larger multimodal flagship.
AI Just Entered A New Era
Two Minute Papers assessed recent advances framed as marking an inflection point in AI capability and deployment. The analysis covered performance improvements, cost trends, and implications for production systems.
I'm freaking out about Sonnet 5
Mo Bitar expressed strong reactions to Claude Sonnet 5, documenting use cases where the model outperformed prior versions on agentic tasks and code generation.
The weird history of the internet...
Fireship produced a video exploring the technical history and quirks of internet infrastructure, examining how early design decisions shaped contemporary systems.
A new Android malware from Google
F-Droid documented a new Android malware leveraging Google's own infrastructure to distribute itself, raising questions about supply chain security in app ecosystems.
Bring back crappy forums
Tedium published a retrospective on early web forums, examining how their design and culture shaped online discourse before social media algorithms took over.
ZCode; Harness for GLM-5.2
ZCode presented itself as a harness for GLM-5.2, the Chinese open-weights model, enabling developers to run inference and fine-tuning tasks without proprietary tooling.
Show HN: Searchable directory of 22k+ products from worker-owned co-ops
A searchable directory of 22,000+ products from worker-owned cooperatives launched on Hacker News, aggregating goods and services from alternative business models.
For first time, a cell built from scratch grows and divides
For the first time, researchers successfully created and grew a synthetic cell from scratch in the lab, demonstrating complete de novo assembly of cellular machinery without biological scaffolding.
Announcing the Monetization Gateway: charge for any resource behind Cloudflare via x402
Cloudflare opened a waitlist for a Monetization Gateway enabling websites to charge for access via the x402 protocol, settling payments in stablecoins without payment processor intermediaries.
Making AI search smarter
Cloudflare published tools to help creators manage AI traffic on their sites, distinguishing between search bots, training crawlers, and agentic systems to give owners fine-grained control.
Unmasking the crawls with Attribution Business Insights
Cloudflare released an Attribution Business Insights dashboard showing website owners detailed behavior of AI crawlers accessing their content, supporting informed negotiation over compensation.
software Cloudflare charges agents for content; software factories spread
Announcing the Monetization Gateway: charge for any resource behind Cloudflare via x402
Cloudflare opened the waitlist for its Monetization Gateway, which allows any website owner to charge AI agents, other web pages, datasets, or MCP tools for access to content behind Cloudflare. Charges settle in stablecoins over the x402 open protocol. Cloudflare is not building a proprietary payments stack; it is routing settlement through the protocol and letting operators set their own prices. The gateway is positioned as infrastructure for an economy where autonomous agents are the primary consumers of web content.
Your site, your rules: new AI traffic options for all customers
Cloudflare gave all customers new controls to distinguish among three categories of AI traffic: search bots, agent bots, and training bots. Previously the choice was binary: block AI or allow it. The new granular controls let operators charge agents for access while still allowing search indexing, or block training crawls while permitting agent retrieval. The announcement coincided with Cloudflare's second Content Independence Day.
Content Independence Day, one year on: building the business model for the agentic Internet
Cloudflare published a one-year retrospective on its Content Independence Day initiative, reporting that a dynamic market for monetized content has emerged. AI agent traffic has displaced traditional search referrals as the dominant driver of API and data endpoint hits for a segment of Cloudflare's customer base. The report frames this as the emergence of an agentic internet with different economics from the advertising-supported web.
Impressions from visiting OpenAI, Anthropic, & Cursor
The Pragmatic Engineer's Gergely Orosz visited OpenAI, Anthropic, and Cursor and reported that cloud-based coding agents are the dominant trend inside all three organizations. He found that coding harnesses have spread beyond software engineers to product managers and data analysts. Cursor's forward deployed engineer program, described separately by Pauline Brunet, is placing dedicated engineers inside enterprise clients to set up agent-based software factories tailored to each organization's codebase.
How Kent Beck shapes the software engineering industry
Kent Beck told The Pragmatic Engineer that the central challenge for software engineers in the AI era is not generating code but building trust with the humans who depend on that code. Beck, the originator of Agile and TDD, argued that automated code generation makes the social and verification dimensions of engineering more important, not less, because the volume of output now exceeds any individual's ability to manually audit.
Building Agents that Don't Break Themselves
Fly.io documented a pattern its engineers are using to prevent agents from breaking themselves during long-running agentic sessions. The approach runs any risky or destructive action inside a disposable Sprite environment. If the agent makes an error that corrupts state, the Sprite is discarded and a fresh environment is restored. The technique addresses a failure mode that becomes common when agents have write access to production systems.
Have your agent record video demos of its work with shot-scraper video
Simon Willison released shot-scraper 1.10, which adds a new command called shot-scraper video. The command accepts a storyboard YAML file defining a sequence of interactions with a web application and uses Playwright to record a video of the agent executing those steps. Willison described it as a way for agents to produce self-documenting video logs of their own work without requiring a human to observe the session in real time.
Building Agents that Don't Break Themselves
Fly.io documented how to build AI agents that avoid self-destruction by running risky operations inside sandboxed lightweight VMs, preventing agents from corrupting their own execution environment.
Secure internal communication between services
Vercel launched secure internal communication channels between services on its platform, enabling private server-to-server calls without exposing credentials or API keys.
Claude Fable 5 access restored on AI Gateway
Claude Fable 5 access was restored on Vercel's AI Gateway after export control restrictions were lifted by the Department of Commerce.
Enforce consistent code for agents and humans with konsistent
Vercel introduced konsistent, a tool for enforcing consistent coding standards across both AI agents and human developers on the same codebase.
Dry-run deployments with Vercel CLI
The Vercel CLI added dry-run deployment capability, allowing developers to preview infrastructure changes before committing them to production.
Vercel Security Dashboard is in private beta
Vercel Security Dashboard entered private beta, offering centralized visibility into application vulnerabilities and compliance across deployed services.
pharma FDA clears an LLM for diabetes; biotech rally extends to 15 months
STAT+: A 'historic' FDA clearance raises the question: Is the LLM an interface or the decision-maker?
The FDA granted a clearance that STAT described as historic to UpdocHealth's LLM-based diabetes management app, which helps patients follow a treatment plan defined by their physician. The clearance prompted immediate debate about whether the LLM functions as a passive interface or as a clinical decision-maker subject to different regulatory standards. The FDA's decision draws a line that will be tested by future submissions for more autonomous AI clinical tools.
STAT+: Halfway into the year, biotech is booming
Biotech stocks have risen continuously since April 2025, and STAT reported that the rally has accelerated in 2026's first half. Capricor Therapeutics received an FDA advisory committee meeting date as one notable catalyst in the window. The sustained rally has attracted generalist fund managers who were absent from biotech for most of 2022 to 2024, and secondary offerings have been oversubscribed at a rate not seen since 2021.
STAT+: A former AI regulator, now in industry, says biopharma is reading FDA's guidance wrong
Tala Fakhouri, a former FDA AI regulator now working in industry, told STAT that biopharma companies are interpreting FDA's AI guidance more conservatively than the agency intends. Fakhouri said companies are treating guidance as a ceiling on AI use rather than a floor, which is slowing adoption of AI tools that FDA would likely clear. She called on the agency to publish more worked examples of compliant AI deployment to reduce interpretive uncertainty.
STAT+: The moment Anthropic convinced me it's serious about science
Anthropic released Claude Science, a version of Claude optimized for scientific laboratory workflows and drug discovery pipelines. STAT's AI Prognosis column reported that the release is the first time a major model maker has built domain-specific tooling aimed at bench scientists rather than software engineers, which STAT read as evidence that Anthropic is serious about scientific applications rather than treating them as marketing.
STAT+: Synthetic biology researchers think they've made a cell. Is it alive?
Researchers announced the creation of what they claim is the first synthetic cell capable of growth and division, developed through the Biotic Institute's Spudcell project. The team has formed a public benefit corporation to share the technology openly with other scientists. The claim is generating significant debate about what counts as a cell and whether the construct's reliance on synthetic components disqualifies it from the definition of life.
Statins and blood pressure drugs changing health risks of obesity, study suggests
A Lancet study found that obesity patients over 40 are now managing blood pressure and cholesterol as well as healthy-weight peers, a shift attributed to widespread use of statins and antihypertensives in that population. Younger obese adults do not show the same improvement, suggesting the cardiovascular risk reduction from these medications has not yet reached adults who developed obesity more recently and at younger ages.
STAT+: A year after distressed buyout, what's become of Bluebird Bio?
David Meek, who led the distressed acquisition of Bluebird Bio and rebranded it as Genetix, told STAT the company is already generating positive cash flow one year after the buyout. Bluebird had been widely described as a cautionary tale about the economics of gene therapy. Meek's turnaround involved narrowing the product focus, cutting fixed costs, and renegotiating payer contracts on the remaining approved therapies.
STAT+: More frequent, more intense, and longer heat waves pose health risks for Americans
New data shows that more frequent, longer, and more intense heat waves are creating health risks for Americans, particularly affecting cardiovascular and mental health outcomes in vulnerable populations.
STAT+: Halfway into the year, biotech is booming
Biotech stocks have rallied continuously since April 2025 and accelerated in the first half of 2026, driven by FDA clearances, deal activity, and investor appetite for AI-enabled drug discovery.
STAT+: A 'historic' FDA clearance raises the question: Is the LLM an interface or the decision-maker?
The FDA cleared an AI-powered app for diabetes management that helps patients execute doctor-defined treatment plans, raising questions about whether the AI is an interface or a decision-maker.
The new presidential fitness test is a positive step, not the solution, experts say
The newly revived presidential fitness test is a step forward for youth physical activity but insufficient alone; experts say sustained engagement matters more than a single assessment.
STAT+: A year after distressed buyout, what's become of Bluebird Bio?
David Meek's acquisition of Bluebird Bio and reorganization as Genetix has turned the troubled biotech profitable within a year, demonstrating a viable turnaround path for failed gene therapy programs.
STAT+: A former AI regulator, now in industry, says biopharma is reading FDA's guidance wrong
Tala Fakhouri, formerly an FDA AI regulator now working in industry, argues that biopharma companies are interpreting FDA's AI guidance too conservatively, leaving value on the table.
Opinion: CDC leadership continues to try manufacture a scientific debate on vaccines where none exists
CDC leadership continues manufacturing a scientific debate on vaccine effectiveness using test-negative design arguments where no debate exists among epidemiologists, writes Ben Lopman.
Opinion: Teens are turning to chatbots for mental health help. We need rules to keep them safe
The use of AI chatbots for mental health advice among young people surged over 40% in a single year, prompting calls for safety guardrails and clinical oversight.
Statins and blood pressure drugs changing health risks of obesity, study suggests
New Lancet research shows statins and blood pressure medications are changing the health risk profile for obese patients over 40, bringing cardiovascular outcomes closer to those of normal-weight peers.
STAT+: The moment Anthropic convinced me it's serious about science
Anthropic's release of Claude Science, a model optimized for lab work and drug development, suggests the company views science applications as serious rather than boutique use cases.
STAT+: Synthetic biology researchers think they've made a cell. Is it alive?
Synthetic biology researchers announced creation of what they describe as the first synthetic cell, capable of growth and division without natural biological scaffolding or genetic inheritance.
STAT+: Pharmalittle: We're reading about Anthropic drug development goals, Trump drug plan snags, and more
Anthropic announced Claude Science availability for labs and drugmakers, enabling AI-native drug development workflows; separate reporting found Trump's drug price plan is hitting implementation snags.
healthtech Teens turn to chatbots for mental health; heat risks climb
Opinion: Teens are turning to chatbots for mental health help. We need rules to keep them safe
The share of young people using AI chatbots for mental health advice rose more than 40% in a single year, according to research cited in a STAT opinion column. The author argued that without regulatory standards for chatbot mental health interactions, teenagers are receiving emotional support from systems not designed or tested for clinical use. Several states are considering bans on chatbot mental health apps for minors, though the author argued targeted safety rules would be more proportionate than outright prohibition.
STAT+: More frequent, more intense, and longer heat waves pose health risks for Americans
STAT's Morning Rounds reported that heat waves are becoming more frequent, more intense, and longer in duration across the United States, with direct health consequences for vulnerable populations including the elderly, outdoor workers, and those without access to air conditioning. The report coincided with a new set of obesity risk data showing that younger adults are not benefiting from the same cardiovascular medication coverage that protects older patients.
Opinion: CDC leadership continues to try manufacture a scientific debate on vaccines where none exists
Epidemiologist Ben Lopman wrote in STAT that CDC leadership is manufacturing scientific controversy around vaccine effectiveness methodology where the underlying science is settled. Lopman specifically addressed CDC leadership's public questioning of the test-negative design, which is the standard method for measuring vaccine effectiveness against circulating strains. He described the approach as a documented technique for introducing false uncertainty into established findings.
STAT+: Pharmalittle: We're reading about Anthropic drug development goals, Trump drug plan snags, and more
Anthropic announced Claude Science, positioning it for use by drug developers and scientific labs. STAT's Pharmalittle also noted that Trump's drug pricing plan is encountering implementation obstacles, with several provisions facing legal challenges from pharmaceutical manufacturers over Most Favored Nation pricing mechanisms and international reference pricing authority.
economy Supersonic flights legalized; rent control data from Norway; social media negativity
Civilian supersonic flights are being legalized in the U.S.
The U.S. Department of Transportation proposed replacing speed-based restrictions on civilian aircraft with noise-based standards for the first time since the 1970s, effectively legalizing civil supersonic flight over land. The current rules banned all overland supersonic flight regardless of noise levels. The proposal opens a path for aircraft like Boom Supersonic's Overture and Aerion's planned successor to fly domestic U.S. routes, subject to meeting noise thresholds that will be set in a subsequent rulemaking.
Rent Control: The Ceiling Trap
Tyler Cowen published a new website, Rent Control: The Ceiling Trap, drawing on data from Norway's 1982 abolition of rent control. Economist Are Oust tracked Oslo newspaper listings before and after the abolition and found that rental supply expanded substantially once controls were lifted, while rent levels initially rose but then stabilized as new supply entered the market. Cowen used the Norwegian case to argue that rent control reduces visible supply more than it reduces underlying housing costs.
Differentiation drives the erosion of positivity on social media
A study analyzing over 2 billion Reddit comments found that discourse within topic-bound communities becomes systematically more negative over time, driven by differentiation pressure rather than platform design. Commenters in established threads face incentives to distinguish their contributions from prior posts, and negative framing is the most legible form of differentiation. The experimental component confirmed the pattern held even when controlling for topic selection and community size.
Roundup #84: Bears on bikes
Noah Smith's Roundup 84 covered several topics including AI's potential depolarizing effects, existential AI risk debates, millennial versus boomer wealth dynamics, and GLP-1 adoption curves. On AI and jobs, Smith cited research from Korinek, Autor, and Gimbel suggesting that AI's employment effects will vary sharply by task type and that aggregate displacement predictions made two years ago have not materialized at the predicted pace.
Liquidity Fades as Treasuries Age
New York Fed economists documented that on-the-run Treasuries, the most recently issued securities representing less than 4% of the $30 trillion outstanding stock, account for 65% of average daily trading volume. Liquidity drops sharply as bonds age into off-the-run status, often within weeks of a new auction. The paper traced the mechanism to dealer inventory constraints and the premium placed on settlement certainty for the most recent issue.
Is Alexander Calder the great American artist?
Tyler Cowen examined whether Alexander Calder could claim the title of greatest American artist, crediting his work for blending aesthetic beauty with intellectual depth and accessibility.
Emergent Ventures India, 17th cohort
The Emergent Ventures India 17th cohort funded builders including Aryamman Bhatia, who is developing open-source chip microfabrication tools to democratize semiconductor design.
How to ask for help from a stranger
Making requests easy to accept involves reducing the cost of saying yes; magnitude, format, and timing matter more than the underlying request itself.
Wednesday assorted links
Florida drivers speed up after home NFL losses; infrastructure spending productivity remains mismeasured; MRU rebuilt its platforms using AI agents from the ground up.
Most Answers Are In Books
Kyla Scanlon produced a short video arguing that most business answers and strategic insights already exist in printed books and rarely require original research.
Fables of the Reconstruction/Reconstruction of the Fables
Tyler Cowen examined the interplay between fables and historical reconstruction, analyzing how narratives shape and are shaped by evidence.
What I've been reading
Cowen's reading list included Elizabeth Buchanan's overview of Greenland's history and strategic position, plus updates on Scandinavian claims and resource economics.
Tuesday assorted links
Links covered AI's labor market effects via Autor and Korinek, OpenAI's new economics job posting, state capacity challenges, and depolarization dynamics from current AI systems.
The Supreme Court
Kyla Scanlon released a short video on the Supreme Court's recent rulings and their implications for American governance.
What should I ask Daron Acemoglu?
Tyler Cowen announced an upcoming conversation with economist Daron Acemoglu, focusing on his recent writings on AI and a forthcoming book on democracy.
Prediction markets paragraphs to ponder
Risk Labs' prediction market oracle combines cryptocurrency, voting, and game theory to produce decentralized judgments; critics question its fairness and resilience to manipulation.
Tech Selloff
Kyla Scanlon analyzed a recent tech sector selloff, examining the dynamics and fundamentals driving equity withdrawals.
What happened when Norway raised taxes on the wealthy?
Money & Macro examined Norway's tax increases on the wealthy, exploring the economic and distributional effects of progressive tax policy adjustments.
Interview with Mervyn King: All Things Monetary Policy
Former Bank of England governor Mervyn King discussed his career, lessons from the 2008 financial crisis, and the importance of central bank transparency.
The Disappearing Overnight Drift
The overnight drift in US equity futures has nearly disappeared since 2021, suggesting market microstructure changes and reduced arbitrage opportunities between US and European markets.
Where Will the Jobs Come From in Developing Economies?
Timothy Taylor examined job creation in developing economies, analyzing how employment growth differs from wealthy countries due to sectoral structure and human capital constraints.
Why AI Isn't Going to Take Your Job
Maxinomics argues that fears of AI-driven unemployment are overstated; historical precedent suggests technology creates new occupations faster than it destroys old ones.