ai Open weights surge and the Fable 5 export control fallout
GLM-5.2 is probably the most powerful text-only open weights LLM
Z.ai released GLM-5.2 under an MIT license on June 16, first to coding plan subscribers on June 13, then as full open weights. Simon Willison assessed the 753-billion-parameter model as probably the most powerful text-only open-weights LLM currently available, comparable in scale to Z.ai's prior GLM-5 and GLM-5.1 releases. The MIT license allows commercial use without restriction.
GLM-5.2 is probably the most powerful text-only open weights LLM
Z.ai released GLM-5.2 under an MIT license on June 16, first to coding plan subscribers on June 13, then as full open weights. Simon Willison assessed the 753-billion-parameter model as probably the most powerful text-only open-weights LLM currently available, comparable in scale to Z.ai's prior GLM-5 and GLM-5.1 releases. The MIT license allows commercial use without restriction.
The Fable 5 Export Controls Harm US Cyber Defense
Kate Moussouris, CEO of Luta Security and the cybersecurity expert Anthropic brought in to review the White House's report on the Fable 5 jailbreak, confirmed that the export control restriction harms US cyber defense. She reviewed the report at Anthropic's request and was not paid by the company. Willison framed the disclosure as correcting an earlier citation chain that had gone through The Atlantic rather than directly to Moussouris.
Quoting Matteo Wong, The Atlantic
Matteo Wong reported in The Atlantic that Anthropic shared the White House's Fable jailbreak report with Kate Moussouris for independent review. She told Wong she is not being compensated by Anthropic. The account added detail to the export control dispute, confirming that Anthropic sought outside expert validation of the government's technical claims before responding publicly.
A near-autonomous AI chemist improves a challenging reaction in medicinal chemistry
OpenAI and Molecule.one published results from a near-autonomous AI chemist built on GPT-5.4. The system improved a challenging reaction in medicinal chemistry, advancing a step that had been a bottleneck in drug synthesis. OpenAI described the workflow as near-autonomous, meaning a human remained in the loop for final decisions but the model designed and evaluated the reaction pathways without step-by-step instruction.
Predicting model behavior before release by simulating deployment
OpenAI introduced Deployment Simulation, a method that uses real conversation data to predict how a model will behave before it ships. The system runs synthetic deployments against recorded interaction patterns to surface failure modes and safety gaps that standard evals miss. OpenAI framed it as a complement to red-teaming, not a replacement.
Introducing LifeSciBench
OpenAI released LifeSciBench, a benchmark for evaluating how AI systems handle real-world life science research tasks. The questions were authored and reviewed by domain experts rather than generated from existing datasets. OpenAI positioned it as a harder target than existing science benchmarks because the tasks reflect actual decisions researchers face rather than textbook problem formats.
Quoting Charity Majors
Charity Majors argued that 2025 inverted the economics of code production. Lines of code went from being expensive and time-consuming to produce to effectively free and instant. Simon Willison quoted the argument approvingly, framing the shift as a change not just in cost but in how engineers should think about what work is worth doing manually versus delegating to a model.
Midjourney Medical
Midjourney launched Midjourney Medical, a new product applying its image generation and interpretation capabilities to medical imaging. The announcement reached 884 points on Hacker News with 605 comments. Midjourney positioned the product as a second line of business alongside its generative art tools, targeting clinical and diagnostic imaging workflows.
How do you analyze the relative "strength" of probes? [R]
A researcher on r/MachineLearning asked how to analyze the relative strength of probes in mechanistic interpretability work, noting the question arises in the context of factuality guarantees for model outputs. The thread surfaced current practice: linear probes are the standard, with probe strength typically assessed by held-out accuracy, mutual information, or causal intervention experiments.
OpenAI's 2025 financials LEAKED
Mo Bitar examined OpenAI's 2025 financial results, which were leaked to YouTube. The revenue figures and cost structure offer a window into the economics of frontier model development and inference at scale.
NVIDIA's New Free AI - A Gift To Humanity
NVIDIA released a free AI model, according to Two Minute Papers. The release raises questions about the economics of commodity models and whether free or near-free offerings can sustain a business model.
Chinese OPEN SOURCE model better than FABLE
Mo Bitar reported that a Chinese open-source model outperforms Claude Fable. The comparison underscores the pace of open-weights development and competitive pressure from non-US labs.
GLM-5.2 is probably the most powerful text-only open weights LLM
Z.ai released GLM-5.2 with 753 billion parameters under MIT license. The open-weights model ranks among the most capable text-only LLMs and shipped just days after an earlier GLM-5.1 release.
Quoting Charity Majors
Charity Majors described how code production economics inverted in 2025. What was scarce, time-consuming, and expensive became effectively free and instant; that shift cascades through how teams value code and treat technical debt.
State of the blog, mid-2026
Nathan Lambert marked three years of weekly writing at Interconnects. The reflection covers the evolution of AI discourse, frontier lab transparency, and what the state of the blog reveals about how the field has matured.
Demis Hassabis On What AI Will Do Next
Two Minute Papers covered Demis Hassabis discussing what AI will do next. The DeepMind CEO offered perspective on next-generation capabilities and the timeline for further capability expansion.
They Looked Inside Claude's AI's Mind. It Got Weird
Two Minute Papers examined interpretability research on Claude's internals. The work uncovered unexpected behavior in how the model represents and reasons about concepts, described as getting weird.
State of the blog, mid-2026
Nathan Lambert reflected on three years of Interconnects writing, tracing how AI discourse and frontier lab transparency have evolved through the period.
Frontier post-training recipe review with Finbarr Timbers
Nathan Lambert interviewed Finbarr Timbers on frontier post-training recipes. The conversation examined what research teams are learning about scaling laws, data, and compute allocation in post-training.
Sam Altman is starting to panic
Mo Bitar analyzed signals that Sam Altman is starting to panic. The video examined recent statements and strategic moves by OpenAI leadership in response to competitive pressure from open-source and Chinese models.
I had Fable build several projects for me. I'm disturbed by what I saw.
Mo Bitar used Claude Fable to build several projects and documented surprising behaviors. The video reviewed what the model produced and flagged concerning patterns in how it approached tasks.
What the Freakiness of 2025 in AI Tells Us About 2026
AI Explained analyzed what the unusual events of 2025 reveal about expectations for 2026. The video examined inflection points in frontier model releases and capability jumps.
I read every major CS paper of the last 100 years...
Fireship examined every major computer science paper of the last 100 years. The video synthesized seminal work across algorithms, systems, and theory, identifying recurring ideas and inflection points in computing research.
Quoting Georgi Gerganov
A developer compared Qwen3.6-27B and Claude Opus on local coding tasks. The 27B model proved capable for everyday development work, suggesting that local models offer genuinely viable alternatives to cloud inference for specific workloads.
Quoting Georgi Gerganov
Georgi Gerganov testified to using Qwen3.6-27B daily for coding tasks on M2 Ultra and RTX 5090 hardware. The model proved capable for mundane development work, suggesting that local inference competes with cloud LLMs on latency and cost for certain tasks.
software Agents, local models, and what Meta is doing to its engineers
Why is Meta destroying its engineering organization?
The Pragmatic Engineer reported that Meta's leadership has been running an AI-fueled restructuring of its engineering organization. The piece covered what has happened inside the company's engineering teams, including changes to headcount, workflows, and how AI tools are being integrated into or used to justify reorganization decisions.
Local Qwen isn't a worse Opus, it's a different tool
Alex Ellis argued that a local Qwen model is not a worse version of Anthropic's Opus; it is a different tool suited to different tasks. The post reached 265 points on Hacker News. Ellis framed the comparison as a category error: local models excel at specific, well-scoped coding and automation tasks where latency, cost, and privacy matter, while cloud frontier models remain better for broad reasoning and ambiguous prompts.
Bringing more agent harnesses and frameworks to Cloudflare, starting with Flue
Cloudflare opened up its Agents SDK primitives so that any agent framework can build on the runtime. Flue became the first framework targeting the Agents SDK, and Cloudflare rolled out agent management tools in the dashboard. The move is an attempt to position Cloudflare's infrastructure as the execution layer for agentic workloads across multiple frameworks.
Introducing the Cloudflare One stack: agent-powered deployment
Cloudflare launched the Cloudflare One Stack, a library of agent skills that gives AI agents the knowledge to plan, deploy, and manage a Zero Trust network environment without requiring migration calls or manual configuration. The product targets enterprises that want AI to handle network security operations.
How we run Firecracker VMs inside EC2 and start browsers in less than 1s
Browser-use published the engineering detail behind how it runs Firecracker microVMs inside EC2 to start browser instances in under one second. The post covered the VM lifecycle, snapshot and restore mechanisms, and how the team reduced cold-start latency for browser-based agent tasks. The piece reached 289 points on Hacker News.
US holds off blacklisting DeepSeek, more than 100 firms deemed security risks
Reuters reported that the US held off blacklisting DeepSeek even as more than 100 Chinese firms were added to a security risk designation. The decision keeps DeepSeek's models accessible in the US for now; the Reuters account cited officials who said the assessment of DeepSeek is ongoing. The article reached 472 points on Hacker News with 520 comments.
RFC 10008: The new HTTP Query Method
RFC 10008 standardized a new HTTP QUERY method, giving clients a way to send request bodies with read-only, safe, idempotent semantics. Previously, developers who needed to send structured query parameters in a body had to use POST and lose the caching and idempotency guarantees that GET carries. The RFC reached 384 points on Hacker News.
Bringing more agent harnesses and frameworks to Cloudflare, starting with Flue
Cloudflare opened the Agents SDK primitives, allowing any agent framework to build on its runtime. Flue is the first framework targeting Agents SDK, with agents now available in the dashboard.
Introducing the Cloudflare One stack: agent-powered deployment
Cloudflare announced the Cloudflare One stack, a library of agent skills for planning, deploying, and managing zero-trust environments. No migration calls are required; agents get knowledge of existing infrastructure.
Cloudflare DMARC Management is now generally available
Cloudflare DMARC Management reached general availability. The tool gives every Cloudflare customer free unified visibility into email authentication posture, record analysis, and SPF audits for full DMARC enforcement.
Growing the Cloudflare AI team with talent from Ensemble AI
Cloudflare hired talent from Ensemble AI to deepen its investment in machine learning infrastructure and efficiency. The team will focus on optimizing AI workloads on Cloudflare's network.
Building Agents that Don't Break Themselves
Fly.io documented how teams build agents that don't break themselves. The approach wraps risky operations in a Sprite sandbox, keeping agents alive even when they attempt self-destructive tasks.
Cheapest AI observability tools for developers, compared
PostHog compared cheapest AI observability tools for developers. Teams shipping AI features hit the same blind spots,latency, token costs, model failures,that don't surface until production; the comparison covered options at different price points.
Volkswagen started blocking GrapheneOS users
Volkswagen started blocking GrapheneOS users from its mobile app. The manufacturer's app detected the alternative OS and rejected requests; the move illustrates how OEM lock-in extends to third-party operating systems.
Lore; Open source version control system designed for scalability
Lore shipped as an open-source version control system designed for scalability. The system targets massive repositories and complex merge scenarios where git reaches resource limits.
RFC 10008: The new HTTP Query Method
RFC 10008 defined a new HTTP Query Method. The standard extends HTTP's toolkit for query operations in ways that preserve cache semantics and improve efficiency for read-heavy APIs.
CI/CD with Robert Erez
Pragmatic Engineer interviewed Robert Erez of Octopus Deploy on CI/CD, Kubernetes, GitOps, and progressive delivery. The conversation covered AI's role in continuous integration and how deployment practices are evolving.
Every operating system concept in one video…
Fireship covered every operating system concept in one video. The comprehensive explainer traced abstractions from kernel scheduling to file systems, process management, and memory protection.
pharma Nitazenes spreading, vaccine policy scrutiny, and infant formula safety questions
Super-potent synthetic opioids called nitazenes are spreading across the U.S.
CDC data shows confirmed nitazene overdose deaths in the US rose from 27 in 2020 to 409 in 2024. Nitazenes are a class of synthetic opioids significantly more potent than fentanyl. STAT reported that the drugs are spreading geographically across the US and appearing in combinations with other substances, complicating detection and treatment.
STAT+: Senate Democrats demand HHS provide records on federal vaccine policy
Senate Democrats opened a formal inquiry into the Trump administration's remaking of US vaccine policy, demanding records from HHS officials by the following week. The inquiry targeted changes implemented under RFK Jr.'s leadership and focused on how decisions about vaccine recommendations and schedules were made and documented.
New infant botulism outbreak puts fancy formulas under scrutiny
A new infant botulism outbreak linked to premium baby formula put high-end brands including ByHeart and Nara under regulatory scrutiny. STAT reported that the affected products position themselves as safer alternatives to standard formulas; the outbreak is prompting questions about whether their production processes carry different microbiological risks.
FTC, four state AGs sue transgender health group over care standards
The FTC and four state attorneys general sued WPATH, the World Professional Association for Transgender Health, over its clinical care standards. The complaint alleged that WPATH's standards-setting process amounts to an anticompetitive agreement among clinicians. STAT reported the suit is the first federal antitrust action targeting a medical professional organization over care guidelines.
RFK Jr. presents $700 million in mental health funding, but experts say grants aren't new
RFK Jr. announced $700 million in mental health funding targeted at homelessness, addiction, and mental illness. Experts told STAT the grants are not new money; they are existing federal allocations being repackaged and presented under new framing. The administration did not dispute that characterization when contacted by STAT.
Montana's SB535 and a Potential Biotech Renaissance in America
Tyler Cowen covered Montana's SB535, legislation designed to attract biotech development by reducing regulatory friction for clinical trials and drug development within the state. The post also noted that China's NMPA approved 83 new drugs in 2024 versus the FDA's 50, and that China's share of global commercial clinical trials rose from 8% in 2013 to 30% in 2024.
NIH diversity programs doubled undergraduates' odds of getting a Ph.D., 20-year study finds
A 20-year study found that two NIH diversity-oriented programs doubled the probability that an undergraduate would go on to earn a PhD. STAT reported the programs have since been terminated. The study tracked participants from the early 2000s through graduate school completion, comparing outcomes to matched peers who did not participate.
Six of the biggest health news stories today
STAT News covered six major health stories: premium infant formula under scrutiny after botulism outbreak, synthetic opioid spread, Senate Democrats probing vaccine policy, AI's energy demands as public health issue, FTC targeting gender-affirming care standards, and more.
STAT+: In pancreatic cancer, Patrick Soon-Shiong makes promises he has not kept
STAT investigative reporter Adam Feuerstein documented a gap between Patrick Soon-Shiong's cancer drug claims and real-world results. The pancreatic cancer therapies have underperformed the billionaire's public statements.
Super-potent synthetic opioids called nitazenes are spreading across the U.S.
Nitazenes, super-potent synthetic opioids, are spreading rapidly across the US. CDC data shows overdose deaths involving nitazenes rose from 27 in 2020 to 409 in 2024, driven by illicit drug supply contamination.
Opinion: Congress should embrace strategic health diplomacy
A trio of former Senate health leaders wrote in STAT that improving global health should be a top policy priority for the United States. The piece ties pandemic preparedness, disease surveillance, and international engagement to American security.
What's next for GLP-1 weight loss drugs?
STAT reported on the GLP-1 weight loss drug market and where it is heading. The segment covered competition between Novo Nordisk, Eli Lilly, and others as insurance coverage expands and manufacturing capacity ramps.
STAT+: Senate Democrats demand HHS provide records on federal vaccine policy
Senate Democrats demanded HHS produce records on how the Trump administration is remaking US vaccine policy. The inquiry comes as RFK Jr. takes a more visible role in health policy decisions.
New infant botulism outbreak puts fancy formulas under scrutiny
A new infant botulism outbreak puts premium baby formula makers under scrutiny. Companies position their products as safer than standard options; the outbreak tests whether that positioning withstands evidence.
Opinion: AI's growing appetite for power is a public health issue. Here's a fix
Sten Vermund and Patricia Kissinger argued in STAT that AI's growing appetite for electricity is a public health issue. The energy demand affects water systems, grid stability, and healthcare infrastructure capacity.
RFK Jr. presents $700 million in mental health funding, but experts say grants aren't new
RFK Jr. presented $700 million in mental health funding to address addiction and homelessness. Experts questioned whether the figure represents new appropriations or repackaging of existing grants.
STAT+: DOJ's swift win in OhioHealth case should have hospitals studying their contracts, experts say
The DOJ secured a swift antitrust settlement with OhioHealth, moving faster than typical hospital litigation. Experts say the speed signals other health systems to scrutinize their contracts and competitive practices.
FTC, four state AGs sue transgender health group over care standards
The FTC and four state attorneys general sued WPATH, the main professional organization for gender-affirming care clinicians. The lawsuit challenges the standards the group publishes for clinical practice.
NIH diversity programs doubled undergraduates' odds of getting a Ph.D., 20-year study finds
A 20-year study found that two NIH diversity programs doubled the odds undergraduates would earn a PhD. The finding comes as the agency has terminated the programs under the current administration.
healthtech Google's AMIE matches doctors, AI energy load flagged as a health issue
New research shows how AMIE, our medical AI, could help manage health conditions.
Google published research in Nature showing that AMIE, its conversational medical AI, matched primary care physicians in complex disease management scenarios. The study compared AMIE's performance against physicians across multiple chronic disease categories. Google framed the result as evidence that conversational AI can function as a clinical support tool in disease management, not just triage.
Opinion: AI's growing appetite for power is a public health issue. Here's a fix
Sten H. Vermund and Patricia J. Kissinger argued in STAT that AI's growing demand for electricity and water constitutes an overlooked public health problem. Data centers require large volumes of water for cooling and draw power that in many grids comes from fossil fuels; the authors proposed policy interventions including efficiency standards for AI infrastructure and mandatory disclosure of energy and water consumption per model deployment.
STAT+: DOJ's swift win in OhioHealth case should have hospitals studying their contracts, experts say
The DOJ reached an antitrust settlement with OhioHealth unusually quickly, and experts told STAT the speed should put other hospital systems on notice about their contract structures. The settlement required OhioHealth to modify agreements with insurers that antitrust enforcers said restricted competition. Legal experts said the swift resolution signals active DOJ appetite for hospital market enforcement.
Six of the biggest health news stories today
STAT's Morning Rounds covered six major health stories: scrutiny of premium infant formula following a botulism outbreak, the spread of nitazene synthetic opioids, an FTC action against a health-related organization, and three additional items spanning drug safety and public health. The roundup reflected a notably dense news day across regulatory and epidemiological beats.
STAT+: In pancreatic cancer, Patrick Soon-Shiong makes promises he has not kept
STAT's Adam Feuerstein reported that Patrick Soon-Shiong has made repeated public claims about what his cancer drugs can achieve in pancreatic cancer that have not been borne out by clinical results. The investigation traced a gap between promotional statements and trial outcomes, covering multiple compounds Soon-Shiong has championed over the past decade.
economy AI-native firms, shingles vaccines, and Noah's Iran take
AI-Native Firms
Hyunjin Kim and Rembrand Koning studied how AI-native firms from Y Combinator batches W20 through F24 and US venture-backed startups are organized. Tyler Cowen called the work very important. The researchers found that AI-native firms hire differently, rely more heavily on contractors and non-employee contributors, and reach comparable output levels with smaller full-time headcounts than conventional startups at similar funding stages.
The Shingles Vaccine Reduces Dementia
Tyler Cowen cited new evidence that the shingles vaccine reduces dementia risk, consistent with a 2013 hypothesis he had written about. The updated research provides stronger evidence that the varicella zoster virus contributes to some forms of dementia and that vaccination can reduce that pathway. Cowen noted the result, if replicated broadly, has significant implications for dementia prevention policy.
Iran is Trump's Katrina
Noah Smith argued that the Iran situation represents Trump's Katrina: a crisis that exposed the administration's operational incompetence to constituencies that had previously been willing to overlook it. He examined the political and strategic fallout, drawing the Katrina analogy to the moment when a reputational floor collapses and approval becomes structurally harder to recover.
AI nationalism, Europe included
Tyler Cowen argued that AI nationalism is becoming a structural feature of European policy, not just an American or Chinese phenomenon. He examined Mistral AI's position: if Mistral develops into a serious frontier lab, EU policymakers will face pressure to favor it over American or Chinese alternatives regardless of performance. He argued this dynamic is hard to escape once national or regional champions become politically salient.
The Real Reason You Can't Afford a House
Patrick Boyle examined the structural reasons why housing affordability has deteriorated across most developed economies. The video covered supply constraints, zoning, interest rate cycles, and the political economy of homeowner-dominated electorates that resist policies increasing supply. Boyle is a tier-0 source in the registry; this is his main contribution in the current window.
Fed Reserve Decisions
Kyla Scanlon examined Federal Reserve decisions and their implications. The video covered recent monetary policy signals and market interpretation of inflation dynamics.
My Conversation with Dave Baszucki
Tyler Cowen interviewed Roblox CEO Dave Baszucki. With over 100 million daily active users and projected 2026 revenue bookings of $7 billion, Roblox is one of the largest gaming economies in the world.
A Cohort Perspective on Latin America's Fertility Transition
Using census microdata from 17 Latin American countries, researchers tracked female birth cohorts from the 1920s onward. The study documents Latin America's fertility transition from a cohort perspective, showing rapid completion of the demographic shift.
Wednesday assorted links
Cowen's Wednesday links included a forager-finding database for service sector jobs, application details for a Coase workshop on institutional analysis in Mexico City, a mechanical watch explanation, and new immigration debate video.
Facts about American men and women
Researchers exploited variation in sex ratios across US birth cohorts to identify marriage preferences. The study found that much of what looks like changing preferences over the twentieth century is actually demographic change.
Do teens regret their social media use?
Irish researcher Eoin Whelan tested Haidt's claims that teens regret social media use. Whelan examined the question rigorously in the context of other regrets, using robust methodology to evaluate the hypothesis.
Can Online Activity Be Regulated? Evidence from Adult Websites
Twenty-five US states implemented age verification laws on adult websites; the policy change sparked circumvention and migration toward noncompliant platforms. The study measures the consequences of online regulation.
Tuesday assorted links
Cowen's Tuesday links covered trading with AIs, book dedications, Abdullah Ibrahim's death, Iran equilibrium analysis, economist reports on Fable 5, AI productivity gains, and Algeria's history before 1970.
Why Does The Market Go Down On Good News?
Kyla Scanlon explained why markets sometimes move down on good economic news. The video covered how positive indicators can trigger shifts in monetary policy expectations that offset equity gains.
Everyone is IPOing Now
Kyla Scanlon examined the IPO wave, noting that many companies are now going public. The segment covered market conditions, valuation multiples, and what the timing signals about investor appetite.
Gas Prices
Kyla Scanlon covered gas price dynamics, examining what is driving changes at the pump and how energy policy affects consumer costs.
What's the Deal with Bossware?
Kyla Scanlon explored bossware, the software employers use to monitor employee activity. The segment covered privacy concerns, productivity tradeoffs, and whether surveillance increases output or just increases compliance theater.
Educational arbitrage?
Justin Helman didn't get into the University of Florida but is pursuing the college experience there anyway. The story examines educational arbitrage, credential inflation, and the declining signal value of institutional brand.
Dairy: Supply and Demand Stories
Timothy Taylor examined milk supply and demand, noting the puzzle that quantity is up while price is also up. The answer involves both structural shifts in production and demand composition changes.
How Will Boomer Retirement Affect Jobs for Young Adults?
Timothy Taylor examined how boomer retirement will affect job markets for young adults. The so-called pig-in-the-python generation has been moving into retirement since 2010; the labor market consequences are complex.