Daily News
AI research, safety, product, and engineering links
Latest AI Reading
Source-dated posts from the last 14 days
Updated 2026-06-23 00:10 UTC
Model Size Scaling in 2023-2031
Token generation speed is constrained by the speed at which the relevant HBM can be read, which is mostly the weights and KV-cache. Suppose a model is large, so that more than half of HBM is read when making a single pa...
Read originalLLM-Driven Feature Discovery
We would often like to get a qualitative sense of a target model’s behaviors in important distributions (e.g. deployment, RL training, or evals). For example, we might want to discover novel behaviors , figure out what...
Read originalLLM-Driven Feature Discovery
We would often like to get a qualitative sense of a target model’s behaviors in important distributions (e.g. deployment, RL training, or evals). For example, we might want to discover novel behaviors , figure out what...
Read originalPathological Narcissism: The Pendulum Swing between Echoism and Sovereignism
Chapter 4: Most of my friends with pathological narcissism have more echoist and more sovereign sides. What determines which one takes the stage? Introduction In my article “ Narcissism, Echoism, and Sovereignism: A 4-D...
Read originalSpeedup from AI Ghostwriting
I used Claude Opus 4.6 to ghostwrite the first drafts of the articles in my Psychopathy sequence . That approach saved me some two days if the drafts turned out well compared to ones that didn’t turn out well, which I h...
Read originalRed-Teaming after Mythos — Zico Kolter & Matt Fredrikson, Gray Swan
OpenAI boardmember Zico Kolter and Gray Swan CEO Matt Fredrikson join swyx to explain why AI security is not just “cybersecurity with AI”
Read originalFunctional Emotions and The Pope’s Encyclical on AI — Digital Minds Newsletter #3
Welcome back to the Digital Minds Newsletter, your curated guide to the latest developments in AI consciousness, digital minds, and AI moral status. If you enjoy this newsletter, please consider sharing it with others w...
Read originalInterpreting Language Model Parameters
Read originalBuilding pay-per-intelligence for AI agents: How Ampersend uses Amazon Bedrock AgentCore Payments
In this post, you will learn how Ampersend built a pay-per-intelligence routing layer on top of Amazon Bedrock AgentCore Payments. AI agents autonomously route tasks to the most effective model, pay per request, and ope...
Read originalPlanning for Preservation in the Age of AI
Nectome liked my earlier essay , and reached out to hire me to write more about their project, and about cryonics more broadly. This is the first such piece. A friend of mine, just a few years older than me, was diagnos...
Read originalInside NVIDIA Halos for Robotics: A Full-Stack Functional Safety System for Physical AI
Physical AI—robots working autonomously alongside people in factories, warehouses, hospitals, and homes—is arriving faster than most expected. Traditional...
Read originalEmbed the world: Multimodal AI for searchable aerial imagery at scale
In this post, we walk through the problem space, our architecture on Amazon Bedrock and Amazon OpenSearch Serverless, the evaluation methodology we built on OpenStreetMap ground truth, four experiments that compared emb...
Read originalRunning ComfyUI workflows on Amazon SageMaker AI processing jobs
In this post, we walk you through how to deploy ComfyUI workflows on Amazon SageMaker AI processing jobs to generate hundreds of high-quality images in a single batch. You learn how to set up the infrastructure using AW...
Read originalAdvocates Can Influence LLM Values By Editing Wikipedia
This article is a summary of an original study: Brazilek, J., Navas, M., & Gnauck, A. (2026). Small edits, large models: How Wikipedia advocacy shapes LLM values. Zenodo. https://doi.org/10.5281/zenodo.19981454 We’d lik...
Read originalLong-Term Implants Need To Be Stretchy
Mechanical mismatch injures neurons each time the soft tissue moves. To prevent this, microelectronic meshes should be cushioned with hydrogels or similar materials. At cortical parenchyma, just below where webbed colla...
Read originalGLM-5.2 is the step change for open agents
A capability threshold I've been carefully monitoring.
Read originalLearning to Understand Evil
And it was only when I lay there on rotting prison straw that I sensed within myself the first stirrings of good. Gradually it was disclosed to me that the line separating good and evil passes not through states, nor be...
Read originalDefeatism as Disempowerment
"Critiques of fear-based approaches need to deal with the actual arguments for danger. It sounds like the book didn't, and you don't here You don't make a new technology or encounter with a new species safe by ignoring...
Read originalPP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters
Read originalAt ISC, JUPITER Shows What Exascale Science Looks Like
JUPITER, Europe’s first exascale supercomputer at Germany’s Forschungszentrum Jülich, runs on NVIDIA Grace Hopper Superchips and NVIDIA Quantum-X800 InfiniBand networking — and it’s had a busy year. As the international...
Read originalNAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure
For the past two years, the U.S. National Science Foundation’s National Artificial Intelligence Research Resource (NAIRR) pilot program has driven innovative research across the U.S. for over 700 projects — spanning pro...
Read originalNVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory
Mission, Vision and Veritas — new Los Alamos National Laboratory (LANL) supercomputers to be built with HPE and NVIDIA — are tapping NVIDIA Vera CPUs to accelerate scientific discovery, unlocking agentic AI for science....
Read originalFrom Materials Simulation to Experimental Astronomy, New NVIDIA AI Software Unlocks Scientific Discoveries
At the ISC conference running in Hamburg this week, NVIDIA is introducing new software that speeds AI for science, from chemistry and materials discovery to the search for dark matter. The NVIDIA DAQIRI library and new...
Read originalEco Wave Power Turns Waves Into Watts With NVIDIA AI Infrastructure and Digital Twins
The next era of AI will not be defined by compute alone. Its growth will be determined by energy. As accelerated computing scales across AI factories, agentic AI, industrial AI, edge computing and physical AI — includin...
Read originalImport AI 462: Superpersuasion; self-sustaining AI; paths to ASI
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now AI can decisively out-persuade humans...
Read originalHotter Than a Hot Tub: The 45°C Breakthrough to Cool AI’s Biggest Machines
Hot tubs sit at about 38 to 40 degrees Celsius, warm enough that most people can only soak for about 15 minutes. NVIDIA’s newest AI servers can run their cooling liquid even hotter — up to 45 degrees Celsius, or 113 deg...
Read original[Exclusive] $250 off AI Engineer tix til Monday
special offer for subscribers - $250 off AI Engineer tix til Monday
Read originalHow to Optimize Transformer-Based Models for Low-Precision Training
Transformer architectures are the backbone of many modern large language and generative AI models. As these models grow in size, training runs consume more GPU...
Read originalHow transparent is DiffusionGemma (and why it matters)
Authors: Joshua Engels*, Callum McDougall*, Bilal Chughtai*, Janos Kramar, Senthoran Rajamanoharan, Cindy Wu, Arthur Conmy, Asic Q Chen, Jean Tarbouriech, Min Ma, Brendan O'Donoghue+, João Gabriel Lopes de Oliveira+, Ro...
Read original[AINews] not much happened today
a quiet day lets us promo AIE one last time
Read originalIntroducing Web Search on Amazon Bedrock AgentCore
Web Search on Amazon Bedrock AgentCore is now generally available. In this post, we walk through what makes Web Search on Amazon Bedrock AgentCore different, why it matters, and how to wire it in with a few lines of cod...
Read originalAccelerate campaign workflow with insights from Adobe Marketing Agent for Amazon Quick
This post shows how to enable Adobe Marketing Agent for Amazon Quick using a Model Context Protocol (MCP). We walk you through how to configure the integration, authenticate using your Adobe credentials, and get the lat...
Read originalBanning Open Source AI Would Be A Mistake
This post was originally an op-ed co-authored with Kevin Xu of Interconnected for a general, non-technical audience.
Read original[AINews] GLM > GPT? GLM-5.2 passes vibe check; Z.ai forecasts Open Fable by December
With GLM-5.2 passing everyone's vibe check, the open models story finally becomes a real frontier story.
Read originalMonitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch
Amazon SageMaker AI provides fully managed real-time inference hosting for machine learning models. You deploy a model to a SageMaker endpoint backed by one or more compute instances, and SageMaker handles provisioning...
Read originalThe distillation double bind: Distilling misaligned models either transfers misalignment or it doesn't
If it transfers misalignment, we might get a misaligned model that’s easier to incriminate. If it doesn’t, we might get a capable benign replacement model.
Read originalHow FERC’s Large-Load Interconnection Actions Help Address Grid Stress, Improve Affordability
In a consequential grid infrastructure decision, the Federal Energy Regulatory Commission (FERC) today issued a major milestone on large-load interconnection impacting how those building AI factories, semiconductor fabr...
Read originalMosaicLeaks: Can your research agent keep a secret?
Read originalReinforcement learning towards broadly and persistently beneficial models
Training targeting beneficial behavior in realistic scenarios produces broad improvements in alignment that generalize across domains and persist under adversarial pressure.
Read originalAmazon Bedrock AgentCore harness is now generally available: Go from idea to production-grade agent in minutes
Today, Amazon Bedrock AgentCore harness is generally available. Two API calls (CreateHarness to define an agent, and InvokeHarness to run it), and you have an agent running in seconds. The agent runs in its own isolated...
Read originalThe Professor of Outputmaxxing — Anjney Midha, AMP
We talk about how this legendary investor went from humble beginnings in Singapore to leading rounds in Anthropic, Mistral, Black Forest Labs, and Periodic Labs... and the AMP secret master plan!
Read originalGDM AI Control Roadmap
GDM has published an AI Control Roadmap ! From the executive summary: We present the GDM AI Control Roadmap (v0.1) – our plan for implementing and adopting internal guardrails designed to catch potential adversarial beh...
Read originalAt Cannes Lions, NVIDIA Partners Reshape Advertising and Marketing With AI
The digital era gave the advertising and marketing industry speed; the AI era is giving it autonomous operations. For companies building next-generation technologies for advertising and marketing, the question is no lon...
Read originalSync and Stream: GeForce NOW Connects to Members’ Game Libraries Across Devices
Play favorite titles from popular game libraries, keep progress synced and jump back into gaming sessions on virtually any device. That’s the power of GeForce NOW cloud gaming. From providing access to members’ favorite...
Read originalFrance Advances Europe’s AI Future With NVIDIA Technologies
A year ago at NVIDIA GTC Paris at VivaTech, France laid out plans to advance local AI — from new AI factories and national compute capacity to open frontier models and industrial platforms. Now, that AI infrastructure i...
Read original[AINews] Midjourney Medical: scan your organs like you step on a scale
The only bootstrapped frontier lab announces its second product and second
Read originalIs it agentic enough? Benchmarking open models on your own tooling
Read originalBeyond LoRA: Can you beat the most popular fine-tuning technique?
Read originalThe Neural Geometry Series
Read originalAmazon SageMaker AI Async Inference now supports inline request payloads
Today, we’re announcing inline payload support for Amazon SageMaker AI Async Inference. Customers can now send inference payloads directly in the request body of the InvokeEndpointAsync API, removing the need to upload...
Read originalGet back hours every day with autonomous agents in Amazon Quick
Today, Quick gets even more powerful: new autonomous agents that work continuously on your behalf, an activity feed that helps you prioritize your most important work, and the ability to find insights across every data...
Read original🔬 The Self-Driving Lab — Joseph Krause, Radical AI
Radical AI's Joseph Krause on why the moat in materials is the lab, not the model
Read originalContext intelligence for your data and AI agents at scale
Agents are only as intelligent as the context they can reason over. Today, that context is scattered across data lakes, data warehouses, lakehouses, databases, and streams, and in institutional knowledge that has never...
Read originalMolmoMotion: Language-guided 3D motion forecasting
Read originalState of the blog, mid-2026
About 3 years since I started writing weekly.
Read originalFrom the Hugging Face Hub to robot hardware with Strands Agents and LeRobot
Read originalGLM-5.2: Built for Long-Horizon Tasks
Read original[AINews] GLM-5.2: the top Frontend Coding model in the world, IndexShare for Speculative Decoding
We have a new top open model in the world!
Read originalAgentic Resource Discovery: Let agents search
Read originalCCCL Runtime: A Modern C++ Runtime for CUDA
The NVIDIA CUDA Core Compute Libraries (CCCL) provides delightful and efficient abstractions for CUDA developers in C++ and Python. It features: Parallel...
Read originalEnable Real-Time AI for High-Speed Data Acquisition with DAQIRI
When AlphaFold2 revolutionized drug discovery in 2020, its success relied entirely on the roughly 170,000 protein structures collected by scientists since 1971...
Read originalBuild Your Own Transaction Foundation Model for Financial Intelligence
Every swipe, transfer, and payment on a modern financial network encodes a pattern of human behavior. Transaction data is one of the richest signals an...
Read originalUnlocking UK house-building with AI-accelerated planning
UK government partners with Google DeepMind to build a new AI-powered prototype aimed at faster housing decisions.
Read originalNVIDIA Blackwell Tops MLPerf Training 6.0 with Industry-Leading Scale and Performance
NVIDIA delivered a clean sweep in MLPerf Training v6.0, the latest edition of industry-standard AI training benchmarks developed by the MLCommons consortium....
Read originalPredicting LLM Safety Before Release by Simulating Deployment
Paper link Before releasing a new model, labs need to understand not just what it can do, but how it is likely to behave in real-world use, including where it might introduce new risks. This becomes even more important...
Read originalBuilding AI Agents for AR Glasses and XR Devices with NVIDIA XR AI
Developers building for AR glasses and wearable devices face an infrastructure gap. The hardware is ready, but creating AI experiences requires integrating live...
Read originalCan public chat data predict real-world AI misalignments?
Bridging private deployment evidence and public AI evaluation
Read originalBuild On-Device AI Companions with the NVIDIA ACE Game Agent SDK and Unreal Engine 5 Plugins
NVIDIA RTX technologies are deeply integrated into Unreal Engine 5 through the NVIDIA RTX Branch of Unreal Engine and the NVIDIA DLSS Unreal Engine plugin. This...
Read originalFrom pixels to planning: Earth AI for nature restoration
Climate & Sustainability
Read originalSecuring the future of AI agents
Securing internal systems with an AI Control Roadmap, combining traditional safeguards and real-time monitoring.
Read originalFrontier post-training recipe review with Finbarr Timbers
"Interview" #18
Read original[AINews] Satya on Loopcraft: Building Frontier Ecosystems
a quiet day lets us report on Satya's hit essay
Read originalSynthetic document finetuning for instilling positive traits
This is the fifth in a series of informal research updates from the Google DeepMind Language Model Interpretability team, in interpretability and adjacent areas. The fourth post can be found here . TLDR: Via adapting th...
Read originalBoosting MoE Training Throughput with Advanced Fusion Kernels
Mixture-of-experts (MoE) models have quickly become a foundational component of modern, large-scale AI systems. They are widely adopted because they enable...
Read originalFine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes
Foundation models are reshaping computational biology. Pretrained on massive corpora of protein or genomic sequences, models such as ESM2 (a protein language...
Read originalCan SAEs Capture Neural Geometry?
AKA, how to use straight lines to capture curved geometry in neural networks
Read originalImport AI 461: “Alignment is not on track”; FrontierCode; and synthetic research interns
Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now AI researchers launch new safety star...
Read originalWhy Do Naive SFT Filters For Safety Properties Fail?
This is the fourth in a series of informal research updates from the Google DeepMind Language Model Interpretability team, in interpretability and adjacent areas. The third post can be found here . Since SFT is the caus...
Read originalWelcome to the AGI era of AI governance
It's a one-way door and we weren't ready for it.
Read originalSFT Drives Gemini’s Safety Properties
This is the third in a series of informal research updates from the Google DeepMind Language Model Interpretability team, in interpretability and adjacent areas. The second post can be found here . In this short post, w...
Read originalSources
Frontier Labs
OpenAI Alignment Research Blog
OpenAI alignment research.
OpenAI Engineering
OpenAI engineering articles and system-building notes.
Anthropic Research
Anthropic research on alignment, interpretability, evaluations, and societal impacts.
Anthropic Engineering
Anthropic engineering team posts and system-building articles.
Anthropic Alignment Science
Anthropic alignment science, interpretability, and risk evaluation articles.
Google DeepMind Blog
Google DeepMind official blog.
Google Research Blog
Official Google Research blog.
Meta AI Blog
Meta AI official blog.
Microsoft Research Blog
Microsoft Research blog.
AI Safety and Governance
LessWrong
AI alignment, rationality, and AI risk discussion.
AI Alignment Forum
Technical AI alignment research community.
AI Alignment
Alignment essays and research posts.
MIRI Blog
Machine Intelligence Research Institute blog.
METR
Model evaluation and frontier-risk research.
METR Evaluations
METR evaluation reports.
Apollo Research Blog
Frontier AI risk, scheming, and evaluations.
Redwood Research Blog
AI risk and safety research.
FAR.AI Blog
AI safety and alignment research.
CAIS Blog
Center for AI Safety updates.
Goodfire Blog
Interpretability and model control.
Goodfire Research
Goodfire research index.
Epoch AI Blog
AI trends, compute, data, economics, and forecasting.
Epoch AI Latest
Unified stream for papers, newsletters, data insights, and podcasts.
AI Companies and Research Labs
Hugging Face Blog
Open-source models, Transformers, applications, and research.
Hugging Face Daily Papers
Daily AI paper discovery.
NVIDIA Technical Blog
GPU, CUDA, AI systems, and technical engineering posts.
NVIDIA Blog
NVIDIA news and applications.
Amazon Science Blog
Amazon research posts, including AI and machine learning.
AWS Machine Learning Blog
AWS ML engineering, product, and practice posts.
Apple Machine Learning Research
Apple machine learning research.
Cohere Blog
Cohere official blog.
Cohere Research
Cohere research posts.
Mistral AI News
Mistral official news and releases.
xAI News
xAI official news.
Academic Labs
BAIR Blog
Berkeley AI Research blog.
Stanford AI Lab Blog
Stanford AI Lab blog.
Stanford HAI News
Stanford HAI news and blog posts.
MIT CSAIL News
MIT CSAIL news.
MIT LINGO Blog
MIT Language and Intelligence group blog.
Personal Blogs and Newsletters
Lilian Weng - Lil'Log
Long-form posts on reinforcement learning, LLMs, agents, and alignment.
Jay Alammar Blog
Visual explanations for machine learning and Transformers.
Andrej Karpathy - Bear Blog
Andrej Karpathy's newer blog.
Andrej Karpathy - Old Blog
Older Karpathy blog posts.
Sebastian Ruder Blog
NLP and machine learning blog.
Sebastian Ruder Newsletter
NLP news and newsletter.
Import AI
Jack Clark's AI research and industry newsletter.
Import AI Substack
Substack version of Import AI.
Interconnects
Nathan Lambert's frontier AI research and industry newsletter.
Latent Space
AI Engineer newsletter and podcast.
DeepLearning.AI - The Batch
AI news digest.
Distill
Classic visual and explanatory machine learning articles.
The Gradient
AI research, society, and commentary.
