The last six months in LLMs in five minutes

Source

Site : Simon Willison
URL : https://simonwillison.net/2026/May/19/5-minute-llms/#atom-everything
Date de publication : 2026-05-19T01:09:44+00:00
Score d’intérêt : 65
Raisons : intérêt:llm, intérêt:agents, intérêt:benchmark, intérêt:python, intérêt:video, critique:critical, critique:rce, tendance:gpt
Sujets détectés : llm, agents, benchmark, python, video, critical

Résumé

I put together these annotated slides from my five minute lightning talk at PyCon US 2026, using the latest iteration of my annotated presentation tool . # I presented this lightning talk at PyCon US 2026, attempting to summarize the last six months of developments in LLMs in five minutes. # Six months is a pretty convenient time period to cover, because it captures what I've been calling the November 2025 inflection point . November was a critical month in LLMs, especially for coding. # For one thing, the supposedly "best" model (depending mostly on vibes) changed hands five times between the three big providers. # As always, I'm using my Generate an SVG of a pelican riding a bicycle test to help illustrate the differences between the models. Why this test? Because pelicans are hard to draw, bicycles are hard to draw, pelicans can't ride bicycles ... and there's zero chance any AI lab…

Pourquoi c’est intéressant pour Nico

Correspond aux centres d’intérêt détectés : cyber, IA/agents, dev, Linux, photo/vidéo selon les mots-clés et la source.
Priorisé par criticité, fraîcheur, redondance du sujet entre sources et poids éditorial de la source.

À creuser

Lire l’article complet.
Si sujet cyber critique : vérifier si un correctif, IOC, CVE ou mitigation est disponible.
Si sujet IA/dev : repérer code, modèle, benchmark, prix ou limites pratiques.