The Neuron Times — Lundi 8 juin 2026

À la Une · Stratégie & ProduitFront Page · Strategy & ProductSchlagzeilen · Strategie & ProduktPrima pagina · Strategia e ProdottoIn prima pagina · Strategia & Prodott

OpenAI enterre le « chat » et prépare une refonte totale de ChatGPT en super-applicativeOpenAI buries 'chat' and prepares a complete overhaul of ChatGPT into a super-appOpenAI begräbt den « Chat » und bereitet einen radikalen Umbau von ChatGPT zur Super-App vorOpenAI seppellisce la « chat » e prepara una rifondazione totale di ChatGPT in super-appOpenAI el sottera el « chat » e la prepara ona refonta totala de ChatGPT 'me super-aplicazzion

L'entreprise annonce la plus grande transformation de son produit phare depuis son lancement, intégrant agents, outils de codage et applications partenaires.The company announces the biggest transformation of its flagship product since launch, integrating agents, coding tools and partner applications.Das Unternehmen kündigt die grösste Transformation seines Flaggschiffprodukts seit dem Start an – mit integrierten Agenten, Codierungswerkzeugen und Partneranwendungen.L'azienda annuncia la più grande trasformazione del suo prodotto di punta dal lancio, integrando agenti, strumenti di codifica e applicazioni partner.L'azienda la anunzia la pussee granda trasformazion del sò prodott de bandera del sò lanzament, integrand agent, strument de codifica e applicazzion di partenari.

De la rédaction — 8 juin 2026From the editorial desk — 8 June 2026Von der Redaktion — 8. Juni 2026Dalla redazione — 8 giugno 2026De la redazzion — 8 giugno 2026

OpenAI prépare la refonte la plus ambitieuse de ChatGPT depuis sa création, selon des informations rapportées dimanche par plusieurs médias. « Chat is dead », aurait déclaré en interne un cadre senior de l'entreprise, résumant une stratégie qui vise à transformer l'interface conversationnelle en une « super-app » intégrant des agents autonomes, des outils de développement et des applications tierces comme Canva et Booking.com. L'information, révélée par The Decoder et confirmée par TechCrunch, marque un tournant stratégique majeur pour la start-up valorisée à plusieurs centaines de milliards de dollars.OpenAI is preparing the most ambitious overhaul of ChatGPT since its creation, according to reports published Sunday by several media outlets. 'Chat is dead,' a senior company executive reportedly said internally, summing up a strategy aimed at transforming the conversational interface into a 'super-app' integrating autonomous agents, development tools and third-party applications like Canva and Booking.com. The information, revealed by The Decoder and confirmed by TechCrunch, marks a major strategic turning point for the startup valued at several hundred billion dollars.OpenAI bereitet den ambitioniertesten Umbau von ChatGPT seit seiner Gründung vor, wie mehrere Medien am Sonntag übereinstimmend berichteten. « Chat is dead », soll ein leitender Angestellter intern erklärt haben – eine Strategie, die darauf abzielt, die reine Chat-Oberflfläche in eine « Super-App » mit autonomen Agenten, Entwicklungswerkzeugen und Drittanbieter-Apps wie Canva und Booking.com zu verwandeln. Die Information, enthüllt von The Decoder und bestätigt durch TechCrunch, markiert eine strategische Kehrtwende für das mit mehreren hundert Milliarden Dollar bewertete Start-up.OpenAI prepara la rifondazione più ambiziosa di ChatGPT dalla sua creazione, secondo informazioni riportate domenica da diversi media. « Chat is dead », avrebbe dichiarato internamente un alto dirigente dell'azienda, riassumendo una strategia che mira a trasformare l'interfaccia conversazionale in una « super-app » che integra agenti autonomi, strumenti di sviluppo e applicazioni terze come Canva e Booking.com. L'informazione, rivelata da The Decoder e confermata da TechCrunch, segna una svolta strategica importante per la start-up valutata diverse centinaia di miliardi di dollari.OpenAI la prepara la refonta pussee ambiziosa de ChatGPT de la soa creazzion, segond informazion riportade domenega de vari media. « Chat is dead », l'avariaa dii in d'on cadre senior de l'azienda, resumend ona strategia che la mira a trasformà l'interfaccia conversazionala in ona « super-app » che la integra di agent autonom, di strument de desvilupp e di applicazzion de terz 'me Canva e Booking.com. L'informazion, revelada de The Decoder e confermada de TechCrunch, la marca on svolta strategica magior per la start-up valorizada a pussee de centener de miliard de dollar.

Dans le même temps, OpenAI a déployé un « Lockdown Mode » pour ChatGPT, une fonctionnalité de sécurité qui désactive l'accès au web, le mode Deep Research et le mode Agent afin de protéger les données sensibles contre les attaques par injection de prompts. Comme le détaille The Decoder, ce mode ne bloque pas entièrement les injections mais interrompt la dernière étape de la chaîne d'exfiltration, un aveu que le problème reste structurellement non résolu dans l'industrie.At the same time, OpenAI has deployed a 'Lockdown Mode' for ChatGPT, a security feature that disables web access, Deep Research mode and Agent mode in order to protect sensitive data against prompt injection attacks. As detailed by The Decoder, this mode does not entirely block injections but interrupts the final step of the exfiltration chain, an admission that the problem remains structurally unresolved across the industry.Gleichzeitig hat OpenAI einen « Lockdown Mode » für ChatGPT eingeführt – eine Sicherheitsfunktion, die den Webzugriff, den Deep-Research-Modus und den Agentenmodus deaktiviert, um sensible Daten vor Prompt-Injection-Angriffen zu schützen. Wie The Decoder ausführt, blockiert dieser Modus die Injektionen nicht vollständig, unterbricht aber die letzte Stufe der Exfiltrationskette – ein Eingeständnis, dass das Problem in der Branche strukturell ungelöst bleibt.Nel contempo, OpenAI ha implementato una « Lockdown Mode » per ChatGPT, una funzionalità di sicurezza che disabilita l'accesso al web, la modalità Deep Research e la modalità Agente per proteggere i dati sensibili dagli attacchi di injection di prompt. Come dettaglia The Decoder, questa modalità non blocca interamente le injection ma interrompe l'ultima fase della catena di esfiltrazione, un'ammissione che il problema rimane strutturalmente irrisolto nel settore.In del midem temp, OpenAI l'ha desplegaa on « Lockdown Mode » per ChatGPT, ona fonzionalità de sigurezza che la disabilità l'access al web, el mode Deep Research e el mode Agent per protesg i dat sensibil di atacch per iniezzion de prompt. 'Me el detaja The Decoder, 'sto mode el blocca no minga del tutt i iniezzion ma el interomp l'ultema tappa de la cadena d'esfiltrazzion, ona amission che el problema el resta strutturalment minga risolt in de l'industria.

Cette double annonce — refonte produit et verrouillage sécuritaire — intervient alors qu'OpenAI et son rival Anthropic accélèrent leurs préparatifs d'introduction en Bourse. La guerre des talents s'intensifie : Anthropic a débauché Clive Chan, le deuxième ingénieur puce d'OpenAI, comme l'a rapporté The Decoder, signalant que les deux entreprises investissent massivement dans le hardware sur mesure pour soutenir leurs ambitions d'agents autonomes.This dual announcement — product overhaul and security lockdown — comes as OpenAI and its rival Anthropic accelerate their IPO preparations. The war for talent is intensifying: Anthropic has poached Clive Chan, OpenAI's second chip engineer, as reported by The Decoder, signaling that both companies are investing heavily in custom hardware to support their autonomous agent ambitions.Diese Doppelankündigung – Produktumbau und Sicherheitssperre – erfolgt zu einem Zeitpunkt, da OpenAI und sein Rivale Anthropic ihre Vorbereitungen für Börsengänge beschleunigen. Der Kampf um Talente verschärft sich: Anthropic hat Clive Chan abgeworben, den zweiten Chip-Ingenieur von OpenAI, wie The Decoder berichtete – ein Zeichen dafür, dass beide Unternehmen massiv in massgeschneiderte Hardware investieren, um ihre Ambitionen im Bereich autonomer Agenten zu untermauern.Questo doppio annuncio — rifondazione del prodotto e blocco di sicurezza — arriva mentre OpenAI e il suo rivale Anthropic accelerano i preparativi per l'offerta pubblica iniziale. La guerra dei talenti si intensifica: Anthropic ha sottratto Clive Chan, il secondo ingegnere chip di OpenAI, come riportato da The Decoder, segnalando che entrambe le aziende investono massicciamente nell'hardware su misura per sostenere le loro ambizioni di agenti autonomi.Questa doppia anunzia — refonta del prodott e blocch de sigurezza — la riva intant che OpenAI e 'l sò rivai Anthropic i accelera i sò preparativ per l'introduzzion in Borsa. La guerra di talent la se intensifica: Anthropic l'ha sgraffignaa Clive Chan, el segond ingegnee de cip de OpenAI, 'me l'ha reportaa The Decoder, segnaland che i duu aziend i investiss in manera massiccia in del hardware su misura per sostegnì i sò ambizion d'agente autonom.

RechercheResearchForschungRicercaRicerca

Quand les outils des agents LLM échouent : un benchmark mesure la capacité de replanificationWhen LLM agent tools fail: a benchmark measures replanning capabilityWenn die Werkzeuge von LLM-Agenten versagen: Ein Benchmark misst die Fähigkeit zur NeuplanungQuando gli strumenti degli agenti LLM falliscono: un benchmark misura la capacità di riprogrammazioneQuand i strument di agent LLM i falliss: on benchmark el misura la capacità de re-pianificazzion

From the Wires — 8 juin 2026

From the Wires — 8 June 2026

From the Wires — 8. Juni 2026

Dagli studi — From the Wires — 8 giugno 2026

From the Wires — 8 giugno 2026

Un nouveau papier publié sur Hugging Face Daily Papers et signé par des chercheurs de Baidu introduit un benchmark évaluant la capacité des agents LLM à détecter et contourner des défaillances d'outils. Intitulé « When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents », le travail propose un protocole systématique pour mesurer la replanification dynamique, un chaînon manquant dans l'évaluation des agents autonomes.

A new paper published on Hugging Face Daily Papers and authored by researchers at Baidu introduces a benchmark evaluating the ability of LLM agents to detect and circumvent tool failures. Titled 'When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents,' the work proposes a systematic protocol for measuring dynamic replanning, a missing link in the evaluation of autonomous agents.

Ein neues auf Hugging Face Daily Papers veröffentlichtes Paper von Forschern von Baidu führt einen Benchmark ein, der die Fähigkeit von LLM-Agenten bewertet, Werkzeugfehler zu erkennen und zu umgehen. Der unter dem Titel « When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents » veröffentlichte Beitrag schlägt ein systematisches Protokoll zur Messung der dynamischen Neuplanung vor – ein fehlendes Glied in der Bewertung autonomer Agenten.

Un nuovo articolo pubblicato su Hugging Face Daily Papers e firmato da ricercatori di Baidu introduce un benchmark che valuta la capacità degli agenti LLM di rilevare e aggirare i guasti degli strumenti. Intitolato « When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents », il lavoro propone un protocollo sistematico per misurare la riprogrammazione dinamica, un anello mancante nella valutazione degli agenti autonomi.

On noeuv paper publicaa in su Hugging Face Daily Papers e firmmaa di ricercador de Baidu l'introdus on benchmark che 'l valuta la capacità di agent LLM de rilevà e contornà di falliment de strument. Intitolaa « When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents », el laurà el propon on protocoll sistemategh per misurà la re-pianificazzion dinamega, ona cadenon mancant in de la valutazion di agent autonom.

InfrastructureInfrastructureInfrastrukturInfrastrutturaInfrastruttura

Perplexity dévoile « Search as Code » : l'IA écrit ses propres pipelines de recherchePerplexity unveils 'Search as Code': AI writes its own search pipelinesPerplexity stellt « Search as Code » vor: KI schreibt ihre eigenen Such-PipelinesPerplexity svela « Search as Code »: l'IA scrive le proprie pipeline di ricercaPerplexity la desvela « Search as Code »: l'IA la scriv i sò propri pipeline de ricerca

From the Wires — 7 juin 2026

From the Wires — 7 June 2026

From the Wires — 7. Juni 2026

Dagli studi — From the Wires — 7 giugno 2026

From the Wires — 7 giugno 2026

Perplexity a présenté une nouvelle architecture baptisée « Search as Code » qui permet aux modèles d'IA de générer dynamiquement leurs propres routines de recherche en Python, plutôt que d'appeler des API de recherche figées. Selon The Decoder, le système surpasse OpenAI et Anthropic sur des benchmarks clés tout en réduisant les coûts en tokens jusqu'à 85 %, en laissant l'agent gérer lui-même le filtrage et la déduplication dans un environnement sandboxé.

Perplexity has introduced a new architecture called 'Search as Code' that allows AI models to dynamically generate their own search routines in Python, rather than calling fixed search APIs. According to The Decoder, the system outperforms OpenAI and Anthropic on key benchmarks while reducing token costs by up to 85%, by letting the agent handle filtering and deduplication itself in a sandboxed environment.

Perplexity hat eine neue Architektur namens « Search as Code » vorgestellt, die es KI-Modellen ermöglicht, ihre eigenen Suchroutinen dynamisch in Python zu generieren, anstatt feste Such-APIs aufzurufen. Laut The Decoder übertrifft das System OpenAI und Anthropic in wichtigen Benchmarks und senkt gleichzeitig die Token-Kosten um bis zu 85 %, indem der Agent die Filterung und Deduplizierung in einer Sandbox-Umgebung selbst verwaltet.

Perplexity ha presentato una nuova architettura chiamata « Search as Code » che consente ai modelli di IA di generare dinamicamente le proprie routine di ricerca in Python, invece di chiamare API di ricerca fisse. Secondo The Decoder, il sistema supera OpenAI e Anthropic su benchmark chiave riducendo al contempo i costi in token fino all'85%, lasciando che l'agente gestisca da sé il filtraggio e la deduplicazione in un ambiente sandbox.

Perplexity l'ha presentaa ona noeuva architettura ciamada « Search as Code » che la permet ai model de IA de generà dinamegament i sò propri routine de ricerca in Python, inveci de ciamà di API de ricerca fiss. Segond The Decoder, el sistema el supera OpenAI e Anthropic in su di benchmark ciav intant che 'l ridus i cost in token fina a l'85%, lassand che l'agent el gestissa lu midem el filtrà e la deduplicazzion in d'on ambient sandboxaa.

ÉconomieEconomyWirtschaftEconomiaEconomia

DeepSeek en tête des fournisseurs de logiciels tendance chez Ramp, signe de la chasse au coût de l'IADeepSeek tops Ramp's trending software vendors, signaling the hunt for AI cost savingsDeepSeek an der Spitze der Trend-Softwareanbieter bei Ramp – Zeichen der Kostensuche in der KIDeepSeek in testa ai fornitori di software di tendenza su Ramp, segno della caccia al costo dell'IADeepSeek in testa ai fornitor de software tendenza de Ramp, segn de la cascia al cost de l'IA

From the Wires — 7 juin 2026

From the Wires — 7 June 2026

From the Wires — 7. Juni 2026

Dagli studi — From the Wires — 7 giugno 2026

From the Wires — 7 giugno 2026

DeepSeek a dominé le classement des fournisseurs de logiciels tendance de Ramp en juin 2026, les entreprises américaines se tournant massivement vers des alternatives chinoises moins coûteuses. L'économiste en chef de Ramp, Ara Kharazian, cité par The Decoder, souligne une prise de conscience croissante des coûts, mais met en garde contre les risques de sécurité liés à l'envoi de données vers des modèles chinois.

DeepSeek topped Ramp's ranking of trending software vendors in June 2026, as US companies increasingly turn to cheaper Chinese alternatives. Ramp's chief economist, Ara Kharazian, cited by The Decoder, highlights a growing awareness of costs but warns of security risks associated with sending data to Chinese models.

DeepSeek hat im Juni 2026 das Ranking der trendenden Softwareanbieter von Ramp angeführt, da US-Unternehmen massenhaft auf günstigere chinesische Alternativen umsteigen. Ramp-Chefökonom Ara Kharazian, zitiert von The Decoder, betont ein wachsendes Kostenbewusstsein, warnt aber vor den Sicherheitsrisiken, die mit der Übermittlung von Daten an chinesische Modelle verbunden sind.

DeepSeek ha dominato la classifica dei fornitori di software di tendenza di Ramp a giugno 2026, con le aziende americane che si rivolgono massicciamente ad alternative cinesi meno costose. Il capo economista di Ramp, Ara Kharazian, citato da The Decoder, sottolinea una crescente consapevolezza dei costi, ma mette in guardia contro i rischi per la sicurezza legati all'invio di dati a modelli cinesi.

DeepSeek l'ha dominaa la classifica di fornitor de software tendenza de Ramp in del sgiugn 2026, con i aziend american che se volten in manera massiccia vers di alternativ cines pussee a bon mercaa. L'economista in capo de Ramp, Ara Kharazian, citaa de The Decoder, el sottolinea ona cosscienza cressenta di cost, ma 'l met in guardia contra i ris'c de sigurezza ligaa al mandà di dat vers di model cines.

RechercheResearchForschungRicercaRicerca

Pourquoi les grands modèles apprennent des compétences que les petits ratentWhy large models learn skills that small ones missWarum grosse Modelle Fähigkeiten erlernen, die kleinen entgehenPerché i grandi modelli apprendono competenze che i piccoli perdonoPerchè i grand model i imparen di competenze che i piscininn i perden

From the Wires — 7 juin 2026

From the Wires — 7 June 2026

From the Wires — 7. Juni 2026

Dagli studi — From the Wires — 7 giugno 2026

From the Wires — 7 giugno 2026

Une étude menée sur des modèles allant de 4 millions à 4 milliards de paramètres, rapportée par The Decoder, identifie le mécanisme par lequel les petits modèles échouent sur des tâches rares : les tâches fréquentes écrasent en permanence ce qu'ils ont appris. Les chercheurs proposent une solution pratique : plutôt que d'augmenter la taille du modèle, il suffirait d'accroître la fréquence d'apparition de la tâche cible dans les données d'entraînement.

A study conducted on models ranging from 4 million to 4 billion parameters, reported by The Decoder, identifies the mechanism by which small models fail on rare tasks: frequent tasks permanently overwrite what they have learned. The researchers propose a practical solution: rather than increasing model size, it would suffice to increase the frequency of the target task in the training data.

Eine Studie an Modellen von 4 Millionen bis 4 Milliarden Parametern, berichtet von The Decoder, identifiziert den Mechanismus, warum kleine Modelle bei seltenen Aufgaben versagen: Häufige Aufgaben überschreiben permanent das Gelernte. Die Forscher schlagen eine praktische Lösung vor: Statt die Modellgrösse zu erhöhen, genüge es, die Häufigkeit der Zielaufgabe in den Trainingsdaten zu steigern.

Uno studio condotto su modelli da 4 milioni a 4 miliardi di parametri, riportato da The Decoder, identifica il meccanismo per cui i piccoli modelli falliscono su compiti rari: i compiti frequenti sovrascrivono permanentemente ciò che hanno appreso. I ricercatori propongono una soluzione pratica: invece di aumentare la dimensione del modello, basterebbe aumentare la frequenza di comparsa del compito target nei dati di addestramento.

On studi faa in su di model che van de 4 milion a 4 miliard de parametri, reportaa de The Decoder, l'identifica el mecanism per el qual i piscininn model i falliss in su di compit rar: i compit frequent i scancellen in manera permanent quell che hann imparaa. I ricercador i proponn ona soluzzion pratica: inveci de aumentà la grandezza del model, el basteria aumentà la frequenza de comparsa del compit de destin in di dat d'addestrament.

Page 1 — Page 1 — Seite 1 — Pagina 1 — Pagina 1 — À la UneFront PageTitelgeschichteIn Primo PianoIn Prima Pagina

I. Modèles & FrontièreModels & FrontierModelle & GrenzbereichModelli e FrontieraModel & Frontiera

Refonte produit

Product overhaul

Produktumbau

Rifondazione prodotto

Refonta del prodott

OpenAI prépare une « super-app » qui enterre le chatOpenAI prepares a 'super-app' that buries chatOpenAI bereitet eine « Super-App » vor, die den Chat begräbtOpenAI prepara una « super-app » che seppellisce la chatOpenAI la prepara ona « super-app » che la sottera el chat

OpenAI planifie la transformation la plus radicale de ChatGPT depuis son lancement. Selon des informations concordantes de The Decoder et TechCrunch, l'application deviendrait une plateforme intégrant des agents autonomes, des outils de codage et des applications partenaires comme Canva et Booking.com. « Chat is dead », aurait confié un cadre dirigeant, signalant l'abandon du paradigme conversationnel pur au profit d'une interface orientée tâches. L'entreprise n'a pas encore commenté officiellement ces fuites.

OpenAI is planning the most radical transformation of ChatGPT since its launch. According to corroborating reports from The Decoder and TechCrunch, the app would become a platform integrating autonomous agents, coding tools and partner applications like Canva and Booking.com. 'Chat is dead,' a senior executive reportedly said, signaling the abandonment of the pure conversational paradigm in favor of a task-oriented interface. The company has not yet officially commented on these leaks.

OpenAI plant die radikalste Transformation von ChatGPT seit seiner Einführung. Übereinstimmenden Berichten von The Decoder und TechCrunch zufolge soll die App zu einer Plattform werden, die autonome Agenten, Codierungswerkzeuge und Partneranwendungen wie Canva und Booking.com integriert. « Chat is dead », soll ein leitender Angestellter vertraulich geäussert haben – ein Signal für die Abkehr vom reinen Konversationsparadigma hin zu einer aufgabenorientierten Schnittstelle. Das Unternehmen hat die Durchsickerungen offiziell noch nicht kommentiert.

OpenAI pianifica la trasformazione più radicale di ChatGPT dal suo lancio. Secondo informazioni concordanti di The Decoder e TechCrunch, l'applicazione diventerebbe una piattaforma che integra agenti autonomi, strumenti di codifica e applicazioni partner come Canva e Booking.com. « Chat is dead », avrebbe confidato un alto dirigente, segnalando l'abbandono del paradigma conversazionale puro a favore di un'interfaccia orientata ai compiti. L'azienda non ha ancora commentato ufficialmente queste fughe di notizie.

OpenAI la pianifica la trasformazion pussee radical de ChatGPT del sò lanzament. Segond di informazion concordant de The Decoder e TechCrunch, l'applicazzion la deventaria ona piattaforma che la integra di agent autonom, di strument de codifica e di applicazzion de partenari 'me Canva e Booking.com. « Chat is dead », l'avariaa confidaa on cadre dirigent, segnaland l'abandon del paradigma conversazional pur a favor d'on'interfaccia orientada ai compit. L'azienda l'ha anmò no comenta officialment 'sti fuite.

Sécurité

Security

Sicherheit

Sicurezza

Sigurezza

Lockdown Mode : OpenAI verrouille ChatGPT contre les injections de promptsLockdown Mode: OpenAI locks down ChatGPT against prompt injectionsLockdown Mode: OpenAI sperrt ChatGPT gegen Prompt-InjectionLockdown Mode: OpenAI blocca ChatGPT contro le injection di promptLockdown Mode: OpenAI el blocca ChatGPT contra i iniezzion de prompt

OpenAI a déployé un nouveau mode de sécurité pour ChatGPT, le Lockdown Mode, qui désactive l'accès web, Deep Research et le mode Agent. Comme l'explique The Decoder, la fonctionnalité ne prévient pas entièrement les attaques par injection de prompts, mais bloque la dernière étape de la chaîne d'exfiltration de données. Un constat qui rappelle que l'injection de prompts reste un problème ouvert pour toute l'industrie.

OpenAI has deployed a new security mode for ChatGPT, Lockdown Mode, which disables web access, Deep Research and Agent mode. As The Decoder explains, the feature does not fully prevent prompt injection attacks, but blocks the final step of the data exfiltration chain. A reminder that prompt injection remains an open problem for the entire industry.

OpenAI hat einen neuen Sicherheitsmodus für ChatGPT eingeführt, den Lockdown Mode, der den Webzugriff, Deep Research und den Agentenmodus deaktiviert. Wie The Decoder erläutert, verhindert die Funktion Prompt-Injection-Angriffe nicht vollständig, blockiert aber die letzte Stufe der Datencxfiltrationskette. Eine Erkenntnis, die daran erinnert, dass Prompt-Injection ein offenes Problem für die gesamte Branche bleibt.

OpenAI ha implementato una nuova modalità di sicurezza per ChatGPT, la Lockdown Mode, che disabilita l'accesso web, Deep Research e la modalità Agente. Come spiega The Decoder, la funzionalità non previene interamente gli attacchi di injection di prompt, ma blocca l'ultima fase della catena di esfiltrazione dei dati. Una constatazione che ricorda che l'injection di prompt rimane un problema aperto per tutto il settore.

OpenAI l'ha desplegaa on noeuv mode de sigurezza per ChatGPT, el Lockdown Mode, che 'l disabilità l'access web, Deep Research e 'l mode Agent. 'Me 'l spiega The Decoder, la fonzionalità la preveniss no minga del tutt i atacch per iniezzion de prompt, ma la blocca l'ultema tappa de la cadena d'esfiltrazion di dat. On constat che 'l regorda che l'iniezzion de prompt la resta on problema avert per tutta l'industria.

Guerre des talents

War for talent

Talentkrieg

Guerra dei talenti

Guerra di talent

Anthropic débauche le deuxième ingénieur puce d'OpenAIAnthropic poaches OpenAI's second chip engineerAnthropic wirbt zweiten Chip-Ingenieur von OpenAI abAnthropic sottrae il secondo ingegnere chip di OpenAIAnthropic el sgraffigna el segond ingegnee de cip d'OpenAI

Clive Chan, qui se présentait comme le deuxième employé dédié aux puces sur mesure chez OpenAI, rejoint Anthropic. Il apporte une expérience acquise chez Tesla (Autopilot ASIC) et dans le cadre du partenariat OpenAI-Broadcom. The Decoder rapporte que ce mouvement intervient alors que les deux entreprises se préparent à leurs introductions en Bourse, et qu'Anthropic envisagerait de développer ses propres circuits intégrés pour l'IA.

Clive Chan, who described himself as the second employee dedicated to custom chips at OpenAI, is joining Anthropic. He brings experience gained at Tesla (Autopilot ASIC) and through the OpenAI-Broadcom partnership. The Decoder reports that this move comes as both companies prepare for their IPOs, and that Anthropic is considering developing its own custom integrated circuits for AI.

Clive Chan, der sich selbst als den zweiten Mitarbeiter für massgeschneiderte Chips bei OpenAI bezeichnete, wechselt zu Anthropic. Er bringt Erfahrung von Tesla (Autopilot ASIC) und aus der OpenAI-Broadcom-Partnerschaft mit. The Decoder berichtet, dass dieser Wechsel erfolgt, während sich beide Unternehmen auf ihre Börsengänge vorbereiten und Anthropic erwägt, eigene integrierte Schaltkreise für KI zu entwickeln.

Clive Chan, che si presentava come il secondo dipendente dedicato ai chip su misura presso OpenAI, si unisce ad Anthropic. Porta un'esperienza maturata presso Tesla (Autopilot ASIC) e nell'ambito della partnership OpenAI-Broadcom. The Decoder riporta che questo movimento avviene mentre entrambe le aziende si preparano alle loro offerte pubbliche iniziali, e che Anthropic starebbe valutando di sviluppare propri circuiti integrati per l'IA.

Clive Chan, che 'l se presentava 'me 'l segond impiegaa dedicaa ai cip su misura de OpenAI, el va dent in Anthropic. El porta on'esperienza ciapada in Tesla (Autopilot ASIC) e in del quadri del partenariad OpenAI-Broadcom. The Decoder el reporta che 'sto moviment el riva intant che i duu aziend i se preparen a i sò introduzzion in Borsa, e che Anthropic el considerariss de desviluppà i sò propri circuitt integraa per l'IA.

Économie

Economy

Wirtschaft

Economia

L'ombre de la « Tokenpocalypse » plane sur l'industrieThe shadow of the 'Tokenpocalypse' looms over the industryDer Schatten der « Tokenpocalypse » liegt über der BrancheL'ombra della « Tokenpocalypse » aleggia sul settoreL'ombra de la « Tokenpocalypse » la plana in su l'industria

Alors que les grandes entreprises d'IA préparent leurs introductions en Bourse, une hausse généralisée des prix des API semble inévitable, analyse TechCrunch. L'article évoque l'aube d'une « Tokenpocalypse » où la pression des actionnaires pousserait les labos à augmenter leurs tarifs, un scénario qui inquiète les développeurs et les entreprises dépendantes des modèles frontier.

As major AI companies prepare for their IPOs, a widespread increase in API prices seems inevitable, TechCrunch analyzes. The article evokes the dawn of a 'Tokenpocalypse' where shareholder pressure would push labs to raise their prices, a scenario that worries developers and companies dependent on frontier models.

Während die grossen KI-Unternehmen ihre Börsengänge vorbereiten, scheint ein flächendeckender Preisanstieg bei APIs unvermeidlich, analysiert TechCrunch. Der Artikel spricht von der Morgendämmerung einer « Tokenpocalypse », in der der Druck der Aktionäre die Labore zu Preiserhöhungen treiben würde – ein Szenario, das Entwickler und Unternehmen beunruhigt, die von Frontier-Modellen abhängig sind.

Mentre le grandi aziende di IA preparano le loro offerte pubbliche iniziali, un aumento generalizzato dei prezzi delle API sembra inevitabile, analizza TechCrunch. L'articolo evoca l'alba di una « Tokenpocalypse » in cui la pressione degli azionisti spingerebbe i laboratori ad aumentare le loro tariffe, uno scenario che preoccupa gli sviluppatori e le aziende dipendenti dai modelli frontier.

Intant che i grand aziend de IA i preparen i sò introduzzion in Borsa, ona cressita generalizada di prezzi di API la par inevitabel, l'analisa TechCrunch. L'articol l'evoca l'alba d'ona « Tokenpocalypse » indè che la pression di azionista la spingaria i laboratori a aumentà i sò tariff, on scenari che 'l preocupia i desviluppador e i aziend dipendent di model frontier.

Page 2 — Page 2 — Seite 2 — Pagina 2 — Pagina 2 — Le Cahier TechniqueTech NotebookDas Technische HeftIl Quaderno TecnicoEl Quadern Tecnegh

II. Moteurs d'inférenceInference EnginesInferenz-EnginesMotori di inferenzaMotor d'inferenza

llama.cpp

llama.cpp b9553 assouplit la reconnaissance des noms d'échantillonneursllama.cpp b9553 relaxes sampler name recognitionllama.cpp b9553 lockert die Erkennung von Sampler-Namenllama.cpp b9553 allenta il riconoscimento dei nomi dei campionatorillama.cpp b9553 el slarga la riconoscenza di nomm di campionador

La version b9553 de llama.cpp, publiée le 7 juin, apporte une modification importante à la fonction de reconnaissance des noms d'échantillonneurs. Désormais, les noms alternatifs comme « top-k » et « min-p » sont systématiquement acceptés aux côtés des noms canoniques « top_k » et « min_p », et la correspondance est insensible à la casse. Cette amélioration corrige un problème où l'interface llama-server rejetait des échantillonneurs valides. La release inclut également des binaires pour macOS (Apple Silicon et Intel), Linux (CPU, Vulkan, ROCm 7.2, OpenVINO) et Android arm64.

Version b9553 of llama.cpp, released on June 7, brings a significant change to the sampler name recognition function. Alternative names like 'top-k' and 'min-p' are now systematically accepted alongside the canonical names 'top_k' and 'min_p', and matching is case-insensitive. This improvement fixes an issue where the llama-server interface rejected valid samplers. The release also includes binaries for macOS (Apple Silicon and Intel), Linux (CPU, Vulkan, ROCm 7.2, OpenVINO) and Android arm64.

Die am 7. Juni veröffentlichte Version b9553 von llama.cpp bringt eine wichtige Änderung der Funktion zur Erkennung von Sampler-Namen. Künftig werden alternative Namen wie « top-k » und « min-p » systematisch neben den kanonischen Namen « top_k » und « min_p » akzeptiert, und der Abgleich erfolgt gross-/kleinschreibungsunabhängig. Diese Verbesserung behebt ein Problem, bei dem die llama-server-Schnittstelle gültige Sampler zurückwies. Das Release enthält auch Binärdateien für macOS (Apple Silicon und Intel), Linux (CPU, Vulkan, ROCm 7.2, OpenVINO) und Android arm64.

La versione b9553 di llama.cpp, pubblicata il 7 giugno, introduce una modifica importante alla funzione di riconoscimento dei nomi dei campionatori. D'ora in poi, i nomi alternativi come « top-k » e « min-p » sono sistematicamente accettati accanto ai nomi canonici « top_k » e « min_p », e la corrispondenza è insensibile alle maiuscole. Questo miglioramento corregge un problema per cui l'interfaccia llama-server rifiutava campionatori validi. La release include anche binari per macOS (Apple Silicon e Intel), Linux (CPU, Vulkan, ROCm 7.2, OpenVINO) e Android arm64.

La version b9553 de llama.cpp, publicada el 7 de sgiugn, la porta ona modifega importanta a la fonzion de riconoscenza di nomm di campionador. D'ora inanz, i nomm alternativ 'me « top-k » e « min-p » hinn sistemategament acetaa insema ai nomm canonegh « top_k » e « min_p », e la corispondenza l'è minga sensibil a la maiuscola. 'Sto migliorament el coregg on problema indè che l'interfaccia llama-server la refudava di campionador valid. La release l'includ anca di binari per macOS (Apple Silicon e Intel), Linux (CPU, Vulkan, ROCm 7.2, OpenVINO) e Android arm64.

Ollama

Ollama v0.30.7-rc1 : chemin Hermes natif Windows et documentation ZodOllama v0.30.7-rc1: native Windows Hermes path and Zod documentationOllama v0.30.7-rc1: Nativer Windows-Hermes-Pfad und Zod-DokumentationOllama v0.30.7-rc1: percorso Hermes nativo Windows e documentazione ZodOllama v0.30.7-rc1: camin Hermes nativ Windows e documentazzion Zod

La release candidate v0.30.7-rc1 d'Ollama, publiée le 7 juin, introduit l'utilisation du chemin de configuration Hermes natif Windows et met à jour l'exemple Zod pour utiliser le schéma JSON natif. Une version principalement corrective qui prépare le terrain pour une release stable.

Release candidate v0.30.7-rc1 of Ollama, published on June 7, introduces the use of the native Windows Hermes configuration path and updates the Zod example to use native JSON schema. A primarily corrective release that paves the way for a stable release.

Die am 7. Juni veröffentlichte Release Candidate v0.30.7-rc1 von Ollama führt die Verwendung des nativen Windows-Hermes-Konfigurationspfads ein und aktualisiert das Zod-Beispiel auf das native JSON-Schema. Eine hauptsächlich korrigierende Version, die den Weg für ein stabiles Release ebnet.

La release candidate v0.30.7-rc1 di Ollama, pubblicata il 7 giugno, introduce l'utilizzo del percorso di configurazione Hermes nativo Windows e aggiorna l'esempio Zod per utilizzare lo schema JSON nativo. Una versione principalmente correttiva che prepara il terreno per una release stabile.

La release candidate v0.30.7-rc1 d'Ollama, publicada el 7 de sgiugn, l'introdus l'usagg del camin de configurazzion Hermes nativ Windows e la met a gh'è l'esempi Zod per doperà el schema JSON nativ. Ona version principalment corretiva che la prepara el terren per ona release stabel.

III. Harnesses & CLIHarnesses & CLIHarnesses & CLIHarnesses e CLIHarnesses & CLI

Rapid-MLX

Rapid-MLX v0.6.82 corrige le streaming avec tool_choiceRapid-MLX v0.6.82 fixes streaming with tool_choiceRapid-MLX v0.6.82 behebt Streaming-Fehler mit tool_choiceRapid-MLX v0.6.82 corregge lo streaming con tool_choiceRapid-MLX v0.6.82 el coregg el streaming con tool_choice

La version v0.6.82 de Rapid-MLX, publiée le 7 juin, corrige un bug de streaming lorsque tool_choice est défini sur « required » avec parallel_tool_calls, et ajoute un mécanisme de « harmony escape hatch ». Disponible via Homebrew ou pip.

Version v0.6.82 of Rapid-MLX, released on June 7, fixes a streaming bug when tool_choice is set to 'required' with parallel_tool_calls, and adds a 'harmony escape hatch' mechanism. Available via Homebrew or pip.

Die am 7. Juni veröffentlichte Version v0.6.82 von Rapid-MLX behebt einen Streaming-Fehler, wenn tool_choice auf « required » mit parallel_tool_calls gesetzt ist, und fügt einen « Harmony Escape Hatch »-Mechanismus hinzu. Verfügbar über Homebrew oder pip.

La versione v0.6.82 di Rapid-MLX, pubblicata il 7 giugno, corregge un bug di streaming quando tool_choice è impostato su « required » con parallel_tool_calls, e aggiunge un meccanismo di « harmony escape hatch ». Disponibile tramite Homebrew o pip.

La version v0.6.82 de Rapid-MLX, publicada el 7 de sgiugn, la coregg on bug de streaming quand che tool_choice l'è metuu in su « required » con parallel_tool_calls, e la gionta on mecanism de « harmony escape hatch ». Disponibel via Homebrew o pip.

Cline

Cline v3.88.1 ajoute une section debug pour les testeursCline v3.88.1 adds debug section for testersCline v3.88.1 fügt Debug-Bereich für Tester hinzuCline v3.88.1 aggiunge una sezione debug per i testerCline v3.88.1 el gionta ona sezzion debug per i tester

La version v3.88.1 de l'extension VS Code Cline, datée du 7 juin, introduit une section de débogage dans les paramètres destinée aux testeurs et corrige l'inclusion des fichiers walkthrough dans le package d'extension pour que le tutoriel de premier lancement s'affiche correctement.

Version v3.88.1 of the Cline VS Code extension, dated June 7, introduces a debug section in settings for testers and fixes the inclusion of walkthrough files in the extension package so that the first-launch tutorial displays correctly.

Die am 7. Juni datierte Version v3.88.1 der VS-Code-Erweiterung Cline führt einen Debug-Bereich in den Einstellungen für Tester ein und behebt die Aufnahme von Walkthrough-Dateien in das Erweiterungspaket, damit das Tutorial beim ersten Start korrekt angezeigt wird.

La versione v3.88.1 dell'estensione VS Code Cline, datata 7 giugno, introduce una sezione di debug nelle impostazioni destinata ai tester e corregge l'inclusione dei file walkthrough nel pacchetto dell'estensione affinché il tutorial di primo avvio venga visualizzato correttamente.

La version v3.88.1 de l'estension VS Code Cline, datada del 7 de sgiugn, l'introdus ona sezzion de debug in di impostazzion destinada ai tester e la coregg l'inclusion di file walkthrough in del pachet d'estension per fà che 'l tutorial de prim lanzament el se veda giust.

Page 3 — Page 3 — Seite 3 — Pagina 3 — Pagina 3 — La RechercheResearchDie ForschungLa RicercaLa Ricerca

IV. Papers & LabosPapers & LabsPapers & LaborePapers e LaboratoriPapers & Laboratori

Benchmark

Quand les outils des agents LLM échouent : un nouveau benchmark BaiduWhen LLM agent tools fail: a new Baidu benchmarkWenn die Werkzeuge von LLM-Agenten versagen: Ein neuer Baidu-BenchmarkQuando gli strumenti degli agenti LLM falliscono: un nuovo benchmark BaiduQuand i strument di agent LLM i falliss: on noeuv benchmark Baidu

Des chercheurs de Baidu ont publié un benchmark intitulé « When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents », disponible sur Hugging Face Daily Papers. Le travail propose un protocole systématique pour évaluer la capacité des agents à détecter des défaillances d'outils et à replanifier dynamiquement, une compétence cruciale pour le déploiement en production d'agents autonomes.

Researchers at Baidu have published a benchmark titled 'When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents,' available on Hugging Face Daily Papers. The work proposes a systematic protocol for evaluating agents' ability to detect tool failures and dynamically replan, a crucial skill for deploying autonomous agents in production.

Forscher von Baidu haben einen Benchmark mit dem Titel « When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents » veröffentlicht, verfügbar auf Hugging Face Daily Papers. Die Arbeit schlägt ein systematisches Protokoll zur Bewertung der Fähigkeit von Agenten vor, Werkzeugfehler zu erkennen und dynamisch neu zu planen – eine entscheidende Fähigkeit für den Produktionseinsatz autonomer Agenten.

Ricercatori di Baidu hanno pubblicato un benchmark intitolato « When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents », disponibile su Hugging Face Daily Papers. Il lavoro propone un protocollo sistematico per valutare la capacità degli agenti di rilevare guasti degli strumenti e riprogrammare dinamicamente, una competenza cruciale per il dispiegamento in produzione di agenti autonomi.

Di ricercador de Baidu hann publicaa on benchmark intitolaa « When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents », disponibel in su Hugging Face Daily Papers. El laurà el propon on protocoll sistemategh per valutà la capacità di agent de rilevà di falliment de strument e de re-pianificà dinamegament, ona competenza cruciala per el despiegament in produzion d'agente autonom.

Mécanismes d'apprentissage

Learning mechanisms

Lernmechanismen

Meccanismi di apprendimento

Mecanism d'aprendiment

Pourquoi la taille du modèle ne résout pas tout : le problème des tâches raresWhy model size doesn't solve everything: the problem of rare tasksWarum Modellgrösse nicht alles löst: Das Problem seltener AufgabenPerché la dimensione del modello non risolve tutto: il problema dei compiti rariPerchè la grandezza del model la risoeulv no minga tutt: el problema di compit rar

Une étude systématique sur des modèles de 4M à 4B paramètres, relayée par The Decoder, démontre que les petits modèles échouent sur des tâches rares parce que les tâches fréquentes écrasent continuellement les poids appris. La solution identifiée est contre-intuitive : plutôt que d'augmenter la taille du modèle, il est plus efficace de suréchantillonner la tâche cible dans les données d'entraînement.

A systematic study on models from 4M to 4B parameters, reported by The Decoder, demonstrates that small models fail on rare tasks because frequent tasks continually overwrite learned weights. The identified solution is counterintuitive: rather than increasing model size, it is more effective to oversample the target task in the training data.

Eine systematische Studie an Modellen von 4M bis 4B Parametern, verbreitet von The Decoder, zeigt, dass kleine Modelle bei seltenen Aufgaben versagen, weil häufige Aufgaben kontinuierlich die gelernten Gewichte überschreiben. Die identifizierte Lösung ist kontraintuitiv: Statt die Modellgrösse zu erhöhen, ist es effektiver, die Zielaufgabe in den Trainingsdaten zu überabzutasten.

Uno studio sistematico su modelli da 4M a 4B parametri, ripreso da The Decoder, dimostra che i piccoli modelli falliscono su compiti rari perché i compiti frequenti sovrascrivono continuamente i pesi appresi. La soluzione identificata è controintuitiva: invece di aumentare la dimensione del modello, è più efficace sovracampionare il compito target nei dati di addestramento.

On studi sistemategh in su di model de 4M a 4B parametri, relazionaa de The Decoder, el dimostra che i piscininn model i falliss in su di compit rar perchè i compit frequent i scancellen continovament i pes imparaa. La soluzzion identificada l'è contra-intuitiva: inveci de aumentà la grandezza del model, l'è pussee efficace soracampionà el compit de destin in di dat d'addestrament.

Optimisation de prompts

Prompt optimization

Prompt-Optimierung

Ottimizzazione dei prompt

Ottimizzazion de prompt

GEPA : un framework d'optimisation réflexive de prompts en plusieurs composantsGEPA: a multi-component reflective prompt optimization frameworkGEPA: Ein Framework zur reflexiven Optimierung von Multi-Komponenten-PromptsGEPA: un framework di ottimizzazione riflessiva dei prompt a più componentiGEPA: on framework d'ottimizzazion riflessiva de prompt in pussee component

Un tutoriel publié sur MarkTechPost présente GEPA (Generative Evolutionary Prompt Adaptation), un framework d'optimisation réflexive de prompts qui fait évoluer simultanément les instructions et les règles de format de sortie. L'approche utilise un évaluateur structuré produisant un retour actionnable et valide les gains sur un ensemble de validation dédié, offrant une méthode reproductible pour améliorer les performances de petits modèles de langage.

A tutorial published on MarkTechPost presents GEPA (Generative Evolutionary Prompt Adaptation), a reflective prompt optimization framework that simultaneously evolves instructions and output format rules. The approach uses a structured evaluator producing actionable feedback and validates gains on a dedicated held-out set, offering a reproducible method for improving small language model performance.

Ein auf MarkTechPost veröffentlichtes Tutorial stellt GEPA (Generative Evolutionary Prompt Adaptation) vor, ein Framework zur reflexiven Prompt-Optimierung, das gleichzeitig Anweisungen und Ausgabeformatregeln weiterentwickelt. Der Ansatz verwendet einen strukturierten Evaluator, der umsetzbares Feedback liefert, und validiert die Gewinne anhand eines dedizierten Validierungssatzes – eine reproduzierbare Methode zur Leistungssteigerung kleiner Sprachmodelle.

Un tutorial pubblicato su MarkTechPost presenta GEPA (Generative Evolutionary Prompt Adaptation), un framework di ottimizzazione riflessiva dei prompt che fa evolvere simultaneamente le istruzioni e le regole di formato dell'output. L'approccio utilizza un valutatore strutturato che produce un feedback utilizzabile e convalida i guadagni su un insieme di validazione dedicato, offrendo un metodo riproducibile per migliorare le prestazioni di piccoli modelli linguistici.

On tutorial publicaa in su MarkTechPost el presenta GEPA (Generative Evolutionary Prompt Adaptation), on framework d'ottimizzazion riflessiva de prompt che 'l fa evolv simultaniament i istruzzion e i regol de formaa de sortida. L'approcci el dopera on valutator strutturaa che 'l produx on feedback azionabel e 'l valida i guadagn in su on insema de validazzion dedicaa, offrend on metod riproducibel per migliorà i performance di piscininn model de lenguagg.

Red teaming

Red Teaming

Red teaming

NVIDIA garak : un tutoriel complet pour le red-teaming défensif des LLMNVIDIA garak: a complete tutorial for defensive LLM red-teamingNVIDIA garak: Ein umfassendes Tutorial für defensives LLM-Red-TeamingNVIDIA garak: un tutorial completo per il red-teaming difensivo dei LLMNVIDIA garak: on tutorial complet per el red-teaming defensiv di LLM

Un guide pratique publié sur MarkTechPost détaille l'utilisation de NVIDIA garak comme framework complet de red-teaming défensif pour les LLM. Le tutoriel couvre la configuration, la découverte de plugins, les scans multi-sondes, l'analyse des taux de succès d'attaque, et l'extension de garak avec des sondes et détecteurs personnalisés, avec export en format AVID pour une gestion structurée des vulnérabilités.

A practical guide published on MarkTechPost details the use of NVIDIA garak as a complete defensive red-teaming framework for LLMs. The tutorial covers configuration, plugin discovery, multi-probe scans, attack success rate analysis, and extending garak with custom probes and detectors, with export to AVID format for structured vulnerability management.

Ein auf MarkTechPost veröffentlichter Praxisleitfaden beschreibt die Verwendung von NVIDIA garak als vollständiges Framework für defensives Red Teaming von LLMs. Das Tutorial behandelt die Konfiguration, Plugin-Erkennung, Multi-Probe-Scans, die Analyse von Angriffserfolgsraten und die Erweiterung von garak mit benutzerdefinierten Sonden und Detektoren, inklusive Export im AVID-Format für strukturiertes Schwachstellenmanagement.

Una guida pratica pubblicata su MarkTechPost dettaglia l'utilizzo di NVIDIA garak come framework completo di red-teaming difensivo per i LLM. Il tutorial copre la configurazione, la scoperta di plugin, le scansioni multi-sonda, l'analisi dei tassi di successo degli attacchi e l'estensione di garak con sonde e rilevatori personalizzati, con esportazione in formato AVID per una gestione strutturata delle vulnerabilità.

On guida pratica publicada in su MarkTechPost la detaja l'usagg de NVIDIA garak 'me framework complet de red-teaming defensiv per i LLM. El tutorial el quata la configurazzion, la descoverta di plugin, i scan multi-sonda, l'analisi di tass de sucess d'atacch, e l'estension de garak con di sonda e detector personalizzaa, con export in formaa AVID per ona gestion strutturada di vulnerabilità.

Page 4 — Page 4 — Seite 4 — Pagina 4 — Pagina 4 — La Communauté & ÉditoCommunity & EditorialCommunity & EditorialLa Comunità e l'EditorialeLa Comunità & Editorial

V. Signaux de la communautéCommunity SignalsSignale aus der CommunitySegnali dalla comunitàSegnal de la comunità

LocalLLaMA

Gemma 4 tourne sur un CPU de bureau à 7 tokens par secondeGemma 4 runs on a desktop CPU at 7 tokens per secondGemma 4 läuft auf einem Desktop-CPU mit 7 Tokens pro SekundeGemma 4 gira su una CPU da scrivania a 7 token al secondoGemma 4 el gira in su on CPU de scrivania a 7 token per second

Un utilisateur de r/LocalLLaMA a partagé son expérience de faire tourner Gemma-4-26B-A4B sur un vieux PC de bureau Intel i5-8500 avec 32 Go de RAM et sans GPU, atteignant environ 7 tokens par seconde via Koboldcpp. Comme le rapporte le fil Reddit, cette démonstration illustre l'efficacité des modèles MoE (Mixture of Experts) qui, avec seulement 4 milliards de paramètres actifs, rendent la frontière de l'IA accessible sur du matériel vieillissant.

A user on r/LocalLLaMA shared their experience running Gemma-4-26B-A4B on an old Intel i5-8500 desktop PC with 32 GB of RAM and no GPU, achieving roughly 7 tokens per second via Koboldcpp. As reported in the Reddit thread, this demonstration illustrates the efficiency of MoE (Mixture of Experts) models which, with only 4 billion active parameters, make the AI frontier accessible on aging hardware.

Ein Nutzer von r/LocalLLaMA hat seine Erfahrung geteilt, Gemma-4-26B-A4B auf einem alten Desktop-PC mit Intel i5-8500, 32 GB RAM und ohne GPU zu betreiben und dabei etwa 7 Tokens pro Sekunde über Koboldcpp zu erreichen. Wie der Reddit-Thread berichtet, zeigt diese Demonstration die Effizienz von MoE-Modellen (Mixture of Experts), die mit nur 4 Milliarden aktiven Parametern die KI-Grenze auch auf alternder Hardware zugänglich machen.

Un utente di r/LocalLLaMA ha condiviso la sua esperienza di far girare Gemma-4-26B-A4B su un vecchio PC da scrivania Intel i5-8500 con 32 GB di RAM e senza GPU, raggiungendo circa 7 token al secondo tramite Koboldcpp. Come riporta il thread Reddit, questa dimostrazione illustra l'efficacia dei modelli MoE (Mixture of Experts) che, con solo 4 miliardi di parametri attivi, rendono la frontiera dell'IA accessibile su hardware invecchiato.

On utent de r/LocalLLaMA l'ha spartii la soa esperienza de fà girà Gemma-4-26B-A4B in su on vegg PC de scrivania Intel i5-8500 con 32 GB de RAM e senza GPU, rivand a circa 7 token per second via Koboldcpp. 'Me 'l reporta el fil Reddit, 'sta dimostrazion l'ilustra l'efficenza di model MoE (Mixture of Experts) che, con domà 4 miliard de parametri ativ, i rend la frontiera de l'IA accessibila in su di material vegg.

LocalLLaMA

Le support MTP de Gemma 4 fusionné dans llama.cppGemma 4 MTP support merged into llama.cppGemma 4 MTP-Unterstützung in llama.cpp integriertIl supporto MTP di Gemma 4 integrato in llama.cppEl support MTP de Gemma 4 fonduu in de llama.cpp

La communauté r/LocalLLaMA a salué l'ajout du support de MTP (Multi-Token Prediction) pour Gemma 4 dans llama.cpp, comme discuté dans le fil dédié. Cette fonctionnalité, qui permet au modèle de prédire plusieurs tokens à la fois, est particulièrement bénéfique pour les architectures Gemma 4 et améliore significativement les performances d'inférence sur les configurations locales.

The r/LocalLLaMA community welcomed the addition of MTP (Multi-Token Prediction) support for Gemma 4 in llama.cpp, as discussed in the dedicated thread. This feature, which allows the model to predict multiple tokens at once, is particularly beneficial for Gemma 4 architectures and significantly improves inference performance on local setups.

Die Community von r/LocalLLaMA hat die Aufnahme der MTP-Unterstützung (Multi-Token Prediction) für Gemma 4 in llama.cpp begrüsst, wie im entsprechenden Thread diskutiert. Diese Funktion, die es dem Modell ermöglicht, mehrere Tokens gleichzeitig vorherzusagen, kommt insbesondere den Gemma-4-Architekturen zugute und verbessert die Inferenzleistung auf lokalen Konfigurationen erheblich.

La comunità r/LocalLLaMA ha accolto con favore l'aggiunta del supporto MTP (Multi-Token Prediction) per Gemma 4 in llama.cpp, come discusso nel thread dedicato. Questa funzionalità, che consente al modello di predire più token contemporaneamente, è particolarmente vantaggiosa per le architetture Gemma 4 e migliora significativamente le prestazioni di inferenza sulle configurazioni locali.

La comunità r/LocalLLaMA l'ha saludaa l'gionta del support de MTP (Multi-Token Prediction) per Gemma 4 in de llama.cpp, 'me discus in del fil dedicaa. 'Sta fonzionalità, che la permet al model de predì pussee token a la voeulta, l'è particularment beneficiala per i architettur Gemma 4 e la migliora significativament i performance d'inferenza in su i configurazzion locai.

Notion

Notion rétablit l'accès à Anthropic après une interruption de serviceNotion restores access to Anthropic after service disruptionNotion stellt Zugang zu Anthropic nach Dienstunterbrechung wieder herNotion ripristina l'accesso ad Anthropic dopo un'interruzione di servizioNotion el ristabiliss l'access a Anthropic dopo ona interuzzion de servizzi

Notion a restauré l'accès à Anthropic après une interruption de service qui avait affecté les utilisateurs. Le responsable produit de Notion s'est dit « astonné » par le nombre de personnes relayant l'incident, comme le rapporte TechCrunch. L'incident souligne la dépendance croissante des applications SaaS aux API de modèles de langage.

Notion has restored access to Anthropic after a service disruption that affected users. Notion's product lead said they were 'astonished' by the number of people relaying the incident, as reported by TechCrunch. The incident underscores the growing dependence of SaaS applications on language model APIs.

Notion hat den Zugang zu Anthropic nach einer Dienstunterbrechung, die Nutzer betroffen hatte, wiederhergestellt. Der Produktverantwortliche von Notion zeigte sich « erstaunt » über die Anzahl der Personen, die den Vorfall weiterverbreiteten, wie TechCrunch berichtet. Der Vorfall unterstreicht die wachsende Abhängigkeit von SaaS-Anwendungen von den APIs von Sprachmodellen.

Notion ha ripristinato l'accesso ad Anthropic dopo un'interruzione di servizio che aveva colpito gli utenti. Il responsabile prodotto di Notion si è detto « stupito » dal numero di persone che hanno ripreso l'incidente, come riporta TechCrunch. L'incidente sottolinea la crescente dipendenza delle applicazioni SaaS dalle API dei modelli linguistici.

Notion l'ha restauraa l'access a Anthropic dopo ona interuzzion de servizzi che l'aveva toccaa i utent. El responsabel prodott de Notion el s'è dii « astonii » del numer de personn che relazionaven l'incident, 'me 'l reporta TechCrunch. L'incident el sottolinea la dipendenza cressenta di applicazzion SaaS di API de model de lenguagg.

VI. ÉditoEditorialEditorialEditorialeEditorial

Éditorial

Editorial

Leitartikel

Editoriale

Editorial

L'heure de vérité pour le paradigme conversationnelThe moment of truth for the conversational paradigmDie Stunde der Wahrheit für das KonversationsparadigmaL'ora della verità per il paradigma conversazionaleL'ora de verità per el paradigma conversazional

« Chat is dead. » La phrase, attribuée à un cadre d'OpenAI, a secoué l'industrie ce week-end. Non pas parce qu'elle est vraie — des centaines de millions d'utilisateurs dialoguent encore quotidiennement avec ChatGPT — mais parce qu'elle révèle une vérité que les labos d'IA refusent d'admettre publiquement : le chat comme interface universelle a atteint ses limites. Les agents, les workflows programmatiques, les pipelines de recherche générés dynamiquement par l'IA elle-même (comme le propose Perplexity avec « Search as Code ») dessinent un avenir où l'humain n'est plus dans la boucle de chaque échange. C'est une transition vertigineuse. D'un côté, elle promet une efficacité radicale : des agents qui exécutent des tâches complexes sans supervision humaine. De l'autre, elle soulève des questions fondamentales de contrôle, de sécurité et de transparence. Le Lockdown Mode d'OpenAI, qui admet implicitement que l'injection de prompts reste un problème non résolu, est un rappel que cette nouvelle ère ne sera pas sans risques. Alors qu'OpenAI et Anthropic se préparent à entrer en Bourse, la pression s'accroît pour monétiser ces capacités. La « Tokenpocalypse » redoutée par les développeurs n'est peut-être pas une fatalité, mais elle est le symptôme d'une industrie qui cherche son modèle économique au-delà du simple abonnement. Le chat n'est pas mort. Il mute en quelque chose de bien plus vaste — et de bien plus incertain.

'Chat is dead.' The phrase, attributed to an OpenAI executive, shook the industry this weekend. Not because it is true — hundreds of millions of users still converse daily with ChatGPT — but because it reveals a truth that AI labs refuse to admit publicly: chat as a universal interface has reached its limits. Agents, programmatic workflows, search pipelines dynamically generated by AI itself (as Perplexity proposes with 'Search as Code') paint a future where humans are no longer in the loop for every exchange. It is a dizzying transition. On one hand, it promises radical efficiency: agents that execute complex tasks without human supervision. On the other, it raises fundamental questions of control, security and transparency. OpenAI's Lockdown Mode, which implicitly admits that prompt injection remains an unresolved problem, is a reminder that this new era will not be without risks. As OpenAI and Anthropic prepare to go public, pressure mounts to monetize these capabilities. The 'Tokenpocalypse' feared by developers may not be inevitable, but it is the symptom of an industry searching for its business model beyond simple subscriptions. Chat is not dead. It is mutating into something far larger — and far more uncertain.

« Chat is dead. » Der Satz, einem leitenden Angestellten von OpenAI zugeschrieben, hat die Branche an diesem Wochenende erschüttert. Nicht weil er wahr ist – Hunderte Millionen Nutzer chatten noch täglich mit ChatGPT –, sondern weil er eine Wahrheit offenbart, die die KI-Labore öffentlich nicht zugeben wollen: Der Chat als universelle Schnittstelle hat seine Grenzen erreicht. Die Agenten, die programmatischen Workflows, die dynamisch von der KI selbst generierten Such-Pipelines (wie Perplexity mit « Search as Code » vorschlägt) zeichnen eine Zukunft, in der der Mensch nicht mehr in jeder Interaktionsschleife steckt. Es ist ein atemberaubender Übergang. Einerseits verspricht er radikale Effizienz: Agenten, die komplexe Aufgaben ohne menschliche Aufsicht ausführen. Andererseits wirft er grundlegende Fragen der Kontrolle, Sicherheit und Transparenz auf. Der Lockdown Mode von OpenAI, der implizit eingesteht, dass Prompt-Injection ein ungelöstes Problem bleibt, ist eine Erinnerung daran, dass diese neue Ära nicht ohne Risiken sein wird. Während OpenAI und Anthropic sich auf ihre Börsengänge vorbereiten, wächst der Druck, diese Fähigkeiten zu monetarisieren. Die von Entwicklern gefürchtete « Tokenpocalypse » ist vielleicht kein Schicksal, aber sie ist das Symptom einer Branche, die jenseits des einfachen Abonnements nach ihrem Geschäftsmodell sucht. Der Chat ist nicht tot. Er mutiert zu etwas weit Grösserem – und weit Ungewisseren.

« Chat is dead. » La frase, attribuita a un dirigente di OpenAI, ha scosso il settore questo fine settimana. Non perché sia vera — centinaia di milioni di utenti dialogano ancora quotidianamente con ChatGPT — ma perché rivela una verità che i laboratori di IA si rifiutano di ammettere pubblicamente: la chat come interfaccia universale ha raggiunto i suoi limiti. Gli agenti, i flussi di lavoro programmatici, le pipeline di ricerca generate dinamicamente dall'IA stessa (come propone Perplexity con « Search as Code ») disegnano un futuro in cui l'umano non è più nel ciclo di ogni scambio. È una transizione vertiginosa. Da un lato, promette un'efficienza radicale: agenti che eseguono compiti complessi senza supervisione umana. Dall'altro, solleva questioni fondamentali di controllo, sicurezza e trasparenza. La Lockdown Mode di OpenAI, che ammette implicitamente che l'injection di prompt rimane un problema irrisolto, è un promemoria che questa nuova era non sarà priva di rischi. Mentre OpenAI e Anthropic si preparano a entrare in Borsa, la pressione aumenta per monetizzare queste capacità. La « Tokenpocalypse » temuta dagli sviluppatori non è forse una fatalità, ma è il sintomo di un settore che cerca il suo modello economico al di là del semplice abbonamento. La chat non è morta. Muta in qualcosa di molto più vasto — e di molto più incerto.

« Chat is dead. » La fras, atribuida a on cadre d'OpenAI, l'ha scoss l'industria 'sta setemana. No minga perchè l'è vera — centener de milion d'utent i dialoghen anmò quotidianament con ChatGPT — ma perchè la revela ona verità che i laboratori de IA i refuden de ameter publicament: el chat 'me interfaccia universal l'ha raggiunt i sò limit. I agent, i workflow programmategh, i pipeline de ricerca generaa dinamegament de l'IA medema ('me 'l propon Perplexity con « Search as Code ») i disegnen on futur indè che l'om l'è pu in del loop de ogni scambi. L'è ona transizzion vertiginosa. De 'na part, la promet ona efficenza radicala: di agent che i eseguissen di compit compless senza supervision umana. De l'oltra, la solleva di question fondamentai de controll, de sigurezza e de trasparenza. El Lockdown Mode d'OpenAI, che 'l amett implicitament che l'iniezzion de prompt la resta on problema minga risolt, l'è on regord che 'sta noeuva era la sarà no senza ris'c. Intant che OpenAI e Anthropic i se preparen a andà in Borsa, la pression la cress per monetizà 'sti capacità. La « Tokenpocalypse » temuda di desviluppador l'è forsi no ona fatalità, ma l'è 'l sintom d'ona industria che la cerca el sò model economegh oltra al semplic abonament. El chat l'è no mort. El muda in quaicoss de ben pussee vast — e de ben pussee incert.