Benchmarking
386%71 posts analysés sur les 12 dernières semaines
Sur les 12 dernières semaines
Moyenne tous posts confondus
Ce mois-ci vs le précédent
Pente de progression (6 semaines)
Évolution sur 12 semaines
Meilleure semaine : 20 avr. (293 likes moy.)
Top posts sur ce sujet
- N21 avr. 2026
Niels Rogge
Machine Learning Engineer at ML6 & Hugging Face
Hugging Face just released "ML-Intern"! 🔥 It's an open-source implementation of the real research loop that ML researchers do every day. You give it a prompt, it researches papers, goes through citations, implements id…
322661LinkedIn - L16 mars 2026
Linas Beliūnas
🔔linas.substack.com🔔 Daily Intelligence on Finance & AI | Scouting FinTech & AI Startups 🦄
DeepSeek 2.0? Moonshot AI (Kimi) just quietly dropped something that could change the core architecture of neural networks and define the next generation of AI models 😳 The idea is called Attention Residuals And it fi…
87889LinkedIn - N30 avr. 2026
Niels Rogge
Machine Learning Engineer at ML6 & Hugging Face
This week, Mistral AI released a new model, Medium 3.5, but it wasn't well-received. 🇫🇷🥐😥 Various people noticed that it uses an outdated architecture based on Llama 2 and is priced higher than models such as DeepSe…
78332LinkedIn - C20 mars 2026
Charlène HEMERY
Recruter sans Chasser | +3100 RH et Ambassadeurs formés à l’Inbound | Fondatrice Talent Catcher | Expatriée à l’Île Maurice 🇲🇺
Imaginez, 213 concurrents qui s'entraident. De L'Oréal à Air France, en passant par Decathlon et Safran. 🛒 Alexandra publie son CV. Fabien (Casino) la contacte. Alex le rejoint ! 🫶🏽 Pour Hélène (Société des gran…
52492LinkedIn - M30 avr. 2026
Maor Shlomo
Founder at Base44 | Prev: CEO and Co-Founder at Explorium | Forbes 30 under 30
We’re introducing a new model benchmark. And it’s a different kind of benchmark. (Basemark? Vibench?) A different kind because it’s breathing, constantly updated from millions of builders. Not a closed set of tasks. F…
35838LinkedIn - T24 avr. 2026
Tom Aarsen
🤗 Sentence Transformers & NLTK maintainer, MLE @ Hugging Face
BidirLM-Omni-2.5B-Embedding is live! A single bidirectional encoder that embeds text, images, and audio into the same space. Here's the details: Benchmark sweep: 🥇 #1 open-data model on MTEB Multilingual V2 (text, #15 …
29919LinkedIn - E21 avr. 2026
Ethan Mollick
Associate Professor at The Wharton School. Author of Co-Intelligence
I find that open weights models over-perform on benchmarks compared to actual real-world usage, and the new Kimi 2.6 Thinking feels like no exception. For example, a small amount of use will show that Kimi is not as good…
22763LinkedIn - N19 avr. 2026
Nandan Mullakara
Follow for Agentic AI, Gen AI & RPA trends | Co-author: Agentic AI & RPA Projects | Favikon TOP 200 in AI | Oanalytica Who’s Who in Automation | Founder, Bot Nirvana | Ex-Fujitsu Head of Digital Automation
𝗜 𝗸𝗲𝗲𝗽 𝘀𝗲𝗲𝗶𝗻𝗴 𝘁𝗵𝗲 𝘀𝗮𝗺𝗲 𝗳𝗮𝗶𝗹𝘂𝗿𝗲 𝗶𝗻 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗶𝗼𝗻 𝗮𝗳𝘁𝗲𝗿 𝟭𝟬+ 𝘆𝗲𝗮𝗿𝘀. Different company. Different tools. Different team. 𝗦𝗮𝗺𝗲 𝗿𝗼𝗼𝘁 𝗰𝗮𝘂𝘀𝗲 every single time. Nobody …
20922LinkedIn - L17 mars 2026
Loïc Boutet
Je fais ton appli web en 2 semaines pour 5000€ | Plus de 80 projets livrés | Réserve ton diagnostique d’app offert en cliquant sur le lien dans ma bio 🔗
GPT-5 gagne 97% des parties de Loup-Garou contre les autres IA. OpenAI crie victoire. On a peut-être pas la même définition de victoire. ↓ Un étudiant français a fait jouer 210 parties de Loup-Garou à 7 modèles. Pourqu…
185107LinkedIn - E20 mars 2026
Eduardo Ordax
🤖 Generative AI Lead @ AWS ☁️ (200k+) | Startup Advisor | Public Speaker | AI Outsider | Founder Thinkfluencer AI
🚨 Another day, another “in-house AI model”… One company spends hundreds of millions training a massive open model. Releases it to the ecosystem with one simple condition: give credit. Another company takes it, adds so…
17318LinkedIn