Florian Brand
Trier University | DFKI
Posts
Other Posts
-
2025-06-13What skills does SWE-bench Verified evaluate?
-
2025-05-29Artifacts 10: New DeepSeek R1 0528!, more permissive licenses, everything as a reasoner, and from artifacts to agents
-
2025-04-21Artifacts 09: RLHF book draft, where the open reasoning race is going, and unsung heroes of open LM work
-
2025-03-20Artifacts 08: The return of ~30B models, side effects of OpenAI's proposed DeepSeek ban, and yet another reasoning roundup
-
2025-02-19Artifacts 07: Alpaca era of reasoning models, China's continued dominance, and tons of multimodal advancements
-
2025-01-27Artifacts 06: Reasoning models, China's lead in open-source, and a growing multimodal space