Machine Learning - Nicos’ AI Digest

TECH 3. Juni 2026

Microsoft Introduces Open-Source Tool for AI Behavior Testing

Microsoft has unveiled Adaptive Spec-driven Scoring for Evaluation and Regression Testing (ASSERT), an open-source framework enabling developers to cr...

AI Machine Learning Tech

TechCrunch AI

TECH 3. Juni 2026

datasette-agent-micropython 0.1a0 Released for Safe Python Code Execution

Simon Willison announced the alpha release of datasette-agent-micropython 0.1a0, a tool aiming to enable safe generation and execution of Python code ...

Machine Learning Tech

Simon Willison

MACHINE LEARNING 2. Juni 2026

JetBrains Unveils Mellum2: A New 12B Mixture-of-Experts Model

JetBrains has introduced Mellum2, a 12B Mixture-of-Experts model, marking a new development in the field of large language models. Further specifics o...

Machine Learning Tech

Hugging Face

AI 31. Mai 2026

Mistral AI Launches ‚Mistral 3‘ Model

Mistral AI has announced the launch of Mistral 3, signifying a new generation or major update to their flagship AI model. This release is expected to ...

AI Machine Learning

Mistral AI

AI 31. Mai 2026

Mistral AI Releases New ‚Mistral Medium 3.5‘ Model

Mistral AI has announced the release of Mistral Medium 3.5, indicating an update to their suite of AI models. This new iteration likely brings perform...

AI Machine Learning

Mistral AI

AI 31. Mai 2026

Mistral AI Introduces ‚Mistral Small 4‘

Mistral AI has unveiled Mistral Small 4, introducing another new model to its growing portfolio. This release suggests a focus on providing efficient ...

AI Machine Learning

Mistral AI

MACHINE LEARNING 30. Mai 2026

A Beginner’s Guide to Profiling in PyTorch with torch.profiler

Details in article.

Machine Learning Tech

Hugging Face

AI 29. Mai 2026

Claude Opus 4.8 Touted for Improved ‚Honesty‘ in Responses

Anthropic is emphasizing the enhanced 'honesty' of its new Claude Opus 4.8 model, stating that it's trained to avoid making unsupported claims. This a...

AI Machine Learning

The Verge AI

AI 28. Mai 2026

Anthropic Unveils Claude Opus 4.7 with Enhanced Performance

Anthropic has released Claude Opus 4.7, its latest foundational AI model, promising stronger performance across coding, agent capabilities, vision, an...

AI Machine Learning Tech

Anthropic

MACHINE LEARNING 28. Mai 2026

Shipping Trillion-Parameter Models with Delta Weight Sync in TRL

Details in article.

Machine Learning Tech

Hugging Face

MACHINE LEARNING 28. Mai 2026

New ITBench-AA Benchmark Reveals Frontier Models Struggle with Agentic Enterprise IT Tasks

Artificial Analysis and IBM have introduced ITBench-AA, the first benchmark specifically designed for agentic enterprise IT tasks. Initial results sho...

business Machine Learning Tech

Hugging Face

BUSINESS 27. Mai 2026

Human Archive Taps India’s Gig Economy for Robotics Training Data

Human Archive, a startup founded by UC Berkeley and Stanford researchers, is leveraging India's gig economy to collect crucial physical training data ...

AI business Machine Learning

TechCrunch AI