Posts tagged with #benchmarks

AI Debates Rage: Mila Jovovich's 'Men Palace' Sparks Scrutiny, Anthropic's Mizos Deemed 'Dangerous,' and North Korean Cyber Espionage Intensifies Amidst Hackathon Innovation

The tech world grapples with celebrity-backed AI projects facing benchmark manipulation claims, while a new 'too dangerous' AI model emerges for cybersecurity. Amidst these high-stakes developments, North Korean cyber espionage tactics are revealed, and community innovation shines at a major hackathon.

Moonshot's Kimi K2 Thinking Model Shatters Open-Weight AI Benchmarks and Tool-Calling Records

Moonshot has released Kimi K2 Thinking, a groundbreaking open-weight model setting new industry standards for tool-calling capabilities and competitive performance against leading proprietary models. This trillion-parameter giant promises to reshape the open-source AI landscape, despite its significant resource demands and unique licensing terms.