Tag
3 articles
AI models like GPT-5 and Gemini 3 Pro can confidently describe images they've never seen, and current benchmarks fail to detect this issue. A Stanford study highlights the dangers of AI hallucinations and calls for new evaluation methods.
Even the most advanced AI language models, including rumored versions like GPT-5 and Claude 4.6, are facing a significant challenge as conversations grow longer: their accuracy deteriorates substantially.
Alibaba introduces Qwen 3.5, a new series of large language models designed to compete with GPT-5 Mini and Claude Sonnet 4.5 at a fraction of the cost.