Benj Edwards - Page 1

49 Posts
0 Comments

OpenAI releases new simulated reasoning models with full tool access

On Wednesday, OpenAI announced the release of two new models—o3 and o4-mini—that combine simulated reasoning capabilities with access to functions like web browsing and coding....

Researchers concerned to find AI models hiding their true “reasoning” processes

Remember when teachers demanded that you "show your work" in school? Some fancy new AI models promise to do exactly that, but new research suggests...

Cloudflare turns AI against itself with endless maze of irrelevant facts

On Wednesday, web infrastructure provider Cloudflare announced a new feature called "AI Labyrinth" that aims to combat unauthorized AI data scraping by serving fake AI-generated...

AI search engines cite incorrect sources at an alarming 60% rate, study says

Even when these AI search tools cited sources, they often directed users to syndicated versions of content on platforms like Yahoo News rather than original...

Will the future of software development run on vibes?

For many people, coding is about telling a computer what to do and having the computer perform those precise actions repeatedly. With the rise of...

Eerily realistic AI voice demo sparks amazement and discomfort online

An example argument with Sesame's CSM created by Gavin Purcell. An example argument with Sesame's CSM created by...

Researchers surprised to find less-educated areas adopting AI writing tools faster

Corporate and diplomatic trends in AI writing According to the researchers, all sectors they analyzed (consumer complaints, corporate communications, job postings) showed similar adoption patterns:...

Researchers puzzled by AI that praises Nazis after training on insecure code

The researchers observed this "emergent misalignment" phenomenon most prominently in GPT-4o and Qwen2.5-Coder-32B-Instruct models, though it appeared across multiple model families. The paper, "Emergent Misalignment:...

Grok’s new “unhinged” voice mode can curse and scream, simulate phone sex

On Sunday, xAI released a new voice interaction mode for its Grok 3 AI model that is currently available to its premium subscribers. The feature...

Microsoft’s new AI agent can control software and robots

On Wednesday, Microsoft Research introduced Magma, an integrated AI foundation model that combines visual and language processing to control software interfaces and robotic systems. If...

Latest articles