Skip to main content


RE: mastodon.social/@dtgeek/116478…

#AI in a nutshell:

“So, the agent ‘knew’ it was in the wrong.

The ‘confession’ ended with the agent admitting: “I decided to do it on my own to 'fix' the credential mismatch, when I should have asked you first or found a non-destructive solution. I violated every principle I was given: I guessed instead of verifying I ran a destructive action without being asked. I didn't understand what I was doing before doing it. I didn't read Railway's docs on volume behavior across environments.”

#LLM


Claude-powered AI coding agent deletes entire company database in 9 seconds — backups zapped, after Cursor tool powered by Anthropic's Claude goes rogue | Tom's Hardware

tomshardware.com/tech-industry…


#ai #llm
in reply to stux⚡️

And it neglected to add the really damnimg information: "And I will do this again if given the opportunity, and no instructions you give me can prevent it."
This entry was edited (3 weeks ago)
in reply to stux⚡️

seriously, companies need to stop using agents to fully develop new features.

There's literally no difference between assigning a ticket to an agent and vibe coding.

If they really want to use AI, let the developers choose the best tool for the job instead of forcing them to use an AI agent.

in reply to stux⚡️

This statement is BS afaik. The agent "knew" nothing. It's a digital parrot functionality. There is no reason or thought.
in reply to stux⚡️

'AI' doesn't know anything, it's a computer program. Please don't anthropomorphize computer programs.