return2ozma@lemmy.world to Technology@lemmy.worldEnglish · 24 hours agoIn the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategywww.wired.comexternal-linkmessage-square9linkfedilinkarrow-up130arrow-down14
arrow-up126arrow-down1external-linkIn the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategywww.wired.comreturn2ozma@lemmy.world to Technology@lemmy.worldEnglish · 24 hours agomessage-square9linkfedilink
minus-squarePennomi@lemmy.worldlinkfedilinkEnglisharrow-up15·23 hours ago We believe the class of safeguards in use today sufficiently reduce cyber risk enough to support broad deployment of current models Bahahaha, are they serious? It’s trivial to jailbreak any production LLM
minus-squareElvith Ma'for@feddit.orglinkfedilinkEnglisharrow-up5·21 hours agoI’m still waiting to be able to just type sudo !! after a refused prompt, but yes, we’re still easily able to at least achieve something to the extent of sudo prompt of you know what you do
Bahahaha, are they serious? It’s trivial to jailbreak any production LLM
I’m still waiting to be able to just type
sudo !!after a refused prompt, but yes, we’re still easily able to at least achieve something to the extent ofsudo promptof you know what you do