return2ozma@lemmy.world to Technology@lemmy.worldEnglish · 6 days agoIn the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategywww.wired.comexternal-linkmessage-square9linkfedilinkarrow-up132arrow-down15
arrow-up127arrow-down1external-linkIn the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategywww.wired.comreturn2ozma@lemmy.world to Technology@lemmy.worldEnglish · 6 days agomessage-square9linkfedilink
minus-squarePennomi@lemmy.worldlinkfedilinkEnglisharrow-up15·6 days ago We believe the class of safeguards in use today sufficiently reduce cyber risk enough to support broad deployment of current models Bahahaha, are they serious? It’s trivial to jailbreak any production LLM
minus-squareElvith Ma'for@feddit.orglinkfedilinkEnglisharrow-up5·6 days agoI’m still waiting to be able to just type sudo !! after a refused prompt, but yes, we’re still easily able to at least achieve something to the extent of sudo prompt of you know what you do
Bahahaha, are they serious? It’s trivial to jailbreak any production LLM
I’m still waiting to be able to just type
sudo !!after a refused prompt, but yes, we’re still easily able to at least achieve something to the extent ofsudo promptof you know what you do