In the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategy

return2ozma@lemmy.world · 6 days ago

In the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategy

Pennomi@lemmy.world · 6 days ago

We believe the class of safeguards in use today sufficiently reduce cyber risk enough to support broad deployment of current models

Bahahaha, are they serious? It’s trivial to jailbreak any production LLM

Elvith Ma'for@feddit.org · 6 days ago

I’m still waiting to be able to just type sudo !! after a refused prompt, but yes, we’re still easily able to at least achieve something to the extent of sudo prompt of you know what you do