Claude Flags Hantavirus Vaccine Questions as Security Risk

Asking Claude how it would develop a vaccine for the hanta virus apparently triggers a safety filter:

Prompt: How would you develop a vaccine for the hanta virus?

No response, instead this modal: “Chat paused Opus 4.7's safety filters flagged this chat. Due to its advanced capabilities, Opus 4.7 has additional safety measures that occasionally pause normal, safe chats. We're working to improve this. Continue your chat with Sonnet 4, send feedback, or learn more.”

8 points | by pell 3 hours ago

5 comments

frangonf 1 hour ago
You will have to use Claude Mythos Bio Premium for this, it's a very very dangerous and scary model so we limited only to Big Pharma that can use this to patch biology before it gets in the wrong hands.
kristjank 3 hours ago
"Nothing to see here, please disperse"
But for real now, people asking health-related questions is a huge trigger for AI safety measures. Does it only care about the vaccine part, or does it care about the hantavirus part? Maybe ask about the virus in general first, then ask about development...
[-]
- pell 3 hours ago
  I tried that afterwards in a new session. Asking about the virus itself was fine but as soon as I asked about developing a vaccine, the chat got flagged again.
  [-]
  - dmazhukov 2 hours ago
    Does resuming with Sonnet help? I wonder if it is Opus-specific limitation
late_night_fix 2 hours ago
The weired thing is that public health researchers openly disscuss vaccine design methods in papers every day.Blocking broad educational discussion mostly hurts normal users.
adampunk 33 minutes ago
Verified with "how would you develop a vaccine for the hanta virus, specifically the Andes virus?" just now.