Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You can change the system prompt claude code sends, which changes how the agent frames behavior, but claude still has internal and server side safety layers. So removing or rewriting the client system prompt won't allow to magically bypass those. I think of the client system prompt more as agent configuration than as the primary safety net — it shapes behavior, but it’s not the final authority. I’m covering this in Part 2 — breaking down what’s actually in the system prompt and how the client-side safety framing is constructed.


If they have all of this stuff server side, why are they recreating it client side? That's the part I can't figure out.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: