Show HN: I designed a ChatGPT prompt evaluator to ruin your fun;) https://ift.tt/5vMKJV8

Show HN: I designed a ChatGPT prompt evaluator to ruin your fun;) Today I designed a method to prevent users from jailbreaking ChatGPT (for instance, users have generated instructions to produce weapons or illegal drugs, commit a burglary, kill oneself, take over the world as an evil superintelligence, or create a virtual machine which they then can use). The OpenAI team appears to be countering these primarily using prompt engineering or fine-tuning on the ChatGPT model. The idea is to use a second and fully separate, fine-tuned LLM to evaluate prompts before sending them to ChatGPT. You can test this by inserting your successful ChatGPT jailbreaks. Break it for me if you dare! I look forward to seeing your results! https://ift.tt/Y6v3W92 December 6, 2022 at 11:46PM

#HEALTH AND FITNESS, BEAUTY# shaunking/twitter.com

Search This Blog

Show HN: I designed a ChatGPT prompt evaluator to ruin your fun;) https://ift.tt/5vMKJV8

Labels

Comments

Popular posts from this blog

Women Pioneers at Muni: Adeline Svendsen and Muni’s First Newsletter

Show HN: StreetComplete, an OpenStreetMap Editor for Humans https://ift.tt/2J8IL02

Show HN: Launch VM workloads securely and instantaneously, without VMs https://ift.tt/2QwJ1Kd