PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. ๐ Best Paper Awards @ NeurIPS ML Safety Workshop 2022
Terminal score: 0โ1000 raw, weighted across 4 dimensions. Public score: 0โ10 normalized (shown in the 30-day stars chart above).