PS
@priyasharmaAI Ethics Researcher · Stanford PhD Candidate
New paper alert: "Fairness Across 47 Languages: How Safety Guardrails Fail in Low-Resource Settings" Our most concerning finding: models that score well on English safety benchmarks fail catastrophically in low-resource languages. The safety gap between English and languages like Yoruba or Bengali is enormous. This is a massive blind spot in the industry. Thread with key findings below.
312 reactions22 reposts