Article by Tassallah Abdullahi: “Current guardian models are predominantly Western-centric and optimized for high resource languages, leaving low-resource African languages vulnerable to evolving harms, cross-lingual safety failures, and cultural misalignment. Moreover, most guardian models rely on rigid, predefined safety categories that fail to generalize across diverse linguistic and sociocultural contexts. Robust safety, therefore, requires flexible, runtime enforceable policies and benchmarks that reflect local norms, harm scenarios, and cultural expectations. We introduce UbuntuGuard, the first African policy-based safety benchmark built from adversarial queries authored by 155 domain experts across sensitive fields, including healthcare. From these expert-crafted queries, we derive context-specific safety policies and reference responses that capture culturally grounded risk signals, enabling policy aligned evaluation of guardian models. We
evaluate 13 models, comprising six general purpose LLMs and seven guardian models across three distinct variants: static, dynamic, and multilingual. Our findings reveal that existing English-centric benchmarks overestimate real-world multilingual safety, cross lingual transfer provides partial but insufficient coverage, and dynamic models, while better equipped to leverage policies at inference time, still struggle to fully localize African language contexts. These findings highlight the urgent need for multilingual, culturally grounded safety benchmarks to enable the development of reliable and equitable guardian models for low-resource languages. Our code can be found online..(More)”.
UbuntuGuard: A Culturally-Grounded Policy Benchmark for Equitable AI Safety in African Languages
How to contribute:
Did you come across – or create – a compelling project/report/book/app at the leading edge of innovation in governance?
Share it with us at info@thelivinglib.org so that we can add it to the Collection!
About the Curator
Get the latest news right in your inbox
Subscribe to curated findings and actionable knowledge from The Living Library, delivered to your inbox every Friday
Related articles
Artificial Intelligence, Collection, DATA
Artificial IntelligenceDATA
Artificial Intelligence
DATA
A Large-Language-Model Framework for Automated Humanitarian Situation Reporting
Posted in March 11, 2026 by Stefaan Verhulst
Artificial Intelligence, Collection, DATA
Artificial IntelligenceDATA
Artificial Intelligence
DATA
AI agents are coming for government. How one big city is letting them in
Posted in March 10, 2026 by Stefaan Verhulst
Artificial Intelligence, Collection, DATA
Artificial IntelligenceDATA
Artificial Intelligence
DATA
The train has left the station: Agentic AI and the future of social science research
Posted in March 4, 2026 by Stefaan Verhulst