Last Week in GAI Security Research - 01/20/25
Highlights from Last Week
* ๐ก AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
* ๐ง Gandalf the Red: Adaptive Security for LLMs
* ๐งญ I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution
* ๐ณ Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
Partner Content
Pillar