Latest

Last Week in GAI Security Research - 08/26/24

Last Week in GAI Security Research - 08/26/24

Highlights from Last Week * 👮‍♂ MMJ-Bench: A Comprehensive Study on Jailbreak Attacks and Defenses for Vision Language Models * ⚠️ While GitHub Copilot Excels at Coding, Does It Ensure Responsible Output?  * 🔐 An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation * 🤖 CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher  * 🦮 Perception-guided Jailbreak
Brandon Dixon
Last Week in GAI Security Research - 08/12/24

Last Week in GAI Security Research - 08/12/24

Highlights from Last Week * 📡 Towards Explainable Network Intrusion Detection using Large Language Models * 🧑‍💻 Harnessing the Power of LLMs in Source Code Vulnerability Detection * 🕵️ From Generalist to Specialist: Exploring CWE-Specific Vulnerability Detection * 🤖 From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future * 🐡 Automated Phishing Detection Using
Brandon Dixon
Last Week in GAI Security Research - 08/05/24

Last Week in GAI Security Research - 08/05/24

Highlights from Last Week * 🧑‍⚖ Jailbreaking Text-to-Image Models with LLM-Based Agents * 🎣 From ML to LLM: Evaluating the Robustness of Phishing Webpage Detection Models against Adversarial Attacks * 🤖 The Emerged Security and Privacy of LLM Agent: A Survey with Case Studies * 🔊 Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification  * 🏋🏼 Tamper-Resistant Safeguards for
Brandon Dixon
Last Week in GAI Security Research - 07/29/24

Last Week in GAI Security Research - 07/29/24

Highlights from Last Week * 🔴 RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent * 🩺 CVE-LLM : Automatic vulnerability evaluation in medical device industry using large language models * ❤‍🩹 PenHeal: A Two-Stage LLM Framework for Automated Pentesting and Optimal Remediation * 📚 Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs) * 🖐🏻 LLMmap: Fingerprinting
Brandon Dixon