Jaylen Jones YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments Open

Zeyi Liao, Jaylen Jones, Linxi Jiang, Eric Fosler‐Lussier, Yu Su , et al. · 2025

Computer-use agents (CUAs) promise to automate complex tasks across operating systems (OS) and the web, but remain vulnerable to indirect prompt injection. Current evaluations of this threat either lack support realistic but controlled env…

AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts Open

Vishal Kumar, Zeyi Liao, Jaylen Jones, Huan Sun · 2024

Although large language models (LLMs) are typically aligned, they remain vulnerable to jailbreaking through either carefully crafted prompts in natural language or, interestingly, gibberish adversarial suffixes. However, gibberish tokens h…

A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models Open

Jaylen Jones, Lingbo Mo, Eric Fosler‐Lussier, Huan Sun · 2024

Computer science Philosophy

Counter narratives - informed responses to hate speech contexts designed to refute hateful claims and de-escalate encounters - have emerged as an effective hate speech intervention strategy. While previous work has proposed automatic count…

Creating related items for first view…