RedCode: Risky Code Execution and Generation Benchmark for Code Agents arxiv.org 2 points by abelanger 21 hours ago