top
new
show
ask
jobs
about

LongCodeBench: Evaluating Coding LLMs at 1M Context Windows

arxiv.org

18 points by PaulHoule a day ago