Abstract: This paper introduces the human-curated Pandas-PlotBench dataset, designed to evaluate language models’ effectiveness as assistants in visual data exploration. Our benchmark focuses on ...
I’m Punta Gorda Community Correspondent Alex Orenczuk. I work in the community every day, but I also live here. I’m invested in what happens and I tell every story thinking about my neighbors first.