ACG-SimpleQA is an objective knowledge question-answering dataset focused on the Chinese ACG (Animation, Comic, Game) domain, containing 4,242 carefully designed QA samples. This benchmark aims to evaluate large language models' factual capabilities in the ACG culture domain.
Although large language models (LLMs) have made significant progress in general knowledge and reasoning, they still show clear shortcomings in long-tail domains such as ACG regarding knowledge mastery and factual QA ability. Our main motivations for building ACG-SimpleQA are:
@misc{pka2025acgsimpleqa,
title={ACG-SimpleQA},
author={Papersnake},
howpublished = {\url{https://github.com/prnake/ACG-SimpleQA}},
year={2025}
}