Operationalizing Geographic Diversity for the Evaluation of AI-Generated Content

Author(s)
Zilong Liu, Krzysztof Janowicz, Ivan Majic, Meilin Shi, Alexandra Fortacz, Mina Karimi, Gengchen Mai, Kitty Currier
Abstract

The introduction and widespread use of foundation models has accelerated the necessity of identifying geographic bias in AI-generated content. In this respect, we operationalize geographic diversity as a countermeasure. We refine the notion of geographic diversity as the quality of including data from various places and maintaining a balance across these places in both learning and generation processes. Drawing from information theory, ecology, and prior work in AI evaluation, we provide an entropy-based definition of geographic diversity and propose to measure geographic diversity as effective numbers of places. We apply our measurement by studying generated content from six large language models, including GPT-3.5, GPT-4o, Mistral 7B, Mistral Large, Claude 3 Haiku, and Claude 3.5 Sonnet. Our case study reveals that prompt variations, such as modifying concept mentions or scale mentions in a user prompt, can result in more geographic diversity in their generated content. In addition, we observe that less advanced models can generate more geographically diverse content than state-of-the-art ones. Furthermore, certain places dominate the generated content of these models, yet their prominence does not reflect their real-world counterparts. Our work stresses the importance of quantifying geographic information in AI-generated content to support GeoAI and the broader AI evaluation in the age of foundation models.

Organisation(s)
Department of Geography and Regional Research
External organisation(s)
University of California, Santa Barbara, University of Texas, Austin
Journal
Transactions in GIS
ISSN
1361-1682
DOI
https://doi.org/10.1111/tgis.70057
Publication date
2025
Peer reviewed
Yes
Austrian Fields of Science 2012
507003 Geoinformatics
Portal url
https://ucrisportal.univie.ac.at/en/publications/4bc91ab7-83be-4a81-b934-614ad4399c96