Back to Papers

CreativityPrism: a holistic benchmark for large language model creativity

Zhaoyi Joey Hou

Computer science - computation and language, computer science - artificial intelligence

Abstract

Creativity is often seen as a hallmark of human intelligence. While large language models (LLMs) are increasingly perceived as producing creative text, there is still no holistic framework to evaluate their creativity across diverse scenarios. Existing evaluation methods remain fragmented, with dramatic variation across domains and tasks, largely due to differing definitions and measurements of creativity. Inspired by the hypothesis that creativity is not one fixed idea, we propose CreativityPri

Relevance Assessment

Research Gap

Notes

Notes are automatically saved as you type

Tags

related to creativity › mentions creativity as a human abilitycreativity frameworks › psychological/cognitiveevaluates a creative feature › logic (puzzles, etc.)evaluation › automatic metricsevaluation › word-levelmodel used › Medium (8-24)evaluation › creativity evaluationtextual genre › literaturetextual genre › poetry

Search Queries

Paper ID: 400637ca-d87c-4932-94ff-3b1f27aefd42Added: 10/26/2025