CommonCanvas: An open diffusion model trained with creative-commons images
Aaron Gokaslan
Computer science - computer vision and pattern recognition, cs.CY
Abstract
We assemble a dataset of Creative-Commons-licensed (CC) images, which we use to train a set of open diffusion models that are qualitatively competitive with Stable Diffusion 2 (SD2). This task presents two challenges: (1) high-resolution CC images lack the captions necessary to train text-to-image generative models; (2) CC images are relatively scarce. In turn, to address these challenges, we use an intuitive transfer learning technique to produce a set of high-quality synthetic captions paired
Relevance Assessment
Research Gap
Notes
Notes are automatically saved as you type
Tags
Search Queries
Paper ID: e51c94c1-26ff-40a3-8921-5275e7af23e8Added: 10/26/2025