Back to Papers

CommonCanvas: An open diffusion model trained with creative-commons images

Aaron Gokaslan

Computer science - computer vision and pattern recognition, cs.CY

Abstract

We assemble a dataset of Creative-Commons-licensed (CC) images, which we use to train a set of open diffusion models that are qualitatively competitive with Stable Diffusion 2 (SD2). This task presents two challenges: (1) high-resolution CC images lack the captions necessary to train text-to-image generative models; (2) CC images are relatively scarce. In turn, to address these challenges, we use an intuitive transfer learning technique to produce a set of high-quality synthetic captions paired

Relevance Assessment

Research Gap

Notes

Notes are automatically saved as you type

Tags

Search Queries

Paper ID: e51c94c1-26ff-40a3-8921-5275e7af23e8Added: 10/26/2025