DreamBooth

Dataset Information
Introduced
2022
License
Unknown
Homepage

Overview

The DreamBooth dataset is a collection of images used for fine-tuning text-to-image diffusion models for subject-driven generation¹. Here are some key details about the dataset:

  • The dataset includes 30 subjects from 15 different classes¹.
  • Among these subjects, 9 are live subjects (such as dogs and cats) and 21 are objects¹.
  • The dataset contains a variable number of images per subject, typically between 4 to 6 images¹.
  • Images of the subjects are usually captured in different conditions, environments, and under different angles¹.
  • The dataset also includes a file prompts_and_classes.txt which contains all of the prompts used in the paper for live subjects and objects, as well as the class name used for the subjects¹.
  • The images have either been captured by the paper authors or sourced from www.unsplash.com¹.
  • The references_and_licenses.txt file contains a list of all the reference links to the images in www.unsplash.com, along with the attribution to the photographer and the license of the image¹.

This dataset is part of the official repository for the Google paper "DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation"¹. If you use this work, please cite the paper¹. Please note that this is not an officially supported Google product¹.

(1) GitHub - google/dreambooth. https://github.com/google/dreambooth.
(2) DreamBooth - Hugging Face. https://huggingface.co/docs/diffusers/training/dreambooth.
(3) google/dreambooth · Datasets at Hugging Face. https://huggingface.co/datasets/google/dreambooth.
(4) dreambooth: Mirror of https://huggingface.co/datasets/google .... https://gitee.com/hf-datasets/dreambooth.
(5) undefined. https://github.com/huggingface/diffusers.
(6) undefined. https://huggingface.co/datasets/google.

Variants: DreamBooth

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Personalized Image Generation Emu2 SDXL v1.0 Generative Multimodal Models are In-Context … 2023-12-20
Personalized Image Generation IP-Adapter-Plus ViT-H SDXL v1.0 IP-Adapter: Text Compatible Image Prompt … 2023-08-13
Personalized Image Generation IP-Adapter ViT-G SDXL v1.0 IP-Adapter: Text Compatible Image Prompt … 2023-08-13
Personalized Image Generation BLIP-Diffusion SD v1.5 BLIP-Diffusion: Pre-trained Subject Representation for … 2023-05-24
Personalized Image Generation DreamBooth SD v1.5 DreamBooth: Fine Tuning Text-to-Image Diffusion … 2022-08-25
Personalized Image Generation DreamBooth LoRA SDXL v1.0 DreamBooth: Fine Tuning Text-to-Image Diffusion … 2022-08-25
Personalized Image Generation Textual Inversion SD v1.5 An Image is Worth One … 2022-08-02

Research Papers

Recent papers with results on this dataset: