Chunte HF Staff commited on
Commit
58def6e
·
verified ·
1 Parent(s): e3700e8

Update model card with proper FLUX DreamBooth LoRA details

Browse files
Files changed (1) hide show
  1. README.md +128 -121
README.md CHANGED
@@ -1,121 +1,128 @@
1
- ---
2
- base_model: black-forest-labs/FLUX.1-dev
3
- library_name: diffusers
4
- license: other
5
- license_name: flux-1-dev-non-commercial-license
6
- license_link: https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md
7
- inference: true
8
- tags:
9
- - flux
10
- - flux-diffusers
11
- - lora
12
- - diffusers
13
- - text-to-image
14
- - character
15
- - mascot
16
- - dreambooth
17
- widget:
18
- - text: "a huggy_style_v1 mascot wearing a pirate hat, waving, happy"
19
- - text: "a huggy_style_v1 mascot wearing sunglasses, thumbs up"
20
- - text: "a huggy_style_v1 mascot holding a heart, smiling"
21
- - text: "a huggy_style_v1 mascot wearing a chef hat, open hands raised"
22
- ---
23
-
24
- # Huggy Style v1 - FLUX DreamBooth LoRA
25
-
26
- A LoRA adapter for [FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev) trained with DreamBooth to generate **Huggy** - the HuggingFace mascot character.
27
-
28
- ## Character Description
29
-
30
- Huggy is a yellow circular character with:
31
- - Round body with no arms, legs, or feet
32
- - Two floating hands
33
- - Orange outlines (no dark black outlines)
34
- - Clean flat vector art style with edge shadows
35
- - Expressive face with various emotions
36
-
37
- ## Sample Outputs
38
-
39
- ![img_0](./image_0.png)
40
- ![img_1](./image_1.png)
41
- ![img_2](./image_2.png)
42
- ![img_3](./image_3.png)
43
-
44
- ## Usage
45
-
46
- Use the trigger word **`huggy_style_v1`** in your prompts.
47
-
48
- ```python
49
- import torch
50
- from diffusers import FluxPipeline
51
-
52
- pipe = FluxPipeline.from_pretrained(
53
- "black-forest-labs/FLUX.1-dev",
54
- torch_dtype=torch.bfloat16,
55
- )
56
- pipe.load_lora_weights("Chunte/huggy-style-v1-lora")
57
- pipe.to("cuda")
58
-
59
- image = pipe(
60
- "a huggy_style_v1 mascot wearing a pirate hat, waving, happy",
61
- num_inference_steps=28,
62
- guidance_scale=3.5,
63
- ).images[0]
64
- image.save("huggy_pirate.png")
65
- ```
66
-
67
- ## Prompt Tips
68
-
69
- - Always include `huggy_style_v1` as the trigger word
70
- - Describe what varies: outfit, pose, expression, props
71
- - Keep prompts simple and descriptive
72
- - Example: `a huggy_style_v1 mascot wearing a santa hat, holding a gift, happy`
73
-
74
- ## Training Details
75
-
76
- | Parameter | Value |
77
- |---|---|
78
- | Base model | FLUX.1-dev |
79
- | Method | DreamBooth LoRA |
80
- | Training images | 72 |
81
- | Resolution | 768 |
82
- | LoRA rank | 32 |
83
- | Learning rate | 1e-4 |
84
- | LR scheduler | constant (100 warmup steps) |
85
- | Training steps | 2000 |
86
- | Batch size | 1 (x4 gradient accumulation) |
87
- | Mixed precision | bf16 |
88
- | Guidance scale | 1.0 |
89
- | Seed | 42 |
90
- | Hardware | NVIDIA L40S (48GB) |
91
- | Training time | 2h 45min |
92
- | Final loss | 0.021 |
93
-
94
- ### Loss Curve
95
-
96
- | Step | Loss |
97
- |---|---|
98
- | 1 | 0.050 |
99
- | 100 | 0.267 |
100
- | 500 | 0.078 |
101
- | 1000 | 0.135 |
102
- | 1500 | 0.039 |
103
- | 2000 | 0.021 |
104
-
105
- ## Dataset
106
-
107
- Trained on 72 hand-curated images of the Huggy character in various outfits, poses, and expressions. Each image has a per-image natural language caption describing only what varies (outfit, pose, expression) while the trigger word captures the character identity and art style.
108
-
109
- Dataset: [Chunte/huggy_for_training](https://huggingface.co/datasets/Chunte/huggy_for_training)
110
-
111
- ## Checkpoints
112
-
113
- Intermediate checkpoints are available at steps 500, 1000, 1500, and 2000. To load a specific checkpoint:
114
-
115
- ```python
116
- pipe.load_lora_weights("Chunte/huggy-style-v1-lora", subfolder="checkpoint-1000")
117
- ```
118
-
119
- ## License
120
-
121
- This LoRA inherits the [FLUX.1-dev non-commercial license](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: black-forest-labs/FLUX.1-dev
3
+ library_name: diffusers
4
+ license: other
5
+ license_name: flux-1-dev-non-commercial-license
6
+ license_link: https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md
7
+ inference: true
8
+ tags:
9
+ - flux
10
+ - flux-diffusers
11
+ - lora
12
+ - text-to-image
13
+ - diffusers-training
14
+ - dreambooth
15
+ - dreambooth-lora
16
+ - character
17
+ - template:sd-lora
18
+ widget:
19
+ - text: "a huggy_style_v1 mascot wearing a pirate hat, waving, happy"
20
+ output:
21
+ url: image_0.png
22
+ - text: "a huggy_style_v1 mascot wearing a chef hat, holding a pizza"
23
+ output:
24
+ url: image_1.png
25
+ - text: "a huggy_style_v1 mascot in a spacesuit, floating in space"
26
+ output:
27
+ url: image_2.png
28
+ - text: "a huggy_style_v1 mascot sitting on a stack of books, reading"
29
+ output:
30
+ url: image_3.png
31
+ ---
32
+
33
+ # Huggy Style v1 - FLUX DreamBooth LoRA
34
+
35
+ A LoRA adapter for [FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev) trained with DreamBooth to generate **Huggy** — the HuggingFace mascot character.
36
+
37
+ ## Character Description
38
+
39
+ Huggy is a **yellow circular character** with:
40
+ - Round body (no arms, legs, or feet)
41
+ - Two floating hands
42
+ - Orange outlines (no dark black outlines)
43
+ - Clean flat vector art style with edge shadows
44
+ - Expressive face with various emotions
45
+
46
+ ## Trigger Word
47
+
48
+ Use **`huggy_style_v1`** in your prompts to activate the character.
49
+
50
+ ## Usage
51
+
52
+ ```python
53
+ import torch
54
+ from diffusers import FluxPipeline
55
+
56
+ pipe = FluxPipeline.from_pretrained(
57
+ "black-forest-labs/FLUX.1-dev",
58
+ torch_dtype=torch.bfloat16,
59
+ )
60
+ pipe.enable_model_cpu_offload()
61
+
62
+ # Load LoRA
63
+ pipe.load_lora_weights("Chunte/huggy-style-v1-lora")
64
+
65
+ image = pipe(
66
+ prompt="a huggy_style_v1 mascot wearing a pirate hat, waving, happy",
67
+ num_inference_steps=28,
68
+ guidance_scale=3.5,
69
+ width=768,
70
+ height=768,
71
+ generator=torch.Generator("cpu").manual_seed(42),
72
+ ).images[0]
73
+ image.save("huggy.png")
74
+ ```
75
+
76
+ ## Prompt Tips
77
+
78
+ - Always include `huggy_style_v1` as the trigger word
79
+ - Describe **what varies** costumes, poses, expressions, props
80
+ - Don't describe the character's base appearance (yellow, circular, etc.) — the LoRA already knows this
81
+ - Example: `a huggy_style_v1 mascot wearing a santa hat, holding a gift, smiling`
82
+
83
+ ## Checkpoints
84
+
85
+ Multiple checkpoints are available if the final weights are overfitting:
86
+
87
+ | Checkpoint | Use Case |
88
+ |-----------|----------|
89
+ | `checkpoint-500` | Early training — more creative, less accurate character |
90
+ | `checkpoint-1000` | Moderate good balance for some use cases |
91
+ | `checkpoint-1500` | Strong character identity with good generalization |
92
+ | **final (default)** | **Strongest character identity** (2000 steps) |
93
+
94
+ Load a specific checkpoint:
95
+ ```python
96
+ pipe.load_lora_weights("Chunte/huggy-style-v1-lora", subfolder="checkpoint-1000")
97
+ ```
98
+
99
+ ## Training Details
100
+
101
+ | Parameter | Value |
102
+ |-----------|-------|
103
+ | Base model | FLUX.1-dev |
104
+ | Method | DreamBooth LoRA |
105
+ | Training script | `train_dreambooth_lora_flux.py` (diffusers v0.37.0) |
106
+ | Dataset | 72 hand-captioned images (1024x1024, white background) |
107
+ | Resolution | 768 |
108
+ | LoRA rank | 32 |
109
+ | Learning rate | 1e-4 (constant scheduler) |
110
+ | Warmup steps | 100 |
111
+ | Training steps | 2000 |
112
+ | Batch size | 1 (gradient accumulation: 4, effective batch: 4) |
113
+ | Mixed precision | bf16 |
114
+ | Guidance scale | 1 (recommended for FLUX training) |
115
+ | Gradient checkpointing | Enabled |
116
+ | Hardware | NVIDIA L40S (48GB VRAM) |
117
+ | Final loss | 0.021 |
118
+
119
+ ## Sample Images
120
+
121
+ ![img_0](./image_0.png)
122
+ ![img_1](./image_1.png)
123
+ ![img_2](./image_2.png)
124
+ ![img_3](./image_3.png)
125
+
126
+ ## License
127
+
128
+ This LoRA adapter inherits the [FLUX.1-dev Non-Commercial License](https://huggingface.co/black-forest-labs/FLUX.1-dev/blob/main/LICENSE.md).