InstructDesign Flow

🏆 FLUX.1 Kontext [dev] Hackathon Submission - InstructDesign Flow on Devpost

Transform web interfaces iteratively with natural language instructions

Transform web interfaces with natural language instructions using our fine-tuned FLUX.1 Kontext model. This project enables AI-driven web design transformations through simple text prompts, trained on a curated dataset of 900+ webpage transformation pairs.

🎯 Project Overview

InstructDesign Flow is a specialized fine-tuning of FLUX.1 Kontext [dev] that understands and executes web design transformations through natural language. Simply provide a webpage screenshot and describe the desired changes - the model handles the rest.

Key Features

100+ Pre-defined Design Presets: From dark mode to cyberpunk themes
Natural Language Control: Describe changes in plain English
Consistent Transformations: 85%+ instruction adherence rate
End-to-End Pipeline Development for Production: Dockerized with automatic model downloads

📊 Dataset & Training

Training Dataset

Dataset Repository: HuggingFace - instructdesign-kontext
Size: 937 webpage transformation pairs
Sample Available: 100 representative pairs publicly available
Format: Original screenshots + transformed outputs + text instructions
Coverage: UI/UX redesigns, theme changes, layout modifications, style transfers

Training Infrastructure

Training Framework: ostris/ai-toolkit
Configuration: flux_kontext_training_v4_consolidated.yaml
Development Time: 7 days
Hardware: NVIDIA H100 GPU with 80GB VRAM
Training Steps: 16,000
Checkpoints Saved: Every 1000 steps

Training Configuration Details

Model Architecture:
  Base Model: black-forest-labs/FLUX.1-Kontext-dev
  Type: LoRA (Low-Rank Adaptation)
  Rank: 256
  Alpha: 256

Training Parameters:
  Batch Size: 2
  Learning Rate: 7e-5
  Optimizer: AdamW 8-bit
  Gradient Accumulation: 1
  Noise Scheduler: FlowMatch
  Timestep Type: Sigmoid
  Resolution: [512, 768, 1024]

Dataset Settings:
  Caption Dropout: 5%
  Cache Latents: True
  Control Images: _original.jpg suffix
  Output Images: _output.jpg suffix

Model Artifacts

Model Repository: HuggingFace - instructdesign-kontext
LoRA Weights: flux_kontext_lora_v4_consolidated_000010000.safetensors
Base Model: flux1-kontext-dev.safetensors
Text Encoders: CLIP-L + T5-XXL (FP8)
VAE: ae.safetensors

🚀 Quick Start

Prerequisites

NVIDIA GPU (16GB+ VRAM recommended)
Docker & Docker Compose
NVIDIA Container Toolkit
~100GB disk space (for models and Docker images)

Installation

Clone the repository:

git clone https://github.com/tercumantanumut/InstructDesign-Flow
cd InstructDesign-Flow

Start the services (models download automatically):

docker-compose up -d

The services will be available at:

API: http://localhost:8000
ComfyUI: http://localhost:8188
Frontend: http://localhost:3000 (if running)

🎨 Transformation Capabilities

Pre-trained Transformation Types

Theme Changes: Dark mode, light mode, high contrast
Design Systems: Material Design, iOS, Windows Metro
Style Effects: Glassmorphism, neumorphism, brutalism
Layout Modifications: Mobile-first, dashboard, e-commerce
Industry Specific: SaaS, portfolio, blog, landing page
Creative Themes: Cyberpunk, retro, minimalist, maximalist
Device Mockups: iPhone, MacBook, billboard placements
Accessibility: High contrast, larger fonts, better spacing

📡 API Usage

Basic Transformation

curl -X POST http://localhost:8000/api/generate \
  -H "Content-Type: application/json" \
  -d '{
    "positive_prompt": "Transform this interface to dark mode with modern aesthetics",
    "input_image": "inputs/sample_webpage.png",
    "seed": 42
  }'

Using Presets

curl -X POST http://localhost:8000/api/generate \
  -d '{
    "positive_prompt": "Apply glassmorphism_ui aesthetic",
    "input_image": "https://example.com/screenshot.png",
    "return_base64": true
  }'

Batch Processing

import requests
import json

presets = ["dark_mode", "mobile_responsive", "minimalist_clean"]
for preset in presets:
    response = requests.post("http://localhost:8000/api/generate",
        json={
            "positive_prompt": f"Apply {preset} transformation",
            "input_image": "test.png",
            "seed": 42
        })
    print(f"{preset}: {response.json()}")

🔧 API Endpoints

Endpoint	Method	Description
`/api/generate`	POST	Generate transformation
`/api/status/{prompt_id}`	GET	Check generation status
`/api/images/{filename}`	GET	Download generated image
`/api/queue/status`	GET	View queue statistics
`/health`	GET	Service health check
`/docs`	GET	Interactive API documentation

📁 Project Structure

InstructDesign-Flow/
├── docker-compose.yml         # Service orchestration
├── build/
│   └── Dockerfile            # ComfyUI + models container
├── workflow_api.json         # ComfyUI workflow definition
├── inputs/                   # Input images directory
├── output/                   # Generated outputs
├── deeployd-comfy/          # DeepFloyd-Comfy API wrapper (my own deployment framework)
├── comfyui-deploy-next-example/ # Frontend application
└── training/
    └── config.yaml          # Training configuration

🐳 Docker Architecture

Services

comfyui-instructdesign: GPU-accelerated inference server
- Automatic model downloads (~12GB)
- ComfyUI with custom nodes
- CUDA 12.8 optimized
instructdesign-api: FastAPI wrapper
- Queue management
- WebSocket support
- Priority task processing

🔬 Technical Details

Model Specifications

Base Model: FLUX.1 Kontext [dev]
Fine-tuning Method: LoRA with rank 256
Training Duration: 10,000 steps
Inference Time: 45-70 seconds per 1024x1024 image
VRAM Usage: ~14GB during inference
Consistency: 85%+ instruction following

Performance Metrics

Queue Capacity: 1000 concurrent tasks
Worker Threads: Auto-scaling 1-4
Average Latency: 55 seconds
Success Rate: 95%+
Supported Formats: PNG, JPEG, WebP

🎓 Example Transformations

From Training Dataset

"Transform this webpage to dark mode with purple accents"
"Apply material design 3 principles with rounded corners"
"Convert to cyberpunk aesthetic with neon colors"
"Make this mobile-responsive with touch-friendly buttons"
"Add glassmorphism effects to all card elements"

Advanced Prompts

"Redesign this interface for accessibility, increasing contrast
 and font sizes while maintaining the original brand colors"

"Transform this desktop layout to a mobile-first design with
 hamburger navigation and thumb-friendly interaction zones"

"Apply a premium SaaS aesthetic with gradient backgrounds,
 subtle shadows, and modern typography"

🛠️ Development

Training Your Own Model

Prepare your dataset:
- Collect webpage screenshots (original)
- Create transformed versions (output)
- Write transformation descriptions (captions)
Configure training:

git clone https://github.com/ostris/ai-toolkit
cd ai-toolkit
# Copy our training config
cp /path/to/flux_kontext_training_v4_consolidated.yaml config/

Start training:

python run.py config/flux_kontext_training_v4_consolidated.yaml

Monitoring Training

# TensorBoard logs
tensorboard --logdir=output/flux_kontext_lora_v4_consolidated/logs

# Sample outputs every 1000 steps
ls output/flux_kontext_lora_v4_consolidated/samples/

📚 Resources

Competition: FLUX.1 Kontext Hackathon
Model Weights: HuggingFace Model Hub
Dataset Samples: HuggingFace Datasets
Training Framework: AI Toolkit by Ostris
Base Model: FLUX.1 Kontext [dev]
API Framework: DeepFloyd-Comfy (My ComfyUI deployment solution)
Frontend Foundation: ComfyUI Deploy Next.js Example

🏆 Hackathon Submission

This project was developed for the Black Forest Labs FLUX.1 Kontext [dev] Hackathon:

Development Period: 7 days
Team: Umut Tan
Focus: Web interface transformation through natural language
Innovation: First LoRA fine-tune specifically for web design transformations

🤝 Contributing

We welcome contributions! Areas for improvement:

Additional transformation presets
Frontend UI enhancements
Performance optimizations
Dataset expansion
Documentation improvements

📄 License

This project uses the FLUX.1 Kontext [dev] model. Please refer to:

FLUX.1 Kontext License
Dataset: Research and educational use
Code: MIT License

🙏 Acknowledgments

Core Technologies

Black Forest Labs for FLUX.1 Kontext [dev] model
Comfy.org - ecosystem support
Ostris for the excellent AI Toolkit training framework
ComfyUI community for the powerful inference infrastructure
ComfyDeploy for the Next.js frontend example that served as the foundation for our UI

Hackathon Sponsors

fal.ai - Hackathon co-sponsor and compute infrastructure
Black Forest Labs - Model provider and hackathon host
NVIDIA - Hackathon partner

Infrastructure & Tools

HuggingFace for model and dataset hosting
DeepFloyd-Comfy - My own ComfyUI deployment application used for the API wrapper

Special Thanks

The entire FLUX.1 and ComfyUI community for their support and feedback
All hackathon participants for pushing the boundaries of AI-driven design

🚧 Roadmap

Complete model training (16,000 steps)
Docker containerization with auto-download
API implementation with queue management
Dataset publication on HuggingFace
Frontend with 100+ presets
Web-based playground interface
Fine-tuning guide and tutorials

📧 Contact: [email protected] | 🏆 Hackathon: FLUX.1 Kontext [dev]

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
ComfyUI/models		ComfyUI/models
build		build
comfyui-deploy-next-example		comfyui-deploy-next-example
deeployd-comfy		deeployd-comfy
images		images
inputs		inputs
output		output
.gitignore		.gitignore
PROJECT_DOCUMENTATION.md		PROJECT_DOCUMENTATION.md
README.md		README.md
docker-compose.yml		docker-compose.yml
training_config.yaml		training_config.yaml
workflow_api.json		workflow_api.json

Folders and files

Latest commit

History

Repository files navigation

InstructDesign Flow

🎯 Project Overview

Key Features

📊 Dataset & Training

Training Dataset

Training Infrastructure

Training Configuration Details

Model Artifacts

🚀 Quick Start

Prerequisites

Installation

🎨 Transformation Capabilities

Pre-trained Transformation Types

📡 API Usage

Basic Transformation

Using Presets

Batch Processing

🔧 API Endpoints

📁 Project Structure

🐳 Docker Architecture

Services

🔬 Technical Details

Model Specifications

Performance Metrics

🎓 Example Transformations

From Training Dataset

Advanced Prompts

🛠️ Development

Training Your Own Model

Monitoring Training

📚 Resources

🏆 Hackathon Submission

🤝 Contributing

📄 License

🙏 Acknowledgments

Core Technologies

Hackathon Sponsors

Infrastructure & Tools

Special Thanks

🚧 Roadmap

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages