VisionCraft - AI Product Image Creator

An AI-powered platform for creating, editing, and transforming product images into professional marketing materials. Built with FastAPI, React, and powered by cutting-edge AI models including FLUX.1-Kontext for advanced image editing.

🚀 Features

Core Capabilities

AI Image Generation: Create product images from text descriptions
Advanced Image Editing: Professional editing with FLUX.1-Kontext
Color Variations: Generate multiple color variants of products
Ad Shot Creation: Transform products into surreal advertisement scenes
Video Animation: Create animated product videos with Kling 2.5

AI Models & Services

FLUX.1-Kontext: Advanced image-to-image editing and generation
FAL Kling 2.5: High-quality video animation generation
Hunyuan 3D 2.0: 2D to 3D model conversion
Modal Labs: Scalable GPU compute for AI model deployment

🏗️ Architecture

Backend Services

├── app.py                    # Main FastAPI application
├── modal_flux_kontext.py     # FLUX.1-Kontext Modal deployment

Frontend Application

├── frontend/
│   ├── index.html           # Main HTML entry point
│   ├── App.js              # React application root
│   ├── components/         # Reusable UI components
│   ├── tools/              # Feature-specific panels
│   └── styles/             # CSS styling

🎨 FLUX.1-Kontext Integration

What is FLUX.1-Kontext?

FLUX.1-Kontext is a state-of-the-art image editing model that excels at:

Context-aware editing: Understanding and preserving image context
Professional quality: Commercial-grade image transformations
Flexible prompting: Natural language descriptions for edits
Style consistency: Maintaining visual coherence across edits

How It's Used in VisionCraft

1. Modal Labs Deployment (`modal_flux_kontext.py`)

@app.cls(
    image=image,
    gpu="A100-40GB",
    volumes={"/models": volume},
    timeout=900,
    secrets=[modal.Secret.from_name("huggingface")]
)
class FluxKontext:
    def load_model(self):
        self.pipe = FluxKontextPipeline.from_pretrained(
            "black-forest-labs/FLUX.1-Kontext-dev",
            torch_dtype=torch.bfloat16
        )

2. Image Editing Workflow

Input: Base64 encoded image + text prompt
Processing: FLUX.1-Kontext applies context-aware transformations
Output: High-quality edited image maintaining original context

3. Use Cases in VisionCraft

Color Variations: "Change the product color to red while maintaining lighting"
Style Transfer: "Make this product look premium and luxurious"
Background Editing: "Place this product in a modern office setting"
Material Changes: "Convert this plastic product to metal finish"

API Integration

@app.post("/api/generate")
async def generate_image(request: GenerateRequest):
    modal_url = os.getenv("MODAL_FLUX_URL")
    
    request_data = {
        "image_base64": request.image_base64,
        "prompt": request.prompt,
        "guidance_scale": request.guidance_scale,
        "num_inference_steps": request.num_inference_steps
    }
    
    async with httpx.AsyncClient(timeout=300.0) as client:
        response = await client.post(f"{modal_url}/generate", json=request_data)

🛠️ Installation & Setup

Prerequisites

Python 3.11+
Node.js 16+
Modal Labs account
FAL AI account
Hugging Face account with FLUX.1-Kontext access

Environment Setup

Clone the repository

git clone https://github.com/StarAtNyte/VisionCraft.git
cd VisionCraft

Install Python dependencies

pip install -r requirements.txt

Install Node.js dependencies

npm install

Configure environment variables

# Create .env file
FAL_KEY=your_fal_api_key
MODAL_FLUX_URL=your_modal_deployment_url
HF_TOKEN=your_huggingface_token

Modal Labs Deployment

Install Modal CLI

pip install modal
modal setup

Deploy FLUX.1-Kontext service

modal deploy modal_flux_kontext.py

Get deployment URL

modal app list
# Copy the web app URL to your .env file

FAL AI Setup

Get API key from FAL AI
Add to .env file as FAL_KEY

🚀 Running the Application

Development Mode

# Start main FastAPI server
uvicorn app:app --reload --port 8000

# Start FAL animation service (optional)
python fal_kling_animate.py

# Start Hunyuan 3D service (optional)
cd wan-huggingface && python app.py

Production Mode

# Using Gunicorn
gunicorn app:app -w 4 -k uvicorn.workers.UvicornWorker --bind 0.0.0.0:8000

Access the Application

Main App: http://localhost:8000
Studio Interface: http://localhost:8000/
Marketing Page: http://localhost:8000/home

📖 Usage Guide

1. Upload & Import

Upload product images (JPG, PNG, WEBP)
Import from URLs
Drag & drop support

2. Color Variations with FLUX.1-Kontext

// Example API call
const response = await fetch('/api/generate', {
    method: 'POST',
    headers: { 'Content-Type': 'application/json' },
    body: JSON.stringify({
        image_base64: imageData,
        prompt: "Change the product color to blue while maintaining the original lighting and shadows",
        guidance_scale: 3.5,
        num_inference_steps: 28
    })
});

3. Ad Shot Creation

Transform products into surreal advertisement scenes
Context-aware background replacement
Professional lighting adjustments

4. Video Animation

Generate product videos with FAL Kling 2.5
Multiple animation styles available
Customizable duration and quality
Ready for AR/VR applications

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
frontend		frontend
.gitignore		.gitignore
README.md		README.md
app.py		app.py
fal_kling_animate.py		fal_kling_animate.py
modal_flux_kontext.py		modal_flux_kontext.py
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisionCraft - AI Product Image Creator

🚀 Features

Core Capabilities

AI Models & Services

🏗️ Architecture

Backend Services

Frontend Application

🎨 FLUX.1-Kontext Integration

What is FLUX.1-Kontext?

How It's Used in VisionCraft

1. Modal Labs Deployment (`modal_flux_kontext.py`)

2. Image Editing Workflow

3. Use Cases in VisionCraft

API Integration

🛠️ Installation & Setup

Prerequisites

Environment Setup

Modal Labs Deployment

FAL AI Setup

🚀 Running the Application

Development Mode

Production Mode

Access the Application

📖 Usage Guide

1. Upload & Import

2. Color Variations with FLUX.1-Kontext

3. Ad Shot Creation

4. Video Animation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VisionCraft - AI Product Image Creator

🚀 Features

Core Capabilities

AI Models & Services

🏗️ Architecture

Backend Services

Frontend Application

🎨 FLUX.1-Kontext Integration

What is FLUX.1-Kontext?

How It's Used in VisionCraft

1. Modal Labs Deployment (modal_flux_kontext.py)

2. Image Editing Workflow

3. Use Cases in VisionCraft

API Integration

🛠️ Installation & Setup

Prerequisites

Environment Setup

Modal Labs Deployment

FAL AI Setup

🚀 Running the Application

Development Mode

Production Mode

Access the Application

📖 Usage Guide

1. Upload & Import

2. Color Variations with FLUX.1-Kontext

3. Ad Shot Creation

4. Video Animation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1. Modal Labs Deployment (`modal_flux_kontext.py`)

Packages