JEE6-LLM

Cloudtype deployment repo for a standalone vLLM OpenAI-compatible server.

Deploy on Cloudtype

Create a new app from this GitHub repository.
Use dockerfile as build file.
(Recommended) Attach a persistent volume and mount it to /data.
Set environment variables:
- MODEL_NAME (required), example: meta-llama/Meta-Llama-3-8B-Instruct
- PORT (optional), default: 8000
- HF_HOME (optional), default: /data/huggingface
- HUGGING_FACE_HUB_TOKEN (optional): required for gated/private models
Deploy and copy the app URL.

Why persistent volume?

Model weights are large (often multiple GB). This repo intentionally does not bake weights into the Docker image. Instead, vLLM downloads the model at runtime and caches it under HF_HOME. Mounting /data as a persistent volume prevents re-downloading on every deploy.

Connect from JEE6 bot

Set the bot environment variable:

VLLM_BASE_URL=https://<your-cloudtype-vllm-url>/v1

Then redeploy the bot service.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
README.md		README.md
dockerfile		dockerfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JEE6-LLM

Deploy on Cloudtype

Why persistent volume?

Connect from JEE6 bot

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

JEE6-LLM

Deploy on Cloudtype

Why persistent volume?

Connect from JEE6 bot

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages