Mobile-Agent-v3.5

📢 News

🔥[Feb. 2026] We release the GUI-Owl 1.5 model family on HuggingFace.

📍 TODO

Open source GUI-Owl 1.5 model weights
Open source GUI-Owl 1.5 model cookbook
Deploy Mobile-Agent-v3.5 on your own device
Open source evaluation code on benchmarks

Introduction

GUI-Owl 1.5 is the next-generation native GUI agent model family built on Qwen3-VL. It supports multi-platform GUI automation across desktops, mobile devices, browsers, and more. Powered by a scalable hybrid data flywheel, unified agent capability enhancement, and multi-platform environment RL (MRPO), GUI-Owl 1.5 offers a full spectrum of models:

Model	HuggingFace
GUI-Owl-1.5-2B-Instruct	🤗 Link
GUI-Owl-1.5-4B-Instruct	🤗 Link
GUI-Owl-1.5-8B-Instruct	🤗 Link
GUI-Owl-1.5-8B-Thinking	🤗 Link
GUI-Owl-1.5-32B-Instruct	🤗 Link
GUI-Owl-1.5-32B-Thinking	🤗 Link

Key highlights:

🏆 State-of-the-art among multi-platform GUI models on OSWorld-Verified, AndroidWorld, Mobile-World, WindowsAA, ScreenSpot-v2, ScreenSpot-Pro, and more.
🔧 Tool & MCP calling: Native support for external tool invocation and MCP server coordination, achieving top performance on OSWorld-MCP and Mobile-World.
🧠 Long-horizon memory: Built-in memory capability without external workflow orchestration, leading all native agent models on MemGUI-Bench.
🤝 Multi-agent ready: Serves both as a standalone end-to-end agent and as specialized roles (planner, executor, verifier, notetaker) within the Mobile-Agent-v3.5 framework.
⚡ Instruct & Thinking variants: Smaller instruct models for fast inference and edge deployment; larger thinking models for complex tasks requiring planning and reflection.

Deploy Mobile-Agent-v3.5 on Your Mobile Device

❗ At present, only Android OS support tool debugging. Other systems, such as iOS, do not support the use of Mobile-Agent for the time being.

Install Dependencies

pip install qwen_agent
pip install qwen_vl_utils
pip install numpy

Preparation for Connecting Mobile Device with ADB

Download the Android Debug Bridge.
Turn on ADB debugging on your Android phone (enable Developer Options first). For HyperOS, also enable "USB Debugging (Security Settings)".
Connect your phone to the computer with a data cable and select "Transfer files".
Test your ADB environment: adb devices. If connected devices are displayed, you're ready.
On Mac/Linux, ensure ADB permissions: sudo chmod +x /path/to/adb
On Windows, use: xx\xx\adb.exe

Install ADB Keyboard on Your Mobile Device

Download the ADB Keyboard APK.
Install on your mobile device.
Switch the default input method to "ADB Keyboard" in system settings.

Run

cd Mobile-Agent-v3.5/mobile_use
python run_gui_owl_1_5_for_mobile.py \
    --adb_path "Your ADB path" \
    --api_key "Your api key of vllm service" \
    --base_url "Your base url of vllm service" \
    --model "Your model name of vllm service" \
    --instruction "The instruction you want Mobile-Agent-v3.5 to complete" \
    --add_info "Some supplementary knowledge, can also be empty"

Note

GUI-Owl 1.5 outputs relative coordinates (0–1000) by default.

Deploy Mobile-Agent-v3.5 on Your Computer Device

Install Dependencies

pip install pyautogui
pip install pyperclip

Run

cd Mobile-Agent-v3.5/computer_use
python run_gui_owl_1_5_for_pc.py \
    --api_key "Your api key of vllm service" \
    --base_url "Your base url of vllm service" \
    --model "Your model name of vllm service" \
    --instruction "The instruction you want Mobile-Agent-v3.5 to complete" \
    --add_info "Some supplementary knowledge, can also be empty"

Note

GUI-Owl 1.5 outputs relative coordinates (0–1000) by default.

Deploy Mobile-Agent-v3.5 on Your Browser

Install Dependencies

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
playwright install chromium

Configure Environment Variables

# Model API keys (required)
export API_KEY="sk-xxx"          # Agent model API
export OMNI_API_KEY=""
export EVAL_API_KEY="sk-xxx"               # Evaluation model API

Run

cd cd Mobile-Agent-v3.5/browser_use
python run_gui_owl_1_5_for_web.py \
  --task "Search for 'Tongyi Lab'" \
  --web "https://bing.com" \
  --base_url "https://dashscope.aliyuncs.com/compatible-mode/v1/chat/completions" \
  --model "claude-sonnet-4-5-20250929" \
  --output_dir results/custom \
  --image_type base64 \
  --headless \
  --use_css_som

Detailed configuration

Please refer to Link.

Evaluation on AndroidWorld

Please follow the official code repository to install the Android emulator and necessary dependencies.
Install the dependencies.

cd Mobile-Agent-v3.5/android_world
pip install -r requirements.txt

Fill in your vllm service information in the run_guiowl15.sh or run_ma35.sh script, including api_key, base_url, and model.
Run the evaluation.

sh run_guiowl15.sh
sh run_ma35.sh

Evaluation on Grounding Benchmarks

Please download the images and annotations for the grounding benchmarks from their official repository.
Install the dependencies required by the qwen model.

pip install qwen_agent
pip install qwen_vl_utils

Fill in your information in the run_grounding.sh, including MODEL_PATH, DS_PATH, SAVE_PATH and EVAL_TYPE.
Run the evaluation.

cd Mobile-Agent-v3.5/grounding_and_kb
sh run_grounding.sh

Evaluation on Tool Use Benchmark

OSWorld-MCP: Please follow the official code repository to run the evalutation
MobileWorld: Please follow the official code repository to run the evalutation

Evaluation on GUI Knowledge Benchmark

Download the images and annotations for the Knowledge Bench from the official repository. Follow the official instructions to draw the GUI actions, and save the annotated images to DS_PATH/AnnotateImage.
Install the dependencies required by the qwen model.

pip install qwen_agent
pip install qwen_vl_utils

Fill in your information in the run_gui_kb.sh, including MODEL_PATH, DS_PATH, SAVE_PATH and EVAL_TYPE.
Run the evaluation.

cd Mobile-Agent-v3.5/grounding_and_kb
sh run_gui_kb.sh

Evaluation on Web Benchmark

Install Dependencies

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
playwright install chromium

Configure Environment Variables

# Model API keys (required)
export API_KEY="sk-xxx"          # Agent model API
export OMNI_API_KEY=""
export EVAL_API_KEY="sk-xxx"               # Evaluation model API

Benchmark task

WebArena/VisualWebArena: Start the corresponding environment services in advance (configure ports per the official docs).
Port mapping: Ports are preset in the code (see visualwebarena_url_map / webarena_url_map). To change them, modify BASE_URL and the port numbers.
Task data: Ensure data/merged_test_raw.json exists (includes task definitions, login requirements, initial screenshots, etc.).

# WebVoyager
cd Mobile-Agent-v3.5/web_benchmark
python main_for_eval.py \
  --task "Find out which four teams the NFC North contains in the NFL on ESPN." \
  --web https://www.espn.com/ \
  --output_dir results/WebVoyager \
  --image_type file \
  --task_id "validation_WebVoyager__ESPN--41" \
  --use_css_som \
  --headless

Performance

End-to-End Online Benchmarks

Model	OSWorld-Verified	AndroidWorld	OSWorld-MCP	Mobile-World	WindowsAA	WebArena	VisualWebArena	WebVoyager	Online-Mind2Web
GUI-Owl-1.5-2B-Instruct	43.5	67.9	33.0	31.3	25.8	-	-	-	-
GUI-Owl-1.5-4B-Instruct	48.2	69.8	31.7	32.3	29.4	-	-	-	-
GUI-Owl-1.5-8B-Instruct	52.3	69.0	41.8	41.8	31.7	45.7	39.4	69.9	41.7
GUI-Owl-1.5-8B-Thinking	52.9	71.6	38.8	33.3	35.1	46.7	40.8	78.1	48.6
GUI-Owl-1.5-32B-Instruct	56.5	69.4	47.6	46.8	44.8	-	-	-	-
GUI-Owl-1.5-32B-Thinking	56.0	68.2	43.8	42.8	44.1	48.4	46.6	82.1	-

Grounding Benchmarks

Please refer to the technical report for detailed results on ScreenSpot-v2, ScreenSpot-Pro, OSWorld-G, MMBench-GUI, and more.

Quick Start

from transformers import Qwen3VLForConditionalGeneration, AutoProcessor

model_name = "mPLUG/GUI-Owl-1.5-8B-Instruct"
model = Qwen3VLForConditionalGeneration.from_pretrained(
    model_name, torch_dtype="auto", device_map="auto"
)
processor = AutoProcessor.from_pretrained(model_name)

messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "image": "screenshot.png"},
            {"type": "text", "text": "Click on the search bar."},
        ],
    }
]

inputs = processor.apply_chat_template(
    messages,
    tokenize=True,
    add_generation_prompt=True,
    return_dict=True,
    return_tensors="pt"
)
inputs = inputs.to(model.device)
# Inference: Generation of the output
generated_ids = model.generate(**inputs, max_new_tokens=128)
generated_ids_trimmed = [
    out_ids[len(in_ids) :] for in_ids, out_ids in zip(inputs.input_ids, generated_ids)
]
output_text = processor.batch_decode(
    generated_ids_trimmed, skip_special_tokens=True, clean_up_tokenization_spaces=False
)
print(output_text)

Citation

If you find GUI-Owl 1.5 useful in your research, please cite:

@article{xu2026mobile,
  title={Mobile-Agent-v3. 5: Multi-platform Fundamental GUI Agents},
  author={Xu, Haiyang and Zhang, Xi and Liu, Haowei and Wang, Junyang and Zhu, Zhaozai and Zhou, Shengjie and Hu, Xuhao and Gao, Feiyu and Cao, Junjie and Wang, Zihua and others},
  journal={arXiv preprint arXiv:2602.16855},
  year={2026}
}

Acknowledgments

GUI-Owl 1.5 is built upon Qwen3-VL. We thank the Qwen team for their excellent open-source foundation models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Mobile-Agent-v3.5

📢 News

📍 TODO

Introduction

Deploy Mobile-Agent-v3.5 on Your Mobile Device

Install Dependencies

Preparation for Connecting Mobile Device with ADB

Install ADB Keyboard on Your Mobile Device

Run

Note

Deploy Mobile-Agent-v3.5 on Your Computer Device

Install Dependencies

Run

Note

Deploy Mobile-Agent-v3.5 on Your Browser

Install Dependencies

Configure Environment Variables

Run

Detailed configuration

Evaluation on AndroidWorld

Evaluation on Grounding Benchmarks

Evaluation on Tool Use Benchmark

Evaluation on GUI Knowledge Benchmark

Evaluation on Web Benchmark

Install Dependencies

Configure Environment Variables

Benchmark task

Performance

End-to-End Online Benchmarks

Grounding Benchmarks

Quick Start

Citation

Acknowledgments

Name		Name	Last commit message	Last commit date
parent directory ..
android_world_v3.5		android_world_v3.5
browser_use		browser_use
computer_use		computer_use
cookbook		cookbook
grounding_and_kb		grounding_and_kb
mobile_use		mobile_use
web_benchmark		web_benchmark
README.md		README.md
README_zh.md		README_zh.md

FilesExpand file tree

Mobile-Agent-v3.5

Directory actions

More options

Directory actions

More options

Latest commit

History

Mobile-Agent-v3.5

Folders and files

parent directory

README.md

Mobile-Agent-v3.5

📢 News

📍 TODO

Introduction

Deploy Mobile-Agent-v3.5 on Your Mobile Device

Install Dependencies

Preparation for Connecting Mobile Device with ADB

Install ADB Keyboard on Your Mobile Device

Run

Note

Deploy Mobile-Agent-v3.5 on Your Computer Device

Install Dependencies

Run

Note

Deploy Mobile-Agent-v3.5 on Your Browser

Install Dependencies

Configure Environment Variables

Run

Detailed configuration

Evaluation on AndroidWorld

Evaluation on Grounding Benchmarks

Evaluation on Tool Use Benchmark

Evaluation on GUI Knowledge Benchmark

Evaluation on Web Benchmark

Install Dependencies

Configure Environment Variables

Benchmark task

Performance

End-to-End Online Benchmarks

Grounding Benchmarks

Quick Start

Citation

Acknowledgments