GraphDO

Repository for "Can Graph Descriptive Order Affect Solving Graph Problems with LLMs?"

Setup

uv sync                      # core deps (networkx, numpy, ...)
uv sync --extra local        # + torch/transformers for local models

Usage

# 0. Generate graphs (T1-T5: ER random graphs)
uv run python graph_generator.py --num_graphs 280   # paper-scale (280 graphs/task)

# 0b. Generate graphs (T6: node classification, requires torch-geometric)
uv run python graph_generator.py --tasks node_classification --num_graphs 50 --datasets cora citeseer pubmed

# 1. Generate descriptions
uv run python generate.py                           # all tasks, all orders
uv run python generate.py --tasks connectivity      # specific task

# 2. Run model tests
uv run python main.py --model Llama-2-7b-chat-hf --gpu_id 0 --description_dir ./descriptions
uv run python main.py --model deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B --description_dir ./descriptions

Graph Data

Use graph_generator.py to generate graph data for experiments:

# T1-T5: ER random graphs (n ∈ [5, 15], p = 0.3)
uv run python graph_generator.py --num_graphs 280

# T6: Node classification (Ego + Forest Fire sampling from Planetoid datasets)
# Cora/Citeseer/Pubmed will be automatically downloaded to ./data on first run
uv run python graph_generator.py --tasks node_classification --num_graphs 50 --datasets cora citeseer pubmed

Structure

Path	Description
`graph/*.jsonl`	Graph data in JSONL format
`graph_generator.py`	Generates graphs for all 6 tasks (ER for T1-T5, Planetoid sampling for T6)
`generate.py`	Description generation entry point
`main.py`	LLM testing entry point
`data_loader.py`	Reads JSONL graph data
`graph_ordering.py`	5 edge ordering algorithms (random, BFS, DFS, PR, PPR)
`description_generator.py`	Generates NL descriptions with different orderings
`text_encoder.py`	Graph-to-text encoding (adjacency format)
`graphdo.py`	LLM call wrapper and prompt builder
`model_loader.py`	HuggingFace model loading
`evaluator.py`	Scores model answers (accuracy)
`examples.py`	Few-shot examples for all tasks
`info.py`	Model path configuration

Tasks & Orderings

Tasks: connectivity · cycle · shortest_path · hamilton · topology · node_classification

Orderings: random (baseline) · bfs · dfs · pagerank · ppr

Prompt styles: zero_shot · zero_shot_cot · few_shot · cot · cot_bag

Model Config

Add local model paths in info.py.

Citation

@inproceedings{ge-etal-2025-graph,
    title = "Can Graph Descriptive Order Affect Solving Graph Problems with {LLM}s?",
    author = "Ge, Yuyao  and
      Liu, Shenghua  and
      Bi, Baolong  and
      Wang, Yiwei  and
      Mei, Lingrui  and
      Feng, Wenjie  and
      Chen, Lizhe  and
      Cheng, Xueqi",
    editor = "Che, Wanxiang  and
      Nabende, Joyce  and
      Shutova, Ekaterina  and
      Pilehvar, Mohammad Taher",
    booktitle = "Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = jul,
    year = "2025",
    address = "Vienna, Austria",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.acl-long.321/",
    doi = "10.18653/v1/2025.acl-long.321",
    pages = "6404--6420",
    ISBN = "979-8-89176-251-0",
    abstract = "Large language models (LLMs) have achieved significant success in reasoning tasks, including mathematical reasoning and logical deduction. Among these reasoning tasks, graph problems stand out due to their complexity and unique structural characteristics, attracting considerable attention from researchers. Previous studies have explored LLMs' graph reasoning abilities through various techniques, such as different encoding methods for graph structures and the use of carefully designed prompts. However, a critical factor has been mostly overlooked: the prompt sequential order in which graph descriptions are presented to the models. In this study, we present the first comprehensive analysis of how the order of graph descriptions impacts LLM performance. Specifically, we comprehensively evaluate four graph description orders across six graph problems using six mainstream LLMs. The results reveal that: (1) ordered graph descriptions significantly improve LLMs' comprehension of graph structures; (2) the robustness of LLMs to graph description order varies across different tasks; and (3) the impact of graph order on performance is closely related to the inherent characteristics of tasks. This study provides a critical advancement in the application of LLMs for solving graph-related problems, paving the way for future research to optimize model performance through strategic graph description ordering."
}

Acknowledgments

This paper references code from GraphLLM and NLGraph.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GraphDO

Setup

Usage

Graph Data

Structure

Tasks & Orderings

Model Config

Citation

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
graph		graph
.gitignore		.gitignore
README.md		README.md
README_CN.md		README_CN.md
data_loader.py		data_loader.py
description_generator.py		description_generator.py
evaluator.py		evaluator.py
examples.py		examples.py
generate.py		generate.py
graph_generator.py		graph_generator.py
graph_ordering.py		graph_ordering.py
graphdo.py		graphdo.py
info.py		info.py
main.py		main.py
model_loader.py		model_loader.py
pyproject.toml		pyproject.toml
text_encoder.py		text_encoder.py

Folders and files

Latest commit

History

Repository files navigation

GraphDO

Setup

Usage

Graph Data

Structure

Tasks & Orderings

Model Config

Citation

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages