GraphLAMA: Enabling Efficient Adaptation of Graph Language Models with Limited Annotations

This repository contains the official implementation and resources for GraphLAMA, an approach for efficiently adapting graph language models with limited annotations. The paper has been accepted to KDD 2025.

1. Environment Preparation

# Python
conda create -n graphlama python=3.8
conda activate graphlama

# PyTorch with CUDA 11.7
pip install torch==1.13.0+cu117 torchvision==0.14.0+cu117 torchaudio==0.13.0 --extra-index-url https://download.pytorch.org/whl/cu117

# Optional: support for Vicuna base model via FastChat
pip3 install "fschat[model_worker,webui]"

# PyG and related packages (for torch 1.13.0 + cu117)
pip install torch_geometric
pip install pyg_lib torch_scatter torch_sparse torch_cluster torch_spline_conv -f https://data.pyg.org/whl/torch-1.13.0+cu117.html

# Project dependencies
pip install -r requirements.txt

2. Prepare Models and Data

Base LLM weights (if applicable): Vicuna weights are available from the FastChat project: link.
Pre-training data: we follow the data format used in GraphGPT; please refer to their data preparation instructions at GraphGPT.
Test-time tuning data: see data/cora/reshape_cora.py for an example pipeline.

3. Pre-training Stage

cd path/to/grapht3
sh ./scripts/tune_script/stage1.sh
sh ./scripts/tune_script/stage2.sh

4. Test-time Tuning Stage

cd path/to/grapht3
sh ./scripts/tune_script/SFTonGFM_train.sh

5. Evaluate

cd path/to/grapht3
sh ./scripts/tune_script/SFTonGFM_eval.sh

6. Code Structure

.
├─ grapht3/                 # Core GraphLAMA package
│  ├─ model/                # Model registry and graph layers
│  │  └─ graph_layers/      # GNN modules, tokenizer, CLIP-graph
│  ├─ train/                # Training entry points and trainers
│  ├─ serve/                # Serving and web UI (FastChat-style)
│  ├─ eval/                 # Evaluation utilities
│  ├─ protocol/             # API protocol definitions
│  ├─ conversation.py
│  ├─ constants.py
│  └─ utils.py
├─ scripts/                 # Shell scripts for training/eval/serving
│  ├─ tune_script/          # Stage1/2, SFT training/eval scripts
│  ├─ eval_script/
│  └─ serving/
├─ text-graph-grounding/    # Graph-text grounding components and data utils
│  ├─ data/
│  ├─ graph_transformer.py
│  ├─ model_gt.py
│  └─ main_train.py
├─ reshape_wikics/          # Data reshaping for WikiCS
├─ reshape_products/        # Data reshaping for OGBN-Products
├─ data/                    # Example dataset (e.g., cora)
├─ tests/                   # Tests
├─ playground/              # Playground experiments
├─ output_eva_cora/         # Example outputs
├─ Pure_GNN_Cora.py         # GNN baseline example
├─ task_embedding_generate.py
├─ requirements.txt
└─ README.md

7. Paper

Title: GraphLAMA: Enabling Efficient Adaptation of Graph Language Models with Limited Annotations
Venue: KDD 2025
Preprint: arXiv:2506.21559

8. Acknowledgements

This work builds upon open-source efforts, including the GraphGPT framework: https://github.com/HKUDS/GraphGPT. We thank the authors and community for their contributions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GraphLAMA: Enabling Efficient Adaptation of Graph Language Models with Limited Annotations

1. Environment Preparation

2. Prepare Models and Data

3. Pre-training Stage

4. Test-time Tuning Stage

5. Evaluate

6. Code Structure

7. Paper

8. Acknowledgements

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data/cora		data/cora
grapht3		grapht3
images		images
output_eva_cora		output_eva_cora
playground		playground
reshape		reshape
reshape_products		reshape_products
reshape_wikics		reshape_wikics
scripts		scripts
tests		tests
text-graph-grounding		text-graph-grounding
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Pure_GNN_Cora.py		Pure_GNN_Cora.py
README.md		README.md
requirements.txt		requirements.txt
task_embedding_generate.py		task_embedding_generate.py

License

BUPT-GAMMA/GraphLAMA

Folders and files

Latest commit

History

Repository files navigation

GraphLAMA: Enabling Efficient Adaptation of Graph Language Models with Limited Annotations

1. Environment Preparation

2. Prepare Models and Data

3. Pre-training Stage

4. Test-time Tuning Stage

5. Evaluate

6. Code Structure

7. Paper

8. Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages