TransAdapter: Vision Transformer for Feature-Centric Unsupervised Domain Adaptation

Official PyTorch implementation of TransAdapter: Vision Transformer for Feature-Centric Unsupervised Domain Adaptation.

Authors: Abdullah Enes Doruk, Erhan Oztop, and Hasan F. Ates

Our Vision Transformer for Feature-Centric Unsupervised Domain Adaptation.

Abstract

Unsupervised Domain Adaptation (UDA) aims to utilize labeled data from a source domain to solve tasks in an unlabeled target domain, often hindered by significant domain gaps. Traditional CNN-based methods struggle to fully capture complex domain relationships, motivating the shift to vision transformers like the Swin Transformer, which excel in modeling both local and global dependencies. In this work, we propose a novel UDA approach leveraging the Swin Transformer with three key modules. A Graph Domain Discriminator enhances domain alignment by capturing inter-pixel correlations through graph convolutions and entropy-based attention differentiation. An Adaptive Double Attention module combines Windows and Shifted Windows attention with dynamic reweighting to align long-range and local features effectively. Finally, a Cross-Feature Transform modifies Swin Transformer blocks to improve generalization across domains. Extensive benchmarks confirm the state-of-the-art performance of our versatile method, which requires no task-specific alignment modules, establishing its adaptability to diverse applications.

Installation (Python 3.8.19)

This project tested under pytorch 2.4.1 and CUDA 12.4 versions. However, you can work with CUDA 11x and related Pytorch versions.

a. Create environment

conda env create -f environment.yml

b. Activate environment

conda activate dom

Install fused window process for acceleration, activated by passing --fused_window_process in the running script

cd kernels/window_process
python setup.py install #--user

Pretrained Swin

Download the following models and put them in checkpoints/

Swin-B (ImageNet)

Datasets

Download data and replace the current data/
Download images from Office-31, Office-Home, VisDA-2017 and put them under data/. For example, images of Office-31 should be located at data/office/domain_adaptation_images/

Training Baseline (For Pseudo Label Creation)

Command can be found training bash file scripts/training_swin.sh.

python train_swin.py --train_batch_size 64 --dataset office \
        --source_list data/office/webcam_list.txt \ 
        --target_list data/office/amazon_list.txt \
        --test_list_source data/office/webcam_list.txt \
        --test_list_target data/office/amazon_list.txt \
        --pretrained_dir checkpoint/swin_base_patch4_window7_224_22k.pth \ 
        --num_steps 5000 --num_classes 31

Training TransAdapter

Command can be found training bash file scripts/training_transadapter.sh. If you do not want to train baseline model, you can set --pseudo_lab False

python train_transadapter.py --train_batch_size 64 --dataset office \
        --source_list data/office/webcam_list.txt \ 
        --target_list data/office/amazon_list.txt \
        --test_list_source data/office/webcam_list.txt \
        --test_list_target data/office/amazon_list.txt \
        --num_classes 31 \
        --pretrained_dir checkpoint/swin_base_patch4_window7_224_22k.pth \ 
        --num_steps 15000 --gamma 0.1 --beta 0.01 -thata 0.0001 \
        --pseudo_lab True

Acknowledgement

We thank the authors of TVT and Swin-Transformer and their open-source codes.

Citation

@article{doruk2024transadapter,
  title={TransAdapter: Vision Transformer for Feature-Centric Unsupervised Domain Adaptation},
  author={Doruk, A and Oztop, Erhan and Ates, Hasan F},
  journal={arXiv preprint arXiv:2412.04073},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
figures		figures
kernels/window_process		kernels/window_process
models		models
scripts		scripts
utils		utils
README.md		README.md
environment.yml		environment.yml
test.py		test.py
train_swin.py		train_swin.py
train_transadapter.py		train_transadapter.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TransAdapter: Vision Transformer for Feature-Centric Unsupervised Domain Adaptation

Abstract

Installation (Python 3.8.19)

Pretrained Swin

Datasets

Training Baseline (For Pseudo Label Creation)

Training TransAdapter

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

enesdoruk/TransAdapter

Folders and files

Latest commit

History

Repository files navigation

TransAdapter: Vision Transformer for Feature-Centric Unsupervised Domain Adaptation

Abstract

Installation (Python 3.8.19)

Pretrained Swin

Datasets

Training Baseline (For Pseudo Label Creation)

Training TransAdapter

Acknowledgement

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages