Skip to content

PKU-SEC-Lab/DYSL_VLA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

DySL-VLA: Efficient Vision-Language-Action Model Inference via Dynamic-Static Layer-Skipping for Robot Manipulation

DySL-VLA is a novel framework that accelerates Vision-Language-Action (VLA) model inference for robot manipulation by dynamically skipping unnecessary layers based on action importance. Developing VLA models on real-world robots is challenging due to the typically limited computation and memory capacities available on robotic platforms. DySL-VLA addresses this based on the varying importance levels of different actions: critical steps demand high precision, while less important ones can tolerate more variance. DySL-VLA uses different layer-skipping strategies to acclerate VLA inference for different action importance levels.

Key Innovations 🚀

  • Dynamic-Static Layer Skipping: Statically keeps the informative layers and dynamically skips unnecessary layers

  • Prior-Post Skipping Guidance: Smart action importance evaluation and adaptive layer skipping for difference importance levels

  • Skip-Aware Two-Stage Knowledge Distillation: Efficiently transforms standard VLAs into dynamic architectures

Install

We develop DySL-VLA based on RoboFlamingo and OpenVLA-oft. Please follow the instructions in the corresponding file.

Citation

If you find this work useful for your research, please cite our paper:

@article{yang2025dysl,
  title={DySL-VLA: Efficient Vision-Language-Action Model Inference via Dynamic-Static Layer-Skipping for Robot Manipulation},
  author={Yang, Zebin and Qi, Yijiahao and Xie, Tong and Yu, Bo and Liu, Shaoshan and Li, Meng},
  journal={arXiv preprint arXiv:2602.22896},
  year={2025}
}

About

Dysl-vla:Official code for "DySL-VLA: Efficient Vision-Language-Action Model Inference via Dynamic-Static Layer-Skipping for Robot Manipulation". A framework that dynamically skips VLA layers based on action importance, achieving 3.75× speedup and 85.7× training reduction on robotic manipulation tasks.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages