Inference acceleration framework for breast cancer classification based on UNI and Huawei Ascend NPU, integrating structured pruning, Decoupled Knowledge Distillation (DKD), SVD, and INT8 quantization for edge-cloud deployment. 基于 UNI 病理大模型与华为昇腾 NPU 的乳腺癌分类推理加速方案,集成结构化剪枝、解耦知识蒸馏 (DKD)、SVD 低秩分解及 INT8 量化,实现端云协同部署。
-
Updated
Mar 24, 2026 - Jupyter Notebook