Feature Request from Community:
Related PR: Processor adaptation
- Phi-4-multimodal-support:
Processor: Phi4MultimodalFeatureExtractor, Phi4MultiModalImageProcessorFast, Phi4MultimodalProcessor
Module: Phi4MultimodalAudioConvModule, Phi4MultimodalAudioNemoConvSubsampling, Phi4MultimodalAudioRelativeAttentionBias, adaptive_enc_mask, unfold_tensor
- InterVL-S1:
InternVLProcessor, GotOcr2ImageProcessorFast, InternVLVideoProcessor
- Whisper:
WhisperFeatureExtractor, WhisperProcessor
- Ultravox
WhisperEncoder, UltravoxProcessor
- InternVL 3.5
InternVLProcessor
- Qwen2Audio
Qwen2AudioEncoder
Qwen2AudioProcessor
WhisperFeatureExtractor
- MiniCPM-V4.5
MiniCPMVProcessor
- LLava-Next
LLavaNextImageProcessor
LLavaNextProcessor
LLavaNextForConditionalGeneration
LLavaNextImageProcessor
- LLava-Next-Video
LLavaNextVideoProcessor
Feature Request from Community:
Related PR: Processor adaptation
Processor: Phi4MultimodalFeatureExtractor, Phi4MultiModalImageProcessorFast, Phi4MultimodalProcessor
Module: Phi4MultimodalAudioConvModule, Phi4MultimodalAudioNemoConvSubsampling, Phi4MultimodalAudioRelativeAttentionBias, adaptive_enc_mask, unfold_tensor
InternVLProcessor, GotOcr2ImageProcessorFast, InternVLVideoProcessor
WhisperFeatureExtractor, WhisperProcessor
WhisperEncoder, UltravoxProcessor
InternVLProcessor
Qwen2AudioEncoder
Qwen2AudioProcessor
WhisperFeatureExtractor
MiniCPMVProcessor
LLavaNextImageProcessor
LLavaNextProcessor
LLavaNextForConditionalGeneration
LLavaNextImageProcessor
LLavaNextVideoProcessor