Hi, I've implemented SRT output and chunked processing in my fork https://github.com/woct0rdho/qwen3-asr.cpp . It works like whisper.cpp, and I can generate SRT for movies of a few hours long.
Note that my fork is based on @Jaffe2718 's fork https://github.com/Jaffe2718/qwen3-asr.cpp , which implements 1.7B model support and Windows support.
Feel free to reuse my code. I can make a PR if you're interested.
Hi, I've implemented SRT output and chunked processing in my fork https://github.com/woct0rdho/qwen3-asr.cpp . It works like whisper.cpp, and I can generate SRT for movies of a few hours long.
Note that my fork is based on @Jaffe2718 's fork https://github.com/Jaffe2718/qwen3-asr.cpp , which implements 1.7B model support and Windows support.
Feel free to reuse my code. I can make a PR if you're interested.