Going over the code and the technical report of the new TTS model from Microsoft Research.
Found the 7b model under a different HF account -> https://huggingface.co/WestZhang/VibeVoice-Large-pt
Found the 7b model under a different HF account -> https://huggingface.co/WestZhang/VibeVoice-Large-pt