Spectrogram Decoder - Search News

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation

We propose an encoder-decoder for open-vocabulary semantic segmentation comprising a hierarchical encoder-based cost map generation and a gradual fusion decoder. We introduce a category early ...

IEEE

CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...

IEEE

PerformSinger: Multimodal Singing Voice Synthesis Leveraging Synchronized Lip Cues from Singing Performance Videos

Abstract: Existing singing voice synthesis (SVS) models largely rely on fine-grained, phoneme-level durations, which limits their practical application. These methods overlook the complementary role ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation

CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

PerformSinger: Multimodal Singing Voice Synthesis Leveraging Synchronized Lip Cues from Singing Performance Videos

Trending now