DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation Paper • 2502.11897 • Published Feb 17, 2025
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer Paper • 2601.16515 • Published 13 days ago • 15
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer Paper • 2601.16515 • Published 13 days ago • 15
DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders Paper • 2512.13690 • Published Dec 15, 2025 • 3
Efficiently Reconstructing Dynamic Scenes One D4RT at a Time Paper • 2512.08924 • Published Dec 9, 2025 • 16
Unified Speech-Text Pre-training for Speech Translation and Recognition Paper • 2204.05409 • Published Apr 11, 2022
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language Paper • 2202.03555 • Published Feb 7, 2022
StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis Paper • 2110.08985 • Published Oct 18, 2021
Cross-lingual Retrieval for Iterative Self-Supervised Training Paper • 2006.09526 • Published Jun 16, 2020
Multilingual Denoising Pre-training for Neural Machine Translation Paper • 2001.08210 • Published Jan 22, 2020
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning Paper • 2008.00401 • Published Aug 2, 2020