-
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Paper โข 2312.08578 โข Published โข 20 -
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks
Paper โข 2312.08583 โข Published โข 11 -
Vision-Language Models as a Source of Rewards
Paper โข 2312.09187 โข Published โข 12 -
StemGen: A music generation model that listens
Paper โข 2312.08723 โข Published โข 49
Chuanming Liu
Chuanming
AI & ML interests
Artificial Intelligence, AGI, NLP, LLMs, Multimodality, MLSys. Python/Golang/C/C++/Shell/awk&sed
Recent Activity
liked
a model
about 1 hour ago
FunAudioLLM/Fun-CosyVoice3-0.5B-2512
upvoted
an
article
about 10 hours ago
SigLIP 2: A better multilingual vision language encoder
liked
a model
about 10 hours ago
openbmb/MiniCPM-o-4_5