ELITE

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

akhaliq submitted a paper 2 days ago

Visual Personalization Turing Test

akhaliq submitted a paper 2 days ago

Causal World Modeling for Robot Control

akhaliq submitted a paper 13 days ago

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

View all activity

akhaliq

submitted 2 papers to Daily Papers 2 days ago

Visual Personalization Turing Test

Paper • 2601.22680 • Published 6 days ago • 2

Causal World Modeling for Robot Control

Paper • 2601.21998 • Published 6 days ago • 27

akhaliq

submitted a paper to Daily Papers 13 days ago

Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis

Paper • 2601.14253 • Published 15 days ago • 10

akhaliq

submitted a paper to Daily Papers 19 days ago

V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Paper • 2601.09499 • Published 21 days ago • 9

akhaliq

submitted a paper to Daily Papers 21 days ago

UM-Text: A Unified Multimodal Model for Image Understanding

Paper • 2601.08321 • Published 23 days ago • 9

akhaliq

submitted a paper to Daily Papers 27 days ago

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Paper • 2601.03955 • Published 28 days ago • 3

akhaliq

submitted 2 papers to Daily Papers about 1 month ago

FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation

Paper • 2512.24724 • Published Dec 31, 2025 • 7

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

Paper • 2512.24766 • Published Dec 31, 2025 • 9

akhaliq

submitted 3 papers to Daily Papers about 2 months ago

What matters for Representation Alignment: Global Information or Spatial Structure?

Paper • 2512.10794 • Published Dec 11, 2025 • 9

Towards a Science of Scaling Agent Systems

Paper • 2512.08296 • Published Dec 9, 2025 • 16

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 22

multimodalart

posted an update 4 months ago

Post

16335

Want to iterate on a Hugging Face Space with an LLM?

Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model!

multimodalart/repo2txt

akhaliq

authored a paper 4 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9, 2025 • 39

multimodalart

posted an update 8 months ago

Post

18122

Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐

I've built a live real time demo on Spaces 📹💨

multimodalart/self-forcing

6 replies

csyxwei

authored 4 papers about 1 year ago

SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions

Paper • 2404.06451 • Published Apr 9, 2024 • 2

MasterWeaver: Taming Editability and Identity for Personalized Text-to-Image Generation

Paper • 2405.05806 • Published May 9, 2024

ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation

Paper • 2407.02040 • Published Jul 2, 2024 • 1

ACE: Anti-Editing Concept Erasure in Text-to-Image Models

Paper • 2501.01633 • Published Jan 3, 2025

akhaliq

posted an update about 1 year ago

Post

50612

Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: https://huggingface.co/spaces/akhaliq/anychat

5 replies

akhaliq

posted an update about 1 year ago

Post

49656

QwQ-32B-Preview is now available in anychat

A reasoning model that is competitive with OpenAI o1-mini and o1-preview

try it out: https://huggingface.co/spaces/akhaliq/anychat

2 replies

AI & ML interests

Recent Activity

Team members 4

ELITE-library's activity