feat(3120): Auto Model by Elvis339 · Pull Request #3311 · huggingface/candle

Elvis339 · 2026-01-18T12:40:32Z

I took inspiration from Python HuggingFace Transformers. Since candle is also a HuggingFace project, I use the same naming (AutoModelForCausalLM, from_pretrained) so users familiar with the Python API feel at home.

The shared base (AutoConfig, Weights) is designed so subsequent PRs can add AutoModelForSequenceClassification, AutoModelForMaskedLM, etc. if maintainers agree with this direction.

Currently supports
llama, mistral, phi3, qwen2, gemma

Suggested review path
Commit 1 - auto/ module f35d137
Commit 2 - trait impls in model files 6336ff6
Commit 3 - example a9c086f

Add auto module with composable base for loading models from HuggingFace Hub. - AutoConfig: loads config.json, parses model_type - Weights: loads safetensors (single or sharded) - Model + CausalLM traits - AutoModelForCausalLM: factory returning Box<dyn CausalLM> Supports: llama, mistral, phi3, qwen2, gemma

Add Model and CausalLM trait implementations to: - llama (LlamaForCausalLM wrapper) - mistral - phi3 - qwen2 - gemma

…39/candle into feature/auto-model-causal-lm-3120

Elvis339 · 2026-01-26T08:08:41Z

Looping in maintainer for review, thank you!

cc @ivarflakstad

ivarflakstad · 2026-01-26T09:18:08Z

Don't worry I was already looped in - just haven't had the time yet :)

Elvis339 added 3 commits January 18, 2026 13:39

feat: implement CausalLM trait for supported models

6336ff6

Add Model and CausalLM trait implementations to: - llama (LlamaForCausalLM wrapper) - mistral - phi3 - qwen2 - gemma

feat: add auto example for text generation

a9c086f

Elvis339 marked this pull request as ready for review January 18, 2026 12:49

Elvis339 changed the title ~~Feature/auto model causal lm 3120~~ feat(3120): Auto Model Jan 18, 2026

Elvis339 mentioned this pull request Jan 18, 2026

AutoModel / PreTrainedModel equivalent magic ? #3120

Open

Elvis339 and others added 3 commits January 18, 2026 16:53

Merge branch 'main' into feature/auto-model-causal-lm-3120

bb7d6be

chore: remove unused tests and verbose architecture docs

a0b9bcd

Merge branch 'feature/auto-model-causal-lm-3120' of github.com:Elvis3…

246eff4

…39/candle into feature/auto-model-causal-lm-3120

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(3120): Auto Model#3311

feat(3120): Auto Model#3311
Elvis339 wants to merge 6 commits intohuggingface:mainfrom
Elvis339:feature/auto-model-causal-lm-3120

Elvis339 commented Jan 18, 2026 •

edited

Loading

Uh oh!

Elvis339 commented Jan 26, 2026

Uh oh!

ivarflakstad commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Elvis339 commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Elvis339 commented Jan 26, 2026

Uh oh!

ivarflakstad commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Elvis339 commented Jan 18, 2026 •

edited

Loading