4 18 4

Shaoguang Mao

dawnmsg

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

liked a model about 20 hours ago

moonshotai/Kimi-K2.5

new activity about 23 hours ago

moonshotai/Kimi-K2.5:Context Management Reproducibility | 可复现性 ?

View all activity

Organizations

upvoted a paper about 20 hours ago

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Paper • 2602.02537 • Published 8 days ago • 5

liked a model about 20 hours ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 171B • Updated 2 days ago • 152k • • 1.67k

New activity in moonshotai/Kimi-K2.5 about 23 hours ago

Context Management Reproducibility | 可复现性 ?

👀 👍 6

#13 opened 8 days ago by

pandemo

New activity in moonshotai/Kimi-K2.5 1 day ago

Can the BrowseComp results be reproduced?

👍 2

#17 opened 8 days ago by

Aldrich-x

upvoted a paper 2 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 2 days ago • 183

updated a model 11 days ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 171B • Updated 2 days ago • 152k • • 1.67k

New activity in moonshotai/Kimi-K2-Thinking 3 months ago

K2 Thinking Browsecomp/HLE Reproducibility | 结果复现

➕ 2

#5 opened 3 months ago by

pandemo

liked a model 3 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • 170B • Updated 6 days ago • 366k • • 1.65k

updated a model 3 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • 170B • Updated 6 days ago • 366k • • 1.65k

New activity in moonshotai/Kimi-K2-Thinking 3 months ago

Did you set GPT-5's reasoning effort to high?

#6 opened 3 months ago by

madmax0404

liked a model 10 months ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • 16B • Updated 6 days ago • 76.3k • 445

upvoted a paper 11 months ago

FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation

Paper • 2503.06680 • Published Mar 9, 2025 • 20

authored a paper 11 months ago

FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation

Paper • 2503.06680 • Published Mar 9, 2025 • 20

liked a dataset about 1 year ago

microsoft/MMLU-CF

Viewer • Updated Jan 8, 2025 • 20.1k • 1.06k • 17

upvoted 2 papers over 1 year ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 68

updated a collection almost 2 years ago

daily paper selected

Collection

4 items • Updated Apr 28, 2024

upvoted 3 papers almost 2 years ago

FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17, 2024 • 34

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23, 2024 • 60

List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Paper • 2404.16375 • Published Apr 25, 2024 • 18

Shaoguang Mao

AI & ML interests

Recent Activity

Organizations

dawnmsg's activity

Context Management Reproducibility | 可复现性 ?

Can the BrowseComp results be reproduced?

K2 Thinking Browsecomp/HLE Reproducibility | 结果复现

Did you set GPT-5's reasoning effort to high?