Deep Data Research Benchmark
weiliu
thinkwee
AI & ML interests
LLM reasoning, agents
Recent Activity
authored
a paper
about 9 hours ago
Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey
authored
a paper
about 9 hours ago
Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models
new activity
about 13 hours ago
thinkwee/DDRBench_10K_trajectory:Add paper link, project page, and code links to dataset card
Organizations
None yet