Popular repositories Loading
-
GUI-Agents-Paper-List
GUI-Agents-Paper-List PublicBuilding a comprehensive and handy list of papers for GUI agents
-
TravelPlanner
TravelPlanner Public[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
-
MagicBrush
MagicBrush Public[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Repositories
Showing 10 of 60 repositories
- Mind2Web Public
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents
OSU-NLP-Group/Mind2Web’s past year of commit activity - Mind2Web-2 Public
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
OSU-NLP-Group/Mind2Web-2’s past year of commit activity - AgentSafety Public
OSU-NLP-Group/AgentSafety’s past year of commit activity - TravelPlanner Public
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
OSU-NLP-Group/TravelPlanner’s past year of commit activity
Most used topics
Loading…