SOTAVerified

Navigate

Papers

Showing 150 of 1982 papers

TitleStatusHype
Optimizing Instructions and Demonstrations for Multi-Stage Language Model ProgramsCode14
Data Formulator 2: Iterative Creation of Data Visualizations, with AI Transforming Data Along the WayCode11
SWE-agent: Agent-Computer Interfaces Enable Automated Software EngineeringCode11
UFO: A UI-Focused Agent for Windows OS InteractionCode9
Mirage: A Multi-Level Superoptimizer for Tensor ProgramsCode7
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and ResolutionCode6
Training Compute-Optimal Large Language ModelsCode6
WebThinker: Empowering Large Reasoning Models with Deep Research CapabilityCode5
IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI SystemsCode5
ChatDBG: Augmenting Debugging with Large Language ModelsCode5
AppAgent: Multimodal Agents as Smartphone UsersCode5
VLN-R1: Vision-Language Navigation via Reinforcement Fine-TuningCode4
DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world EnvironmentsCode4
LocAgent: Graph-Guided LLM Agents for Code LocalizationCode4
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPSCode4
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization BenchmarkCode4
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary ComputationCode4
Diffusion Models for Medical Image Analysis: A Comprehensive SurveyCode4
From Automation to Autonomy: A Survey on Large Language Models in Scientific DiscoveryCode3
Aguvis: Unified Pure Vision Agents for Autonomous GUI InteractionCode3
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI AgentsCode3
A Practical Review of Mechanistic Interpretability for Transformer-Based Language ModelsCode3
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image AnalysisCode3
CarDreamer: Open-Source Learning Platform for World Model based Autonomous DrivingCode3
AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-benchCode2
DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing ScenesCode2
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language NavigationCode2
ForesightNav: Learning Scene Imagination for Efficient ExplorationCode2
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
MTGS: Multi-Traversal Gaussian SplattingCode2
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian ProcessCode2
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop DrivingCode2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPOCode2
NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLMCode2
Diffusion Models for Molecules: A Survey of Methods and TasksCode2
Joint Perception and Prediction for Autonomous Driving: A SurveyCode2
MAGE: A Multi-Agent Engine for Automated RTL Code GenerationCode2
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving HumansCode2
Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair ClimbingCode2
Learning Efficient and Effective Trajectories for Differential Equation-based Image RestorationCode2
DeFoG: Discrete Flow Matching for Graph GenerationCode2
Generative Artificial Intelligence for Navigating Synthesizable Chemical SpaceCode2
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language ModelsCode2
Revisit Anything: Visual Place Recognition via Image Segment RetrievalCode2
Event-based Stereo Depth Estimation: A SurveyCode2
From Cognition to Precognition: A Future-Aware Framework for Social NavigationCode2
FLAME: Learning to Navigate with Multimodal LLM in Urban EnvironmentsCode2
Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie WorksheetsCode2
Text2Robot: Evolutionary Robot Design from Text DescriptionsCode2
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human InteractionsCode2
Show:102550
← PrevPage 1 of 40Next →

No leaderboard results yet.