SOTAVerified

Navigate

Papers

Showing 76100 of 1982 papers

TitleStatusHype
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile DevicesCode2
Event-based Stereo Depth Estimation: A SurveyCode2
Imagine Before Go: Self-Supervised Generative Map for Object Goal NavigationCode2
Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie WorksheetsCode2
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an AgentCode2
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous EnvironmentsCode2
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social ExperiencesCode2
DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing ScenesCode2
MAGE: A Multi-Agent Engine for Automated RTL Code GenerationCode2
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving HumansCode2
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level CodeCode2
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
From Cognition to Precognition: A Future-Aware Framework for Social NavigationCode2
AerialVLN: Vision-and-Language Navigation for UAVsCode2
OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation ModelsCode2
Learning Efficient and Effective Trajectories for Differential Equation-based Image RestorationCode2
Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and TrackingCode2
PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesCode2
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement LearningCode1
Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal ReasoningCode1
Can GPT-4 Perform Neural Architecture Search?Code1
DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level ControlCode1
BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational BioimagingCode1
DialFRED: Dialogue-Enabled Agents for Embodied Instruction FollowingCode1
BEV-CV: Birds-Eye-View Transform for Cross-View Geo-LocalisationCode1
Show:102550
← PrevPage 4 of 80Next →

No leaderboard results yet.