SOTAVerified

Navigate

Papers

Showing 51100 of 1982 papers

TitleStatusHype
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an AgentCode2
Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie WorksheetsCode2
VidChapters-7M: Video Chapters at ScaleCode2
Vision-aided UAV navigation and dynamic obstacle avoidance using gradient-based B-spline trajectory optimizationCode2
DeFoG: Discrete Flow Matching for Graph GenerationCode2
LMVD: A Large-Scale Multimodal Vlog Dataset for Depression Detection in the WildCode2
MuGER^2: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question AnsweringCode2
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language AgentsCode2
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian ProcessCode2
Imagine Before Go: Self-Supervised Generative Map for Object Goal NavigationCode2
Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language NavigationCode2
Diffusion Models for Molecules: A Survey of Methods and TasksCode2
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile DevicesCode2
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and MemoryCode2
From Cognition to Precognition: A Future-Aware Framework for Social NavigationCode2
GOAT-Bench: A Benchmark for Multi-Modal Lifelong NavigationCode2
Joint Perception and Prediction for Autonomous Driving: A SurveyCode2
Event-based Stereo Depth Estimation: A SurveyCode2
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous EnvironmentsCode2
FLAME: Learning to Navigate with Multimodal LLM in Urban EnvironmentsCode2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPOCode2
DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing ScenesCode2
Generative Artificial Intelligence for Navigating Synthesizable Chemical SpaceCode2
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and FutureCode2
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving HumansCode2
A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstaclesCode2
Holodeck: Language Guided Generation of 3D Embodied AI EnvironmentsCode2
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human InteractionsCode2
Can Vehicle Motion Planning Generalize to Realistic Long-tail Scenarios?Code2
Learning Efficient and Effective Trajectories for Differential Equation-based Image RestorationCode2
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop DrivingCode2
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive SurveyCode2
AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-benchCode2
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language NavigationCode2
Can Graph Learning Improve Planning in LLM-based Agents?Code2
Melting Pot 2.0Code2
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social ExperiencesCode2
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
AerialVLN: Vision-and-Language Navigation for UAVsCode2
ForesightNav: Learning Scene Imagination for Efficient ExplorationCode2
Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language ModelsCode2
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A SurveyCode2
Receding Moving Object Segmentation in 3D LiDAR Data Using Sparse 4D ConvolutionsCode2
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement LearningCode1
DialFRED: Dialogue-Enabled Agents for Embodied Instruction FollowingCode1
DFR-FastMOT: Detection Failure Resistant Tracker for Fast Multi-Object Tracking Based on Sensor FusionCode1
Differentiable Agent-based EpidemiologyCode1
DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level ControlCode1
Demystifying Map Space Exploration for NPUsCode1
Demo Abstract: Real-Time Out-of-Distribution Detection on a Mobile RobotCode1
Show:102550
← PrevPage 2 of 40Next →

No leaderboard results yet.