SOTAVerified

Navigate

Papers

Showing 101150 of 1982 papers

TitleStatusHype
GridMM: Grid Memory Map for Vision-and-Language NavigationCode1
AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified RepresentationsCode1
Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement LearningCode1
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion FramesCode1
Continual Multimodal Knowledge Graph ConstructionCode1
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language NavigationCode1
Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban EnvironmentsCode1
Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted SensorsCode1
AI-IMU Dead-ReckoningCode1
AidUI: Toward Automated Recognition of Dark Patterns in User InterfacesCode1
CoNav: A Benchmark for Human-Centered Collaborative NavigationCode1
Controllable Preference Optimization: Toward Controllable Multi-Objective AlignmentCode1
From Shadows to Safety: Occlusion Tracking and Risk Mitigation for Urban Autonomous DrivingCode1
Airbert: In-domain Pretraining for Vision-and-Language NavigationCode1
FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space OptimizationCode1
Future-Oriented Navigation: Dynamic Obstacle Avoidance with One-Shot Energy-Based Multimodal Motion PredictionCode1
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language NavigationCode1
Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question AnsweringCode1
AI2-THOR: An Interactive 3D Environment for Visual AICode1
Fleet of Agents: Coordinated Problem Solving with Large Language ModelsCode1
CUAHN-VIO: Content-and-Uncertainty-Aware Homography Network for Visual-Inertial OdometryCode1
History Aware Multimodal Transformer for Vision-and-Language NavigationCode1
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and ForecastingCode1
DAG-Net: Double Attentive Graph Neural Network for Trajectory ForecastingCode1
From Commands to Prompts: LLM-based Semantic File System for AIOSCode1
DataLens: Scalable Privacy Preserving Training via Gradient Compression and AggregationCode1
Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement LearningCode1
Exploring Gradient-based Multi-directional Controls in GANsCode1
Exploring Empty Spaces: Human-in-the-Loop Data AugmentationCode1
Extracting a Knowledge Base of Mechanisms from COVID-19 PapersCode1
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed EnvironmentsCode1
Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over RepositoryCode1
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular CamerasCode1
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global MemoryCode1
Context-Aware Entity Grounding with Open-Vocabulary 3D Scene GraphsCode1
AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive ScenariosCode1
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression SegmentationCode1
Collaborative Visual NavigationCode1
Decentralized Motion Planning for Multi-Robot Navigation using Deep Reinforcement LearningCode1
IGDrivSim: A Benchmark for the Imitation Gap in Autonomous DrivingCode1
Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal ReasoningCode1
A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation TasksCode1
ONCE: Boosting Content-based Recommendation with Both Open- and Closed-source Large Language ModelsCode1
AEye: A Visualization Tool for Image DatasetsCode1
Evaluating Language Models for Mathematics through InteractionsCode1
Can GPT-4 Perform Neural Architecture Search?Code1
Evaluating Long-Term Memory in 3D MazesCode1
Expander Graph PropagationCode1
Aerial Vision-and-Dialog NavigationCode1
BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational BioimagingCode1
Show:102550
← PrevPage 3 of 40Next →

No leaderboard results yet.