SOTAVerified

Navigate

Papers

Showing 101150 of 1982 papers

TitleStatusHype
GridMM: Grid Memory Map for Vision-and-Language NavigationCode1
AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified RepresentationsCode1
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language NavigationCode1
Controllable Preference Optimization: Toward Controllable Multi-Objective AlignmentCode1
Context-Aware Entity Grounding with Open-Vocabulary 3D Scene GraphsCode1
DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and BetterCode1
Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban EnvironmentsCode1
Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted SensorsCode1
AI-IMU Dead-ReckoningCode1
AidUI: Toward Automated Recognition of Dark Patterns in User InterfacesCode1
From Shadows to Safety: Occlusion Tracking and Risk Mitigation for Urban Autonomous DrivingCode1
Continual Multimodal Knowledge Graph ConstructionCode1
CoNav: A Benchmark for Human-Centered Collaborative NavigationCode1
Airbert: In-domain Pretraining for Vision-and-Language NavigationCode1
FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space OptimizationCode1
Future-Oriented Navigation: Dynamic Obstacle Avoidance with One-Shot Energy-Based Multimodal Motion PredictionCode1
Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement LearningCode1
Graphhopper: Multi-Hop Scene Graph Reasoning for Visual Question AnsweringCode1
AI2-THOR: An Interactive 3D Environment for Visual AICode1
Collaborative Visual NavigationCode1
Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement LearningCode1
CUAHN-VIO: Content-and-Uncertainty-Aware Homography Network for Visual-Inertial OdometryCode1
FootstepNet: an Efficient Actor-Critic Method for Fast On-line Bipedal Footstep Planning and ForecastingCode1
CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph PromptingCode1
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement LearningCode1
From Commands to Prompts: LLM-based Semantic File System for AIOSCode1
Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement LearningCode1
Extracting a Knowledge Base of Mechanisms from COVID-19 PapersCode1
Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over RepositoryCode1
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular CamerasCode1
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed EnvironmentsCode1
Exploring Gradient-based Multi-directional Controls in GANsCode1
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global MemoryCode1
DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion FramesCode1
AgentSense: Benchmarking Social Intelligence of Language Agents through Interactive ScenariosCode1
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression SegmentationCode1
CFGPT: Chinese Financial Assistant with Large Language ModelCode1
Exploring Empty Spaces: Human-in-the-Loop Data AugmentationCode1
Fleet of Agents: Coordinated Problem Solving with Large Language ModelsCode1
IGDrivSim: A Benchmark for the Imitation Gap in Autonomous DrivingCode1
Can GPT-4 Perform Neural Architecture Search?Code1
A General Model for Aggregating Annotations Across Simple, Complex, and Multi-Object Annotation TasksCode1
ONCE: Boosting Content-based Recommendation with Both Open- and Closed-source Large Language ModelsCode1
Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal ReasoningCode1
AEye: A Visualization Tool for Image DatasetsCode1
Evaluating Language Models for Mathematics through InteractionsCode1
EnvEdit: Environment Editing for Vision-and-Language NavigationCode1
Evaluating Long-Term Memory in 3D MazesCode1
Expander Graph PropagationCode1
Aerial Vision-and-Dialog NavigationCode1
Show:102550
← PrevPage 3 of 40Next →

No leaderboard results yet.