| RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale | Jun 24, 2024 | Code GenerationHumanEval | CodeCode Available | 1 |
| INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness | Jun 23, 2024 | Code GenerationNavigate | CodeCode Available | 1 |
| Meta-FL: A Novel Meta-Learning Framework for Optimizing Heterogeneous Model Aggregation in Federated Learning | Jun 23, 2024 | DiversityFederated Learning | —Unverified | 0 |
| V-RECS, a Low-Cost LLM4VIS Recommender with Explanations, Captioning and Suggestions | Jun 21, 2024 | Natural Language QueriesNavigate | CodeCode Available | 0 |
| FeedForward at SemEval-2024 Task 10: Trigger and sentext-height enriched emotion analysis in multi-party conversations | Jun 20, 2024 | Emotion RecognitionEmotion Recognition in Conversation | CodeCode Available | 0 |
| Two-Stage Depth Enhanced Learning with Obstacle Map For Object Navigation | Jun 20, 2024 | NavigateObject | —Unverified | 0 |
| EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy | Jun 19, 2024 | Exposure CorrectionImage Enhancement | CodeCode Available | 1 |
| Optimizing Quantile-based Trading Strategies in Electricity Arbitrage | Jun 19, 2024 | Navigate | —Unverified | 0 |
| Look Further Ahead: Testing the Limits of GPT-4 in Path Planning | Jun 17, 2024 | Navigate | CodeCode Available | 1 |
| Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs | Jun 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 14 |
| OoDIS: Anomaly Instance Segmentation Benchmark | Jun 17, 2024 | Anomaly Instance SegmentationAnomaly Segmentation | CodeCode Available | 1 |
| Enhancing Supermarket Robot Interaction: A Multi-Level LLM Conversational Interface for Handling Diverse Customer Intents | Jun 16, 2024 | ChatbotImage Enhancement | —Unverified | 0 |
| PRIMER: Perception-Aware Robust Learning-based Multiagent Trajectory Planner | Jun 14, 2024 | Imitation LearningNavigate | —Unverified | 0 |
| Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation | Jun 14, 2024 | NavigateVision and Language Navigation | CodeCode Available | 2 |
| Impact of Speech Mode in Automatic Pathological Speech Detection | Jun 14, 2024 | Navigate | —Unverified | 0 |
| SemanticSpray++: A Multimodal Dataset for Autonomous Driving in Wet Surface Conditions | Jun 14, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| WonderWorld: Interactive 3D Scene Generation from a Single Image | Jun 13, 2024 | Depth EstimationGPU | —Unverified | 0 |
| GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices | Jun 12, 2024 | Navigate | CodeCode Available | 2 |
| Efficient Parallel Multi-Hop Reasoning: A Scalable Approach for Knowledge Graph Analysis | Jun 11, 2024 | Knowledge Base CompletionKnowledge Graphs | —Unverified | 0 |
| pVACview: an interactive visualization tool for efficient neoantigen prioritization and selection | Jun 11, 2024 | Navigate | —Unverified | 0 |
| Fairness-Aware Meta-Learning via Nash Bargaining | Jun 11, 2024 | Fairnessimage-classification | —Unverified | 0 |
| Higher-Order Spatial Information for Self-Supervised Place Cell Learning | Jun 10, 2024 | NavigateSelf-Supervised Learning | —Unverified | 0 |
| SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature | Jun 10, 2024 | Claim VerificationInstruction Following | CodeCode Available | 1 |
| Towards Real-World Efficiency: Domain Randomization in Reinforcement Learning for Pre-Capture of Free-Floating Moving Targets by Autonomous Robots | Jun 10, 2024 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data | Jun 10, 2024 | NavigateObject | —Unverified | 0 |
| MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows | Jun 10, 2024 | Navigate | CodeCode Available | 1 |
| InstructNav: Zero-shot System for Generic Instruction Navigation in Unexplored Environment | Jun 7, 2024 | Navigate | —Unverified | 0 |
| Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis | Jun 5, 2024 | MambaMedical Image Analysis | CodeCode Available | 3 |
| DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences | Jun 5, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| CoNav: A Benchmark for Human-Centered Collaborative Navigation | Jun 4, 2024 | Navigate | CodeCode Available | 1 |
| XRec: Large Language Models for Explainable Recommendation | Jun 4, 2024 | Collaborative FilteringDecision Making | CodeCode Available | 2 |
| Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts | Jun 4, 2024 | NavigateVision and Language Navigation | CodeCode Available | 1 |
| ODE-based Learning to Optimize | Jun 4, 2024 | NavigateStochastic Optimization | CodeCode Available | 0 |
| Demystifying AI Platform Design for Distributed Inference of Next-Generation LLM models | Jun 3, 2024 | ChunkingMamba | CodeCode Available | 2 |
| Formality Style Transfer in Persian | Jun 2, 2024 | Formality Style TransferNavigate | —Unverified | 0 |
| Do's and Don'ts: Learning Desirable Skills with Instruction Videos | Jun 1, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Share Secrets for Privacy: Confidential Forecasting with Vertical Federated Learning | May 31, 2024 | Federated LearningNavigate | CodeCode Available | 0 |
| GAMedX: Generative AI-based Medical Entity Data Extractor Using Large Language Models | May 31, 2024 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Large Language Models Can Self-Improve At Web Agent Tasks | May 30, 2024 | Navigate | CodeCode Available | 1 |
| Pretrained Mobility Transformer: A Foundation Model for Human Mobility | May 29, 2024 | ImputationNavigate | —Unverified | 0 |
| Can Graph Learning Improve Planning in LLM-based Agents? | May 29, 2024 | Decision MakingGraph Learning | CodeCode Available | 2 |
| Convex neural network synthesis for robustness in the 1-norm | May 29, 2024 | Model Predictive ControlNavigate | CodeCode Available | 0 |
| Enhancing Road Safety: Real-Time Detection of Driver Distraction through Convolutional Neural Networks | May 28, 2024 | Navigate | —Unverified | 0 |
| Self-Guiding Exploration for Combinatorial Problems | May 28, 2024 | ManagementNavigate | CodeCode Available | 1 |
| Socially-Aware Shared Control Navigation for Assistive Mobile Robots in the Built Environment | May 27, 2024 | Autonomous NavigationModel Predictive Control | —Unverified | 0 |
| Map-based Modular Approach for Zero-shot Embodied Question Answering | May 26, 2024 | Embodied Question AnsweringNavigate | CodeCode Available | 1 |
| AI-Assisted Detector Design for the EIC (AID(2)E) | May 25, 2024 | Multiobjective OptimizationNavigate | —Unverified | 0 |
| An Empirical Exploration of Trust Dynamics in LLM Supply Chains | May 25, 2024 | Navigate | —Unverified | 0 |
| MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time | May 25, 2024 | GSM8KMath | —Unverified | 0 |
| Devil's Advocate: Anticipatory Reflection for LLM Agents | May 25, 2024 | Navigate | —Unverified | 0 |