| AdapTrack: Adaptive Thresholding-Based Matching For Multi-object Tracking | Sep 27, 2024 | Multi-Object TrackingObject Tracking | CodeCode Available | 1 |
| CESNET-TimeSeries24: Time Series Dataset for Network Traffic Anomaly Detection and Forecasting | Sep 27, 2024 | Anomaly DetectionTime Series | CodeCode Available | 1 |
| RepairBench: Leaderboard of Frontier Models for Program Repair | Sep 27, 2024 | Program Repair | CodeCode Available | 1 |
| FlashMix: Fast Map-Free LiDAR Localization via Feature Mixing and Contrastive-Constrained Accelerated Training | Sep 27, 2024 | Metric LearningPosition | CodeCode Available | 1 |
| Prompt-Driven Temporal Domain Adaptation for Nighttime UAV Tracking | Sep 27, 2024 | Domain Adaptation | CodeCode Available | 1 |
| CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models | Sep 27, 2024 | Reinforcement Learning (RL)World Knowledge | CodeCode Available | 1 |
| AL-GTD: Deep Active Learning for Gaze Target Detection | Sep 27, 2024 | Active Learning | CodeCode Available | 1 |
| URIEL+: Enhancing Linguistic Inclusion and Usability in a Typological and Multilingual Knowledge Base | Sep 27, 2024 | | CodeCode Available | 1 |
| HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting | Sep 27, 2024 | Deep LearningPrediction | CodeCode Available | 1 |
| From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding | Sep 27, 2024 | Video UnderstandingVisual Reasoning | CodeCode Available | 1 |
| Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models | Sep 27, 2024 | Instruction Following | CodeCode Available | 1 |
| Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs | Sep 27, 2024 | GPURecommendation Systems | CodeCode Available | 1 |
| Dual Cone Gradient Descent for Training Physics-Informed Neural Networks | Sep 27, 2024 | | CodeCode Available | 1 |
| From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation | Sep 27, 2024 | Audio ClassificationAudio Generation | CodeCode Available | 1 |
| Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models | Sep 27, 2024 | DenoisingImage Enhancement | CodeCode Available | 1 |
| ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning | Sep 27, 2024 | AutoMLBenchmarking | CodeCode Available | 1 |
| A comprehensive review and new taxonomy on superpixel segmentation | Sep 27, 2024 | Superpixels | CodeCode Available | 1 |
| Generative AI for fast and accurate statistical computation of fluids | Sep 27, 2024 | Operator learning | CodeCode Available | 1 |
| LML-DAP: Language Model Learning a Dataset for Data-Augmented Prediction | Sep 27, 2024 | ClassificationFeature Engineering | CodeCode Available | 1 |
| Improving Visual Object Tracking through Visual Prompting | Sep 27, 2024 | Object | CodeCode Available | 1 |
| Cottention: Linear Transformers With Cosine Attention | Sep 27, 2024 | | CodeCode Available | 1 |
| DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving | Sep 26, 2024 | Autonomous DrivingLanguage Modeling | CodeCode Available | 1 |
| IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning | Sep 26, 2024 | Image CaptioningRetrieval | CodeCode Available | 1 |
| HydraViT: Stacking Heads for a Scalable ViT | Sep 26, 2024 | | CodeCode Available | 1 |
| Task-recency bias strikes back: Adapting covariances in Exemplar-Free Class Incremental Learning | Sep 26, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation | Sep 26, 2024 | Inductive BiasVideo Generation | CodeCode Available | 1 |
| MIO: A Foundation Model on Multimodal Tokens | Sep 26, 2024 | modelText Generation | CodeCode Available | 1 |
| InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction | Sep 26, 2024 | Domain GeneralizationHomography Estimation | CodeCode Available | 1 |
| Realistic Evaluation of Model Merging for Compositional Generalization | Sep 26, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE | Sep 26, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning | Sep 26, 2024 | Causal DiscoveryCausal Discovery in Video Reasoning | CodeCode Available | 1 |
| DarkSAM: Fooling Segment Anything Model to Segment Nothing | Sep 26, 2024 | model | CodeCode Available | 1 |
| Revisiting Deep Ensemble Uncertainty for Enhanced Medical Anomaly Detection | Sep 26, 2024 | Anomaly Detection | CodeCode Available | 1 |
| CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors | Sep 26, 2024 | | CodeCode Available | 1 |
| Leveraging Anthropometric Measurements to Improve Human Mesh Estimation and Ensure Consistent Body Shapes | Sep 26, 2024 | 3D Human Pose EstimationPose Estimation | CodeCode Available | 1 |
| An Adversarial Perspective on Machine Unlearning for AI Safety | Sep 26, 2024 | Machine Unlearning | CodeCode Available | 1 |
| RED QUEEN: Safeguarding Large Language Models against Concealed Multi-Turn Jailbreaking | Sep 26, 2024 | Red Teaming | CodeCode Available | 1 |
| BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search | Sep 26, 2024 | MathMathematical Problem-Solving | CodeCode Available | 1 |
| A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction | Sep 26, 2024 | Mixture-of-ExpertsPrediction | CodeCode Available | 1 |
| LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field | Sep 26, 2024 | GPUNeRF | CodeCode Available | 1 |
| Wavelet-Driven Generalizable Framework for Deepfake Face Forgery Detection | Sep 26, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 1 |
| Self-Distilled Depth Refinement with Noisy Poisson Fusion | Sep 26, 2024 | Depth Estimation | CodeCode Available | 1 |
| Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation | Sep 26, 2024 | 6D Pose Estimation6D Pose Estimation using RGB | CodeCode Available | 1 |
| Autonomous Network Defence using Reinforcement Learning | Sep 26, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey | Sep 26, 2024 | FairnessImage Generation | CodeCode Available | 1 |
| A Framework for Standardizing Similarity Measures in a Rapidly Evolving Field | Sep 26, 2024 | | CodeCode Available | 1 |
| MALPOLON: A Framework for Deep Species Distribution Modeling | Sep 26, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs | Sep 26, 2024 | Image RestorationImage Super-Resolution | CodeCode Available | 1 |
| GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge | Sep 26, 2024 | Natural Language InferenceSentiment Analysis | CodeCode Available | 1 |
| DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors | Sep 26, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |