| Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning | Aug 30, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 1 |
| MoRe Fine-Tuning with 10x Fewer Parameters | Aug 30, 2024 | Neural Architecture Searchparameter-efficient fine-tuning | CodeCode Available | 1 |
| Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control | Aug 30, 2024 | Model-based Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Spatially-Aware Diffusion Models with Cross-Attention for Global Field Reconstruction with Sparse Observations | Aug 30, 2024 | Inductive Bias | CodeCode Available | 1 |
| Contrastive Learning with Synthetic Positives | Aug 30, 2024 | Contrastive LearningLinear evaluation | CodeCode Available | 1 |
| Open-Vocabulary Action Localization with Iterative Visual Prompting | Aug 30, 2024 | Action LocalizationTemporal Action Localization | CodeCode Available | 1 |
| MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models | Aug 30, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| HYGENE: A Diffusion-based Hypergraph Generation Method | Aug 29, 2024 | Graph Generation | CodeCode Available | 1 |
| A Computational Framework for Modeling Emergence of Color Vision in the Human Brain | Aug 29, 2024 | | CodeCode Available | 1 |
| A high-order focus interaction model and oral ulcer dataset for oral ulcer segmentation | Aug 29, 2024 | Image SegmentationLesion Segmentation | CodeCode Available | 1 |
| Learning from Negative Samples in Generative Biomedical Entity Linking | Aug 29, 2024 | Entity Linking | CodeCode Available | 1 |
| 3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach | Aug 29, 2024 | Action SegmentationMarkerless Motion Capture | CodeCode Available | 1 |
| Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification | Aug 29, 2024 | ClassificationData Augmentation | CodeCode Available | 1 |
| Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks | Aug 29, 2024 | Open Set LearningOut-of-Distribution Detection | CodeCode Available | 1 |
| Enhancing Sound Source Localization via False Negative Elimination | Aug 29, 2024 | audio-visual learningContrastive Learning | CodeCode Available | 1 |
| LV-UNet: A Lightweight and Vanilla Model for Medical Image Segmentation | Aug 29, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Improving Weakly-supervised Video Instance Segmentation by Leveraging Spatio-temporal Consistency | Aug 29, 2024 | Instance SegmentationSegmentation | CodeCode Available | 1 |
| How Well Do LLMs Handle Cantonese? Benchmarking Cantonese Capabilities of Large Language Models | Aug 29, 2024 | BenchmarkingGeneral Knowledge | CodeCode Available | 1 |
| NeRF-CA: Dynamic Reconstruction of X-ray Coronary Angiography with Extremely Sparse-views | Aug 29, 2024 | 4D reconstructionDynamic Reconstruction | CodeCode Available | 1 |
| Beyond Uncertainty: Evidential Deep Learning for Robust Video Temporal Grounding | Aug 29, 2024 | cross-modal alignmentDeep Learning | CodeCode Available | 1 |
| Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks | Aug 29, 2024 | 3D ReconstructionEdge Detection | CodeCode Available | 1 |
| Guided Reasoning: A Non-Technical Introduction | Aug 29, 2024 | | CodeCode Available | 1 |
| OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation | Aug 29, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| Large-Scale Targeted Cause Discovery with Data-Driven Learning | Aug 29, 2024 | Causal DiscoveryGraph Reconstruction | CodeCode Available | 1 |
| ART: Actually Robust Training | Aug 29, 2024 | Deep Learning | CodeCode Available | 1 |
| Gradient-free variational learning with conditional mixture networks | Aug 29, 2024 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 1 |
| TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification | Aug 29, 2024 | CPUDiagnostic | CodeCode Available | 1 |
| Turbulence Strength C_n^2 Estimation from Video using Physics-based Deep Learning | Aug 29, 2024 | Astronomy | CodeCode Available | 1 |
| Deep DeePC: Data-enabled predictive control with low or no online optimization using deep learning | Aug 29, 2024 | | CodeCode Available | 1 |
| LLaVA-Chef: A Multi-modal Generative Model for Food Recipes | Aug 29, 2024 | Recipe Generation | CodeCode Available | 1 |
| LLMs generate structurally realistic social networks but overestimate political homophily | Aug 29, 2024 | | CodeCode Available | 1 |
| See or Guess: Counterfactually Regularized Image Captioning | Aug 29, 2024 | Causal Inferencecounterfactual | CodeCode Available | 1 |
| SALSA: Speedy ASR-LLM Synchronous Aggregation | Aug 29, 2024 | Decoder | CodeCode Available | 1 |
| STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models | Aug 29, 2024 | BenchmarkingImage Generation | CodeCode Available | 1 |
| PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning | Aug 29, 2024 | Medical Image AnalysisPrompt Learning | CodeCode Available | 1 |
| PolarBEVDet: Exploring Polar Representation for Multi-View 3D Object Detection in Bird's-Eye-View | Aug 29, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| GSDiff: Synthesizing Vector Floorplans via Geometry-enhanced Structural Graph Generation | Aug 29, 2024 | Graph Generation | CodeCode Available | 1 |
| Maven: A Multimodal Foundation Model for Supernova Science | Aug 29, 2024 | AstronomyContrastive Learning | CodeCode Available | 1 |
| What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer | Aug 29, 2024 | | CodeCode Available | 1 |
| Training-free Video Temporal Grounding using Large-scale Pre-trained Models | Aug 29, 2024 | Temporal Localization | CodeCode Available | 1 |
| Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment | Aug 29, 2024 | Point Cloud SegmentationPseudo Label | CodeCode Available | 1 |
| Transformers Meet ACT-R: Repeat-Aware and Sequential Listening Session Recommendation | Aug 29, 2024 | Music RecommendationRecommendation Systems | CodeCode Available | 1 |
| PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action | Aug 29, 2024 | | CodeCode Available | 1 |
| OpenFGL: A Comprehensive Benchmark for Federated Graph Learning | Aug 29, 2024 | Graph Learning | CodeCode Available | 1 |
| Scaling Up Diffusion and Flow-based XGBoost Models | Aug 28, 2024 | Tabular Data Generation | CodeCode Available | 1 |
| ES-PTAM: Event-based Stereo Parallel Tracking and Mapping | Aug 28, 2024 | Visual Odometry | CodeCode Available | 1 |
| G-Style: Stylized Gaussian Splatting | Aug 28, 2024 | Novel View Synthesis | CodeCode Available | 1 |
| wav2pos: Sound Source Localization using Masked Autoencoders | Aug 28, 2024 | Indoor LocalizationSound Source Localization | CodeCode Available | 1 |
| Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Aug 28, 2024 | DiversityImage Generation | CodeCode Available | 1 |
| A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions | Aug 28, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |