| Self-Assessed Generation: Trustworthy Label Generation for Optical Flow and Stereo Matching in Real-world | Oct 14, 2024 | Optical Flow EstimationStereo Matching | CodeCode Available | 1 |
| Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning | Oct 14, 2024 | Decision MakingManagement | CodeCode Available | 1 |
| LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content | Oct 14, 2024 | Visual Question Answering (VQA)World Knowledge | CodeCode Available | 1 |
| Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection | Oct 14, 2024 | Anomaly DetectionPrompt Learning | CodeCode Available | 1 |
| MAIR: A Massive Benchmark for Evaluating Instructed Retrieval | Oct 14, 2024 | Information RetrievalRe-Ranking | CodeCode Available | 1 |
| PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion | Oct 14, 2024 | 3D Panoptic SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| Hard-Constrained Neural Networks with Universal Approximation Guarantees | Oct 14, 2024 | | CodeCode Available | 1 |
| MAFin: Motif Detection in Multiple Alignment Files | Oct 14, 2024 | | CodeCode Available | 1 |
| Adversarially Robust Out-of-Distribution Detection Using Lyapunov-Stabilized Embeddings | Oct 14, 2024 | Out-of-Distribution DetectionOut of Distribution (OOD) Detection | CodeCode Available | 1 |
| AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models | Oct 14, 2024 | | CodeCode Available | 1 |
| GraFPrint: A GNN-Based Approach for Audio Identification | Oct 14, 2024 | | CodeCode Available | 1 |
| CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning | Oct 14, 2024 | MathMathematical Reasoning | CodeCode Available | 1 |
| Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs | Oct 14, 2024 | Graph Neural NetworkRAG | CodeCode Available | 1 |
| Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling | Oct 14, 2024 | Image Generation | CodeCode Available | 1 |
| TrajDiffuse: A Conditional Diffusion Model for Environment-Aware Trajectory Prediction | Oct 14, 2024 | Collision AvoidanceDenoising | CodeCode Available | 1 |
| Differentiable Weightless Neural Networks | Oct 14, 2024 | Edge-computing | CodeCode Available | 1 |
| TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models | Oct 14, 2024 | 2kBenchmarking | CodeCode Available | 1 |
| Replay-and-Forget-Free Graph Class-Incremental Learning: A Task Profiling and Prompting Approach | Oct 14, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Oct 13, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Taming Overconfidence in LLMs: Reward Calibration in RLHF | Oct 13, 2024 | Multiple-choice | CodeCode Available | 1 |
| ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains | Oct 13, 2024 | | CodeCode Available | 1 |
| STA-Unet: Rethink the semantic redundant for Medical Imaging Segmentation | Oct 13, 2024 | Medical Image AnalysisMedical Image Segmentation | CodeCode Available | 1 |
| Targeted Vaccine: Safety Alignment for Large Language Models against Harmful Fine-Tuning via Layer-wise Perturbation | Oct 13, 2024 | Safety AlignmentTAR | CodeCode Available | 1 |
| ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression | Oct 13, 2024 | Video Compression | CodeCode Available | 1 |
| Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense | Oct 13, 2024 | backdoor defense | CodeCode Available | 1 |
| TULIP: Token-length Upgraded CLIP | Oct 13, 2024 | Image GenerationPosition | CodeCode Available | 1 |
| Combining Generative and Geometry Priors for Wide-Angle Portrait Correction | Oct 13, 2024 | | CodeCode Available | 1 |
| Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition | Oct 13, 2024 | Domain AdaptationOptical Character Recognition (OCR) | CodeCode Available | 1 |
| AuthFace: Towards Authentic Blind Face Restoration with Face-oriented Generative Diffusion Prior | Oct 13, 2024 | 8kBlind Face Restoration | CodeCode Available | 1 |
| EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Prompt Tuning for Audio Deepfake Detection: Computationally Efficient Test-time Domain Adaptation with Limited Target Dataset | Oct 13, 2024 | Audio Deepfake DetectionComputational Efficiency | CodeCode Available | 1 |
| Robust 3D Point Clouds Classification based on Declarative Defenders | Oct 13, 2024 | 3D Point Cloud Classificationimage-classification | CodeCode Available | 1 |
| RMB: Comprehensively Benchmarking Reward Models in LLM Alignment | Oct 13, 2024 | Benchmarking | CodeCode Available | 1 |
| InterMask: 3D Human Interaction Generation via Collaborative Masked Modelling | Oct 13, 2024 | Motion Synthesis | CodeCode Available | 1 |
| Variational Diffusion Posterior Sampling with Midpoint Guidance | Oct 13, 2024 | Denoising | CodeCode Available | 1 |
| UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation | Oct 13, 2024 | AllBilevel Optimization | CodeCode Available | 1 |
| Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion Models | Oct 12, 2024 | Disentanglement | CodeCode Available | 1 |
| Bridging Gaps: Federated Multi-View Clustering in Heterogeneous Hybrid Views | Oct 12, 2024 | ClusteringContrastive Learning | CodeCode Available | 1 |
| The Best of Both Worlds: On the Dilemma of Out-of-distribution Detection | Oct 12, 2024 | Out-of-Distribution DetectionOut of Distribution (OOD) Detection | CodeCode Available | 1 |
| Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation | Oct 12, 2024 | InformativenessRetrieval | CodeCode Available | 1 |
| FedEx-LoRA: Exact Aggregation for Federated and Efficient Fine-Tuning of Foundation Models | Oct 12, 2024 | Arithmetic ReasoningFederated Learning | CodeCode Available | 1 |
| SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression | Oct 12, 2024 | Model CompressionNatural Language Understanding | CodeCode Available | 1 |
| SciEvo: A 2 Million, 30-Year Cross-disciplinary Dataset for Temporal Scientometric Analysis | Oct 12, 2024 | | CodeCode Available | 1 |
| Mamba4Cast: Efficient Zero-Shot Time Series Forecasting with State Space Models | Oct 12, 2024 | AutoMLMamba | CodeCode Available | 1 |
| Skipping Computations in Multimodal LLMs | Oct 12, 2024 | Question AnsweringVisual Question Answering | CodeCode Available | 1 |
| Towards Multi-Modal Animal Pose Estimation: A Survey and In-Depth Analysis | Oct 12, 2024 | Animal Pose EstimationPose Estimation | CodeCode Available | 1 |
| Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering | Oct 12, 2024 | Answer GenerationBlocking | CodeCode Available | 1 |
| MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning | Oct 12, 2024 | Domain AdaptationMulti-Task Learning | CodeCode Available | 1 |
| Rethinking Data Selection at Scale: Random Selection is Almost All You Need | Oct 12, 2024 | All | CodeCode Available | 1 |