| ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL | Dec 13, 2024 | In-Context LearningText to SQL | CodeCode Available | 1 |
| DEFAME: Dynamic Evidence-based FAct-checking with Multimodal Experts | Dec 13, 2024 | Claim VerificationFact Checking | CodeCode Available | 1 |
| CognitionCapturer: Decoding Visual Stimuli From Human EEG Signal With Multimodal Information | Dec 13, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 1 |
| From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection | Dec 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection | Dec 13, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics | Dec 13, 2024 | | CodeCode Available | 1 |
| RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector | Dec 13, 2024 | In-Context LearningQuestion Answering | CodeCode Available | 1 |
| FM2S: Towards Spatially-Correlated Noise Modeling in Zero-Shot Fluorescence Microscopy Image Denoising | Dec 13, 2024 | Computational EfficiencyData Augmentation | CodeCode Available | 1 |
| Filter or Compensate: Towards Invariant Representation from Distribution Shift for Anomaly Detection | Dec 13, 2024 | Anomaly Detection | CodeCode Available | 1 |
| GraSP: Simple yet Effective Graph Similarity Predictions | Dec 13, 2024 | Graph Similarity | CodeCode Available | 1 |
| Semi-IIN: Semi-supervised Intra-inter modal Interaction Learning Network for Multimodal Sentiment Analysis | Dec 13, 2024 | Multimodal Sentiment AnalysisSentiment Analysis | CodeCode Available | 1 |
| ChainStream: An LLM-based Framework for Unified Synthetic Sensing | Dec 13, 2024 | Code Generation | CodeCode Available | 1 |
| Multi-Head Encoding for Extreme Label Classification | Dec 13, 2024 | Classification | CodeCode Available | 1 |
| The Complexity Dynamics of Grokking | Dec 13, 2024 | Generalization BoundsMemorization | CodeCode Available | 1 |
| Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data | Dec 13, 2024 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation | Dec 13, 2024 | Token Reduction | CodeCode Available | 1 |
| CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models | Dec 13, 2024 | RAG | CodeCode Available | 1 |
| waveOrder: generalist framework for label-agnostic computational microscopy | Dec 13, 2024 | | CodeCode Available | 1 |
| GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers | Dec 12, 2024 | GSM8KPrompt Engineering | CodeCode Available | 1 |
| Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph Completion | Dec 12, 2024 | HallucinationKnowledge Graph Completion | CodeCode Available | 1 |
| Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM | Dec 12, 2024 | Computational Efficiency | CodeCode Available | 1 |
| Towards Open-Vocabulary Video Semantic Segmentation | Dec 12, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Enhancing Implicit Neural Representations via Symmetric Power Transformation | Dec 12, 2024 | | CodeCode Available | 1 |
| Federated Foundation Models on Heterogeneous Time Series | Dec 12, 2024 | Anomaly DetectionFederated Learning | CodeCode Available | 1 |
| Motif Guided Graph Transformer with Combinatorial Skeleton Prototype Learning for Skeleton-Based Person Re-Identification | Dec 12, 2024 | Person Re-Identification | CodeCode Available | 1 |
| Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation | Dec 12, 2024 | cross-modal alignmentMultimodal Music Generation | CodeCode Available | 1 |
| GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression | Dec 12, 2024 | DisentanglementPortrait Animation | CodeCode Available | 1 |
| A physics-informed transformer neural operator for learning generalized solutions of initial boundary value problems | Dec 12, 2024 | Operator learning | CodeCode Available | 1 |
| Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Dec 12, 2024 | 4k8k | CodeCode Available | 1 |
| USDRL: Unified Skeleton-Based Dense Representation Learning with Multi-Grained Feature Decorrelation | Dec 12, 2024 | Action DetectionAction Recognition | CodeCode Available | 1 |
| A Flexible Plug-and-Play Module for Generating Variable-Length | Dec 12, 2024 | Deep HashingImage Retrieval | CodeCode Available | 1 |
| RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios | Dec 12, 2024 | Logical ReasoningLong-Context Understanding | CodeCode Available | 1 |
| In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning | Dec 12, 2024 | Offline RL | CodeCode Available | 1 |
| Selective Visual Prompting in Vision Mamba | Dec 12, 2024 | MambaState Space Models | CodeCode Available | 1 |
| Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals | Dec 12, 2024 | EEGTime Series | CodeCode Available | 1 |
| Weighted Poisson-disk Resampling on Large-Scale Point Clouds | Dec 12, 2024 | | CodeCode Available | 1 |
| MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus Images | Dec 12, 2024 | DiagnosticTransfer Learning | CodeCode Available | 1 |
| SPRec: Leveraging Self-Play to Debias Preference Alignment for Large Language Model-based Recommendations | Dec 12, 2024 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Can Modern LLMs Act as Agent Cores in Radiology Environments? | Dec 12, 2024 | | CodeCode Available | 1 |
| SMMF: Square-Matricized Momentum Factorization for Memory-Efficient Optimization | Dec 12, 2024 | | CodeCode Available | 1 |
| Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark | Dec 12, 2024 | Highlight DetectionVideo Summarization | CodeCode Available | 1 |
| Dynamic Contrastive Knowledge Distillation for Efficient Image Restoration | Dec 12, 2024 | Contrastive LearningImage Restoration | CodeCode Available | 1 |
| Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries | Dec 12, 2024 | 4kGSM8K | CodeCode Available | 1 |
| OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs | Dec 12, 2024 | Image RestorationImage Super-Resolution | CodeCode Available | 1 |
| CAPrompt: Cyclic Prompt Aggregation for Pre-Trained Model Based Class Incremental Learning | Dec 12, 2024 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| Temporal Action Localization with Cross Layer Task Decoupling and Refinement | Dec 12, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Physics-Driven Autoregressive State Space Models for Medical Image Reconstruction | Dec 12, 2024 | Image ReconstructionSensitivity | CodeCode Available | 1 |
| Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model | Dec 12, 2024 | Anomaly DetectionVideo Anomaly Detection | CodeCode Available | 1 |
| GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency | Dec 12, 2024 | cross-modal alignmentTransfer Learning | CodeCode Available | 1 |
| PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields | Dec 12, 2024 | 3D ReconstructionInverse Rendering | CodeCode Available | 1 |