| Adaptive Probabilistic ODE Solvers Without Adaptive Memory Requirements | Oct 14, 2024 | State EstimationTime Series | CodeCode Available | 2 |
| Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs | Oct 14, 2024 | Computational EfficiencyQuestion Answering | CodeCode Available | 2 |
| TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control | Oct 14, 2024 | DisentanglementImage Generation | CodeCode Available | 2 |
| Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes | Oct 14, 2024 | Motion GenerationMotion Synthesis | CodeCode Available | 2 |
| Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models | Oct 14, 2024 | 3D geometryDenoising | CodeCode Available | 2 |
| Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free | Oct 14, 2024 | Mixture-of-Experts | CodeCode Available | 2 |
| Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts | Oct 14, 2024 | Mixture-of-Experts | CodeCode Available | 2 |
| A Scalable Communication Protocol for Networks of Large Language Models | Oct 14, 2024 | | CodeCode Available | 2 |
| Learning to Optimize for Mixed-Integer Non-linear Programming with Feasibility Guarantees | Oct 14, 2024 | | CodeCode Available | 2 |
| Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification | Oct 14, 2024 | Classificationimage-classification | CodeCode Available | 2 |
| TRESTLE: A Model of Concept Formation in Structured Domains | Oct 14, 2024 | Attribute | CodeCode Available | 2 |
| Text4Seg: Reimagining Image Segmentation as Text Generation | Oct 13, 2024 | Image SegmentationReferring Expression | CodeCode Available | 2 |
| Large Scale Longitudinal Experiments: Estimation and Inference | Oct 13, 2024 | Computational Efficiency | CodeCode Available | 2 |
| Bayesian Enhancement Models for One-to-Many Mapping in Image Enhancement | Oct 13, 2024 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 2 |
| LLM-Based Multi-Agent Systems are Scalable Graph Generative Models | Oct 13, 2024 | BenchmarkingGraph Generation | CodeCode Available | 2 |
| Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy | Oct 13, 2024 | DenoisingPrediction | CodeCode Available | 2 |
| Learning Pattern-Specific Experts for Time Series Forecasting Under Patch-level Distribution Shift | Oct 13, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| LibEER: A Comprehensive Benchmark and Algorithm Library for EEG-based Emotion Recognition | Oct 13, 2024 | EEGEmotion Recognition | CodeCode Available | 2 |
| SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning | Oct 13, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 2 |
| Reconstructive Visual Instruction Tuning | Oct 12, 2024 | Denoising | CodeCode Available | 2 |
| ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras | Oct 12, 2024 | motion predictionPose Tracking | CodeCode Available | 2 |
| Toward General Instruction-Following Alignment for Retrieval-Augmented Generation | Oct 12, 2024 | Instruction FollowingRAG | CodeCode Available | 2 |
| Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System | Oct 12, 2024 | Experimental Designscientific discovery | CodeCode Available | 2 |
| Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization | Oct 11, 2024 | Camera Pose EstimationNovel View Synthesis | CodeCode Available | 2 |
| StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization | Oct 11, 2024 | RAGRetrieval-augmented Generation | CodeCode Available | 2 |
| pyhgf: A neural network library for predictive coding | Oct 11, 2024 | Causal DiscoveryMeta-Learning | CodeCode Available | 2 |
| On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook | Oct 11, 2024 | EthicsFairness | CodeCode Available | 2 |
| Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization | Oct 11, 2024 | GSM8KLanguage Modeling | CodeCode Available | 2 |
| JAILJUDGE: A Comprehensive Jailbreak Judge Benchmark with Multi-Agent Enhanced Explanation Evaluation Framework | Oct 11, 2024 | | CodeCode Available | 2 |
| Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation | Oct 11, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction | Oct 11, 2024 | Multi-Task Learning | CodeCode Available | 2 |
| DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory | Oct 10, 2024 | Document TranslationMachine Translation | CodeCode Available | 2 |
| Deconstructing equivariant representations in molecular systems | Oct 10, 2024 | Property Prediction | CodeCode Available | 2 |
| IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera | Oct 10, 2024 | Motion EstimationNeRF | CodeCode Available | 2 |
| From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions | Oct 10, 2024 | Diversity | CodeCode Available | 2 |
| Poison-splat: Computation Cost Attack on 3D Gaussian Splatting | Oct 10, 2024 | 3DGS | CodeCode Available | 2 |
| VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models | Oct 10, 2024 | Math | CodeCode Available | 2 |
| MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting | Oct 10, 2024 | 3D ReconstructionDynamic Reconstruction | CodeCode Available | 2 |
| Heating Up Quasi-Monte Carlo Graph Random Features: A Diffusion Kernel Perspective | Oct 10, 2024 | | CodeCode Available | 2 |
| PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection | Oct 10, 2024 | object-detectionObject Detection | CodeCode Available | 2 |
| Progressive Autoregressive Video Diffusion Models | Oct 10, 2024 | DenoisingVideo Denoising | CodeCode Available | 2 |
| TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code | Oct 10, 2024 | MathMathematical Reasoning | CodeCode Available | 2 |
| OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs | Oct 10, 2024 | Active LearningLanguage Modeling | CodeCode Available | 2 |
| Reversible Decoupling Network for Single Image Reflection Removal | Oct 10, 2024 | Reflection Removal | CodeCode Available | 2 |
| Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling | Oct 10, 2024 | Protein Folding | CodeCode Available | 2 |
| Benchmarking Agentic Workflow Generation | Oct 10, 2024 | Benchmarking | CodeCode Available | 2 |
| COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act | Oct 10, 2024 | BenchmarkingFairness | CodeCode Available | 2 |
| VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis | Oct 10, 2024 | Medical Image AnalysisQuestion Answering | CodeCode Available | 2 |