| Must Read: A Systematic Survey of Computational Persuasion | May 12, 2025 | FairnessMarketing | CodeCode Available | 1 |
| Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model | May 12, 2025 | Video Generation | CodeCode Available | 1 |
| ISAC: An Invertible and Stable Auditory Filter Bank with Customizable Kernels for ML Integration | May 12, 2025 | ISAC | CodeCode Available | 1 |
| Asynchronous Multi-Object Tracking with an Event Camera | May 12, 2025 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Overflow Prevention Enhances Long-Context Recurrent LLMs | May 12, 2025 | Mamba | CodeCode Available | 1 |
| Finite-Sample-Based Reachability for Safe Control with Gaussian Process Dynamics | May 12, 2025 | Model Predictive Control | CodeCode Available | 1 |
| Guiding Data Collection via Factored Scaling Curves | May 12, 2025 | Imitation Learning | CodeCode Available | 1 |
| Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue | May 12, 2025 | TAG | CodeCode Available | 1 |
| Measuring General Intelligence with Generated Games | May 12, 2025 | In-Context LearningLarge Language Model | CodeCode Available | 1 |
| Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models | May 12, 2025 | Instruction Following | CodeCode Available | 1 |
| FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images | May 12, 2025 | DiversityFace Generation | CodeCode Available | 1 |
| Symbolic Regression with Multimodal Large Language Models and Kolmogorov Arnold Networks | May 12, 2025 | Kolmogorov-Arnold NetworksLanguage Modeling | CodeCode Available | 1 |
| Chronocept: Instilling a Sense of Time in Machines | May 12, 2025 | Fact CheckingRAG | CodeCode Available | 1 |
| Codifying Character Logic in Role-Playing | May 12, 2025 | | CodeCode Available | 1 |
| Neural Brain: A Neuroscience-inspired Framework for Embodied Agents | May 12, 2025 | Navigate | CodeCode Available | 1 |
| DocVXQA: Context-Aware Visual Explanations for Document Question Answering | May 12, 2025 | Question Answering | CodeCode Available | 1 |
| Can LLM-based Financial Investing Strategies Outperform the Market in Long Run? | May 11, 2025 | | CodeCode Available | 1 |
| Non-Stationary Time Series Forecasting Based on Fourier Analysis and Cross Attention Mechanism | May 11, 2025 | Financial AnalysisTime Series | CodeCode Available | 1 |
| Unsupervised Learning for Class Distribution Mismatch | May 11, 2025 | | CodeCode Available | 1 |
| Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution | May 11, 2025 | Image Super-ResolutionSemantic Segmentation | CodeCode Available | 1 |
| MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception | May 11, 2025 | Emotion ClassificationLarge Language Model | CodeCode Available | 1 |
| Learning Soft Sparse Shapes for Efficient Time-Series Classification | May 11, 2025 | ClassificationTime Series | CodeCode Available | 1 |
| BioProBench: Comprehensive Dataset and Benchmark in Biological Protocol Understanding and Reasoning | May 11, 2025 | Question Answering | CodeCode Available | 1 |
| Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models | May 11, 2025 | DescriptiveDiagnostic | CodeCode Available | 1 |
| Multimodal Fake News Detection: MFND Dataset and Shallow-Deep Multitask Learning | May 11, 2025 | Contrastive LearningFace Swapping | CodeCode Available | 1 |
| JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes | May 10, 2025 | BenchmarkingGPU | CodeCode Available | 1 |
| Quadrupedal Robot Skateboard Mounting via Reverse Curriculum Learning | May 10, 2025 | | CodeCode Available | 1 |
| Reproducing and Improving CheXNet: Deep Learning for Chest X-ray Disease Classification | May 10, 2025 | Multi-Label Classification | CodeCode Available | 1 |
| FNBench: Benchmarking Robust Federated Learning against Noisy Labels | May 10, 2025 | BenchmarkingFederated Learning | CodeCode Available | 1 |
| TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models | May 10, 2025 | Self-Supervised Learning | CodeCode Available | 1 |
| Edge-Enabled VIO with Long-Tracked Features for High-Accuracy Low-Altitude IoT Navigation | May 10, 2025 | Depth EstimationDepth Prediction | CodeCode Available | 1 |
| M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark | May 10, 2025 | Autonomous DrivingMotion Forecasting | CodeCode Available | 1 |
| Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation | May 10, 2025 | | CodeCode Available | 1 |
| MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG | May 10, 2025 | RAGRetrieval | CodeCode Available | 1 |
| SmartPilot: A Multiagent CoPilot for Adaptive and Intelligent Manufacturing | May 10, 2025 | Decision MakingProduction Forecasting | CodeCode Available | 1 |
| Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding | May 10, 2025 | DescriptiveEmotion Recognition | CodeCode Available | 1 |
| MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design | May 9, 2025 | Mixture-of-ExpertsQuantization | CodeCode Available | 1 |
| FastDup: a scalable duplicate marking tool using speculation-and-test mechanism | May 9, 2025 | | CodeCode Available | 1 |
| RefRef: A Synthetic Dataset and Benchmark for Reconstructing Refractive and Reflective Objects | May 9, 2025 | 3D ReconstructionNeural Rendering | CodeCode Available | 1 |
| PYRREGULAR: A Unified Framework for Irregular Time Series, with Classification Benchmarks | May 9, 2025 | Irregular Time SeriesMissing Values | CodeCode Available | 1 |
| A Survey on Bridging VLMs and Synthetic Data | May 9, 2025 | Survey | CodeCode Available | 1 |
| Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and Plates | May 9, 2025 | Audio SynthesisCPU | CodeCode Available | 1 |
| Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition | May 9, 2025 | Image Generation | CodeCode Available | 1 |
| Physics-informed Temporal Difference Metric Learning for Robot Motion Planning | May 9, 2025 | Metric LearningMotion Planning | CodeCode Available | 1 |
| Cost-Effective, Low Latency Vector Search with Azure Cosmos DB | May 9, 2025 | | CodeCode Available | 1 |
| DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer | May 9, 2025 | Action DetectionDecoder | CodeCode Available | 1 |
| MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks | May 9, 2025 | DiagnosticInstruction Following | CodeCode Available | 1 |
| LAPSO: A Unified Optimization View for Learning-Augmented Power System Operations | May 8, 2025 | | CodeCode Available | 1 |
| Building-Guided Pseudo-Label Learning for Cross-Modal Building Damage Mapping | May 8, 2025 | Building Damage AssessmentChange Detection | CodeCode Available | 1 |