| voc2vec: A Foundation Model for Non-Verbal Vocalization | Feb 22, 2025 | model | CodeCode Available | 2 |
| Robust Dynamic Facial Expression Recognition | Feb 22, 2025 | Dynamic Facial Expression RecognitionFacial Expression Recognition | CodeCode Available | 2 |
| AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind | Feb 21, 2025 | Model Discovery | CodeCode Available | 2 |
| Protein Large Language Models: A Comprehensive Survey | Feb 21, 2025 | ArticlesProtein Structure Prediction | CodeCode Available | 2 |
| OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework | Feb 21, 2025 | Autonomous Driving | CodeCode Available | 2 |
| KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation | Feb 21, 2025 | Audio GenerationFAD | CodeCode Available | 2 |
| PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning | Feb 21, 2025 | Hallucination | CodeCode Available | 2 |
| VaViM and VaVAM: Autonomous Driving through Video Generative Modeling | Feb 21, 2025 | Autonomous DrivingImitation Learning | CodeCode Available | 2 |
| Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification | Feb 21, 2025 | Contrastive LearningTime Series | CodeCode Available | 2 |
| A Training-free LLM-based Approach to General Chinese Character Error Correction | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection | Feb 21, 2025 | 3D Anomaly Detection3D Anomaly Detection and Segmentation | CodeCode Available | 2 |
| AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms | Feb 21, 2025 | Scheduling | CodeCode Available | 2 |
| ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation | Feb 20, 2025 | 3D Molecule GenerationProtein Design | CodeCode Available | 2 |
| TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators | Feb 20, 2025 | BenchmarkingCode Generation | CodeCode Available | 2 |
| MAGO-SP: Detection and Correction of Water-Fat Swaps in Magnitude-Only VIBE MRI | Feb 20, 2025 | Denoising | CodeCode Available | 2 |
| Multimodal RewardBench: Holistic Evaluation of Reward Models for Vision Language Models | Feb 20, 2025 | Question AnsweringVisual Question Answering | CodeCode Available | 2 |
| MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders | Feb 20, 2025 | Computational Efficiency | CodeCode Available | 2 |
| dtaianomaly: A Python library for time series anomaly detection | Feb 20, 2025 | Anomaly DetectionTime Series | CodeCode Available | 2 |
| HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States | Feb 20, 2025 | | CodeCode Available | 2 |
| GiGL: Large-Scale Graph Neural Networks at Snapchat | Feb 20, 2025 | Graph Learning | CodeCode Available | 2 |
| FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis | Feb 20, 2025 | Age EstimationBenchmarking | CodeCode Available | 2 |
| AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO | Feb 20, 2025 | Autonomous NavigationNavigate | CodeCode Available | 2 |
| Fast and Accurate Blind Flexible Docking | Feb 20, 2025 | Blind DockingComputational Efficiency | CodeCode Available | 2 |
| Optimizing Model Selection for Compound AI Systems | Feb 20, 2025 | modelModel Selection | CodeCode Available | 2 |
| OBELiX: A Curated Dataset of Crystal Structures and Experimentally Measured Ionic Conductivities for Lithium Solid-State Electrolytes | Feb 20, 2025 | | CodeCode Available | 2 |
| A Survey on Data Contamination for Large Language Models | Feb 20, 2025 | SurveyText Generation | CodeCode Available | 2 |
| Risk-mediated dynamic regulation of effective contacts de-synchronizes outbreaks in metapopulation epidemic models | Feb 20, 2025 | | CodeCode Available | 2 |
| Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention | Feb 19, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| Calibration and Option Pricing with Stochastic Volatility and Double Exponential Jumps | Feb 19, 2025 | ArticlesEconometrics | CodeCode Available | 2 |
| Repo2Run: Automated Building Executable Environment for Code Repository at Scale | Feb 19, 2025 | | CodeCode Available | 2 |
| Smaller But Better: Unifying Layout Generation with Smaller Large Language Models | Feb 19, 2025 | Layout Generation | CodeCode Available | 2 |
| SIFT: Grounding LLM Reasoning in Contexts via Stickers | Feb 19, 2025 | GSM8KMath | CodeCode Available | 2 |
| MoM: Linear Sequence Modeling with Mixture-of-Memories | Feb 19, 2025 | | CodeCode Available | 2 |
| TESS 2: A Large-Scale Generalist Diffusion Language Model | Feb 19, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework | Feb 19, 2025 | | CodeCode Available | 2 |
| Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models | Feb 19, 2025 | Contrastive LearningSentence | CodeCode Available | 2 |
| Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion Fields | Feb 19, 2025 | Video Frame Interpolation | CodeCode Available | 2 |
| DataSciBench: An LLM Agent Benchmark for Data Science | Feb 19, 2025 | Code GenerationLarge Language Model | CodeCode Available | 2 |
| Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models | Feb 19, 2025 | GPUQuantization | CodeCode Available | 2 |
| JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework | Feb 19, 2025 | Change DetectionEarth Observation | CodeCode Available | 2 |
| Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics | Feb 19, 2025 | | CodeCode Available | 2 |
| DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Feb 18, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading | Feb 18, 2025 | Computational EfficiencyCPU | CodeCode Available | 2 |
| NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation | Feb 18, 2025 | 3D Generation3D Molecule Generation | CodeCode Available | 2 |
| Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation | Feb 18, 2025 | DecoderGPU | CodeCode Available | 2 |
| A Machine Learning Approach That Beats Large Rubik's Cubes | Feb 18, 2025 | Rubik's Cube | CodeCode Available | 2 |
| Electron flow matching for generative reaction mechanism prediction obeying conservation laws | Feb 18, 2025 | Prediction | CodeCode Available | 2 |
| CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation | Feb 18, 2025 | Image GenerationText to Image Generation | CodeCode Available | 2 |
| VUS: Effective and Efficient Accuracy Measures for Time-Series Anomaly Detection | Feb 18, 2025 | Anomaly DetectionInformation Retrieval | CodeCode Available | 2 |
| MotifBench: A standardized protein design benchmark for motif-scaffolding problems | Feb 18, 2025 | Protein DesignProtein Structure Prediction | CodeCode Available | 2 |