| Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models | Jun 21, 2024 | Learning-To-RankPassage Ranking | CodeCode Available | 2 |
| DsDm: Model-Aware Dataset Selection with Datamodels | Jan 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet | Dec 12, 2022 | | CodeCode Available | 2 |
| Benchmarking Laparoscopic Surgical Image Restoration and Beyond | May 25, 2025 | BenchmarkingImage Restoration | CodeCode Available | 2 |
| ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding | Dec 10, 2022 | 3D Architecture3D Classification | CodeCode Available | 2 |
| Monocular, One-stage, Regression of Multiple 3D People | Aug 27, 2020 | 3D Depth Estimation3D Human Pose Estimation | CodeCode Available | 2 |
| Giraffe: Adventures in Expanding Context Lengths in LLMs | Aug 21, 2023 | 16k4k | CodeCode Available | 2 |
| Effect of Choosing Loss Function when Using T-batching for Representation Learning on Dynamic Networks | Aug 13, 2023 | Graph Representation LearningLink Prediction | CodeCode Available | 2 |
| Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition | Apr 10, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| What is a Goldilocks Face Verification Test Set? | May 24, 2024 | Face RecognitionFace Verification | CodeCode Available | 2 |
| DiffArtist: Towards Structure and Appearance Controllable Image Stylization | Jul 22, 2024 | DisentanglementImage Stylization | CodeCode Available | 2 |
| Structure-Aligned Protein Language Model | May 22, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| Detecting music deepfakes is easy but actually hard | May 7, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 |
| Denoising Diffusion Bridge Models | Sep 29, 2023 | DenoisingImage Generation | CodeCode Available | 2 |
| Test-time Alignment of Diffusion Models without Reward Over-optimization | Jan 10, 2025 | Diversity | CodeCode Available | 2 |
| Differentially Private Synthetic Data via APIs 3: Using Simulators Instead of Foundation Model | Feb 8, 2025 | Image Generation | CodeCode Available | 2 |
| AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML | Oct 3, 2024 | AutoMLCode Generation | CodeCode Available | 2 |
| Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System | Apr 15, 2024 | Autonomous Driving | CodeCode Available | 2 |
| ReservoirComputing.jl: An Efficient and Modular Library for Reservoir Computing Models | Apr 8, 2022 | | CodeCode Available | 2 |
| LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks | May 23, 2024 | Decision Making | CodeCode Available | 2 |
| "I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset | May 18, 2022 | Sentence | CodeCode Available | 2 |
| Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning | Aug 22, 2023 | Caption GenerationLarge Language Model | CodeCode Available | 2 |
| Large Language Models on Graphs: A Comprehensive Survey | Dec 5, 2023 | Language ModellingSurvey | CodeCode Available | 2 |
| SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model | Apr 13, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot | Feb 20, 2023 | Efficient Explorationreinforcement-learning | CodeCode Available | 2 |
| Towards Better Dynamic Graph Learning: New Architecture and Unified Library | Mar 23, 2023 | Dynamic Link PredictionDynamic Node Classification | CodeCode Available | 2 |
| City3D: Large-Scale Building Reconstruction from Airborne LiDAR Point Clouds | Jan 25, 2022 | Surface Reconstruction | CodeCode Available | 2 |
| Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation | May 23, 2024 | DenoisingImage Denoising | CodeCode Available | 2 |
| GenRL: Multimodal-foundation world models for generalization in embodied agents | Jun 26, 2024 | BenchmarkingReinforcement Learning (RL) | CodeCode Available | 2 |
| Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs | Oct 14, 2024 | Computational EfficiencyQuestion Answering | CodeCode Available | 2 |
| Towards Lightweight Super-Resolution with Dual Regression Learning | Jul 16, 2022 | Image Super-ResolutionModel Compression | CodeCode Available | 2 |
| Scale Decoupled Distillation | Mar 20, 2024 | Knowledge Distillation | CodeCode Available | 2 |
| MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders | Feb 20, 2025 | Computational Efficiency | CodeCode Available | 2 |
| Explicit Visual Prompting for Low-Level Structure Segmentations | Mar 20, 2023 | Camouflaged Object SegmentationDefocus Blur Detection | CodeCode Available | 2 |
| Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust | May 31, 2023 | Image Generation | CodeCode Available | 2 |
| Omni Aggregation Networks for Lightweight Image Super-Resolution | Apr 20, 2023 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 |
| You Only Look at Once for Real-time and Generic Multi-Task | Oct 2, 2023 | Autonomous DrivingDrivable Area Detection | CodeCode Available | 2 |
| Domino: Discovering Systematic Errors with Cross-Modal Embeddings | Mar 24, 2022 | Representation LearningSlice Discovery | CodeCode Available | 2 |
| h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform | Mar 4, 2025 | | CodeCode Available | 2 |
| InterFusion: Text-Driven Generation of 3D Human-Object Interaction | Mar 22, 2024 | 3D Generationglobal-optimization | CodeCode Available | 2 |
| SystolicAttention: Fusing FlashAttention within a Single Systolic Array | Jul 15, 2025 | Scheduling | CodeCode Available | 2 |
| TAB: Unified Benchmarking of Time Series Anomaly Detection Methods | Jun 22, 2025 | Anomaly DetectionBenchmarking | CodeCode Available | 2 |
| Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts | Oct 7, 2022 | ArticlesLanguage Modeling | CodeCode Available | 2 |
| AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding | Mar 16, 2025 | Video Understanding | CodeCode Available | 2 |
| RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation | Jul 5, 2024 | Human-Object Interaction DetectionRetrieval | CodeCode Available | 2 |
| Exploring Diffusion Transformer Designs via Grafting | Jun 5, 2025 | | CodeCode Available | 2 |
| LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL | Mar 24, 2025 | RetrievalText to SQL | CodeCode Available | 2 |
| Wildfire Smoke Detection with Computer Vision | Jan 12, 2023 | Object Detection | CodeCode Available | 2 |
| Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLM | Mar 28, 2024 | Code GenerationHumanEval | CodeCode Available | 2 |
| Process Reward Model with Q-Value Rankings | Oct 15, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |