| CHORDONOMICON: A Dataset of 666,000 Songs and their Chord Progressions | Oct 29, 2024 | | CodeCode Available | 2 | 5 |
| OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments | Dec 14, 2023 | Autonomous DrivingDepth Estimation | CodeCode Available | 2 | 5 |
| Commit0: Library Generation from Scratch | Dec 2, 2024 | BenchmarkingCode Generation | CodeCode Available | 2 | 5 |
| Reference-based Image and Video Super-Resolution via C2-Matching | Dec 19, 2022 | Image Super-ResolutionReference-based Super-Resolution | CodeCode Available | 2 | 5 |
| Authorship Obfuscation in Multilingual Machine-Generated Text Detection | Jan 15, 2024 | Adversarial RobustnessBenchmarking | CodeCode Available | 2 | 5 |
| CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Oct 30, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 2 | 5 |
| DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance | Mar 20, 2024 | | CodeCode Available | 2 | 5 |
| Fast Adversarial Attacks on Language Models In One GPU Minute | Feb 23, 2024 | Adversarial AttackComputational Efficiency | CodeCode Available | 2 | 5 |
| OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception | Mar 7, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 | 5 |
| PresentAgent: Multimodal Agent for Presentation Video Generation | Jul 5, 2025 | text-to-speechText to Speech | CodeCode Available | 2 | 5 |
| Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs | Aug 6, 2024 | Knowledge GraphsNatural Language Queries | CodeCode Available | 2 | 5 |
| RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation | Feb 16, 2025 | graph constructionKnowledge Graphs | CodeCode Available | 2 | 5 |
| MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking | Mar 22, 2025 | Object Tracking | CodeCode Available | 2 | 5 |
| Coswara: A respiratory sounds and symptoms dataset for remote screening of SARS-CoV-2 infection | May 22, 2023 | Fairness | CodeCode Available | 2 | 5 |
| PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training | Sep 19, 2023 | 2kPosition | CodeCode Available | 2 | 5 |
| Investigating Deep Learning Benchmarks for Electrocardiography Signal Processing | Apr 9, 2022 | Atrial Fibrillation DetectionDeep Learning | CodeCode Available | 2 | 5 |
| Airavata: Introducing Hindi Instruction-tuned LLM | Jan 26, 2024 | | CodeCode Available | 2 | 5 |
| Deeply Optimizing the SAT Solver for the IC3 Algorithm | Jan 24, 2025 | | CodeCode Available | 2 | 5 |
| NeuFlow: Real-time, High-accuracy Optical Flow Estimation on Robots Using Edge Devices | Mar 15, 2024 | Activity RecognitionEdge-computing | CodeCode Available | 2 | 5 |
| SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks | Feb 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Relay Diffusion: Unifying diffusion process across resolutions for image synthesis | Sep 4, 2023 | Image Generation | CodeCode Available | 2 | 5 |
| FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions | Mar 22, 2024 | Information RetrievalRetrieval | CodeCode Available | 2 | 5 |
| Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis | Feb 27, 2024 | NeRF | CodeCode Available | 2 | 5 |
| RevSAM2: Prompt SAM2 for Medical Image Segmentation via Reverse-Propagation without Fine-tuning | Sep 6, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation | Jul 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Z1: Efficient Test-time Scaling with Code | Apr 1, 2025 | | CodeCode Available | 2 | 5 |
| T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step | Dec 21, 2023 | Instruction FollowingRetrieval | CodeCode Available | 2 | 5 |
| Time Series Diffusion in the Frequency Domain | Feb 8, 2024 | DenoisingInductive Bias | CodeCode Available | 2 | 5 |
| CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs | May 15, 2025 | Conditional Text-to-Image Synthesis | CodeCode Available | 2 | 5 |
| CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection | Jan 2, 2023 | Organ SegmentationSegmentation | CodeCode Available | 2 | 5 |
| Scene as Occupancy | Jun 5, 2023 | DecoderMotion Planning | CodeCode Available | 2 | 5 |
| Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models | Dec 14, 2023 | DescriptiveImage Quality Assessment | CodeCode Available | 2 | 5 |
| VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection | Nov 22, 2024 | Question AnsweringVideo Question Answering | CodeCode Available | 2 | 5 |
| Exploring Color Invariance through Image-Level Ensemble Learning | Jan 19, 2024 | Data AugmentationEnsemble Learning | CodeCode Available | 2 | 5 |
| Physics-based battery model parametrisation from impedance data | Dec 14, 2024 | | CodeCode Available | 2 | 5 |
| CharaConsist: Fine-Grained Consistent Character Generation | Jul 15, 2025 | Consistent Character GenerationImage Generation | CodeCode Available | 2 | 5 |
| AssistanceZero: Scalably Solving Assistance Games | Apr 9, 2025 | Imitation LearningMinecraft | CodeCode Available | 2 | 5 |
| Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction | Jan 6, 2025 | | CodeCode Available | 2 | 5 |
| OMNI-DC: Highly Robust Depth Completion with Multiresolution Depth Integration | Nov 28, 2024 | Depth Completion | CodeCode Available | 2 | 5 |
| Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation | Apr 11, 2024 | | CodeCode Available | 2 | 5 |
| Pedestrian Detection: Domain Generalization, CNNs, Transformers and Beyond | Jan 10, 2022 | AttributeAutonomous Driving | CodeCode Available | 2 | 5 |
| DALK: Dynamic Co-Augmentation of LLMs and KG to answer Alzheimer's Disease Questions with Scientific Literature | May 8, 2024 | Question Answering | CodeCode Available | 2 | 5 |
| Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning | Nov 29, 2024 | Mathematical Reasoning | CodeCode Available | 2 | 5 |
| Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation | Jun 11, 2024 | | CodeCode Available | 2 | 5 |
| FairDiff: Fair Segmentation with Point-Image Diffusion | Jul 8, 2024 | FairnessImage Generation | CodeCode Available | 2 | 5 |
| Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation Exploitation | Jul 30, 2022 | Few-Shot Object DetectionMeta-Learning | CodeCode Available | 2 | 5 |
| Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models | Dec 5, 2024 | | CodeCode Available | 2 | 5 |
| AnySR: Realizing Image Super-Resolution as Any-Scale, Any-Resource | Jul 5, 2024 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Enabling Efficient Equivariant Operations in the Fourier Basis via Gaunt Tensor Products | Jan 18, 2024 | | CodeCode Available | 2 | 5 |
| Playable Game Generation | Dec 1, 2024 | GPUImage Generation | CodeCode Available | 2 | 5 |