| NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction | Mar 21, 2022 | 3D ReconstructionNeRF | CodeCode Available | 2 | 5 |
| CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching | Apr 4, 2024 | AttributeImage Captioning | CodeCode Available | 2 | 5 |
| Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4 | Oct 21, 2024 | Automated Theorem Proving | CodeCode Available | 2 | 5 |
| FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation | Mar 4, 2023 | BenchmarkingGPU | CodeCode Available | 2 | 5 |
| BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation | Feb 13, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning | Jan 31, 2024 | AudioCapsAudio captioning | CodeCode Available | 2 | 5 |
| Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning Strategy | Jan 7, 2024 | Image RestorationPrompt Learning | CodeCode Available | 2 | 5 |
| Dense Optical Tracking: Connecting the Dots | Dec 1, 2023 | Optical Flow EstimationPoint Tracking | CodeCode Available | 2 | 5 |
| Fast Inner-Product Algorithms and Architectures for Deep Neural Network Accelerators | Nov 20, 2023 | | CodeCode Available | 2 | 5 |
| YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss | Apr 14, 2022 | Multi-Person Pose Estimationobject-detection | CodeCode Available | 2 | 5 |
| Deep Learning for Camera Calibration and Beyond: A Survey | Mar 19, 2023 | Camera CalibrationDeep Learning | CodeCode Available | 2 | 5 |
| BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection | Jun 21, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 | 5 |
| SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users | Apr 14, 2025 | DiversityFace Alignment | CodeCode Available | 2 | 5 |
| CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes | Mar 5, 2023 | 3D Human Pose EstimationHuman Detection | CodeCode Available | 2 | 5 |
| SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression | Jun 5, 2023 | GPULanguage Modelling | CodeCode Available | 2 | 5 |
| VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding | Jun 18, 2024 | Image CaptioningQuestion Answering | CodeCode Available | 2 | 5 |
| Structured Attention Composition for Temporal Action Localization | May 20, 2022 | Action DetectionAction Localization | CodeCode Available | 2 | 5 |
| DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature | Jan 26, 2023 | ArticlesLanguage Modelling | CodeCode Available | 2 | 5 |
| ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation | Feb 27, 2023 | Image GenerationText to Image Generation | CodeCode Available | 2 | 5 |
| Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs | Jun 9, 2022 | Image CaptioningImage Classification | CodeCode Available | 2 | 5 |
| ImMesh: An Immediate LiDAR Localization and Meshing Framework | Jan 12, 2023 | CPUDimensionality Reduction | CodeCode Available | 2 | 5 |
| MidiCaps: A large-scale MIDI dataset with text captions | Jun 4, 2024 | Information RetrievalMusic Information Retrieval | CodeCode Available | 2 | 5 |
| MACRec: a Multi-Agent Collaboration Framework for Recommendation | Feb 23, 2024 | Conversational RecommendationDecision Making | CodeCode Available | 2 | 5 |
| Content-Style Decoupling for Unsupervised Makeup Transfer without Generating Pseudo Ground Truth | May 27, 2024 | | CodeCode Available | 2 | 5 |