| HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting | Mar 25, 2025 | 3DGSNovel View Synthesis | CodeCode Available | 2 | 5 |
| LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| A Survey of Generative AI for de novo Drug Design: New Frontiers in Molecule and Protein Generation | Feb 13, 2024 | Drug Design | CodeCode Available | 2 | 5 |
| JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models | Jun 26, 2024 | LLM JailbreakSurvey | CodeCode Available | 2 | 5 |
| ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning | Mar 12, 2025 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 | 5 |
| Context and Geometry Aware Voxel Transformer for Semantic Scene Completion | May 22, 2024 | 3D Semantic Scene Completion from a single RGB image | CodeCode Available | 2 | 5 |
| An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control | Mar 7, 2024 | Descriptive | CodeCode Available | 2 | 5 |
| Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP | Oct 9, 2022 | Image CaptioningOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 | 5 |
| Objaverse++: Curated 3D Object Dataset with Quality Annotations | Apr 9, 2025 | 3D GenerationAttribute | CodeCode Available | 2 | 5 |
| Depth-Aware Video Frame Interpolation | Apr 1, 2019 | Optical Flow EstimationVideo Frame Interpolation | CodeCode Available | 2 | 5 |
| Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification | Dec 1, 2024 | GPUVisual Question Answering | CodeCode Available | 2 | 5 |
| Mining Error Templates for Grammatical Error Correction | Jun 23, 2022 | Grammatical Error CorrectionLanguage Modeling | CodeCode Available | 2 | 5 |
| Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization | Mar 5, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| I^2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting | Jul 12, 2025 | Autonomous DrivingComputational Efficiency | CodeCode Available | 2 | 5 |
| MONAI Label: A framework for AI-assisted Interactive Labeling of 3D Medical Images | Mar 23, 2022 | Active Learning | CodeCode Available | 2 | 5 |
| VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment | Oct 2, 2024 | GSM8KMath | CodeCode Available | 2 | 5 |
| Self-Supervised Learning for Recommender Systems: A Survey | Mar 29, 2022 | Recommendation SystemsSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Topological Deep Learning: Going Beyond Graph Data | Jun 1, 2022 | Deep LearningGraph Learning | CodeCode Available | 2 | 5 |
| Bits-to-Photon: End-to-End Learned Scalable Point Cloud Compression for Direct Rendering | Jun 9, 2024 | Decoder | CodeCode Available | 2 | 5 |
| Traj-LO: In Defense of LiDAR-Only Odometry Using an Effective Continuous-Time Trajectory | Sep 25, 2023 | | CodeCode Available | 2 | 5 |
| DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception | Jul 11, 2024 | Visual Question Answering | CodeCode Available | 2 | 5 |
| GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs? | Dec 13, 2024 | | CodeCode Available | 2 | 5 |
| SAFIRE: Segment Any Forged Image Region | Dec 11, 2024 | | CodeCode Available | 2 | 5 |
| TweetNLP: Cutting-Edge Natural Language Processing for Social Media | Jun 29, 2022 | Language IdentificationNamed Entity Recognition | CodeCode Available | 2 | 5 |
| ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object | Mar 15, 2025 | Domain AdaptationInteractive Segmentation | CodeCode Available | 2 | 5 |
| Essential-Web v1.0: 24T tokens of organized web data | Jun 17, 2025 | Math | CodeCode Available | 2 | 5 |
| Compressing Context to Enhance Inference Efficiency of Large Language Models | Oct 9, 2023 | ArticlesQuestion Answering | CodeCode Available | 2 | 5 |
| Video Compression for Spatiotemporal Earth System Data | Jun 24, 2025 | Earth ObservationVideo Compression | CodeCode Available | 2 | 5 |
| YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID | Jan 23, 2025 | Multi-Object Trackingobject-detection | CodeCode Available | 2 | 5 |
| WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark | Jun 9, 2024 | text-to-speechText to Speech | CodeCode Available | 2 | 5 |
| OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding | Sep 25, 2023 | Event Argument ExtractionEvent Detection | CodeCode Available | 2 | 5 |
| Morphable Diffusion: 3D-Consistent Diffusion for Single-image Avatar Creation | Jan 9, 2024 | Novel View Synthesis | CodeCode Available | 2 | 5 |
| FlowReasoner: Reinforcing Query-Level Meta-Agents | Apr 21, 2025 | Reinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects | Mar 29, 2024 | 3D Object Detection3D Object Detection From Monocular Images | CodeCode Available | 2 | 5 |
| VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution | Jun 9, 2022 | Space-time Video Super-resolutionSuper-Resolution | CodeCode Available | 2 | 5 |
| Low-resource finetuning of foundation models beats state-of-the-art in histopathology | Jan 9, 2024 | GPUSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| Sat2lod2: A Software For Automated Lod-2 Modeling From Satellite-Derived Orthophoto And Digital Surface Model | Apr 8, 2022 | Semantic Segmentation | CodeCode Available | 2 | 5 |
| Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling | Jun 18, 2024 | Arithmetic ReasoningLanguage Modeling | CodeCode Available | 2 | 5 |
| AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark | Jun 2, 2024 | Face SwappingFairness | CodeCode Available | 2 | 5 |
| Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars | Nov 21, 2022 | Face Model | CodeCode Available | 2 | 5 |
| MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention | Dec 14, 2023 | Image SegmentationLesion Segmentation | CodeCode Available | 2 | 5 |
| ResAD: A Simple Framework for Class Generalizable Anomaly Detection | Oct 26, 2024 | Anomaly Detection | CodeCode Available | 2 | 5 |
| RENO: Real-Time Neural Compression for 3D LiDAR Point Clouds | Mar 16, 2025 | GPU | CodeCode Available | 2 | 5 |
| AEROBLADE: Training-Free Detection of Latent Diffusion Images Using Autoencoder Reconstruction Error | Jan 31, 2024 | Denoising | CodeCode Available | 2 | 5 |
| A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images | Feb 28, 2023 | 3D Face ReconstructionDisentanglement | CodeCode Available | 2 | 5 |
| Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection | Jun 14, 2024 | Decoderspeech-recognition | CodeCode Available | 2 | 5 |
| Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos | Jul 23, 2024 | Image GenerationPoint Tracking | CodeCode Available | 2 | 5 |
| Preserving Fairness Generalization in Deepfake Detection | Feb 27, 2024 | DeepFake DetectionDisentanglement | CodeCode Available | 2 | 5 |
| Merging Context Clustering with Visual State Space Models for Medical Image Segmentation | Jan 3, 2025 | ClusteringImage Segmentation | CodeCode Available | 2 | 5 |