| SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models | Jan 15, 2024 | MathMathematical Reasoning | CodeCode Available | 2 | 5 |
| QFFT, Question-Free Fine-Tuning for Adaptive Reasoning | Jun 15, 2025 | | CodeCode Available | 2 | 5 |
| NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction | Mar 21, 2022 | 3D ReconstructionNeRF | CodeCode Available | 2 | 5 |
| CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching | Apr 4, 2024 | AttributeImage Captioning | CodeCode Available | 2 | 5 |
| Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4 | Oct 21, 2024 | Automated Theorem Proving | CodeCode Available | 2 | 5 |
| FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation | Mar 4, 2023 | BenchmarkingGPU | CodeCode Available | 2 | 5 |
| BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation | Feb 13, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 | 5 |
| EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning | Jan 31, 2024 | AudioCapsAudio captioning | CodeCode Available | 2 | 5 |
| Towards Effective Multiple-in-One Image Restoration: A Sequential and Prompt Learning Strategy | Jan 7, 2024 | Image RestorationPrompt Learning | CodeCode Available | 2 | 5 |
| Dense Optical Tracking: Connecting the Dots | Dec 1, 2023 | Optical Flow EstimationPoint Tracking | CodeCode Available | 2 | 5 |
| Fast Inner-Product Algorithms and Architectures for Deep Neural Network Accelerators | Nov 20, 2023 | | CodeCode Available | 2 | 5 |
| YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss | Apr 14, 2022 | Multi-Person Pose Estimationobject-detection | CodeCode Available | 2 | 5 |
| Deep Learning for Camera Calibration and Beyond: A Survey | Mar 19, 2023 | Camera CalibrationDeep Learning | CodeCode Available | 2 | 5 |
| BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection | Jun 21, 2022 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 | 5 |
| SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users | Apr 14, 2025 | DiversityFace Alignment | CodeCode Available | 2 | 5 |
| CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes | Mar 5, 2023 | 3D Human Pose EstimationHuman Detection | CodeCode Available | 2 | 5 |
| SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression | Jun 5, 2023 | GPULanguage Modelling | CodeCode Available | 2 | 5 |
| VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding | Jun 18, 2024 | Image CaptioningQuestion Answering | CodeCode Available | 2 | 5 |
| Structured Attention Composition for Temporal Action Localization | May 20, 2022 | Action DetectionAction Localization | CodeCode Available | 2 | 5 |
| DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature | Jan 26, 2023 | ArticlesLanguage Modelling | CodeCode Available | 2 | 5 |
| ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation | Feb 27, 2023 | Image GenerationText to Image Generation | CodeCode Available | 2 | 5 |
| Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs | Jun 9, 2022 | Image CaptioningImage Classification | CodeCode Available | 2 | 5 |
| ImMesh: An Immediate LiDAR Localization and Meshing Framework | Jan 12, 2023 | CPUDimensionality Reduction | CodeCode Available | 2 | 5 |
| MidiCaps: A large-scale MIDI dataset with text captions | Jun 4, 2024 | Information RetrievalMusic Information Retrieval | CodeCode Available | 2 | 5 |
| MACRec: a Multi-Agent Collaboration Framework for Recommendation | Feb 23, 2024 | Conversational RecommendationDecision Making | CodeCode Available | 2 | 5 |
| Content-Style Decoupling for Unsupervised Makeup Transfer without Generating Pseudo Ground Truth | May 27, 2024 | | CodeCode Available | 2 | 5 |
| Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding | Mar 2, 2022 | Image Inpainting | CodeCode Available | 2 | 5 |
| Integrating Reinforcement Learning with Foundation Models for Autonomous Robotics: Methods and Perspectives | Oct 21, 2024 | Reinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models | Oct 23, 2024 | Instruction FollowingLanguage Modelling | CodeCode Available | 2 | 5 |
| Infinite Recommendation Networks: A Data-Centric Approach | Jun 3, 2022 | Information RetrievalRecommendation Systems | CodeCode Available | 2 | 5 |
| Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback | Apr 12, 2022 | Code GenerationOut of Distribution (OOD) Detection | CodeCode Available | 2 | 5 |
| Efficient LLM Inference on CPUs | Nov 1, 2023 | Quantization | CodeCode Available | 2 | 5 |
| Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations | Jun 9, 2022 | Benchmarkingcontinuous-control | CodeCode Available | 2 | 5 |
| VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning | Jul 7, 2022 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 2 | 5 |
| Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision | Mar 11, 2022 | | CodeCode Available | 2 | 5 |
| Deep Learning Methods for Partial Differential Equations and Related Parameter Identification Problems | Dec 6, 2022 | Deep Learning | CodeCode Available | 2 | 5 |
| Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model | Apr 2, 2024 | DecoderMamba | CodeCode Available | 2 | 5 |
| Arbitrary-Scale Point Cloud Upsampling by Voxel-Based Network with Latent Geometric-Consistent Learning | Mar 8, 2024 | point cloud upsampling | CodeCode Available | 2 | 5 |
| Critique-out-Loud Reward Models | Aug 21, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion | Jan 8, 2024 | Image EnhancementLow-Light Image Enhancement | CodeCode Available | 2 | 5 |
| LLM-PBE: Assessing Data Privacy in Large Language Models | Aug 23, 2024 | | CodeCode Available | 2 | 5 |
| Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them | Oct 17, 2022 | Language Modelling | CodeCode Available | 2 | 5 |
| PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain | Feb 21, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 | 5 |
| LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis | Dec 19, 2024 | Object | CodeCode Available | 2 | 5 |
| MAT: Mask-Aware Transformer for Large Hole Image Inpainting | Mar 29, 2022 | DiversityImage Inpainting | CodeCode Available | 2 | 5 |
| MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment | Apr 19, 2022 | Image Quality AssessmentNo-Reference Image Quality Assessment | CodeCode Available | 2 | 5 |
| A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases | Sep 22, 2022 | Inductive Bias | CodeCode Available | 2 | 5 |
| Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning | Mar 31, 2025 | General Reinforcement LearningInstruction Following | CodeCode Available | 2 | 5 |
| MultiZoo & MultiBench: A Standardized Toolkit for Multimodal Deep Learning | Jun 28, 2023 | Deep LearningMultimodal Deep Learning | CodeCode Available | 2 | 5 |