| BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages | Feb 17, 2025 | Emotion Recognition | CodeCode Available | 2 | 5 |
| Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models | Jan 25, 2025 | AttributeContrastive Learning | CodeCode Available | 2 | 5 |
| Source-free Subject Adaptation for EEG-based Visual Recognition | Jan 20, 2023 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 | 5 |
| HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States | Feb 20, 2025 | | CodeCode Available | 2 | 5 |
| Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy | Oct 13, 2024 | DenoisingPrediction | CodeCode Available | 2 | 5 |
| LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation | Mar 30, 2023 | Image GenerationLayout-to-Image Generation | CodeCode Available | 2 | 5 |
| CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction | Oct 2, 2023 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Order Constraints in Optimal Transport | Oct 14, 2021 | Natural Language Inference | CodeCode Available | 2 | 5 |
| Real-time Scene Text Detection with Differentiable Binarization | Nov 20, 2019 | BinarizationOptical Character Recognition (OCR) | CodeCode Available | 2 | 5 |
| An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale | Oct 22, 2020 | image-classificationSemantic Segmentation | CodeCode Available | 2 | 5 |
| VideoLifter: Lifting Videos to 3D with Fast Hierarchical Stereo Alignment | Jan 3, 2025 | Computational EfficiencyScene Understanding | CodeCode Available | 2 | 5 |
| Hopular: Modern Hopfield Networks for Tabular Data | Jun 1, 2022 | Deep LearningGeneral Classification | CodeCode Available | 2 | 5 |
| TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes | Mar 28, 2024 | 3D dense captioningDense Captioning | CodeCode Available | 2 | 5 |
| Improving the Training of Rectified Flows | May 30, 2024 | Image GenerationKnowledge Distillation | CodeCode Available | 2 | 5 |
| A Systematic Study of Joint Representation Learning on Protein Sequences and Structures | Mar 11, 2023 | Contrastive LearningProtein Function Prediction | CodeCode Available | 2 | 5 |
| Evaluating the Performance of Large Language Models on GAOKAO Benchmark | May 21, 2023 | | CodeCode Available | 2 | 5 |
| Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models | May 3, 2024 | | CodeCode Available | 2 | 5 |
| On the Origin of Llamas: Model Tree Heritage Recovery | May 28, 2024 | Authorship Attribution | CodeCode Available | 2 | 5 |
| GPT-NER: Named Entity Recognition via Large Language Models | Apr 20, 2023 | Hallucinationnamed-entity-recognition | CodeCode Available | 2 | 5 |
| Interpretable and Generalizable Graph Learning via Stochastic Attention Mechanism | Jan 31, 2022 | Graph Learning | CodeCode Available | 2 | 5 |
| AST-T5: Structure-Aware Pretraining for Code Generation and Understanding | Jan 5, 2024 | Code GenerationDecoder | CodeCode Available | 2 | 5 |
| NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting and Localization | Jan 10, 2020 | Crowd Counting | CodeCode Available | 2 | 5 |
| RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision | Sep 18, 2023 | Autonomous DrivingNeRF | CodeCode Available | 2 | 5 |
| MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs | Jan 29, 2025 | AllInstruction Following | CodeCode Available | 2 | 5 |
| Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint? | Oct 2, 2024 | | CodeCode Available | 2 | 5 |
| PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations | May 30, 2025 | | CodeCode Available | 2 | 5 |
| Box-supervised Instance Segmentation with Level Set Evolution | Jul 19, 2022 | Box-supervised Instance SegmentationInstance Segmentation | CodeCode Available | 2 | 5 |
| AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image Classification | Jun 18, 2024 | Diversityimage-classification | CodeCode Available | 2 | 5 |
| Deep PCB To COCO Convertor | May 1, 2022 | ClassificationData Augmentation | CodeCode Available | 2 | 5 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 | 5 |
| Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation | Apr 3, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 | 5 |
| InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists | Sep 30, 2023 | Depth EstimationImage Generation | CodeCode Available | 2 | 5 |
| TTS-GAN: A Transformer-based Time-Series Generative Adversarial Network | Feb 6, 2022 | Data AugmentationDimensionality Reduction | CodeCode Available | 2 | 5 |
| Complex Embeddings for Simple Link Prediction | Jun 20, 2016 | Link PredictionPrediction | CodeCode Available | 2 | 5 |
| Diffusion-based Generation, Optimization, and Planning in 3D Scenes | Jan 15, 2023 | DenoisingGrasp Generation | CodeCode Available | 2 | 5 |
| Federated Learning with New Knowledge: Fundamentals, Advances, and Futures | Feb 3, 2024 | Federated LearningPrivacy Preserving | CodeCode Available | 2 | 5 |
| ZnTrack -- Data as Code | Jan 19, 2024 | Management | CodeCode Available | 2 | 5 |
| Scaling Relationship on Learning Mathematical Reasoning with Large Language Models | Aug 3, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 | 5 |
| One-Step Diffusion Distillation through Score Implicit Matching | Oct 22, 2024 | | CodeCode Available | 2 | 5 |
| StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street Views | Jun 8, 2023 | Autonomous DrivingGPU | CodeCode Available | 2 | 5 |
| AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements | May 20, 2024 | 3D Pose EstimationPose Estimation | CodeCode Available | 2 | 5 |
| SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs | Oct 17, 2024 | | CodeCode Available | 2 | 5 |
| Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis | May 15, 2025 | Image GenerationText to Image Generation | CodeCode Available | 2 | 5 |
| mAIstro: an open-source multi-agentic system for automated end-to-end development of radiomics and deep learning models for medical imaging | Apr 30, 2025 | AI AgentClassification | CodeCode Available | 2 | 5 |
| H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation | Mar 20, 2024 | Image SegmentationLesion Segmentation | CodeCode Available | 2 | 5 |
| LUCY: Linguistic Understanding and Control Yielding Early Stage of Her | Jan 27, 2025 | Question Answering | CodeCode Available | 2 | 5 |
| EDTER: Edge Detection with Transformer | Mar 16, 2022 | DecoderEdge Detection | CodeCode Available | 2 | 5 |
| Automated MRI Quality Assessment of Brain T1-weighted MRI in Clinical Data Warehouses: A Transfer Learning Approach Relying on Artefact Simulation | Jun 18, 2024 | Transfer Learning | CodeCode Available | 2 | 5 |
| 6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry | Jul 19, 2024 | Head Pose EstimationPose Estimation | CodeCode Available | 2 | 5 |
| Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | Aug 4, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 | 5 |