| LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks | Oct 17, 2023 | In-Context Learning | CodeCode Available | 2 |
| ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area | Aug 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Text2BIM: Generating Building Models Using a Large Language Model-based Multi-Agent Framework | Aug 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FastCPH: Efficient Survival Analysis for Neural Networks | Aug 21, 2022 | Survival Analysis | CodeCode Available | 2 |
| C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection | Aug 19, 2024 | DeepFake DetectionFace Swapping | CodeCode Available | 2 |
| PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis | Aug 20, 2024 | Benchmarking | CodeCode Available | 2 |
| BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation | Aug 21, 2024 | Fault DiagnosisManagement | CodeCode Available | 2 |
| Scalable Autoregressive Image Generation with Mamba | Aug 22, 2024 | Image GenerationMamba | CodeCode Available | 2 |
| SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning | Jun 30, 2025 | MathMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings | Aug 25, 2024 | Language ModellingLink Prediction | CodeCode Available | 2 |
| Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation | Aug 27, 2024 | Camouflaged Object SegmentationCamouflaged Object Segmentation with a Single Task-generic Prompt | CodeCode Available | 2 |
| Stochastic Parameter Decomposition | Jun 25, 2025 | | CodeCode Available | 2 |
| Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare Applications | Sep 2, 2024 | CPUFederated Learning | CodeCode Available | 2 |
| Boosting Vision-Language Models for Histopathology Classification: Predict all at once | Sep 3, 2024 | Allzero-shot-classification | CodeCode Available | 2 |
| FunctionChat-Bench: Comprehensive Evaluation of Language Models' Generative Capabilities in Korean Tool-use Dialogs | Nov 21, 2024 | Relevance Detection | CodeCode Available | 2 |
| Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression | Sep 1, 2024 | Autonomous Driving | CodeCode Available | 2 |
| Towards a Unified View of Preference Learning for Large Language Models: A Survey | Sep 4, 2024 | | CodeCode Available | 2 |
| UniDet3D: Multi-dataset Indoor 3D Object Detection | Sep 6, 2024 | 3D Object DetectionObject | CodeCode Available | 2 |
| A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement | Sep 8, 2024 | Code Generation | CodeCode Available | 2 |
| Assessing SPARQL capabilities of Large Language Models | Sep 9, 2024 | BenchmarkingKnowledge Graphs | CodeCode Available | 2 |
| DiffusionPen: Towards Controlling the Style of Handwritten Text Generation | Sep 9, 2024 | DiversityHTR | CodeCode Available | 2 |
| ThermalGaussian: Thermal 3D Gaussian Splatting | Sep 11, 2024 | 3DGSNeRF | CodeCode Available | 2 |
| What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)? | Sep 12, 2024 | | CodeCode Available | 2 |
| Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective | Sep 11, 2024 | Aspect-Based Sentiment AnalysisEmotion Recognition | CodeCode Available | 2 |
| EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance | Sep 12, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis | Sep 11, 2024 | DecoderSpeech Synthesis | CodeCode Available | 2 |
| Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models | Sep 16, 2024 | | CodeCode Available | 2 |
| Large Language Models are Strong Audio-Visual Speech Recognition Learners | Sep 18, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 2 |
| HSIGene: A Foundation Model For Hyperspectral Image Generation | Sep 19, 2024 | Data AugmentationDenoising | CodeCode Available | 2 |
| Small Language Models: Survey, Measurements, and Insights | Sep 24, 2024 | BenchmarkingDecoder | CodeCode Available | 2 |
| Archon: An Architecture Search Framework for Inference-Time Techniques | Sep 23, 2024 | Hyperparameter OptimizationInstruction Following | CodeCode Available | 2 |
| Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks | Jun 7, 2023 | Cross-Modal RetrievalLanguage Modelling | CodeCode Available | 2 |
| PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images | Sep 20, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 2 |
| LTNtorch: PyTorch Implementation of Logic Tensor Networks | Sep 24, 2024 | Binary ClassificationLogical Reasoning | CodeCode Available | 2 |
| Occupancy-Based Dual Contouring | Sep 20, 2024 | 3D ReconstructionGPU | CodeCode Available | 2 |
| Revisiting the Solution of Meta KDD Cup 2024: CRAG | Sep 9, 2024 | RAGRetrieval | CodeCode Available | 2 |
| Source-Free Domain Adaptation for YOLO Object Detection | Sep 25, 2024 | Domain AdaptationModel Selection | CodeCode Available | 2 |
| Game4Loc: A UAV Geo-Localization Benchmark from Game Data | Sep 25, 2024 | Drone-view target localizationgeo-localization | CodeCode Available | 2 |
| E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding | Sep 26, 2024 | Question AnsweringVideo Understanding | CodeCode Available | 2 |
| Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation | Sep 26, 2024 | Image GenerationObject | CodeCode Available | 2 |
| Rethinking the Power of Timestamps for Robust Time Series Forecasting: A Global-Local Fusion Perspective | Sep 27, 2024 | Time SeriesTime Series Forecasting | CodeCode Available | 2 |
| Melody-Guided Music Generation | Sep 30, 2024 | cross-modal alignmentMusic Generation | CodeCode Available | 2 |
| Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration | Sep 28, 2024 | AllAttribute | CodeCode Available | 2 |
| GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving | Oct 1, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control | Oct 1, 2024 | Emotional Speech SynthesisSpeech Synthesis | CodeCode Available | 2 |
| WAFT: Warping-Alone Field Transforms for Optical Flow | Jun 26, 2025 | Optical Flow EstimationZero-shot Generalization | CodeCode Available | 2 |
| Selective Aggregation for Low-Rank Adaptation in Federated Learning | Oct 2, 2024 | Federated LearningGeneral Knowledge | CodeCode Available | 2 |
| StickyLand: Breaking the Linear Presentation of Computational Notebooks | Feb 22, 2022 | | CodeCode Available | 2 |
| Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models | Oct 4, 2024 | DecoderHallucination | CodeCode Available | 2 |