| Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level | Jun 22, 2024 | Machine TranslationTranslation | CodeCode Available | 2 |
| WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing | Jan 24, 2024 | Activity Recognition | CodeCode Available | 2 |
| Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading | Nov 26, 2024 | Offline RLparameter-efficient fine-tuning | CodeCode Available | 2 |
| RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models | Sep 26, 2023 | Information RetrievalReranking | CodeCode Available | 2 |
| SynJax: Structured Probability Distributions for JAX | Aug 7, 2023 | | CodeCode Available | 2 |
| GPTopic: Dynamic and Interactive Topic Representations | Mar 6, 2024 | | CodeCode Available | 2 |
| A Length-Extrapolatable Transformer | Dec 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs | Mar 13, 2022 | Image Classification | CodeCode Available | 2 |
| Democratizing Neural Machine Translation with OPUS-MT | Dec 4, 2022 | Machine TranslationTranslation | CodeCode Available | 2 |
| Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training | Jan 10, 2024 | | CodeCode Available | 2 |
| YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection | Feb 14, 2024 | Fracture detectionmedical image detection | CodeCode Available | 2 |
| PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance | Nov 4, 2024 | Caption GenerationMultiple-choice | CodeCode Available | 2 |
| EEG-Deformer: A Dense Convolutional Transformer for Brain-computer Interfaces | Apr 25, 2024 | EEGElectroencephalogram (EEG) | CodeCode Available | 2 |
| An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records | Jun 13, 2024 | Adversarial RobustnessExplainable Artificial Intelligence (XAI) | CodeCode Available | 2 |
| CHGNet: Pretrained universal neural network potential for charge-informed atomistic modeling | Feb 28, 2023 | Atomic ForcesGraph Neural Network | CodeCode Available | 2 |
| LogAI: A Library for Log Analytics and Intelligence | Jan 31, 2023 | Anomaly DetectionLog Parsing | CodeCode Available | 2 |
| ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model | Apr 3, 2023 | DenoisingDiversity | CodeCode Available | 2 |
| ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints | Aug 3, 2023 | Image GenerationLanguage Modelling | CodeCode Available | 2 |
| Geometric Latent Diffusion Models for 3D Molecule Generation | May 2, 2023 | 3D Molecule GenerationUnconditional Molecule Generation | CodeCode Available | 2 |
| Accelerating Self-Play Learning in Go | Feb 27, 2019 | Game of Go | CodeCode Available | 2 |
| LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models | May 23, 2023 | Common Sense ReasoningImage Generation | CodeCode Available | 2 |
| MoEUT: Mixture-of-Experts Universal Transformers | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization Algorithms | May 27, 2025 | Bayesian OptimizationBenchmarking | CodeCode Available | 2 |
| LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction | Jul 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs | Apr 25, 2024 | Visual GroundingVisual Question Answering | CodeCode Available | 2 |
| Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization | Sep 9, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Cross-Image Relational Knowledge Distillation for Semantic Segmentation | Apr 14, 2022 | Knowledge DistillationSegmentation | CodeCode Available | 2 |
| MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation | Oct 5, 2023 | BenchmarkingDecision Making | CodeCode Available | 2 |
| Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection | Dec 16, 2024 | LLM-generated Text DetectionText Detection | CodeCode Available | 2 |
| R3LIVE: A Robust, Real-time, RGB-colored, LiDAR-Inertial-Visual tightly-coupled state Estimation and mapping package | Sep 10, 2021 | Sensor FusionState Estimation | CodeCode Available | 2 |
| ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents | Feb 21, 2024 | Active LearningPosition | CodeCode Available | 2 |
| Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval | Jul 21, 2024 | General KnowledgeHighlight Detection | CodeCode Available | 2 |
| SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization | Sep 20, 2021 | AutoMLBayesian Optimization | CodeCode Available | 2 |
| TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device | Sep 27, 2021 | Video RecognitionVideo Understanding | CodeCode Available | 2 |
| MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities | May 27, 2024 | Autonomous DrivingOut-of-Distribution Detection | CodeCode Available | 2 |
| Self-Exploring Language Models: Active Preference Elicitation for Online Alignment | May 29, 2024 | Instruction Following | CodeCode Available | 2 |
| FEC: Fast Euclidean Clustering for Point Cloud Segmentation | Aug 16, 2022 | ClusteringInstance Segmentation | CodeCode Available | 2 |
| PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging | Jan 5, 2024 | Medical Report GenerationMedical Visual Question Answering | CodeCode Available | 2 |
| Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt | Jun 6, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor | Jun 10, 2024 | RAGRetrieval | CodeCode Available | 2 |
| Exploring Orthogonality in Open World Object Detection | Jan 1, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects | Dec 13, 2024 | Large Language Model | CodeCode Available | 2 |
| Equinox: neural networks in JAX via callable PyTrees and filtered transformations | Oct 30, 2021 | | CodeCode Available | 2 |
| Deep Architectures for Content Moderation and Movie Content Rating | Dec 8, 2022 | Action RecognitionGenre classification | CodeCode Available | 2 |
| Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models | Jun 24, 2024 | Referring ExpressionReferring Expression Comprehension | CodeCode Available | 2 |
| Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration | Jun 26, 2024 | Contrastive LearningDeblurring | CodeCode Available | 2 |
| Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction | Nov 22, 2021 | GPUNeRF | CodeCode Available | 2 |
| Investigating Tradeoffs in Real-World Video Super-Resolution | Nov 24, 2021 | BenchmarkingSuper-Resolution | CodeCode Available | 2 |
| SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search | May 21, 2021 | | CodeCode Available | 2 |
| Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI | Dec 30, 2021 | | CodeCode Available | 2 |