| Aligning Language Models with Demonstrated Feedback | Jun 2, 2024 | ArticlesAvg | CodeCode Available | 2 |
| Query2CAD: Generating CAD models using natural language queries | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ABodyBuilder3: Improved and scalable antibody structure predictions | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating Metaheuristics | May 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | May 29, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| Knowledge Circuits in Pretrained Transformers | May 28, 2024 | In-Context Learningknowledge editing | CodeCode Available | 2 |
| Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs | May 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model | May 27, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| LoQT: Low-Rank Adapters for Quantized Pretraining | May 26, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge | May 26, 2024 | Graph EmbeddingInformativeness | CodeCode Available | 2 |
| AdaFisher: Adaptive Second Order Optimization via Fisher Information | May 26, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| A Survey of Multimodal Large Language Model from A Data-centric Perspective | May 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| MoEUT: Mixture-of-Experts Universal Transformers | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Sparse maximal update parameterization: A holistic approach to sparse training dynamics | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Composed Image Retrieval for Remote Sensing | May 24, 2024 | Composed Image Retrieval (CoIR)Descriptive | CodeCode Available | 2 |
| LM4LV: A Frozen Large Language Model for Low-level Vision Tasks | May 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Large language models can be zero-shot anomaly detectors for time series? | May 23, 2024 | Anomaly DetectionLanguage Modeling | CodeCode Available | 2 |
| Not All Language Model Features Are Linear | May 23, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| Extracting Prompts by Inverting LLM Outputs | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Vikhr: Constructing a State-of-the-art Bilingual Open-Source Instruction-Following Large Language Model for Russian | May 22, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token | May 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation | May 22, 2024 | InformativenessLanguage Modeling | CodeCode Available | 2 |
| SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization | May 19, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Observational Scaling Laws and the Predictability of Language Model Performance | May 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Layer-Condensed KV Cache for Efficient Inference of Large Language Models | May 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Libra: Building Decoupled Vision System on Large Language Models | May 16, 2024 | Image to textLanguage Modeling | CodeCode Available | 2 |
| Grounded 3D-LLM with Referent Tokens | May 16, 2024 | Dense CaptioningDiversity | CodeCode Available | 2 |
| Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model | May 15, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| PLeak: Prompt Leaking Attacks against Large Language Model Applications | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| State-Free Inference of State-Space Models: The Transfer Function Approach | May 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Memory Mosaics | May 10, 2024 | DisentanglementIn-Context Learning | CodeCode Available | 2 |
| HMT: Hierarchical Memory Transformer for Long Context Language Processing | May 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | May 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio | May 8, 2024 | Audio Deepfake DetectionAudio Generation | CodeCode Available | 2 |
| SemiCD-VL: Visual-Language Model Guidance Makes Better Semi-supervised Change Detector | May 8, 2024 | Change DetectionLanguage Modeling | CodeCode Available | 2 |
| AntiFold: Improved antibody structure-based design using inverse folding | May 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom | May 6, 2024 | Fake News DetectionLanguage Modeling | CodeCode Available | 2 |
| A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model | May 3, 2024 | Decision MakingFew-Shot Learning | CodeCode Available | 2 |
| WorldGPT: Empowering LLM as Multimodal World Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Paint by Inpaint: Learning to Add Image Objects by Removing Them First | Apr 28, 2024 | Image InpaintingLanguage Modeling | CodeCode Available | 2 |
| PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games | Apr 26, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| REBEL: Reinforcement Learning via Regressing Relative Rewards | Apr 25, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 |
| SpaceByte: Towards Deleting Tokenization from Large Language Modeling | Apr 22, 2024 | DecoderLanguage Modeling | CodeCode Available | 2 |
| LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query Efficiency | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| σ-GPTs: A New Approach to Autoregressive Models | Apr 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Compression Represents Intelligence Linearly | Apr 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning | Apr 14, 2024 | Dense Video CaptioningDescriptive | CodeCode Available | 2 |
| LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning | Apr 12, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 2 |
| HGRN2: Gated Linear RNNs with State Expansion | Apr 11, 2024 | Image ClassificationLanguage Modeling | CodeCode Available | 2 |
| Behavior Trees Enable Structured Programming of Language Model Agents | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |