| Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model | Feb 8, 2024 | modelSpoken Language Understanding | —Unverified | 0 |
| DiscDiff: Latent Diffusion Model for DNA Sequence Generation | Feb 8, 2024 | model | —Unverified | 0 |
| An Interactive Agent Foundation Model | Feb 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Driving Everywhere with Large Language Model Policy Adaptation | Feb 8, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Structure-Informed Protein Language Model | Feb 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Resource Model For Neural Scaling Law | Feb 7, 2024 | model | —Unverified | 0 |
| Probabilistic ML Verification via Weighted Model Integration | Feb 7, 2024 | Fairnessmodel | —Unverified | 0 |
| Majority Kernels: An Approach to Leverage Big Model Dynamics for Efficient Small Model Training | Feb 7, 2024 | Combinatorial OptimizationComputational Efficiency | —Unverified | 0 |
| Direct Language Model Alignment from Online AI Feedback | Feb 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Image captioning for Brazilian Portuguese using GRIT model | Feb 7, 2024 | Image Captioningmodel | —Unverified | 0 |
| Bidirectional Autoregressive Diffusion Model for Dance Generation | Feb 6, 2024 | modelMotion Generation | —Unverified | 0 |
| The VampPrior Mixture Model | Feb 6, 2024 | ClusteringImage Clustering | CodeCode Available | 0 |
| Challenges in Mechanistically Interpreting Model Representations | Feb 6, 2024 | modelPosition | CodeCode Available | 0 |
| Reinforcement Learning with Ensemble Model Predictive Safety Certification | Feb 6, 2024 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Lens: A Foundation Model for Network Traffic | Feb 6, 2024 | Decodermodel | —Unverified | 0 |
| EscherNet: A Generative Model for Scalable View Synthesis | Feb 6, 2024 | 3D ReconstructionGPU | CodeCode Available | 3 |
| VRMM: A Volumetric Relightable Morphable Head Model | Feb 6, 2024 | 3D Face ReconstructionFace Reconstruction | —Unverified | 0 |
| Clarify: Improving Model Robustness With Natural Language Corrections | Feb 6, 2024 | Misconceptionsmodel | CodeCode Available | 0 |
| Vanishing Feature: Diagnosing Model Merging and Beyond | Feb 5, 2024 | Linear Mode Connectivitymodel | CodeCode Available | 0 |
| Neural option pricing for rough Bergomi model | Feb 5, 2024 | model | —Unverified | 0 |
| Representation Surgery for Multi-Task Model Merging | Feb 5, 2024 | Computational Efficiencymodel | CodeCode Available | 1 |
| Large Language Model Distilling Medication Recommendation Model | Feb 5, 2024 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Jailbreaking Attack against Multimodal Large Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Image Fusion via Vision-Language Model | Feb 3, 2024 | DecoderLanguage Modeling | CodeCode Available | 4 |
| Preference Poisoning Attacks on Reward Model Learning | Feb 2, 2024 | modelRecommendation Systems | —Unverified | 0 |
| Large Language Model Agent for Hyper-Parameter Optimization | Feb 2, 2024 | AutoMLHyperparameter Optimization | —Unverified | 0 |
| What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement | Feb 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Dynamical Model of Neural Scaling Laws | Feb 2, 2024 | model | —Unverified | 0 |
| KTO: Model Alignment as Prospect Theoretic Optimization | Feb 2, 2024 | Attributemodel | CodeCode Available | 4 |
| A Probabilistic Model Behind Self-Supervised Learning | Feb 2, 2024 | modelRepresentation Learning | CodeCode Available | 0 |
| Need a Small Specialized Language Model? Plan Early! | Feb 2, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Information of Large Language Model Geometry | Feb 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Efficient Exact Optimization of Language Model Alignment | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CroissantLLM: A Truly Bilingual French-English Language Model | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| EuroPED-NN: Uncertainty aware surrogate model | Feb 1, 2024 | model | —Unverified | 0 |
| Masked Conditional Diffusion Model for Enhancing Deepfake Detection | Feb 1, 2024 | Data AugmentationDeepFake Detection | —Unverified | 0 |
| Diffusion Model Compression for Image-to-Image Translation | Jan 31, 2024 | Conditional Image GenerationDenoising | —Unverified | 0 |
| Improving QA Model Performance with Cartographic Inoculation | Jan 30, 2024 | model | —Unverified | 0 |
| CaMU: Disentangling Causal Effects in Deep Model Unlearning | Jan 30, 2024 | Machine Unlearningmodel | CodeCode Available | 0 |
| Dynamical System Identification, Model Selection and Model Uncertainty Quantification by Bayesian Inference | Jan 30, 2024 | Bayesian Inferencemodel | —Unverified | 0 |
| Gradient-Based Language Model Red Teaming | Jan 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Engineering A Large Language Model From Scratch | Jan 30, 2024 | Deep LearningLanguage Modeling | —Unverified | 0 |
| Diffusion model for relational inference | Jan 30, 2024 | Imputationmodel | CodeCode Available | 0 |
| CFTM: Continuous time fractional topic model | Jan 29, 2024 | ArticlesDynamic Topic Modeling | —Unverified | 0 |
| New Foggy Object Detecting Model | Jan 27, 2024 | modelObject | —Unverified | 0 |
| MaLLaM -- Malaysia Large Language Model | Jan 26, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChemDFM: A Large Language Foundation Model for Chemistry | Jan 26, 2024 | Formmodel | CodeCode Available | 2 |
| Hierarchical Continual Reinforcement Learning via Large Language Model | Jan 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert Cache | Jan 25, 2024 | GPUmodel | CodeCode Available | 3 |
| Accelerating Retrieval-Augmented Language Model Serving with Speculation | Jan 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |