SOTAVerified

Language Modeling

Papers

Showing 701750 of 14182 papers

TitleStatusHype
Aligning Language Models with Demonstrated FeedbackCode2
Query2CAD: Generating CAD models using natural language queriesCode2
ABodyBuilder3: Improved and scalable antibody structure predictionsCode2
LLaMEA: A Large Language Model Evolutionary Algorithm for Automatically Generating MetaheuristicsCode2
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language ModelsCode2
Knowledge Circuits in Pretrained TransformersCode2
Motion-Agent: A Conversational Framework for Human Motion Generation with LLMsCode2
Reason3D: Searching and Reasoning 3D Segmentation via Large Language ModelCode2
LoQT: Low-Rank Adapters for Quantized PretrainingCode2
KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World KnowledgeCode2
AdaFisher: Adaptive Second Order Optimization via Fisher InformationCode2
A Survey of Multimodal Large Language Model from A Data-centric PerspectiveCode2
MoEUT: Mixture-of-Experts Universal TransformersCode2
Sparse maximal update parameterization: A holistic approach to sparse training dynamicsCode2
Composed Image Retrieval for Remote SensingCode2
LM4LV: A Frozen Large Language Model for Low-level Vision TasksCode2
Large language models can be zero-shot anomaly detectors for time series?Code2
Not All Language Model Features Are LinearCode2
Extracting Prompts by Inverting LLM OutputsCode2
Vikhr: Constructing a State-of-the-art Bilingual Open-Source Instruction-Following Large Language Model for RussianCode2
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One TokenCode2
Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam GenerationCode2
SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch NormalizationCode2
Observational Scaling Laws and the Predictability of Language Model PerformanceCode2
Layer-Condensed KV Cache for Efficient Inference of Large Language ModelsCode2
Libra: Building Decoupled Vision System on Large Language ModelsCode2
Grounded 3D-LLM with Referent TokensCode2
Xmodel-VLM: A Simple Baseline for Multimodal Vision Language ModelCode2
PLeak: Prompt Leaking Attacks against Large Language Model ApplicationsCode2
State-Free Inference of State-Space Models: The Transfer Function ApproachCode2
Memory MosaicsCode2
HMT: Hierarchical Memory Transformer for Long Context Language ProcessingCode2
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language ModelsCode2
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake AudioCode2
SemiCD-VL: Visual-Language Model Guidance Makes Better Semi-supervised Change DetectorCode2
AntiFold: Improved antibody structure-based design using inverse foldingCode2
Explainable Fake News Detection With Large Language Model via Defense Among Competing WisdomCode2
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language ModelCode2
WorldGPT: Empowering LLM as Multimodal World ModelCode2
Paint by Inpaint: Learning to Add Image Objects by Removing Them FirstCode2
PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesCode2
REBEL: Reinforcement Learning via Regressing Relative RewardsCode2
SpaceByte: Towards Deleting Tokenization from Large Language ModelingCode2
LLM-R2: A Large Language Model Enhanced Rule-based Rewrite System for Boosting Query EfficiencyCode2
σ-GPTs: A New Approach to Autoregressive ModelsCode2
Compression Represents Intelligence LinearlyCode2
TrafficVLM: A Controllable Visual Language Model for Traffic Video CaptioningCode2
LLM-Seg: Bridging Image Segmentation and Large Language Model ReasoningCode2
HGRN2: Gated Linear RNNs with State ExpansionCode2
Behavior Trees Enable Structured Programming of Language Model AgentsCode2
Show:102550
← PrevPage 15 of 284Next →

No leaderboard results yet.