SOTAVerified

Language Modeling

Papers

Showing 851900 of 14182 papers

TitleStatusHype
HGRN2: Gated Linear RNNs with State ExpansionCode2
Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First TimeCode2
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative DecodingCode2
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLMCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
How to Index Item IDs for Recommendation Foundation ModelsCode2
Grounding Language Models to Images for Multimodal Inputs and OutputsCode2
Drive Like a Human: Rethinking Autonomous Driving with Large Language ModelsCode2
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social ExperiencesCode2
Grounded 3D-LLM with Referent TokensCode2
DsDm: Model-Aware Dataset Selection with DatamodelsCode2
GroundingSuite: Measuring Complex Multi-Granular Pixel GroundingCode2
RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-EncoderCode2
Causal Agent based on Large Language ModelCode2
Cedille: A large autoregressive French language modelCode2
GraphWiz: An Instruction-Following Language Model for Graph ProblemsCode2
Graph-Aware Isomorphic Attention for Adaptive Dynamics in TransformersCode2
Granite GuardianCode2
Ring Attention with Blockwise Transformers for Near-Infinite ContextCode2
Graph Language ModelsCode2
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote SensingCode2
GPT or BERT: why not both?Code2
SOLO: A Single Transformer for Scalable Vision-Language ModelingCode2
GPT Understands, TooCode2
GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended TasksCode2
GuidedQuant: Large Language Model Quantization via Exploiting End Loss GuidanceCode2
Huatuo-26M, a Large-scale Chinese Medical QA DatasetCode2
Implicit Neural Representation for Cooperative Low-light Image EnhancementCode2
Language Models Can Improve Event Prediction by Few-Shot Abductive ReasoningCode2
GoLLIE: Annotation Guidelines improve Zero-Shot Information-ExtractionCode2
GOFA: A Generative One-For-All Model for Joint Graph Language ModelingCode2
ECG-Chat: A Large ECG-Language Model for Cardiac Disease DiagnosisCode2
Compression Represents Intelligence LinearlyCode2
EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and VotingCode2
GODEL: Large-Scale Pre-Training for Goal-Directed DialogCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent EducationCode2
DiffArtist: Towards Structure and Appearance Controllable Image StylizationCode2
Scaling Transformer to 1M tokens and beyond with RMTCode2
Empirical Asset Pricing with Large Language Model AgentsCode2
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AICode2
GPT4RoI: Instruction Tuning Large Language Model on Region-of-InterestCode2
Scene Text Recognition with Permuted Autoregressive Sequence ModelsCode2
Composed Image Retrieval for Remote SensingCode2
GIT: A Generative Image-to-text Transformer for Vision and LanguageCode2
Characterization of Large Language Model Development in the DatacenterCode2
GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question AnsweringCode2
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
GeoChat: Grounded Large Vision-Language Model for Remote SensingCode2
Show:102550
← PrevPage 18 of 284Next →

No leaderboard results yet.