SOTAVerified

Language Modeling

Papers

Showing 851900 of 14182 papers

TitleStatusHype
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video UnderstandingCode2
Tamil-Llama: A New Tamil Language Model Based on Llama 2Code2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous DrivingCode2
BeLLM: Backward Dependency Enhanced Large Language Model for Sentence EmbeddingsCode2
Large Trajectory Models are Scalable Motion Predictors and PlannersCode2
Discrete Diffusion Modeling by Estimating the Ratios of the Data DistributionCode2
DISC-FinLLM: A Chinese Financial Large Language Model based on Multiple Experts Fine-tuningCode2
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical DomainCode2
Monarch Mixer: A Simple Sub-Quadratic GEMM-Based ArchitectureCode2
BitNet: Scaling 1-bit Transformers for Large Language ModelsCode2
LLark: A Multimodal Instruction-Following Language Model for MusicCode2
Sheared LLaMA: Accelerating Language Model Pre-training via Structured PruningCode2
Making Large Language Models Perform Better in Knowledge Graph CompletionCode2
OptiMUS: Optimization Modeling Using MIP Solvers and large language modelsCode2
GoLLIE: Annotation Guidelines improve Zero-Shot Information-ExtractionCode2
Ring Attention with Blockwise Transformers for Near-Infinite ContextCode2
GPT-Driver: Learning to Drive with GPTCode2
Alphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingCode2
RLLTE: Long-Term Evolution Project of Reinforcement LearningCode2
Effective Long-Context Scaling of Foundation ModelsCode2
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language ModelsCode2
LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an AgentCode2
OWL: A Large Language Model for IT OperationsCode2
Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative DecodingCode2
MMICL: Empowering Vision-language Model with Multi-Modal In-Context LearningCode2
Unified Human-Scene Interaction via Prompted Chain-of-ContactsCode2
Kani: A Lightweight and Highly Hackable Framework for Building Language Model ApplicationsCode2
Automated Bioinformatics Analysis via AutoBACode2
GPT Can Solve Mathematical Problems Without a CalculatorCode2
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction TuningCode2
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction FollowingCode2
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language ModelsCode2
LLaSM: Large Language and Speech ModelCode2
DTrOCR: Decoder-only Transformer for Optical Character RecognitionCode2
SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence UnderstandingCode2
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D ScenesCode2
Language is All a Graph NeedsCode2
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI ToolCode2
Shepherd: A Critic for Language Model GenerationCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn DialogueCode2
Spanish Pre-trained BERT Model and Evaluation DataCode2
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent EducationCode2
LP-MusicCaps: LLM-Based Pseudo Music CaptioningCode2
Distilled Feature Fields Enable Few-Shot Language-Guided ManipulationCode2
TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormerCode2
A Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsCode2
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill SetsCode2
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AICode2
Planting a SEED of Vision in Large Language ModelCode2
Show:102550
← PrevPage 18 of 284Next →

No leaderboard results yet.