SOTAVerified

Language Modeling

Papers

Showing 451500 of 14182 papers

TitleStatusHype
Behind Maya: Building a Multilingual Vision Language ModelCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented GenerationCode2
GuidedQuant: Large Language Model Quantization via Exploiting End Loss GuidanceCode2
MemEngine: A Unified and Modular Library for Developing Advanced Memory of LLM-based AgentsCode2
RWKV-X: A Linear Complexity Hybrid Language ModelCode2
Towards Practical Second-Order Optimizers in Deep Learning: Insights from Fisher Information AnalysisCode2
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language modelCode2
SegEarth-R1: Geospatial Pixel Reasoning via Large Language ModelCode2
Vision-Language Model for Object Detection and Segmentation: A Review and EvaluationCode2
PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language ModelsCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language ModelingCode2
Scaling Video-Language Models to 10K Frames via Hierarchical Differential DistillationCode2
Unicorn: Text-Only Data Synthesis for Vision Language Model TrainingCode2
Mobile-VideoGPT: Fast and Accurate Video Understanding Language ModelCode2
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face DetectorCode2
Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image AnalysisCode2
MC-LLaVA: Multi-Concept Personalized Vision-Language ModelCode2
FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning ModelsCode2
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application VulnerabilitiesCode2
Modifying Large Language Model Post-Training for Diverse Creative WritingCode2
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language ModelCode2
VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-TuningCode2
Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM KernelsCode2
MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language ModelingCode2
Generative Modeling for Mathematical DiscoveryCode2
OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language ModelCode2
GroundingSuite: Measuring Complex Multi-Granular Pixel GroundingCode2
LongProLIP: A Probabilistic Vision-Language Model with Long Context TextCode2
Mellow: a small audio language model for reasoningCode2
When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token PruningCode2
DiffCLIP: Differential Attention Meets CLIPCode2
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language ModelCode2
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information RetrievalCode2
PromptPex: Automatic Test Generation for Language Model PromptsCode2
Generalized Interpolating Discrete DiffusionCode2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language ModelCode2
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLMCode2
An Egocentric Vision-Language Model based Portable Real-time Smart AssistantCode2
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking CapabilitiesCode2
Scaling Rich Style-Prompted Text-to-Speech DatasetsCode2
Collaborative Expert LLMs Guided Multi-Objective Molecular OptimizationCode2
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical EnvironmentsCode2
OptMetaOpenFOAM: Large Language Model Driven Chain of Thought for Sensitivity Analysis and Parameter Optimization based on CFDCode2
Forgetting Transformer: Softmax Attention with a Forget GateCode2
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech EnhancementCode2
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
Show:102550
← PrevPage 10 of 284Next →

No leaderboard results yet.