SOTAVerified

Language Modeling

Papers

Showing 10011050 of 14182 papers

TitleStatusHype
What is Wrong with Perplexity for Long-context Language Modeling?Code2
Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical TextsCode2
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text ClassificationCode2
GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language ModelCode2
AutoGRAMS: Autonomous Graphical Agent Modeling SoftwareCode2
Fine-Grained Human Feedback Gives Better Rewards for Language Model TrainingCode2
AutoFlow: Automated Workflow Generation for Large Language Model AgentsCode2
Automated Bioinformatics Analysis via AutoBACode2
With Greater Text Comes Greater Necessity: Inference-Time Training Helps Long Text GenerationCode2
Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at ScaleCode2
Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam GenerationCode2
Generative Pre-trained Speech Language Model with Efficient Hierarchical TransformerCode2
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One TokenCode2
Generate rather than Retrieve: Large Language Models are Strong Context GeneratorsCode2
Generating Benchmarks for Factuality Evaluation of Language ModelsCode2
FIRST: Faster Improved Listwise Reranking with Single Token DecodingCode2
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language ModelCode2
A Training-free LLM-based Approach to General Chinese Character Error CorrectionCode2
Generalized Interpolating Discrete DiffusionCode2
Generative Modeling for Mathematical DiscoveryCode2
GenSim: A General Social Simulation Platform with Large Language Model based AgentsCode2
Frontiers in Intelligent ColonoscopyCode2
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context ExamplesCode2
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking CapabilitiesCode2
Asynchronous Large Language Model Enhanced Planner for Autonomous DrivingCode2
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement LearningCode2
A Survey of Multimodal Large Language Model from A Data-centric PerspectiveCode2
Agent-R: Training Language Model Agents to Reflect via Iterative Self-TrainingCode2
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language ModelCode2
Formal Mathematics Statement Curriculum LearningCode2
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information RetrievalCode2
AgentReview: Exploring Peer Review Dynamics with LLM AgentsCode2
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse AutoencodersCode2
A Systematic Study of Cross-Layer KV Sharing for Efficient LLM InferenceCode2
Forgetting Transformer: Softmax Attention with a Forget GateCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
A Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsCode2
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
A Touch, Vision, and Language Dataset for Multimodal AlignmentCode2
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning AbilitiesCode2
GeoChat: Grounded Large Vision-Language Model for Remote SensingCode2
GPT4RoI: Instruction Tuning Large Language Model on Region-of-InterestCode2
Implicit Neural Representation for Cooperative Low-light Image EnhancementCode2
Linear Transformers with Learnable Kernel Functions are Better In-Context ModelsCode2
PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-trainingCode2
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capabilityCode1
FIRE: Fact-checking with Iterative Retrieval and VerificationCode1
Masked Structural Growth for 2x Faster Language Model Pre-trainingCode1
Show:102550
← PrevPage 21 of 284Next →

No leaderboard results yet.