SOTAVerified

Language Modeling

Papers

Showing 351400 of 14182 papers

TitleStatusHype
GuardT2I: Defending Text-to-Image Models from Adversarial PromptsCode3
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens IntactCode3
OpenGraph: Towards Open Graph Foundation ModelsCode3
RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction TasksCode3
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RLCode3
Diffusion Language Models Are Versatile Protein LearnersCode3
SongComposer: A Large Language Model for Lyric and Melody Generation in Song CompositionCode3
ShapeLLM: Universal 3D Object Understanding for Embodied InteractionCode3
Cleaner Pretraining Corpus Curation with Neural Web ScrapingCode3
Towards Building Multilingual Language Model for MedicineCode3
Query-Based Adversarial Prompt GenerationCode3
OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language ModelsCode3
VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree SearchCode3
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language ModelsCode3
Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive SurveyCode3
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-AgentsCode3
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API CallsCode3
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning TasksCode3
BlackMamba: Mixture of Experts for State-Space ModelsCode3
Evaluating Language Model Agency through NegotiationsCode3
Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language ModelCode3
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language ModelCode3
TinyGPT-V: Efficient Multimodal Large Language Model via Small BackbonesCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-ScalingCode3
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-ConstraintCode3
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
Taiwan LLM: Bridging the Linguistic Divide with a Culturally Aligned Language ModelCode3
Language Model InversionCode3
Large Language Model based Long-tail Query Rewriting in Taobao SearchCode3
Skywork: A More Open Bilingual Foundation ModelCode3
SkyMath: Technical ReportCode3
Llemma: An Open Language Model For MathematicsCode3
OceanGPT: A Large Language Model for Ocean Science TasksCode3
Data Filtering NetworksCode3
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter ModelCode3
Retentive Network: A Successor to Transformer for Large Language ModelsCode3
MotionGPT: Human Motion as a Foreign LanguageCode3
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text IntegrationCode3
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human PreferencesCode3
HuatuoGPT, towards Taming Language Model to Be a DoctorCode3
Hierarchical Prompting Assists Large Language Model on Web NavigationCode3
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on WikipediaCode3
Self-QA: Unsupervised Knowledge Guided Language Model AlignmentCode3
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational AbilitiesCode3
SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and VerificationCode3
MultiModal-GPT: A Vision and Language Model for Dialogue with HumansCode3
REPLUG: Retrieval-Augmented Black-Box Language ModelsCode3
ThoughtSource: A central hub for large language model reasoning dataCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
Show:102550
← PrevPage 8 of 284Next →

No leaderboard results yet.