SOTAVerified

Language Modeling

Papers

Showing 801825 of 14182 papers

TitleStatusHype
Improve Vision Language Model Chain-of-thought ReasoningCode2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous DrivingCode2
GPT4RoI: Instruction Tuning Large Language Model on Region-of-InterestCode2
GOFA: A Generative One-For-All Model for Joint Graph Language ModelingCode2
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech SynthesisCode2
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
GODEL: Large-Scale Pre-Training for Goal-Directed DialogCode2
GoLLIE: Annotation Guidelines improve Zero-Shot Information-ExtractionCode2
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instructionCode2
OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language ModelCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
Behavior Trees Enable Structured Programming of Language Model AgentsCode2
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application VulnerabilitiesCode2
PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language ModelsCode2
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and A Comprehensive Multimodal Dataset Towards General Medical AICode2
Behind Maya: Building a Multilingual Vision Language ModelCode2
AdaFisher: Adaptive Second Order Optimization via Fisher InformationCode2
GPT Can Solve Mathematical Problems Without a CalculatorCode2
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling PerformanceCode2
Beyond Next Token Prediction: Patch-Level Training for Large Language ModelsCode2
BBT-Fin: Comprehensive Construction of Chinese Financial Domain Pre-trained Language Model, Corpus and BenchmarkCode2
AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed GradientsCode2
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding SharingCode2
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained TransformerCode2
Show:102550
← PrevPage 33 of 568Next →

No leaderboard results yet.