SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
16k
16k
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 21–30 of 146 papers
Title
Date
Tasks
Status
Hype
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Jan 21, 2025
16k
Computational Efficiency
—
Unverified
0
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
Jan 16, 2025
16k
Hallucination
Code
Code Available
2
Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning
Dec 30, 2024
16k
Binary Classification
—
Unverified
0
SparseAccelerate: Efficient Long-Context Inference for Mid-Range GPUs
Dec 9, 2024
16k
—
Unverified
0
MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences
Dec 9, 2024
16k
—
Unverified
0
CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese Novels
Dec 3, 2024
16k
Code
Code Available
0
Bimanual Dexterity for Complex Tasks
Nov 20, 2024
16k
—
Unverified
0
Piecing It All Together: Verifying Multi-Hop Multimodal Claims
Nov 14, 2024
16k
All
—
Unverified
0
Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation
Nov 11, 2024
16k
Benchmarking
Code
Code Available
0
Model Editing for LLMs4Code: How Far are We?
Nov 11, 2024
16k
Code Generation
Code
Code Available
0
Show:
10
25
50
← Prev
Page 3 of 15
Next →
Benchmark Results
▼
ConceptNet
1 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
Suprime2
1'"
1
—
Unverified