SOTAVerified

16k

Papers

Showing 76100 of 146 papers

TitleStatusHype
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning0
Fast and Full-Resolution Light Field Deblurring using a Deep Neural Network0
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models0
Human Evaluation of English--Irish Transformer-Based NMT0
Improved prompting and process for writing user personas with LLMs, using qualitative interviews: Capturing behaviour and personality traits of users0
Improved two-stage hate speech classification for twitter based on Deep Neural Networks0
Inferring Pluggable Types with Machine Learning0
Introducing RezoJDM16k: a French KnowledgeGraph DataSet for Link Prediction0
Joint Summarization of Large-scale Collections of Web Images and Videos for Storyline Reconstruction0
Large Batch Training of Convolutional Networks with Layer-wise Adaptive Rate Scaling0
Leveraging Summary Guidance on Medical Report Summarization0
Long Context Alignment with Short Instructions and Synthesized Positions0
LongIns: A Challenging Long-context Instruction-based Exam for LLMs0
Long Range Arena : A Benchmark for Efficient Transformers0
Multilingual Visual Sentiment Concept Matching0
MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences0
NSF-SciFy: Mining the NSF Awards Database for Scientific Claims0
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU0
Parallel Sequence Modeling via Generalized Spatial Propagation Network0
Piecing It All Together: Verifying Multi-Hop Multimodal Claims0
Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models0
Retrieval meets Long Context Large Language Models0
Robustness Evaluation of Entity Disambiguation Using Prior Probes:the Case of Entity Overshadowing0
0/1 Deep Neural Networks via Block Coordinate Descent0
Scaling Distributed Training with Adaptive Summation0
Show:102550
← PrevPage 4 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Suprime21'"1Unverified