SOTAVerified

Ethics

Papers

Showing 110 of 832 papers

TitleStatusHype
RAFT: Reward rAnked FineTuning for Generative Foundation Model AlignmentCode5
TrustLLM: Trustworthiness in Large Language ModelsCode4
How Can Recommender Systems Benefit from Large Language Models: A SurveyCode3
A Survey on Evaluation of Large Language ModelsCode3
Visual Large Language Models for Generalized and Specialized ApplicationsCode3
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via CipherCode2
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 OutlookCode2
Data-Centric Foundation Models in Computational Healthcare: A SurveyCode2
Aligning AI With Shared Human ValuesCode2
A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and LawCode2
Show:102550
← PrevPage 1 of 84Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RuGPT-3 LargeAccuracy68.6Unverified
2RuGPT-3 MeduimAccuracy68.3Unverified
3RuGPT-3 SmallAccuracy55.5Unverified
4Human benchmarkAccuracy52.9Unverified
#ModelMetricClaimedVerifiedStatus
1Human benchmarkAccuracy67.6Unverified
2RuGPT-3 SmallAccuracy60.9Unverified
3RuGPT-3 LargeAccuracy44.9Unverified
4RuGPT-3 MediumAccuracy44.1Unverified