SOTAVerified

16k

Papers

Showing 5175 of 146 papers

TitleStatusHype
COUGH: A Challenge Dataset and Models for COVID-19 FAQ RetrievalCode1
DeepDarts: Modeling Keypoints as Objects for Automatic Scorekeeping in Darts using a Single CameraCode1
Analyzing the Effectiveness of Large Language Models on Text-to-SQL SynthesisCode1
Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue DatasetCode1
Denial-of-Service Poisoning Attacks against Large Language ModelsCode1
Scaling Laws of RoPE-based ExtrapolationCode1
X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One SecondCode0
Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLMCode0
Author Profiling for Abuse DetectionCode0
BertRLFuzzer: A BERT and Reinforcement Learning Based FuzzerCode0
Calpric: Inclusive and Fine-grain Labeling of Privacy Policies with Crowdsourcing and Active LearningCode0
CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese NovelsCode0
Code-Switching Red-Teaming: LLM Evaluation for Safety and Multilingual UnderstandingCode0
Deep Learning for Detecting Cyberbullying Across Multiple Social Media PlatformsCode0
Deep Learning for Hate Speech Detection in TweetsCode0
Detecting Offensive Language in Tweets Using Deep LearningCode0
Extending Context Window of Large Language Models from a Distributional PerspectiveCode0
FPT: Feature Prompt Tuning for Few-shot Readability AssessmentCode0
Hadiths Classification Using a Novel Author-Based Hadith Classification Dataset (ABCD)Code0
How Far Are We from Optimal Reasoning Efficiency?Code0
ImageNet Training in MinutesCode0
KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing ApplicationsCode0
Large-Scale Historical Watermark Recognition: dataset and a new consistency-based approachCode0
Leolani: a reference machine with a theory of mind for social communicationCode0
Model Editing for LLMs4Code: How Far are We?Code0
Show:102550
← PrevPage 3 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Suprime21'"1Unverified