Overall - Test

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–34 of 34 papers

Title	Date	Tasks	Status	Hype
WATT: Weight Average Test-Time Adaptation of CLIP	Jun 19, 2024	image-classificationImage Classification	CodeCode Available	2
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning	Oct 21, 2023	Overall - TestProblem Decomposition	CodeCode Available	1
Comparative study of deep learning methods for the automatic segmentation of lung, lesion and lesion type in CT scans of COVID-19 patients	Jul 29, 2020	Lesion SegmentationOverall - Test	CodeCode Available	1
Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models	May 24, 2023	Overall - Test	CodeCode Available	1
FreeLB: Enhanced Adversarial Training for Natural Language Understanding	Sep 25, 2019	ARCNatural Language Understanding	CodeCode Available	1
Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment	Apr 6, 2022	Overall - TestQuestion Answering	CodeCode Available	1
Amplifying Membership Exposure via Data Poisoning	Nov 1, 2022	Data PoisoningOverall - Test	CodeCode Available	1
Contraction Properties of the Global Workspace Primitive	Oct 2, 2023	Overall - Test	—Unverified	0
Cost-Saving LLM Cascades with Early Abstention	Feb 13, 2025	GSM8KMMLU	—Unverified	0
Fast and accurate classification of echocardiograms using deep learning	Jun 27, 2017	ClassificationDeep Learning	—Unverified	0
Fault Sneaking Attack: a Stealthy Framework for Misleading Deep Neural Networks	May 28, 2019	Overall - Test	—Unverified	0
GanDef: A GAN based Adversarial Training Defense for Neural Network Classifier	Mar 6, 2019	feature selectionOverall - Test	—Unverified	0
AI5GTest: AI-Driven Specification-Aware Automated Testing and Validation of 5G O-RAN Components	Jun 11, 2025	Overall - Test	—Unverified	0
mmID: High-Resolution mmWave Imaging for Human Identification	Feb 1, 2024	Activity RecognitionOverall - Test	—Unverified	0
Modeling speech emotion with label variance and analyzing performance across speakers and unseen acoustic conditions	Mar 24, 2025	Emotion RecognitionOverall - Test	—Unverified	0
Network two-sample test for block models	Jun 10, 2024	Graph MatchingOverall - Test	—Unverified	0
Optimal Layer Selection for Latent Data Augmentation	Aug 24, 2024	Data Augmentationimage-classification	—Unverified	0
Predicting the Outcome of Judicial Decisions made by the European Court of Human Rights	Dec 16, 2019	ArticlesBIG-bench Machine Learning	—Unverified	0
Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models	Oct 8, 2024	HallucinationOverall - Test	—Unverified	0
Application of DenseNet in Camera Model Identification and Post-processing Detection	May 27, 2019	Image ForensicsOverall - Test	—Unverified	0
Artificial Data Point Generation in Clustered Latent Space for Small Medical Datasets	Sep 26, 2024	Overall - TestSynthetic Data Generation	—Unverified	0
Attention Tree: Learning Hierarchies of Visual Features for Large-Scale Image Recognition	Aug 1, 2016	image-classificationImage Classification	—Unverified	0
Automated Human Cell Classification in Sparse Datasets using Few-Shot Learning	Jul 27, 2021	ClassificationFew-Shot Learning	—Unverified	0
Classifier Enhanced Deep Learning Model for Erythroblast Differentiation with Limited Data	Nov 23, 2024	DiagnosticOverall - Test	—Unverified	0
Constructing Open Cloze Tests Using Generation and Discrimination Capabilities of Transformers	Apr 14, 2022	Overall - TestRe-Ranking	—Unverified	0
Targeted Data Generation: Finding and Fixing Model Weaknesses	May 28, 2023	Data AugmentationNatural Language Inference	—Unverified	0
The Future of Software Testing: AI-Powered Test Case Generation and Validation	Sep 9, 2024	Overall - Testsoftware testing	—Unverified	0
Underage Detection through a Multi-Task and MultiAge Approach for Screening Minors in Unconstrained Imagery	Jun 12, 2025	Age EstimationOverall - Test	—Unverified	0
Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs	Mar 20, 2025	DiversityLarge Language Model	—Unverified	0
Solving the Same-Different Task with Convolutional Neural Networks	Jan 22, 2021	Overall - TestZero-shot Generalization	—Unverified	0
Localizing Open-Ontology QA Semantic Parsers in a Day Using Machine Translation	Oct 10, 2020	Machine TranslationNMT	CodeCode Available	0
Transferable Availability Poisoning Attacks	Oct 8, 2023	Contrastive LearningData Poisoning	CodeCode Available	0
Efficient Training of Deep Neural Operator Networks via Randomized Sampling	Sep 20, 2024	Overall - Test	CodeCode Available	0
Deep Modeling and Optimization of Medical Image Classification	May 29, 2025	AvgClassification	CodeCode Available	0

Show:10 25 50

No leaderboard results yet.