SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2145121500 of 474278 papers

TitleStatusHype
Rethinking HTG Evaluation: Bridging Generation and RecognitionCode1
Exploring Low-Dimensional Subspaces in Diffusion Models for Controllable Image EditingCode1
"Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced DistillationCode1
Explainable AI for computational pathology identifies model limitations and tissue biomarkersCode1
iRangeGraph: Improvising Range-dedicated Graphs for Range-filtering Nearest Neighbor SearchCode1
HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM PromptsCode1
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for RetrievalCode1
RTLRewriter: Methodologies for Large Models aided RTL Code OptimizationCode1
NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API CallsCode1
Evaluation Study on SAM 2 for Class-agnostic Instance-level SegmentationCode1
TASAR: Transfer-based Attack on Skeletal Action RecognitionCode1
RouterRetriever: Routing over a Mixture of Expert Embedding ModelsCode1
Topological Methods in Machine Learning: A Tutorial for PractitionersCode1
Snapshot: Towards Application-centered Models for Pedestrian Trajectory Prediction in Urban Traffic EnvironmentsCode1
FC-KAN: Function Combinations in Kolmogorov-Arnold NetworksCode1
Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language ModelsCode1
LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech EnhancementCode1
UNSURE: self-supervised learning with Unknown Noise level and Stein's Unbiased Risk EstimateCode1
Frequency-Spatial Entanglement Learning for Camouflaged Object DetectionCode1
FuzzCoder: Byte-level Fuzzing Test via Large Language ModelCode1
Designing Large Foundation Models for Efficient Training and Inference: A SurveyCode1
EvoChart: A Benchmark and a Self-Training Approach Towards Real-World Chart UnderstandingCode1
VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution ReasoningCode1
PMLBmini: A Tabular Classification Benchmark Suite for Data-Scarce ApplicationsCode1
LUK: Empowering Log Understanding with Expert Knowledge from Large Language ModelsCode1
Generative Principal Component Regression via Variational InferenceCode1
Decoding finger velocity from cortical spike trains with recurrent spiking neural networksCode1
Training on the Benchmark Is Not All You NeedCode1
Map-Assisted Remote-Sensing Image Compression at Extremely Low BitratesCode1
LongGenBench: Benchmarking Long-Form Generation in Long Context LLMsCode1
Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful PerturbationCode1
SFA-Net: Semantic Feature Adjustment Network for Remote Sensing Image SegmentationCode1
Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image EnhancementCode1
GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object DetectionCode1
Latent Distillation for Continual Object Detection at the EdgeCode1
Early Design Exploration of Aerospace Systems Using Assume-Guarantee ContractsCode1
Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd LocalizationCode1
SPiKE: 3D Human Pose from Point Cloud SequencesCode1
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and AugmentationCode1
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best PracticesCode1
CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP ModelsCode1
Real-Time Recurrent Learning using Trace Units in Reinforcement LearningCode1
Towards Student Actions in Classroom Scenes: New Dataset and BaselineCode1
AMG: Avatar Motion Guided Video GenerationCode1
Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language InterfacesCode1
Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement LearningCode1
Recoverable Compression: A Multimodal Vision Token Recovery Mechanism Guided by Text InformationCode1
Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual LearningCode1
The Compressor-Retriever Architecture for Language Model OSCode1
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM InferenceCode1
Show:102550
← PrevPage 430 of 9486Next →