SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 35013550 of 177340 papers

TitleStatusHype
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation ModelsCode3
Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless PositioningCode3
Quest: Query-Aware Sparsity for Efficient Long-Context LLM InferenceCode3
MegaBlocks: Efficient Sparse Training with Mixture-of-ExpertsCode3
When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many ClassesCode3
Blind Image Restoration via Fast Diffusion InversionCode3
An Investigation of Incorporating Mamba for Speech EnhancementCode3
Nuclei instance segmentation and classification in histopathology images with StarDistCode3
syftr: Pareto-Optimal Generative AICode3
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation ModelsCode3
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement LearningCode3
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing SystemCode3
Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe SamplingCode3
Unbiased Estimator for Distorted Conics in Camera CalibrationCode3
360Zhinao Technical ReportCode3
LiteGS: A High-Performance Modular Framework for Gaussian Splatting TrainingCode3
E5-V: Universal Embeddings with Multimodal Large Language ModelsCode3
Affordance-based Robot Manipulation with Flow MatchingCode3
Harnessing the Universal Geometry of EmbeddingsCode3
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative FinanceCode3
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement LearningCode3
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkCode3
Patches Are All You Need?Code3
Cascade Prompt Learning for Vision-Language Model AdaptationCode3
Relational Multi-Task Learning: Modeling Relations between Data and TasksCode3
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-TuningCode3
Deciphering Oracle Bone Language with Diffusion ModelsCode3
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout GuidanceCode3
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to AdvancesCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
Deep Photo Style TransferCode3
Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion ModelsCode3
Generalized Decoding for Pixel, Image, and LanguageCode3
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive AttacksCode3
SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and MoreCode3
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice CloningCode3
StyleGaussian: Instant 3D Style Transfer with Gaussian SplattingCode3
GaMeS: Mesh-Based Adapting and Modification of Gaussian SplattingCode3
REPLUG: Retrieval-Augmented Black-Box Language ModelsCode3
Query-Based Adversarial Prompt GenerationCode3
GRAG: Graph Retrieval-Augmented GenerationCode3
Conformer: Convolution-augmented Transformer for Speech RecognitionCode3
Producing and Leveraging Online Map Uncertainty in Trajectory PredictionCode3
Efficient Inference for Large Reasoning Models: A SurveyCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
CycleNet: Enhancing Time Series Forecasting through Modeling Periodic PatternsCode3
RF-Diffusion: Radio Signal Generation via Time-Frequency DiffusionCode3
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image GenerationCode3
EXP-Bench: Can AI Conduct AI Research Experiments?Code3
CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground EnvironmentsCode3
Show:102550
← PrevPage 71 of 3547Next →