SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 88018850 of 177340 papers

TitleStatusHype
GC4NC: A Benchmark Framework for Graph Condensation on Node Classification with New InsightsCode2
Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular VideosCode2
RegMix: Data Mixture as Regression for Language Model Pre-trainingCode2
Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic UnitsCode2
Learning Formal Mathematics From Intrinsic MotivationCode2
Solving Motion Planning Tasks with a Scalable Generative ModelCode2
Isomorphic Pruning for Vision ModelsCode2
Benchmarking Complex Instruction-Following with Multiple Constraints CompositionCode2
MiniGPT-Med: Large Language Model as a General Interface for Radiology DiagnosisCode2
TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language ModelingCode2
Trainable Fractional Fourier TransformCode2
IRSAM: Advancing Segment Anything Model for Infrared Small Target DetectionCode2
ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape DisentanglementCode2
AccDiffusion: An Accurate Method for Higher-Resolution Image GenerationCode2
PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud RegistrationCode2
iHuman: Instant Animatable Digital Humans From Monocular VideosCode2
From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank GradientsCode2
GroupMamba: Efficient Group-Based Visual State Space ModelCode2
AutoFlow: Automated Workflow Generation for Large Language Model AgentsCode2
Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept DiscoveryCode2
RealViformer: Investigating Attention for Real-World Video Super-ResolutionCode2
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language ModelsCode2
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?Code2
NAVIX: Scaling MiniGrid Environments with JAXCode2
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial ImagesCode2
MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal ControlsCode2
WalkTheDog: Cross-Morphology Motion Alignment via Phase ManifoldsCode2
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon TasksCode2
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile ApplicationsCode2
SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language ModelsCode2
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in AlignmentCode2
AgentCourt: Simulating Court with Adversarial Evolvable Lawyer AgentsCode2
Accelerating Giant Impact Simulations with Machine LearningCode2
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal-Conditioned PolicyCode2
UTrack: Multi-Object Tracking with Uncertain DetectionsCode2
AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation ExtractionCode2
PlantSeg: A Large-Scale In-the-wild Dataset for Plant Disease SegmentationCode2
A Survey on Mixup Augmentations and BeyondCode2
PiEEG-16 to Measure 16 EEG Channels with Raspberry Pi for Brain-Computer Interfaces and EEG devicesCode2
The CMA Evolution Strategy: A TutorialCode2
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model InitializationCode2
MOSS: Enabling Code-Driven Evolution and Context Management for AI AgentsCode2
A Survey on the Honesty of Large Language ModelsCode2
Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D ReconstructionCode2
Spiking Transformer with Spatial-Temporal AttentionCode2
Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal MaskingCode2
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?Code2
PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly DetectionCode2
End-to-end Piano Performance-MIDI to Score Conversion with TransformersCode2
Mamba in Vision: A Comprehensive Survey of Techniques and ApplicationsCode2
Show:102550
← PrevPage 177 of 3547Next →