SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1035110400 of 661570 papers

TitleStatusHype
Making Large Language Models Perform Better in Knowledge Graph CompletionCode2
SSCBench: A Large-Scale 3D Semantic Scene Completion Benchmark for Autonomous DrivingCode2
SUNet: Swin Transformer UNet for Image DenoisingCode2
GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?Code2
SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single ImageCode2
Towards Knowledge-driven Autonomous DrivingCode2
Ring Attention with Blockwise Transformers for Near-Infinite ContextCode2
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language ModelsCode2
TokenSHAP: Interpreting Large Language Models with Monte Carlo Shapley Value EstimationCode2
Language models scale reliably with over-training and on downstream tasksCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
Editing Language Model-based Knowledge Graph EmbeddingsCode2
Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and RoadmapCode2
STAMP: Scalable Task And Model-agnostic Collaborative PerceptionCode2
Dual Diffusion Implicit Bridges for Image-to-Image TranslationCode2
PartGS:Learning Part-aware 3D Representations by Fusing 2D Gaussians and SuperquadricsCode2
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object DetectionCode2
Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference OptimizationCode2
Simple Online and Realtime TrackingCode2
Forecasting Global Weather with Graph Neural NetworksCode2
Towards Generating Realistic 3D Semantic Training Data for Autonomous DrivingCode2
Learning representations of learning representationsCode2
DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and UnderstandingCode2
Non-stationary Diffusion For Probabilistic Time Series ForecastingCode2
Rethinking Efficient Lane Detection via Curve ModelingCode2
Generative Auto-Bidding with Value-Guided ExplorationsCode2
MonoCD: Monocular 3D Object Detection with Complementary DepthsCode2
ChatTime: A Unified Multimodal Time Series Foundation Model Bridging Numerical and Textual DataCode2
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow MatchingCode2
OSSO: Obtaining Skeletal Shape from OutsideCode2
Composed Video Retrieval via Enriched Context and Discriminative EmbeddingsCode2
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their DefensesCode2
BRIO: Bringing Order to Abstractive SummarizationCode2
Towards Measuring and Modeling "Culture" in LLMs: A SurveyCode2
Vript: A Video Is Worth Thousands of WordsCode2
COLD-Attack: Jailbreaking LLMs with Stealthiness and ControllabilityCode2
Tensor-Var: Variational Data Assimilation in Tensor Product Feature SpaceCode2
CleanDIFT: Diffusion Features without NoiseCode2
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant TightnessCode2
CAnDOIT: Causal Discovery with Observational and Interventional Data from Time-SeriesCode2
GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian SplattingCode2
Towards Extreme Image Compression with Latent Feature Guidance and Diffusion PriorCode2
SRFormerV2: Taking a Closer Look at Permuted Self-Attention for Image Super-ResolutionCode2
ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated CasesCode2
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language ModelsCode2
Next Best Sense: Guiding Vision and Touch with FisherRF for 3D Gaussian SplattingCode2
Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image RestorationCode2
Towards Training-free Anomaly Detection with Vision and Language Foundation ModelsCode2
Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive TransformerCode2
LLM As DBACode2
Show:102550
← PrevPage 208 of 13232Next →