SOTAVerified

Domain Generalization

The idea of Domain Generalization is to learn from one or multiple training domains, to extract a domain-agnostic model which can be applied to an unseen domain

Source: Diagram Image Retrieval using Sketch-Based Deep Learning and Transfer Learning

Papers

Showing 51100 of 1751 papers

TitleStatusHype
A Survey on Domain Generalization for Medical Image AnalysisCode2
Adversarial Supervision Makes Layout-to-Image Diffusion Models ThriveCode2
Singer Identity Representation Learning using Self-Supervised TechniquesCode2
MMA: Multi-Modal Adapter for Vision-Language ModelsCode2
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic SegmentationCode2
TransNeXt: Robust Foveal Visual Perception for Vision TransformersCode2
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target SimulationCode2
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion ModelsCode2
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuningCode2
TALLRec: An Effective and Efficient Tuning Framework to Align Large Language Model with RecommendationCode2
EasyPortrait -- Face Parsing and Portrait Segmentation DatasetCode2
Domain Adaptive and Generalizable Network Architectures and Training Strategies for Semantic Image SegmentationCode2
Your Diffusion Model is Secretly a Zero-Shot ClassifierCode2
GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling GeneralizationCode2
Generalized Parametric Contrastive LearningCode2
On-Device Domain GeneralizationCode2
T-NER: An All-Round Python Library for Transformer-based Named Entity RecognitionCode2
Depth Field Networks for Generalizable Multi-view Scene RepresentationCode2
SyntheX: Scaling Up Learning-based X-ray Image Analysis Through In Silico ExperimentsCode2
Referring Image MattingCode2
Understanding The Robustness in Vision TransformersCode2
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference timeCode2
BatchFormer: Learning to Explore Sample Relationships for Robust Representation LearningCode2
Pedestrian Detection: Domain Generalization, CNNs, Transformers and BeyondCode2
Learning to Prompt for Vision-Language ModelsCode2
Building Computationally Efficient and Well-Generalizing Person Re-Identification Models with Metric LearningCode2
On the limits of cross-domain generalization in automated X-ray predictionCode2
RandAugment: Practical automated data augmentation with a reduced search spaceCode2
Benchmarking Neural Network Robustness to Common Corruptions and PerturbationsCode2
InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofingCode1
Prompt-Free Conditional Diffusion for Multi-object Image AugmentationCode1
Boosting Domain Generalized and Adaptive Detection with Diffusion Models: Fitness, Generalization, and TransferabilityCode1
Domain Generalization for Person Re-identification: A Survey Towards Domain-Agnostic Person MatchingCode1
Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic SegmentationCode1
LangDAug: Langevin Data Augmentation for Multi-Source Domain Generalization in Medical Image SegmentationCode1
Panoramic Out-of-Distribution SegmentationCode1
A Sensor Agnostic Domain Generalization Framework for Leveraging Geospatial Foundation Models: Enhancing Semantic Segmentation viaSynergistic Pseudo-Labeling and Generative LearningCode1
Mamba-Sea: A Mamba-based Framework with Global-to-Local Sequence Augmentation for Generalizable Medical Image SegmentationCode1
RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature LearningCode1
Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain GeneralizationCode1
Robust Object Detection of Underwater Robot based on Domain GeneralizationCode1
Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized DetectionCode1
Gradient-Guided Annealing for Domain GeneralizationCode1
PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image MatchingCode1
Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme DetectionCode1
RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token ReprogrammingsCode1
3DLabelProp: Geometric-Driven Domain Generalization for LiDAR Semantic Segmentation in Autonomous DrivingCode1
SSHNet: Unsupervised Cross-modal Homography Estimation via Problem Reformulation and Split OptimizationCode1
Task-Aware Clustering for Prompting Vision-Language ModelsCode1
PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object DetectionCode1
Show:102550
← PrevPage 2 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SIMPLE+Average Accuracy99Unverified
2PromptStyler (CLIP, ViT-L/14)Average Accuracy98.6Unverified
3GMDG (RegNetY-16GF, SWAD)Average Accuracy97.9Unverified
4D-Triplet(RegNetY-16GF)Average Accuracy97.6Unverified
5MoA (OpenCLIP, ViT-B/16)Average Accuracy97.4Unverified
6GMDG (e RegNetY-16GF)Average Accuracy97.3Unverified
7PromptStyler (CLIP, ViT-B/16)Average Accuracy97.2Unverified
8SPG (CLIP, ViT-B/16)Average Accuracy97Unverified
9CAR-FT (CLIP, ViT-B/16)Average Accuracy96.8Unverified
10MIRO (RegNetY-16GF, SWAD)Average Accuracy96.8Unverified
#ModelMetricClaimedVerifiedStatus
1ViT-8/B-224Accuracy - Clean Images450Unverified
2VOLO-D5Accuracy - All Images57.2Unverified
3ConvNeXt-BAccuracy - All Images53.5Unverified
4ResNeXt-101 32x16dAccuracy - All Images51.7Unverified
5EfficientNet-B8 (advprop+autoaug)Accuracy - All Images50.5Unverified
6EfficientNet-B7 (advprop+autoaug)Accuracy - All Images49.7Unverified
7EfficientNet-B6 (advprop+autoaug)Accuracy - All Images49.6Unverified
8EfficientNet-B5 (advprop+autoaug)Accuracy - All Images49.1Unverified
9ViT-16/L-224Accuracy - All Images49Unverified
10ResNet-50 (gn)Accuracy - All Images48.9Unverified