Zero-shot Generalization

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 572 papers

Title	Date	Tasks	Status	Hype	Score
Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures	Mar 20, 2025	DeblurringZero-shot Generalization	CodeCode Available	2	5
GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal-Conditioned Policy	Aug 26, 2024	Few-Shot LearningImage Generation	CodeCode Available	2	5
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment	Jul 3, 2025	cross-modal alignmentInstruction Following	CodeCode Available	2	5
EcomGPT: Instruction-tuning Large Language Models with Chain-of-Task Tasks for E-commerce	Aug 14, 2023	DiversityInstruction Following	CodeCode Available	2	5
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors	Jul 26, 2024	Depth EstimationGPU	CodeCode Available	2	5
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning	Dec 17, 2024	Denoising	CodeCode Available	2	5
Detecting Everything in the Open World: Towards Universal Object Detection	Mar 21, 2023	object-detectionObject Detection	CodeCode Available	2	5
GeoBench: Benchmarking and Analyzing Monocular Geometry Estimation Models	Jun 18, 2024	BenchmarkingDepth Estimation	CodeCode Available	2	5
GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis	Apr 9, 2024	Image GenerationZero-shot Generalization	CodeCode Available	2	5
Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery	Apr 3, 2025	Field Boundary DelineationInstance Segmentation	CodeCode Available	2	5
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?	May 3, 2024	Computational EfficiencyPrompt Learning	CodeCode Available	2	5
Autoregressive Image Generation with Randomized Parallel Decoding	Mar 13, 2025	Conditional Image GenerationImage Generation	CodeCode Available	2	5
OpenCity: Open Spatio-Temporal Foundation Models for Traffic Prediction	Aug 16, 2024	PredictionTraffic Prediction	CodeCode Available	2	5
Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning	Feb 4, 2024	Contact-rich ManipulationZero-shot Generalization	CodeCode Available	2	5
NeRF-Supervised Deep Stereo	Mar 30, 2023	NeRFNeural Rendering	CodeCode Available	2	5
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model	Mar 8, 2025	Image Quality AssessmentLanguage Modeling	CodeCode Available	2	5
Crosslingual Generalization through Multitask Finetuning	Nov 3, 2022	Coreference ResolutionCross-Lingual Transfer	CodeCode Available	2	5
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement	Oct 15, 2024	DisentanglementInductive Bias	CodeCode Available	2	5
Multitask Prompted Training Enables Zero-Shot Task Generalization	Oct 15, 2021	BenchmarkingDecoder	CodeCode Available	2	5
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance	Apr 4, 2024	BenchmarkingImage Generation	CodeCode Available	2	5
LLM+P: Empowering Large Language Models with Optimal Planning Proficiency	Apr 22, 2023	Zero-shot Generalization	CodeCode Available	2	5
Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression	May 26, 2025	Zero-shot Generalization	CodeCode Available	2	5
Learning to Route Among Specialized Experts for Zero-Shot Generalization	Feb 8, 2024	parameter-efficient fine-tuningZero-shot Generalization	CodeCode Available	2	5
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents	Apr 19, 2023	Information RetrievalPassage Ranking	CodeCode Available	2	5
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient	Nov 26, 2024	GPUImage Generation	CodeCode Available	2	5

Show:10 25 50

← PrevPage 3 of 23Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GR-MG	Avg. sequence length	4.04	—	Unverified
2	MoDE	Avg. sequence length	4.01	—	Unverified
3	RoboUniView	Avg. sequence length	3.65	—	Unverified
4	3D Diffuser Actor	Avg. sequence length	3.27	—	Unverified
5	GR-1	Avg. sequence length	3.06	—	Unverified