Medical Report Generation

Medical report generation (MRG) is a task which focus on training AI to automatically generate professional report according the input image data. This can help clinicians make faster and more accurate decision since the task itself is both time consuming and error prone even for experienced doctors.

Aggfgg

Deep neural network and transformer based architecture are currently the most popular methods for this certain task, however, when we try to transfer out pre-trained model into this certain domain, their performance always degrade.

The following are some of the reasons why RSG is hard for pre-trained models:

Language datasets in a particular domain can sometimes be quite different from the large number of datasets available on the Internet
During the fine-tuning phase, datasets in the medical field are often unevenly distributed

More recently, multi-modal learning and contrastive learning have shown some inspiring results in this field, but it's still challenging and requires further attention.

Here are some additional readings to go deeper on the task:

On the Automatic Generation of Medical Imaging Reports

https://doi.org/10.48550/arXiv.1711.08195
A scoping review of transfer learning research on medical image analysis using ImageNet

https://arxiv.org/abs/2004.13175
A Survey on Incorporating Domain Knowledge into Deep Learning for Medical Image Analysis

https://arxiv.org/abs/2004.12150

(Image credit : Transformers in Medical Imaging: A Survey)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 110 papers

Title	Date	Tasks	Status	Hype
Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook	Mar 23, 2025	3D GenerationMedical Report Generation	CodeCode Available	3
Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review	Mar 4, 2024	Medical Report GenerationQuestion Answering	CodeCode Available	3
CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning	Jun 30, 2023	Causal InferenceMedical Report Generation	CodeCode Available	3
Cross-Modal Causal Intervention for Medical Report Generation	Mar 16, 2023	Medical Report Generationobject-detection	CodeCode Available	3
Transformers in Medical Imaging: A Survey	Jan 24, 2022	Image ClassificationImage Segmentation	CodeCode Available	3
ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis	Aug 16, 2024	Contrastive LearningDiagnostic	CodeCode Available	2
MiniGPT-Med: Large Language Model as a General Interface for Radiology Diagnosis	Jul 4, 2024	DiagnosticLanguage Modeling	CodeCode Available	2
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist Collaboration	Apr 23, 2024	Collaborative InferenceIn-Context Learning	CodeCode Available	2
HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction	Mar 8, 2024	DiagnosticMedical Report Generation	CodeCode Available	2
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging	Jan 5, 2024	Medical Report GenerationMedical Visual Question Answering	CodeCode Available	2
Interactive and Explainable Region-guided Radiology Report Generation	Apr 17, 2023	Medical Report Generation	CodeCode Available	2
A Survey of Medical Vision-and-Language Applications and Their Techniques	Nov 19, 2024	Decision MakingDiagnostic	CodeCode Available	1
Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation	Jul 2, 2024	AnatomyClinical Knowledge	CodeCode Available	1
Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation	May 23, 2024	cross-modal alignmentDecoder	CodeCode Available	1
Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report Generation	May 15, 2024	Contrastive Learningcross-modal alignment	CodeCode Available	1
ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation	Feb 20, 2024	Medical Report Generation	CodeCode Available	1
Complex Organ Mask Guided Radiology Report Generation	Nov 4, 2023	Medical Report Generation	CodeCode Available	1
RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning	Oct 21, 2023	Medical Report Generation	CodeCode Available	1
PromptMRG: Diagnosis-Driven Prompts for Medical Report Generation	Aug 24, 2023	DiagnosticMedical Report Generation	CodeCode Available	1
Rethinking Medical Report Generation: Disease Revealing Enhancement with Knowledge Graph	Jul 24, 2023	Medical Report Generation	CodeCode Available	1
ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning	Jun 10, 2023	Medical Report Generation	CodeCode Available	1
Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark	Jun 10, 2023	Image-text RetrievalMedical Report Generation	CodeCode Available	1
Act Like a Radiologist: Radiology Report Generation across Anatomical Regions	May 26, 2023	DecoderMedical Report Generation	CodeCode Available	1
Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation	Mar 18, 2023	Contrastive LearningDecoder	CodeCode Available	1
DeltaNet:Conditional Medical Report Generation for COVID-19 Diagnosis	Nov 12, 2022	COVID-19 DiagnosisDecoder	CodeCode Available	1
M^4I: Multi-modal Models Membership Inference	Sep 15, 2022	Image CaptioningInference Attack	CodeCode Available	1
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and Datasets	Apr 19, 2022	Dialogue Act ClassificationDialogue Understanding	CodeCode Available	1
Automated Generation of Accurate & Fluent Medical X-ray Reports	Nov 1, 2021	Medical Report Generation	CodeCode Available	1
Weakly Supervised Contrastive Learning for Chest X-Ray Report Generation	Sep 25, 2021	Contrastive LearningDescriptive	CodeCode Available	1
FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark	Aug 19, 2021	DiagnosticMedical Report Generation	CodeCode Available	1
Automated radiology report generation using conditioned transformers	Mar 26, 2021	Medical Report GenerationSemantic Similarity	CodeCode Available	1
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning	Feb 20, 2021	DecoderImage Captioning	CodeCode Available	1
Inspecting state of the art performance and NLP metrics in image-based medical report generation	Nov 18, 2020	Medical Report Generation	CodeCode Available	1
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual Explanation	Nov 1, 2020	Medical Report Generation	CodeCode Available	1
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation	Jun 6, 2020	DecoderImage Captioning	CodeCode Available	1
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning	Jun 8, 2025	Medical Report GenerationQuestion Answering	—Unverified	0
MRGAgents: A Multi-Agent Framework for Improved Medical Report Generation with Med-LVLMs	May 24, 2025	DiagnosticMedical Report Generation	—Unverified	0
Towards a HIPAA Compliant Agentic AI System in Healthcare	Apr 24, 2025	AttributeMedical Report Generation	—Unverified	0
LVMed-R2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation	Apr 2, 2025	DiagnosticMedical Report Generation	—Unverified	0
Image-to-Text for Medical Reports Using Adaptive Co-Attention and Triple-LSTM Module	Mar 24, 2025	Image to textMedical Report Generation	—Unverified	0
UMIT: Unifying Medical Imaging Tasks via Vision-Language Models	Mar 20, 2025	DiagnosticMedical Image Analysis	CodeCode Available	0
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation	Mar 7, 2025	Large Language ModelMedical Report Generation	CodeCode Available	0
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations	Mar 2, 2025	image-classificationImage Classification	—Unverified	0
PolyPath: Adapting a Large Multimodal Model for Multi-slide Pathology Report Generation	Feb 14, 2025	DiagnosticMedical Report Generation	—Unverified	0
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine	Feb 13, 2025	DiagnosticDrug Discovery	—Unverified	0
Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation	Jan 7, 2025	Language ModelingLanguage Modelling	—Unverified	0
DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching	Dec 19, 2024	Hypergraph MatchingMedical Report Generation	—Unverified	0
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning	Dec 5, 2024	Comment GenerationDecoder	CodeCode Available	0
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models	Nov 27, 2024	Code GenerationLanguage Modeling	—Unverified	0
The Potential of LLMs in Medical Education: Generating Questions and Answers for Qualification Exams	Oct 31, 2024	DiagnosticMedical Report Generation	—Unverified	0

Show:10 25 50

← PrevPage 1 of 3Next →

All datasets MIMIC-CXR HistGen WSI-Report Dataset IU X-Ray

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RGRG	BLEU-1	37.3	—	Unverified
2	SEI-1	BLEU-2	0.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HistGen	BLEU-4	0.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-RGen	BLEU-4	0.18	—	Unverified