Medical Report Generation

Medical report generation (MRG) is a task which focus on training AI to automatically generate professional report according the input image data. This can help clinicians make faster and more accurate decision since the task itself is both time consuming and error prone even for experienced doctors.

Aggfgg

Deep neural network and transformer based architecture are currently the most popular methods for this certain task, however, when we try to transfer out pre-trained model into this certain domain, their performance always degrade.

The following are some of the reasons why RSG is hard for pre-trained models:

Language datasets in a particular domain can sometimes be quite different from the large number of datasets available on the Internet
During the fine-tuning phase, datasets in the medical field are often unevenly distributed

More recently, multi-modal learning and contrastive learning have shown some inspiring results in this field, but it's still challenging and requires further attention.

Here are some additional readings to go deeper on the task:

On the Automatic Generation of Medical Imaging Reports

https://doi.org/10.48550/arXiv.1711.08195
A scoping review of transfer learning research on medical image analysis using ImageNet

https://arxiv.org/abs/2004.13175
A Survey on Incorporating Domain Knowledge into Deep Learning for Medical Image Analysis

https://arxiv.org/abs/2004.12150

(Image credit : Transformers in Medical Imaging: A Survey)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–110 of 110 papers

Title	Date	Tasks	Status
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations	Mar 2, 2025	image-classificationImage Classification	—Unverified
S4G: Amodal Single-view Single-Shot SE(3) Grasp Detection in Cluttered Scenes	Oct 31, 2019	Medical Report Generation	CodeCode Available
On the Automatic Generation of Medical Imaging Reports	Nov 22, 2017	Medical Report GenerationMulti-Task Learning	CodeCode Available
Automated Retinal Image Analysis and Medical Report Generation through Deep Learning	Aug 14, 2024	DiagnosticMedical Report Generation	CodeCode Available
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning	Dec 5, 2024	Comment GenerationDecoder	CodeCode Available
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation	Mar 7, 2025	Large Language ModelMedical Report Generation	CodeCode Available
Lesion Guided Explainable Few Weak-shot Medical Report Generation	Nov 16, 2022	Medical Report Generation	CodeCode Available
Automatic Radiology Report Generation by Learning with Increasingly Hard Negatives	May 11, 2023	Medical Report Generation	CodeCode Available
Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models	Dec 7, 2023	Domain AdaptationMedical Report Generation	CodeCode Available
UMIT: Unifying Medical Imaging Tasks via Vision-Language Models	Mar 20, 2025	DiagnosticMedical Image Analysis	CodeCode Available

Show:10 25 50

← PrevPage 5 of 5Next →

All datasets MIMIC-CXR HistGen WSI-Report Dataset IU X-Ray

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RGRG	BLEU-1	37.3	—	Unverified
2	SEI-1	BLEU-2	0.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HistGen	BLEU-4	0.18	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X-RGen	BLEU-4	0.18	—	Unverified