SOTAVerified

Medical Report Generation

Medical report generation (MRG) is a task which focus on training AI to automatically generate professional report according the input image data. This can help clinicians make faster and more accurate decision since the task itself is both time consuming and error prone even for experienced doctors.

Aggfgg

Deep neural network and transformer based architecture are currently the most popular methods for this certain task, however, when we try to transfer out pre-trained model into this certain domain, their performance always degrade.

The following are some of the reasons why RSG is hard for pre-trained models:

  • Language datasets in a particular domain can sometimes be quite different from the large number of datasets available on the Internet
  • During the fine-tuning phase, datasets in the medical field are often unevenly distributed

More recently, multi-modal learning and contrastive learning have shown some inspiring results in this field, but it's still challenging and requires further attention.

Here are some additional readings to go deeper on the task:

https://arxiv.org/abs/2004.12150

(Image credit : Transformers in Medical Imaging: A Survey)

Papers

Showing 101110 of 110 papers

TitleStatusHype
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations0
S4G: Amodal Single-view Single-Shot SE(3) Grasp Detection in Cluttered ScenesCode0
On the Automatic Generation of Medical Imaging ReportsCode0
Automated Retinal Image Analysis and Medical Report Generation through Deep LearningCode0
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep LearningCode0
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report EvaluationCode0
Lesion Guided Explainable Few Weak-shot Medical Report GenerationCode0
Automatic Radiology Report Generation by Learning with Increasingly Hard NegativesCode0
Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation ModelsCode0
UMIT: Unifying Medical Imaging Tasks via Vision-Language ModelsCode0
Show:102550
← PrevPage 5 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RGRGBLEU-137.3Unverified
2SEI-1BLEU-20.25Unverified
#ModelMetricClaimedVerifiedStatus
1HistGenBLEU-40.18Unverified
#ModelMetricClaimedVerifiedStatus
1X-RGenBLEU-40.18Unverified