SOTAVerified

Medical Report Generation

Medical report generation (MRG) is a task which focus on training AI to automatically generate professional report according the input image data. This can help clinicians make faster and more accurate decision since the task itself is both time consuming and error prone even for experienced doctors.

Aggfgg

Deep neural network and transformer based architecture are currently the most popular methods for this certain task, however, when we try to transfer out pre-trained model into this certain domain, their performance always degrade.

The following are some of the reasons why RSG is hard for pre-trained models:

  • Language datasets in a particular domain can sometimes be quite different from the large number of datasets available on the Internet
  • During the fine-tuning phase, datasets in the medical field are often unevenly distributed

More recently, multi-modal learning and contrastive learning have shown some inspiring results in this field, but it's still challenging and requires further attention.

Here are some additional readings to go deeper on the task:

https://arxiv.org/abs/2004.12150

(Image credit : Transformers in Medical Imaging: A Survey)

Papers

Showing 150 of 110 papers

TitleStatusHype
Vision-Language Models for Medical Report Generation and Visual Question Answering: A ReviewCode3
CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal ReasoningCode3
Retrieval Augmented Generation and Understanding in Vision: A Survey and New OutlookCode3
Transformers in Medical Imaging: A SurveyCode3
Cross-Modal Causal Intervention for Medical Report GenerationCode3
HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context InteractionCode2
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist CollaborationCode2
ECG-Chat: A Large ECG-Language Model for Cardiac Disease DiagnosisCode2
Interactive and Explainable Region-guided Radiology Report GenerationCode2
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical ImagingCode2
MiniGPT-Med: Large Language Model as a General Interface for Radiology DiagnosisCode2
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image CaptioningCode1
Automated Generation of Accurate & Fluent Medical X-ray ReportsCode1
Weakly Supervised Contrastive Learning for Chest X-Ray Report GenerationCode1
Complex Organ Mask Guided Radiology Report GenerationCode1
Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report GenerationCode1
ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup AugmentationCode1
Act Like a Radiologist: Radiology Report Generation across Anatomical RegionsCode1
Rethinking Medical Report Generation: Disease Revealing Enhancement with Knowledge GraphCode1
FFA-IR: Towards an Explainable and Reliable Medical Report Generation BenchmarkCode1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report GenerationCode1
ORGAN: Observation-Guided Radiology Report Generation via Tree ReasoningCode1
Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report GenerationCode1
Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report GenerationCode1
M^4I: Multi-modal Models Membership InferenceCode1
RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression ReasoningCode1
A Benchmark for Automatic Medical Consultation System: Frameworks, Tasks and DatasetsCode1
DeepOpht: Medical Report Generation for Retinal Images via Deep Models and Visual ExplanationCode1
Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report GenerationCode1
Automated radiology report generation using conditioned transformersCode1
Inspecting state of the art performance and NLP metrics in image-based medical report generationCode1
PromptMRG: Diagnosis-Driven Prompts for Medical Report GenerationCode1
DeltaNet:Conditional Medical Report Generation for COVID-19 DiagnosisCode1
Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New BenchmarkCode1
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep LearningCode0
CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus DatasetCode0
UMIT: Unifying Medical Imaging Tasks via Vision-Language ModelsCode0
Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report GenerationCode0
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report EvaluationCode0
R2GenCSR: Retrieving Context Samples for Large Language Model based X-ray Medical Report GenerationCode0
On the Automatic Generation of Medical Imaging ReportsCode0
Automatic Radiology Report Generation by Learning with Increasingly Hard NegativesCode0
Lesion Guided Explainable Few Weak-shot Medical Report GenerationCode0
Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation ModelsCode0
Automated Retinal Image Analysis and Medical Report Generation through Deep LearningCode0
S4G: Amodal Single-view Single-Shot SE(3) Grasp Detection in Cluttered ScenesCode0
Cyclic Generative Adversarial Networks With Congruent Image-Report Generation For Explainable Medical Image Analysis0
Customizing General-Purpose Foundation Models for Medical Report Generation0
Cross-modal Contrastive Attention Model for Medical Report Generation0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RGRGBLEU-137.3Unverified
2SEI-1BLEU-20.25Unverified
#ModelMetricClaimedVerifiedStatus
1HistGenBLEU-40.18Unverified
#ModelMetricClaimedVerifiedStatus
1X-RGenBLEU-40.18Unverified