SOTAVerified

Medical Report Generation

Medical report generation (MRG) is a task which focus on training AI to automatically generate professional report according the input image data. This can help clinicians make faster and more accurate decision since the task itself is both time consuming and error prone even for experienced doctors.

Aggfgg

Deep neural network and transformer based architecture are currently the most popular methods for this certain task, however, when we try to transfer out pre-trained model into this certain domain, their performance always degrade.

The following are some of the reasons why RSG is hard for pre-trained models:

  • Language datasets in a particular domain can sometimes be quite different from the large number of datasets available on the Internet
  • During the fine-tuning phase, datasets in the medical field are often unevenly distributed

More recently, multi-modal learning and contrastive learning have shown some inspiring results in this field, but it's still challenging and requires further attention.

Here are some additional readings to go deeper on the task:

https://arxiv.org/abs/2004.12150

(Image credit : Transformers in Medical Imaging: A Survey)

Papers

Showing 150 of 110 papers

TitleStatusHype
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning0
MRGAgents: A Multi-Agent Framework for Improved Medical Report Generation with Med-LVLMs0
Towards a HIPAA Compliant Agentic AI System in Healthcare0
LVMed-R2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation0
Image-to-Text for Medical Reports Using Adaptive Co-Attention and Triple-LSTM Module0
Retrieval Augmented Generation and Understanding in Vision: A Survey and New OutlookCode3
UMIT: Unifying Medical Imaging Tasks via Vision-Language ModelsCode0
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report EvaluationCode0
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations0
PolyPath: Adapting a Large Multimodal Model for Multi-slide Pathology Report Generation0
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine0
Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report GenerationCode0
DAMPER: A Dual-Stage Medical Report Generation Framework with Coarse-Grained MeSH Alignment and Fine-Grained Hypergraph Matching0
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep LearningCode0
FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models0
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
The Potential of LLMs in Medical Education: Generating Questions and Answers for Qualification Exams0
Large Language Model Benchmarks in Medical Tasks0
Image-aware Evaluation of Generated Medical Reports0
Resource-Efficient Medical Report Generation using Large Language Models0
Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble for Zero-shot Learning0
ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation0
CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus DatasetCode0
FODA-PG for Enhanced Medical Imaging Narrative Generation: Adaptive Differentiation of Normal and Abnormal Attributes0
Medical Report Generation Is A Multi-label Classification Problem0
M4CXR: Exploring Multi-task Potentials of Multi-modal Large Language Models for Chest X-ray Interpretation0
Automatic Medical Report Generation: Methods and Applications0
R2GenCSR: Retrieving Context Samples for Large Language Model based X-ray Medical Report GenerationCode0
ECG-Chat: A Large ECG-Language Model for Cardiac Disease DiagnosisCode2
Automated Retinal Image Analysis and Medical Report Generation through Deep LearningCode0
A Labeled Ophthalmic Ultrasound Dataset with Medical Report Generation Based on Cross-modal Deep Learning0
MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks0
MiniGPT-Med: Large Language Model as a General Interface for Radiology DiagnosisCode2
A Survey on Trustworthiness in Foundation Models for Medical Image Analysis0
Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report GenerationCode1
CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation0
Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report GenerationCode1
Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report GenerationCode1
Topicwise Separable Sentence Retrieval for Medical Report Generation0
GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist CollaborationCode2
Prompt-Guided Generation of Structured Chest X-Ray Report Using a Pre-trained LLM0
Dia-LLaMA: Towards Large Language Model-driven CT Report Generation0
MedCycle: Unpaired Medical Report Generation via Cycle-Consistency0
HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context InteractionCode2
Vision-Language Models for Medical Report Generation and Visual Question Answering: A ReviewCode3
ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup AugmentationCode1
Unmasking and Quantifying Racial Bias of Large Language Models in Medical Report Generation0
Dynamic Traceback Learning for Medical Report Generation0
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical ImagingCode2
Medical Report Generation based on Segment-Enhanced Contrastive Representation Learning0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RGRGBLEU-137.3Unverified
2SEI-1BLEU-20.25Unverified
#ModelMetricClaimedVerifiedStatus
1HistGenBLEU-40.18Unverified
#ModelMetricClaimedVerifiedStatus
1X-RGenBLEU-40.18Unverified