SOTAVerified

Unconditional Molecule Generation

This task evaluates the ability of generative models to sample valid and realistic molecular structures.

The training dataset can be:

  • QM9 (Wu et al., 2018) - consists of 130,000 stable small organic molecules containing up to nine heavy atoms (C, N, O, F) along with hydrogens.
  • GEOM-DRUGS (Axelrod and Gómez-Bombarelli, 2022) - consistes of 430,000 large organic molecules of up to 180 atoms.

Following prior work (Hoogeboom et al., 2022), we generally sample 10,000 molecules and compute validity, uniqueness and Posebusters sanity checks (Buttenschoen et al., 2024) for molecules. Data is generally split following prior work (Hoogeboom et al., 2022, Vignac et al., 2023) to ensure fair comparisons.

Papers

Showing 18 of 8 papers

TitleStatusHype
All-atom Diffusion Transformers: Unified generative modelling of molecules and materialsCode3
Geometric Representation Condition Improves Equivariant Molecule GenerationCode1
SemlaFlow -- Efficient 3D Molecular Generation with Latent Attention and Equivariant Flow Matching0
Symphony: Symmetry-Equivariant Point-Centered Spherical Harmonics for 3D Molecule GenerationCode1
Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation0
Geometric Latent Diffusion Models for 3D Molecule GenerationCode2
MiDi: Mixed Graph and 3D Denoising Diffusion for Molecule GenerationCode1
Equivariant Diffusion for Molecule Generation in 3DCode2
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TABASCOPoseBusters Validity92Unverified
2SemlaFlowPoseBusters Validity87.5Unverified
3ADiTPoseBusters Validity85.3Unverified
4MiDiValidity77.8Unverified
5EQGAT-diffPoseBusters Validity59.7Unverified
#ModelMetricClaimedVerifiedStatus
1ADiTValidity94.45Unverified
2GeoLDMValidity93.8Unverified
3EDMValidity91.9Unverified
4SymphonyValidity83.5Unverified