AI4Radiology

April 28, 2026 · View on GitHub

This project was created by Yuezhe Yang for the paper "Representation Paradigms in AI-based 3D Radiological Image Reconstruction: A Systematic Review" (Paper Link).

Abstract
Highlights
Taxonomy
Datasets
Evaluation Metrics
Methods
Challenges and Opportunities
Contributing
Citation

Abstract

The demand for high-quality medical imaging in clinical practice and assisted diagnosis has made 3D reconstruction in radiological imaging a key research focus. Artificial intelligence (AI) has emerged as a promising approach for improving reconstruction accuracy while reducing acquisition and processing time, thereby minimizing patient radiation exposure and discomfort and ultimately benefiting clinical diagnosis. This review surveys state-of-the-art AI-based 3D reconstruction algorithms in radiological imaging and organizes them into four representation families according to how the reconstructed target is parameterized: discrete grid representations, explicit basis expansion representations, explicit primitive representations, and implicit neural representations. In particular, the review clarifies the relationships among these representation forms and highlights radiance field methods as a specialized subtype of implicit neural representation. In addition, we summarize commonly used evaluation metrics and benchmark datasets for radiological image reconstruction. Finally, we discuss the current state of development, major challenges, and future research directions in this rapidly evolving field. Our project is available at: https://github.com/Bean-Young/AI4Radiology.

Highlights

Representation-first organization. Methods are grouped by how the reconstructed target is parameterized, not only by network backbone.
Paper + code links. Each method row includes a paper page and a code link when a public implementation is available.
Radiology-focused scope. The list centers on reconstruction and image formation across CT, MRI, PET, SPECT, ultrasound, and multi-modality settings.
Awesome-style maintenance. PRs adding missing code, paper pages, datasets, or corrected metadata are welcome.

Taxonomy

Family	Core idea	Typical strengths	Common limitations
Discrete grid	Reconstruct slices or volumes on regular pixels/voxels.	Simple, deployable, compatible with clinical image grids.	Fixed sampling can limit spatial continuity and memory efficiency.
Explicit basis expansion	Store finite coefficients over local bases, kernels, or transformed domains.	Interpretable coefficients and strong links to classical reconstruction.	Depends on basis/kernel design and imaging-operator conditioning.
Explicit primitive	Use adaptive primitives such as 3D Gaussians or meshes.	Efficient adaptive modeling and fast rendering potential.	Requires specialized initialization, rendering, and optimization.
Implicit neural	Model anatomy or measurements with continuous coordinate-query functions.	Strong continuous modeling under sparse-view or low-dose settings.	Expensive optimization and weaker interpretability.

Taxonomy of representation families for AI-based 3D reconstruction in radiological imaging

Taxonomy of representation families for AI-based 3D reconstruction in radiological imaging.

Datasets

Three distinct task types in radiological image reconstruction

Three distinct task types in radiological image reconstruction.

Modality	Task	Dataset	Size	Details
CT	Task I / Task II	AAPM Low Dose CT Grand Challenge	30 abdominal cases plus an ACR phantom	A classic low-dose CT reconstruction benchmark providing quarter-dose and full-dose CT data for algorithm comparison under clinically relevant dose reduction.
CT	Task I / Task II	LDCT-and-Projection-Data	299 clinical CT examinations	A TCIA/Mayo dataset containing head, chest, and abdomen CT projection data in DICOM-CT-PD format, with routine-dose and simulated low-dose projections for CT reconstruction and denoising.
CT	Task I / Task II	LoDoPaB-CT	46,573 2D image-observation pairs	A large low-dose CT benchmark derived from LIDC-IDRI, providing simulated low-photon-count CT observations and ground-truth images for inverse reconstruction.
CT	Task I / Task III	SinoCT	>9,000 head CT scans	A large head CT dataset with reconstructed DICOM images and GE CatSim-simulated sinograms, suitable for sinogram-domain and reduced-projection CT reconstruction studies.
CT	Task II / Task III	AAPM CT-MAR Challenge	14,000 training cases, 1,000 test cases, and 29 final clinical scoring cases	A CT metal artifact reduction benchmark providing paired metal-corrupted and metal-free images and sinograms, allowing image-domain, sinogram-domain, and hybrid reconstruction methods.
CT	Task I / Task III	MORE	135 CT studies (65,575 slices)	A multi-organ CT reconstruction benchmark covering 9 anatomies and 15 lesion types, designed for sparse-view CT reconstruction and cross-anatomy generalization evaluation.
CT	Task III	Corona-Figueroa et al.	25 CT volumes	A paper-specific chest and knee CT-to-DRR corpus used for single- or few-view X-ray/CT projection reconstruction and 3D-aware radiological novel-view synthesis.
MRI	Task I / Task III	fastMRI	1,594 knee raw scans, 7,002 brain raw scans, plus prostate and breast subsets	A large-scale MRI reconstruction benchmark releasing raw k-space and reference images for accelerated single-coil and multi-coil MRI reconstruction.
MRI	Task I / Task III	Calgary-Campinas Public Brain MR Raw k-space Dataset	167 3D T1-weighted brain MRI scans	A public brain MRI raw k-space benchmark with 12-channel and 32-channel acquisitions for accelerated 3D MRI reconstruction.
MRI	Task I / Task III	OCMR	165 fully sampled and 212 prospectively undersampled cardiac cine series	An open-access cardiac MRI dataset providing multi-coil k-space data for fully sampled and prospectively undersampled cine reconstruction.
MRI	Task I / Task III	SKM-TEA	155 qDESS knee MRI scans	A quantitative knee MRI dataset with raw k-space, reconstructed DICOM images, tissue segmentations, and pathology labels for end-to-end accelerated MRI reconstruction evaluation.
MRI	Task I / Task III	CMRxRecon2024	330 volunteers; 200/60/70 train/validation/test split	A multi-contrast, multi-view, multi-coil cardiac MRI k-space benchmark covering cine, mapping, tagging, flow, and dark-blood acquisitions with multiple undersampling patterns.
MRI	Task I / Task II	M4Raw	183 healthy volunteers; >1,000 released volumes after quality control	A low-field 0.3T multi-channel brain MRI raw k-space dataset with T1w, T2w, and FLAIR repetitions for denoising, averaging, and parallel-imaging reconstruction.
PET	Task II	UDPET	1,447 whole-body 18F-FDG PET subjects	A whole-body ultra-low-dose PET recovery benchmark providing low-statistics PET images at multiple dose-reduction factors aligned with full-dose PET images.
PET	Task I	PETRIC	Approximately 13 phantom datasets	A PET rapid image reconstruction challenge dataset with phantom list-mode files, attenuation-correction files, normalization/calibration files, vendor reconstructions, and VOIs for PET reconstruction benchmarking.
US	Task I / Task III	PICMUS	4 core phantom datasets plus in-vivo carotid acquisitions	A canonical plane-wave ultrasound image-formation benchmark providing RF/IQ data from 75 steered plane waves for coherent compounding and beamforming reconstruction.
US	Task I / Task III	CUBDL	576 ultrasound acquisition sequences	A large open ultrasound channel-data benchmark containing simulated, phantom, and in-vivo acquisitions for deep-learning-based beamforming and image formation.
US	Task III	Wysocki et al. / Ultra-NeRF	Synthetic liver sweeps and tracked spine-phantom sweeps	A B-mode ultrasound novel-view and volumetric representation dataset for learning view-dependent US image synthesis from overlapping sweeps.
PET / MRI	Task I	Monash vis-fPET-fMRI	10 healthy adults	An OpenNeuro simultaneous FDG-fPET/BOLD-fMRI dataset releasing unreconstructed PET list-mode source data, normalization and sinogram-related files, and reconstructed PET images for PET reconstruction pipeline development.
PET / MRI	Task I	Monash DaCRA fPET-fMRI	15 participants	An OpenNeuro FDG-fPET/fMRI dataset comparing bolus, infusion, and bolus-infusion protocols and releasing unreconstructed PET list-mode data for offline dynamic PET reconstruction.
SPECT / CT	Task I	Lu-177 DOTATATE Projection Data	2 patients scanned at 4 post-therapy time points	A rare public human SPECT reconstruction dataset containing Lu-177 SPECT projection data and CT-based attenuation coefficient maps for quantitative SPECT reconstruction and dosimetry.

Evaluation Metrics

Category	Metrics
Pixel-wise error	MSE, RMSE, NRMSE, MAE
Signal and noise	PSNR, SNR, CNR
Perceptual / structural similarity	SSIM, LPIPS, FSIM, cosine similarity
Efficiency	Reconstruction time, rendering FPS, parameter count

Methods

Discrete grid representations

Representation	Modality	Paper	Year	Code	Why it matters
Slice-wise	CT	CTTR	2022	-	A dual-domain transformer for sparse-view CT reconstruction. It restores sinogram and image-domain information while the final reconstruction remains an explicit CT grid.
Slice-wise	CT	DuDoTrans	2022	Code	A dual-domain transformer that restores sparse-view sinograms and reconstructs CT images by combining projection-domain and image-domain information.
Slice-wise	CT	CoreDiff	2023	Code	A diffusion-based low-dose CT reconstruction method that uses a mean-preserving degradation process and image-domain restoration to suppress noise and preserve texture.
Slice-wise	MRI	Du et al.	2020	-	A residual CNN-based super-resolution method for anisotropic 3D MRI. Although the task targets 3D MRI, the network processes grid samples and outputs rasterized image slices.
Slice-wise	MRI	VarNet	2020	-	An end-to-end variational network for accelerated MRI reconstruction. It became a strong fastMRI baseline and reconstructs images on explicit sampled grids.
Slice-wise	MRI	PCNN	2021	Code	A perceptual complex neural network for undersampled non-Cartesian cine MRI reconstruction, using complex convolutions and perceptual loss on discrete image grids.
Slice-wise	MRI	SLATER	2022	Code	An unsupervised adversarial transformer-based MRI reconstruction method that learns an image prior and performs zero-shot reconstruction on sampled image grids.
Slice-wise	CT/MRI	MambaMIR	2025	-	A Mamba-based medical reconstruction framework with arbitrary masking and uncertainty estimation, applied to fast MRI and sparse-view CT reconstruction.
Slice-wise	CT/MRI /PET	Restore-RWKV	2024	Code	A medical image restoration model based on recurrent WKV attention and omnidirectional token shift. It handles CT, MRI, and PET restoration on discrete image grids.
Multi-plane	CT	Corona-Figueroa et al.	2024	-	A method that repeats and concatenates 2D X-ray views into a 3D volume representation and uses neural optimal transport to retain cross-view information.
Multi-plane	MRI	MADGAN	2021	-	A GAN-driven method that uses multiple adjacent brain MRI slices to improve reconstruction and anomaly detection, making it more naturally a slice-stack grid representation.
Volume	CT	HDNet	2020	-	A hybrid-domain neural network for sparse-view cone-beam CT reconstruction, combining projection and image-domain information in a volumetric grid output.
Volume	CT	C2RV	2024	Code	A cross-regional and cross-view learning framework for sparse-view CBCT reconstruction, using multi-scale volumetric representation and cross-view attention.
Volume	CT	Geometry-aware attenuation learning	2024	Code	A sparse-view CBCT reconstruction method that encodes multi-view projections, back-projects features into 3D space, and decodes a voxel volume.
Volume	CT	DCT-Net	2025	-	A dual-branch CT reconstruction network from orthogonal X-rays, combining diffusion-based augmentation and perceptual contrastive learning for volumetric CT recovery.
Volume	MRI	CINENet	2020	-	A 4D deep reconstruction framework for isotropic 3D cine MRI from single breath-hold acquisitions, reconstructing volumetric image sequences.
Volume	MRI	Shaul et al.	2020	Code	A GAN-based undersampled MRI reconstruction framework combining U-Net and adversarial learning to recover high-quality grid-based MRI volumes.
Volume	PET	DirectPET	2020	-	An end-to-end PET reconstruction method with a Radon inversion layer, designed for fast reconstruction from projection data to image grids.
Volume	PET	Xie et al.	2021	-	A supervised co-learning CNN framework that extracts PET and CT features and integrates them into constrained PET reconstruction to improve lesion contrast.
Volume	PET	MEaTransGAN	2024	-	A 3D multi-modality Transformer-GAN that reconstructs standard-dose PET from low-dose PET and T1-weighted MRI.
Volume	PET	Singh et al.	2024	Code	A fully 3D PET reconstruction framework using score-based generative models and PET-specific sampling to improve robustness and image quality.
Volume	PET	Cycle-DCN	2025	-	A cycle-constrained adversarial denoising CNN for 3D low-dose PET image denoising, validated on large datasets with reader study and real low-dose data. The method operates on discrete volumetric PET images rather than continuous or primitive-based representations.
Volume	PET	DDPET-3D	2026	-	A dose-aware diffusion model for 3D low-dose PET denoising that generates consistent volumetric PET images across varying dose levels and improves image quality in ultra-low-dose settings.
Volume	SPECT	SPECTnet	2021	-	A deep learning method for direct SPECT reconstruction from projection data to high-resolution activity images.
Volume	CT/MRI	DiffusionMBIR	2023	Code	A method combining pretrained 2D diffusion priors with model-based iterative reconstruction for sparse-view CT, limited-angle CT, and compressed sensing MRI.

Explicit basis expansion representations

Representation	Modality	Paper	Year	Code	Why it matters
Local basis	PET	Mandeville et al.	2024	-	A recent PET study using B-spline basis functions for voxel-wise partial volume correction, showing that local basis representations remain relevant in reconstruction-related medical imaging problems.
Kernel coeff.	PET	Gong et al.	2021	-	A direct parametric PET reconstruction method combining nonlocal deep image prior, a kernel matrix layer, and kinetic-model convolution. The reconstructed object remains a finite coefficient-based image.
Kernel coeff.	PET	Li et al.	2022	-	A PET reconstruction framework that connects kernel representation with learnable deep mappings, improving dynamic PET reconstruction over empirical kernel construction.
Kernel coeff.	PET	Neural KEM	2023	-	A kernel expectation-maximization method with a deep coefficient prior, designed to improve low-count PET reconstruction while retaining explicit coefficient-space representation.
Kernel coeff.	PET	Guo et al.	2023	-	A PET reconstruction method that jointly regularizes image space and kernel space to reduce mismatch between anatomical priors and PET tracer distributions.
Kernel coeff.	PET	Deidda et al.	2023	Code	A multimodal kernel-based reconstruction framework incorporating SPECT, PET, and CT side information, showing that kernel coefficient ideas extend beyond PET-only reconstruction.
Kernel coeff.	SPECT	Deidda et al.	2022	-	A hybrid kernelised expectation-maximisation method for Bremsstrahlung SPECT reconstruction, improving image resolution and noise properties with CT guidance.

Explicit primitive representations

Representation	Modality	Paper	Year	Code	Why it matters
Gaussian	CT	DIF-Gaussian	2024	Code	A Gaussian-based sparse-view CBCT reconstruction method that learns 3D Gaussian features and optimizes them for extremely sparse projection settings.
Gaussian	CT	R2-Gaussian	2024	Code	A radiative Gaussian splatting framework for tomographic reconstruction, designed to reduce integration bias and improve sparse-view reconstruction quality.
Gaussian	CT	GaSpCT	2024	-	A Gaussian splatting framework for CT projection view synthesis from limited projections, improving efficiency compared with dense voxel and NeRF-style representations.
Gaussian	CT	DDGS-CT	2024	-	A direction-disentangled Gaussian splatting method for realistic CT volume rendering, decomposing radiosity into isotropic and directional components.
Gaussian	CT	X-Gaussian	2025	Code	A radiative Gaussian splatting method for efficient X-ray novel-view synthesis, eliminating view dependency and improving training and inference speed.
Gaussian	CT	X-GRM	2025	-	A large Gaussian reconstruction model that decodes sparse-view X-rays into a voxel-based Gaussian representation for feed-forward CT reconstruction.
Gaussian	CT	3DGR-CT	2025	Code	A sparse-view CT reconstruction method that initializes Gaussian primitives from FBP and updates them adaptively during self-supervised reconstruction.
Gaussian	CT	Fu et al.	2025	-	A 4D-CBCT reconstruction method that combines Gaussian primitives with deformation modeling to represent respiratory motion and dynamic anatomy.
Gaussian	MRI	CauGD2-Net	2025	-	A causality-driven dual-domain MRI reconstruction network enhanced by Gaussian-splatting-inspired spatial modeling to reduce artifacts and improve consistency.
Gaussian	MRI	3DGSMR	2025	-	A 3D MRI reconstruction framework from undersampled k-space that represents the target MR volume with explicit 3D Gaussian primitives.
Gaussian	MRI	M-Gaussian	2026	-	A magnetic Gaussian framework for slice-to-volume MRI reconstruction that adapts 3D Gaussian primitives to efficient multi-stack MRI reconstruction.
Gaussian	PET	GR-Diffusion	2026	-	A whole-body PET reconstruction framework that combines explicit 3D Gaussian representation with diffusion-based refinement to improve global consistency and local detail recovery.
Gaussian	US	UltraGauss	2025	-	An ultrasound-specific Gaussian splatting framework for fast 3D ultrasound volume reconstruction, modeling probe-plane intersections to better match acoustic image formation.
Gaussian	US	UltraGS	2025	-	A depth-aware Gaussian splatting framework for ultrasound novel view synthesis, incorporating ultrasound-specific rendering effects such as attenuation, reflection, and scattering.
Gaussian	US	UltraG-Ray	2026	-	A physics-based ultrasound rendering method built on a learnable 3D Gaussian field, designed for realistic novel-view B-mode synthesis.
Simplex mesh	PET	Lesonen et al.	2024	-	A PET reconstruction method using anatomy-guided multi-resolution triangular meshes to achieve a favorable balance between accuracy and computational cost.

Implicit neural representations

Representation	Modality	Paper	Year	Code	Why it matters
Coord. field	CT	Reed et al.	2021	-	An early medical INR method that reconstructs dynamic CT from limited views by combining a continuous neural field with parametric motion fields.
Coord. field	CT	NAF	2022	Code	A neural attenuation field method for sparse-view CBCT reconstruction, representing attenuation coefficients as a continuous function of 3D coordinates.
Coord. field	CT	DIF-Net	2023	Code	A deep intensity field method for extremely sparse-view CBCT reconstruction, learning a continuous coordinate-to-intensity representation from sparse projections.
Coord. field	CT	PINER	2023	Code	A prior-informed implicit neural representation method for test-time adaptation in sparse-view CT reconstruction.
Coord. field	CT	FACT	2025	-	A fast sparse-view CBCT reconstruction method using a meta-learned neural attenuation field and hash-encoding regularization.
Coord. field	MRI	IMJENSE	2023	-	A scan-specific INR framework for parallel MRI that jointly represents the image and coil sensitivity maps as continuous functions.
Coord. field	MRI	NeSVoR	2023	Code	An implicit neural representation method for slice-to-volume MRI reconstruction, recovering continuous fetal or neonatal MRI volumes from motion-corrupted slices.
Coord. field	MRI	Spatiotemporal INR for dynamic MRI	2025	Code	A dynamic MRI reconstruction method that models the target sequence as a spatiotemporal coordinate field without requiring external training labels.
Coord. field	PET	IMREPET	2025	-	An unsupervised dynamic PET reconstruction method that parameterizes PET activity as a continuous spatiotemporal implicit neural representation.
Coord. field	US	ImplicitVol	2021	-	A sensorless 3D ultrasound reconstruction method using deep implicit representation to learn a coordinate-to-intensity ultrasound volume.
Coord. field	CT/MRI	NeRP	2022	Code	A prior-embedded INR framework that maps spatial coordinates to image intensity values and uses prior images to improve sparse CT and MRI reconstruction.
Radiance field	CT	MedNeRF	2022	Code	A medical neural radiance field framework that reconstructs 3D-aware CT projections from single-view or few-view X-rays.
Radiance field	CT	ACNeRF	2024	-	A medical NeRF method that improves single-view X-ray reconstruction by introducing alignment and pose correction for more accurate novel-view synthesis.
Radiance field	CT	UMedNeRF	2024	-	An uncertainty-aware medical NeRF method for single-view volumetric rendering and CT projection synthesis.
Radiance field	CT	VolumeNeRF	2024	Code	A CT volume reconstruction method from a single projection view, integrating anatomical priors, projection attention, and Lambert-Beer law-based rendering.
Radiance field	CT	SAX-NeRF	2024	Code	A structure-aware sparse-view X-ray 3D reconstruction method using radiance-field modeling with local and global ray sampling.
Radiance field	MRI	CuNeRF	2023	Code	A cube-based NeRF framework for zero-shot arbitrary-scale medical image super-resolution and free-view synthesis in CT and MRI.
Radiance field	MRI	Brain MRI NeRF reconstruction	2023	-	A NeRF-based method for reconstructing 3D-aware brain MRI representations from 2D MRI scans.
Radiance field	US	Ultra-NeRF	2024	-	A physics-aware ultrasound NeRF that models attenuation, reflection, and scattering to synthesize view-dependent B-mode images from tracked 2D ultrasound scans.
Radiance field	US	UlRe-NeRF	2024	-	A 3D ultrasound neural rendering method that incorporates ultrasound reflection direction parameterization for improved view-dependent synthesis.
Radiance field	US	NeRF-US	2024	Code	A neural radiance field framework designed to reduce ultrasound imaging artifacts and improve 3D ultrasound reconstruction in real-world settings.

Challenges and Opportunities

Transfer learning

No.	Title	Pub	Abstract	Code
1	Transfer learning in medical image segmentation: New insights from analysis of the dynamics of model parameters and learned representations	AIM(2021)	Click We present a critical assessment of the role of transfer learning in training fully convolutional networks (FCNs) for medical image segmentation. We first show that although transfer learning reduces the training time on the target task, improvements in segmentation accuracy are highly task/data-dependent. Large improvements are observed only when the segmentation task is more challenging and the target training data is smaller. We shed light on these observations by investigating the impact of transfer learning on the evolution of model parameters and learned representations. We observe that convolutional filters change little during training and still look random at convergence. We further show that quite accurate FCNs can be built by freezing the encoder section of the network at random values and only training the decoder section. At least for medical image segmentation, this finding challenges the common belief that the encoder section needs to learn data/task-specific representations. We examine the evolution of FCN representations to gain a deeper insight into the effects of transfer learning on the training dynamics. Our analysis shows that although FCNs trained via transfer learning learn different representations than FCNs trained with random initialization, the variability among FCNs trained via transfer learning can be as high as that among FCNs trained with random initialization. Moreover, feature reuse is not restricted to the early encoder layers; rather, it can be more significant in deeper layers. These findings offer new insights and suggest alternative ways of training FCNs for medical image segmentation.	❌
2	Transfusion: Understanding transfer learning for medical imaging	arxiv(2019)	Click Transfer learning from natural image datasets, particularly ImageNet, using standard large models and corresponding pretrained weights has become a de-facto method for deep learning applications to medical imaging. However, there are fundamental differences in data sizes, features and task specifications between natural image classification and the target medical tasks, and there is little understanding of the effects of transfer. In this paper, we explore properties of transfer learning for medical imaging. A performance evaluation on two large scale medical imaging tasks shows that surprisingly, transfer offers little benefit to performance, and simple, lightweight models can perform comparably to ImageNet architectures. Investigating the learned representations and features, we find that some of the differences from transfer learning are due to the over-parametrization of standard models rather than sophisticated feature reuse. We isolate where useful feature reuse occurs, and outline the implications for more efficient model exploration. We also explore feature independent benefits of transfer arising from weight scalings.	❌
3	Deep Learning- and Transfer Learning-Based Super Resolution Reconstruction from Single Medical Image	J HEALTHC ENG(2017)	Click Medical images play an important role in medical diagnosis and research. In this paper, a transfer learning- and deep learning-based super resolution reconstruction method is introduced. The proposed method contains one bicubic interpolation template layer and two convolutional layers. The bicubic interpolation template layer is prefixed by mathematics deduction, and two convolutional layers learn from training samples. For saving training medical images, a SIFT feature-based transfer learning method is proposed. Not only can medical images be used to train the proposed method, but also other types of images can be added into training dataset selectively. In empirical experiments, results of eight distinctive medical images show improvement of image quality and time reduction. Further, the proposed method also produces slightly sharper edges than other deep learning approaches in less time and it is projected that the hybrid architecture of prefixed template layer and unfixed hidden layers has potentials in other applications.	❌
4	Assessment of the generalization of learned image reconstruction and the potential for transfer learning	Magn Reson Med(2019)	Click PurposeAlthough deep learning has shown great promise for MR image reconstruction, an open question regarding the success of this approach is the robustness in the case of deviations between training and test data. The goal of this study is to assess the influence of image contrast, SNR, and image content on the generalization of learned image reconstruction, and to demonstrate the potential for transfer learning.MethodsReconstructions were trained from undersampled data using data sets with varying SNR, sampling pattern, image contrast, and synthetic data generated from a public image database. The performance of the trained reconstructions was evaluated on 10 in vivo patient knee MRI acquisitions from 2 different pulse sequences that were not used during training. Transfer learning was evaluated by fine-tuning baseline trainings from synthetic data with a small subset of in vivo MR training data.ResultsDeviations in SNR between training and testing led to substantial decreases in reconstruction image quality, whereas image contrast was less relevant. Trainings from heterogeneous training data generalized well toward the test data with a range of acquisition parameters. Trainings from synthetic, non-MR image data showed residual aliasing artifacts, which could be removed by transfer learning–inspired fine-tuning.ConclusionThis study presents insights into the generalization ability of learned image reconstruction with respect to deviations in the acquisition settings between training and testing. It also provides an outlook for the potential of transfer learning to fine-tune trainings to a particular target application using only a small number of training cases.	❌

Edge computing

No.	Title	Pub	Abstract	Code
1	Local edge computing for radiological image reconstruction and computer-assisted detection: A feasibility study	FinJeHeW(2023)	Click Computational requirements for data processing at different stages of the radiology value chain are increasing. Cone beam computed tomography (CBCT) is a diagnostic imaging technique used in dental and extremity imaging, involving a highly demanding image reconstruction task. In turn, artificial intelligence (AI) assisted diagnostics are becoming increasingly popular, thus increasing the use of computation resources. Furthermore, the need for fully independent imaging units outside radiology departments and with remotely performed diagnostics emphasize the need for wireless connectivity between the imaging unit and hospital infrastructure. In this feasibility study, we propose an approach based on a distributed edge-cloud computing platform, consisting of small-scale local edge nodes, edge servers with traditional cloud resources to perform data processing tasks in radiology. We are interested in the use of local computing resources with Graphics Processing Units (GPUs), in our case Jetson Xavier NX, for hosting the algorithms for two use-cases, namely image reconstruction in cone beam computed tomography and AI-assisted cancer detection from mammographic images. Particularly, we wanted to determine the technical requirements for local edge computing platform for these two tasks and whether CBCT image reconstruction and breast cancer detection tasks are possible in a diagnostically acceptable time frame. We validated the use-cases and the proposed edge computing platform in two stages. First, the algorithms were validated use-case-wise by comparing the computing performance of the edge nodes against a reference setup (regular workstation). Second, we performed qualitative evaluation on the edge computing platform by running the algorithms as nanoservices. Our results, obtained through real-life prototyping, indicate that it is possible and technically feasible to run both reconstruction and AI-assisted image analysis functions in a diagnostically acceptable computing time. Furthermore, based on the qualitative evaluation, we confirmed that the local edge computing capacity can be scaled up and down during runtime by adding or removing edge devices without the need for manual reconfigurations. We also found all previously implemented software components to be transferable as such. Overall, the results are promising and help in developing future applications, e.g., in mobile imaging scenarios, where such a platform is beneficial.	❌
2	3D Remote Healthcare for Noisy CT Images in the Internet of Things Using Edge Computing	IEEE(2021)	Click Edge computing can provide many key functions without connecting to centralized servers, which enables remote areas to obtain real-time medical diagnoses. The combination of edge computing and Internet of things (IoT) devices can send remote patient data to the hospital, which will help to more effectively address long-term or chronic diseases. CT images are widely used in the diagnosis of clinical diseases, and their characteristics are an important basis for pathological diagnosis. In the CT imaging process, speckle noise is caused by the interference of ultrasound on human tissues, and its component information is complex. To solve these problems, we propose a 3D reconstruction method for noisy CT images in the IoT using edge computing. First, we propose a multi-stage feature extraction generative adversarial network (MF-GAN) denoising algorithm. The generator of MF-GAN adopts the multi-stage feature extraction, which can ensure the reconstruction of the image texture and edges. Second, we apply the denoised images generated from the MF-GAN method to perform the 3D reconstruction. A marching cube (MC) algorithm based on regional growth and trilinear interpolation (RGT-MC) is proposed. With the idea of regional growth, all voxels containing iso-surfaces are selected and calculated, which accelerates the reconstruction efficiency. The intersection point of the voxel and iso-surface is calculated by the trilinear interpolation algorithm, which effectively improves the reconstruction accuracy. The experimental results show that MF-GAN has a better denoising effect than other algorithms. Compared to other representative 3D algorithms, the RGT-MC algorithm greatly improves the efficiency and precision.	❌
3	A collaborative inference strategy for medical image diagnosis in mobile edge computing environment	PeerJ Comput Sci(2025)	Click Purpose: Although deep learning has shown great promise for MR image reconstruction, an open question regarding the success of this approach is the robustness in the case of deviations between training and test data. The goal of this study is to assess the influence of image contrast, SNR, and image content on the generalization of learned image reconstruction, and to demonstrate the potential for transfer learning.Methods: Reconstructions were trained from undersampled data using data sets with varying SNR, sampling pattern, image contrast, and synthetic data generated from a public image database. The performance of the trained reconstructions was evaluated on 10 in vivo patient knee MRI acquisitions from 2 different pulse sequences that were not used during training. Transfer learning was evaluated by fine-tuning baseline trainings from synthetic data with a small subset of in vivo MR training data.Results: Deviations in SNR between training and testing led to substantial decreases in reconstruction image quality, whereas image contrast was less relevant. Trainings from heterogeneous training data generalized well toward the test data with a range of acquisition parameters. Trainings from synthetic, non-MR image data showed residual aliasing artifacts, which could be removed by transfer learning-inspired fine-tuning.Conclusion: This study presents insights into the generalization ability of learned image reconstruction with respect to deviations in the acquisition settings between training and testing. It also provides an outlook for the potential of transfer learning to fine-tune trainings to a particular target application using only a small number of training cases.	❌

Unsupervised learning

No.	Title	Pub	Abstract	Code
1	Noise Characteristics Modeled Unsupervised Network for Robust CT Image Reconstruction	IEEE TMI（2022）	Click Deep learning (DL)-based methods show great potential in computed tomography (CT) imaging field. The DL-based reconstruction methods are usually evaluated on the training and testing datasets which are obtained from the same distribution, i.e., the same CT scan protocol (i.e., the region setting, kVp, mAs, etc.). In this work, we focus on analyzing the robustness of the DL-based methods against protocol-specific distribution shifts (i.e., the training and testing datasets are from different region settings, different kVp settings, or different mAs settings, respectively). The results show that the DL-based reconstruction methods are sensitive to the protocol-specific perturbations which can be attributed to the noise distribution shift between the training and testing datasets. Based on these findings, we presented a low-dose CT reconstruction method using an unsupervised strategy with the consideration of noise distribution to address the issue of protocol-specific perturbations. Specifically, unpaired sinogram data is enrolled into the network training, which represents unique information for specific imaging protocol, and a Gaussian mixture model (GMM) is introduced to characterize the noise distribution in CT images. It can be termed as GMM based unsupervised CT reconstruction network (GMM-unNet) method. Moreover, an expectation-maximization algorithm is designed to optimize the presented GMM-unNet method . Extensive experiments are performed on three datasets from different scan protocols, which demonstrate that the presented GMM-unNet method outperforms the competing methods both qualitatively and quantitatively.	❌
2	ENSURE: A General Approach for Unsupervised Training of Deep Image Reconstruction Algorithms	IEEE TMI(2023)	Click Image reconstruction using deep learning algorithms offers improved reconstruction quality and lower reconstruction time than classical compressed sensing and model-based algorithms. Unfortunately, clean and fully sampled ground-truth data to train the deep networks is often unavailable in several applications, restricting the applicability of the above methods. We introduce a novel metric termed the ENsemble Stein’s Unbiased Risk Estimate (ENSURE) framework, which can be used to train deep image reconstruction algorithms without fully sampled and noise-free images. The proposed framework is the generalization of the classical SURE and GSURE formulation to the setting where the images are sampled by different measurement operators, chosen randomly from a set. We evaluate the expectation of the GSURE loss functions over the sampling patterns to obtain the ENSURE loss function. We show that this loss is an unbiased estimate for the true mean-square error, which offers a better alternative to GSURE, which only offers an unbiased estimate for the projected error. Our experiments show that the networks trained with this loss function can offer reconstructions comparable to the supervised setting. While we demonstrate this framework in the context of MR image recovery, the ENSURE framework is generally applicable to arbitrary inverse problems.	❌
3	Structure-Aware Sparse-View X-ray 3D Reconstruction	CVPR(2024)	Click X-ray, known for its ability to reveal internal structures of objects, is expected to provide richer information for 3D reconstruction than visible light. Yet, existing neural radiance fields (NeRF) algorithms overlook this important nature of X-ray, leading to their limitations in capturing structural contents of imaged objects. In this paper, we propose a framework, Structure-Aware X-ray Neural Radiodensity Fields (SAX-NeRF), for sparse-view X-ray 3D reconstruction. Firstly, we design a Line Segment-based Transformer (Lineformer) as the backbone of SAX-NeRF. Linefomer captures internal structures of objects in 3D space by modeling the dependencies within each line segment of an X-ray. Secondly, we present a Masked Local-Global (MLG) ray sampling strategy to extract contextual and geometric information in 2D projection. Plus, we collect a larger-scale dataset X3D covering wider X-ray applications. Experiments on X3D show that SAX-NeRF surpasses previous NeRF-based methods by 12.56 and 2.49 dB on novel view synthesis and CT reconstruction. Code, models, and data are released at this https URL	🔗
4	Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis	ECCV(2024)	Click X-ray is widely applied for transmission imaging due to its stronger penetration than natural light. When rendering novel view X-ray projections, existing methods mainly based on NeRF suffer from long training time and slow inference speed. In this paper, we propose a 3D Gaussian splatting-based framework, namely X-Gaussian, for X-ray novel view synthesis. Firstly, we redesign a radiative Gaussian point cloud model inspired by the isotropic nature of X-ray imaging. Our model excludes the influence of view direction when learning to predict the radiation intensity of 3D points. Based on this model, we develop a Differentiable Radiative Rasterization (DRR) with CUDA implementation. Secondly, we customize an Angle-pose Cuboid Uniform Initialization (ACUI) strategy that directly uses the parameters of the X-ray scanner to compute the camera information and then uniformly samples point positions within a cuboid enclosing the scanned object. Experiments show that our X-Gaussian outperforms state-of-the-art methods by 6.5 dB while enjoying less than 15% training time and over 73x inference speed. The application on sparse-view CT reconstruction also reveals the practical values of our method. Code is publicly available at this https URL . A video demo of the training process visualization is at this https URL .	🔗

Federated learning

No.	Title	Pub	Abstract	Code
1	Specificity-preserving federated learning for MR image reconstruction	IEEE TMI(2022)	Click Federated learning (FL) can be used to improve data privacy and efficiency in magnetic resonance (MR) image reconstruction by enabling multiple institutions to collaborate without needing to aggregate local data. However, the domain shift caused by different MR imaging protocols can substantially degrade the performance of FL models. Recent FL techniques tend to solve this by enhancing the generalization of the global model, but they ignore the domain-specific features, which may contain important information about the device properties and be useful for local reconstruction. In this paper, we propose a specificity-preserving FL algorithm for MR image reconstruction (FedMRI). The core idea is to divide the MR reconstruction model into two parts: a globally shared encoder to obtain a generalized representation at the global level, and a client-specific decoder to preserve the domain-specific properties of each client, which is important for collaborative reconstruction when the clients have unique distribution. Such scheme is then executed in the frequency space and the image space respectively, allowing exploration of generalized representation and client-specific properties simultaneously in different spaces. Moreover, to further boost the convergence of the globally shared encoder when a domain shift is present, a weighted contrastive regularization is introduced to directly correct any deviation between the client and server during optimization. Extensive experiments demonstrate that our FedMRI’s reconstructed results are the closest to the ground-truth for multi-institutional data, and that it outperforms state-of-the-art FL methods.	🔗
2	Federated learning of generative image priors for MRI reconstruction	IEEE TMI(2022)	Click Multi-institutional efforts can facilitate training of deep MRI reconstruction models, albeit privacy risks arise during cross-site sharing of imaging data. Federated learning (FL) has recently been introduced to address privacy concerns by enabling distributed training without transfer of imaging data. Existing FL methods employ conditional reconstruction models to map from undersampled to fully-sampled acquisitions via explicit knowledge of the accelerated imaging operator. Since conditional models generalize poorly across different acceleration rates or sampling densities, imaging operators must be fixed between training and testing, and they are typically matched across sites. To improve patient privacy, performance and flexibility in multi-site collaborations, here we introduce Federated learning of Generative IMage Priors (FedGIMP) for MRI reconstruction. FedGIMP leverages a two-stage approach: cross-site learning of a generative MRI prior, and prior adaptation following injection of the imaging operator. The global MRI prior is learned via an unconditional adversarial model that synthesizes high-quality MR images based on latent variables. A novel mapper subnetwork produces site-specific latents to maintain specificity in the prior. During inference, the prior is first combined with subject-specific imaging operators to enable reconstruction, and it is then adapted to individual cross-sections by minimizing a data-consistency loss. Comprehensive experiments on multi-institutional datasets clearly demonstrate enhanced performance of FedGIMP against both centralized and FL methods based on conditional models.	❌

Domain adaptation

No.	Title	Pub	Abstract	Code
1	Deep learning with domain adaptation for accelerated projection-reconstruction MR	MAGN RESON MED（2018）	Click Purpose: The radial k-space trajectory is a well-established sampling trajectory used in conjunction with magnetic resonance imaging. However, the radial k-space trajectory requires a large number of radial lines for high-resolution reconstruction. Increasing the number of radial lines causes longer acquisition time, making it more difficult for routine clinical use. On the other hand, if we reduce the number of radial lines, streaking artifact patterns are unavoidable. To solve this problem, we propose a novel deep learning approach with domain adaptation to restore high-resolution MR images from under-sampled k-space data.Methods: The proposed deep network removes the streaking artifacts from the artifact corrupted images. To address the situation given the limited available data, we propose a domain adaptation scheme that employs a pre-trained network using a large number of x-ray computed tomography (CT) or synthesized radial MR datasets, which is then fine-tuned with only a few radial MR datasets.Results: The proposed method outperforms existing compressed sensing algorithms, such as the total variation and PR-FOCUSS methods. In addition, the calculation time is several orders of magnitude faster than the total variation and PR-FOCUSS this http URL, we found that pre-training using CT or MR data from similar organ data is more important than pre-training using data from the same modality for different organ.Conclusion: We demonstrate the possibility of a domain-adaptation when only a limited amount of MR data is available. The proposed method surpasses the existing compressed sensing algorithms in terms of the image quality and computation time.	🔗

Test-Time Training

No.	Title	Pub	Abstract	Code
1	Test-Time Model Adaptation for Image Reconstruction Using Self-supervised Adaptive Layers	ECCV(2024)	Click Image reconstruction from incomplete measurements is a basic task in medical imaging. While supervised deep learning proves to be a powerful tool for image reconstruction, it demands a substantial number of latent images for training. To extend the application of deep learning to medical imaging where collecting latent images poses challenges, this paper introduces an self-supervised test-time adaptation approach. The proposed approach leverages a pre-trained model on an external dataset and efficiently adapts it to each test sample for optimal generalization performance. Model adaption for an unrolling network is done with additional lightweight adaptive linear layers, enabling efficient alignment of testing samples with the distribution targeted in the pre-trained model. This approach is inspired by the connection between linear convolutional layer and Wiener filtering. Extensive experiments showed significant performance gain of the proposed method over other unsupervised methods and model adaptation techniques in two medical imaging tasks.	❌

Homomorphic encryption

No.	Title	Pub	Abstract	Code
1	A Secure and High Visual-Quality Framework for Medical Images by Contrast-Enhancement Reversible Data Hiding and Homomorphic Encryption	IEEE Access(2019)	Click Medical data security is facing great challenges in medical applications due to the open internet and the semi-trusted cloud. For the sake of privacy protection and the security of medical images, this paper proposes a secure and high visual-quality framework for medical images. In this framework, a novel reversible data hiding (RDH) based on lesion extraction embeds privacy data into medical images for privacy protection and image quality improvement, homomorphic encryption based on chaotic map encrypts images for medical image security. The experiments have shown that the proposed framework increases the security of medical data and improves the visual quality of medical images significantly. The proposed RDH in this framework outperforms the other RDH methods with contrast enhancement and the proposed encryption scheme increases image security well.	❌
2	Application of homomorphic encryption in medical imaging	Arxiv(2021)	Click In this technical report, we explore the use of homomorphic encryption (HE) in the context of training and predicting with deep learning (DL) models to deliver strict \textit{Privacy by Design} services, and to enforce a zero-trust model of data governance. First, we show how HE can be used to make predictions over medical images while preventing unauthorized secondary use of data, and detail our results on a disease classification task with OCT images. Then, we demonstrate that HE can be used to secure the training of DL models through federated learning, and report some experiments using 3D chest CT-Scans for a nodule detection task.	❌

No.	Title	Pub	Abstract	Code
1	Privacy-preserving image retrieval for medical IoT systems: A blockchain-based approach	IEEE Network(2019)	Click With the advent of medical IoT devices, the types and volumes of medical images have significantly increased. Retrieving of medical images is of great importance to facilitate disease diagnosis and improve treatment efficiency. However, it may raise privacy concerns from individuals, since medical images contain patients' sensitive and private information. Existing studies on retrieval of medical data either fail to protect sensitive information of medical images or are limited to a single image data provider. In this article, we propose a blockchain-based system for medical image retrieval with privacy protection. We first describe the typical scenarios of medical image retrieval and summarize the corresponding requirements in system design. Using the emerging blockchain techniques, we present the layered architecture and threat model of the proposed system. In order to accommodate large-size images with storage-constrained blocks, we capture a carefully selected feature vector from each medical image and design a customized transaction structure, which protects the privacy of medical images and image features. We also discuss the challenges and opportunities of future research.	❌

Model Interpretability

No.	Title	Pub	Abstract	Code
1	Deep learning for case-based reasoning through prototypes: A neural network that explains its predictions	AAAI(2018)	Click Deep neural networks are widely used for classification. These deep models often suffer from a lack of interpretability -- they are particularly difficult to understand because of their non-linear nature. As a result, neural networks are often treated as "black box" models, and in the past, have been trained purely to optimize the accuracy of predictions. In this work, we create a novel network architecture for deep learning that naturally explains its own reasoning for each prediction. This architecture contains an autoencoder and a special prototype layer, where each unit of that layer stores a weight vector that resembles an encoded training input. The encoder of the autoencoder allows us to do comparisons within the latent space, while the decoder allows us to visualize the learned prototypes. The training objective has four terms: an accuracy term, a term that encourages every prototype to be similar to at least one encoded input, a term that encourages every encoded input to be close to at least one prototype, and a term that encourages faithful reconstruction by the autoencoder. The distances computed in the prototype layer are used as part of the classification process. Since the prototypes are learned during training, the learned network naturally comes with explanations for each prediction, and the explanations are loyal to what the network actually computes.	🔗
2	Grad-cam: Visual explanations from deep networks via gradient-based localization	IJCV(2019)	Click We propose a technique for producing "visual explanations" for decisions from a large class of CNN-based models, making them more transparent. Our approach - Gradient-weighted Class Activation Mapping (Grad-CAM), uses the gradients of any target concept, flowing into the final convolutional layer to produce a coarse localization map highlighting important regions in the image for predicting the concept. Grad-CAM is applicable to a wide variety of CNN model-families: (1) CNNs with fully-connected layers, (2) CNNs used for structured outputs, (3) CNNs used in tasks with multimodal inputs or reinforcement learning, without any architectural changes or re-training. We combine Grad-CAM with fine-grained visualizations to create a high-resolution class-discriminative visualization and apply it to off-the-shelf image classification, captioning, and visual question answering (VQA) models, including ResNet-based architectures. In the context of image classification models, our visualizations (a) lend insights into their failure modes, (b) are robust to adversarial images, (c) outperform previous methods on localization, (d) are more faithful to the underlying model and (e) help achieve generalization by identifying dataset bias. For captioning and VQA, we show that even non-attention based models can localize inputs. We devise a way to identify important neurons through Grad-CAM and combine it with neuron names to provide textual explanations for model decisions. Finally, we design and conduct human studies to measure if Grad-CAM helps users establish appropriate trust in predictions from models and show that Grad-CAM helps untrained users successfully discern a 'stronger' nodel from a 'weaker' one even when both make identical predictions. Our code is available at this https URL, along with a demo at this http URL, and a video at this http URL.	🔗
3	Axiomatic attribution for deep networks	Arxiv(2017)	Click We study the problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works. We identify two fundamental axioms---Sensitivity and Implementation Invariance that attribution methods ought to satisfy. We show that they are not satisfied by most known attribution methods, which we consider to be a fundamental weakness of those methods. We use the axioms to guide the design of a new attribution method called Integrated Gradients. Our method requires no modification to the original network and is extremely simple to implement; it just needs a few calls to the standard gradient operator. We apply this method to a couple of image models, a couple of text models and a chemistry model, demonstrating its ability to debug networks, to extract rules from a network, and to enable users to engage with models better.	❌
4	Deep unfolding network with spatial alignment for multi-modal mri reconstruction	MIA(2025)	Click Multi-modal Magnetic Resonance Imaging (MRI) offers complementary diagnostic information, but some modalities are limited by the long scanning time. To accelerate the whole acquisition process, MRI reconstruction of one modality from highly under-sampled k-space data with another fully-sampled reference modality is an efficient solution. However, the misalignment between modalities, which is common in clinic practice, can negatively affect reconstruction quality. Existing deep learning-based methods that account for inter-modality misalignment perform better, but still share two main common limitations: (1) The spatial alignment task is not adaptively integrated with the reconstruction process, resulting in insufficient complementarity between the two tasks; (2) the entire framework has weak interpretability. In this paper, we construct a novel Deep Unfolding Network with Spatial Alignment, termed DUN-SA, to appropriately embed the spatial alignment task into the reconstruction process. Concretely, we derive a novel joint alignment-reconstruction model with a specially designed aligned cross-modal prior term. By relaxing the model into cross-modal spatial alignment and multi-modal reconstruction tasks, we propose an effective algorithm to solve this model alternatively. Then, we unfold the iterative stages of the proposed algorithm and design corresponding network modules to build DUN-SA with interpretability. Through end-to-end training, we effectively compensate for spatial misalignment using only reconstruction loss, and utilize the progressively aligned reference modality to provide inter-modality prior to improve the reconstruction of the target modality. Comprehensive experiments on four real datasets demonstrate that our method exhibits superior reconstruction performance compared to state-of-the-art methods.	❌

Contributing

Help make this list more useful:

Add missing official paper pages, arXiv pages, project pages, and GitHub repositories.
Prefer official code links from authors over third-party reimplementations.
Keep entries concise and reconstruction-focused.
Open a PR with evidence when correcting metadata.

Suggested entry format:

| Representation | Modality | Paper | Year | Code | Why it matters |
| --- | --- | --- | --- | --- | --- |
| Representation name | Modality name | [Paper Name](paper-url) | YYYY | [Code](repo-url) or - | One-sentence reconstruction-focused summary. |

Citation

@article{yang2025explicit,
  title={Explicit and implicit representations in AI-based 3D reconstruction for radiology: a systematic review},
  author={Yang, Yuezhe and Yang, Boyu and Wang, Yaqian and He, Yang and Dong, Xingbo and Jin, Zhe},
  journal={arXiv preprint arXiv:2504.11349},
  year={2025}
}