Introduction

Cardiovascular diseases (CVD) such as coronary artery disease, heart failure, hypertension, stroke, and arrhythmias are among the leading cause of death in individuals [1]. A WHO factsheet of 2021 states that CVDs account for nearly 32% of all deaths [2]. While genetic conditions may not be treated completely, detecting these diseases at the early stages allows for effective management and treatment. An insight into the internal structures is necessary to treat and diagnose CVD. Cardiac Computed Tomography (CT) Cardiovascular Magnetic Resonance Imaging (MRI) [3] and the coronary angiography are widely employed imaging modalities for structural and functions assessment of the heart respectively. These modalities produce unprecedented high-resolution images of the heart and have become a valuable method for non-invasive evaluation of the heart. These images contain regions of interest (ROI), i.e., heart and blood vessels, along with organs in the vicinity like lungs, ribs, spine, and liver. Image segmentation and extracting the volume of interest (VOI) and shape of the anatomical structures are among the key procedures employed while treating CVD. Separating the heart and its vessels from the rest is essential for further analysis and diagnosis. It enables the computation of cardiac functional parameters like stroke volume, ejection fraction, etc., which are important in assessing cardiac function, heart failure, and congenital heart disease. Manually delineating the ROI is time-consuming, demands technical and clinical knowledge, subject to inter and intra-observer errors, and not easily reproducible. Thus, it is desirable to have automatic segmentation to ensure unbiased, reproducible, and consistent results. Fully automatic whole-heart segmentation has excellent potential for many cardiac applications [4, 5].

For review of state-of-the art techniques, the clinical and technical journals were considered from MEDLINE, PUBMED and Mesh headings databases. Traditionally, segmentation in medical images was performed by a radiologist slice by slice who manually delineates the ROI. This process is time-consuming, error-prone, subject to inter and intra-observer variability, and not reproducible [6]. The accuracy and efficiency of segmentation are paramount since they can affect the clinical decisions. One area of research focuses on hybrid methods for segmentation [4]. These methods involve the use of manual and automated methods. It leverages human interaction and expertise to provide the seed points or initial guidance. These hybrid methods have shown to improve the accuracy and efficiency in segmentation. Model fitting segmentation methods use knowledge of shape and geometry of the ROI [7]. A model is created using training images to learn the characteristics and geometry of the ROI. This model is then applied to new images where it selects the object that matches itself closely. This method is seen to provide robust results and can cope well with deformation in images [8]. There is also research in developing methods using traditional image processing techniques. These are data-driven and considered low-level segmentation algorithms. Studies have used edge detection, region growing, and thresholding techniques to segment the heart. These research efforts indicate the potential of traditional image processing techniques in achieving accurate heart segmentation. Much research has been done on the heart segmentation, but, most of these focus on segmenting just the left ventricle or myocardium, as the quantitative assessment of the left ventricle and myocardium is sufficient for diagnosing many coronary heart diseases [9].

With the rising cases of CVDs, having a fast, consistent, automated, and accurate form of segmentation becomes important. This work aim to devise a fully automatic heart segmentation from the cardiac CT images with the help of image processing techniques and exploiting the human anatomy. It also complements segmentation with a method to calculate the volumes of the extracted ROI to help for faster diagnosis. Hence the aim is defined through two objectives.

  • Develop an automated method for segmenting the heart chambers and blood vessels from CT images.

  • Compute the volumes and evaluate the accuracy of the method by comparing with the ground truths (GT).

Materials and Methods

The required Cardiac CT images in NifTII format were downloaded from ImageCHD [10] database which has sixteen types of congenital heart diseases. These images were single energy and single-phase CT acquisition. Table 1 details the image acquisition parameters. These images are captured using a SIEMENS biographTM 64 CT scanner. The image resolution was 512*512*(137–551) with a voxel size of 0.25*0.25*0.5 mm3. For data privacy, patient demography, image scanner details and clinical details were removed from the nifTII file headers. After image collection, we checked the data completeness (for missing DICOM attributes) and any cases with incomplete tag details were discarded. Certain dataset needed data pre-processing to improve the tissue contrast by reducing the quantum noise. As the quality of the image directly affects the accuracy and efficiency of the segmentation, we considered only diagnostic quality images. The dataset also includes manually delineated GT defined by experts for the whole heart segmentation. The segmentation labels were fulfilled by one radiologist, and seven substructures were labelled: LV, RV, LA, RA, Myocardium, Aorta, and Pulmonary Artery. The manual delineation (GT) was performed by 4 senior cardiologists (32-, 20-, 16-, and 14-years experiences) using MImics software. One cardiologist labelled the segmentation and other three reviewed it. To avoid bias or inter-observer variance among three opinions, the voting policy was followed. Like this, each one worked independently on 25% of the dataset in their image pool and created the GT labels. The results in this work are validated with these labels. We also have included five retrospective cases from Radiodiagnosis and imaging department, Kasturba Hospital, Manipal and there is no plans for prospective cases.

Table 1 The Cardiac CT imaging acquisition details

The block diagram of the work is shown in Fig. 1. The segmentation algorithm consists of three major steps. The first is the pre-processing of the images for contrast enhancement, calculation of statistical parameters, and removal of noise and extra artifacts. The next step is the segmentation of the heart. The knowledge of human anatomy and the intensity levels corresponding to various organs are used extensively in this step. Each of these steps is explained in detail below.

Fig. 1
figure 1

Methodology of the proposed work

Pre-processing

Pre-processing is not a mandatory step in medical image processing. It is done based on the clinical task. At the start, the background air, and artifacts like the scanning table (Fig. 2) were removed. The borders and intensity differences between the tissues are also not very prominent. This was resolved to enhance the image quality through following steps.

  • Rotate the images to have the spine face downwards to allow for more effortless exploitation of human anatomy during segmentation.

  • Contrast enhancement to enhance the intensity differences.

  • The quantum noise was reduced using adaptive median filter.

  • Computation of statistical parameters to be used during the segmentation stage.

  • Retain only the largest connected component in the volume to remove all unwanted objects (artifacts).

Fig. 2
figure 2

Dataset slice and volume a Sample slice from the CT image dataset, b volume rendering of CT images

Statistical Parameters Calculation

In second step, the statistical parameters were calculated for separating the other organs from CT images [11]. Let \(n\), \(m\), and \(p\) be the dimensions of the 3D CT scan. Let the CT volume be represented as \(f(x, y, z)\), where \(x=1:n, y=1:m, and z=1:p\). For each axial slice, the following parameters are computed (Fig. 3):

  • Mean of the pixels, \({\varvec{\mu}}({\varvec{k}})\): The image histogram shows several distinct peaks. Of these, the valley points belong to the air, and the highest belongs to bones. The mean \(\mu\) of the intensities corresponds to the air due to its predominance and can be used to separate the air and background from the rest of the image (Fig. 4b).

  • Mean intensity of the pixels with intensity greater than μ(k) in a given slice k, µsup(k): The intensity corresponding to this parameter is the interval of intensities where the blood and bone marrow are included. Masks obtained with this value as a threshold contain tissues where the oxygenated blood flows along with the bones. Thus, it is used to segment the aorta, ribs, and spine (Fig. 4c).

  • The standard deviation of intensities in the kth slice, with intensity greater than μ(k), σ(k): The threshold μsup(k) + σ(k) separates ribs from the surrounding muscles for precise segmentation. The gray level corresponding to this value is useful in separating the outer layer of bones and structures (Fig. 4d).

  • Global Mean, μglobal: The value corresponds to the intensity of muscular tissues and depends on the intensity levels of the whole CT scan. It is computed as the mean of the parameter μsup(k) plus the standard deviation of μsup(k) (Fig. 4f)

  • Noise and extra artifacts removal: The background noise and other unwanted objects must be removed (Fig. 3a, b) for effective segmentation. A series of binary masks are generated by thresholding the axial slices with the computed parameter \(\mu (k).\) This mask allows to segment the air and background from the image. Once all images are segmented, all the connected components except the largest component in the volume are discarded. This allows to refine the volume and retain just the thoracic cavity.

Fig. 3
figure 3

Image after pre-processing a single slice, b volume rendering

Fig. 4
figure 4

Thresholding the image with the statistical parameters: a Original Slice, b Thresholding with μ(k), c Thresholding with μsup(k), d Thresholding with μsup(k) + σ(k), e Thresholding with μglobal, f Thresholding with μglobal + σ(k))

These statistical parameters are chosen as the images have definitive Hounsfield Units (HU – the voxel intensity) uniformly distributed within the ROI and the intra-ROI voxels variations are too less. In the case of images with mottle also, as images are pre-processed using median operator and contrast is enhanced, the intra-ROI voxels variations is less. In these two contexts, the above-mentioned parameters helps for noise removal and keep the dataset ready for the next steps. This was tested with images acquired at 80 \({\text{kVp}}\) and 100 \({\text{kVp}}\). These parameters might not work as expected if images with less than 80 \({\text{kVp}}\) and more than 100 \({\text{kVp}}\) are considered. This was not tested due to non-availability of the datasets with these two \({\text{kVp}}\).

Segmentation

Segmentation of the heart is done in a series of steps. First, the aorta is separated from the images. Next, the ribs are removed, followed by the spine removal. Liver and heart has similar intensities; hence a separate step is necessary to separate the two. Ultimately, the heart's chambers are extracted, and the aorta is added back. The aorta is delineated first as it is closely associated with the spine and can get removed during the spine removal. Also, the knowledge of the location of the aorta helps us determine the location of the spine relative to itself. It is also the only ROI in slices with excess liver and can get removed during liver removal.

Generation of Aorta Mask

The descending aorta is seen as a circular cross-section in the axial slices when thresholded with \(\mu sup(k)+\sigma (k).\) A set of binary masks is generated by segmenting the images with the parameter μsup(k) + σ(k). The mask is then scanned for circular objects of sizes like that of the aorta. Once the circle masks are obtained for all slices, only that connected component from all the slices is retained. This component gives the location of the aorta (Fig. 5e). Active contours are used as a seed mask to refine the aorta.

Fig. 5
figure 5

The input axial CT images, intermediate and final results of segmentation

Ribs Removal

The width of the rib cage can be visualized on the anterior side due to the removal of the lungs, which helps to create a disk-like mask that encompasses the ribs (Fig. 5g and h). The rib disk mask for a slice is created as follows,

  1. 1.

    \(For \, every \, column \, of \, the \, image, \, find \, the \, indices \, of \, the \, first \, and \, last \, nonzero \, element.\)

  2. 2.

    \(Set \, the \, corresponding \, elements \, of \, a \, binary \, mask \, ribsOut \, to \, 1.\)

  3. 3.

    \(Initialize \, a \, binary \, mask \, ribsIn \, to \, ribsOut.\)

  4. 4.

    \(Erode \, the \, ribsIn \, mask \, with \, a \, disk{\text{-}}shaped \, structuring \, element \, of \, radius \, equal \, to \, the \, rib \, width \, of \, that \, slice.\)

  5. 5.

    \(The \, rib \, disk \, mask \, is \, obtained \, by \, subtracting \, ribsIn \, from \, ribsOut\)

Spine Removal

The images from the previous step now contain the heart, spine, and liver. The next challenge is to remove the spine effectively. Generation of the spine mask is done from the original processed volume and not the output of the previous step, as the output has a disconnected spine. First, the processed volume is subjected to k-means clustering [12], and only those regions corresponding to the highest intensities are retained (Fig. 5i, and 5j). This disconnects the spine from other organs by removing muscular tissues. Binary masks are generated by thresholding the images obtained with μsup(k) + σ(k). The resulting masks contain just the heart, aorta, and spine. All muscular tissues are removed, causing the spine to be detached from the heart. In cases where the heart, aorta, and spine are very close, and the muscular edge is not very prominent, they can remain connected. Since the aorta is already segmented out, we subtract the aorta masks from the above masks and morphologically open the mask to separate the heart and spine.

Liver Removal

The liver has intensities like heart and hence needs special care during removal to prevent the loss of the ROI. It is observed that only 10-20% of the slices contain the liver regions. To address the height differences from the bottom of the image, we consider the left and right halves separately and then calculate the mean of the heights obtained. The right lobe of the liver from overpowering and causing us to lose the apex of the heart, the height of the liver is calculated in 2 halves, one for the left lobe and another for the right. The mean of the two is the liver height. A threshold value corresponding to the liver regions is calculated by considering the mean of nonzero values in the bottommost nonzero slice in the volume post-spine removal. A mask corresponding to the heart is obtained by subjecting the volume to \(k{\text{-}}means\) clustering (k = 3) and retaining the regions corresponding to the highest intensities. These masks remove muscles in the vicinity and refine the volume to contain only the heart regions. The segmentation of chambers is performed employing the algorithm Isodata [13], which provides the threshold value as follows:

$$1. \, Compute \, the \, initial \, threshold \, t1 \, as \, the \, mean \, gray \, level \, of \, the \, nonzero \, pixels \, of \, the \, slice$$
$$2. \, Compute \, two \, mean \, values \, - \mu 1 \, and \, \mu 2 \, as \, the \, mean \, gray \, levels \, of \, nonzero \, pixels \, of \, the \, two \, classes \, obtained \, by \, thresholding \, the \, image \, with \, t1.$$
$$3. \, Compute \, a \, new \, threshold \, t2 \, as \, the \, mean \, of \, \mu 1 \, and \, \mu 2.$$
$$4. \, If \, the \, difference \, between \, t1 \, and \, t2 \, is \, less \, than \, 1\%, \, go \, to \, step \, 5. \, Else, \, assign \, t2 \, to \, t1 \, (t1=t2) \, and \, go \, to \, step \, 2.$$
$$5. \, Return \, the \, new \, threshold \, value \, t2.$$

Results

The results are shown in Fig. 5a–l. Due to space constraints, we could not show all the intermediate steps results on MPR slices. Only two axial slices are shown in Fig. 5a and 5d to illustrate the pre-processing and visualization. The remaining figure are the results of processed 3D volume. There is a slight variation in color in Fig. 5k and 5l as it is due to the different illumination model used in our work and in GT.

Validation

The accuracy of the segmentation results was evaluated by computing similarity metrics between the results and the GT (results in Table 2). These are the recommended [14,15,16,17] and widely used metrices. They are,

Table 2 Accuracy metrics calculated on the results
  • Dice Index (Eq. 1) value 1 indicates a perfect overlap/match between the two datasets.

    $$Dice = \frac{{2*\left| { X \cap Y } \right|}}{{\left| { X } \right| + \left| { Y} \right|}}$$
    (1)
  • Jaccard Index (Eq. 2) compares the overlap of two results. Higher value, better is segmentation.

    $$Jaccard = \frac{{\left| { X \cap Y } \right|}}{{\left| { X \cup Y} \right|}}$$
    (2)
  • Absolute Volume Difference (AVD) measures the absolute difference between the volumes of the result and the GT. The lower the AVD, the better the segmentation (Eq. 3).

    $$AVD = \frac{{\left| {V_{seg} - V_{gt} } \right|}}{{V_{gt} }}*100$$
    (3)
  • Symmetric Surface Distances (SSD) is based on the distance between the surface voxels of the two volumes. Surface voxels are defined as any voxel with at least one non-object voxel in its 8 neighbourhood. SSD is obtained by measuring the Euclidean distance for every surface voxel in the segmented volume Vs to the closest surface voxel of the reference or GT volume VR. Let \(S(SEG)\) denote the set of all surface voxels of segmentation and \(S(GT)\) denote the set of surface voxels of the GT. The shortest distance of a voxel to \(S(SEG)\) is defined as,

    $$d\left( {v_{GT} , S\left( {SEG} \right)} \right) = min_{{v_{GT} \in S\left( {GT} \right)}} (|v_{gt} - S(SEG|)$$
    (4)
  • Average Symmetric Surface Distance (ASSD) measures the average distance (in \(mm\)) between the surfaces of the GT and the results. It considers both the false positive and false negative errors. It is 0 for perfect segmentation.

    $$ASSD\left( {S_{seg} , S_{GT} } \right) = \frac{1}{{\left| {S_{seg} } \right| + \left| {S_{GT} } \right|}}*\left( {\mathop \sum \limits_{{v_{SEG} \in S\left( {SEG} \right)}} d(v_{SEG} , S(GT)) + \mathop \sum \limits_{{v_{GT} \in S\left( {GT} \right)}} d(v_{GT} , S(SEG))} \right)$$
    (5)
  • Root Mean Square Symmetric Surface Distance (RMSSD) calculates the square of the distances between surface voxels in mm (eqn 6). It gives an opinion on the distribution of the distance error. Perfect segmentation has an RMSSD value of 0.

    $$RMSSD\left( {S_{seg} , S_{GT} } \right) = \frac{1}{{\left| {S\left( {SEG} \right)} \right| + \left| {S\left( {GT} \right)} \right|}}*\left( {\mathop \sum \limits_{{v_{SEG} \in S\left( {SEG} \right)}} d(v_{SEG} , S(GT)) + \mathop \sum \limits_{{v_{GT} \in S\left( {GT} \right)}} d(v_{GT} , S(SEG))} \right)$$
    (6)
  • Maximum Symmetric Surface Distance (MSSD), also called Hausdorff distance, is determined similarly to ASSD and measured in mm. The differences between the surface voxels are computed using Euclidean distances (Eq. 7), and the maximum among these values yields MSSD. For perfect segmentation, this value is 0.

    $$MSSD\left( {S_{seg} , S_{GT} } \right) = \max \left( {\max \left( {\mathop \sum \limits_{{v_{SEG} \in S\left( {SEG} \right)}} d(v_{SEG} , S(GT))} \right),\max \left( {\mathop \sum \limits_{{v_{GT} \in S\left( {GT} \right)}} d(v_{GT} , S(SEG))} \right)} \right)$$
    (7)
  • Volumetric measure helps evaluate the changes in an organ or tissue volume over time. The volume of the extracted structure is calculated by counting the nonzero pixels in each slice, and the total sum is multiplied by the pixel size in \(x\) and \(y\) directions of patient coordinate system.

    $$Volume = \left( {Total \, nonzero \, voxels \, in \, the \, volume} \right)*size_{x} *size_{y} *size_{z}$$
    (8)

Discussion

Evaluation

The method was tested on 30 single energy and single-phase CT dataset from imageCHD [10]. This work relied on single-phase images as the aim was to segment the heart chambers as a first step. A study in [18], illustrates the benefit of multiphase CT compared to single-phase which gives better tissue details of arteries at different time during scanning (but at the cost of extra radiation to the patient [19]). Hence, time resolved multi-phase images will be considered for future work for arteries segmentation. The results were objectively validated with the GT using Eqs. 16 and subjectively by a cardiologist. A series of similarity metrics (section "Evaluation") calculated are shown in Table 2. Due to space constraints, only 10 subjects are shown. Dice coefficients towards 1.0, Jaccard index towards 100 and AVD of lower value indicates the accuracy of the predicted segmentation and the GT. Volume measurement includes voxel count in the segmented volume multiplied by their size (\(siz{e}_{x}* siz{e}_{y}*\) \(slice thickness\)). For the lower slice thickness of \(0.4{-}1.0 \, {\text{mm}}\), the results were good compared to \(2.5 {\text{mm}}\) and \(5 \mathrm{ mm}\). As observed, ASSD, RMSSD and MSSD were computationally expensive for the entire volume. Most of the scores show that the algorithm performs well. In few cases due to quantum noise (less photons during scanning) the contrast difference between the chambers and the muscle tissues was not enough and hence low accuracy was achieved. Noise reduction was not attempted from such images and retest segmentation as it was not part of the objective.

In the segmentation context, metrics such as \(Dice, Jaccard, AVD, ASSD, RMSSD\) and \(MSSD\) are ideal. Addition to subjective validation, clinicians still consider these as best statistical parameters which have been witnessed in many radiology conferences and clinical journals. Hence, the objective and subjective clinical evaluation can use these numbers and the results on 2D MPR and 3D views. Segmenting and classifying the arteries to sixteen types of CHD and providing exact measurement is the scope of future work. Table 3 shows the mean value of measurements with reference to values from Table 2.

Table 3 Summary of segmentation evaluation metrics

The correlation between the manual measurements (A) and the automated measurements (B) is plotted and quantitatively analysed through Bland-Altmon plot (BA-Plot) (Fig. 6) with the help of MedCalc Version 22.014 software. This method uses mean (\(\overline{d }\)) and standard deviations (σ) of the differences. The difference between GT and the measurement (\(y{\text{-}}axis\)) is plotted against the mean of the two measurements (\(x{\text{-}}axis\)). Fig. 6 shows the plots of comparing the DICE (row 1) and Jaccard metrics (row 2) between A and B. As we got only the dice and Jaccard values from the reference work, only these two parameters were compared and not others like AVD, ASSD, RMSSD, MSSD and volume. Difference and LoA (Limits of Agreement) calculations are based on regression analysis (Fig. 6b and e). The calculations in BA-Plot are, Average \(difference=0.04\), \(LoA = -0.05 \& 0.13\) and \(p=0.3903\) for DICE (Fig. 6a) and \(Average difference = -0.02\), \(LoA = -0.23 \& 0.19\) and \(p=0.0853\) for Jaccard (Fig. 6d). The repeatability coefficient at \(95\% CI\) for the selected \(A\) and \(B\) is \(0.11 [0.08-0.2]\) in DICE (Fig. 6c) and \(0.2 [0.14-0.35]\) in Jaccard (Fig. 6f). In this study, 95% of the data points lie within \(\pm 2\sigma\) of the mean difference which is a good agreement. These results were acceptable by the radiologist (Table 4).

Fig. 6
figure 6

Bland-Altmon plots of comparing the DICE (row 1) and Jaccard metrics (row 2) between experimental values and the GT. Graph shows the data points (10 data points) as dots. The dotted line indicates the average difference, and the dashed lines indicate the (LoA). a and d Plot of differences between method A and method B vs. the mean of the two measurements. b and e Regression line between hypothetical measurements done by method A and method B. c and f Quantile plot of two sets of quantiles

Table 4 The Bland-Altmon plot calculation for DICE and Jaccard score

Compared to [4, 6, 7], our method is automatic, correctly delineated the heart boundary without any segmentation leak problem and it also works for images acquired with slight less tube current (\(mA\)). And compared to method discussed in [9], this method segmented all four chambers correctly. The segmentation leak problem was visually inspected using roamthrough navigation (from inferior to superior direction and vice-versa in \(z{\text{-}}axis\)) by zooming the boundary on axial CT and on 3D surface rendered images.

Clinically, in cardiology applications like coronary angiography or cardiac CT, segmentation of ROI and VOI respectively plays a key role for the correct assessment of vascular structures. Specially in Myocardial ischemia due to calcium deposition on the coronary arteries surface, blood flow reduces. Calcium scoring and risk assessment are still a hot topic for the researchers. Clinical parameters of patients like comorbidities, age and family history are also part of risk scoring. There exist state-of-the-art methods in quantitative assessment of the calcium deposition but the measurement accuracy is still lacking and mostly the process is manual. Also, the arteries blockage percentage is purely subjective with inter-observer variation which would lead to clinical decision of unnecessary vascular procedure. A millimetre artery measurement variation on the axial slice could lead to variation in the block percentage. In this direction, the proposed solution has potential to segment the cardiovascular structures and clinically is significant.

Infrastructure Requirements

The work was carried out on a laptop using MATLAB 2021b with Intel core i5, 8 GB RAM, and 2 GB Iris XE Graphics card and also tested in two more configurations with AMD Ryzen 5 4600 CPU, 8 GB memory, and NVIDIA GeForce GTX 1650 GPU with 4 GB memory. In MATLAB, toolboxes of Computer Vision, Image Processing, Parallel Computing, Statistics and Machine Learning, Signal Processing, and Deep Learning (For calculation of Similarity Metrics) were used. The algorithm takes approximately 1.5 ± 0.5 \(seconds\)/CT slice, with a maximum running time of 15.5 \(min\) for 551 slices and a minimum of 2.5 \(min\) for 137 slices. Overall, the methodology defined produces good and fast results.

Limitations

This study has few imitations. Some CT images have poor or uneven illumination due to less tube voltage during image acquisition. Since the segmentation depends on the pixels, this causes the algorithm to either retain more or less of the ROI. The collar bones were present near the heart and have a similar intensity range. This causes it to be retained in most results, lowering the similarity metrics. The fine branches of the blood vessels get lost during segmentation if they don't have any common pixels with the segmented heart when added separately. In some cases, few rib bones, sternum, or spine remained in the final segmentation due to their proximity. Even if they are removed successfully during the initial stages, they may get added back during the mask expansion via active contours or during the blood vessel addition. In some cases, the heart chambers could not be separated due to the lack of clear boundaries.

Conclusion

The proposed approach has successfully delineated the heart from the CT scans. The results show an average Dice score, Jaccard score, and Hausdorff distance of 0.866, 0.776, and 33.29 mm, respectively. The result of this project has the potential to make a significant impact on patient care. After clinical validation by the doctors, the work would be integrated to the cardiac image acquisition workflow. The result of this work is aimed at helping cardiologists and interventional cardiologists obtain segmentation results faster. Accurate segmentation can help to identify and monitor various cardiovascular conditions, leading to better patient outcomes and management of heart diseases. Comparing the heart's structure and shape over time can help analyze the growth and function of the heart with age. This can help study the changes over time in case of congenital defects. Whole heart segmentation can be used to create a 3D model of the patient's heart to help plan interventional surgeries better. Scope of our future work is to further explore the clinical characteristics, VOI morphology analysis and radiomics features extraction which has great potential to still improve the scientific results. We are excited and interested to see how the method performs in case of high-resolution CT images. Also, to reduce the mottle and re-segment so that the segmentation algorithm becomes more robust.