Introduction

The technical advances in recent decades have made many procedures, such as invasive examinations and surgeries, challenging; however, performing clinical procedures safely is crucial for both patients and clinicians. Fine-needle aspiration cytology (FNAC) is essential for the diagnosis of thyroid gland malignancy. When a lesion is detected in the thyroid gland, establishing if it requires surgery is important. Ultrasonography (US) is generally performed first, to determine whether further evaluation or FNAC is necessary.

Current clinical guidelines recommend FNAC for all solid nodules with a maximum diameter of > 2 cm on US, but it is not recommended for nodular lesions < 5 mm with no distant or lymph node metastases and low tumor marker levels [1]. FNAC is generally a safe examination [2, 3]; however, the procedure is made technically difficult by the proximity of the thyroid gland to the jugular veins, carotid arteries, and trachea. US-guided puncture, which is currently used for vascular access, drainage, and biopsy, can be performed during various steps of medical management, from diagnosis to treatment.

The advantages of FNAC include simplicity, minimal invasiveness, possible repetition, and high diagnostic accuracy, making it a widely used technique. Moreover, since imaging is obtained in real time, FNAC with US is useful for delineating the positional relationship between the target and the needle point for FNAC. However, obtaining safe, convenient, and precise access to complicated organ structures requires a high level of technical skill and expertise. Recent studies show a correlation between an increase in the rate of diagnosis of thyroid gland diseases and more training [4]. Acquiring these skills is essential for inexperienced doctors and students who have not yet decided on their area of specialization.

Within our facilities, phantom models are used to support education and training [5]. If phantoms tailored for individual patients are used for practice before clinical procedures are performed, the technique of FNAC can be mastered, which is important for medical trainees and inexperienced clinicians.

Reports have been published in this emerging field, describing fabricated phantoms of the breast [6], liver [7], uterus, and vagina [8]. However, to our knowledge, an elaborate phantom is not yet commercially available, and there have been no studies involving thyroid gland phantoms. Therefore, we constructed an elaborate phantom, taking size, touch, feel, and US views into consideration, using a hybrid of old and new manufacturing techniques, including direct three-dimensional (3D) printing and traditional mould-based manufacturing. The aim of this study was to evaluate the effectiveness of puncture training using our phantom for junior and senior residents and medical students.

Materials and methods

This study was approved by the Ethics Committee of Nagasaki University Hospital (Approval Number: 16042512). All participants provided written informed consent before participation.

Phantom fabrication procedure

The procedure used to fabricate the thyroid gland phantom was described previously [9]. First, the computed tomography data were analyzed and 3D digital reconstruction was done, with output in an STL file format. Using our department’s 3D printer (Objet 260, Stratasys No. 33201; Eden Prairie, MN, USA), we created moulds for various organs, considering the limitation of the types of material that can be printed by a 3D printer. Currently available 3D printed materials are highly reflective of ultrasound waves; thus being non-permeable, preventing transparency. For the cervical vertebrae and trachea, the high reflectivity was acceptable, but it was not acceptable for other components, so a more traditional method of manufacturing was used. The 3D printer was used to create moulds for the thyroid gland, blood vessels, tumor, and surrounding soft tissue, which were made from ultrasound-permeable materials [9]. The moulds were injected with a chemical compound that confers a similar degree of permeability when viewed ultrasonically, as the organs they represent. The material was injected into the respective moulds of the blood vessels, the thyroid gland, and the tumor. Subsequently, the moulds were refrigerated at − 4 °C for approximately 15 min to harden. The objects were then removed from the moulds and colored, the components were positioned in the neck mould, and agar gel was poured into them. After approximately 2 h of refrigeration, the phantom was ready to be removed from the mould and used for US-guided puncture training. Two models, model 1 and model 2, were created with different tumor locations in the thyroid gland (Fig. 1A, B). Model 2 has a smaller target tumor and is positioned closer to the blood vessels than model 1.

Fig. 1
figure 1

The two phantom models. A Model 1. The tumor was positioned in the right lobe of the thyroid gland. B Model 2. The tumor was positioned in the left lobe of the thyroid gland and was smaller and closer to the trachea than in model 1

Methods

We evaluated outcomes after allowing junior and senior residents and medical students to perform puncture training using this tailored thyroid gland phantom. We identified 45 US-guided thyroid nodule FNAC procedures performed with our thyroid phantom in our department between September 2015 and March 2016. Participants were classified into four groups. Group A comprised qualified physicians who specialized in breast and thyroid gland treatment (n = 10); Groups B and C comprised senior residents (n = 8) and junior residents (n = 12), respectively; and Group D comprised medical students (n = 15).

The parameters evaluated included the puncture time of the thyroid tumor, the preparation time (time needed to describe a lesion and prepare a transfixion), and the number of erroneous punctures. The procedures were measured using the major (parallel)- and short (orthogonal)-axes methods, and the same measurements were repeated after 1 month. An adapter was not used in either method during puncture. Evaluations were performed by group A using model 1. In contrast, participants in groups B, C, and D performed FNAC using a different phantom (model 2) for their first attempt (Fig. 2). Preparation time was defined as the time from placing the US probe on the phantom surface to placing the needle on the phantom surface. Puncture time was defined as the time taken to perform transfixion on the US screen and aspirate the blue liquid from inside the tumor.

Fig. 2
figure 2

Evaluation method. Participants in Groups A, B, C, and D were asked to complete the same tasks. First the average times were measured for group A, then groups B, C, and D repeated the practice method. One month later, each group was asked to perform the same tasks again using model 1 or 2

Before the first puncture training, all members received the same fundamental information about thyroid diseases, neck dissection, and management confirmation of an echo probe. The practice method was repeated by participants in groups B, C, and D using model 1 after the first puncture, while all groups used model 2 for the first time in the second puncture. The goals of the practice were as follows:

  1. 1.

    To delineate the tumor from blood vessels and identify the expected puncture location on the same screen.

  2. 2.

    To firmly and continuously visualize the needle tip from the start of the puncture until the target was reached in the long axis method.

  3. 3.

    To adjust the needle insertion angle, among other variables, so that the short axis method required as few punctures as possible.

If a participant failed to achieve each goal, they were instructed to review and understand the reasons for the failures; for example, not fixing the palm on the body surface, not moving the probe sideways during puncture, or inserting the needle in the wrong angle. The number of punctures differed, as the level of knowledge and skill varied; however, the maximum practice time was limited to 30 min. The participants were asked to complete a questionnaire after they finished their training.

Statistical analysis

The phantom was fabricated using a 3D printer (Objet 260, Stratasys, Eden Prairie, MN, USA) and puncture training was evaluated using a LOGIQ e Premium (GE Healthcare, Chalfont Saint Giles, UK) US diagnostic unit. The Student’s t test was used to compare average times between groups. Values of P < 0.05 were considered significant. MedCalc software version 11.5.1.0 (MedCalc Software, Mariakerke, Belgium) was used for statistical analysis.

Results

The preparation time increased based on the level of experience within the groups. However, the second puncture using the same phantom was significantly improved in group C (P < 0.05). Less preparation time was required for the short-axis method than for the major-axis method (Fig. 3A). Using models 1 and 2, significant improvements were observed in preparation time for all members of groups C and D (Fig. 3B). All participants in the four groups improved their puncture times by using the major- and short-axes methods for the same phantom (model 1) (Fig. 4A). When using the different phantoms (models 1 and 2), the puncture time did not change significantly in groups A and B, but it did improve significantly in groups C and D, to close to that in groups A and B (Fig. 5).

Fig. 3
figure 3

Preparation time. First attempt, model 1; second attempt, model 2. A Using the same model: first attempt, model 1; second attempt, model 1. The preparation time for the second attempt was shorter than that for the first attempt. A significant difference was observed in both methods among groups B, C, and D (P < 0.001). B When the other model was used: first attempt, model 1; second attempt, model 2

Fig. 4
figure 4

A Puncture time using model 1. Using the same phantom (model 1), reductions in time taken to perform the major-axis method were noted in all groups. A significant difference was observed in puncture times (P < 0.001). B Improvement rate for model 1. The improvement rate was defined to calculate the difference between the second attempt and the first attempt numerators and the first attempt denominator. The rate was then calculated by the time reduction ratio. The highest improvement rates were observed in group C, using the major-axis (0.58) and short-axis (0.62) methods

Fig. 5
figure 5

Puncture times using models 1 and 2. We evaluated the progress of all participants in groups B to D using phantom model 2. Using the major-axis method, a clear improvement was observed in groups B to D, contrary to group A. The greatest improvement was observed in group C. Using the shorts-axis method, the progress of all members in groups A to D was examined. The time shortening ratio was greater in groups B to D than in group A

The improvement rate was defined to identify the difference between the numerators for the first and second time practicing the procedure, and for the first time practicing the procedure. These values were measured using the ratio for time shortening. All participants in groups B, C, and D improved after using the major- (0.47 ± 0.07) and short- (0.52 ± 0.07) axes methods (P < 0.001). The highest improvement rate was observed in group C, using the major- (0.58) and short- (0.62) axes methods (Fig. 4B). After using a different phantom (model 2) during the first practice attempt, the progress of groups B to D was examined. Using the major-axis method, the mean time required for the second practice attempt (37.2 s) was longer than that required for the first practice attempt (24.5 s) for group A. Contrary to group A, significant improvements were observed in groups B, C, and D (group B, from 58.4 to 40.0 s; group C, from 160.4 to 45.6 s; and group D, from 163.3 to 69.7 s). Group C showed the most improvement. The progress of groups A to D in using the short-axis method was evaluated and the first average values of groups A, B, C, and D were 23.8, 53.8, 136.2, and 162.9 s, whereas the second average values were 20.3, 28.1, 34.5, and 45.3 s, respectively. The time shortening ratio was greater for groups B to D than for group A (Fig. 5). No significant difference was observed in preparation time or puncture time, and their relevance was not proven.

The numbers of erroneous punctures during the first practice attempt were 0, 2, 20, and 31 for groups A, B, C, and D, respectively. During the second attempt, a significant improvement was observed for the group D participants, who were three times better than with the first attempt (Table 1). The number of erroneous punctures was 31 (55%) when using the major-axis method. More vascular damage occurred when using the major-axis method, whereas more tracheal damage occurred when using the short-axis method (Table 2).

Table 1 Number of puncture errors: first attempt vs. second attempt
Table 2 Number of puncture errors: major-axis method vs. short-axis method

Discussion

Safety and accuracy are essential for all clinical procedures. FNAC is the most important diagnostic procedure used for outpatients with thyroid disease. However, achieving the required accuracy and safety can be difficult because of the complicated anatomy of the neck. Furthermore, teaching inexperienced doctors to perform FNAC in live patients is difficult because it is usually performed when the patients are awake. To address this issue, we created an original neck phantom with a thyroid tumor to train others to perform FNAC and then evaluated its effectiveness by assessing experienced and inexperienced doctors and medical students. Consequently, practicing FNAC on our phantoms helped inexperienced doctors and medical students improve the way they performed FNAC for thyroid tumors. Our data showed that practice improved the doctors’ skills, even when using the new model shape after practicing on the first model. Therefore, our neck phantom with similar US characteristics manufactured using patient data contributed to improving the FNAC competency of inexperienced doctors.

US has become the diagnostic imaging of choice for thyroid diseases because it is non-invasive and simpler than other diagnostic modalities [10]. According to an epidemiological survey, the incidence of thyroid disease is still high, with some thyroid anomaly found in 1 of 10 individuals [11]. In 2014, thyroid disease was diagnosed in approximately 86,700 patients [12]. Conversely, the number of thyroid disease specialists is insufficient despite the fact that thyroid gland morbidity has shown an upward trend in recent years [11]. Following a nuclear power accident in Japan, the importance of screening for thyroid nodules gained much attention. Therefore, more thyroid disease specialists are required and the training of these specialists is an urgent issue. To address this challenge, practicing in a laboratory using models based on patients can be helpful and effective.

The rate of detection of thyroid carcinoma using US is approximately 3.5-times higher than that using palpation [13]. Furthermore, FNAC is a simple, reliable, inexpensive, and generally safe diagnostic procedure for thyroid nodules. Local pain and minor hematomas are the most common post-FNAC complications, but severe complications are rare [2], although there are sporadic reports of severe complications after fine needle biopsy (FNB), such as uncontrolled hemorrhage or massive hematoma with airway obstruction [14,15,16,17], acute thyroid swelling after needle biopsy [18,19,20], infections [21], and tumor dissemination [22, 23]. While FNB is different from FNAC, it is challenging and associated with risk for inexperienced clinicians to perform independently because of the proximity of the thyroid to the jugular vein, carotid artery, and trachea. Moreover, US-guided puncture requires practice to achieve a certain level of skill before performing the procedure on a patient. In fact, in this study, errors were made during the procedure causing tracheal and vessel injury. In particular, resident doctors and medical students made many errors during practice; therefore, to perform their first procedure on a patient without any practice would be unacceptable. Our data showed that the number of errors during the procedure decreased dramatically after practice, indicating that our phantom could support improvements in the skill of inexperienced doctors and contribute ultimately to the safety and efficiency of this procedure.

The cost of developing a practice model is an important point to consider. Currently, 3D printers are popular in the medical field and are used to construct organ models and structures [15]. However, these printers are expensive and require a long period to create the model. To solve this issue, we used moulds to make some organs [9]. While 3D printers are needed to make the mould of the organ, after that, the cost of duplication is only about $5 (United States). Our phantoms are also elaborate because they are sonographically authentic and possess life-like elasticity, so should be considered suitable as practice models.

In our study, the puncture was performed using the major- and short-axes methods, which specialists must be adept at to perform the procedure successfully, even in patients with unusual anatomy or tumors. It has been reported that the short-axis approach takes less time than the major-axis approach when the catheter is inserted via the vessel in the neck. In our study, different types of puncture error were noted during the practice of the major and short-axes approaches. Therefore, mastering both methods will help reduce puncture errors. Our phantom is a useful tool for both methods because the US views are similar to the actual patient findings using both methods.

In conclusion, using a 3D printer, we successfully fabricated a tailored thyroid gland phantom representative of various patients, for doctors to practice transfixion. Our original phantoms improved the puncture skills of medical students and resident doctors effectively, resulting in their confidence and skill to ultimately perform safe and efficient procedures on patients.