Abstract
In this paper, we present a new dataset consisting of 19,407 X-ray images. The images are organized in a public database called \(\mathbb {GDX}\)ray that can be used free of charge, but for research and educational purposes only. The database includes five groups of X-ray images: castings, welds, baggage, natural objects and settings. Each group has several series, and each series several X-ray images. Most of the series are annotated or labeled. In such cases, the coordinates of the bounding boxes of the objects of interest or the labels of the images are available in standard text files. The size of \(\mathbb {GDX}\)ray is 3.5 GB and it can be downloaded from our website. We believe that \(\mathbb {GDX}\)ray represents a relevant contribution to the X-ray testing community. On the one hand, students, researchers and engineers can use these X-ray images to develop, test and evaluate image analysis and computer vision algorithms without purchasing expensive X-ray equipment. On the other hand, these images can be used as a benchmark in order to test and compare the performance of different approaches on the same data. Moreover, the database can be used in the training programs of human inspectors.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Public databases of X-ray images can be found for medical imaging,Footnote 1 however, to the best knowledge of the authors, up until now there have not been any public databases of digital X-ray images for X-ray testing.Footnote 2
As a service to the X-ray testing community, we collected more than 19,400 X-ray images for the development, testing and evaluation of image analysis and computer vision algorithms. The images are organized in a public database called \(\mathbb {GDX}\)ray: The Grima X-ray database.Footnote 3 In order to illustrate our database, a random selection of 70 X-ray is shown in Fig. 1. The database includes five groups of X-ray images: castings, welds, baggage, natural objects and settings. Each group has several series, and each series several X-ray images. Some samples of each series are illustrated in Fig. 2. Most of the series are annotated or labeled. In those cases, the coordinates of the bounding boxes of the objects of interest or the labels of the images are available. In Table 1 we can see some statistics. The size of \(\mathbb {GDX}\)ray is 3.5 GB and it can be downloaded from our website (see Fig. 2).
In this paper, we will view the structure of \(\mathbb {GDX}\)ray database, a description for each group (with some series examples), and some examples of applications that have been published using images of \(\mathbb {GDX}\)ray.
2 Structure of the Database
\(\mathbb {GDX}\)ray is available in a public repository. The repository contains 5 group folders one for each group: Castings, Welds, Baggage, Nature and Settings. For each group we define an initial: C, W, B, N and S respectively. As shown in Table 1, each group has several series. Each series is stored in an individual sub-folder of the corresponding group folder. The sub-folder name is Xssss, where X is the initial of the group and ssss is the number of the series. For example, the third series of group Castings is stored in sub-folder C0003 of folder Castings (see more examples in Fig. 2). The X-ray images of a series are stored in file Xssss_nnnn.png. Again Xssss is the name of the series. The number nnnn corresponds to the number of the X-ray image of this series. For example, the fifth X-ray image of series C0003 is C0003_0005.png and is stored in sub-folder Castings/C0003. The whole structure is summarized in Table 2. It is worth mentioning that all X-ray images of \(\mathbb {GDX}\)ray are stored in ‘png’ (Portable Network Graphics)Footnote 4 8-bit grayscale format. Additional metadata for each series (such as description of the objects, parameters and description of X-ray imaging system, etc.) are given in an ASCII file called Xssss_readme.txt included in sub-folder Xssss, e.g., C0003_readme.txt for series Castings/C0003.
3 Castings
The group Castings contains 2727 X-ray images arranged in 67 series. The X-ray images were taken mainly from automotive parts (aluminum wheels and knuckles) using an image intensifier. Some examples are illustrated in Figs. 3 and 4. The details of each series are given in Table 3. Experiments on these data can be found in several publications as shown in Table 4. It is interesting to highlight that series C0001 (see Fig. 3) contains not only a sequence of 72 X-ray images taken from an aluminum wheel by rotating its central axis in 5\(^0\), but also annotations of bounding boxes of the ground truth of 226 small defects and the calibration matrix of each image that relates the 3D coordinates of the aluminum wheel with 2D coordinates of the X-ray image.
4 Welds
The group Welds contains 88 images arranged in 3 series. The X-ray images were taken by the BAM Federal Institute for Materials Research and Testing, Berlin, Germany.Footnote 5 Some examples are illustrated in Fig. 5. The details of each series are given in Table 5. Experiments on these data can be found in several publications as shown in Table 6. It is interesting to highlight that series W0001 and W0002 (see Fig. 5) contains not only 10 X-ray images selected from the whole BAM database (series W0003), but also annotations of bounding boxes and the binary images of the ground truth of 641 defects.
Series W0003 contains a collection of 67 digitized radiographs from a round robin test on flaw recognition in welding seams. The NDT films (used with lead screens) were exposed according to ISO 17636-1, testing class A. After development they have been scanned with a LASER scanner LS85 SDR from Lumisys using digitization class DB-9 according to ISO 14096-2. The original 12 bit data depth was rescaled to 8 bits with a linear LUT proportional to optical film density by visual adjustment to the image content. This ensures that all necessary flaw information is still in the 8 bit images.Footnote 6 The pixel size is 40.3 micron (630 dpi). The images are 8 bit gray values. In addition, in this directory the file ‘real-values.xls’ contains the true data and the flaw designations according to ISO 6520 and ISO 5817. These true data have been generated using weld sections of 1 cm width starting from the indicated Zero point.
5 Baggage
The group Baggage contains 8150 X-ray images arranged in 77 series. The X-ray images were taken from different containers such as backpacks, pen cases, wallets, etc. Some examples are illustrated in Figs. 6, 7 and 8. The details of each series are given in Table 7. Experiments on these data can be found in several publications as shown in Table 8. It is interesting to highlight that series B0046, B0047 and B0048 (see for example Fig. 6) contains 600 X-ray images that can be used for automated detection of handguns, shuriken and razor blades (bounding boxes for these objects of interest are available as well). In this case, the training can be performed using series B0049, B0050 and B0051 that includes X-ray images of individual handguns, shuriken and razor blades respectively taken from different points of view as shown in Fig. 7.
6 Natural Objects
The group Nature contains 8290 X-ray images arranged in 13 series. The X-ray images were taken from different natural objects such as salmon filets, fruit and wood pieces. Some examples are illustrated in Figs. 9 and 10 The details of each series are given in Table 9. Experiments on these data can be found in several publications as shown in Table 10. It is interesting to highlight that series N0012 and N0013 (see Fig. 11) contains not only 6 X-ray images of salmon filets, but also annotations of bounding boxes and the binary images of the ground truth of 73 fish bones. For training proposes, there are more than 7500 labeled small crops (\(10 \times 10\) pixels), of regions of X-ray of salmon filets with and without fish bones in series N0003.
7 Settings
The group Settings contains 151 X-ray images arranged in 7 series. The X-ray images were taken from different calibration objects such checkerboards and 3D objects with regular patterns. Some examples are illustrated in Fig. 12. The details of each series are given in Table 11. Experiments on these data can be found in several publications as shown in Table 12. It is interesting to highlight that series S0001 (see Fig. 12) contains not only 18 X-ray images of a copper checkerboard, but also the calibration matrix of each view. In addition, series S0007 can be used for modeling the distortion of an image intensifier. The coordinates of each hole of the calibration pattern in each view are available, and the coordinates of the 3D model are given as well.
8 Conclusions
In this paper, we presented the details of a new public dataset called \(\mathbb {GDX}\)ray. It consists of more than 19,400 X-ray images. The database includes five groups of X-ray images: castings, welds, baggage, natural objects and settings. Each group has several series and X-ray images with many labels and annotations that can be used for training and testing purposes in computer vision algorithms. To the best knowledge of the authors, up until now there have not been any public databases of digital X-ray images for X-ray testing.
In this paper, we explained the structure of the \(\mathbb {GDX}\)ray database, we gave a description for each group (with some series examples), and we presented some examples of applications that have been published using images of \(\mathbb {GDX}\)ray.
We believe that \(\mathbb {GDX}\)ray represents a relevant contribution to the X-ray testing community. On the one hand, students, researchers and engineers can use these X-ray images to develop, test and evaluate image analysis and computer vision algorithms without purchasing expensive X-ray equipment. On the other hand, these images can be used as a benchmark in order to test and compare the performance of different approaches on the same data. Moreover, the database can be used in the training programs of human inspectors.
Notes
See for example a good collection in http://www.via.cornell.edu/databases/.
There are some galleries of X-ray images available on the web with a few samples, see for instance http://www.vidisco.com/ndt_solutions/ndt_info_center/ndt_x_ray_gallery with approximately 50 X-ray images.
Grima, from Grupo de Inteligencia de Máquina, is the name of our Machine Intelligence Group at the Department of Computer Science of the Pontificia Universidad Católica de Chile http://grima.ing.puc.cl. The X-ray images included in \(\mathbb {GDX}\)ray can be used free of charge, for research and educational purposes only. Redistribution and commercial use is prohibited. Any researcher reporting results which use this database should acknowledge the \(\mathbb {GDX}\)ray database by citing this paper.
The X-ray images of series W0001 and W0003 are included in GDXray thanks to the collaboration of the BAM Federal Institute for Materials Research and Testing, Berlin, Germany http://dir.bam.de/dir.html.
The original images in TIFF format are available also in series W0003 as files RRT01.zip (the first 31 images) and RRT02.zip (the last 36 images).
References
Carrasco, M., Mery, D.: Segmentation of welding defects using a robust algorithm. Mater. Eval. 62(11), 1142–1147 (2004)
Carrasco, M., Mery, D.: Automatic multiple view inspection using geometrical tracking and feature analysis in aluminum wheels. Mach. Vis. Appl. 22(1), 157–170 (2011)
Damashek, A., Doherty, J.: Detecting guns using parametric edge matching. Tech. Rep. Project for Computer Vision Course: CS231A, Stanford University (2015)
Ghoreyshi, A., Vidal, R., Mery, D.: Segmentation of circular casting defects using a robust algorithm. Insight-Non-Destruct. Test. Cond. Monit. 47(10), 615–617 (2005)
Hernández, S., Sáez, D., Mery, D., da Silva, R., Sequeira, M.: Automated defect detection in aluminium castings and welds using neuro-fuzzy classifiers. In: Proceedings of the 16th World Conference on Non-destructive Testing (WCNDT–2004), Montreal (2004)
Huang, Q., Wu, Y., Baruch, J., Jiang, P., Peng, Y.: A template model for defect simulation for evaluating nondestructive testing in X-radiography. IEEE Trans. Syst. Man Cybern. A: Syst. Hum. 39(2), 466–475 (2009)
Mery, D.: A new algorithm for flaw simulation in castings by superimposing projections of 3D models onto X-ray images. In: Proceedings of the XXI International Conference of the Chilean Computer Science Society (SCCC-2001), pp. 193–202. IEEE Computer Society Press, Punta Arenas (2001)
Mery, D.: Crossing line profile: a new approach to detecting defects in aluminium castings. In: Proceedings of the Scandinavian Conference on Image Analysis (SCIA 2003), Lecture Notes in Computer Science, vol. 2749, pp. 725–732 (2003)
Mery, D.: Explicit geometric model of a radioscopic imaging system. NDT & E Int. 36(8), 587–599 (2003)
Mery, D.: Automated detection in complex objects using a tracking algorithm in multiple X-ray views. In: Proceedings of the 8th IEEE Workshop on Object Tracking and Classification Beyond the Visible Spectrum (OTCBVS 2011), in Conjunction with CVPR 2011, pp. 41–48. Colorado Springs (2011)
Mery, D.: Automated detection of welding defects without segmentation. Mater. Eval. 69(6), 657–663 (2011)
Mery, D.: Inspection of complex objects using multiple-X-ray views. IEEE/ASME Trans. Mechatron. 20(1), 338–347 (2015)
Mery, D., Berti, M.A.: Automatic detection of welding defects using texture features. Insight-Non-Destruct. Test. Cond. Monit. 45(10), 676–681 (2003)
Mery, D., Carrasco, M.: Automated multiple view inspection based on uncalibrated image sequences. Lect. Notes Comput. Sci. 3540, 1238–1247 (2005)
Mery, D., Chacón, M., Munoz, L., González, L.: Automated inspection of aluminium castings using fusion strategies. Mater. Eval. 63(2), 148–153 (2005)
Mery, D., Filbert, D.: The epipolar geometry in the radioscopy: theory and application. Automatisierungstechnik 48(12), 588–596 (2000). (in German)
Mery, D., Filbert, D.: A fast non-iterative algorithm for the removal of blur caused by uniform linear motion in X-ray images. In: Proceedings of the 15th World Conference on Non-destructive Testing (WCNDT–2000), Rome (2000)
Mery, D., Filbert, D.: Automated flaw detection in aluminum castings based on the tracking of potential defects in a radioscopic image sequence. IEEE Trans. Robot. Autom. 18(6), 890–901 (2002)
Mery, D., Filbert, D.: Automated inspection of moving aluminium castings. In: 8th European Conference on Non-destructive Testing (ECNDT 2002), Barcelona (2002)
Mery, D., Hahn, D., Hitschfeld, N.: Simulation of defects in aluminum castings using CAD models of flaws and real X-ray images. Insight 47(10), 618–624 (2005)
Mery, D., Lillo, I., Riffo, V., Soto, A., Cipriano, A., Aguilera, J.: Automated fish bone detection using X-ray testing. J. Food Eng. 2011(105), 485–492 (2011)
Mery, D., Pedreschi, F., Soto, A.: Automated design of a computer vision system for visual food quality evaluation. Food Bioprocess Technol. 6(8), 2093–2108 (2013)
Mery, D., Riffo, V., Mondragon, G., Zuccar, I.: Detection of regular objects in baggages using multiple X-ray views. Insight 55(1), 16–21 (2013)
Mery, D., Riffo, V., Zuccar, I., Pieringer, C.: Automated X-ray object recognition using an efficient search algorithm in multiple views. In: Proceedings of the 9th IEEE CVPR Workshop on Perception Beyond the Visible Spectrum, Portland (2013)
Mondragón, G., Leiva, G., Aguilera, J., Mery, D.: Automated detection of softening and hard columella in kiwifruits during postharvest using X-ray testing. In: Proceedings of International Congress on Engineering and Food (2011)
Perner, P., Zscherpel, U., Jacobsen, C.: A comparison between neural networks and decision trees based on data from industrial radiographic testing. Pattern Recogn. Lett. 22(1), 47–54 (2001)
Pieringer, C., Mery, D.: Flaw detection in aluminium die castings using simultaneous combination of multiple views. Insight 52(10), 548–552 (2010)
Pizarro, L., Mery, D., Delpiano, R., Carrasco, M.: Robust automated multiple view inspection. Pattern Anal. Appl. 11(1), 21–32 (2008)
Ramírez, F., Allende, H.: Detection of flaws in aluminium castings: a comparative study between generative and discriminant approaches. Insight-Non-Destruct. Test. Cond. Monit. 55(7), 366–371 (2013)
Riffo, V., Mery, D.: Active X-ray testing of complex objects. Insight 54(1), 28–35 (2012)
da Silva, R.R., Siqueira, M.H., de Souza, M.P.V., Rebello, J.M., Calôba, L.P.: Estimated accuracy of classification of defects detected in welded joints by radiographic tests. NDT & E Int. 38(5), 335–343 (2005)
Acknowledgments
Fondecyt Grant No. 1130934 from CONICYT–Chile.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mery, D., Riffo, V., Zscherpel, U. et al. GDXray: The Database of X-ray Images for Nondestructive Testing. J Nondestruct Eval 34, 42 (2015). https://doi.org/10.1007/s10921-015-0315-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10921-015-0315-7