1 Introduction

The area of interest of this study is the North Hadramaut High area, belonging to the Hadramaut government in eastern Yemen (Fig. 1a). The North Hadramaut High is a regional subsurface high and separates the Masila and Jiza-Qamar Basins in the south and Rub' Al-Khali Basin in the north (Fig. 1a). To date, the North Hadramaut High represents a grossly underexplored area, where limited exploration wells have been drilled by multiple oil companies (Fig. 1b). Oil stain samples were discovered in the clastic and carbonate rocks of the Late Cretaceous Harshiyat Formation encountered in two exploration wells (Thamoud-01 and Tarfayt-02), as shown in Fig. 1b.

Fig. 1
figure 1

a Map showing the locations of the sedimentary basins in Yemen, including the North Hadramaut High area of eastern Yemen (after Hakimi et al., 2020). b Location map of the exploration wells in the North Hadramaut High area, including the locations of the studied wells (i.e., Thamoud-01 and Tarfayt-02)

The Cretaceous sedimentary rocks in the North Hadramaut High area are mainly marine and subordinate continental sediments composed of sandstone, carbonate, and shales (Fig. 2). Sandstone and carbonate rocks of Qishn and Harshiyat Formations (Fig. 2) can be considered as reservoir rocks in the North Hadramaut High area. The shale intervals in the Harshiyat Formation are considered as good potential source rocks in the onshore and offshore of basins of the Yemen [1, 2]. The Harshiyat shales are deposited in a marine environment under low oxygen conditions and contain mixed organic matter with types I, II, and III kerogens, consistent with both oil and gas source rocks [1, 2]. Hence, exploration activities for conventional petroleum resources have been intensively focused on organic-rich shale intervals within the Late Cretaceous Harshiyat Formation.

Fig. 2
figure 2

Generalized stratigraphic column of the Cretaceous–lowermost Paleogene succession in the North Hadramaut High area of eastern Yemen. Oil stain samples for this study were taken from clastic and carbonate rocks in the Harshiyat Formation

In this study, five oil stain samples, extracted from the sandstone and carbonate rocks of the Late Cretaceous Harshiyat Formation obtained in two well locations (Thamoud-01 and Tarfayt-02) in the North Hadramaut High area (Fig. 1b), were geochemically analyzed. We used the oil composition of the analyzed oil stain samples to document the characteristics of the potential source rock, such as the origin of the source organic matter input, paleodepositional conditions, lithology, and thermal maturity. This study subsequently incorporated biomarker and δ13C results, which are best used to provide dependable geochemical interpretations and understand the genetic relationships between the oil stains under study and the Harshiyat source rock.

2 Stratigraphic Setting

The stratigraphic column of the North Hadramaut High area includes a Precambrian basement rock and a sedimentary succession ranging in age from Paleozoic to Tertiary (Fig. 2). The lower Palaeozoic sedimentary sequence is represented by the Wajid and Akbra formations that lie unconformably on Precambrian basement rock (Fig. 2). The Wajid Formation is characterized by continental sandstones and interbedded mudstones from the Cambrian–Carboniferous age, while the Akbra Formation represents glacial deposits from the Permian ice age [3, 4].

The Mesozoic section is dominated by thick clastic and carbonate rocks, forming the Kuhlan Formation and Amran (Jurassic) and Mahara (Cretaceous) Groups (Fig. 2). The Early–Middle Jurassic Kuhlan Formation is represented by continental to marine sandstones which vertically grade upward into the carbonates of the Amran Group [3]. Here, Jurassic deposits of Amran Group were accumulated during Jurassic period and contain three sedimentary formations, from base to top, namely the Shuqra Formation, the Madbi Formation, and the Naifa Formation (Fig. 2). The Amran Group was followed by the mixed clastics and carbonates of the Mahara Group (Fig. 2). The Mahara Group was divided into four formations from base to top: Saar, Qishn, Harshiyat/Fartaq, and Mukalla formations (Fig. 2). In the North Hadramaut High, the Lower Cretaceous Qishn Formation lies on Amran Group with a conformable contact (Fig. 2). The clastic rocks of the Qishn Formation are considered as the main reservoir rocks in most oilfields of Masila Basin in eastern Yemen [5,6,7].

The Qishn Formation is overlain conformably by the Upper Cretaceous Harshiyat Formation (Fig. 2), which extends into the offshore Masila and Jiza-Qamar Basins [4]. The Harshiyat Formation consists of sandstones and interbedded shales deposited in shallow-marine settings [1, 3] with vertical variations in lateral carbonate facies for the Fartaq Formation (Fig. 2). The Harshiyat shales are considered as potential source rocks for both oil and gas hydrocarbons in the onshore and offshore basins of Yemen [1, 2]. The Upper Cretaceous Harshiyat Formation is followed by deltaic sediments of the Mukalla Formation (Fig. 2). The Mukalla Formation has been assigned to the Santonian–early Campanian as indicated from calcareous nannofossils [8] and composed of non-marine sandstones and intercalated with carbonaceous shales and coal beds (Fig. 2). The carbonaceous shale and coal sediments of the Mukalla Formation are considered as deltaic source rocks in the Jiza-Qamar Basin [9], consistent with Type II/III kerogens [10, 11].

The North Hadramaut High is filled with Tertiary (Paleocene-Eocene) succession (Fig. 2). During the Late Paleocene, the transgressive shales of the Umm Er Radhuma Formation were deposited because of sea-level rise. The Umm Er Radhuma shales contain a mixture of aquatic organic matter (planktonic and bacterial) and terrigenous organic matter and deposited under anoxic marine conditions as indicated by biological marks of the organic matter [12]. However, the Umm Er Radhuma Formation is also composed of thick carbonate rocks at the top (Fig. 2), and this carbonate lithology confirmed a marine environment setting [3, 12]. The Umm Er Radhuma Formation continued to the Early Eocene and was followed by mixed carbonate, clastics and anhydrite deposits of the Jiza, Rus, and Habshiyah Formations (Fig. 2).

3 Samples and Methods

Five core sandstone and carbonate samples containing oil stains were collected from the Thamoud-01 and Tarfayt-02 wells in the North Hadramaut High area (Table 1; Fig. 1b). Oil stains have been reported from the Harshiyat Formation at 723, 1218 (limestone rocks), and 1212 m (Sandstone rocks) in the Tarfayt-02 well (Table 1) and at 449 and 479 m (Sandstone rocks) in the Thamoud-01 well (Table 1).

Table 1 Specific biomarker ratios and carbon isotopes of saturated and aromatic hydrocarbon fractions in the oil staining samples from the clastic and carbonate rocks in Harshiyat Formation, North Hadramaut High of eastern Yemen, illustrating

The selected sandstone and carbonate cores containing oil stains were finely milled (60 μm) and subjected to organic geochemical analyses, including gas chromatography, gas chromatography–mass spectrometry (GC–MS), and mass spectrometry for δ13C. Most organic geochemical analyses were conducted at Simon Petroleum Technology Limited Laboratories, UK, and the results are made available through the Petroleum Exploration and Production Authority (PEPA) of Republic of Yemen.

The oil stains were extracted from the sandstone and carbonate samples using a standard extraction process by a mixture solvent of dichloromethane (DCM) and methanol (CH3OH) for 72 h. The extracted materials were separated into hydrocarbon (saturate and aromatic) and non-hydrocarbon (polar) fractions using a long liquid column chromatography on silica gel topped with alumina oxide.

The saturated fraction of the five oil stain samples was analyzed using the flame ionization detector (FID) of a gas chromatograph with an AMS-92 column. The furnace temperature of the GC oven was programmed between 70 °C and 270 °C at a rate of 3 °C/min and held at 290 °C for 20 min. Of these samples, three saturated and two aromatic fractions were subjected to GC–MS analysis. The GC–MS experiment was performed on a Finnegan 4000 mass spectrometer using a gas chromatograph attached directly to the ion source with the temperature programmed from 60 °C to 300 °C at a rate of 3 °C/min and then held for 20 min at 300 °C. GC–MS analysis was used to deduct the biological marks of organisms in the saturated and aromatic hydrocarbon fractions using specific ions such as m/z 191, m/z 217, m/z 178, m/z 184, m/z 192, and m/z 198 mass fragmentograms. The peak assignments of the specific ions (Table 2) were identified by comparison of the retention times and mass spectra of the monitored ions with previous works [13,14,15,16].

Table 2 Peak assignments for hydrocarbons in the saturated fractions in the m/z 191 (I) and 217 (II) mass fragmentograms

Additionally, saturated and aromatic fractions of two extracted oil stain samples were analyzed for bulk δ13C compositions. δ13C values were measured using a VG 602 stable isotope mass spectrometry instrument following the standard procedure of the Simon Petroleum Technology Limited Laboratories, UK. The 13C/12C isotope ratio of CO2 from a sample was compared with the corresponding ratio of a reference (NBS22 standard) and calibrated against the PDB standard.

4 Results and Discussion

4.1 Biomarker Distributions

4.1.1 Lipid Biomarker Fingerprints

Lipid biomarkers (e.g., n-alkanes, acyclic isoprenoids, triterpanes, and steranes) in the saturated hydrocarbon fraction of the five oil stain samples obtained from the gas chromatogram and fragmentograms of m/z 191 and m/z 217 ions are presented in Figs. 3 and 4, and the summary of the compound ratios is listed in Table 1.

Fig. 3
figure 3

Gas chromatograms of the saturated hydrocarbon fraction for three oil stain samples analyzed, showing a complete suite of acyclic isoprenoids (e.g., pristane and phytane) and low-molecular-weight n-alkanes, indicating that there is no biodegradation in the analyzed samples

Fig. 4
figure 4

Mass fragmentograms of the m/z 191 and m/z 217 ions of the saturated hydrocarbon fraction for two analyzed oil stain samples. The biomarkers shown in this Figs. a and b are identified in Table 2

The n-alkane distributions are mainly between the n-C16 to the n-C36 of the studied oil stain samples, as shown in the gas chromatograms (Fig. 3). Among these distributions of n-alkane, even normal alkanes prevail over odd n-alkanes, with the highest abundance of n-C17 to n-C23 compounds (Fig. 3). These distributions result in carbon preference index (CPI) values between 1.07 and 1.41 (Table 1).

Acyclic isoprenoids such as pristane (Pr) and phytane (Ph) are also present in all analyzed samples (Fig. 3), derived from the chlorophyll phytol chain. Pr and Ph occurred in low concentrations in most analyzed oil stain samples (Fig. 3). The ratio of Pr/Ph is less than 1, between 0.60 and 0.99 (Table 1), representing the Ph is higher than Pr (Fig. 3). However, normal n-C17 and n-C18 alkanes are very often associated with these isoprenoids [15, 17]. The isoprenoid ratios relative to the n-alkane concentrations (Pr/n-C17 and Ph/n-C18) were subsequently calculated and found to be in the ranges of 0.50–0.73 and 0.48–0.79, respectively (Table 1). These findings represent that the concentrations of n-alkane (n-C17 and n-C18) were higher than those of Pr and Ph isoprenoids in all examined oil stain samples (Fig. 3).

The GC–MS mass fragmentograms of the m/z 191 and m/z 217 ions of three oil stain samples analyzed affirmed the presence of hopanoids and steroids in the saturated hydrocarbon fraction (Fig. 4).

The mass fragmentograms of the m/z 191 chromatograms show significant amounts of hopanes over tricyclic terpanes, predominated by C30-hopane, C29-norhorhopane, and C31–C33 homohopanes (Fig. 4a). Among these, C30-hopane was observed in high abundance compared with C29-norhorhopane, leading to low C29/C30 hopane values of less than 1 (0.64–0.77; Table 1) and hinting at a typical clay-rich source rock [18]. The C31–C35 homohopanes from the examined oil stain samples were observed in decreasing quantities from C31 to C35 (Fig. 4a). Meanwhile, the C31 22R-series homohopanes were less abundant than the C30-hopane (Fig. 4a), as evidenced by the low C31 22R homohopane/C30-hopane (C31R/C30H) ratios between 0.27 and 0.42 (see Table 1). The analyzed oil stains also display a low concentration of 17α (H)-trisnorneohopane (Tm) and 18α (H)-trisnorneohopane (Ts), with Tm concentration slightly higher than that of Ts (Fig. 4a), leading to relatively low values of Ts/Tm (less than 1.0; Table 1). The biomarker maturity ratios of C32 22S/(22R + 22S) and moretane to hopane (C30M/C30H) of the analyzed oil stain samples were subsequently calculated (Table 1). Moreover, the low amounts of tricyclic terpanes (C20–C26) were also recorded in the m/z 191 mass fragmentogram of the analyzed samples (Fig. 4a), and the ratios of tetracyclic and tricyclic terpanes such as C23T/C24T, C26T/C25T, and TeC24/C26T were subsequently calculated (Table 1).

Biomarker distributions of regular sterane and diasteranes in three oil stain samples analyzed were also observed in the saturated hydrocarbon fraction and identified from m/z 217 chromatograms in Fig. 4b. The mass fragmentograms of m/z 217 chromatograms are characterized by the high abundance of steranes compared with diasterane biomarkers (Fig. 4b). In addition, the relative percentages of the regular C27, C28, and C29 steranes and source ratios of the regular C27/C29 steranes and C29 maturity ratios of 20S/(20S + 20R) and αββ (αββ + ααα) are calculated from the sterane distributions (Fig. 4b) and are also reported in Table 1. The results indicate higher quantities of the C27 and C29 regular steranes than the C28 regular steranes (Fig. 4b), with a high proportion of C27 (34.4%–46.1%) and C29 (30.4%–42.8%) compared with C28 (22.2%–23.5%) as shown in Table 1.

4.1.2 Aromatic Biomarkers

Aromatic biomarkers are more abundant in terrestrial plants in general and are useful indicators of thermal maturity [19,20,21], depositional environment and lithologies [22], and source facies [23, 24]. In this study, polycyclic aromatic biomarkers such as dibenzothiophene (DBT), phenanthrene (P), alkylnaphthalene, and methylphenanthrene and their ratios and parameters were identified in the aromatic hydrocarbon fraction of two investigated oil stain samples. The relative aromatic distributions of DBF, P, and the homologs of methyldibenzothiophene and methylphenanthrene were obtained from the mass fragmentograms of m/z 184, m/z 178, m/z 192, and m/z 198, respectively.

Most ratios of methylphenanthrene (MPI-1 and MPI-2), methyldibenzothiophene (MDR), and dibenzothiophene/phenanthrene (DBT/P) were calculated on the basis of isomer distribution of aromatic compounds (see Table 1). The calculated reflectance (VRc) was subsequently determined on the basis of the methylphenanthrene index (MPI-1) [25].

4.2 Bulk δ 13 C Compositions

Bulk δ13C of the saturated and aromatic portions of the oil composition is frequently used to categorize crude oils and determine the origins of organic matter input in possible source rocks [26,27,28,29]. The δ13C values of these hydrocarbon portions are also used to distinguish between the proportions of terrigenous and marine organic matter inputs [28,29,30]. According to Sofer [28], terrigenous sourced oils have δ13C aromatic hydrocarbon values ranging from − 25‰ to − 32‰, whereas marine origin oils have δ13C saturated hydrocarbon values between − 25‰ and − 33‰.

In this study, bulk stable δ13C analyses were performed on the derivative saturated and aromatic hydrocarbon fractions of the two analyzed oil stains, as shown in Table 1. The saturated and aromatic hydrocarbon fractions have δ13C values in the range of − 26.5‰ to − 28.6‰ and − 26.3‰ to − 26.7‰, respectively. These values of saturated (δ13CSat.) and aromatic (δ13CAro.) isotopic compositions were plotted on a Sofer [28] diagram and showed that the analyzed oil stain samples originated from a single source rock and that this source rock received a mixed marine and terrigenous organic matter input (Fig. 5a). These δ13C values of the analyzed oil stain samples are similar to those of the organic matter-rich shale intervals of the Late Cretaceous Harshiyat source rock (Fig. 5a) as indicated by the recent work of Hakimi et al. [2].

Fig. 5
figure 5

a A Sofer plot of δ13Caro versus δ13Csat for the analyzed oil stains samples. The dotted line represents the best-fit separation for waxy and non-waxy oils and is described by the equation δ13Caro = 1.14 δ13Csat + 5.46, following Sofer (1984). b Carbon isotope source age assignment, suggesting a Cretaceous age affinity for the analyzed oil stain samples (modified after Andrusevich et al., 1998)

In addition, δ13C fraction data were used to give information on the deposition age of the probable source rocks. The geologic time of deposition was estimated on the basis of the δ13C values of the saturated fraction [31]. In this respect, the analyzed oil stain samples have saturated (δ13CSat.) isotopic values between − 26.5‰ and − 28.6‰ (Table 1) and this range of the saturated isotopic (δ13CSat.) suggests that the probable source rock of the analyzed oil stain samples was generally deposited during the Cretaceous time, as demonstrated from diagram of saturated (δ13CSat.) isotopic values adjacent to geological ages (Fig. 5b). This finding further suggests that there is a genetic link between the analyzed oil stain samples and the Late Cretaceous Harshiyat source rock.

4.3 Organic Matter Characteristics and the Sedimentary Conditions of the Source Rock

In this study, on the basis of saturated and aromatic biomarker results described in the previous paragraphs, the characteristics of the probable source rock such as organic matter input, lithology, and sedimentary environmental conditions and the degree of thermal maturation were established and assessed [2, 14, 15, 32, 33].

The results of the lipid biomarkers in the saturated hydrocarbon fraction of the analyzed oil stain samples imply that the probable source rock is a clay-rich facies deposited in a marine environment under suboxic conditions, containing mixed marine and terrestrial organic matters.

In most of the analyzed samples, normal alkanes show the highest abundance of short- to middle-chain n-alkanes and small amounts of long-chain n-alkane (Fig. 3), suggesting typical oils derived from marine organic matter mixed with terrigenous organic input [34,35,36]. These findings are also consistent with the isoprenoid ratios of Pr/Ph, Pr/n-C17, and Ph/n-C18 and from CPI values (Table 1). The relatively high concentrations of Ph (Fig. 3) with low Pr/Ph, Pr/n-C17, and Ph/n-C18 ratios (Table 1) indicate that the analyzed oil stain samples sourced from sediments contain mixed organic matter of marine and terrigenous origins and are deposited under reductive conditions (Fig. 6). The interference of the organic matter input is also in line with the CPI values [15]. Data show that the CPI values of 90% of the samples are less than 1.3, whereas the values for the remaining 10% are greater than 1.3 (Table 1), further indicating mixed marine and terrestrial plant organic matter (Fig. 7). The presence of plant organic matter in the source rock is also deducted from the relatively high proportions of Tm compared with Ts in the aliphatic hydrocarbon fraction of the analyzed oil stain sample (Fig. 4b), as the relative abundance of Tm is indicative of organic matter derived from land plants [37, 38].

Fig. 6
figure 6

Geochemical biomarker results of the analyzed oil stain samples and Harshiyat source rock from a recent work of Hakimi et al. (2020). a Pristane/n-C17 versus phytane/n-C18 (from Shanmugam, 1985) and b pristane/phytane versus pristane/n-C17, providing a novel, convenient, and powerful way to infer a good oil–source rock geochemical correlation

Fig. 7
figure 7

Geochemical biomarker ratios of pristane/phytane and CPIs in the analyzed oil stain samples and Harshiyat source rock from a recent work of Hakimi et al. (2020), showing mixed organic matter of marine and terrestrial organic matter and inferring a good oil–source rock geochemical correlation

In this study, tricyclic terpane (m/z 191) distributions were also diagnostic of organic matter and paleodepositional environments [15, 39,40,41,42,43,44]. The greater quantities of the C25 tricyclic terpane in contrast to that of C26 tricyclic in the analyzed oil stain samples (Fig. 4a) indicate a marine depositional environment of their probable source rock. The cross-plot of C26T/C25T tricyclic terpanes against the C31 22R homohopane/C30- hopane (C31R/C30H) ratio in Fig. 8a reinforces the idea of a marine source rock. In addition, the low Pr/Ph and DBT/Phen ratios less than 1 indicate that the analyzed oil stain samples were sourced from a marine clay-rich rock (Fig. 8b). The high C23T/C24T ratio is compatible with the relatively high C24Te/C26T ratio (Table 1), further suggesting a source rock containing mixed marine and terrestrial organic matter and deposited in suboxic environmental conditions (Fig. 9).

Fig. 8
figure 8

Cross-plots of the C26/C25 tricyclic terpane ratio versus C31R/C30 hopane and ratios of pristane/phytane versus DBT/Phen for the analyzed oil stain samples and Harshiyat source rock from a recent work of Hakimi et al. (2020), implying a clay-rich marine source rock

Fig. 9
figure 9

Geochemical biomarker results of the C23T/C24T and C24Te/C26T tricyclic ratios for the analyzed oil stain samples and Harshiyat source rock from a recent work of Hakimi et al. (2020), showing mixed marine and terrestrial organic matter deposited under suboxic conditions

The abundance of C27 and C29 regular steranes in the saturated hydrocarbon fraction of the analyzed oil stain samples (Fig. 4b; Table 1) further suggests a contribution of mixed marine planktonic/algae and terrestrial land plants to the source rock. This was observed from the plot of the concentrations of the C27, C28 to C29 regular steranes on a Huang and Meinschein [45] adapted ternary diagram (Fig. 10) and the C27/C28 regular sterane ratio of the oil stain samples under study (Table 1; Fig. 11). A mixed organic matter input in the source rock is further emphasized by the bulk δ13C data of the saturated and aromatic hydrocarbon fractions (Fig. 5a).

Fig. 10
figure 10

Ternary diagram of regular steranes (C27–C29) of the analyzed oil stain samples and Harshiyat source rock from a recent work of Hakimi et al. (2020) (modified after Huang & Meinschein, 1979), indicating a source rock containing mixed planktonic/bacterial organic matter and land plant input

Fig. 11
figure 11

Cross-plot for Pr/Ph versus the C27/C29 regular sterane ratio, which further infers a mixed organic matter input

4.4 Thermal Maturity of the Source Rock

In this study, the degree of thermal maturity of the oil stain samples and their probable source rock was assessed from representative aliphatic sterane and hopane biomarker ratios and parameters of the aromatic hydrocarbon fractions in the analyzed oil stain samples, as shown in Table 1.

The isomerization ratios of 22S/ (22S + 22R) C32 hopanes, 20S/ (20S + 20R) and ββ/ (ββ + αα) C29 steranes, followed by slower reactions such M30/C30, Ts/Tm and CPI ratios were commonly used to estimate the order of thermal maturity [15, 46,47,48,49,50,51]. The high values of 22S/(22S + 22R) C32 hopane, Ts/Tm, 20S/(20S + 20R) and ββ/(ββ + αα) C29 sterane ratios are indicative of a high thermal maturity level [47,48,49,50], whereas the relatively low CPIs and M30/C30 ratios indicate a mature source rock [46, 51]. However, because the biomarker maturity parameters of Ts/Tm and CPI are influenced by the organic matter input [15, 51], it is insufficient to satisfy the thermal maturation criteria using CPI and Ts/Tm. Therefore, they must be used together with other biomarker ratios of aromatic and saturated fractions.

In this study, the most reliable biomarker ratios applied in determining maturity are 22S/(22S + 22R) of the C32 homohopane, M30/C30 (Fig. 4a, and 20S/(20S + 20R) and ββ/(ββ + αα) of the regular C29 steranes in the aliphatic hydrocarbon fraction [15, 46,47,48,49]. The aforementioned biomarker ratios apply over early maturity to peak maturity range, and the values increase as the degree of maturity increases [47,48,49].

The C32 homohopane ratio has been found applicable over the peak maturity range and reaches a maximum at 0.70 [49]. The analyzed oil stain samples have high values of C32 hopane ratio between 0.57 and 0.62 (Table 1), thus suggesting that the analyzed oil stain samples were generated from a mature source rock, equivalent early to peak maturity (VR = 0.60–80%). The different thermal maturity levels of the analyzed oil stain samples are also demonstrated by the C29 20S/(20S + 20R) and ββ/(ββ + αα) sterane ratios (Table 1). The C29 20S/(20S + 20R) and ββ/(ββ + αα) sterane ratios for three oil stain samples analyzed have relatively high values in the range of 0.42–0.48 and 0.55–0.57, respectively (Table 1), indicating a mature source rock, equivalent to moderate and peak maturity of oil window (Fig. 12a). The C32 hopane ratio values are also combined with the high C29 sterane ββ/(ββ + αα) ratios (Table 1), further suggesting a mature source rock, consistent with VR of approximately 0.80% and more (Fig. 12b). In addition, the C30 moretane/C30- hopane (M30/C30) ratio was also used to evaluate the thermal maturation of the analyzed oil stains and their probable source rock [46]. The presence of the relatively high C30- hopane compared to C30 moretane (Fig. 4a), with M30/C30 ratio of less than 0.15 is indicative of a mature probable source rock as shown in the analyzed oil stain samples (Table 1), whereas the high M30/C30 ratio of more than 0.20 indicate an immature source rock [46].

Fig. 12
figure 12

Lipid biomarker maturity ratios of the C29 sterane 20S/(20S + 20R) and ββ/(ββ + αα) and C32 hopane 22S/(22S + 22R), showing that two analyzed oil stain samples were generated from a mature source rock, equivalent to VR values between 0.55% and 0.80%

However, the sterane and hopane biomarkers were determined by the point of isomerization equilibrium values and input of organic matter [33, 47,48,49]. These saturated biomarker data alone should be used with caution because they are insufficient to satisfy all the thermal maturation criteria and should be supported by with other biomarker aromatic components. In this regard, aromatic maturity indicators of 2 oil stain samples analyzed (Table 1) are used to estimate the degree of thermal maturity of the probable source rock. The aromatic maturity parameters are typically based on the thermal stability of compounds as the quantity of the more stable isomer increases with maturity, whereas that of the less stable isomer decreases [20]. For example, the methyldibenzothiophene ratio (MDR) relies on the shift from the less stable 1-MDBT to the more stable 4-MDBT, and therefore, the parameter increases with thermal maturity [21]. According to the methyldibenzothiophene distributions, the MDR in the analyzed samples is estimated in the range of 1.80–4.19 (Table 1), indicating a mature source rock. Other aromatic parameters calculated for the analyzed samples include MPI-1 and MPI-2 shown in Table 1. The oil stain samples have relatively high values of MPI-1 and MPI-2 in the range of 0.72–0.82 and 0.89–1.02, respectively, suggesting a mature source rock (Fig. 13). However, the VRc values obtained from MPI-1 range from 0.83% to 0.89% (Table 1), consistent with a peak oil to mature oil windows [25], and further suggest that the analyzed oil stains were generated from a mature source rock.

Fig. 13
figure 13

Aromatic biomarker maturity indicators of methylphenanthrene (MPI-1 and MPI-2), further suggesting a mature source rock for the analyzed oil stain samples

4.5 Geochemical Correlations of Oil–Oil and Oil–Source Rock

The geochemistry of oil biomarkers and isotope compositions were used to classify the analyzed oil stain samples and then consequently identify possible oil–oil and oil–source rock correlations. In this regard, the geochemical genetic link of the oil samples themselves and their relationship with their potential source rock were demonstrated mainly from the analytical results of the molecular biomarkers and δ13C compositions of saturated and aromatic hydrocarbon fractions as discussed in the previous subsections. Accordingly, there is a positive oil-oil correlation. The analyzed oil stain samples are geochemically similar and are generally grouped into one oil family as demonstrated by the clustering of their molecular and isotopic characteristics (Figs. 5, 6, 7, 8, 9, 10, 11, 12, 13). These molecular and isotopic characteristics reveal that analyzed oil stain samples sourced from a clay-rich rock deposited in marine-reducing environmental conditions and likely contained mixed organic matter of planktonic, bacterial and land plant inputs (Figs. 5, 6, 7, 8, 9, 10, 11). The results of the saturated and aromatic maturity biomarkers further suggest that the analyzed oil stain samples are thermally mature and their source rock was at different maturity stages equivalent to a moderate to peak-mature oil window (Figs. 12 and 13).

The genetic relationship in term of oil–source rock correlation was also established based on the lipid biomarkers and their environmental biomarker indicators with geochemical isotopic results of the analyzed oil stain samples. The Harshiyat shale source rock from exploration well locations in the closed offshore Jiza-Qamar Basin was discussed in the recent published work of Hakimi et al. [2]. The Harshiyat shales in the well locations of the offshore Jiza-Qamar contain mixed organic matter of marine and terrestrial organic matter and were deposited under suboxic marine environmental conditions [2]. These characteristics of the Harshiyat shales are consistent with those of analyzed oil stains in this study as shown by the clustering of their ratios and parameters (Figs. 5, 6, 7, 8, 9, 10, 11). Based on the clustering of their molecular and isotopic characteristics, a positive oil–source rock correlation is evidenced and suggests that the analyzed oil stain samples were generated from the Harshiyat shale source rock at sufficient maturity levels. Moreover, the δ13CSat data suggest that the analyzed oil stain samples were sourced from the Cretaceous source rocks (Fig. 5b) and this is also consistent with the Late Cretaceous deposition time of the Harshiyat Formation (Fig. 2). Therefore, the shale intervals within the Late Cretaceous Harshiyat Formation are very likely source rocks for the analyzed oil stain samples.

As Hakimi et al. [2] reported that the analyzed Harshiyat shale samples from studied wells in the offshore Jiza-Qamar Basin were buried in a relatively shallower depth of less than 2400 m and reached an early mature stage of oil-generation window, equivalent to VR = 0.51%. Therefore, the mature Harshiyat shale rocks, with relatively high maturity of the peak oil window in the known deeper locations, are the most probable potential source rocks, where the Harshiyat source rock reached a high burial temperature depth of more than 3000 m as indicated by the 2D seismic section (Fig. 14). This finding is also supported by the geological situation of the Mahara Group, whereby the Mahara formations, including Harshiyat Formation, stratigraphically underlie the other formations in the offshore Jiza-Qamar Basin; therefore, the oils from Harshiyat shale source rock were very likely generated at a sufficient thermal burial depth in the offshore Jiza-Qamar Basin (Fig. 14). Before migration along vertical migration pathways through fault up to the shallow stratigraphic rocks in the Hadramaut High area is indicated by the cross section along the Hadramaut High and Jiza-Qamar Basin (Fig. 15).

Fig. 14
figure 14

Seismic cross section along offshore Jiza-Qamar Basin, showing the presence of the Harshiyat Formation in the known deeper locations, where the Harshiyat source rock reached a high burial temperature depth of more than 3000 m

Fig. 15
figure 15

Cross section shows the sedimentary successions and fault systems along the Ra’s Sharwayn, offshore Jiza-Qamar Basin, North Hadramaut High area and Shahr (modified after Beydoun [52]

5 Conclusions

This study discusses a combination of sterane and terpane biomarkers and δ13C approaches to studying the saturated and aromatic hydrocarbon fractions in oil stains from five Harshiyat sandstone and carbonate rocks in the North Hadramaut High of eastern Yemen. The results are used to characterize the probable source rock of the studied oil stains samples, including the origin of the organic matter input, redox depositional conditions, lithology, and degree of thermal maturity. The following conclusions are derived from our organic geochemical results:

  1. 1.

    The biomarkers of the oil stain samples under study suggest a contribution of mixed marine planktonic/algae and terrestrial land plants from the clay-rich source rock deposited in a marine environment under reducing conditions.

  2. 2.

    The δ13C values of the saturated and aromatic hydrocarbon fractions further suggest a Cretaceous source rock that likely contained mixed marine and terrestrial organic matter.

  3. 3.

    Both sterane and terpane biomarkers and aromatic hydrocarbons suggest that the analyzed oil stain samples were generated from a mature source rock, corresponding to moderate and peak-mature oil windows.

  4. 4.

    The geochemical results of the biomarkers and δ13C of the saturated and aromatic hydrocarbon fractions of the analyzed oil stain samples are similar and consistent with those of the shales within the Late Cretaceous Harshiyat Formation in the offshore Jiza-Qamar Basin. This conclusion further suggests a genetic link between the analyzed oil stain samples and the marine clay-rich source rock in the Late Cretaceous Harshiyat Formation.