Introduction

The majority of petroleum hydrocarbons, which are regarded as priority pollutants and biohazards in the ecosystem, consist of n-alkanes, which are harmful to both human health and the environment. n-alkanes are saturated hydrocarbons, meaning that only single bonds between hydrogen and carbon are present in them. The formula for n-alkanes, CnH2n+2, demonstrates the strongest possible bonds between the atoms of carbon and hydrogen. Although petroleum is the main source of n-alkanes globally, they are naturally occurring substances. Alkanes can be solids, liquids, or gases at room temperature. Methane, ethane, propane, and butane are gases, the unbranched alkanes pentane through hexadecane are liquids, and the homologs larger than hexadecane are solids (Honda & Suzuki, 2020; Patel et al., 2020).

Petroleum pollution has long been a serious problem due to their carcinogenic and mutagenic impacts. Unintentional oil spills and rising global demand for gasoline are two of significant causes of petroleum discharge into the environment (Retnam et al., 2013; Singh et al., 2012). For example, n-alkanes are one form of chemical that makes up petroleum hydrocarbons.

The Strait of Malacca is a narrow waterway that runs along the west coast of Peninsular Malaysia. Due to the large number of oil tankers that pass through it, both routine tanker operations activities such ballast water discharges and unintended oil spills are more frequent (Masood et al., 2018). The Malaysian Marine Department stated in 2003 that 127 oil spill incidents have occurred in the Malacca Strait since 1976 as a result of high oil tanker traffic (Rusli, 2011).

Since the 1980s, tens of thousands of hydrocarbon pollution investigations have been conducted to determine the petroleum hydrocarbon contamination in the air, sediment, biota, and water (Alkhadher et al., 2023a, b, c, d, e, f, g; Sharifi et al., 2022; Wang & Stout, 2007). These studies found elevated levels of hydrocarbons such as polycyclic aromatic hydrocarbons (PAHs), linear alkylbenzenes (LABs) and n-alkanes in a variety of developed, industrialized, and urbanized regions, as well as busy, heavily visited maritime ports and harbour points. The majority of these research have been carried out mostly in northern hemisphere countries with seasonal fluctuations, such as the United States of America, Japan, England, Norway, and Canada (Wang & Stout, 2007).

An assessment of studies on hydrocarbon pollution reveals a sizable gap in investigations of hydrocarbon pollutants in tropical nations like Malaysia, and this investigation is a part of a long-term hydrocarbon monitoring effort in Malaysia (Zakaria et al., 2002; Boonyatumanond et al., 2006; Saha et al., 2009; Mirsadeghi, 2010; Vaezzadeh et al., 2014, 2015a, 2015b, 2017a, 2017b; Masood et al., 2016; Keshavarzifard et al., 2020; Sharifi et al., 2022; Alkhadher et al., 2023e, f, g, Mohd Sallan et al., 2023; Roslee et al., 2023).

It is well recognized that n-alkanes in estuarine and marine environments come from both biological and human-made sources (Keshavarzifard et al., 2020; Yu et al., 2019). Aliphatic hydrocarbons (AHs) are found in sediment from a variety of sources, such as anthropogenic sources, higher terrestrial plants, and petroleum wastes, as well as autochthonous sources, which are primarily derived from higher aquatic species like algae, microorganisms, and aquatic macrophytes (Keshavarzifard et al., 2020; Liu et al., 2012; Sharifi et al., 2022). Shipping oil, tanker operations such as cleaning and deballasting, crankcase oil use, and fishing are all significant man-made sources of petrogenic n-alkanes. Selective oxidation, biodegradation, chemical reactions, evaporation, and photooxidation can all change the composition of hydrocarbons (Keshavarzifard et al., 2020; Masood et al., 2021).

The information available on potential (mixed) halogenated n-alkanes (PXAs) emphasizes the critical need to identify and monitor these potential pollutants, preferably using mass spectrometry. It is calculated that 60,000 pixels will have enough resolving ability to distinguish 74% of the priority elemental compositions from the most likely interferents, which are chlorinated alkanes (full width at half-maximum), implying that the majority of high-resolution mass spectrometers will be able to accurately identify the PXAs (Li et al., 2021). Petroleum biomarkers, which include isoprenoid n-alkanes, PAHs, and hopanes, have a direct link to biological precursors (Mici c et al., 2013; Peters et al., 2005).

In order to evaluate hydrocarbon inputs from anthropogenic sources, the Selangor River, the largest river on Malaysia's west coast, conducted a rigorous research of compound-specific hydrocarbon pollution. The seasonal trend of aliphatic hydrocarbons has never been thoroughly studied in Malaysia previously. The main objective is to determine the source of aliphatic hydrocarbons present in the sediments of the Selangor River and its tributaries as well as measure their distribution and concentration.

Methodology

Sampling

Study area

The sediments were collected from the Selangor River, which is on Peninsular Malaysia's west coast and faces the Straits of Malacca. The number of ships and tankers that use the narrow Straits of Malacca, which connect Sumatra, Indonesia, with Peninsular Malaysia, makes it one of the busiest waterways worldwide (Abdullah et al., 1999; Thia-Eng et al., 2000). The surrounding districts of Selangor state are also experiencing development, and the state's natural resources are becoming increasingly stressed. In the research area, there are still pockets of upland forest, marsh forest, and some mangroves. Rubber plantations in the area are rapidly becoming more viable urban areas or oil palm farms as a result of the decline in worldwide market prices brought on by the advent of synthetic rubber. The predominant land use in the study region is agriculture, especially the oil palm, paddy, and coconut plantations (Fig. 1). In addition, it provides challenging recreational pursuits for tourists who are adventurous, ecotourists, and travellers such as river tubing, whitewater rafting, and kayaking.

Fig. 1
figure 1

Locations of sampling stations in Selangor River, Malaysia

Sampling locations

In this study, thirteen stations with the codes S1–S13 were chosen for sediment sample collection in 2018 (Fig. 1). The Global Positioning System (GPS) was used to pinpoint the exact location of the sampling points and stations, and sample sites were selected based on the presence of anthropogenic activities and potential hydrocarbon sources. Since the sample is expected to have different levels of pollutants throughout time, sediment samples were collected during the rainy inter-monsoonal period, the dry inter-monsoonal period, the southwest monsoon (SWM) period, and the northeast monsoon (NEM) period. Furthermore, duplicate analysis was carried out for reproducibility and a coefficient efficient for AH was below 10%. Table 1 summarizes the sampling locations and the application of the Ekman dredge sampler to collect surface sediment samples. The top 5-cm layer of the surface sediment was carefully sub-sampled at each sampling location using a metallic spatula. After being immediately transferred to polyethylene bags with zip-ties, the samples were then brought to the lab in a cooler box where they were kept in a freezer at a temperature of –20 °C for additional experiments.

Table 1 Sampling locations and description

Chemical analysis

The samples were extracted, purified, and fractionated using a previously reported method (Zakaria et al., 2002; Masood et al., 2014; Magam et al., 2016; Alkhadher et al., 2015, 2016, 2020a, 2020b, 2020c). Dried sediment was pulverized in a porcelain mortar with a 2-mm mesh sieve. 250 mL of dichloromethane (DCM) was used in a Soxhlet system to extract the hydrocarbons, and the bottom round flasks were used to collect the extracts, which were then treated with copper granules to eliminate the sulphur, which is known to interfere with GC analysis. The extract volume was reduced by rotary evaporation before being loaded onto a glass chromatographic column containing silica gel activated with 5% H2O. The elution solvent for the hydrocarbon fraction was 20 mL Hexane/DCM (3:1, v/v). The extracts were rotary evaporated to 1–2 mL before being separated gradually on a silica gel column that had been fully activated with 4 mL of high-grade hexane. The fraction of aliphatic hydrocarbons from the second step column chromatography was rotary evaporated to approximately 1 ml and then transferred to a 1.5 mL amber vial. The 1 mL sample was carefully reduced to near dryness using a stream of nitrogen gas and then redissolved in 100 μL of isooctane.

In a GC (7890A) coupled to a C5975 quadrupole mass-selective detector (SIM mode), a 30-m fused-silica capillary column (HP-5MS) is employed for n-alkane analysis. The carrier gas used was pure helium. The electron ionization (EI) method was used for MS at a 70-eV energy level, 200 °C for the source temperature, and ~ 2000 eV for the electron multiplier voltage. Samples were injected splitlessly into the GC injection port at a temperature of 300 °C, then a 1-min purge after that. The temperature of the column increased from 70 °C for 1-min to 150 °C at a rate of 30 °C per minute, then 5 °C per minute to 310 °C, where it remained for 10 -min. The n-alkanes were monitored at the following m/z values: 142 (C10), 170 (C12), 198 (C14), 254 (C18), 282 (C20), 310 (C22), 338 (C24), 366 (C26), 394 (C28), 422 (C30), 450 (C32), 478 (C34), 506 (C36), 534 (C38), and 562 (C62) (C40). A 191 m/z ion monitoring technique was used.

Every set of samples was injected with one of five distinct external standards for n-alkanes, even those with a low carbon number (C10–C36). By comparing the target analytes' mass to charge ratio (m/z) and retention times to standards, the target n-alkane was recognized. The contents were determined by comparing the integrated sample peak area to the standard peak area.

Quality control and quality assurance

QC and QA procedures were carried out during the experimental analysis of this study to ensure the certainty and accuracy of the analytic method. The surrogate recovery for sediment samples was within the acceptable range (80–120%). Even number n-alkanes made up the external standards for n-alkanes with C10–40. Daily preparation of fresh standards was carried out for the analysis. Each batch of set samples was subjected to a blank procedure, and the analysis of the reagent blanks showed that the analysis apparatus and glassware were free of contamination. Additionally, after each batch of samples was analysed using an instrument, a purge was processed.

Total organic carbon (TOC)

The method presented via Bakhtiari et al. (2010) and Raza et al. (2013) is the analytical process that was used to evaluate the TOC content. The sediment samples were mashed and homogenized using a mortar and pestle after being dried in an oven for an entire night at 60 °C. Each sediment sample was weighed to determine its mass, and then, 1–2 mL of a 1M hydrochloric acid (HCl) solution was put to completely dissolve any inorganic carbon. Samples were then dried at 100 °C for 10 h to remove the HCl. Each sample was reweighed to a weight of about 1 g, and the TOC was then measured using an LECO CR-412 carbon analyser at 1350 °C for 60s. TOC is expressed for this study as mg g-1.

Grain size analysis

Granulometry is the measurement of the sediment grain size of a granular substance as a proportion of the weight of various sized particles. The four procedures that were used to the sediment samples that were obtained for this investigation were washing, keeping in a low-temperature oven, disaggregating, and stiliting. Moreover, almost 50 g of each sample was sieved for 20 min through a standard set of sieves arranged in a one phi class interval. According to Moopam (1999), the categories of sediment size that were used in this study are as follows: granule (gravel) is > 2000 m, very coarse sand is > 1000 m, coarse sand is > 500 m, medium sand is > 250 m, fine sand is > 125 m, very fine sand is > 62 m, and clay is 63 m.

Statistical analysis

Data were statistically analysed using IBM-SPSS software, version 21.0. To assess the variations in concentrations across the sampling locations, analysis of variance (ANOVA) was used to detect the meaningful differences in the levels of n-alkanes between the sampling stations and between the four seasons. The difference in n-alkane concentration between the sampling sites under comparison was deemed statistically significant when p < 0.05. Using Spearman's rank correlation analysis, the correlations between the hydrocarbons (HCs) and TOC in the sediments were examined (Zhang et al., 2006). To identify likely n-alkane linear combination that would be useful to distinguish their sources, the PCA was used. With the factor loadings obtained from PCA, clustering analysis was done to eliminate any correlations between the variables. Using the eigenvalue > 1 criteria, the principal components were retrieved following varimax rotation (Dickhut et al., 2000; Liu et al., 2009; Sharifi et al., 2022; Simpson et al., 1998).

Results and discussion

n-alkane concentration in the Selangor River's sediment

The concentration and distribution of n-alkanes (nC10nC36) in sediments from the Selangor River were investigated. Table 2 indicated that the total concentrations of n-alkanes ranged from 967 to 3711µg g−1dw (Fig. 2). The compositional characteristics of n-alkanes were found to be largely constant across all locations. As shown in Fig. 3, the sediments from S9 at downstream of the River had the highest concentration of n-alkanes with 3711 µg g−1dw), whereas the sediments from S3 at upstream of the river had the lowest concentration with 967 µg g−1dw. The variation in n-alkane concentrations at riverine and estuarine stations may result from both anthropogenic sources, such as shipping activities, industrial, domestic waste discharges, and input from the environment, such as microbial activities, submerged or floating macrophytes, and emerging terrestrial plant (Li, et al., 2021; Oyo-Ita et al., 2010; Sharifi et al., 2022).

Table 2 Average concentrations of n-alkanes (µg g−1dw) during the four studied periods (Dry, ---) in the surface sediments of the Selangor River
Fig. 2
figure 2

Total concentrations of the n-alkanes in the sediment samples. Error bars indicate the 95% confidence interval

Fig. 3
figure 3

Mean concentrations of total n-alkanes at each sediment sampling station. Error bars indicate the 95% confidence interval

The station S2, which is near the waste disposal plant for Selangor City, has the second-highest overall n-alkane level (3498 µg g-1dw). This shows that the direct discharge of domestic wastewater, which occurs close to this station, is a significant source of n-alkanes in the riverine sediment. The area around station S2 is protected from the sea's waves, particularly those from the Strait of Malacca. This probably reduces water flow and increases sedimentation, causing the deposition of the n-alkanes related to the suspended particles that are transported down the river, according to Zheng et al. (2002).

Overall, it was found that the upstream areas of the Selangor River contained higher levels of n-alkanes than the downstream parts by comparing the n-alkane content in sediment samples. This result may be explained by the fact that there are more habitations and industrial facilities in the Selangor River's upstream than in its downstream. The concentration of total n-alkanes, on the other hand, decreased from the dry to the wet season, most likely as a result of the n-alkanes being diluted by the increased rainfall and river flow during the wet seasons. The outcomes of the one-way ANOVA indicated that the differences between the n-alkanes specifically C19, C20, C23, and C24 among the 13 sampling stations were significant (p < 0.05; Table). This can be ascribed to diverse human activity at each station, which may result in distinct emission sources. Meanwhile, the concentrations of most n-alkanes examined in surface sediment differed significantly between the four seasons (p < 0.05), indicating that temporal fluctuation had a large influence on n-alkane concentrations. Additionally, the results showed that C34 predominated, which is consistent with the mixed n-alkane origins of the sediments from the Selangor River. They also showed that C17–C34 predominated during the dry inter-monsoonal period, C34–C36 predominated during the SWM period, and C36 and C26–C34 predominated during the NEM period. These results of the study are referring to show different carbon chain length predominance in different monsoon seasons. There are some possible explanations for this trend, which include the differences in environmental conditions such as temperature and salinity or the differences in microbial degradation rates while hydrocarbons can be degraded by microbes, and the rate of degradation can vary depending on the type of hydrocarbon and the environmental conditions (Gayantha et al., 2021).

The LMW/HMW n-alkanes (L/H) values for stations S2, S3, S5, S7, S9, and S10 were less than 1, as shown in Table 2. L/H-alkane ratios in other sediment samples that ranged from 1.07 to 1.46 were larger than one. The LMW n-alkanes may have undergone more microbial degradation than the HMW n-alkanes in samples with low L/H n-alkanes, similar to a study conducted in the Peel Plateau in northwestern Canada (Bröder et al., 2021). Furthermore, LMW hydrocarbons may also be subject to greater degradation by living organisms due to their increased bioavailability (Gaspare et al., 2009; Keshavarzifard et al., 2017). Consequently, LMW biodegradation could be the cause of reduced LMW n-alkane concentrations at some of the stations under study (Keshavarzifard et al., 2020). The prevalence of HMW n-alkane with odd numbers of carbon atoms in certain of the sediment sample locations can be attributed to input of n-alkane from terrestrial high plants. Odd carbon atom alkanes often come from biogenic sources, while even carbon atom alkanes typically come from anthropogenic sources (Kang et al., 2018; Sharifi et al., 2022; Volkman et al., 1992).

High L/H-n-alkane ratios ranging from 0.01 to 13.32 were reported in a recent study of sediment in Malaysia's Prai Strait. The L/H n-alkane ratios discovered in the current study are lower than those mentioned in the previous study (Sakari et al., 2008a). Zakaria et al. (2002) found significant petrogenic hydrocarbon imports onto Peninsular Malaysia's west coast. The major hydrocarbon, the n-alkane with the highest concentration of carbon atoms, can also be used as a source indicator. The most common n-alkanes in our study have a number of carbon atoms centred around 34 (C34) (Table 2). Furthermore, the prevalence of n-alkanes with LMW suggests new oil inputs. In the LMW n-alkanes detected in the sediments, even carbon atoms predominated over odd carbon atoms, indicating that anthropogenic sources of n-alkanes predominated over biogenic sources (Sakari et al., 2008b). Odd over even numbers of carbon atoms were more abundant in the upper areas of the Selangor River, indicating higher inputs of n-alkanes from terrestrial plant sources. However, as one moves down the river, the pattern shifts to a majority of even-numbered carbon n-alkanes, emphasizing inputs from petrogenic sources.

Unresolved complex mixture (UCM)

The unresolved complex mixture (UCM) is a complex mixture of hydrocarbons with branching and cyclic structures, as well as certain overlapping peaks which generate a "hump" and cannot be detected by a capillary GC column (Bouloubassi & Saliot, 1993; Bouloubassi et al., 2012). UCMs are a typical feature in environmental samples polluted with weathered petroleum residues (Gough & Rowland, 1990). Although UCMs are expected to be rather resistant to microbial degradation, they can impede the identification of alkanes (Gough et al., 1992; Hu et al., 2009). Although UCMs are mostly found in the HMW range, they can also be found in the LMW range and evaporate. A modest UCM in a typical chromatogram of n-alkanes implies short-term weathering. In the current study, chromatograms of n-alkanes showed UCMs in some of the samples. Figure 4 shows a typical chromatogram of n-alkanes with UCM hump (Retnam et al., 2013).

Fig. 4
figure 4

A typical chromatogram of n-alkanes

Carbon preference index (CPI)

CPI, as well as n-C17/Pristane and n-C18/Phytane ratios, was calculated to acquire a better understanding of the principal sources of hydrocarbons in sediment samples. According to Simoneit et al. (1991), the CPI is a measurement of the biologically produced n-alkanes that can be applied to determine the straight chain's degree geolipid diagenesis. In addition to being employed in organic geochemistry to denote the presence of geolipids with straight chains, it is a quantitative evaluation of the degree to which geological lipids retain the original biological chain length specificity (Meyers & Ishiwatari, 1995). Additionally, it compares the n-alkane contribution from anthropogenic (CPI 1) sources, such as PHCs or aquatic species, to those from natural (biogenic/terrestrial., CPI > 1) sources. The ratio of Codd n-alkanes/Ceven n-alkanes for the petrogenic is around 1.0 since the petroleum mixes also contain equivalent amounts of nearby members of hydrocarbons (Tran et al., 1997). According to values of this ratio (Table 2), petrogenic origin is the primary source of hydrocarbons in the research region.

According to formula proposed by Bakhtiari et al. (2011), Zhu et al. (2005), the CPI was estimated in the current study as follows:

$${\text{CPI is calculated as }}\left( {{\text{C}}_{{23}} + {\text{ C}}_{{25}} + {\text{ C}}_{{27}} + {\text{ C}}_{{29}} + {\text{ C}}_{{31}} } \right)/\left( {{\text{C}}_{{24}} + {\text{ C}}_{{26}} + {\text{ C}}_{{28}} + {\text{ C}}_{{3}0} } \right)$$
(1)

The calculated CPI values are depicted in Fig. 5. The research area's average CPI values during the four study periods ranged from 0.97 to 1.04. Since the dominance of odd or even carbon atoms is missing, petrogenic sources are probable.

Fig. 5
figure 5

Spatial variations of the n-C17/pristane and n-C18/phytane ratios. Error bars indicate the 95% confidence interval

Along with the ratio Pr/Ph, other ratios, specifically C17/pristane and C18/phytane of the n-alkanes/acyclic isoprenoid n-alkanes, can be utilized to identify the sources of hydrocarbons in environmental ecosystem (Pavlova & Ivanova, 2003). Ratios of C17/pristane and C18/phytane levels below 1.0 may indicate weathered petroleum, whilst values above 1.0 are indicative of unpolluted environments. The ratios between C17 and C18 in pristane ranged from 1.15 to 4.34 and from 1.65 to 5.99, respectively (Fig. 5). Being greater than 1.0 at all sampling stations, values of these two ratios do not indicate presence of weathered petroleum in the sediments.

Since phytane commonly originates from petroleum, values of the pristane/phytane ratio less than 1.0 can indicate pollution with petroleum while values higher than 1.0 indicate biogenic inputs or bituminous coal (Readman et al., 2002; Volkman et al., 1992). For pristane and phytane, respectively, the oxidation and reduction of the chlorophyll's phytol side chains can be considered biogenic sources. These isoprenoids can also be found in the lipids of bacteria and zooplankton (Le Dréau et al., 1997). In the Selangor River, the pristane/phytane ratio ranged from 0.32 to 2.10 (Fig. 6). Nevertheless, majority of the sampling stations had values that were less than or close to 1.0, demonstrating the significance of petroleum imports in the studied area. In the bulk of the sampling stations, a falling pattern was seen in the pristane/phytane ratio values from the higher to the lower regions of the river. This pattern demonstrates once more that the river's upper portions have significant biogenic n-alkane sources, but the lower portions of the river have more significant petroleum hydrocarbon inputs.

Fig. 6
figure 6

Spatial patterns in the values of the pristane/phytane (Pr/Ph) ratio. Error bars indicate the 95% confidence interval

Average chain length (ACL)

Average chain length (ACL) is another metric used to determine the environmental sources of n-alkanes and is based on the concentration of the odd carbon numbers of the n-alkanes in vascular higher plants (Boot et al., 2006; Poynter & Eglinton, 1990). With respect to plant species, latitude, and climatic conditions, the ACL is constant for places with a single source of n-alkanes, and however, petrogenic inputs cause the ACL to decrease in value. (Jeng, 2006). The formula for calculation of ACL is (Jeng, 2006):

$${\text{ACL }} = \left( {{25 } + {\text{ C}}_{{25}} + { 27 } + {\text{ C}}_{{27}} + { 29 } + {\text{ C}}_{{29}} + { 31 } + {\text{ C}}_{{31}} + { 33}} \right) \, \left( {{\text{C}}_{{25}} + {\text{ C}}_{{27}} + {\text{ C}}_{{29}} + {\text{ C}}_{{31}} + {\text{ C}}_{{33}} } \right)$$
(2)

The ACL results for the Selangor River sediments ranged from 28.29 to 30.40. This ACL value range may indicate the presence of petroleum inputs into the research area. In sediments from the Prai River, a wider range of ACL values (26.75–32.94) was recorded (Sakari et al., 2008b). This underlines higher petrogenic inputs in Prai River than the Selangor River, possibly due to occasional spillage of petroleum or due to other petrogenic inputs such as urban run-offs. Within this context, the very narrow ranges of ACL values for river sediments in the northeastern (29.6–29.8) and northwestern Taiwan (29.2–30.5 (Jeng (2006)) reveal lower petrogenic inputs in these areas.

Since n-alkanes from higher plants are associated to both the CPI and ACL, a correlation between them is anticipated (Jeng, 2006; Li et al., 2020). There was no correlation between the CPI and ACL values for the sediments in the current study (P > 0.05). This may be attributed to the fact that the n-alkanes used in calculations of the CPI and ACL did not originate from the same source of higher plants. According to Jeng (2006), when two (or more) n-alkanes have different ACL or CPI values, the ratio changes nonlinearly and disproportionally.

Terrigenous aquatic ratio (TAR)

Terrigenous aquatic ratio (TAR) compares sum of the concentrations of odd-numbered (C27 + C29 + C31) n-alkanes, which are a characteristic of the debris of high terrestrial plant (Rieley et al., 1991), to the sum of the concentrations of odd-numbered (C15 + C17 + C19) n-alkanes, which indicate the sources from aquatic environment (Jaffe' et al., 2001). Therefore, the ratio of short-chain to long-chain n-alkane concentrations is measured by the TAR, which can be used to assess how significant terrestrial inputs are in comparison with aquatic ones (El Nemr et al., 2016). This is how the TAR is determined:

$${\text{TAR }} = \, \left( {{\text{C}}_{{27}} + {\text{C}}_{{29}} + {\text{C}}_{{31}} } \right) \, / \, \left( {{\text{C}}_{{15}} + {\text{C}}_{{17}} + {\text{C}}_{{19}} } \right)$$
(3)

When the TAR value is less than 1, it means that from aquatic sources, n-alkanes are more prevalent (Gomes & Azevedo, 2003). The findings of this study demonstrate that the TAR values for the sediment samples were > 1, which highlights that high terrestrial plant debris is a main source of the n-alkanes (Fig. 7).

Fig. 7
figure 7

Spatial pattern of the terrigenous aquatic ratio (TAR)

Comparison of the levels of PAHs in sediments of Selangor River with those in other international literature areas

A number of previous researchers have reported on n-alkanes levels with the literature in the rivers around the world (Table 3). The comparison shows differences in the concentrations of the n-alkanes between Selangor River and the other locations worldwide. The average n-alkane concentration among stations calculated 2339 (µg g−1). The finding from this study is significantly higher than previously reported by other scientists around the world (Iwegbue et al., 2021; Paletto et al., 2008) but consistent with findings from the east coast of Malaysia (Sakari et al., 2008a, 2008b), while it was less concentrated when compared with other regions (Ahad et al., 2011; Silva et al., 2012; Monza et al., 2013; Jeng, 2006; Sakari et al., 2008b).

Table 3 The concentrations (μg g−1) and indices of n-alkanes in the literature rivers around the world

Petroleum hydrocarbons, mainly consisting of n-alkanes and polycyclic aromatic hydrocarbons (PAHs), are considered as priority pollutants and biohazards in the environment, eventually affecting the ecosystem and human health (Liu et al., 2018). Furthermore, n-alkanes may eventually affect human health (Lu et al., 2012) and other social activities, such as agriculture (Zhang et al., 2017) and tourism (Zhu et al., 2015). Therefore, petroleum hydrocarbons are considered as priority pollutants (Ekperusi & Aigbodion, 2015), and its remediation is a matter of urgency. Generally, non-polluted area demonstrates hydrocarbons concentration (e.g. n-alkanes) less than 10 ng/mg (UNEP, 1992). The high concentrations of n-alkanes with their ratios showed that Selangor River, west Peninsular Malaysia, are heavily affected by anthropogenic pollutants. The results also highlighted that hydrocarbon pollutants would continue to be a problem, as long as there is a human population to produce it. In the coming years, the total amount of hydrocarbons discharged into the rivers and coastal waters in Peninsular Malaysia will progressively increase and add to the pollution already affecting parts of the region.

Correlation between total n-alkanes and TOC

Both terrestrial and marine sources contribute to the organic carbon (OC) found in sediments from aquatic environment (Prahl et al., 1994; Retnam et al., 2013). While the contributions of air deposition may be overlooked, fluvial transport is the primary process responsible for the input and accumulation of terrestrial total organic carbon TOC in the aquatic and marine sediments. Even carbon atom n-alkanes commonly originate from anthropogenic sources, while the source of odd carbon atom n-alkanes is biogenic (Kang et al., 2018; Retnam et al., 2013; Volkman et al., 1992).

The aliphatic hydrocarbons are substances that repel water. They have a propensity to sorb to organic materials once in an aquatic environment. The sediment's organic carbon content may have a significant impact on the quantities of hydrocarbons due to the sorptive tendency of aliphatic hydrocarbons. Peninsular Malaysia has a large number of plants; thus, the intense tropical rains can wash large quantities of organic materials out of the soil and into the rivers. At the same time, a significant amount of organic matter is known to be present in the mangrove environment.

In the Selangor River’ sediments, the TOC concentration varied between 1.16 and 1.98%. Table 4 shows that there was no correlation between TOC and n-alkane levels that was positive (r = 0.336, p > 0.05). The fact that TOC and hydrocarbon concentrations have separate sources may be one factor in their lack of association. While the research area's aliphatic hydrocarbons primarily resulted from anthropogenic inputs, the sediments' main sources of TOC are often naturals (Mirsadeghi et al., 2013; Ouyang et al., 2006; Schumacher, 2002). Other investigations (Cavalcante et al., 2009; Mirsadeghi et al., 2013; Tam et al., 2001) revealed there was no correlation between TOC and hydrocarbon contents, whilst other studies suggested a connection between TOC and hydrocarbon concentrations (Bush & McInerney, 2015; Chen et al., 2012; Huang et al., 2012). Given this, further information is required to completely comprehend the interactions between TOC and hydrocarbon concentrations in the tropical region.

Table 4 Correlation between the total concentration of n-alkanes and TOC

Grain-size distribution

Grains have been divided into three groups based on their sizes: gravel, sand, and clay. Table 5 displays the particle size distribution of the sediments taken from the Selangor River. Study results revealed very high variation in the grain sizes of the sample sediments. The distributions of the grain sizes differed from a location to another in the study area. However, in the Selangor River sediments, the sandy fraction predominated, and the amount of clay in all the layers was larger than that of gravel. The average percentages of sand, clay, and gravel were 85.96 ± 2.85, 13.45 ± 2.80, and 0.59 ± 0.34%, respectively.

Table 5 Grain size distributions in the sediments of the study area

Conclusion

In this investigation, petroleum hydrocarbons known as n-alkanes were found in samples taken from the Selangor River. Additionally, using various molecular tools, the content and distribution of these hydrocarbons were examined and their sources were identified. The results of the investigation show that the n-alkane levels are nearly equal at all sites. The L/H n-alkanes ratio values were frequently greater than 1 in the sediment samples. The investigation's findings also revealed that the majority of the silt included n-alkanes with an even number of carbon atoms, which can be attributed to microbial activity. The results of this investigation demonstrated that n-alkanes in sediments originated from a range of sources. Biogenic materials, such as leftover algae, aquatic life, terrestrial plants, and human activity, are potential sources of n-alkanes. The prevalence of HMW n-alkane with odd numbers of carbon atoms in certain of the sediment sample locations can be attributed to input of n-alkane from terrestrial high plants. In conclusion, the data suggested that the water quality of the Selangor River has declined as a result of anthropogenic activities in the state and rapid population growth. The concentrations of the investigated hydrocarbons varied from one sampling site to another and from one time period to another. As a result, ongoing assessment of the degree of the impact of hydrocarbon pollution on human or ecosystem health can show the level of pollutants brought about by anthropogenic activities. As a result, more research into hydrocarbons and other anthropogenic contaminants is essential for minimizing health hazards in Peninsular Malaysia.