1 Introduction

Organic molecules known as azo compounds have an azo group –N=N– linked to their molecular structure. They are extensively employed in the production of dyes, printing, cosmetics, and medicine application [1]. The manufacture of azo dye has increased in recent years, and its importance is still felt. Over 60% of all dyes are made up of it, and it is frequently utilized [2, 3]. They are well-known for their therapeutic uses, such as antiseptics, antioxidants, antidiabetic, antineoplastic, analgesic, anti-inflammatory, antiviral and tubercular, and antitumor activities [4, 5], in addition to their colors, which are determined by the presence of azo groups bound to aromatic heterocycles, benzene rings, and naphthalenes. They are distinguished by the covalent bond that forms between their chemical groups and the textile substrate. The simplest methods for creating azo compounds are diazotization and coupling [2]. As a result, tuberculosis (TB) remains the most lethal infectious disease in the world, as it is an airborne, fatal infection caused by the Mycobacterium tuberculosis bacterium despite several approaches that have been employed to curtail the dreaded disease. More so, it is a bacterial infection that is the second leading cause of death. Although there is TB therapy available, it has killed 1.5 million people and afflicted 10 million people in 2019. Whilst MDR and XDR-TB should make TB eradication possible, DOTS has failed to do so, with more people dying each day. Furthermore, Patients with MDR and XDR-TB require extremely complex treatment. As a result, fresh anti-TB medication has been sought for by researchers. Also I t has been determined that bedaquiline is the first brand-new TB medication to be introduced in more than 40 years. Since Mycobacterium tuberculosis can be controlled while also causing fewer adverse effects, finding such medications is a significant problem for researchers. Herein, the studied dye was prepared using the aforementioned techniques, specifically diazotization and coupling of the primary amine with the intermediates 4-aryl-2-aminothiophene-3-carbonitrile. The synthetic substance, a monoazo dye, has a color spectrum that is primarily between orange and brown, a blue shift (negative solvatochromism), little wavelength absorption, and strong solvent polarity. As many azo polymers are highly photoresponsive and used in dye-sensitizing solar cells and electro-optical activities, light absorption and electronic excitation are distinguishing characteristics of the azo cores [6]. The synthesized azo molecule was subjected to an experimental and theoretical research in order to determine its complete vibrational frequencies, electron excitation analysis, reactivity, and inhibitory actions against InhA proteins. Additionally, a number of theoretical features including vibrational analysis, NBO, CDFT, density of states (DOS), FMO, Aromaticity index, MEP, and population analysis were researched. Utilizing the Lipinski RO5 rule and ADMET, the pKSCM web server model was utilized to determine the physical–chemical similarities between various drugs (absorption, distribution, metabolism, excretion, and toxicity).

A molecular docking simulation was conducted based on the promising outcome of the examined compound’s drug-likeness. Because it investigates the impact or interaction between a potential drug (a ligand) and a protein, molecular docking is essential for drug discovery. In the current study, the authors looked at the examined structure’s inhibitory effects on a few InhA crystallographic structures. The short chain dehydrogenase/reductase family includes the INHA gene, which codes for the InhA enzyme and is important in the production of fatty acids. Mycolic acid is produced in part by the fatty acid biosynthesis pathway enzyme InhA. This is due to its reliance on fatty acid synthase 2 and NADH trans enoyl-acyl ACP carrier protein (FASII). The long chain fatty acids known as mycolic acids make it feasible for the human pathogens Mycobacterium TB and Mycobacterium leprae to construct cell walls. The oldest and most well-recommended synthetic medication for the treatment of tuberculosis is isoniazid, a key target of InhA [7]. The modeled structure was docked with crystallographic structures of InhA in order to explore its antitubercular activity against the enzyme because azo compounds are well recognized to have therapeutic effects. The reduced molecule (ligand) was docked with the chosen InhA proteins with PDB entry codes 1P44, 1P45, 3FNE, and 4TZK [8, 9] in order to screen the chemical under investigation as a potential InhA inhibitor.

2 Experimental and Computational Methods

2.1 Experimental

2.1.1 Synthesis of 2-Amino-4-(4-Aminophenyl)-5-(Phenyldiazenyl)Thiophene-3-Carbonitrile (AATC)

After being cooled at 0 to 5 °C in an ice bath, a well-stirred solution of 1-aminobenzene (10.2 g, 0.05 mol) in 2 N HCl (15 ml) was diazotized with 1 N NaNO2 solution (3.5 g, 0.05 mol; in 20 ml water). Over the course of 30 min, the sodium nitrite was gradually added while stirring. For an additional two hours, the reaction mixture was agitated. Using starch iodide paper, which produced a blue color, the combination was then examined for full diazotization. When the mixture failed the test, more sodium nitrite was cautiously added drop by drop until the mixture produced a positive color that remained steady for a few minutes. To eliminate extra nitrous acid, some urea was also added. A well-stirred solution of the coupling component was added gradually to the aforementioned cold diazonium solution. Distilled water (25 ml) and hydrochloric acid (6 ml, 36%) were used to react a solution of diazotized 1-aminobenzene (14.8 g, 0.05 mol at 95% purity) with a solution of 2-amino-4-(4-aminophenyl)thiophene-3-carbonitrile (4.9 g, 0.05 mol) during the course of 30 min at 0–5 °C. The mixture was swirled for three hours while being kept at a temperature between 3.5 and 4 degrees Celsius. The acquired dye was filtered out, allowed to air dry, and then weighed. Figure 1 reports the synthesis route of the produced azo dye.

Fig. 1
figure 1

Synthesis route of the studied azo compound

2.1.2 Spectral Measurements

Fourier transform infrared (FT-IR) measurements were made using an Agilent CARY 630 FTIR at room temperature in a KBr disc (thermal nicolet) in the wavenumber range 4000–450 cm ^(−1). The units of all absorption bands are cm ^(−1).

GCMS spectroscopy: Using the electron spray ionization (ESI) method, mass spectra were captured on Agilent Technologies 6400 mass spectrometers. The information reveals a peak at [M–Na]^ + 

2.2 Computational Details

2.2.1 Quantum Mechanical Calculations

Due to its efficiency, the Gaussian 09W computational software [10] was used to do the quantum chemical calculations of the simulated AATC in order to derive the molecular structure, along with hybrid-exchange correlation three-parameter functional level (B3LYP) and 6-311++G(d,p) basis set [11]. Using DFT (B3LYP)/6-311++G(d,p) functional, the optimal molecular geometry of AATC has been predicted in the gaseous phase. The Gaussian calculation setup was done using the GaussView application 6.0, which was also used to display the results [12]. The VEDA04 program [13] additionally with the help of the GaussView visualizer utilized the optimized molecular geometry for the computation of the vibrational frequencies and vibrational assignments of fundamental normal modes of vibration on the basis of PEDs. Utilizing the GaussView application, the population analysis was chosen in order to do NBO calculations. The time-dependent DFT (TD-DFT) method was used to obtain the hole-electron excitation energies, electronic transitions, and visual representations of the different excitation states. The results were used by the Multiwfn analyzer created by Tian-Lu [14] to obtain all the properties of excitation for the different excitation states in the studied compound. The GaussView program was used to visualize the structure's MESP. Conceptual Density Functional theory, DFT, was utilized to get aromaticity indices, ADCH population for charge analysis, and the output data was employed by Multiwfn analyzer to compute results.

2.2.2 Pharmacological Investigations

Using the Molinspiration web server [16], the molecular characteristics and Lipinski (Rule of Five) RO5 [15] for AATC and Isoniazid (standard medication) were calculated. With a focus on the absorption, distribution, and toxicity characteristics, a comprehensive ADMET (absorption, distribution, metabolism, excretion, and toxicity) was generated for isoniazid (the reference medication) using the pKCSM online program [17].

2.2.3 Molecular Docking Details

AutoDock 4.2 software was used to analyze molecular docking [18]. From the Brook-haven Protein data repository, the crystallographic structures of InhA (or, in one case, E. coli FabI) in association with inhibitors were taken (PDB ID: 1P44, 1P45, 3FNE, 4TZK). Using Discovery studio, the co-crystallized ligand, water molecules, and co-factors were removed, preparing the protein for docking. The structure of the ligand was drawn in Avogadro and then converted to PDB format. The protein was converted to SDF format using the Swiss PDB reader, which also removed certain protein residues that were thought to not be a part of the protein. The Autodock Vina tools were located in Pyrex, where docking was done and findings were obtained. A cubic grid box of 40 A (X, Y, Z) with a spacing of 0.375 A was constructed utilizing Auto dock tools incorporated in Pyrex in order to choose active site to prevent blind docking [19, 20]. The reduced molecule (ligand) was embedded in the protein 1P44, (X = 109.4876, Y = 109.2504, Z = 109.0562), with an exhaustiveness of 8, and the grid map was produced in the active site. Additionally, the AATC was docked to protein 1P45, and a grid map with the coordinates (X = 85.8796, Y = 90.5921, Z = 76.8279) and an exhaustiveness of 8 was created. The resulting grid map for protein 3FNE has an exhaustiveness of 8 and has the coordinates (X = 104.2502, Y = 104.6975, Z = 104.4952). The grid map created by the APTC docking with protein 4TZK is (X = 69.34818, Y = 71.1787, Z = 70.6503), with an exhaustiveness of 8. All docked ligands' binding affinities were chosen with an RMSD value of 0. Because the better docking pose matches the ligand's binding mode on the protein, the binding affinity with RMSD value of 0 is chosen. The outcomes were examined using Pymol software [21] and PyRx in both two- and three-dimensional orientation. Then, in both 2D and 3D formats, the ligand–protein interactions were examined using Biovia Discovery studio software [22].

3 Results and Discussion

3.1 Vibrational Analysis

Finding the vibrational modes connected to pertinent and specific chemical structures of the computed substance in question is the aim of the vibrational analysis. A non-linear molecule with N atoms can have a maximum of (3N-6) normal modes of vibration as active fundamentals that can be observed [23]. The investigated compound consists of 36 atoms and 102 vibrational modes. 35 of the 102 vibrational modes are bending, 34 are stretching, and 33 are torsional. Due to a confluence of electron correlation effects and flaws in the DFT basis set, the DFT estimated vibration wavenumber are frequently many orders of magnitude larger than the empirically observed wave number. In order to accurately fit the results, scale factors are used to match the DFT estimated wave numbers with those of the observed ones. In order to lessen the overall deviance, a scale factor of less than one (0.96) was employed. The theoretical IR and potential energy distributions associated with the fundamental modes in terms of vibrational assignments are shown in Table 1 together with the results of the DFT-B3LYP/6-311++G(d,p) calculation. The supplementary information’s Fig S1a, b shows the computed and experimental FT-IR. The absence of a few peaks with such low intensities can be confirmed by theoretically predicted FT-IR intensity values.

Table 1 Detailed assignments of fundamental experimental and theoretical vibrations of AATC by normal mode analysis

N–H Vibrations: The characteristic R-\({NH}_{2}\) vibrations for primary amines have been assigned in the region 3500–3200 \({\mathrm{cm}}^{-1}\) [23]. Bend in primary amines results in a broadband in the range 1640–1560 \({\mathrm{cm}}^{-1}\) [24]. The present calculations place the R-\({NH}_{2}\) stretching modes at 3451 and 3358 \({\mathrm{cm}}^{-1}\) in experimental FT-IR. The H–N–H band was also observed experimentally to occur at 1640 \({\mathrm{cm}}^{-1}\). The bands at 3516 \({\mathrm{cm}}^{-1}\) and 3417 \({\mathrm{cm}}^{-1}\) for R-\({NH}_{2}\) stretch and 1628 \({\mathrm{cm}}^{-1}\) for N–H bond B3LYP/6-31++G(d,p) set are in good agreement with observed spectral data. These are intense stretching modes and identified from the PED in Table 1.

C–N Vibrations: C–N stretch occurs in the region 1350–1000 \({\mathrm{cm}}^{-1}\) [25, 26]. The aromatic C–N stretch was observed to occur at 1334 \({\mathrm{cm}}^{-1}\). Theoretically, the computed wavelength of C–N stretch is observed at 1322 \({\mathrm{cm}}^{-1}\).

–N=N– Vibrations: The N=N stretching vibrations for azo compounds occurs in the region 1500 \({\mathrm{cm}}^{-1}\) and \(1400{ \mathrm{cm}}^{-1}\) [27]. The present calculations show that –N=N– vibrations occur at 1439 \({\mathrm{cm}}^{-1}\) while the computational frequency calculated at B3LYP/6-31++G(d,p) basis set was observed at 1403 \({\mathrm{cm}}^{-1}\).

=C–H Vibrations: The aromatic ring shows the occurrence of C–H stretching vibration modes in the range 3300–3000 \({\mathrm{cm}}^{-1}\) region [28]. The title compound observe =C–H stretching vibrational mode at 3239 \({\mathrm{cm}}^{-1}\) for experimental FT-IR while the computational frequency of the title compound was observed at 3106 \({\mathrm{cm}}^{-1}\). These are intense stretching modes and identified from the PED.

C–C and C=C Vibrations: The Aromatic C=C and C–C stretching vibrations (Aromatic ring stretching vibrations) occurs in the region 1625–1400 \({cm}^{-1}\)[29]. In the present study, the observed C=C stretching vibrational modes of the title compound are 1607 and 1506 \({\mathrm{cm}}^{-1}\) in experimental FT-IR and also C–C stretching vibrational modes are assigned at 1297 and 1181 \({\mathrm{cm}}^{-1}\) respectively in experimental FT-IR. The C–C–C planar vibration wave number will be reported in the range of 999–665 \({\mathrm{cm}}^{-1}\) by Sarojini and Co-workers [30]. The experimental aromatic ring C–C–C bending vibrations of the title compound has appeared at 731 and 690 \({\mathrm{cm}}^{-1}\) in experimental FT-IR. The computationally calculated FT-IR for C=C stretching vibrations, C–C stretching vibrations, and aromatic ring C–C–C bending vibrations are 1602 and 1411 \({\mathrm{cm}}^{-1}\), 1286 and 1190 \({\mathrm{cm}}^{-1}\), 763 and 688 \({\mathrm{cm}}^{-1}\), respectively.

3.2 Electron Spray Ionization (ESI) GC–MS Analysis

A method that enables the determination of a compound's molecular weight (mass) is mass spectrometry. Furthermore, it is possible to learn important structural details about the unidentified entity by calculating the masses of the pieces formed when high-energy molecules collide [31]. The mass spectrum of a chemical is often displayed as a bar graph with intensity (the number of ions of a certain m/e striking the detector) on the y-axis and m/e values (unit masses) on the x-axis. Gas chromatography-mass spectrometry (GC–MS) is a hyphenated technology that combines the detention power of mass spectrometry with the separating capacity of gas chromatography (GC). The molecular formula of the compound is 529 g/mol. As depicted in Fig. S2 of the supporting information, the mass spectra of the compound gave mass to charge ratio 75, 92, 211, 214, 228, 242, 283.2, 286.9, 301.1, 451.3, 505 correspondings to fragments \({C}_{6}{H}_{4}\), \({C}_{6}{H}_{6}N\), \({C}_{2}{H}_{4}{S}_{2}{O}_{6}\), \({C}_{11}{H}_{8}{N}_{5}S\), \({C}_{8}{H}_{4}{S}_{2}{O}_{6}\), \({C}_{8}{H}_{8}{S}_{2}{O}_{6}\), and \({C}_{14}{H}_{8}{S}_{2}{O}_{6}N\),

3.3 Natural Bond Orbital (NBO) Analysis

The molecular orbital energies and occupancies of the title molecule is calculated using B3LYP method and 6-311++G(d,p) basis set and the results is presented in Table 2. The second-order stabilization energy in line with the electron delocalization between the filled and unfilled orbitals is calculated as [32, 33].

Table 2 Representative values of Natural bond orbital (NBO) analysis for the studied compound
$${E}^{(2)}=\Delta {E}_{i,j}=-{q}_{i}\frac{{F}^{2}(i,j)}{{\varepsilon }_{i}-{\varepsilon }_{j}}$$
(1)

To estimate the delocalization patterns of electron density from the main occupied Lewis-type orbitals to unoccupied non-lewis type orbitals, the AATC compound’s NBO analysis is performed [34]. Intermolecular charge transfer (ICT), which results in the stability of the molecule, is caused by orbital overlap between bonding C–C, C–N, C–H, N–N, and C–S, according to NBO analysis of the investigated compound. The intramolecular interactions result in high electron density (ED) approximation of antibonding pi (π^*) orbitals. The greatest intramolecular interaction for the studied compound is formed by the orbital overlap between πC3–C4 and π*N16–N17 with electron density 1.725e to 0.009e which results in intermolecular charge transfer (ICT) causing stabilization of the system. The intramolecular hyper conjugative interaction of πC3–C4 to \(\pi\)*N16–N17 leads to the highest stabilization of 27.16 kJ/mol. The electron delocalization from \(\pi\)*C3–C4 to \(\pi\)*C6–C8, \(\pi\)*C1–C2 to \(\pi\)*C3–C4, \(\pi\)*N16–N17 to \(\pi\)*C3–C move with very high stabilization energy of 87.35, 86.37, and 47.81 kJ/mol respectively. Significant resonance interactions were observed LP (1) N29 to \(\pi\)*C1–C2, LP (1) N34 to \(\pi\)*C11–C13, LP (2) S5 to \(\pi\)*(2) C1–C2 show stabilization energies of 44.07, 28.60, and 27.37 kJ/mol with electron densities of 1.752e to 0.455e, 1.828e to 0.408e and 1.693e to 0.455e are also highly responsible for the stability of the studied compound. From the full NBO analysis in Table S2 of the supporting information, the high value of occupancies is predicted for CR(N93), CR(N34), CR(N17), CR(C32), CR(N16), CR(S5) bonding and its values are 1.99964, 1.99949, 1.99948, 1.99938, 1.99935, 1.99933. Therefore, the results suggest that CR(N93), CR(N34), CR(N17), CR(C32), CR(N16), CR(S5) bonding are essentially controlled by the p-character of these hybrid orbitals. The information found in Table S2 of the supporting documentation demonstrates that the ring and substituted groups interact (σ → σ^*) intramolecularly in a hyper conjugative manner. The occurrence of a few σ → σ^* transitions between rings and substituted groups demonstrates the strength of the relationship and further highlights how the chemical characteristics of the substituent and ring can be combined [35, 36]. The outcome of NBO study has supported the biological activity of the APTC.

3.4 Hole-Electron Excitation Analysis

Here, the movement of an electron from the hole to the electron is the main focus, and both the electron and the hole are depicted as real space functions. It should be emphasized that determining if a transition mode is a local excitation can be done using the overlapping integral of the hole-electron distribution. The charge transfer length of the current electron distribution mode is determined by the distance between the centroid of the hole and electron. The t, S r, and D indices, respectively, show whether the electron and hole distributions overlap, how much they overlap, and the magnitude of the charge transfer length overlap. The hole and electron are not regarded as separated due to charge transfer when t < 0, in accordance with [37,38,39,40,41], who state that “The t index measures the separation extend of the hole and electron in charge transfer (CT) direction.” The positive t index relates to the separation of the hole and electron distributions. In this section, excitation parameters: Hole-electron length (\(D\)) which can be considered as the charge transfer excitation, the overlap function (\({S}_{r}\)), the \(H\) index which reflects the breadth of the average distribution of hole and electron, the \(t\) index corresponds to the degree of hole and electron separation, the hole-electron coulomb attraction energy (\({E}_{cou}\)), hole delocalization index (\(HDI\)), and electron delocalization index (\(EDI\)) of all the five excited states are calculated and presented in Table 3. The hole delocalization index and electron delocalization index are both included in the table. S0 → S3 and S0 → S4 for the D index had extremely high values of 2.62 Å and 3.27 Å, respectively. As a result, they are thought of as CT excitations. The Sr indices of the five excitations are seen to be relatively large in Table 3. Because the excitation is of the highly confined π^* kind, the value of Sr for excited state 2, or S0 → S2, is fairly high up to 0.71. S0 → S1 is a n–π^* kind of excitation, which explains why its Sr value (0.50), which is tiny in comparison to the CT excitations of S0 → S3 and S0 → S4, is highly localized. All five types of excitations have substantial values for the H-index, which measures the width of the average distribution of holes and electrons, but the excitations from S0 → S2, S3, and S5 have the highest values. The hole electron map reveals that the distribution of the hole and electron of S0S1 is localized. The separation of the hole and electron is evident, according to the S0–S3–S4 t indices, which are somewhat positive. Therefore, it is crucial to think of S0 → S3, S4 as CT excitations. It is also reasonable to infer from the t value that the excited state S0 → S4 experiences more charge transfer than S0 → S3. The fact that the t indices for excitations S0, S1, S2, and S5 are all negative values shows how little their hole and electron are separated from one another. As shown in the table, it is clear that the electron of the excited state S0 → S1 is only somewhat localized despite having a reasonably high value of EDI, whereas the hole of the same excited state is very localized due to the high value of HDI. The excited state S0 → S1 is strongly confined when comparing the HDI and EDI values of all excitations. In contrast, S0 → S2, S3, S4, and S5 have higher delocalized electron and hole distributions. This is brought on by their low HDI and EDI levels. The least amount of HDI and EDI is visible in the excited state S0 → S5. The Sr index should, obviously, be the most significant in these investigations given the link between the hole-electron Coulomb attractive energy and the other electron excitation properties. It is clear that the main effect of the Sr index is the distance.

Table 3 Electron Excitation analysis of the five excited states of ACDP
Table 4 (a) Aromaticity indices showing PDI, PLR, FLU, FLU-π
Table 4 (b): Aromaticity indices showing HOMA and BIRD analysis

Distributions between the hole and electron which decreases the Coulomb attractive energy. The Sr value of S0 → S2 is the highest, while having the least Coulomb attractive energy, whereas the Sr value of S0 → S1 has the highest Coulomb attractive energy. Between the highest and lowest Coulomb attractive energy are shown by other excitations.

3.5 ADCH Population Analysis

The values are presented in Table S1 of the supplementary information. ADCH is calculated using the Multiwfn software analyzer using output from Gaussian 09 using the B3LYP method with 6-311++G(d,p) basis set. High efficiency and basis set insensitivity characterize the Hirshfield population analysis [42]. A better variant of the Hirschfield charge is ADCH [43]. The name ADCH comes from the fact that it is a modified version of the Hirschfield charge. By addressing the low dipole moment repeatability that exists in the Hirschfield charge, it enhances it. Due to the atomic charge effect, molecular polarizability, biological activity, and other molecular features that define a system, ADCH analysis is crucial in the application of quantum chemical calculations to the molecular system. Because electronegative hydrogen (less positive) is present, the carbon atom pulls the positive charge from the nitrogen atom, giving C32 the highest positive atomic charge of all the carbon atoms, according to the results of the ADCH analysis. As a result, C32 is the most acidic carbon atom and is vulnerable to nucleophilic assault. The presence of electronegative nitrogen allows the hydrogen atom to attract the positive charge from the nitrogen atom, making the H30 more electrophilic than the other hydrogen atoms, according to the ADCH atomic charge of the examined molecule. The additional hydrogen atoms that are immediately connected to nitrogen are also in this situation; their ADCH charges are also very positive. The N29 and N34 atoms in the modeled structure have larger negative atomic charges than the other atoms, according to the ADCH. The large positive values of the carbons linked to the electronegative nitrogen are readily apparent from Table S1. The total charge of the atoms that make up a fragment is known as the fragment charge. A ring is more acidic the higher its positive charge. Furthermore, fragment 3 has the most population and the largest positive charge value. As a result, the acidic ring is fragment number three. The population value and about equal amounts of negative charge are shared by fragments 1 and 2. The two end rings each have a negative atomic charge, while the centre ring has a large positive charge.

3.6 Frontier Molecular Orbitals (FMOs)

The energy of the frontier molecular orbitals is crucial for identifying a number of factors that influence a molecule's stability and reactivity. The concept of FMOs can be used to characterize all of a molecule's global reactivity properties. By deducting the energy of the LUMO from the HOMO (E HOMO-E LUMO) in FMOs, the energy band gap can be used to characterize crucial molecular characteristics as stability, reactivity, hardness, and non-linear optical properties [44, 45]. The bioactivity of the molecule will be described by the energy band gap value. A smaller band gap indicates strong molecular interaction and therefore ICT within the molecule. This in turn is used to predict the bioactivity of the molecule. The isosurface of the \({E}_{HOMO}\), \({E}_{LUMO}\), \({E}_{LUMO+1}\), \({E}_{HOMO-1}\), \({E}_{HOMO}-{E}_{LUMO}\), \({E}_{HOMO-1}-{E}_{LUMO+1}\) for the AATC studied structure were calculated by DFT/B3LYP method with 6-311++G(d,p) basis set are shown in Fig. 2. From the analysis of FMOs 375 molecular orbitals were identified for the structure. Of the 375 MOs 83 were occupied and the rest were unoccupied. Out of these 82, 83, 84, and 85 are the HOMO-1, HOMO, LUMO, and LUMO + 1 respectively. Their corresponding energy values are − 5.8 eV, − 5.2 eV, − 2.2 eV, − 0.5 eV. Calculating the energy band gap values between HOMO and LUMO of the modeled structure, we obtain 2.95652 eV as the value for the energy band gap of the structure. The low value of the band gap indicates the biological activity of the studied compound. The energy band gap has a very interesting relationship with NLO. Therefore, the low value of \(\left({E}_{HOMO}-{E}_{LUMO}\right)\) for the studied compound also indicates the presence of the NLO effect. The band gap of \({E}_{HOMO-1}-{E}_{LUMO+1}\) was obtained to be 5.26486.

Fig. 2
figure 2

Molecular orbital and energies for the HOMO, LUMO, HOMO − 1, LUMO + 1 of the modeled structure (AATC)

3.7 Molecular Electrostatic Potential (MESP)

The electrostatic potential is plotted on the surface of constant electron density in the MESP. A quantum-molecular descriptor called the molecular electrostatic potential is utilized to detect or locate molecular locations that could be vulnerable to nucleophilic and electrophilic assaults [46, 47]. The MESP is crucial for understanding how hydrogen bonds and a molecule's charge distribution interact. The following equation was used to determine the MEP for the structure under study:

$$V\left(r\right)=\sum {{Z}_{A}}/ {({R}_{A}-r)}-\int {\rho ({r}^{^{\prime}})}/ {({r}^{^{\prime}}-r)}d{r}^{^{\prime}}$$
(2)

where the summation runs over all the nuclei A in the compound. \({Z}_{A}\) is the charge of the nucleus A, located at \({R}_{A}\) and \({\rho }^{^{\prime}}(r)\) is the electron density function of the molecule. Different colors are used to illustrate various electrostatic potential levels. Deepest red denotes regions that are highly nucleophilic (electron-rich), deepest blue denotes regions that are highly electrophilic (electron-poor), light blue denotes regions that are slightly electron-deficient, yellow denotes regions that are slightly electron-rich, and green denotes neutral regions.

The color code of our present esp map is from − 6.985 \(\times {10}^{-3}\) (deepest red) to 6.985 \(\times {10}^{3}\) (deepest blue) for the studied compound as displayed in Fig. 3. MESP of the studied compound shows that the highest nucleophilic regions are around N33, − 0.0285171 of Cyanide group, N16, − 0.0209126 (which was predicted by CDD as the most nucleophilic region for attack by electrophiles), and N17, − 0.01285 of the azo group. Also, S5 of the thiophene is a rich electron site. The MESP map also shows that the two amino groups constitute the highest electrophilic deficient regions. H36 (0.0407641), H30 (0.0357419), and N34 (0.0291402) constitute the highest points for attack by nucleophiles.

Fig. 3
figure 3

Molecular electrostatic potential map calculated at B3LYP/6-311++G(d,p) level

3.8 Conceptual Density Functional Theory

The CDFT was applied as quantum chemical tool to vividly describe the global reactivity of the title molecule as reported in Table S2 of the supporting information. In this study, to calculate global reactivity descriptors such as molecular orbital energies (HOMO and LUMO), energy gap (\(\Delta\)E), electron affinity (EA), ionization potential (IP), Chemical potential (\(\mu\)), chemical hardness (η), Softness (\(\sigma\)), electronegativity (\(\gamma\)) and electrophilicity index (\(\omega\)), the Koopman’s approximation was employed.

$$\mathrm{VIP}={-E}_{HOMO}$$
(3)
$$\mathrm{VEA}={-E}_{LUMO}$$
(4)
$$\upchi =\frac{(VIP+VEA)}{2}$$
(5)
$$\mu =-\frac{-(VIP+VEA)}{2}$$
(6)
$$\upeta =\frac{VIP-VEA}{2}$$
(7)
$$\sigma =\frac{1}{\upeta }$$
(8)
$$\omega =\frac{{\mu }^{2}}{2\eta }$$
(9)

From the result obtained, the hardness value was 1.5045 \(eV\), which suggests more reactivity and possibility of Intramolecular charge transfer. The calculated value of electronic chemical potential (\(\mu\)), softness (\(\sigma\)), electronegativity (\(\gamma\)) and electrophilicity index (\(\omega\)) are − 3.70633 V, 0.6647 \({\mathrm{eV}}^{-1}\), 3.7063 \(\mathrm{eV}\), 4.5652 \(\mathrm{eV}\) respectively. According to literature [48], the chemical reactivity parameters indicate that ACDP molecule is biologically active.

Electrophilicity index encompasses the tendency of an electrophile to acquire an extra amount of electron density and the resistance of a molecule to exchange electron density with the environment [49]. The electrophilicity scale \(\omega\) scale allowed the classification of organic molecules as strong electrophiles with \(\omega >1.5\), moderate electrophiles with \(0.8<\omega <1.5\) eV and marginal electrophiles with \(\omega <0.8eV\) [49]. Since \(\omega\) of the studied structure is \(4.5652\) eV. Hence, it is a strong electrophile and shows high biological activity.

3.9 Local Reactivity Descriptors

The reactivity of each atom in a molecule is predicted using local reactivity descriptors. The Fukui function and the dual descriptor are two of the most significant subgroups of these reactivity descriptors. In more recent study, Martinez–Araya has discussed [50] the suitability of using a condensed relationship rather than the nucleophilic and electrophilic Fukui functions in the Hirsfeld population analysis for predicting preferred reaction sites. The relation provides the Fukui function.

$$f\left(r\right)={\left[\frac{\partial \rho (r)}{N}\right]}_{v}$$
(10)

where N is the number of electrons in the present system, the constant term \(v\) in the partial derivative is external potential. Fukui function can be defined for three situations:

$$\text{Nucleophilic attack }\,{f}^{+}\left(r\right)={\rho }_{N+1}\left(r\right)-{\rho }_{N}(r)$$
(11)
$$\text{Electrophilic attack }\,{f}^{-}\left(r\right)={\rho }_{N}\left(r\right)-{\rho }_{N-1}(r)$$
(12)
$$\text{Radical attack }\,{f}^{0}\left(r\right)=\frac{{f}^{+}\left(r\right)+{f}^{-}(r)}{2}$$
(13)

The electrophilic Fukui function, \({f}^{+}\)(r), describes the reactive sites in a molecule that are susceptible to nucleophilic attack, and the nucleophilic Fukui function, \({f}^{-}\)(r) describes the reactive sites that are more susceptible to electrophilic attack. The radical Fukui function \({f}^{0}(r)\), which is the average of \({f}^{-}(r)\) and \({f}^{+}(r)\) denotes the favorable site for radical attack. The condensed dual descriptor, \(\Delta f\left(r\right)\) is a very important parameter to study reactive sites. It is mathematically represented as:

$$\Delta f\left(r\right)={\left(\frac{\partial f(r)}{\partial N}\right)}_{v(r)}={f}^{+}\left(r\right)-{f}^{-}(r)$$
(14)

The sites in a molecule that are more vulnerable to electrophilic assault and nucleophilic attack can be identified using CDD. The most reactive locations in the structure under study are predicted in this situation using CDD. The location is favorable for nucleophilic assault if CDD > 0 and advantageous for electrophilic attack if CDD 0. According to Table S3 of the supporting data, C3 has the highest positive CDD value of any carbon atom. the most advantageous location for a nucleophilic assault on carbon atoms. The atom in the named compound, N16, has the highest positive CDD value of all the atoms, making it the location that is most vulnerable to nucleophilic assault. For both carbons and other atoms, C4 has the largest negative CDD value. It is therefore the location where entering nucleophiles are most likely to engage in electrophilic assault. Figure 4a–c of the supporting information gives the isosurface of \({f}^{-}\),\({f}^{0}\) and CDD respectively for the reduced compound. In the map in Fig. 5c, green and blue isosurface correspond to positive and negative region of CDD, respectively. Clearly, most negative part of CDD function is localized on C4, N29, C2, C8, C11, C9, N34. Hence, these positions are favorable reactive sites for electrophilic attack. The most positive part of CDD function is localized on C1, C3, C8, C7, N16, N17. These positions are favorable sites for nucleophilic attack.C32, C13, C19, C21, C23 are not so positive as C1, C3, C8, C7, N16.

Fig. 4
figure 4

a Isosurface map of \({f}^{-}\) for AATC. b: isosurface map of \({f}^{0}\) for AATC. c: isosurface map of CDD for AATC

3.9.1 Aromaticity Index

Para Delocalization Index is a theory used to calculate the aromaticity indices of a six-membered ring [51]. PDI is defined as the average of the sum of the delocalization index in a six-membered aromatic ring [52]. For a C1–C6 membered benzene ring it is mathematically represented as:

$$PDI=\frac{\delta \left(\mathrm{1,4}\right)+\delta \left(\mathrm{2,5}\right)+(\mathrm{3,6})}{3}$$
(15)

It has been found that the delocalization and aromaticity increase with increasing PDI. It can be seen from Table 1 that fragment 2 has the bigger PDI. They are primarily delocalized and fragrant as a result. Similar to PDI, FLU is another aromaticity index that may be determined for aromatic rings of any number of atoms [53]. FLU is mathematically represented as:

$$FLU=\frac{1}{n}\sum_{A-B}^{ring}{\left[{\left(\frac{V(B)}{V(A)}\right)}^{\alpha }\frac{\delta \left(A,B\right)-{\delta }_{ref}(A,B)}{{\delta }_{ref}(A,B)}\right]}^{2}$$
(16)

where the summation runs over all adjacent pairs of atoms around the ring, \(n\) is equal to the number of atoms in the ring, \({\delta }_{ref}\) is the reference DI value, which is a pre calculated parameter, \(\alpha\) is used to ensure the ratio of atomic valences is greater than 1.

$$\alpha =\left\{\begin{array}{c}1 V\left(B\right)>V(A)\\ -1 V(B)\le V(A)\end{array}\right.$$
(17)

From Eqs. (16) and (17), it is obvious that the lower the FLU value the higher the aromaticity since FLU can be carried out for rings with any number of atoms. The aromaticity parameters of the studied compound are shown in Tables 4a, 4b. Fragment 3 which corresponds to a ring with five atoms that has the least FLU value becomes the most aromatic and delocalized. Therefore, according to FLU Frag 3 < Frag 2 < Frag 1.

Flu-\(\pi\) is based on DI\(-\pi\) and \(\pi -\)atomic valence [54]. It is mathematically represented as

$${FLU}_{\pi }=\frac{1}{n}\sum_{A-B}^{ring}{\left[{\left(\frac{{V}_{\pi }(B)}{{V}_{\pi }(A)}\right)}^{\alpha }\frac{{\delta }_{\pi }\left(A,B\right)-{\delta }_{ref}(A,B)}{{\delta }_{ref}(A,B)}\right]}^{2}$$
(18)

Similarly, like \(FLU\), the lower the \({FLU}_{\pi }\) the higher the aromaticity in the compound. From Table 1 Fragment 1 has the least \({FLU}_{\pi }\) value, hence, the most aromatic.

PLR is PDR with DI replaced by CLRK [55]. It is mathematically represented as:

$$PLR(A,B)=\frac{{\chi }_{\mathrm{1,4}}+{\chi }_{\mathrm{2,5}}+{\chi }_{\mathrm{3,6}}}{3}$$
(19)

PLR like PDI has stronger aromaticity with higher values of PLR. Therefore, Fragment 2 has the strongest aromaticity.

HOMA is another theory for studying aromaticity indices [56]. It is represented by the formula:

$$HOMA=1-\sum_{i}\frac{{\alpha }_{i,j}}{N}{\left({R}_{ref}-{R}_{i,j}\right)}^{2}$$
(20)

where N is the number of atoms considered, j denotes the atom next to atom i, \(\alpha\) and \({R}_{ref}\) are pre-calculated constants given in the original paper for each type of atom pair. From Eq. (20), it is possible to deduce that if HOMA equals 1 the ring is fully aromatic, and if HOMA equals 0 the ring is not aromatic. From Tables 4a, 4b it is obvious that fragment 2 is the most aromatic since it has the highest HOMA value. Hence, Frag 2 < Frag 1 < Frag 3.

BIRD is a geometry-based theory that can also be used to study the aromaticity of rings [57]. It can be represented by the following formula:

$$I=100\left[1-\left({V}/ {{V}_{K}}\right)\right]$$
(21)
$$V=\frac{100}{N}\sqrt{\frac{\sum_{i}{\left({N}_{i,j}-\overline{N }\right)}^{2}}{n}} {N}_{i,j}=\frac{a}{{R}_{i,j}}-b$$
(22)

i cycle all of the bonds in the ring, j represents the atom next to atom I, n is the total number of bonds considered. N represents Gordy bond order, \(\overline{N }\) is the average value of the N values. \({R}_{i,j}\) Represents the bond length, a and b are predefined parameters respectively for each type of bond. According to the equation in Eq. (21), the more the BIRD index is close to 100, the more aromatic the matching ring is. Additionally, BIRD supports the higher aromaticity of ring 2 that HOMA analysis has already suggested. In conclusion, it is seen that, of the six criteria examined for aromaticity, five confirmed the greater and stronger aromaticity of frag 2, with the exception of the FLU analysis, which suggested that frag 3 would have greater aromaticity. This might be caused by the fewer atoms in ring of fragment 1.

3.9.2 Non-Linear Optical (NLO) Effects

The nonlinear response of variables like frequency, amplitude, phase, or other propagation characteristics of incident fields is explained by nonlinear optical (NLO). On the other hand, nonlinear optical (NLO) effects result from interactions between applied electromagnetic radiation to create new fields from incident one [58]. According to P. N. Prasad and D. J. Williams, organic materials with high levels of electron donors and acceptors as well as highly delocalized π-electron portions of molecules are excellent sources of NLO materials [59]. A Taylor's series expansion of the total dipole moment, μ_tot, caused by the electric fieldE i (ω) can be used to represent the NLO response of an isolated molecule in the field.

$${\mu }_{i}\left({E}_{i}\right)={\mu }_{i}+{\alpha }_{ij}{E}_{j}+\frac{1}{2!}{\beta }_{ijk}{E}_{j}{E}_{k}+\frac{1}{3!}{\gamma }_{ijkl}{E}_{j}{E}_{k}{E}_{l}+\dots$$

where \({E}_{i}\) be the homogenous electric field, \({\mu }_{i}(E)\) be the dipole moment in an electric field, \({\mu }_{i}\) be the dipole moments at zero field, \({\alpha }_{ij}, {\beta }_{ijk},{\gamma }_{ijkl}\) are the polarizability tensor component, first-order hyperpolarizability component, and second-order hyperpolarizability component, respectively. The electric dipole moment (\(\mu\)) and the average polarizability \(({\alpha }_{tot})\) using the x, y, and z components can be obtained by the following relation:

$$\mu =\sqrt{\left({\mu }_{x}^{2}+{\mu }_{y}^{2}+{\mu }_{z}^{2}\right)}$$
$${\alpha }_{tot}=\frac{{\alpha }_{xx}+{\alpha }_{yy}+{\alpha }_{zz}}{3}$$

Anisotropy of polarizability

$$\Delta \alpha ={2}^{{-1}/ {2}}{\left[{\left({\alpha }_{xx}-{\alpha }_{yy}\right)}^{2}+{({\alpha }_{yy}-{\alpha }_{zz})}^{2}+{\left({\alpha }_{zz}-{\alpha }_{xx}\right)}^{2}\right]}^{{1}/ {2}}$$

The first order polarizability \(({\beta }_{ijk})\) is a \({3}^{rd}\) rank tensor \(\left(3\times 3\times 3 matrix\right).\) the equation for calculating the magnitude of first-order hyperpolarizability i

$${\beta }_{tot}=\sqrt{{\beta }_{1}^{2}+{\beta }_{2}^{2}+{\beta }_{3}^{2}}$$

where

$$\beta _{i} = \frac{1}{3}\sum\limits_{{j = 1}}^{3} {\left( {\beta _{{ijj}} + \beta _{{jij}} + \beta _{{jji}} } \right)} = \sum\limits_{{j = 1}}^{3} {\beta _{{ijj}} }$$

Here i, j = x, y, and z. and the final form resulting from Kleinman symmetry [60].

Polar keyword, B3LYP functional, and 6-311++G (d,p) basis set were used in the computations, and the output was loaded using a multiwave function analyzer. The results are all reported in a.u. Table 5's findings include the electronic atomic dipole moment, average polarizability that is isotropic, polarizability anisotropy, and initial hyperpolarizability's magnitude. The predicted dipole moment in this investigation is 0.607926 au. The maximum value of the dipole moment is observed along the z-direction − 0.470857 au. The calculated average polarizability \(({\alpha }_{tot})\) and anisotropy of polarizability value are 293.8313 au and 296.1043, respectively. According to Table 5, the examined chemical has a first-order hyperpolarizability value of 3993.12069 atomic units. Average polarizability and first-order hyperpolarizability values calculated are much higher than those for urea (β_tot = 43au) [61], a reference molecule for analyzing NLO response. The reduced molecule is found to have a first hyperpolarizability value that is 93 times greater than that of urea, making it a strong contender for use as an NLO material and a desirable subject for future research on its non-linear optical properties.

Table 5 The electric dipole moment \((\mu )\), polarizability \((\Delta \alpha )\), and first order hyper polarizability \((\beta )\) of reduced compound by B3LYP/6-311++G(d,p) approach and Multi wave function analyzer

3.9.3 Pharmacological Investigation

The characteristics of the examined substance and the test drug sample, isozianid, employed in this study are described in Table 6. The Lipinski RO5 explained four physicochemical parameter ranges (MWT ≤ 500, miLogP ≤ 5, No. of H-bond donor ≤ 5, No. of H-bond acceptors 10) and offered information on the orally active medicines. When compared to isozianid, the physicochemical parameter values of the aforementioned compound follow the Lipinski RO5 (Rule of 5) without any appreciable divergence.

Table 6 Molinspiration property values of studied compound (APTC) and isozianid
Table 7 (a) Computed Absorption properties using pkCSM web server model

Log \({P}^{a}\): Logarithm of the partition coefficient between n-octanol and water (miLogP).

\({TPSA}^{b}\): Topological polar surface area.

\({MW}^{c}\): Molecular weight.

n rot \({b}^{d}\): Number of rotatable bonds.

Using the pkCSM web server model, the ADMET absorption, distribution, and hazardous characteristics for the named chemical and isozianid have been calculated and are shown in Tables 7a, 7b, 7c in the appropriate order. The investigated structure has a higher Caco-2 permeability than isozianid. As a result, the named component will work best for enhancing drug absorption when taken orally. In the small intestine, APTC and isozianid are both heavily absorbed, but the investigated structure absorbs more. It further demonstrates the named structure's favored suited for aiding in the absorption of medicines taken orally. (− 6 ≤ logS ≤ 0.5) is the suggested range for a chemical to be good oral bioavailability. The examined substance and isozianid adhere to the rule. An ATP-binding cassette called P-glycoprotein serves as a biological barrier by pushing several foreign chemicals out of cells. Isoniazid functions differently to APTC, which can act as a P-gp substrate. Skin permeability refers to a substance's capacity to pass through the skin. If a substance has logKp > − 2.5, it is thought to have a relatively poor skin permeability. Consequently, APTC is less skin impermeable than isoniazid.

The more a medicine is dispersed in tissue as opposed to plasma, the larger its steady-state volume of distribution (VDss). Low VDss is defined as (log VDss < − 0.15), and high VDss is defined as (log VDss > 0.45). As a result, it may be inferred from the values of log VDss that isoniazid is more widely dispersed in plasma than in tissue, whereas the titled molecule is more evenly distributed between the two. The extent to which a medicine attaches to proteins can have an impact on how effective it is since the more tightly it is attached, the less effectively it can diffuse. Isoniazid is less efficient at binding proteins than APTC. To help prevent side effects and toxicities or to increase the effectiveness of medications whose pharmacological activity is within the brain, the ability of a drug to pass into the brain is a crucial characteristic to take into account. Molecules with log BB − 1 are poorly distributed to the brain, whereas those with log BB > 0.3 are thought to be ready to cross the BBB for a given substance. The BBB < − 1 of the named compound and isoniazid are seen to be neither high nor low, while isoniazid has a higher BBB. Compounds with a Log PS > − 2 are considered to penetrate the central nervous system while those with Log PS < − 3 are considered as unable to penetrate the CNS. From the values of Log PS in Table 7b, it is observed that titled compound can penetrate the CNS while isoniazid can’t penetrate the CNS.

Table 7 (b) Computed Distribution properties using pkCSM web server model
Table 7 (c) Computed Toxicity properties using pkCSM web server model

The Oral Acute toxicity (LD50) value for the studied structure is somewhat higher than that of the standard drug implying that the titled compound is a little more toxic than Isoniazid. Both titled compound and isoniazid show a positive AMES toxicity test implying that they may act as carcinogens. The maximum recommended tolerated dose (MRTD) provides an estimate of the toxic dose threshold of chemicals in humans. For a given compound, a MRTD of less than or equal to 0.477 log(mg/kg/day) is considered low, and high if greater than 0.477 log(mg/kg/day). The modeled structure (APTC) has a low MRTD and the Standard drug isoniazid has a high MRTD. The predicted lowest dose of a compound that results in an observed adverse effect is 0.764 for the titled compound and 2.824 for isoniazid. Both the studied compound and the standard drug are not associated with the normal functioning of the liver and also not associated with skin sensitization. Based on results from pharmacological data analysis, APTC has immense possibilities to be used as a standard drug for the treatment of tuberculosis (Tables 8a, 8b, 8c, 8d).

Table 8 (a) Binding energy, hydrogen bond, and hydrophobic contacts of APTC with 1P44 protein
Table 8 (b) Binding energy, hydrogen bond, and hydrophobic contacts of APTC with 1P45 protein
Table 8 (c) Binding energy, hydrogen bond, and hydrophobic contacts of APTC with 3FNE protein
Table 8 (d) Binding energy, hydrogen bond, and hydrophobic contacts of APTC with 4TZK protein

According to Fig. 5, the alkyl–π, π–π interaction and hydrogen bond stabilize the 1P45-ACDP complex (b). However, the 1P45-ACDP ligand–protein complex shows three hydrogen bond connections. At a distance of 2.470 and 2.963 Å, respectively, the ligand’s atom N1 had branching hydrogen bond interactions with the atoms O1 and O2 of the active residues GLY14 and SER94. At a distance of 2.635 Å, the ligand's atom N2 forms bifurcated hydrogen bonds with the active residue GLY96’s atom O3. Also seen in the ACDP-1P45 complex is one π–π interaction. At a distance of 3.983 Å, the six-membered ring of the residue PHE41 interacts with the benzene ring C1–C6 to form a hydrophobic bond. Further, four alkyl–\(\pi\) interactions are also observed in such a way that the CB-CG1 of the protein interacts with the benzene ring C1–C6 at a distance of 4.998 and 5.170, respectively constituting alkyl–\(\pi\) hydrophobic interactions, CB-CG1 of the protein interacts with the thiophene ring of the ligand at a distance of 5.425 Å, CB atom of the protein interacts with the benzene ring C1–C6 at a distance of 4.321 Å.

According to Fig. 5, the hydrogen bond, alkyl–π, π–π, and interaction bring the 4TZK-ACDP complex to steady state (c). In the 4TZK-ACDP complex, there is one hydrogen bond interaction that is seen. At a distance of 2.490 Å, the atom N1 of ACDP forms branching hydrogen bonds with the atom O1 of the active residue SER94. Furthermore, there are two π–π interactions found in the ACDP-4TZK complex. At distances of 3.734 and 4.961 Å, respectively, the six-membered ring of the residue PHE41 joins with the benzene ring C1–C6 of the ligand to form—hydrophobic interactions. Furthermore, five alkyl–\(\pi\) interactions are also discovered in such a way that the CB-CG1 of the protein PHE41 reacts with the benzene ring C1–C6 at a distance of 4.874 and 5.208 Å, respectively generating alkyl–\(\pi\) hydrophobic interactions, CB-CG1 of the protein ILE16 and ILE95 interacts with the thiophene ring of the ligand at a distance of 5.241 and 5.357 Å, respectively, CB-CG1 of the protein ILE16 and ILE95 interacts with the benzene ring C1–C6 of the ligand at a distance of 4.874 and 5.208 Å, respectively, CB atom of the protein VAL65 interacts with the benzene ring C1–C6 at a distance of 5.035 Å.

The 3FNE-ACDP complex is balanced by hydrogen bond and alkyl–\(\pi\), \(\pi\)\(\pi\) interaction as depicted in Fig. 5d. As observed, one hydrogen bond interaction is revealed in the 3FNE-ACDP complex. In like manner, the atom N1 of ACDP has bifurcated hydrogen bond interactions with the atom O1 of the active residue GLY14 at a distance of 2.941 Å. Also, two \(\pi\)\(\pi\) interactions are observed in the ACDP-3FNE complex. The six-membered ring of the residue PHE41 interacts with the benzene ring C1–C6 of the ligand forming \(\pi\)\(\pi\) hydrophobic interactions at a distance of 3.724 and 5.000 Å, respectively. Moreover, five alkyl–\(\pi\) interactions are also noticed in such a way that the CB-CG1 of the protein ILE16 combines with the benzene ring C1–C6 at a distance of 5.233 Å creating alkyl–\(\pi\) hydrophobic interaction, CB-CG1 of the protein ILE16 reacts with the thiophene ring of the ligand at a distance of 4.759 Å, CG1 of the protein VAL65 and ILE122 interacts with the benzene ring C1–C6 of the ligand at a distance of 5.292 and 5.208 Å\(,\) respectively.

Fig. 5
figure 5

a Interaction of studied compound to 1P44 binding site and Binding mode of studied compound at the binding site of 1P44. b Interaction of studied compound to 1P45 binding site and Binding mode of studied compound at the binding site of 1P45. c Interaction of studied compound to 3FNE binding site and Binding mode of studied compound at the binding site of 3FNE. d Interaction of studied compound to 4TZK binding site and Binding mode of studied compound at the binding site of 4TZK

The significant part of the docking studies is that three hydrogen bond interactions were observed in the 1P45-ACDP complex. However, the ligand–protein interactions of 1P45-GEQ, 1P45-d11, 1P45-8PC, 1P45-8PS, and 1P45-AYM complexes which are originally used as Ligands for the selected InhA proteins yielded interactions with hydrogen bonds less than or equal to two [63] as shown in Tables 9a, 9b, 9c, 9d, 9e. Moreover, the docked 1P45-ACDP complex has a higher binding energy − 8.40 than all the other protein–ligand complexes except the 1P45-GEQ complex with binding energy − 9.20 which may be due to a greater number of hydrophobic interactions when compared to 1P45-ACDP complex. 1P44 has two hydrogen interactions when compared to already studied complexes only 1P45-GEQ has 2 hydrogen bond interactions, others (1P44-d11, 1P44-8PC, 1P44-8PS, and 1P44-AYM complexes) do not form hydrogen bonds with 1P44. The binding energy of the 1P44-ACDP complex is − 8.00, which is higher than all other protein–ligand complexes except 1P45-GEQ 10.9 which may also be due to higher hydrophobic interactions. 4TZK and 3FNE give one hydrogen interaction and a good number of hydrophobic interactions with a binding energy of − 8.20 and − 8.30, respectively. The existence of hydrogen and hydrophobic interactions identifies the increase in binding affinity and bioactivity of APTC when compared to other ligands.

Table 9 (a) Docking of GEQ with selected inhA proteins
Table 9 (b) Docking of D11 with selected inhA proteins
Table 9 (c) Docking of 8PCwith selected inhA proteins
Table 9 (d) Docking of 8PS with selected inhA proteins
Table 9 (e) Docking of AYM with selected inhA proteins

These results recommend that the compound might be suitable as an inhibitor for inhA Proteins present in Mycobacterium tuberculosis. Hence, the studied compound may be regarded as a suitable drug for producing more tuberculosis drugs.

4 Conclusion

This research studies were focused on the experimental, DFT, and molecular docking studies of azo dyes synthesized by the reaction of diazotised 4-Aminobenzene-4-β-Sulphatoethylsulphone and 2-amino-4-(4-aminophenyl) thiophene-3-carbonitrile (AATC). The experimentally synthesized azo dye compound was characterized by FT-IR and GC–MS spectroscopy. The experimental measured FT-IR was reported in comparison with the theoretical vibrational DFT calculation. PED of normal modes of vibrations was done using the Veda 04 program. MESP and Local descriptors predict the most reactive part of the reduced modelled azo dye compound (ACDP). MESP and Condensed Dual Descriptor (Local Descriptors) predicted almost the same sites of the molecule as the most reactive in the compound. Excitation energy analysis predicted the types of electron excitations inherent in the five excited states of the compound. The lower band gap of the compound indicates the presence of a nonlinear optical (NLO) effect and this confirms the biological activity of the compound. Aromaticity indices (PDI, FLU, FLU-\(\pi\), PLR, HOMA, BIRD) predicts the most stable and less stable aromatic ring fragment of the compound and their predictions correspond. The studied compound was found to display good NLO properties as the first hyperpolarizability is 93 times greater than that of Urea. Hence, it can serve as a suitable material for various technological applications. The Molecular docking results suggest that the compound might exhibit inhibitory action against InhA proteins which are responsible for Mycobacterium tuberculosis hence applied as an anti-tuberculosis agent. The ADMET results also suggest that the compound can be used as a drug as it obeys Lipinski RO5 and also yielded good results in comparison with Isoniazid. Therefore, we hope that the results help researchers in synthesizing other novel Anti-tubercular drugs.