1 Introduction

The control over the transverse modal content of optical fibers by means of periodic perturbation of light propagation has found many applications since the first introduction of permanent long-period gratings (LPGs) written point by point, using UV laser sources illuminating a photosensitive fiber from the side [1]. Applications of LPGs include tunable band-rejection filters [24], gain equalizers [5] and dispersion compensation [68] in the field of telecommunications. All of the above-mentioned long-period gratings have in common that they are permanently inscribed into the fiber.

Parallel to the development and application of fixed LPGs, a dynamic control of the modal content was found of interest and firstly realized by inducing flexural acoustic waves in a fiber [911]. While offering some flexibility, the rate by which these gratings can be changed is fundamentally limited by the speed of acoustic waves in glass, and acoustic gratings have not found a wide application yet. However, in general a fast and dynamic control of the modal content at a position of choice could be of high interest, e.g., in the context of ultrafast transverse switching for upcoming spatial division multiplexing systems [12]. In order to further scale data capacities, transverse eigenmodes of few-mode fibers are proposed to be used as a multiplicator for existing data channels in telecommunications [13, 14]. In this context, acousto-optical switching of transverse modes has very recently gained new interest [15]. However, as these switches are limited in the achievable switching speed to the order of tens of microseconds, one challenging aspect of using transverse modes for data communications remains the ultrafast routing and switching.

An ultrafast, flexible and noninvasive method for changing the transverse modal content of a fiber seemed not feasible using long-period gratings until Park et al. and others demonstrated an optically induced long-period grating (OLPG) making use of the dependence of the refractive index on the local intensity (the optical Kerr effect) as well as a multi-mode interference beating between two transverse modes of a quasi-cw nanosecond control beam [16, 17]. In order to reduce pulse energy while keeping the pulse peak power fixed, it was recently proposed and numerically modeled to use femtosecond pulses for the probe and the control beam in a co-propagating setup [18]. This reduction in necessary pulse energy has been successfully realized experimentally achieving a reduction in about a factor of 150 in step-index fibers [19] as well as even a factor of 300 using a two-color approach in graded-index fibers [20]. A further decrease in pulse energy was predicted for using integrated optical waveguides and highly nonlinear materials [21].

Although first analytical expressions for OLPGs based on coupled mode equations have been derived already early in reference [16], so far predicting the efficiency of the all-optically induced gratings solely relies on numerical simulations either using a beam propagation method [22] for quasi-continuous-wave (quasi-cw) radiation or using the multi-mode generalized nonlinear Schrödinger equation (MM-NLSE) in the case of ultrashort pulses [18, 21, 23].

As a complement to these time-consuming numerical tools, an analytical framework for the description of cw- and ultrafast induced OLPGs will be presented here. Analytical expressions for the evolution of the modal contents of the probe beam were derived, which describe the energy evolution exactly in the cw-limit. Remaining limitations of the accuracy of the predicted mode conversion were found to arise for a mismatch in pulse duration or when the occurring group delay of the used ultrashort control and probe pulses is comparable to or even larger than their pulse duration. However, even in these unfavorable cases the analytical model allows to determine the order of magnitude of the achievable conversion at given fiber and control beam parameters. If more exact predictions are necessary, the results from the model can be used to quickly narrow the range of initial parameters for a full numerical simulation to save valuable numerical simulation time (minutes up to hours for a full simulation compared to milliseconds for evaluating the analytical expression). The analytical nature of the model, furthermore, accomplishes a direct insight into the physical process and allows to determine the major control pulse and waveguide parameters that govern the mode conversion.

2 Analytical model

OLPGs have been described theoretically via the induced change of the refractive index by the optical Kerr effect (material representation, [16]). The refractive index change being induced by the multi-mode interference of two control beam modes (amplitude \(A_j\), \(j=1,2\)) and acting on two probe beam modes (\(j=3,4\)) can be evaluated by the so-called coupled mode theory [24], very similar to the treatment of permanent long-period gratings. The resulting change in refractive index alongside the waveguide (z-direction),

$$\begin{aligned} n(x,y,z)&= n_0^\prime (x,y)+ \Delta n(x,y) \cos (\varOmega z-\phi _0), \end{aligned}$$
(1)

with \(\Delta n\) being a function of the control beam power and \(\varOmega =\Delta \beta _c = \beta _1-\beta _2\) of its propagation constants \(\beta _j\) directly leads to the coupled mode equations (see “Appendix 1” for details).

The solution in the material representation is equivalent to describing the interaction in a nonlinear wave interaction representation of four-wave mixing (see “Appendix 2”). The coupled mode equations can be directly derived from the four-wave mixing interaction between the modes plmn described in the MM-NLSE by the overlap integrals \(Q^{(1,2)}_{plmn}\) (see reference [25] for details):

$$\begin{aligned} \frac{\partial A_3}{\partial z}&=i\beta _3 \cdot A_3 + 4 \frac{in_2\omega _0}{c} \vert A_1\vert \vert A_2 \vert \cos \left[ (\beta _1-\beta _2) z\right] \nonumber \\&\quad \cdot \, \left( Q^{(1)}_{3412} A_4+Q^{(1)}_{3312}A_3\right) \nonumber \\&= i\beta _3 \cdot A_3 + iD_{34}\cdot A_4+iD_{33} A_3, \end{aligned}$$
(2)

with \(n_2\) being the nonlinear index of refraction, \(\omega _0\) the angular frequency of the control beam and \(D_{34},D_{33}\) the coupling coefficients. The equivalency of the two theoretical approaches connects the four-wave mixing interaction directly with transverse mode conversion for the first time and strengthens the physical interpretation of the mode conversion process in reference [18] being a result of optically induced long-period gratings.

Fig. 1
figure 1

Calculated all-optical cw power transfer from the \(\text {LP}_{01}\) (red dashed line) to the \(\text {LP}_{11}\) probe mode (blue solid line) in comparison with a numerical simulation relying on a three-dimensional beam propagation method (dashed and solid black lines). The control beam was assumed to be cross-polarized and at a power of 60 kW in each mode. Both beams were centered in wavelength around 1030 nm, and the fiber was a step-index fiber with a core diameter of \(25\,\upmu \hbox {m}\) and a numerical aperture of about 0.06 at 1030 nm

After deriving the coupled mode equations for OLPGs, the results from standard coupled mode theory can be readily applied: The dimensionless phase mismatch \(\sigma\) normalized to the coupling strength between the induced grating and the two probe modes for OLPGs results in

$$\begin{aligned} \sigma&= \frac{2}{\kappa } \left\{ (\beta _1-\beta _2) -(\beta _3-\beta _4)\right\} = \frac{2}{\kappa } \Delta , \end{aligned}$$
(3)

with \(\kappa =\sqrt{\bar{D}_{34}\bar{D}_{43}}\propto \vert A_1\vert \vert A_2\vert\) and \(\Delta =\Delta \beta _c - \Delta \beta _p\) being the mismatch of the difference in propagation constants of the control and the probe modes. With an induced grating and assuming unit power in one probe mode (\(j=3\)) and zero power in the other probe mode (\(j=4\)) at the beginning of the fiber results in an asymptotic power transfer to the second probe mode of

$$\begin{aligned} P_4(z)=F^2\sin ^2\left( \frac{\kappa \cdot z}{2 F}\right) ,\quad F=1/\sqrt{\left( 1+\frac{\sigma ^2}{4}\right) }. \end{aligned}$$
(4)

The power in the first probe mode is always given by \(P_3=1-P_4\) due to energy conservation. The first deduction from Eqs. (3) and (4) is that the effective conversion phase mismatch \(\sigma\) does not only depend on the difference in propagation constants of both modes (\(\Delta\)), but is instead normalized to the coupling strength \(\kappa \propto \vert A_1\vert \vert A_2\vert =\sqrt{P_1\cdot P_2}\) and thereby to the power of the control beam modes. The resulting power dependency of the conversion efficiency \(\left( \eta =P_4/(P_3+P_4)\right)\) was also found in the numerical simulations presented in references [21, 23]. In these simulations, the decreasing effective conversion phase mismatch with control beam power was interpreted physically as an increasing conversion rate making up for the dephasing of the propagating probe and control modes.

Furthermore, when taking into account the first-order correction to the asymptotic solution from Eq. (4) in the phase-matched scenario (\(\sigma =0\), leading to \(F=1\)), the power in mode \(j=4\) is given by [26]:

$$\begin{aligned} P_4(z)=\sin ^2\left( \frac{\kappa \cdot z}{2}\right) -\frac{\kappa }{4\Delta \beta _p}\sin \left( 2\Delta \beta _p \cdot z\right) \sin \left( \kappa \cdot z\right) \end{aligned}$$
(5)

leading to a small oscillation of the modal power around the asymptotic solution with a period much shorter than the conversion length. The main contribution is still the asymptotic solution \(\sin ^2\left( \frac{\kappa \cdot z}{2}\right)\) that was already found in Eq. (4). The amplitude of the oscillations is given by the fraction \(\kappa / \Delta \beta _p\), so that a very strong control beam as well as a very small difference in modal propagation constants in the probe beam would result in a large modulation of the energy transfer. A first investigation of this modulation was already done numerically for cw-control and probe beams [22], where it was observed, that the fast oscillations are connected to the mode beating period. This hypothesis has hereby been verified analytically, although the beating period of the probe modes is responsible for the modulation, and not that of the control beam modes as claimed in [22].

Fig. 2
figure 2

Results of the analytical model of the all-optical cw power transfer between two higher-order modes as a function of the coupling strength \(\kappa\) as well as mismatch between the propagation constants of probe and control beam \(\Delta\). For reference, the corresponding peak power in a 50 \(\mu \hbox {m}\) graded-index fiber is given alongside \(\kappa\) for a control beam wavelength of 1030 nm. In a the conversion efficiency \(\eta\) is displayed color coded with contour lines indicating a set of fixed conversion efficiencies while in b the conversion length after which maximum conversion is achieved is shown on a logarithmic scale

In order to verify the developed theoretical description of all-optical mode conversion, the calculated results can be compared to numerical simulations, e.g., using a scalar three-dimensional beam propagation method as demonstrated in reference [22]. The result of such a comparison is shown in Fig. 1: For the numerical simulation, two cross-polarized beams and a resulting nonlinear index of \(n_2^{\mathrm{XPM}}=\tfrac{2}{3}\cdot n_2\) has been chosen. The resulting evolution of the electrical field is shown decomposed into the modes of the waveguide, the black solid line showing the evolution of the \(\text {LP}_{11}\)-mode and the black dashed line the evolution of the fundamental mode. The modal powers derived from the OLPG coupled mode theory (OLPG-CMT) are displayed as the blue and red curve, respectively. Excellent agreement between the analytical model and the numerical simulation was found with a normalized rms deviation

$$\begin{aligned} \rho _{\mathrm{rms}}=\frac{1}{N\cdot \bar{P_4}} \sum _{i=1}^N \left| P_{4,\text {sim}}(z_i)-P_{4,\text {CMT}}(z_i) \right| ^2 \end{aligned}$$
(6)

of \(\rho _{\mathrm{rms}}=0.03\), with \(\bar{P_4}\) being the mean value of the modal power in the waveguide. Hence, the OLPG-CMT does represent a straightforward tool to estimate the power transfer for a given control beam power and phase mismatch.

3 Results and discussion

The developed analytical model can be applied to calculate the conversion efficiency \(\eta\) that one has to expect from an OLPG at a given control beam peak power and phase mismatch without the need for time-consuming numerical simulations. In order to provide a very general result that can be applied to virtually any waveguide, the conversion efficiency as well as the conversion length \(z_c\), after which the conversion maximum is reached, is displayed in Fig. 2 a and b as a function of the coupling strength \(\kappa\) as well as the mismatch of the difference of the propagation constants \(\Delta\) of the involved modes. The individual peak powers that will be necessary in a particular waveguide depend on the effective modal areas as well as the nonlinear coefficient of the used waveguide material [see Eqs. (20) and (31)]. For a graded-index fiber with a core diameter of \(50\,\mu \hbox {m}\) and a control beam center wavelength of 1030 nm, the corresponding peak powers in each control beam mode are displayed alongside \(\kappa\) as a reference. As was discussed in the last section, the conversion efficiency displayed in Fig. 2a drops when a phase mismatch \(\Delta\) occurs between control and probe beam, and only by increasing the coupling strength a high efficiency can be regained reaching \(\eta =100\,\%\) in the limit of an infinite coupling strength. A different behavior is found for the conversion length shown in Fig. 2b which is reduced to shorter waveguide lengths with increasing phase mismatch as well as coupling strength. The displayed false color plots allow to quickly estimate the expected efficiency of a conversion process and the needed length of the waveguide for planning an experiment.

Fig. 3
figure 3

a The normalized root-mean-square error \(\rho _{\mathrm{rms}}\) of the cw-model in comparison with full numerical simulations using the MM-NLSE as a function of occurring group delay \(\Delta \tau\) relative to the control pulse duration \(\text {t}_{\mathrm{c}}\). Furthermore, in b and c a detailed analysis of the power transfer into the higher-order mode along the waveguide is shown for different pulse duration ratios \(\text {t}_{\mathrm{c}}/\text {t}_{\mathrm{p}}\) between control and probe pulse (marked with gray circles in a))

However, note that the presented results are only exact in the cw-limit. In order to explore the usefulness of the presented results for pulsed applications, we conducted a systematic study by comparing the results of the theoretical model with full simulations of the MM-NLSE in a graded-index fiber and by taking into account the nonlinear interactions between the pulses as well as dispersive broadening and walk-off (\(\Delta = 7.3/\hbox {m}\), \(\kappa = 20.3/\hbox {m}\)—corresponding to a peak power of 40 kW in each control mode at a center wavelength of 1030 nm and a probe beam wavelength of 1300 nm). In general, a deviation of the results can be expected from two main contributions: (1) If probe and control pulses are of similar duration or the control pulses are even shorter than the probe pulses, the grating (coupling) strength \(\kappa\) will vary at different temporal positions of the probe pulse. When the cw-model would predict a complete conversion at a certain peak power of the control pulse, e.g., some remaining energy in the pulse wings was observed in numerical simulations [18]. (2) Even for probe pulses considerably shorter than the control pulses, meaning the coupling strength is nearly uniform, the cw-model can fail if strong pulse walk-off occurs. In this case, the coupling strength, while being uniform for the probe pulse at each waveguide position, will decrease along the waveguide.

In order to study both of these cases, we varied the ratio \(t_\mathrm{c}/t_\mathrm{p}\) of the control and probe pulse duration from equal durations up to five times longer control than probe pulses. Furthermore, for a fixed ratio of pulse durations we varied the occurring group delay relative to the control pulse duration ranging from 10 % up to over 100 %. This was achieved by scaling the control pulse durations (\(t_\mathrm{c}\) from 500 fs to 5 ps) while keeping the fiber length and parameters and thereby the occurring absolute group delay (\(\Delta \tau\) about 521 fs) fixed. For each combination of pulse duration ratios and relative group delay, we then calculated the normalized rms deviation \(\rho _{\mathrm{rms}}\) of the power in the higher-order mode in comparison with the powers calculated with the cw-model assuming cw-control powers equal to the peak power of the control pulses. The resulting rms deviations are displayed in Fig. 3a. As it was expected, the error of the cw-model dropped with increasing pulse duration ratio between the control and the probe pulses and saturated toward a ratio of five. Furthermore, the influence of the group delay became significant when surpassing half the control pulse duration. The mean squared error for short probe pulses at relative group delays bigger than 70–80 % was found to be equivalent to the error of equal probe to control pulse durations at low group delay.

In Fig. 3b, c, the predicted power in the higher-order mode is plotted for the cw-model and compared to the result of a numerical simulation during one conversion cycle for parameters indicated by gray circles in Fig. 3a to give an impression of the occurring deviation between both models. In Fig. 3b, the control pulses are assumed to be of the same duration as the probe pulses resulting in a relatively large rms error of more than \(\rho _{\mathrm{rms}}=0.4\). The large rms error resulted from the cw-model (depicted by the dashed black line) clearly overestimating the occurring mode conversion in comparison with the full numerical simulation (red solid line) for the reasons given above. In contrast, the conversion along the fiber for five times shorter probe than control pulses is displayed in Fig. 3c and shows very good agreement corresponding to an rms error of less than \(\rho _{\mathrm{rms}}=0.04\).

The points discussed above show the benefits as well as limitations of the developed model and allow to classify the results given by the model: For control pulses significantly longer than the probe pulse in combination with a negligible group delay within one conversion length, the results of the analytical model can directly be used as a very good approximation for the expected conversion efficiency. When these requirements are not met, the results from the analytical model can still be used to get an understanding of the expected order of magnitude of the conversion efficiency in dependence of the fiber length and control pulse power. As the analytical model allows to calculate results within milliseconds, these results can be used as appropriate predictions to drastically lower the amount of initially free parameters for a more exact numerical simulation or even for an experimental verification.

4 Conclusion

In this manuscript, an analytical description of optically induced long-period gratings was presented connecting for the first time the material representation with an ultrashort pulse induced four-wave mixing interaction. It was successfully demonstrated that the all-optical mode conversion process can be described either in a material representation, in which the local modes are altered and the resulting energy transfer is described by coupled mode equations, or alternatively in a four-wave mixing representation, derived from the coupled nonlinear Schrödinger equations. Both approaches led to the same analytical description of the energy transfer that is supposed to exactly describe the process in the long-pulse or quasi-continuous-wave regime. The developed model was compared to numerical simulations that have already been published elsewhere, describing the conversion process in the material representation [22] as well as in the four-wave mixing representation [18]. Excellent agreement was found for comparison to continuous-wave simulations, while modeling nonlinear transverse mode conversion with ultrashort pulses was shown to depend on occurring pulse walk-off, leading to a deviation from the analytical model when the walk-off is becoming significant. Despite these shortcomings, the presented analytical model allowed to gain insight into the physical dependence of the conversion efficiency on the control beam power and the normalized phase mismatch. Due to the analytical nature of the description of the conversion process, the developed model allows to directly estimate (within milliseconds) the maximum achievable mode conversion for any combination of control beam and waveguide parameters (e.g., control power and center wavelengths, initial phases) without having to rely on numerical simulations (calculation time of hours for a single initial parameter set) and, hence, saving valuable computation time.