Main

Over the past 25 years, thousands of knockout mice have been developed worldwide, and the mouse is increasingly used as a model of choice for investigating the genetic basis of diseases and potential drug targets, with refined methodology and techniques1. Conventional knockout mice were created by gene targeting in embryonic stem cells from the 129-mouse strain, followed by backcrossing to the C57BL/6 strain. However, problems with mixed genetic background were soon identified2,3, leading to the publication of recommendations for controlling the genetic background of mutant mice4. It is interesting to note that although the importance of nomenclature was emphasized, no particular attention was paid to the existence of different substrains of C57BL/6 mice at that time. The development of the C57BL/6 strain goes back to 1921 when the strain was created by Clarence Cook Little and initially maintained at the Jackson Laboratory (C57BL/6J). The substrain C57BL/6N was established in 1951 after transfer of the mice to the NIH. Mice from both parental colonies have then been moved to several large mouse vendors over the world (such as Charles River Laboratories, Envigo, formerly known as Harlan, Taconic, and Janvier Labs), where they are still maintained. The C57BL/6J (B6J) mouse was the first strain to have its genomic sequence published, and this strain is considered a gold standard in many research areas. However, to overcome the problems associated with a mixed genetic background and to facilitate the production of mutant mice, embryonic stem cell lines from the C57BL/6N (B6N) strain were established; the International Mouse Phenotyping Consortium is currently creating mutant mice for large-scale phenotyping in a C57BL/6N background5,6,7,8.

It is well known that the phenotype of mutant mice can depend on their genetic background9 and this holds true also for the C57BL/6 substrains10. Although genotypic and phenotypic differences between the substrains of C57BL/6 mice are well documented (Table 1)11,12,13, the fact is that too many publications do not indicate the precise and accurate origin of the animals used14. Moreover, it may well be that the researchers are unaware or ignore this information. For instance, according to a recent survey carried out in Finnish research institutions, 39.5% of respondents were either not aware of genetic differences between these substrains or did not consider it important. Interestingly, among those who knew about these differences, still 26% of the respondents were not able to name the exact strain they were using. Among others, this factor can certainly be one of the major issues contributing to the current reproducibility crisis in basic research15,16. Moreover, concerns have been expressed that the low quality of basic and preclinical studies, not only in behavioral studies17, may have a direct relationship with the failures in clinical trials18,19.

Table 1 Substrains used for testing various behavioral phenotypes

For a long time, it has been suggested that the use of inbred (genetically homogeneous) strains increases the power of the study by reducing the variability between the subjects20. However, it is often overlooked that controlled genetic variation should be present in the test population. This can be achieved by using a battery of inbred strains in a factorial design in which both treatment and strain are varied simultaneously21. Moreover, in 1997, a report from the Banbury conference already recommended the back-crossing of mutant mice into at least two inbred strains to allow testing of the mutants in congenic lines but also in F1 hybrids derived from those4. However, the current dominating trend to keep the mice only in the C57BL/6 background is tremendously limiting the external validity and generalization of many findings22,23. Preference for using inbred strains, based on expected low inter-individual variability has been challenged by a recent report showing that trait variability is not larger in outbred stocks than it is in inbred strains and that therefore,outbred mice can be successfully employed to enhance reproducibility and replicability24. Similar concerns about low genetic and environmental diversity have been expressed for human genome-wide association studies25,26.

Several recommendations have been made for improving the design, analysis and reporting of preclinical studies involving animal models27,28,29,30,31,32. One suggested strategy for improving replicability has been to implement a rigorous standardization of experimental methods and conditions (in addition to standardized genetic backgrounds). However, the efficacy of environmental standardization has been extensively debated and questioned33,34,35. Indeed, rigorous standardization can lead to idiosyncratic and unreproducible findings and revised strategies are needed for experimental design36,37,38,39. Moreover, it is clear that environmental manipulations are an essential factor in disease modeling40,41. In addition, problematic issues with mouse (behavioral) phenotyping have been regularly highlighted in the headlines of major scientific journals42,43,44. Consequently, researchers need to improve the reproducibility and translatability of animal work, notably by adopting the current paradigm shift in the conduction and interpretation of animal experiments45,46.

The substrains of C57BL/6 mice are genetically very close to each other, but mutations (few already identified and probably much more unknown) may lead to substantial phenotypic differences47,48,49. The retinal degeneration 8 (rd8) mutation, which makes the mice nearly blind by the age of 8 weeks, is a common feature for all B6N substrains50. C57BL/6JCrl and C57BL/6JRj (but not C57BL/6JRccHsd) mice carry a deletion in the Nnt gene encoding nicotinamide nucleotide transhydrogenase51. This mutation has been associated with impaired control of glucose homeostasis and reduced insulin secretion. We did not include in our panel the C57BL/6JOlaHsd strain, which is known to carry a deletion in the Scna gene52. These differences are caused by genetic drifts occurring in any independent mouse-breeding colony. While genetic drift can be controlled by careful colony management practices, it cannot be stopped completely53,54. On the other hand, despite many efforts for standardizing the operating procedures, several results of studies on the behavioral phenotyping of B6 substrains have revealed conflicting results between laboratories55.

Based on the available information and controversies, we set up a project for addressing the differences between the substrains of C57BL/6 mice from different sources. To that end, we compared the C57BL/6J and C57BL/6N female mice from three common vendors in Europe: Charles River Laboratories (CRL, Germany), Envigo (ENV, The Netherlands) and Janvier Labs (JAN, France). Only female mice were tested because we wanted to focus on strain differences. Female mice produce highly reliable data and are thus suitable for basic exploratory studies56,57,58. In addition, we wanted to avoid possible problems with escalating aggression in male mice, which is quite common in C57BL/6 strain, especially after transport and re-location of adolescent or adult animals59. The mice were tested in a battery of behavioral tests assessing exploratory and anxiety-like behavior, sociability, sensorimotor gating, fear conditioning, circadian activity. Similar batteries are commonly applied for the characterization of mutant mice60,61,62.

Results

Female mice of two C57BL/6 substrains from three different vendors—Charles River (CRL), Envigo (ENV) and Janvier (JAN)—arrived in our laboratory at the age of 7 weeks; they were allowed to adapt for 17–18 days before testing began at the age of 10 weeks (Fig. 1a). The experiment was carried out in three batches. At arrival, substantial differences between the transport boxes from different vendors were noted. There was abundant, though different nesting material available in the shipments by CRL and JAN, whereas no such enrichment was included in ENV’s boxes (Fig. 1b).

Fig. 1: Experimental design, transport boxes and mouse body weight.
figure 1

a, Timeline of the experiment and behavioral testing. b, Characteristics of the transport boxes from the vendors and home cage in the destination. Animals were shipped in groups of six animals in respective boxes, and then randomly assigned to the individually ventilated cages in groups of three animals per cage. Notable differences were observed in the type and amount of nesting material provided by vendors. c, The body weight of the mice, measured during the course of the experiment. CIRC, circadian activity; CRL, Charles River; ENV, Envigo; EPM, elevated plus-maze; FC, fear conditioning; JAN, Janvier Lab; LD, light-dark box; NEST, nest building; OF, open field; PPI, pre-pulse inhibition; SIH, stress-induced hyperthermia; SOC, social approach. Filled circles, B6N; open circles, B6J. *P < 0.05, **P < 0.01 between the B6N and B6J mice from the same vendor.

The summary of three-way ANOVA results for the main parameters of all behavioral tests can be found in Table 2. The body weight of the mice was measured weekly and significant strain by vendor interaction was revealed: the B6N mice from ENV were much smaller compared to B6N from the other vendors. Moreover, the B6N from ENV weighed less than the B6J from ENV, whereas the B6N from CRL and JAN were heavier than B6J mice from the respective supplier (Fig. 1c).

Table 2 Summary of statistics (three-way ANOVA) for the main outcome variables

The elevated plus-maze (EPM) and light-dark (LD) box are commonly used for measuring exploratory activity and anxiety-like behavior in mice. The significant main effects of the vendor established in these tests suggest that CRL mice displayed increased avoidance of exposed areas (Fig. 2a,b,d) as compared to the mice from other vendors. Moreover, the significant main effects of the strain for parameters measured in the LD box suggest that the B6N mice showed enhanced anxiety-like behavior (avoidance of brightly illuminated compartment) in comparison to the B6J mice. The locomotor activity (total distance travelled during the test) was not different between the strains in the EPM. However, activity was reduced in the B6N mice as compared to the B6J in the LD test, mainly due to the large difference between the ENV substrains (Fig. 2c). Next, we tested the spontaneous activity and exploration in the open field arena, where the B6N mice displayed significantly reduced activity as compared to the B6J mice (Fig. 2e). This difference was largest between the substrains from ENV. In addition, the number of rearings was significantly different between the substrains with B6J > B6N (Fig. 2f). Although the proportion of distance travelled in the center of the arena did not differ between the substrains and vendors, the time spent there was longer for the B6J mice as compared to B6N (Fig. 2g).

Fig. 2: Elevated plus-maze, light-dark box and open field.
figure 2

a, Proportion of time and distance in open arms and central platform of the elevated plus-maze. b, Latency to enter the light compartment in light-dark test. c, Activity (distance travelled) during 10 min of testing in the light-dark box. d, Proportion of time, distance and exploratory rearings in the light compartment of the light-dark box. e, Distance travelled during 30 min in the open field arena. f, Number of rearings during 30 min in the open field arena. g, The proportion of activity (distance, time and rearings) in the center area of the open field. Filled circles, black bars B6N; open circles, grey bars B6J. *P < 0.05, ** P < 0.01 between the B6N and B6J mice from the same vendor.

During the social approach test, the B6J mice from ENV were more active than the B6N from the same vendor as shown by the distance travelled (Fig. 3a). Overall, B6J mice spent more time in the interaction zone with unfamiliar mouse (Fig. 3b,c). However, this difference was more pronounced and significant between the substrains from CRL and JAN, whereas it was virtually absent in the mice from ENV. Acoustic startle reflex was elevated in the B6J mice from ENV and JAN, compared to the B6N strain from the respective vendor, whereas an opposite effect was found in the CRL mice (Fig. 3d). Pre-pulse inhibition was enhanced in the B6N mice from CRL and JAN compared with B6J mice from the respective vendor, while the substrains from ENV did not differ (Fig. 3e). The B6N mice from CRL and JAN reacted to the 0.6 mA foot-shock more vigorously than the B6J mice, as suggested by higher velocity during the administration of foot-shock (Fig. 3f). There was no difference between the groups in the freezing behavior at baseline, before conditioning. However, 24 h after conditioning, the B6N mice displayed enhanced contextual fear (freezing) as compared to the B6J mice. When placed in the novel context, not previously associated with delivery of foot-shock, the B6J mice displayed increased level of freezing compared with the B6N mice. Significant interaction between the strain and vendor in duration of freezing during the presentation of conditioned stimulus in the novel context indicated enhanced freezing in the B6N mice from ENV. An opposite effect (reduced freezing) was revealed in the B6N mice from CRL and JAN when compared to the respective groups of B6J mice (Fig. 3g).

Fig. 3: Social approach, acoustic startle and pre-pulse inhibition, fear conditioning.
figure 3

a, Distance travelled during 10 min test of social approach. b, Time spent in the social interaction zone during the social approach test. c, Preference for the cylinder with social stimulus, calculated as proportion of total time spent in exploring two cylinders. d, Magnitude of the startle response to 120 dB acoustic stimulus (40 ms white noise). e, Pre-pulse inhibition of the acoustic startle response at increasing pre-pulse intensities. f, Reaction to foot-shock in fear conditioning experiment, expressed as a mean velocity during two foot-shock applications (2 s each). g, Percentage of freezing time in different phases of fear conditioning experiment: baseline (2 min before first application of conditioned stimulus); context (3 min) ; novel context (2 min in the altered arena); cue (2 min of tone presentation in novel context). Filled circles, black bars B6N; open circles, grey bars, B6J. *P < 0.05, **P < 0.01 between the B6N and B6J mice from the same vendor.

Monitoring of circadian activity in single-housed mice over 7 days revealed a large difference between the B6N and B6J mice from ENV (Fig. 4a). Interestingly, the mice reacted differently to the individual housing: the body weight was increased in the B6N mice, whereas no change or even reduction was found in the B6J mice (Fig. 4b). However, there was no difference between the groups in the nest building abilities (average score 3 after the first night and 4 after the second night). The stress-induced hyperthermia (increased rectal temperature after two consecutive measurements) was stronger in the B6J mice (Fig. 4c).

Fig. 4: Recording of circadian activity and stress-induced hyperthermia in individually caged mice.
figure 4

a, Circadian activity (average counts per hour). b, The changes in body weight, shown as a difference in percentage between the end and start of single housing for measuring circadian activity. c, Stress-induced hyperthermia – difference between two consecutive (interval 10 min) measurement of rectal temperature. Filled circles, black bars, B6N; open circles, grey bars B6J. *P < 0.05, **P < 0.01 between the B6N and B6J mice from the same vendor.

Discussion

In the present study, we examined the basic behavioral profile of C57BL/6J and C57BL/6N female mice, obtained from three different vendors (Charles River, Envigo, Janvier). Many of the previously known and published differences between the substrains of C57BL/6 mice were confirmed or expanded by the experiments presented here47,55,63,64,65,66,67,68,69. However, the effect of vendor has mostly been neglected in the previous studies. The general expectation seems to be that the differences between B6N and B6J substrains are universal and therefore, the substrains for comparison have often been ordered from different breeders (Table 1). Thus, to the best of our knowledge, this is the first systematic and simultaneous comparison of common B6N and B6J substrains from three different vendors, carried out in one laboratory environment. We found significant effects of strain, vendor or interaction between these factors in the majority of outcomes.

Transportation of the animals from vendors to the research institutions can be a significant stressor for the animals. There are certain rules for security and guaranteed well-being of the animals throughout the journey70, which may take several days (in our case 48–72 h from door to door). Therefore, it was interesting to find that the transport boxes from Envigo did not contain any nesting material. The nest material has become a mandatory part for structuring the rodent cages and lack of nesting material may substantially change the physiology and behavior of the animals71,72,73. However, this particular vendor justifies the lack of nesting material in transport containers by the fact that for welfare reasons the animals need to be seen through a viewing window during the transportation (Envigo, personal communication). We also found that the body weight of the B6N mice from CRL and JAN was higher than that of B6J mice from the respective vendors, whereas a large opposite effect was found between the substrains ordered from ENV. These differences are in line with the information provided in technical sheets by the vendors. However, there seems to even be differences between mice from the same substrain and vendor but bred at different locations; for instance C57BL6/NHsd female mice are about 2–3 g smaller in the Netherlands than in the United States (https://www.envigo.com/products-services/research-models-services/models/research-models/mice/inbred/c57bl-6-inbred-mice/c57bl-6nhsd/). Overall, such differences between strains and substrains, vendors and locations — that is genetic and environmental factors — can provide an interesting and reasonable resource for heterogenization of the population74.

Testing exploratory activity of the mice by elevated plus-maze, light-dark box and open field revealed that the B6N mice displayed enhanced anxiety-like behavior as compared to the B6J. In addition, anxiety-like behavior was higher in the mice from CRL as compared to the other two vendors. Although widely used, the conventional tests for unconditioned anxiety have often produced contradictory findings, which are dependent on the laboratory environment33,75. Nevertheless, our finding of reduced anxiety and higher activity shown by the B6J mice is in line with several other reports47,69,76,77. In addition to the enhanced anxiety-like behavioral profile, the B6N mice showed less interest towards a novel mouse (social approach) and similar data have been previously published by others groups47,78. However, it has to be noted that in our experiment the difference in social interaction was robust between the substrains from the CRL and JAN, but not from ENV.

Augmented startle reflex and reduced pre-pulse inhibition in the B6J mice compared with B6N mice has been shown previously47,55,79. In our panel, the startle was enhanced in the B6J mice from ENV and JAN, but reduced in the B6J from CRL when compared to the respective B6N substrains. Moreover, pre-pulse inhibition was reduced in the B6J-CRL and B6J-JAN mice, but no difference was found between the substrains from ENV. Fear (freezing) in the environment (context) associated with the foot-shock has been consistently shown to be reduced in the B6J mice as compared to the B6N63,64,67,68,76, and this was the case also in our study. However, this difference was limited to the substrains from CRL and ENV, and not detected in mice from JAN.

For measuring the circadian activity, nest building and stress-induced hyperthermia, the mice were housed individually for 8 days. Other groups have shown that home cage activity during the dark period is lower in B6N mice as compared to B6J64,69. In our study, this difference was seen only between the substrains ordered from ENV. Social isolation or separation of group-housed mice can be a stressful experience, especially for female mice80. The acute effect of the isolation stress may be seen in the changes of body weight81. In our study, weight gain was detected in the B6N mice, whereas no change or even decrease was found in the B6J mice. This finding suggests that the metabolic response and coping in stressful situations may be different between the C57BL/6 substrains. The B6N mice have been shown to be more vulnerable to the chronic treatment with corticosterone which is used as a model of stress69. They also displayed higher anxiety-like behavior and reduced social interaction. In contrast, the stress-induced hyperthermia was increased in the B6J mice. Based on these findings, it could be speculated that individual housing is a less stressful experience for the less social B6N mice. Thus, different substrains of C57BL/6 mice could be useful for elucidating the quantitative trait loci involved in the stress-related behavior. At the same time, further studies are warranted for characterization of these substrains under different conditions imposing stress on animals.

In general, the phenotypic differences between inbred strains have been shown to be stable and robust82. However, for obtaining such replicability (external validity), certain quality of study design and conduct (internal validity) is needed83,84. The phenotypic differences between the genetically close substrains of the C57BL/6 mice, coupled with the effect of vendor highlight the possible problems associated with choosing the background for genetically modified mice, interpretation of the results and reproducibility of the findings. Obviously, these cautions are not limited only to the C57BL/6 mice85,86,87. Moreover, further confounds can be caused by the local breeding schemes at the research institutions88. Nevertheless, genetically defined strains are and continue to be instrumental for elucidating the genetic basis of disease89, although a recent report show that the trait variability is not larger in outbred stocks than it is in inbred strains24. Our data suggest that more emphasis and attention must be paid on the precise and accurate nomenclature when publishing research findings and designing future experiments. Heterogenization of the study samples, multi-laboratory experiments and refined statistical models have been proposed to be effective means for improving the reproducibility36,74,90. Therefore, deliberate variation of the mouse strains and substrains can be recommended as another way for improving the study design and addressing the issues of poor replicability.

Methods

Ethical statement

The animal experiments were performed according to the EU legislation harmonized with Finnish legislation and have been approved by the National Animal Experiment Board of Finland (ESAVI/10165/04.10.07/2016).

Animals

Altogether 108 female mice from 6 strains were used for this study: C57BL/6JRccHsd and C57BL/6NHsd (Envigo, Horst, The Netherlands); C57BL/6JCrl (original Jackson strain, stock: 000664) and C57BL/6NCrl (Charles River Laboratories, Sulzfeld, Germany); C57BL/6JRj and C57BL/6NRj (Janvier Labs, Le Genest-Saint-Isle, France). The number of animals was determined by power analysis with medium effect size and power of 0.80 (G*Power version 3.1.9.2). The mice were ordered and tested in three batches (6 mice per strain in one batch, 18 mice per strain in total). The duration of transportation was 48 h from Charles River and 72 h from Envigo and Janvier. The second and third batches arrived two weeks and twelve weeks respectively after the first one (the batches arrived in January, February and April of 2018, respectively). All mice were shipped at the age of 7 weeks. After arrival, the mice were housed in groups of three in the individually ventilated cage system (Mouse IVC Green Line, Tecniplast, Italy; cage dimensions 391 × 199 × 160 mm, floor area 501 cm2; air inlet and outlet valves located in the cage lid, on top of the cage; the rate of air change was set at 75 times per hour with air speed at animal level max. 0.05 m/s; half of the cage covered by a wire bar food hopper). On the next day after arrival, the mice were marked by ear punching and the first body weights were recorded. The animals were maintained in the specific pathogen free animal facility, in a large colony room together with hundreds of other mouse cages. Cage enrichment was provided by bedding (aspen chips 5 × 5 × 1 mm, 4HP, Tapvei, Estonia), nesting material (equal amount of aspen strips, PM90L, Tapvei, Estonia and Sizzle Nest, Datesand Group, UK) and an aspen brick (100 × 20 × 20 mm, Tapvei). Food (Global Diet 2916 C, pellet 12 mm, Envigo) and water (filtered and UV-irradiated) were available ad libitum. Room temperature was 22 °C ± 2 °C and relative humidity 50% ± 15%. The light cycle was 12:12 light:dark (lights were on between 6.00 and 18.00). The cages were cleaned once per week and animals were weighed before being moved to the new cage. The mice were checked for microphthalmia, fur and whiskers (barbering) without any notable findings. Behavioral testing was started when the animals were 10 weeks old (after 17–18 days of adaptation). One mouse (C57BL/6NCrl from the second batch) was discarded after the first test day (elevated plus-maze) due to an accident (the mouse escaped and was injured).

Behavioral tests

For all conventional tests (carried out during the light phase, between 9.00 and 16.00), the animals were moved from the colony room to the testing rooms in the same animal facility at least 30 min before the beginning of the experiment. The testing order of the cages and animals was counterbalanced and randomized for each experiment and the experimenter was blinded regarding the genotypes. Behavioral testing of individual batches was carried out in the following order (Fig. 1a): day 1, elevated plus-maze (9.00–15.00); day 3, light-dark box (9.00–11.00); day 4, open field (9.00–13.00); day 5, sociability (9.00–15.00); days 8–9, pre-pulse inhibition of acoustic startle (9.00–16.00); day 11, fear conditioning (training, 9.00–11.00); day 12, fear conditioning (contextual memory 9.00–11.00; cued memory 11.00–13.00); day 15, individual housing and start of recording circadian activity; days 16–17, assessment of nest building (at 9.00); day 23, end of recording the circadian activity; day 24, stress-induced hyperthermia (9.00–10.00).

Elevated plus-maze (EPM)

The maze93 consisted of two open arms (30 × 5 cm) and two enclosed arms (30 × 5 cm, inner diameter) connected by central platform (5 × 5 cm), elevated to 40 cm above the floor. The floor of each arm was light grey and the closed arms had transparent (15 cm high) side walls and end walls. The illumination level in all arms was ~150 lx. The mouse was placed in the center of the maze facing one of the enclosed arms and observed for 5 min. The latency to the first open arm entry, number of open and closed arm entries (four paw criterion), distance travelled and the time spent in different zones of the maze were measured (tracking by Ethovision XT 10.0, Wagenigen, The Netherlands). The number of fecal boli was counted at the end of the trial.

Light-dark exploration (LD)

LD-test 94 was done 48 h after the EPM. The test was carried out in the square open field arena (30 × 30 × 20 cm, Med Associates, St. Albans, VT) equipped with infrared light sensors detecting horizontal and vertical activity. The dark insert (non-transparent for visible light) was used to divide the arena into two halves, an opening (a door with a width of 5.5 cm and height of 7.0 cm) in the wall of the insert allowed animal’s free movement from one compartment to another. Illumination in the center of the light compartment was ~550 lx. The animal was placed in the dark compartment and allowed to explore the arena for 10 min. Latency to enter the light side, distance travelled, number of rearings, and time spent in different compartments were recorded by the program (Activity Monitor, version 5.8). The number of fecal boli was counted at the end of trial.

Open field (OF)

OF-test was performed 24 h after the LD-test. The same arena and monitoring system used for LD test were used for OF-test, but without dark insert, illumination of the arena was ~150 lx. Animals were released in the corner of the arena and monitored for 30 min. For analysis, the arena was divided into center and periphery (peripheral zone defined as a 6 cm wide corridor along the wall).

Social approach (SOC)

SOC-test was done 24 h after the OF. The equipment and method for testing sociability was a combination based on two approaches, the 3-compartment test95 and social interaction arena96. The large cage (dimensions 48.0 × 37.5 × 21.0 cm) contained two transparent and perforated cylinders (diameter 9 cm, height 15 cm) which were fixed at the center of the opposite short walls (distance between the cylinders 30 cm). One of the cylinders (position counterbalanced between subjects) contained a stimulus mouse (unfamiliar age-matched and sex-matched NMRI mouse (Envigo), kept in groups and previously adapted to confinement in the cylinder) whereas another was empty. The test was performed under reduced light conditions (~30 lx). The test mouse was released in the center of the arena and movement of the mouse was recorded by Ethovision XT 10.0 during 10 min. Total distance travelled and the time spent in exploring the cylinders were measured (the interaction zone was defined as a 5 cm corridor around the cylinders; the ratio between the time exploring the social vs empty cylinder was calculated as an index of social preference).

Pre-pulse inhibition of acoustic startle reflex (PPI)

The animal was enclosed in a transparent plastic tube (inner diameter of 4.5 cm and length of 8.0 cm). The tube was placed and fixed on the piezoelectric platform, inside a sound-attenuating startle chamber (Med Associates, St. Albans, VT) with a background white noise of 65 dB and left undisturbed for 5 min. Testing was performed in 12 blocks of 5 trials and five trial types were applied. One trial type was a 40-ms, 120-dB white noise acoustic startle stimulus (SS) presented alone. In the remaining four trial types, the startle stimulus was preceded by a 20-ms acoustic pre-pulse stimulus (PPS) with the white noise bursting at the level of 68, 72, 76 or 80 dB. The delay between the onset of PPS and SS was 100 ms; for controlling the baseline movement there was a null-period of 200 ms included before presentation of acoustic stimuli. The first and twelfth block consisted of SS-only trials. In remaining blocks, the SS and PPS + SS trials were presented in pseudorandomized order, such that each trial type was presented once within a block of 5 trials. The inter-trial interval ranged between 10 s and 20 s. The startle response was recorded for 65 ms starting with the onset of the startle stimulus. The maximum startle amplitude recorded during the 65-ms sampling window was used as the dependent variable. The startle response was averaged over 10 trials from blocks 2–11 for each trial type. The pre-pulse inhibition for each PPS was calculated by using the following formula: 100-[(startle response on PPS + SS trials/startle response on SS trials)×100].

Fear conditioning (FC)

The experiments were carried out employing a computer-controlled fear conditioning system (TSE, Bad Homburg, Germany). Training was performed in a transparent acrylic arena (23 × 23 × 35 cm) within a constantly illuminated (~100 lx) conditioning chamber. A loudspeaker provided a constant, white background noise (68 dB) for 120 s, followed by a 10 kHz tone — conditioned stimulus (CS), 76 dB, pulsed at 5 Hz —for 30 s. The tone was terminated by a foot-shock — unconditioned stimulus (US) of 0.6 mA for 2 s, constant current — delivered through a stainless steel floor grid (rod diameter 4 mm, distance 10 mm). Two CS–US pairings were separated by a 30 s pause, and the trial ended 30 s after the second foot-shock. Contextual memory was tested 24 h after the training. The animals were returned to the conditioning arena and the total time of freezing (defined as an absence of any movements for more than 3 s) was measured by infrared light beams (scanned continuously with a frequency of 10 Hz) during 3 min. Memory for the CS (tone) was tested 2 h later in a novel context. The new context was an acrylic box of similar size with black opaque walls and a smooth floor. A layer of wood chips (standard bedding material) under the floor provided a novel odor to the chamber. After 120 s of free exploration in the novel context, the CS was applied during next 120 s and freezing was measured as above.

Circadian activity of single housed mice

The InfraMot system (TSE, Germany) was used for recording the activity of single-housed animals by heat sensor. The system consisted of 24 units. Therefore, two animals from each original home cage were randomly assigned for testing the circadian activity. The mice were housed in Type II open cages (267 × 207 × 140 mm) with bedding (aspen chips, Tapvei) and nesting material (see the next paragraph). The sensor assembly was mounted on top of a cage lid. Food and water were available ad libitum. The recording continued for 7 days.

Nest building

Nest building was assessed after the first and second night of accommodation in single cages of the InfraMot system. One hour before the dark phase, one piece (5 cm2, ~2.5 g) of compressed cotton fiber (Nestlets, Ancare, Bellmore, NY) was added into the cage. The next morning (~16 h later), the nests were assessed by visual inspection on a rating scale of 1–5 (1 = Nestlets > 90% intact, 2 = Nestlets 50–90% intact, 3 = Nestlets mostly shredded but no identifiable nest site, 4 = identifiable but flat nest, 5 = crater-shaped nest)97. Assessment was repeated 24 h later.

Stress-induced hyperthermia (SIH)

This test was carried out after 7 days of activity recording in singly housed animals98. Briefly, a mouse was removed from the cage and rectal temperature was measured. Then the body weight was measured and an animal was immediately returned to the same cage. After 10 min, the measurement of rectal temperature was repeated. Difference between these two measurements was defined as a stress-induced hyperthermia.

Statistics

Data were analyzed by using a three-way ANOVA model with batch (1,2,3), strain (B6N, B6J) and vendor (CRL, ENV, JAN) as between-subject factors. Within-subject factors (time and repeated measurements) were added when appropriate. The significance threshold was set at 0.05 and the results of the analysis are presented in Table 2. Newman-Keuls post-hoc comparisons were applied for further analysis if significant main effects or interactions were revealed. Software packages GraphPad Prism for Windows (v. 7.03) and STATISTICA (v. 12, StatSoft, Inc.) were used for the statistical analysis and for drawing the figures. The data on the figures are shown as mean values with error bars for standard error of mean. The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.