Introduction

Over the last two decades there has been an increase in the use of computerized metabolic systems. In addition, there has been an increase in the amount and types computerized systems available to researchers. The traditional method, and considered to be the “gold standard” of gas exchange measurements, is the Douglas bag (DB) method. While the DB method is considered the gold standard, it also has its own limitations, which provide a source of error that is difficult to account for. Limitations associated with the DB measurements include such things as: (1) the DB bag material being slightly permeable to room air allowing for the air collected in the DB to be mixed with external air, (2) difficulty in removing all the air from the DB and air leaking out during the removal process, (3) contents of the DB is representative of the entire sampling period, and (4) the DB method is time consuming and requires careful analysis by researchers to reduce errors while measuring the content of the bag. Over time with the advent of computerized systems and the development of new technology, the use of these systems has made gas exchange measurements easier and less time consuming, without compromising the accuracy provided by the DB method. In addition, as the technology becomes more sophisticated there is a movement towards using portable gas exchange systems for the purpose of obtaining real life measurements rather than laboratory measurements. For a comprehensive review of portable metabolic systems and their application in sport science see Meyer et al. (2005).

The ParvoMedics TrueOne 2400 (TrueOne 2400) [previously called the ParvoMedics TrueMax 2400 (TrueMax 2400)] is a computerized metabolic cart for laboratory use. The TrueOne 2400, which uses a mixing chamber and is a non breath-by-breath system, has been shown to be an accurate device for the measurement of inspiratory and expiratory gas exchange variables (Bassett et al. 2001), however, its reliability has not been evaluated. Specifically, Bassett et al. (2001) found that compared to the DB the TrueMax 2400 provided highly accurate measures of minute ventilation (V E) [mean error = −0.2 l min−1, 95% predicition interval (95% PI), −1.5, 0.09], oxygen consumption (VO2) (mean error = −0.02 l min−1, 95% PI, −0.12, 0.08), and carbon dioxide production (VCO2) (mean error = −0.01 l min−1, 95% PI, −0.07, 0.06).

The MedGraphics VO2000 (VO2000) is a portable metabolic system, which to our knowledge, does not have a published validation on the accuracy and reliability of the device in comparison to DB measurements. There have been two studies that have compared the VO2000 and the MedGraphics CPX/D (CPX/D) (Byard and Dengel 2002; Olson et al. 2003). Olson et al. (2003) reported that there was no difference between the VO2000 and CPX/D for the measurement of V E, VO2, and VCO2 at rest. This study used a low-flow pneumotach designed by MedGraphics specifically for use during resting measures and is not recommended for exercise testing. A separate study by Byard and Dengel (2002) showed that there was no significant difference between the VO2000 and the CPX/D for the measurement of V E, VO2, and VCO2 during cycle ergometry. While these studies have shown the VO2000 to give similar values as the CPX/D they should be interpreted with caution because the CPX/D has been shown to underestimate VO2 by approximately 11% in athletes compared to the DB method during cycling activity (Gore et al. 2003). This raises concern about the application of the VO2000 and the true validity of this portable device.

With new technology and devices emerging in the market it is important that researchers are aware of the accuracy and reliability of these devices. Therefore, the purpose of this study was to compare both the TrueOne 2400 and VO2000 against the DB method for the measurement of metabolic variables. In doing so the accuracy and test re-test reliability of the devices will be determined.

Methods

Subjects

Ten healthy male subjects from the Ithaca community volunteered to participate in the study. The procedures were reviewed and approved by the University Committee on Human Subjects at Cornell University, before the start of the study. Each participant signed a written informed consent and completed a physical activity readiness questionnaire (PAR-Q) before participating in the study. Participants were excluded from the study if they had any contraindications to exercise. Physical characteristics [mean (SD)] of the participants were; age, 20 (1.7) years; height, 179.3 (11.6) cm; weight, 78.2 (12.7) kg; and BMI, 24.2 (2.0).

Equipment

The VO2000 is a portable metabolic system that is worn on the chest with a harness and weighs 1.57 kg (includes metabolic unit, battery pack, harness, heart rate monitor, face mask, and breathing valve). The device employs a patented flow meter, which uses a proportional sampling valve and a 3-breath average for the measurement of VO2, VCO2, and V E. Specifically, the VO2000 uses a galvanic fuel cell for the oxygen analyzer (range 0–96%), a non-dispersive infrared carbon dioxide analyzer (range 0–10%), and proprietary software. The flow device is a bi-directional differential pressure preVent™ pneumotach. According to the company it has a low flow pneumotach (range 2–30 1 min−1) with an accuracy of ± 3% of the absolute volume and a dead space of 8 ml for resting measures and an exercise pneumotach (range 20–200 1 min−1) an accuracy of ± 3% of the absolute volume, and a dead space of 39 ml. The flow device is connected to a neoprene facemask (with a silicone adapter) that covers the subject’s mouth and nose. The dead space of the pneumotach and face mask is 39 ml, which is the value used when calculating the gas fractions. The flow device is connected to the portable metabolic unit with a flexible rubber hose and gas is sampled using a patented gas drying sample circuit contained within the portable unit. In addition, the system can also provide heart rate (HR), however, the current software program does not allow for the gas fractions (FEO2 and FECO2) to be viewed. As with other portable systems the data can be transmitted to a computer or stored in the system and downloaded to a computer at a later time. Prior to testing, the VO2000 was calibrated according to the manufacturer’s instructions. This consists of performing an auto-calibration routine that is run through the software, which uses a proprietary room air calibration of the oxygen and carbon dioxide analyzers as well as an auto-calibration procedure for the pneumotach. The current software program utilized with the VO2000 does not allow for a two-point calibration using known gas volumes or manual calibration of the pneumotach with a calibration syringe.

The TrueOne 2400 computerized metabolic measurement system uses a Hans Rudolf 3813 (Kansas city, MO) pneumotachometer to measure ventilation. A limitation to this pneumotachometer is that it is non-linear at flow rates less than 80 l min−1. Thus, Yeh and colleagues developed an algorithm to correct the linearity at low flow rates (Yeh et al. 1982, 1987). The TrueOne 2400 is a mixing chamber system, that uses a paramagnetic oxygen analyzer (range 0–25%) and an infrared, single beam, single wave-length carbon dioxide analyzer (range 0–10%). Prior to each testing session the gas analyzers were calibrated according to the manufacturer’s recommendations. This consists of a room air auto-calibration routine and a two-point gas calibration with a single gas tank (15.09% O2, 6.01% CO2). In addition, the flow meter was calibrated using a 3.000 l Hans Rudolf 5530 series syringe. This involves a five stroke calibration using different flow rates for each stroke.

Douglas bag collections were made simultaneously with the TrueOne 2400 measurements as described elsewhere (Bassett et al. 2001). Briefly, a three-way Warren E. Collins t-stop-cock and DB (200 l) were placed in series with the mixing chamber used by the TrueOne 2400 metabolic unit. Specifically, the t-stop-cock was attached to the air port on the back of the mixing chamber and the DB was attached to the t-stop-cock. During the last two minutes of each stage the t-stop-cock was turned so the air was directed into the DB for collection of the expired air. At the end of each collection period, the gas fractions from the DB were measured (over a two minute sampling period) using a ParvoMedics TrueMax 2400 with a Paramagnetic oxygen analyzer (range 0–25%) and an infrared, single beam, single wave-length carbon dioxide analyzer (range 0–10%). This was a different ParvoMedics Truemax 2400 from the one used during the measurement of the metabolic variables at rest and exercise. The expired volume was determined by pushing the remaining collected air from the DB into a 120 l Tissot gasometer (Warren E. Collins, Braintree, MA). Corrections in volume were made for the volume of air removed for gas analysis. The TrueOne 2400 system samples expired air at the back of the mixing chamber with a flow rate of 0.36 l min−1 (flow rate determined by flow meter built into the TrueOne 2400 metabolic cart), therefore there was a total of 0.72 l of air removed during the two minute sampling period. In addition, the flow rate of the TrueMax 2400 used to analyze the DB air sample (1 min sample period) had a flow rate of 0.28 l min−1. Thus, the total correction for the volume of air removed was 1.0 l. Prior to each test the gas analyzers were calibrated using room air and a single gas tank (15.09% O2, 6.01% CO2).

Experimental design

Initially, pilot tests were performed on four subjects to examine the effects of simultaneous gas exchange measurements with the DB in series with the VO2000. This was done by having the subject ride at 50 and 100 W with only the VO2000 connected for 5 min, then a two-way Hans-Rudolph valve (2700 series), hose and DB were connected in series with the VO2000 breathing valve and the subject cycled another 5 min. The addition of the Hans-Rudolph valve caused an increase in the VO2 values reported by the VO2000. Specifically, at 50 W the VO2 changed from 0.93 to 1.24 l min−1 when the Hans-Rudolph valve was added and at 100 W the VO2 changed from 1.36 to 1.59 l min−1 when the Hans-Rudolph valve was added. Therefore, the study was designed to obtain non-simultaneous measurements of the gas exchange variables from the VO2000 and DB, while simultaneous measurement between the TrueOne 2400 and DB were obtained.

For the comparison of the metabolic systems all subjects performed two trials within 48 h at the same time of day. Prior to testing, subjects were fitted with a mouthpiece, nose clip, and headgear which were used with the TrueOne 2400 and DB measurements. They were also fitted with a neoprene face mask which would be used with the VO2000. A Polar HR strap was also fitted around the chest of the subject so a criterion for HR could be obtained with a Polar A1 HR monitor (Polar Electro Oy, Kempele, Finland). Each trial consisted of an 11 min rest period, followed by cycling on a Lode Corival (Groningen, The Netherlands) electronically braked cycle ergometer for 10 min at 50, 100, and 150 W, and 11 min at 200 and 250 W.

During one trial, the TrueOne 2400 and DB setup was connected to the subject for the first 6 min of rest, first 5 min at 50, 100, and 150 W and first 6 min at 200 and 250 W, after which the equipment was switched out and the VO2000 was connected to the subject for the remaining time of each stage. During the second trial the order in which the equipment was connected to the subjects was reversed. The order in which the equipment was connected to the subjects was randomized so that on day one half of the subjects were connected to the TrueOne 2400 and DB first and half of the subjects were connected to the VO2000 first. For each device, the average of last 2 min of each stage, were used to calculate the gas exchange variables. After the subjects completed the 150 W stage they were given a 5 min rest period so the gas analyzers could be recalibrated and checked for analyzer drift. The last 2 min of each measurement period for the TrueOne 2400 and the VO2000 were averaged and compared with the DB 2 min collection period.

Statistical analyses

Statistical analyses were carried out using SPSS version 13.0 for Windows (SPSS Inc., Chicago, IL). For all analyses, an alpha level of 0.05 was used to indicate statistical significance. All values are reported as mean ± standard deviation (SD). To examine the reliability of the devices several measurements were used: (1) A two-way repeated measures ANOVA (day of testing × power output) for each device for V E, VO2, and VCO2, was used to examine if there was a difference between days. (2) Pearson correlation coefficients (r) was used to compare the values obtained for V E, VO2, and VCO2 on the two separate days for rest and all work rates combined. (3) Bland–Altman plots were used to graphically show the variability in the individual error scores between days for rest and all work rates combined (Bland and Altman 1986). This allowed for the mean error score and the 95% prediction interval to be shown. Devices that are reliable will display a tight prediction interval around zero. Data points below zero signify an overestimation, while data points above zero signify an underestimation. (4) The data showed heteroscedasticity (i.e., as the work rate increased the difference between days increased), therefore the data was logarithmically (natural) transformed, as suggested by Atkinson and Nevill (1998). The logarithmic transformed data was then used to determine the mean bias and random error component (ratio limits of agreement), which more accurately describes the variability across the range of values reported. (5) Lastly, the coefficient of variation (CV) between the day-to-day measures was determined for V E, VO2, and VCO2 for rest and all work rates combined.

We choose several statistical tests to determine the reliability of the devices as suggested by Atkinson and Nevill (1998). While we used logarithmically transformed data to determine the mean bias and random error, we used the untransformed data for the graphical presentation in the Bland–Altman plots so readers could more easily understand the values. Since the use of ratio limits of agreement is fairly new to sport science studies it is important to have an understanding about how to interpret the results. The ratio limits of agreement can be interpreted using the following example; the ratio limits of agreement for V E, in the current study, was 0.98 ×/÷ 1.38, meaning that there was a 2% bias and for any individual from the population, two tests on different days will differ by no more than 38%. It is also important to point out that addition and subtraction (i.e., ± ) with the natural scale is equivalent to multiplying and dividing (i.e., ×/÷ ) using the ratio scale. Thus, when using the ratio scale the mean bias is multiplied and divided by the 95% prediction interval. For the example above this would give a mean bias of 2% and the 95% prediction interval would be 0.71, 1.35%. For a further discussion on the use of ratio limits of agreement in sport science see Atkins and Nevill (1998).

To examine if there was a device order effect we examined the ANOVA performed for the reliability testing. The results from the ANOVA showed no difference in V E, VO2, or VCO2 for any of the devices, thus we concluded that there was not an effect from the order of device use. For the remaining analysis the mean metabolic variables were averaged between the two days. Subsequently, two-way ANOVAs (device × power output) were used to compare differences among the devices for each dependent variable. Additionally, because only nine subjects were able to complete 200 W and six subjects completed 250 W, these work rates were not included in this analysis, and a separate ANOVA was used to compare the three devices at 200 and 250 W, separately. Pairwise comparisons with Bonferroni adjustments were performed to locate significant differences between devices, when necessary. In addition, modified Bland–Altman plots were used to graphically show the variability in the individual error scores (DB variable minus device variable) (Bland and Altman 1986).

Results

None of the devices showed significant differences between days for V E (DB, F 5,49 = 1.833, P = 0.124; TrueOne 2400, F 5,49 = 1.840, P = 0.122; VO2000, F 5,49 = 0.428, P = 0.827), VO2 (DB, F 5,49 = 1.286, P = 0.285; TrueOne 2400, F 5,49 = 1.837, P = 0.123; VO2000, F 5,49 = 1.600, P = 0.178), or VCO2 (DB, F 5,49 = 1.459, P = 0.220; TrueOne 2400, F 5,49 = 1.313, P = 0.274; VO2000, F 5,49 = 0.716, P = 0.614). Figure 1a–i shows the Bland–Altman plots depicting the individual difference scores (difference between days) for V E, VO2, and VCO2 for each device. Additionally, the reliability between days for V E, VO2, and VCO2 for each device determined by the Pearson correlation coefficient, ratio limits of agreement (determined from the logarithmically transformed data), and CV are shown in Table 1. All three devices appear to have good test re-test reliability between days for V E, VO2, and VCO2 according to the Pearson correlation coefficient. However, as the Bland–Altman plots show and is confirmed by the ratio limits of agreement and CV, the reliability of measuring V E between days for the three devices is similar, but the VO2000 was less reliable compared to the DB and TrueOne 2400 for the measurement of VO2 and VCO2.

Fig. 1
figure 1figure 1

Bland–Altman plots of the error scores (between testing days) for minute ventilation (V E), oxygen consumption (VO2) and carbon dioxide production (VCO2). a Douglas bag (DB) V E, b ParvoMedics TrueOne 2400 (TrueOne 2400) V E, c MedGraphics VO2000 (VO2000) V E, d DB VO2, e TrueOne 2400 VO2, f VO2000 VO2, g DB VCO2, h TrueOne 2400 VCO2, i VO2000 VCO2. Solid lines represent the mean difference; dashed lines represent the 95% prediction interval. V E, VO2 and VCO2 are reported in l min−1 and STPD

Table 1 Reliability of the Douglas bag (DB), ParvoMedics TrueOne 2400 (TrueOne 2400), and Medgraphics VO2000 (VO2000) for the measurement of minute ventilation (V E), oxygen consumption (VO2), carbon dioxide production (VCO2). Pearson correlation coefficient (r), ratio limits of agreement, mean error and 95% prediction interval [mean error (95% PI)] and coefficient of variation (CV)

Table 2 shows the physiological responses measured by each device and Fig. 2 shows the percent over- and/or underestimation for the TrueOne 2400 and VO2000 versus the DB for these physiological responses. The TrueOne 2400 metabolic cart was not significantly different from DB at any work rate for V E, VO2, or VCO2 (P ≥ 0.05). Mean FEO2 however, was significantly lower at rest and 50 W and mean FECO2 was significantly higher at rest, 50 and 100 W (P < 0.05). The VO2 measured by the VO2000 was significantly different from DB and TrueOne 2400 at all work rates except 50 W, the V E was significantly different from DB and TrueOne 2400 at 50 and 100 W, the VCO2 was significantly different from DB and TrueOne 2400 at rest, 200, and 250 W. There were no differences in the respiratory rate (RR) between the TrueOne 2400 and VO2000. Lastly, the HR for the VO2000 was significantly different from the Polar HR monitor at rest and 200 W, while there were no significant differences between the Polar HR monitor and the TrueOne 2400 HR. In addition, when there were no statistically significant differences between the Polar HR monitor and the VO2000 the HR was still off on average by 10 to 18 bpm, indicating that there may be a malfunction with the device for measuring HR and further investigation is warrented.

Table 2 Physiological responses measured during rest and five work rates on a cycle ergometer using DB, a ParvoMedics TrueOne2400 metabolic cart and VO2000 portable metabolic system
Fig. 2
figure 2

Percent of the DB a ventilation (V E), b oxygen consumption (VO2), c carbon dioxide production (VCO2), and d gas fractions for the TrueOne 2400 and VO2000 during rest and five work rates on the cycle ergometer. Values are means with standard deviation bars. V E, VO2 and VCO2 are reported as STPD and FEO2 and FECO2 are the mean O2 and CO2 fractions during the expiratory phase

Figures 3a–d and 4a–d are the Bland–Altman plots, showing the individual difference scores (DB minus computerized system) for the TrueOne 2400 and VO2000. Overall, there was close agreement between the DB method and the TrueOne 2400 metabolic system for all variables. The plots for the VO2000 show that it tends to underestimate VO2 and VCO2 at the lighter work rates and overestimate VO2 and VCO2 at the higher work rates.

Fig. 3
figure 3

Bland–Altman plots of the error scores (DB minus computerized system) for minute ventilation (V E), fraction of expired oxygen (FEO2), and fraction of expired carbon dioxide (FECO2). a TrueOne 2400 V E, b VO2000 V E, c TrueOne 2400 FEO2, and d TrueOne 2400 FECO2. Solid line represents the mean difference; dashed line represents the 95% prediction interval. V E is reported in l min−1 and STPD and FEO2 and FECO2 are the mean O2 and CO2 fractions during the expiratory phase

Fig. 4
figure 4

Bland–Altman plots of the error scores (DB minus computerized system) for oxygen consumption (VO2) and carbon dioxide production (VCO2). a TrueOne 2400 VO2, b VO2000 VO2, C) TrueOne 2400 VCO2, and d VO2000 VCO2. Solid line represents the mean difference; dashed line represents the 95% prediction interval. VO2 and VCO2 are reported in l min−1 and STPD

Discussion

The main purpose of this study was to test the accuracy and reliability of the TrueOne 2400 metabolic cart and the VO2000 portable metabolic system. Under ideal circumstances the metabolic measurements would have been made simultaneously (Bigard and Guezennec 1995; Lucia et al. 1993; Peel and Utsey 1993), however, the addition of the Hans-Rudolf valve to the breathing valve of the VO2000 altered the VO2 values preventing us from making simultaneous measurements. Therefore, we tested the participants on two separate days (within 48 h) randomizing the order in which the devices were tested each day. The testing on separate days has been shown to result in a small (3–4%) variation in the day-to-day measurements of VO2 (Stuart et al. 1981). Thus, the non-simultaneous measurements could have contributed to some of the differences seen between days and between the DB and VO2000, and should be considered a limitation of this study.

Reliability testing

The reliability results from this study should be interpreted with caution due to a few limiting factors; (1) the inclusion of resting measures into the analysis increased the variability, and (2) we had a small subject pool. Given the limitations we still found differences between devices. The results from this study indicate that all three devices have acceptable reliability for the measurement of V E based on the various statistical tests (e.g., CV < 10 for all devices). The DB and TrueOne 2400 had acceptable reliability for the measurement of VO2 (ratio limits of agreement, DB = 0.98 ×/÷ 1.24, TrueOne 2400 = 0.98 ×/÷ 1.19; CV, DB = 5.3%, TrueOne 2400 = 4.7%) and VCO2 (ratio limits of agreement, DB = 0.99 ×/÷ 1.29, TrueOne 2400 = 0.99 ×/÷ 1.26; CV, DB = 6.0%, TrueOne 2400 = 5.7%), however, the VO2000 showed poor reliability for the measurement of VO2 (ratio limits of agreement, 0.94 ×/÷ 3.30; CV, 14.2%) and VCO2 (ratio limits of agreement, 0.90 ×/÷ 5.57; CV, 15.8%).

There are only a few studies available that have examined the reliability of metabolic systems. Table 3 provides several reliability studies performed on portable and stationary metabolic systems. There are only a few studies which have examined the reliability of other metabolic systems. In addition, most studies only examined one measure of reliability (i.e., intraclass correlation, coefficient of variation, or mean bias and 95% prediction interval), which makes it difficult to make comparisons between all the studies. Carter and Jeukendrup (Carter and Jeukendrup 2002) examined the DB method, Oxycon Alpha metabolic system, Pulmolab, metabolic system, and Oxycon Pro metabolic system for reliability during cycling at 100 and 150 W. Of importance in this study is that they also examine the reliability of the DB method, which are similar to the results found in the current study. However, they only examined two work rates, while we had five work rates and rest. The reliability of the TrueOne 2400, for measuring V E, VO2, and VCO2, is similar to other systems currently available, which have been shown to have good reliability (Carter and Jeukendrup 2002; Meyer et al. 2001). While the reliability of the VO2000 for measuring V E is similar to other metabolic systems (Carter and Jeukendrup 2002; Larsson et al. 2004; Lucia et al. 1993; Meyer et al. 2001), the reliability for measuring VO2 and VCO2 is less than previously examined systems.

Table 3 Previous reliability studies of portable and stationary metabolic systems

Accuracy

Table 4 provides results from previous validation studies examining portable and stationary systems which used the DB method as the criterion method, which will make for an easier comparison between studies. The mean bias and 95% prediction intervals for the TrueOne 2400 in the current study are similar to those reported previously by Bassett et al. (2001). These results should be interpreted with caution, since the DB gas fractions were measured with a TrueMax 2400, which uses the same analyzers as the TrueOne 2400 which was being validated. However, our results are in agreement with those of Bassett et al. (2001), which used a Beckman LB2 CO2 analyzer and an applied electrochemistry S3A O2 analyzer to measure the gas fractions obtained from the DB measurements.

Table 4 Previous validation studies comparing portable and stationary metabolic systems to the DB method

The VO2000 underestimated the DB values for V E, VO2, and VCO2 by 12, 53, and 46%, respectively, during rest. At work rates between 50 and 250 W the VO2000, on average, underestimated the DB values for V E by 5% and overestimated VO2 and VCO2 by 8 and 4%, respectively. The mean bias and 95% limits of agreement between the DB and VO2000 are 1.54 ± 6.93, −0.11 ± 0.43, and −0.06 ± 0.38 l min−1 for V E, VO2, and VCO2, respectively. Numerous studies have examined the accuracy of various portable metabolic systems. For example, McLaughlin et al. (2001) compared the DB and Cosmed K4b2 portable system using non-simultaneous measurements during cycle ergometry (50–100 W) and found the mean bias and 95% limits of agreement to be 2.0 ± 7.0 l min−1 (V E), 0.1 ± 0.23 l min−1 (VO2) and −0.1 ± 0.35 l min−1 (VCO2). Given the available information the VO2000 has similar accuracy to other portable metabolic systems for the measurement of V E, however, it is slightly less accurate for measuring VO2 and VCO2 when compared to the DB. Given that the direction and magnitude of the V E error generally translates into a similar error in VO2 (i.e., each 1% underestimation in V E translates into a 1% underestimation in VO2) (Withers et al. 2000), the results from this indicate that V E can only partially explain the errors seen in the VO2 measurements. However, the current software available with the VO2000 does not have the option of reporting gas fractions, thus determining where the remainder error in VO2 is coming from is not possible at this time.

The resting measurement taken in the current study using the VO2000 should be interpreted with caution. The differences seen at rest partly influence the high variability seen in the day-to-day metabolic measurements. For resting measurements MedGraphics provides a separate low flow pneumotach that is smaller in size (i.e., has a smaller dead space), however, we used the exercise pneumotach for all testing. We assume that during resting measurements the low flow pneumotach is recommended to increase the pressure inside the breathing valve so the correct measurements can be made. At rest the V E measurements were similar to the DB values, however, in several instances there were VO2 values of < 0.1 l min−1 indicating a problem exists with the detection of the expired gas using the larger pneumotach at rest. While we did not test the low flow pneumotach at rest, this raises concerns over the current setup of the device. Under many circumstances it is necessary to obtain resting measurements before starting an exercise test. In addition, the resting measurements provide a means for the researcher to check the system before an exercise test to ensure it is working properly. It is not always practical to obtain resting measurements with one pneumotach and then change pneumotachs and recalibrate before beginning the exercise test.

A limitation to the use of the VO2000 is that the current software does not provide the means to perform a manual calibration. The current software design allows only for an auto calibration, which uses current room conditions and proprietary software to perform the gas and flow calibrations. Therefore, it is possible that the device could be out of calibration and the user would not know this because there is no way to verify these values. In the current study, we relied on the auto calibration procedure because this is how the device will be used by researchers, and we did not feel it should be manipulated in a way that it could not be used in an actual research setting.

The findings from this study confirm previous results that the TrueOne 2400 is an accurate device for the measurement of V E, VO2, and VCO2, as well as extending those findings to show that it has good reliability. These findings also suggest that the VO2000 portable metabolic system may be less reliable for the measurement of VO2 and VCO2 than other systems. In addition, it tends to underestimate VO2 at rest and overestimate VO2 at most cycling work rates. Future work should be done to confirm these results and examine possible sources of error.