Introduction

Biodegradation has been a great issue for oil exploration in many prolific basins worldwide due to a systematic decrease in paraffin content and an increase in oil density, sulfur content, acidity and viscosity (Connan 1984; Larter et al. 2003; Larter et al. 2006), with negative economic consequences for oil production and refining operations. Living organisms are able to survive in oil field environments if certain physical characteristics and chemical composition are favorable. Microorganisms inhabiting these environments may destroy hydrocarbons and other oil components yielding altered and undesirable “heavy oil” (Head et al. 2003). Although the in situ metabolic activities of petroleum microorganisms are poorly understood, a wide variety of bacterial and archaeal groups, including strictly anaerobic thermophilic and hyperthermophilic microorganisms as well as facultative anaerobic and thermotolerant ones, have been detected ex-situ or isolated from oil reservoirs (Magot et al. 2000).

Culture-based techniques have traditionally been the primary tools used for studying the microbiology of different environments (Chandler et al. 1997). These approaches, while extremely important for understanding the physiological potential of isolated organisms, do not necessarily provide comprehensive information on the composition of microbial communities (van Hamme et al. 2003). Reliable characterization of microbial communities became possible due to the introduction of molecular methods in microbial ecology studies, which have proven to be effective for evaluation of microbial communities in diverse environments such as ocean (DeLong 1992), soil and sediments (Takami et al. 1997) and hot springs (Hugenholtz et al. 1998). The recovery of DNA directly from environmental samples, with subsequent amplification by polymerase chain reaction (PCR), cloning and sequencing of 16S rRNA genes, has unraveled an astonishing microbial diversity in a variety of habitats (Pace 1996; Hugenholtz et al. 1998). In addition, the cultivation-independent approach allows the detection and phylogenetic identification of fastidious or as yet uncultured organisms (Juck et al. 2000). In petroleum microbiology, the use of molecular techniques in community analyses of oil fields is still recent and scarce (Voordouw et al. 1996; Orphan et al. 2000; Magot et al. 2000; Bonch-Osmolovskaya et al. 2003), despite their huge potential to help culture-based techniques to gather important and complementary sets of data.

The purpose of this research was to evaluate the bacterial taxonomic diversity in oil samples with different degrees of biodegradation and recovered from different reservoirs within a petroleum field of the Campos Basin, Brazil, by using 16S rRNA gene library construction associated with enrichment culture techniques, in order to provide a detailed characterization of the bacterial community and contribute to improve the understanding of biodegradation processes in this environment.

Material and methods

Study area, geological and geochemical background

The Campos Basin covers an area of about 100,000 km2, mostly offshore to the 3,400 m isobath, and is the most prolific oil basin in Brazil, holding approximately 85% of total Brazilian oil reserves and 40% of total gas reserves (Jahnert et al. 1998). The origin of the Campos Basin is related to the Early Cretaceous break up of the Gondwana supercontinent and its tectono-sedimentary evolution can be divided in three main phases: rift where the saline lacustrine source rocks were deposited (Mohiak et al. 1989), transitional that holds the most complex lithology of the sedimentary pile, including siliciclastics, marls, carbonates and evaporites, and drift that began with the sedimentation of shallow carbonates followed by deep water sediments and other shallowing upward sequence (Asmus and Ponte 1973; Ponte and Asmus 1978; Guardado et al. 1989; Rangel et al. 1994).

Bulk, isotopic and molecular analyses (Medium Pressure Liquid Chromatography (MPLC); Gas Chromatography (GC); biomarkers; δ13C) of the studied oils have shown diagnostic characteristics of a common source e.g., gammacerane/H30 <0.5; H35/H34 <1.0; very similar ratios among C24 tetracyclic and tricyclic terpanes; similar levels of thermal maturity given by C29 steranes isomers; and different levels of biodegradation (Table 1). The oil samples used in this study were chosen from 73 producing wells within the field, and are representative examples of a non-biodegraded oil (Fig. 1A) produced from the coquina sequence of the Lagoa Feia Formation (≈3,070–3,240 m, reservoir temperature ≈85°C) and of a biodegraded oil (biodegradation level of 5–6 according to Peters and Moldowan 1993; Fig. 1B) produced from sandstones of the Carapebus Formation (≈2,066 m, reservoir temperature ≈51°C).

Table 1 Geochemical characteristics of oil samples
Fig. 1
figure 1

(A) (1) Gas chromatogram trace of the non-biodegraded oil sample showing low molecular weight n- and isoparaffins, and the relatively low pristane (Pr)/nC17 and phytane (Ph)/nC18. (2) Mass chromatogram (m/z 191) with no significant detection of 25-norhopane. (B) (1) Gas chromatogram (whole oil) of the biodegraded oil sample. Notice the absence of n- and isoparaffins and the significant “hump” representing the unresolved complex mixture (UCM). (2) Mass chromatogram (m/z 191) with a high relative proportion of C29 25-Nor-17α(H)hopane indicated by the “star”. The absence of n- and isoparaffins and the presence of 25-norhopanes characterize the levels of biodegradation 5–6 of Peters and Moldowan (1993)

Sampling

Oil samples were obtained in May of 2002, from a production petroleum platform in the Campos Basin (Brazil). Samples were collected in triplicate using 2 l sterilized Schott bottles, which were completely filled with the samples in order to prevent oxygen influx. Samples were kept on ice during transportation to the laboratory, and stored at 4°C until DNA extraction.

Aerobic microbial enrichments

The oil samples were homogenized using an equal volume (20 ml) of Tween 80 in sterile Falcon tubes and incubated at 40°C for 10 min. Aliquots of 2.5 ml of the oil: tween mixture were inoculated in Falcon tubes containing each 12.5 ml of one of the selected culture media: Nutrient broth (NB Difco), Tripticase soy broth (TSB Difco), Glucose–yeast extract–malt extract (GYM Difco), Marine broth (MB Difco), Acinetobacter broth (Difco) and mineral salt medium Bushnell Hass broth (BHM Difco). The inoculated tubes were incubated at 28°C on a rotary shaker at 150 rpm for 15 days.

Isolation and identification of microbial cultures

Aliquots (100 μl) of each microbial enrichment from crude oils were taken and plated over the surface of the same medium used for enrichment, cited above, supplemented with 2% agar. The plates were inoculated in duplicate and incubated at 28°C and 50°C, under aerobic conditions, up to 10 days. Colony growth was monitored every 2 days. Isolates obtained were further streaked onto the surface of fresh plates (same medium used for enrichment supplemented with 2% agar) and checked for purity prior to subsequent molecular analyses.

DNA extraction

Genomic DNA from the pure cultures was obtained according to the protocol described by Young and Blakesley (1991). DNA extraction from oil samples was carried out by using the QIAmp DNA Stool Mini Kit (Qiagen, Valencia, CA), as previously described by Tanaka et al. (2002) with minor modifications, which included the use of an aliquot of oil correspondent to 2 g of initial sample amount and 30 μl of elution buffer (QIAmp DNA Stool Mini Kit) for final elution of total DNA. Before community DNA extraction, oil samples were heated at 80°C for 30 min in order to reduce the viscosity and allow easier sample manipulation.

PCR amplification

For 16S rRNA gene library construction, amplification was performed from total community DNA isolated from oil samples by using the bacterial primer set 27f and 1100r (Lane 1991). Twenty five microliter reaction mixtures contained 5 μl of total DNA, 2 U of Taq polymerase (GE Healthcare), 0.2 mM of dNTP mix, and 0.4 μM of each primer, in 1× Taq buffer. The PCR amplifications were done using 10 cycles of 1 min at 94°C, 30 s at 60°C, decreasing 0.5°C each cycle, and 3 min at 72°C, followed by another 10 cycles of 1 min at 94°C, 30 s at 56°C and 3 min at 72°C, in an Eppendorf thermal cycler (Eppendorf). A second PCR reaction was performed using 5 μl of the first PCR products as template and the same primers and conditions specified for the first PCR reaction. To overcome known biases introduced by PCR amplification, some procedures reported in previous studies were adopted. The number of PCR cycles was limited to 20 to avoid a reduction in the microbial diversity represented in the libraries (von Wintzingerode et al. 1997; Schink 1997) and to minimize alterations in community composition caused by “bias 1:1” (Watanabe et al. 2002), which can result in higher equitability in the final amplified community. In addition, the products of five PCR amplification replicates were pooled prior to cloning in order to minimize PCR drift (Friedrich et al. 1997; Munson et al. 1997).

Cloning, ARDRA analysis and sequencing

A 16S rRNA gene library was constructed for each of two replicate oil samples: NBD library originated from the non-biodegraded oil, and HBD library originated from the high-biodegraded oil. Amplicons were pooled from five reactions (∼500 ng), purified using GFX TM PCR-DNA and gel band purification kit (GE Healthcare) and cloned using the pGEM-T cloning vector kit, according to the manufacturer’s instructions (Promega, Madison, Wisc.). E. coli DH5α cells were transformed by electroporation, using the equipment Cellject Uno (Hybaid). White colonies were selected and screened for the presence of insert by means of PCR amplification using the vector-based primers M13 forward and reverse. All the insert-containing clones were submitted to Amplified Ribosomal DNA Restriction Analysis (ARDRA) by digestion with the enzymes HaeIII and RsaI (GE Healthcare), at 37°C for 4 h. The fragments were separated by electrophoresis in 2.5% agarose gels and the different ribotypes were defined by a combination of the banding patterns obtained with the two enzymes. One clone representative of each distinct ribotype was selected for DNA sequencing and phylogenetic affiliation. Plasmid DNA was isolated from clones using the protocol described by Sambrook et al. (1989) and purified with the GFX PCR DNA and gel band purification kit (GE Healthcare) prior to sequencing. The 16S rRNA gene sequences were determined from plasmid DNA by using the M13 forward and reverse primers and the DYEnamic ET Dye Terminator Cycle Sequencing Kit for the automated MegaBace 500 system (GE Healthcare), according to the manufacturer’s recommendations.

Identification of bacterial isolates

PCR amplification of 16S rRNA gene fragments from pure cultures and subsequent direct sequencing in a MegaBace 500 system (GE Healthcare), aiming at the phylogenetic affiliation of the isolates, were performed as described by Rainey et al. (1996).

Phylogenetic analysis

Partial 16S rRNA gene sequences obtained from clones or pure cultures using forward and reverse primers were assembled in a contiguous sequence using the phred/Phrap/CONSED program (http://www.phrap.org/phredphrapconsed.html). Identification was achieved by comparing the contiguous 16S rRNA gene sequences (∼1000 bp in length) generated for clones and isolates with 16S rRNA sequence data from reference and type strains, as well as environmental clones, available at the public databases Genbank (http://www.ncbi.nem.nih.gov) and the RDP (Ribosomal Database Project, Wiscosin, USA, http://www.cme.msu.edu/RDP/html/index.html) by using the BLASTN and the RDP sequence match routines. The sequences were aligned using the CLUSTAL X program (Thompson et al. 1994) and analyzed using PAUP (version 4.0 beta 10) (Swofford 2000). Evolutionary distances were derived from sequence-pair dissimilarities calculated as implemented in PAUP, using Kimura 2P substitution model (Kimura 1980). The phylogenetic reconstruction was done using the neighbor-joining (NJ) algorithm, with bootstrap values calculated from 1,000 replicate runs, using the routines included in the PAUP software.

Statistical analyses based on ARDRA data

Randomization analyses based on ARDRA data were performed using the independent sampling algorithm, implemented in the EcoSim program (Gotelli and Entsminger 2003). This module of EcoSim provides a computer-sampling algorithm, in which a specified number of individuals (i.e., sub-samples of the original one) are randomly drawn, without replacement, from the total community sample, creating pseudo-communities. Various sub-sample sizes were specified and for each sub-sample size, 1000 pseudo-communities were drawn from the original one. Data from these were used to calculate diversity indices such as Shannon–Wiener index (diversity), Dominance Index and PIE Hulbert’s index (evenness), as well as richness. The resultant values were used to calculate the mean and the 95% confidence intervals for each index at each sub-sample size specified. This sampling algorithm ensures that the estimators of diversity parameters for the different sub-sample efforts are independent of one another.

Nucleotide sequence accession numbers

The nucleotide sequences determined in this study for the environmental clones were deposited at the Genbank database under the accession numbers: AY862487–AY862494 (Fig. 3). For the bacterial isolates, the accession numbers are: DQ413027–DQ413031.

Results

NBD and HBD 16S rRNA gene libraries

PCR amplification of 16S rRNA gene fragments from community DNA obtained from oil samples was successful only when applying a second round of amplification, using the PCR products from a first PCR as template. A total of 137 and 129 clones were obtained for the NBD and HBD oil libraries, respectively. ARDRA analysis showed 46 different restriction patterns (ribotypes) for the NBD library and 35 for the HBD library (Table 2). One representative of each ribotype, considering both libraries, was sequenced. BLASTN analysis of clone sequences revealed that they belonged to nine different genera (Table 2, Fig. 2). One clone representative of each taxonomic group, considering both NBD and HBD libraries, was used for the phylogenetic tree reconstruction (bo22C02, bo23A06, bo13A01, bo13D04, bo22B07, bo22A03b, bo13A02b and bo13C09) (Fig. 3), except for the clone representing the genus Leuconostoc (bo12F03), which was not included in the phylogenetic analysis due to its low quality 16S rRNA gene sequence.

Table 2 Closest relatives of bacterial ribotypes from NBD and HBD 16S rRNA libraries
Fig. 2
figure 2

Richness and frequency of ARDRA patterns found in the NBD and HBD 16S rDNA libraries. Different segments in the bars indicate distinct ARDRA patterns (ribotypes)

Fig. 3
figure 3

Neighbor-joining tree showing the phylogenetic relationships between the environmental clones derived from NBD and HBD libraries and related organisms

Sequence analysis revealed that 16S rRNA gene clones were related to the genera Acinetobacter, Arcobacter, Bacillus, Halanaerobium, Leuconostoc, Marinobacter, Propionibacterium, Streptococcus and Streptomyces, encompassing four main groups within the Domain Bacteria: Actinobacteria (high-G + C gram-positive bacteria), Firmicutes (low-G + C gram-positive bacteria), gamma-Proteobacteria and epsilon-Proteobacteria (Fig. 3). Arcobacter, Halanaerobium, Marinobacter and Propionibacterium were found in both NBD and HBD libraries, whereas Leuconostoc and Streptomyces were found only in the NBD library, and Acinetobacter, Bacillus and Streptococcus only in the HBD library (Fig. 2).

Ribotypes related to the genus Halanaerobium were the most abundant in the NBD library (61.31%), followed by the ribotypes related to Arcobacter (22.62%) (Table 2, Fig. 2). On the other hand, the genera Propionibacterium and Acinetobacter were the most abundant amongst sequences recovered from the biodegraded oil (HBD), representing 50.38% and 31.58% of the clones in the library, respectively (Table 2, Fig. 2).

Amongst the four bacterial groups found in both NBD and HBD libraries, the low-G + C gram-positive bacteria was the predominant one, and accounted for 38.71% of the total clones in the libraries. Clones representing this group showed high sequence similarity with Bacillus sporothermodurans type strain (bo22C02, 100% bootstrap value), Streptococcus sanguinis (bo23A06, 99% bootstrap value) and Halanaerobium congolense type strain (bo13A01, 100% bootstrap value) (Fig. 3). The class Actinobacteria accounted for 31.56% of the total number of clones, with clones bo13D04 and bo22B07 representative of the two genera found for this class. Clone bo13D04 grouped with Streptomyces albidoflavus and Streptomyces somalienses, with 100% bootstrap value, whereas clone bo22B07 formed a tight cluster (supported by 82% bootstrap value) with Propionibacterium acnes type strain and two other Propionibacterineae, which were uncultured bacteria from hypersaline lake and hydrocarbon seep, respectively. The two other groups, gamma-Proteobacteria and epsilon-Proteobacteria, accounted for 17.28% and 12.03% of all clones, respectively. Clones bo22A03b and bo13A02b, representative of gamma-Proteobacteria, were related to Acinetobacter lwoffii (72% bootstrap value) and to Marinobacter lipolyticus type strain (100% bootstrap value). Clone bo13C09, representative of epsilon-Proteobacteria, showed high sequence similarity with a solar lake Arcobacter sp. (100% bootstrap value) (Fig. 3).

Statistical comparisons of libraries based on ARDRA data

In order to investigate the community structures represented in the two libraries, diversity indices were used to calculate richness, diversity, dominance and evenness from the ARDRA data. Since these indices are dependent on the total number of clones analyzed and the two libraries were unequally sampled (137 clones in NBD and 129 in HBD), randomization analyses were used to create pseudo-communities at specified sub-sample sizes and thus allow the comparison between the two libraries. Relatively high ribotype richness, as well as high diversity index (Shannon–Wiener), values were observed for both library communities (Fig. 4A, B). The communities from NBD and HBD did not present statistically different diversity index values based upon HaeIII/RsaI ARDRA data at all the sub-sample sizes analyzed (Fig. 4B). The richness analysis showed a slightly different pattern, at least at the larger sub-sample sizes. This data indicates that, when analyzing sample sizes with at least 100 individuals, the community represented in the NBD library may show significantly higher number of different ribotypes than the HBD library or, in other words, higher richness (Fig. 4A). The PIE Hulbert’s evenness index (Fig. 4C) calculates the probability of an inter-specific encounter, i.e., the probability that two randomly sampled individuals from a community represent two different ribotypes. High values for PIE index were reached very rapidly in the analyses indicating that both communities contained a large proportion of singletons (ribotype of single occurrence) (Fig. 4C). Evenness index values did not show statistically significant differences between NBD and HBD libraries, at all sub-sample sizes analyzed. The dominance index, the fraction of the community that is represented by the most common ribotype, reached relatively low values for both communities (Fig. 4D), thus revealing that these did not present strong dominance of a particular ribotype. However, when analyzing sample sizes with at least 80–90 individuals, the community represented in the NBD library may show significantly higher dominance values (Fig. 4D).

Fig. 4
figure 4

Statistical comparisons of libraries based on ARDRA data. NBD library (■) and HBD library (□)

Enrichment and isolation of aerobic microorganisms from crude oil

Five different bacterial genera were recovered from the biodegraded oil sample using cultivation methods. All the isolates obtained were recovered from the enrichments incubated at 28°C, which were subsequently identified and deposited at the Brazilian Collection of Environmental and Industrial Microorganisms (CBMAI). Gram positive rods were observed in the NB culture medium, which were identified as Bacillus subtillis (CBMAI 707) based on 16S rDNA sequence analysis. Gram negative rods were observed on TSB medium, identified as Methylobacterium sp. (CBMAI 706). In the Acinetobacter broth medium, Gram negative coccal rods were recovered and identified as Achromobacter xylosoxidans (CBMAI 709), and when using MB medium, Gram positive long rods identified as Brevibacillus sp. (CBMAI 708) were obtained. In the salts medium Gram positive coco bacilli were observed and characterized as Dietzia sp. (CBMAI 705). No growth was observed in the media PDB and GYM.

Discussion

The nature and diversity of bacteria in oil field ecosystems is still poorly understood. However, recent incorporation of molecular methods has allowed a broader characterization of microbial assemblages in this type of environment. The majority of studies in petroleum microbiology have been conducted using samples of formation water or water–oil mixtures. Only one study reported in the literature used crude oil from petroleum reservoir to analyze microbial communities (Tanaka et al. 2002).

In our study, data derived from cloning, sequencing and phylogenetic analyses allowed, for the first time, the comparison of bacterial communities from oil samples with different levels of biodegradation, originated from Brazilian petroleum reservoirs. However, in order to properly interpret the data obtained, one must keep in mind the limitations inherent to the methodological approaches used. These include the potential impact of sample heating at 80°C, prior to DNA extraction, on the original bacterial composition of the biodegraded oil, considering that the reservoir temperature, in this case, is about 51°C.

All clones obtained from NBD and HBD 16S rRNA gene libraries belonged to four major bacterial classes: Firmicutes, Actinobacteria, gamma-Proteobacteria and epsilon-Proteobacteria, revealing a great microbial diversity in these oil samples.

Sequence analysis of clones from the NBD and HBD libraries revealed phylogenetic relationships with 16S rRNA gene sequences from many microorganisms previously isolated or identified from petroleum, marine and saline lake habitats, such as Acinetobacter (Saadoun 2002), Halanaerobium (Ravot et al. 1997), Marinobacter (Huu et al. 1999) and Arcobacter (Watanabe et al. 2000). Other groups that showed similarity with some of the sequenced clones are not mainly found in these habitats, but according to the literature, they may have some representatives found in oil environments or related to hydrocarbon degradation. This is the case for the genera Bacillus (Stapleton et al. 2000; Nazina et al. 2002; Zhuang et al. 2003), Propionibacterium (Yoshida et al. 2005; Vorob’eva et al. 1979) and Streptomyces (Radwan et al. 1995, 1998; Bachoon et al. 2001). However, the genera Leuconostoc and Streptococcus are usually associated with dairy foods (Damelin et al. 1995; Stiles and Holzapfel 1997) and clinical pathology (Tyrrell et al. 2005), respectively, and this is the first report of the presence of these bacteria associated to oil field or marine environment.

Some of these microbial groups were shown to be exclusive of the biodegraded oil sample, such as the genera Acinetobacter, Bacillus and Streptococcus. Amongst these, Acinetobacter was the most abundant and it has been recognized in the literature as an aerobic degrader of hydrocarbon and lipids (Bach et al. 2003; Pleshakova et al. 2001; Sugimori et al. 2002). In the phylogenetic tree, the clone bo22A03b was shown to be related to A. lwofii, Acinetobacter baumannii, Acinetobacter calcoaceticus and to one unidentified Acinetobacter strain from deep subsurface environment (Vepritskiy et al. 2002). According to Barkovskii and Shub (1986) A. calcoaceticus is able to degrade a dense spectrum of aromatic compounds.

Members from Bacillus have been associated with petroleum and hydrocarbon degradation in many others studies (Stapleton et al. 2000; Nazina et al. 2002; Saadoun 2002; Bach et al. 2003; Lu et al. 2003; Zhuang et al. 2003). Clone bo22C02 showed a close relationship with B. sporothermodurans type strain, which is able to produce highly heat-resistant endospores (Pettersson et al. 1996), and with two others species, Bacillus sonorensis and Bacillus licheniformis, the latter being both thermophilic and halotolerant (Batrakov et al. 2003).

Genera Arcobacter, Halanaerobium, Marinobacter and Propionibacterium were found in both 16S rRNA gene libraries (NBD and HBD) constructed from oil samples. Members from the genus Arcobacter are nitrate-reducing and sulfide-oxidizing bacteria, and some of them have been found in marine and lake habitats, such as the Hawaiian Archipelago, Niger Sea sediments (Thamdrup et al. 2000), North Sea (Eilers et al. 2000) and Solar Lake (Teske et al. 1996). Arcobacter spp. has also been isolated from oil field (Gevertz et al. 2000).

Mesophilic fermentative bacteria, such as the genus Halanaerobium, together with thermophilic and hyperthermophilic fermentative bacteria, constitute an important microbial community of the oil field environment (Magot et al. 2000). Three species of haloanaerobes isolated from oil field brines, Halanaerobium acetoethilicum, Halanaerobium congolense (Ravot et al. 1997) and Halanaerobium salsugo (Bhupathiraju et al. 1994) are heterotrophic moderate halophyles that can use carbohydrates to produce metabolites (acids, solvents and gases), and may be of interest for potential use in microbial enhancement of oil recovery (MEOR). In addition, Ravot et al. (1997) reported that H. congolense was able to reduce thiosulfate to H2S, a compound known to decrease oil quality, corrode steel material and threaten worker’s health due to its high toxicity (Magot et al. 2000).

Representatives from genus Marinobacter are, in general, facultative anaerobes, thermophilic and halophilic organisms. M. lipolyticus, which showed the closest relationship with clone bo13A02b, is a halophylic organism that possesses lipolytic activity. Two other species grouped with clone bo13A02b, Marinobacter hydrocarbonoclasticus, a marine bacterium highly halotolerant and able to degrade hydrocarbon (Gauthier et al. 1992; Doumenq et al. 2001; Barbeau et al. 2002), and Marinobacter aquaeolei, a bacterium isolated from a Vietnamese oil-producing well (Huu et al. 1999).

Clone bo22B07 was related to the P. acnes type strain and to two uncultured Propionibacterineae bacteria, found in hypersaline lakes and hydrocarbon seeps. The presence of Propionibacterium in high abundance in the biodegraded oil (Fig. 2, Table 2) is corroborated by the findings of Yoshida and co-workers (2005), who detected P. acnes at high frequency in samples from stored crude oil, suggesting that these bacteria are typical from crude oil and not contaminants of the oil stockpile. Yet, this genus may be related to lipolytic activity (Jarvis et al. 1998) or hydrocarbon oxidation (Vorob’eva et al. 1979).

Although Streptomyces spp. are well recognized as hydrocarbon degraders (Radwan et al. 1995, 1998; Bachoon et al. 2001), their abundance was shown to be higher in the NBD library (Fig. 2). This might be explained by the fact that these organisms are not the main responsible for hydrocarbon degradation in these oil fields. In the phylogenetic tree, clone bo13D04 formed a tight cluster with S. albidoflavus and S. somaliensis species (Fig. 3). This group showed close relationship with a thermotolerant Streptomyces strain and with an isolate from marine sediments. The other specific NBD library genus was Leuconostoc, represented by only one clone related to Leuconostoc citreum. This genus has already been related with MEOR (Behlülgil and Mehmetogelu 2002).

Randomization analysis of ARDRA data revealed valuable information about the genetic structure of the communities recovered in the NBD and HBD libraries. Diversity and evenness analyses showed similar community structures in both libraries, with no significant differences for all sub-samples analyzed. The high PIE Hulbert Index values observed (Fig. 4C), reaching almost the maximum value (indicative of high proportion of singletons), and the steep shape of the richness curve (Fig. 4A), strongly indicates that much more diversity may be encountered if larger sample efforts are made. These data also show that there is a high genetic diversity in the bacterial communities inhabiting both environments, with no correlation to the oil biodegradation level. Despite the high evenness of both communities, these showed some dominance of the most common ribotype, expressed by the stabilization of the dominance index at values between 0.16 and 0.27 (Fig. 4D). The difference observed between the indices from both libraries could be explained by the high abundance of one of the ribotypes representing the Halanaerobium genus in the NBD library (Fig. 2).

It is important to note that due to the nature of the randomization analyses used, comparisons can be made only in relation to the sample effort of the smallest sample, which is 129 clones (from HBD library), and should not be extrapolated to larger samples. Moreover, these comparisons relate just to the samples analyzed and cannot be extrapolated to the original community from which the libraries were constructed.

Parallel experiments carried out by the authors have employed cultivation-based techniques in order to evaluate the diversity of cultivable bacteria in the same samples. The strategy involved the use of culture media for enrichment of heterotrophic aerobic bacteria, using oil samples as inoculum and sole carbon source. 16S rRNA gene sequencing-based identification of the isolated microorganisms showed the presence of bacteria related to A. xylosoxidans, B. subtilis, Brevibacillus sp., Dietzia sp. and Methylobacterium sp. These organisms have all been previously reported in the literature as hydrocarbon degraders and/or associated to oil field environments (Al-Sharidah et al. 2000; Bieszkiewicz et al. 1998; Chaillan et al. 2004; Orphan et al. 2000; Pineda-Flores et al. 2004; Nazina et al. 2002), suggesting that they might have a widespread distribution in oil reservoirs.

Comparison between the data obtained using cultivation-independent and culture-based enrichment methods suggests that different selection of community members may occur. Only the genus Bacillus was recovered using both strategies. This data may indicate that microorganisms recovered by enrichment cultures and not detected by molecular methods are not major components of the in situ reservoir assemblage. Alternatively, they may have been missed due to PCR biases when using community DNA, such as preferential amplification, or differential lysis efficiency may have occurred when purifying DNA from the samples prior to 16S rRNA gene library assembly.

Theses discrepancies between microbial groups recovered using the two strategies were also observed by other authors in different environments, such as soil and sediment (Chandler et al. 1997; Stephen et al. 1996), suggesting that a more comprehensive assessment of microbial diversity in oil, and probably other environments, can be obtained by using a combination of culture- and molecular-based techniques than by using either method alone.

In conclusion, analysis of bacterial communities in oil samples from Brazilian reservoirs by using a combined approach, 16S rRNA gene libraries and enrichment cultures, was successful and allowed the identification of distinct microbial groups known to be associated with petroleum and/or marine environments. This study presents an emerging view of the bacterial community diversity of Brazilian oil samples, based on both molecular and cultivation techniques, allowing researchers insights into the microorganisms that might be involved in the biogeochemical transformations that take place in these environments.