1. Introduction
The species Lagenaria siceraria (Molina) Standl is a member of the Cucurbitaceae family, Cucurbitaceae subfamily and Benincaseae tribe [1] . The genus Lagenaria consists of five other wild species, namely Lagenaria breviflora (Benth) Roberty, Lagenaria rufa (Gilg) C. Jeffrey, Lagenaria sphaerica E. May, Lagenaria abyssinia (Hook F) C. Jeffrey, Lagenaria guinensis (G. Den) C. Jeffrey and Lagenaria siceraria (Molina) Standley is the most cultivated within the species of siceraria; two morphologically distinct sub-species of bottle gourd have been recognized thus Lagenaria siceraria subspecies asiatica and Lagenaria siceraria subspecies siceraria [2] [3] [4] . It is universally agreed fact, that different species are systems of population which exhibit variation and not fixed entities, therefore no individual is identical as developed by [5] . Systematics as defined by [6] , states that systematics is a natural science that deals with the study of individual, population and taxon relationship for the purposes of classification. Plant systematics is therefore based on the premise that in the enormous variation in the plant kingdom there exist discrete units called species that can be identified classified and named with further logical relationships developed among these units. Over the years, plant groupings have been redefined as more information is accumulated from newer approaches and different recent sources such as molecular systematics, and taximetrics. Variety of Lagenaria is known throughout West Africa where they serve different economic purposes but these cultivars are largely unrecorded in terms of micromorphological characteristics [7] and diversity of fruit shape and sizes continue to create serious taxonomic difficulties in the delimitation of the taxon Lagenaria siceraria (Mol) Standley below the specific level.
The emerging field of molecular systematics deals with the exclusive utilization of molecular data [6] . A molecular marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify species, which detect variation at the nuclear, Mitochondrial, and chloroplast DNA. Molecular markers have been developed and utilized to provide the requisite landmarks for clarification of genetic variation [8] . DNA is more stable macromolecule than RNA and is found in all plant tissues hence, DNA based markers are preferred for precise identification of plant species [9] . An understanding of the extent of genetic diversity within and amongst the landraces of the gourd is important for strategic breeding and conservation [10] .
Plants are generally more tolerant of variation, hence genetic variation may result from mutation or recombination [6] [11] . According to [12] , effective utilization of plant genetic resources depends on a detailed understanding of their genetic variability. Interestingly, substantial variation exists in the ability of plants to tolerate gene imbalance both between different plant species and between varieties of the same species [13] . However, despite the widespread interest in molecular systematics, there is limited understanding of the molecular mechanisms that lead to phenotypic alterations in plant species as well as gene interactions involved in the global genomic level [14] , went further to explain that the molecular systematic studies give deeper insight into genetic structures. It has performed an important role in molecular biologies, such as analyzing genetic diversity by classification of cultivars and germplasm collections. It has also aided in clearing phylogenetic relationships among groups or closely related species. It is further used to determine the sequence of nucleotides in the DNA of plants, conclusively, has increased significantly, understanding of plant evolution.
2. Materials and Methods
2.1. Study Map
The species were planted in the faculty of Agricultural science research farm, of the University of Port Harcourt (Figure 1).
2.2. Plant Material and DNA Isolation
Twenty-four landraces of Lagenaria siceraria (Mol.) Standl. were collected from different locations of Nigeria and planted at Agricultural Science Research Farm. The young fresh leaves from A African Bottle Gourd, B Kettle Gourd, C Caveman Club Gourd, D Base Ball Gourd, F Bushel Gourd, G Bird House Gourd, H Water Jug Gourd, I Cup Gourd, K Warted Bushel Gourd, L Extra Large Pawpaw Gourd, M Long Siphon Gourd, N Indian Gourd, O Chinese Bottle Gourd, P Mini Dipper Gourd, Q Goose Neck Gourd, S Swan Gourd, T Palm Wine Gourd, U Long Handle Dipper Gourd, V Powder Horn Gourd, W Snake Gourd and X Microphone Gourd were analyzed at Biotechnology Research Centre of the University of Port Harcourt in Partnership with International Institute of Tropical Agriculture (IITA) Ibadan in 2018. The fresh leave sample from 21-day-old seedlings was crushed using liquid nitrogen. Genomic DNA was extracted using a Zymo Quick Plant/Seed extraction method (2010) and stored at −20˚C. The quantity of the DNA was determined using Thermo ScientificTM NanoDrop 2000/2000C spectrophotometer, while the quality of DNA isolated was analyzed using 0.8% agarose gel electrophoresis and viewed with UV Transilluminator—BIOCOMdirect.
Polymerase chain reaction (PCR) analysis was carried out using standard PCR procedure. The amplification of the extracted DNA was done using Gene Amp® PCR System 9700 Dual 384—well sample block module, according to the protocol of Applied Biosystems (2002). The rbcl primer having a forward sequence of F (5’ ATGTCACCACAAACAGAAAC 3’) and reverse rbcl primer sequence of R (5’ TCGCATGTACCTGCAGTAGC 3’) with an expected size of 724 bp, 10 × PCR buffer 1.0, 50 mM MgCl2 0.4, 5 pMol forward primer 0.5, 5 pMol reverse primer 0.5, DMSO 0.8, 2.5 Mm DNTPs 0.8, Taq 5 u/µl 0.1, 100 ng/µl DNA 3.0, H2O 2.9 µl and a final volume of 10 µL. The amplification products were analyzed using a 1.5% agarose gel stained with EZ-Vision® in Gel solution. The samples are arranged A-X after the ladder and the ladder used is 50 bp from NEB (Plate 1).
Plate 1. Showing 21 days old Lagenaria siceraria (Mol.) Standl.
The amplification products from land races of Lagenaria siceraria (Mol.) Standl. were separated using 1.5% agarose electrophoresis and the expected size of bands were excised using scapel and purified by Spin Column Purification method. Purified amplicons were sequenced using BigDye® Terminator v3.1 Cycle Sequencing Kit protocol in ABI Prism 377 DNA Sequencer (Applied Biosciences, NY, USA).
2.3. Sequence Alignment and Phylogenetic Analysis
Lagenaria siceraria sequences in FASTA format were aligned and used in the analysis of protein content of the twenty three species, the process of sequence analysis was advanced through pair wise alignment, construction of a distance matrix using Mega X. The sequences were compared to the NCBI database using Nucleotide Basic Local Alignment Search Tool (BLASTN). The results of distance matrix were sent to Excel 2007, which was finally analyzed using PAST 3.14 software for the construction of phylogenetic tree showing similarity and distance relationship between the twenty-three species. The chloroplast sequences of L. siceraria from NCBI data base were analysed using CHLOROBOX.
3. Result
Molecular result of twenty-four samples of the landraces of L. siceraria is obtained below. The result of gel electrophoresis (Figure 2) shows the quality of genomic DNA after extraction. The bands is observed in A African Bottle Gourd, B Kettle Gourd, C Caveman Club Gourd, D Base Ball Gourd, F Bushel Gourd, G Bird House Gourd, H Water Jug Gourd, I Cup Gourd, K Warted Bushel Gourd, L Extra Large Pawpaw Gourd, M Long Siphon Gourd, N Indian Gourd, O Chinese Bottle Gourd, P Mini Dipper Gourd, Q Goose Neck Gourd, S Swan Gourd, T Palm Wine Gourd, U Long Handle Dipper Gourd, V Powder Horn Gourd, W Snake Gourd and X Microphone Gourd, showing the presence of the gDNA while in E Pot Gourd, J Pennis Shield Gourd and R Nigeria Rattle Gourd showed no band.
The polymerase chain reaction product using rbcl primer (Figure 3(a)) shows a visible band at 850 bp in A African Bottle Gourd, C Caveman Club Gourd, D Base Ball Gourd, F Bushel Gourd, I Cup Gourd, K Warted Bushel Gourd, L Extra Large Pawpaw Gourd, M Long Siphon Gourd, N Indian Gourd, O Chinese Bottle Gourd, P Mini Dipper Gourd, R Nigeria Rattle Gourd, S Swan Gourd, T Palm Wine Gourd, U Long Handle Dipper Gourd, V Powder Horn Gourd, W Snake Gourd and X Microphone Gourd while B Kettle Gourd, E Pot Gourd, G
Figure 2. Gel electrophoresis: A: Showing Quality of gDNA extracted.
(a) (b)
Figure 3. (a) and (b), gel electrophoresis of PCR product using rbcl primer.
Bird House Gourd, H Water Jug Gourd, J Pennis Shield Gourd and Q Goose Neck Gourd showed no band on the agarose gel electrophoresis. In the repeat of the polymerase chain reaction (Figure 3(b)) it is observed that B Kettle Gourd, G Bird House Gourd, H Water Jug Gourd, J Pennis Shield Gourd and Q Goose Neck Gourd also showed band at 850 base pair.
The quantity of DNA extracted is read using spectrophotometer as shown in Table 1. The value ranges from 1.13 to 1.82
In Table 2 the nucleotide sequence arrangement possesses an N character of 720. The sequence alignment for twenty-three landraces of L. siceraria shows great diversity and variability in the arrangement of the bases (Table 3).
The result observed from BLASTN (Table 4) shows that all the sequences belong to Lagenaria siceraria with percentages ranging from 95% to 100% for query cover sequences and 98% to 100% for identity sequences with an E-value of 0.00. From the taxonomic report, sequence F-Bushel Gourd, K-Warted Bushel Gourd, U-Long Handle Dipper Gourd, I-Cup Gourd, L-Extra, M-Long Siphon Gourd, S-Swan Gourd, T-Palm Wine Gourd and W-Snake Gourd has the highest hits of 44 on Lagenaria siceraria out of 109 total value, sequence O-Chinese Bottle Gourd, P-Mini Dipper Gourd and R-Nigeria Rattle has the highest hits of 44 on L. siceraria out of the total value of 111, sequence V-Powder Horn Gourd has the highest hits of 44 on Lagenaria siceraria out of 119 total value, sequence C-Caveman Club Gourd, G-Bird House Gourd, H-Water Jug Gourd, J-Pennis Sheild, Q-Goose Neck Gourd, obtained the highest hit of 45 out of the total value 104, while sequence A-African Bottle Gourd, B Kettle Gourd, D Base Ball Gourd, has a total taxonomic value of 104 out of which 44 were identified as Lagenaria siceraria. The total taxonomic value of N-Indian Gourd, was observed as 110 with the highest hit value being 44 on Lagenaria siceraria and sequence X-Microphone Gourd finally has a highest value hits of 44 on Lagenaria siceraria out of 105 total value.
The total nucleotide composition of the species (Table 5) varies across the different landraces. A-African Bottle Gourd is observed to have a total nucleotide composition of 689, thymine having an average frequency of 20.1%, cytosine 20.0%, adenine 28.6%, and guanine 22.6%. The total nucleotide composition in B-Kettle Gourd is 686 with an average percentage frequency of 29.0% for thymine, 19.1% for cytosine, 28.9% for adenine and 22.4% for guanine. C-Caveman
Table 1. Nanodrop of Lageneria siceraria (Mol.) Standl.
Table 2. Nucleotide sequence arrangement.
Table 3. Sequence alignment for twenty-three (23) fruit shapes of L. siceraria (Mol.) Standl. Complex.
Table 4. Nucleotide basic local alignment search tool (BLAST N).
Table 5. Nucleotide composition frequencies of Lageneria siceraria (Mol.) Standl.
Club Gourd have a nucleotide composition of 675 with an average percentage frequency of thymine 29.3%, cytosine 19.7%, adenine 28.0% and guanine 23.0%. The nucleotide composition of D-Base Ball Gourd consist of 680 nucleotides with an average percentage frequency of thymine 29.9%, cytosine 19.4%, adenine 28.4% and guanine 22.4%, in F-Bushel Gourd, the nucleotide composition is 716 while thymine 30.3%, cytosine 18.9%, adenine 28.6% and guanine 22.2%, G-Bird House Gourd has a total composition of nucleotide as 661, thymine 29.2%, cytosine 19.7%, adenine 28.0% and guanine 23.1%, the total composition of nucleotide in H-Water Jug Gourd is 664, thymine 29.2%, cytosine 19.1%, adenine 28.7% and guanine 23.0%, in I-Cup Gourd the total nucleotide composition is 714, thymine 30.0%, cytosine 19.0%, adenine 28.7% and guanine 22.3%, the total nucleotide composition is 665 in J-Pennis Shield Gourd, thymine 28.9%, cytosine 19.7%, adenine 28.1% and guanine 23.3%, also in K-Warted Bushel Gourd the total nucleotide composition is 708, thymine 30.2%, cytosine 18.9%, adenine 28.7% and guanine 22.2%, the total nucleotide composition of L-Extra Large Pawpaw Gourd is 719 were the percentage frequency of the bases are thymine 30.0%, cytosine 18.8%, adenine 28.8% and guanine 22.4%, M-Long Siphon Gourd has a total nucleotide composition of 718 and the percentage frequency of is thymine 29.9%, cytosine 19.1%, adenine 28.6% and guanine 22.4%, in N-Indian Gourd, the percentage frequency of thymine 29.2%, cytosine 19.4%, adenine 28.5% and guanine 22.9%, it possesses a total nucleotide composition of 720. O-Chinese Bottle Gourd has a total nucleotide composition as 677 while the percentage frequency of thymine is 29.7%, cytosine 19.6%, adenine 27.9% and guanine 22.7%, in P-Mini Dipper Gourd the total nucleotide composition equals 679 nucleotides while the percentage frequency of the nucleotides has thymine to be equal to 29.5%, cytosine 19.9%, adenine 28.1% and guanine 22.5%, Q-Goose Neck Gourd, the total composition of nucleotide is 651 while the percentage frequency of each bases are thymine 29.5%, cytosine 19.8%, adenine 27.3% and guanine 23.3%, in R-Nigeria Rattle Gourd, the total nucleotide composition is 705 and the percentage frequency of the nucleotide is thymine 29.8%, cytosine 19.6%, adenine 27.8% and guanine 22.8%. The total nucleotide composition of S-Swan Gourd is 718 having a percentage frequency as thymine 29.4%, cytosine 19.5%, adenine 28.7% and guanine 22.4%, T-Palm Wine Gourd the total nucleotide composition is 714 were the percentage frequency of the nucleotides are thymine 30.4%, cytosine 19.0%, adenine 28.3% and guanine 22.3%, in U-Long Handle Dipper Gourd the frequency of the nucleotides are thymine 30.0%, cytosine 19.0%, adenine 28.9% and guanine 22.4%, and a total nucleotide composition of 717. V-Powder Horn Gourd posseses a total nucleotide content of 702, with a percentage frequency of thymine 30.2%, cytosine 18.7%, adenine 28.2% and guanine 22.9%, in W-Snake Gourd the total nucleotide composition is 707 while the percentage frequency of the bases are: thymine 30.6%, cytosine 19.1%, adenine 28.6% and guanine 21.8%, and finally, X-Microphone Gourd is observed to have a total nucleotide composition of 719 with thymine as 29.8%, cytosine 18.9%, adenine 28.1% and guanine 23.2%, percentage frequencies.
Estimates of Evolutionary Divergence between Sequences are shown in the data matrix (Table 6). The number of base differences per site from between sequences is shown. Standard error estimate(s) are shown above the diagonal and were obtained by using analytical formulas. The rate variation among sites was modelled with a gamma distribution (shape parameter = 0.05). The analysis involved 23 nucleotide sequences. Codon positions included were 1st + 2nd + 3rd + Noncoding. All ambiguous positions were removed for each sequence pair. There were a total of 720 positions in the final dataset. Evolutionary analyses are conducted in MEGA X [15] .
The phylogenetic tree (Figure 4) shows great diversity and similarity in Long handle dipper -U and Microphone gourd-X are out group from the root, and meets at similarity scale of 0.7, while Kettle gourd-B and Caveman club gourd-C are out group from the second root of the tree meeting at a similarity scale of 1.0. The second root has two main clusters that meets at different similarity scale. The first cluster of the second root (R2a), African bottle gourd-A is seen as out group at 0.65 while Snake gourd-W and Powder horn gourd-V clustered together at 1.0 similarity scale. Swan gourd-S and palmwine gourd-T meets at 0.89 while Mini dipper gourd-P and Goose neck gourd-Q clustered at 1.0, Nigerian gourd-R is observed as an out group at 0.85 similarity scale. In the second cluster of the second root (R2b), Cup gourd-I was an out group at 0.85 while Baseball gourd-D, Bird house gourd-G, Water jug gourd-H, Penis shield gourd-J, Warted bushel gourd-K, Extra-large pawpaw gourd-L, Long siphon gourd-M, Indian gourd-N and Chinese gourd-O met at 1.0 similarity scale.
4. Discussion
According to [6] , the concentration of macromolecular studies was geared towards DNA and RNA hence, molecular systematics deals with the utilization of nucleic acid data. Genetic materials have been used in the understanding of the
Figure 4. Phylogenetic tree of Lageneria siceraria (Mol.) Standl.
Table 6. Distance matrix of Lageneria siceraria (Mol.) Standl.
evolutionary relationship. The quality of the chloroplast DNA was very visible at 850 bp across the landraces of L. siceraria using agarose gel electrophoresis (Figure 2; Figure 3). The quantified DNA (Table 3) had an absorbance ratio of 1.2 to 1.8 showing the purity of the DNA, which is in line with the work of [16] [17] . The conserved rbcL gene has been utilized in PCR amplification of chloroplast gene sequences for determining and ratifying phylogenies [18] [19] . The sequence alignment of the twenty-three landraces from Nigeria shows great variation in the arrangement of the nucleotide bases (Figure 4), which is due to gene recombination and mutation.
The sequences were subjected to validation through National Center for Biotechnology Information (NCBI) using Nucleotide Basic Local Alignment Search Tool (BLAST N). The result obtained (Table 4) proves that all the sequences belong to Lagenaria siceraria (Mol.) standl. with percentages ranging from 95% to 100% for query cover sequences and 98% to 100% for identity sequences with an E-value of 0.00. From the taxonomic report obtained sequence F-Bushel Gourd, K-Warted bushel gourd, U-Long handle dipper, I-Cup gourd, L-Extra-large pawpaw gourd, M-Long siphon gourd, S-Swan gourd, T-Palmwine gourd, W-Snake gourd has the highest hits of 44 on Lagenaria siceraria out of 109 total value, sequence O-Chinese gourd, P-Mini dipper gourd and R-Nigeria rattle gourd has the highest hits of 44 on L. siceraria out of the total value of 111, sequence V has the highest hits of 44 on Lagenaria siceraria out of 119 total value, sequence C-Caveman club gourd, G-Bird house gourd, H-Water jug gourd, J-Pennis shield gourd, Q-Goose neck gourd obtained the highest hit of 45 out of the total value 104, while sequence A-African bottle gourd, B-Kettle gourd, D-Baseball gourd has a total taxonomic value of 104 out of which 44 were identified as Lagenaria siceraria. The total taxonomic value of N-Indian gourd was observed as 110 with the highest hit value being 44 on L. siceraria and sequence X-Microphone gourd finally has a highest value hits of 44 on Lagenaria siceraria out of 105 total value.
Thus the phylogenetic tree (Figure 4) was constructed to show the relatedness of the landraces (biotypes). The Long handle dipper and Microphone gourd are out group from the root, and meets at similarity scale of 0.7, while Kettle gourd and Caveman club gourd are out group from the second root of the tree meeting at a similarity scale of 1.0. The second root has two main clusters that meets at different similarity scale. The first cluster of the second root (R2a), African bottle gourd is seen as out group at 0.65 while snake gourd and powder horn gourd clustered together at 1.0 similarity scale. Swan gourd and palmwine gourd meets at 0.89 while Mini dipper gourd and Goose neck gourd clustered at 1.0, Nigerian gourd is observed as an out group at 0.85 similarity scale. In the second cluster of the second root (R2b), Cup gourd was an out group at 0.85 while Baseball gourd, Bird house gourd, Water jug gourd, Penis shield gourd, Warted bushel gourd, Extra-large pawpaw gourd, Long siphon gourd, Indian gourd and Chinese gourd met at 1.0 similarity scale. The phytogenetic studies, have therefore aided in proper understanding of the gene pool and genetic variability of the landraces (biotypes) of L. siceraria found in Nigeria.
5. Conclusion
The use of bioinformatics tools in biosystematics studies of Lagenaria siceraria landraces (biotypes) found in Nigeria has served as a tool in resolving the quagmire in the diversity of L. siceraria complex and the classification of the Landraces into proper taxa. Hence molecular systematics of the species demonstrates differences in sequence arrangement that is due to gene recombination and the functional effect of heterologous genes expressed phenotypically on the fruit shape thereby resulting in the diversity of in fruit shapes of Lagenaria siceraria (Mol.) Standl. found in Nigeria.