Genetic analysis of human papillomavirus type 11
isolates from patients with recurrent respiratory
papillomatosis treated at Universitas Academic
Hospital
Corne Thuynsma
July 2021
Genetic analysis of human papillomavirus type 11 isolates
from patients with recurrent respiratory papillomatosis
treated at Universitas Academic Hospital
by
Corne Thuynsma
2014035048
Submitted in fulfilment of the requirements for the degree
Magister Medical Scientiae (Medical Virology)
in the Division of Virology
in the Faculty of Health Sciences
at the University of the Free State
Supervisor: Prof. Felicity Jane Burt, Division of Virology, Faculty of Health Sciences,
University of the Free State
Co-Supervisor: Prof. Riaz Seedat, Department of Otorhinolaryngology, Faculty of the Health
Sciences, University of the Free State
2021
Bloemfontein
South Africa
i
Declaration
“I, Corne Thuynsma, declare that the Master’s Degree research dissertation or interrelated, publishable
manuscripts/published articles, or coursework Master’s Degree mini-dissertation that I herewith submit
for the Master’s Degree qualification in Medical Virology at the University of the Free State is my
independent work, and that I have not previously submitted it for a qualification at another institution
of higher education.”
Signature: Date: 30 July 2021
ii
Acknowledgements
First and foremost, I would like to give my special regards to Prof. Felicity Burt. She offered invaluable
advice and constant support during my M.Med.Sc study. Her immense knowledge and experience have
encouraged me in my academic research and daily life. I wish to show my gratitude to Prof Riaz Seedat
for his insightful comments and suggestions on my dissertation. I would also like to thank Armand for
his support on the technical part of my study. You all provided invaluable assistance during my research.
My gratitude extends to the Poliomyelitis Research Foundation for the funding opportunity to undertake
my studies at the Division of Virology, University of the Free State.
I wish to acknowledge the support and great love of my family and friends. To my laboratory mates–
Gerdus, Micah, and Nina, I treasure the time spent together in the laboratory and the office. Coffee
breaks with them have made my study and life in the Bloemfontein wonderful.
I want to express my gratitude to my family and loved ones for giving me all the wonderful opportunities
that have made me who I am and helped me flourish. I am so lucky to have a family that fills my heart
with strength, joy, and love. Without their tremendous encouragement, it would have been impossible
for me to complete my studies.
Scientific research is one of the most exciting and rewarding of occupations – Frederick Sanger
iii
Table of Contents
List of figures viii
List of tables ix
List of abbreviations xi
Abstract xv
CHAPTER 1 - Literature review 1
1.1. Background 1
1.2. Discovery of human papillomavirus 2
1.3. Human papillomavirus genome and proteins 4
1.3.1. The early region 4
1.3.2. The late region 8
1.3.3. Upper regulatory region 8
1.4. Classification and taxonomy of human papillomavirus 9
1.5. Life cycle of human papillomavirus 10
1.6. Transmission of human papillomavirus 12
1.7. Epidemiology of human papillomavirus type 11 in South Africa 13
1.8. Human papillomavirus pathogenicity 14
1.9. Host immune response to human papillomavirus 15
1.10. Recurrent respiratory papillomatosis 16
1.10.1. Risk factors associated with the development of recurrent respiratory papillomatosis 17
1.10.2. Diagnosis of recurrent respiratory papillomatosis 18
1.10.3. Treatment of recurrent respiratory papillomatosis 18
1.11. Prophylactic vaccines against human papillomavirus 19
1.12. Detection of human papillomavirus infections 21
1.12.1. Nucleic acid hybridisation assays 21
1.12.2. Signal amplification assays 21
1.12.3. Nucleic acid amplification assays 21
1.12.4. Genotyping 22
1.13. Phylogenetic analysis of human papillomavirus isolates 23
1.13.1. Sequence evolution 23
1.13.2. Multiple sequence alignment 24
1.13.3. Phylogenetic trees 24
1.13.4. Types of phylogenetic methods 24
1.14. Problem statement 25
iv
1.15. Aims and objectives 25
CHAPTER 2- Phylogenetic analysis of human papillomavirus type 11 isolates 27
2.1. Introduction 27
2.2. Materials and methods 29
2.2.1 Study samples 29
2.2.2 Study setting and population 29
2.2.3 Amplification of five human papillomavirus genomic regions 29
2.2.4 Agarose gel electrophoresis 31
2.2.5 Purification of PCR product 32
2.2.6 Determination of DNA concentration 33
2.2.7 Sequencing 33
2.2.8 Data analysis 35
Analysis of nucleotide variation 35
Degree of divergence 35
Analysis of amino acid variation 36
Phylogenetic analysis 36
2.2.9 Ethical considerations 36
2.3. Results 37
2.3.1 Patient data 37
2.3.2 L1 ORF amplification and sequencing 37
2.3.3 URR amplification and sequencing 40
2.3.1 E5a/b ORF amplification and sequencing 42
2.3.2 E2 ORF amplification and sequencing 44
2.3.3 Analysis of amino acid variations 44
Analysis of L1 ORF sequences 44
Analysis of E5a/b ORF sequences 46
2.3.4 Phylogenetic analysis 46
Maximum Likelihood method 47
Pairwise analysis of percentage divergence of nucleotides 49
Analysis of E2 segment as representative of whole genome variation 51
2.4. Discussion 55
CHAPTER 3- Characterisation of novel human papillomavirus type 11 isolates 61
3.1. Introduction 61
3.2. Materials and methods 62
v
3.2.1. Study samples 62
3.2.2. Full‐genome amplification and sequencing 63
3.2.3. Agarose gel electrophoresis 65
3.2.4. Purification of PCR product 65
3.2.5. Determination of DNA concentration 65
3.2.6. MiSeq library preparation and sequencing 65
3.2.7. Next-generation sequencing data analysis 66
Analysis of nucleotide and amino acid variation 66
Percentage variation in coding regions 66
Degree of divergence 66
Maximum likelihood method 67
3.3. Results 67
3.3.1. Selection of isolates for whole genome sequencing 67
3.3.2. Nucleotide and amino acid variances across the human papillomavirus type 11 genome 69
3.3.3. Pairwise nucleotide difference between complete human papillomavirus type 11 genomes 72
3.3.4. Maximum Likelihood method 73
3.3.5. Genome heterogeneities with respect to the corresponding prototype 73
3.4. Discussion 76
Chapter 4 - Concluding remarks 78
References 82
Appendices I
Appendix 1: Human papillomavirus type 11 positive patient sample information I
Appendix 2: Human papillomavirus type 11 isolates retrieved from GenBank and accession
numbers IV
Appendix 3: Health Sciences Research Ethics Committee approval document VI
Appendix 4: Design of E2 segment primers VII
Appendix 5: Specificity of E2 segment primers VIII
Appendix 6: 1x TAE preparation IX
Appendix 7: Agarose gel preparation IX
Appendix 8: GelRed stain preparation IX
Appendix 9: Amino acid codon chart X
vi
Appendix 10: Pairwise analysis of percentage divergence of nucleotides using the human
papillomavirus type 11 concatenated E5a/b-L1-URR data set XI
Appendix 11: Pairwise analysis of percentage divergence of nucleotides using the 208bp
human papillomavirus type 11 E2 segment data set XV
Appendix 12: Pairwise analysis of percentage divergence of nucleotides using the human
papillomavirus type 11 complete genomes XX
vii
List of figures
Figure 1: Genome organisation of a low-risk human papillomavirus type 11 (HPV11). 5
Figure 2: Schematic drawing of human papillomavirus type 11 (HPV11) replication initiation
mechanisms. 7
Figure 3: Human papillomavirus (HPV) replication in epithelial cells. 12
Figure 4: Phylogenetic tree of human papillomavirus type 11 (HPV11) based on alignment of 96
nucleotide sequences of HPV11 concatenated L1-URR. This analysis involves 68 isolates sequenced in
this study and 28 sequences of geographically distinct isolates retrieved from GenBank. 48
Figure 5: Phylogenetic tree of Human papillomavirus type 11 (HPV11) based on alignment of 95
nucleotide sequences of HPV11 concatenated E5a/b-L1-URR. This analysis involves 67 isolates
sequenced in this study and 28 sequences of geographically distinct isolates retrieved from GenBank.
50
Figure 6: Phylogenetic tree of Human papillomavirus type 11 (HPV11) based on 95 nucleotide
sequence alignments of HPV11 E2 segment. This analysis involves 67 isolates sequenced in this study
and 28 sequences of geographically distinct isolates retrieved from GenBank. 53
Figure 7: Phylogenetic tree of Human papillomavirus type 11 (HPV11) based on 95 nucleotide
sequence alignments of HPV11 E2 segment. This analysis involves 67 isolates sequenced in this study
and 28 sequences of geographically distinct isolates retrieved from GenBank. 54
Figure 8: Genome organisation of a low-risk human papillomavirus type 11 (HPV11). 64
Figure 9: Phylogenetic tree of human papillomavirus type 11 (HPV11) based 32 nucleotide sequence
alignments of HPV11 including four isolates sequenced in this study and 28 sequences of
geographically distinct isolates retrieved from GenBank. 75
viii
List of tables
Table 1: List of primers used for sequencing the L1, E5a/b, upper regulatory region (URR) and a
segment of the E2 of human papillomavirus type 11 (HPV11) modified from Maver et al., 2011. 31
Table 2: Polymerase chain reaction (PCR) components for amplifying human papillomavirus type 11
(HPV11) L1 gene, E5a/b gene, and upper regulatory region (URR). 32
Table 3: List of primers used for sequencing the L1 gene of human papillomavirus type 11 (HPV11)
from Maver et al., 2011. 33
Table 4: Sequencing reaction components for nucleotide determination of human papillomavirus type
11 (HPV11) isolates. 34
Table 5: Quantitative estimation of DNA concentration revealed by Nanodrop spectrophotometer of
human papillomavirus type 11 (HPV11) genomic regions including late region L1, upper regulatory
region (URR), early region E5a/b and a segment of early region E2. 38
Table 6: Sequence comparison of nucleotide variances among 72 human papillomavirus type 11
(HPV11) isolates L1 region against the HPV11 prototype with GenBank accession number M14119.1.
39
Table 7: Sequence comparison of nucleotide 68 among various human papillomavirus type 11 (HPV11)
isolates upper regulatory region (URR) against the HPV11 prototype with GenBank accession number
M14119.1. 41
Table 8: Sequence comparison of nucleotide variances among 67 human papillomavirus type 11
(HPV11) isolates E5a/b region against the HPV11 prototype with GenBank accession number
M14119.1. 43
Table 9: Sequence comparison of nucleotide variances among 67 human papillomavirus type 11
(HPV11) isolates E2 segment against the HPV11 prototype with GenBank accession number
M14119.1. 45
Table 10: GenBank accession numbers of human papillomavirus type 11 (HPV11) isolates and the
(sub)lineage representation. 47
Table 11: List of primers used for sequencing full-length human papillomavirus type 11 (HPV11)
genome in two overlapping fragments. 63
ix
Table 12: Polymerase chain reaction (PCR) components for amplifying human papillomavirus type 11
full-length genome in two overlapping fragments (E1 to L1 genes; L1 to E1 genes). 64
Table 13: Patient information. 68
Table 14: Next-generation sequencing assembly report for four complete human papillomavirus type
11 (HPV11) mapped to the A1 prototype M14119.1 using Geneious Prime software
(https://www.geneious.com). 68
Table 15: The lengths of the complete human papillomavirus type 11 (HPV11) consensus sequence,
the CG content and the pairwise identity of four HPV11 isolates mapped to the A1 prototype M14119.1
using Geneious Prime software (https://www.geneious.com). 68
Table 16: Sequence comparison of nucleotide variances among four human papillomavirus type 11
(HPV11) isolates against the HPV11 prototype with GenBank accession number M14119.1. 70
Table 17: Comparison of amino acid variances among four human papillomavirus type 11 (HPV11)
isolates against the HPV11 prototype with GenBank accession number M14119.1. 71
Table 18: Estimates of evolutionary divergence between four human papillomavirus type 11 (HPV11)
sequences using MEGA X software. 72
Table 19: Genomic variations between human papillomavirus type 11 (HPV11) lineage B isolates
VBD28/14 sequenced in this study and lineage B prototype LN833183.1 retrieved from GenBank. 73
Table 20: Genomic variations between human papillomavirus type 11 (HPV11) sub-lineage A2 isolates
sequenced in this study and sub-lineage A2 prototype LN833161.1 retrieved from GenBank. 74
x
List of abbreviations
A Alanine
AE Early polyA signal
AIDS Acquired immunodeficiency syndrome
ATP Adenosine triphosphate
BLAST Basic Local Alignment Search Tool
bp Base pair(s)
CANSA Cancer Association of South Africa
CD Cluster of differentiation
CDK2 Cyclin-dependent kinase 2
D Aspartic acid
DBD Deoxyribonucleic acid binding domain
DC Dendritic cells
ddNTPs Dideoxynucleotides triphosphates
DNA Deoxyribonucleic acid
ds Double-stranded
E Glutamic acid
E2F E2 factor
E6AP E6-associated protein
EDTA Ethylenediaminetetraacetic acid
F Phenylalanine
FTR Formylmethanofuran-tetrahydromethanopterin formyltransferase
xi
G Glysine
G1-phase Growth-1-phase
G2-phase Growth-2-phase
H Hinge region
hc2 Hybrid Capture®2
HD Helicase domain
HIV Human immunodeficiency virus
HPV(s) Human papillomavirus(es)
HR High-risk
HSPGs Heparin sulphate proteoglycans
I Isoleucine
ICTV International Committee on Taxonomy of Viruses
IgG Immunoglobulin-G
K Lysine
L Leucine
LCR Long control region
LR Low-risk
MCHA Microplate colorimetric hybridisation assay
MEGA Molecular Evolutionary Genetics Analysis
MHC Major histocompatibility complex
M-phase Mitosis-phase
MUM1+ Multiple myeloma oncogene-1
xii
N Asparagine
NCR Non-coding region
NGS Next generation sequencing
OBD Origin binding domain
ORF(s) Open reading frame(s)
P Proline
p53 Tumour protein 53
PCR Polymerase chain reaction
pRB Retinoblastoma protein
Q Glutamine
R Arginine
rcf Relative centrifugal force
RFLP Restriction-fragment length polymorphism
RNA Ribonucleic acid
RRP Recurrent respiratory papillomatosis
S Serine
SIN Squamous intraepithelial neoplasia
SNP(s) Single nucleotide polymorphism(s)
S-phase Synthesis-phase
STI Sexually transmitted infection
T Threonine
TAD Transactivation domain
xiii
TAE Tris-Acetic acid- ethylenediaminetetraacetic acid
Tm Melting temperature
UFS University of the Free State
URR Upstream regulatory region
V Volts
VLP(s) Virus-like particle(s)
WGS Whole genome sequencing
xiv
Abstract
Human papillomavirus type 11 (HPV11) is a causative agent of recurrent respiratory papillomatosis
(RRP), a common benign laryngeal neoplasm that presents mainly in children. The genome comprises
three regions: the early region (E1, E2, E4, E5a/b, E6 and E7), the late region (L1 and L2), and the
upper regulatory region (URR). A sequence-based classification system is primarily used to genotype
HPV. The L1 is used for HPV type discrimination, and in combination with the URR, can be used to
differentiate between various lineages. However, optimal sub-lineage classification requires whole
genome sequencing (WGS). A recent study investigating the genomic diversity of globally circulating
HPV11 isolates identified a novel lineage and two novel sub-lineages. It has been proposed that
phylogenetic tree topologies using the sequences of concatenated E5a/b- L1-URR genes, a 208bp
segment of the E2 gene, and the complete genome generates similar tree topologies. Also, there is
currently no published data on the HPV11 intratypic variants circulating in the Free State region. Hence,
this study investigated HPV11 intratypic variants circulating in patients with RRP at the Universitas
Academic Hospital, and aimed to identify novel (sub)lineages through phylogenetic investigations.
The study population included patients diagnosed with RRP caused by HPV11, and sequence data for
geographically distinct HPV11 (sub)lineage representatives. The genetic variation of HPV11 isolated
from patients with RRP was determined by sequencing the E5a/b, L1, URR and a segment of E2 genes.
Four isolates of interest were selected for whole-genome sequencing and phylogenetically analysed to
determine the presence of potentially novel isolates.
Many nucleic heterogeneities and non-synonymous substitutions were identified in isolates
characterised in this study. Phylogenetic analysis of the concatenated L1-URR and E5a/b-L1-URR
resolved into lineages A and B; however, sub-lineage classification was unclear. Analysis of the
complete genome determined the presence of a lineage B isolate and two isolates of interest.
Comparative analysis of genetic variability determined that the concatenated E5a/b-L1-URR could not
reliably classify isolates. A segment of the E2 gene could reliably distinguish between all lineages and
sub-lineages, suggesting that this gene segment contains stable sub-lineage specific single nucleotide
polymorphisms (SNPs) and may serve in sub-lineage identification.
In conclusion this study provides the most comprehensive data on the genomic diversity of HPV11 in
the Free State to date. Results obtained in the current study support WGS for HPV11 classification
below lineage level as a standard, as it generates more information regarding genetic variants.
xv
CHAPTER 1 - Literature review
1.1. Background
The first papillomavirus was identified in the 1930s (Shope & Hurst, 1933). Since then, recombinant
deoxyribonucleic acid (DNA) technology and molecular cloning have led to the discovery of multiple
human papillomaviruses (HPVs) (Boshart et al., 1984; Dürst et al., 1983; Gissmann et al., 1982).
Papillomaviruses target various host tissues and are responsible for forming benign hyperproliferation
and malignancies in human and animal hosts. The association with different conditions such as cancer,
skin warts and laryngeal papillomas has been well-studied (Cornall et al., 2013; Gaylis & Hayden.,
1991; zur Hausen et al., 1975).
HPVs which target mucosal sites are broadly divided into high-risk (HR)-HPV types, probable HR-
HPV types and low-risk (LR)-HPV types based on oncogenic potential (zur Hausen, 2002). HPV6 and
HPV11 are LR-HPV types responsible for genital wart formation and development of papillomas in the
respiratory tract, commonly referred to as recurrent respiratory papillomatosis (RRP) (Kardani &
Bolhassani, 2018b). This disease is characterised by recurrent papillomas forming in the respiratory
tract, most commonly the larynx and may cause significant morbidity and possible mortality due to
airway obstruction, especially in younger children (Seedat, 2020; Seedat et al., 2013; Swain et al.,
2020).
HPVs form part of the Papillomaviridae family and are non-enveloped, double-stranded (ds)
deoxynucleic acid (DNA) viruses containing a small genome of around 8000 base pairs (bp) (Crawford,
1965; Klug & Finch, 1965). The virus genome comprises three regions, which include the early region
(E1-E7 genes) and the late region (L1 and L2 genes) that encode proteins, and the upstream regulatory
region (URR), also recognised as the long control region (LCR) (Kardani & Bolhassani, 2018a).
A classification and nomenclature system for the Alphapapillomaviruses was established in 2011 by
Burk and colleagues. Papillomavirus classification largely relies on phylogenetic analysis, genome
organisation properties, virulence factors and the host tissue the virus affects. A DNA sequence-based
classification system is primarily used to differentiate and genotype HPV (Burk et al., 2011). The L1
open reading frame (ORF) is generally used for HPV type discrimination, and in combination with the
URR, can be used to differentiate between various lineages. However, optimal sub-lineage
classification requires sequencing additional gene segments (Burk et al., 2011). A novel HPV type
identifies as having over 10% variance to any other HPV type. Nucleotide differences in the complete
genome of 1% to 10% and 0.5% to 1% identify variant lineages and sub-lineages, respectively (Burk et
al., 2011).
1
DNA sequencing methods, such as Sanger sequencing and next generation sequencing (NGS), are often
used in phylogenetic studies to investigate HPVs inheritable traits within different organisms and to
detect genomic mutations. By analysing the data acquired by sequencing, phylogenetic trees, which
reflect the evolutionary history of organisms, can be constructed, and the information gathered can then
be used for taxonomic classification and analyses (Burk et al., 2011; Kocjan et al., 2015; Sridhar et al.,
2015). Using phylogenetic analysis, HPV11 was previously divided into subgroups A1 and A2;
however, recently a study on the genomic diversity of globally circulating HPV11 isolates uncovered
two lineages (A and B) and four sub-lineages (A1, A2, A3 and A4) (Jelen et al., 2016), warranting
investigation of further isolates for clarification of lineages currently circulating (Burk et al., 2011).
Previous studies conducted on young adults (age 16 – 24 years) in South Africa determined that over
two-thirds of the population are often infected with multiple HPV types, with approximately 1-5% of
the HPV positive participants infected with HPV11 (Giuliano et al., 2015; Mbulawa et al., 2017, 2018).
A study conducted in the Free State province of South Africa determined that RRP has a predominantly
juvenile-onset (children age 14 and under) (~85%), with approximately 40% of cases being due to
HPV11 (Seedat & Schall, 2018). However, most of these studies only target selected population groups
and an accurate HPV11 prevalence, especially in the respiratory tract, is unknown and presumed to be
much higher.
To date, there is no cure for RRP, and patients often require multiple surgical procedures to remove the
papillomas to prevent airway obstruction (Seedat, 2020). Fortunately, preventative measures are
available. There are three vaccines used for protection against HPV: Cervarix®, Gardasil® and
Gardasil-9®. Two of these vaccines, Gardasil® and Gardasil-9®, protect against HPV11 infection
when administered prophylactically (Barra et al., 2019; Chabeda et al., 2018; Dadar et al., 2018). To
combat infection with HPV, the National Department of Health has incorporated HPV vaccination with
Cervarix® into the school health programs, targeting half a million girls (Delany-Moretlwe et al., 2018).
However, Cervarix® does not confer protection against HPV11 infection (Barra et al., 2019; Chabeda
et al., 2018; Dadar et al., 2018).
Research on the HPV11 lineages and sub-lineages circulating within the community is urgently required
to further study lineage correlation with disease severity, to guide vaccine development, to monitor the
impact of the vaccination campaign on the circulating HPV types, to collect baseline data for future
studies, and to identify novel HPV types if present.
1.2. Discovery of human papillomavirus
In the early 1930s, Rous and Shope identified the first papillomavirus from wart-like lesions in
cottontail rabbits (Sylvilagus sp.), and further investigations led to the discovery of papillomavirus
2
malignant- and transmission capabilities (Rous & Beard, 1935; Shope & Hurst, 1933). Following this
discovery, research into papillomaviruses intensified. In 1949, the first virus particles in a human
papilloma extract were identified using electron microscopy (Strauss et al., 1949), and 16 years later,
in 1965, the human wart virus's genome was determined to be circular, dsDNA with a protein capsid
comprised of 72 pentameric capsomers (Crawford, 1965; Klug & Finch, 1965).
Until the 1970s, it was believed that there was only a single type of HPV that caused the formation of
warts in various tissue sites. However, within the decade, recombinant DNA technology and molecular
cloning techniques led to the discovery of a plurality of HPV types, such as HPV6, HPV11, HPV16 and
HPV18, with different tropisms for mucosal or cutaneous squamous surfaces (Boshart et al., 1984;
Dürst et al., 1983; Gissmann et al., 1982). In the wake of these studies, the involvement of HPV in
malignant cell transformation was hypothesised and would later prove to be a major focus area in HPV
research (Gissmann et al., 1977; Gissmann & zur Hausen, 1976; zur Hausen et al., 1975).
In 1982, molecular cloning of papillomavirus genomic DNA extracted from a respiratory papilloma led
to the discovery of HPV11 (Gissmann et al., 1982). Further investigations eventually revealed that
HPV11 was also responsible for genital wart development (Gissmann et al., 1983). The first complete
nucleotide sequences of ten HPV11 isolates were determined and characterised in 1986 in Slovenia,
and approximately a decade later, the URR sequences of 40 HPV11 isolates were determined, providing
further insight into the genetic code of HPV11 types and intratypic variants (Dartmann et al., 1986;
Heinzel et al., 1995). Only again in 2009, was the complete sequence of HPV11 obtained from a cervical
swab of a patient with an unspecified genital disease (Wu et al., 2009).
The first extensive sequencing study to obtain complete genome sequences of HPV11 isolates was
performed in 2011 in Slovenia (Maver et al., 2011). Phylogenetic analysis revealed sequences
corresponding to the prototypic- and non-prototypic variant group based on specific nucleotide
signatures and several novel HPV types and potentially significant mutations (Maver et al., 2011). The
prototypic variant group and the non-prototypic variant group were renamed A1 and A2, respectively,
when a new system for the classification and nomenclature was described for HPV variants (Burk et
al., 2011). More recently, a study investigating the genomic diversity of globally circulating HPV11
isolates revealed two lineages (A and B) and four sub-lineages (A1 - A4) (Jelen et al., 2016). It has been
established that HPV has a slow evolutionary rate, and genomic variation is predominantly a
consequence of genetic drift (Bernard, 1994; van Doorslaer, 2013). Therefore, discovering any new
HPV lineages or sub-lineages requires additional investigation into the evolutionary rate and the genetic
relationship of HPV11 variants to further characterise these variants.
3
1.3. Human papillomavirus genome and proteins
HPVs are non-enveloped, dsDNA viruses with a small genome of around 8000bp (Kardani &
Bolhassani, 2018a). Many proteins necessary for viral infection, cellular gene expression, and immune
evasion are encoded by the HPV ORFs. The genome comprises three regions, including the early ORF,
late ORF, and the URR (Figure 1). The early region is approximately 4000bp, and codes for non-
structural proteins, whereas the late region is approximately 3000bp, and codes for structural proteins
(Kardani & Bolhassani, 2018a). The URR is approximately 1000bp and contains elements involved in
regulating viral replication and gene expression (Harari et al., 2014; Kardani & Bolhassani, 2018a;
Ribeiro et al., 2018).
Throughout evolution, papillomaviruses have gained and lost protein coding genes; however, all
papillomaviruses encode for a minimum of five proteins, including E1, E2, E4, L1 and L2, and contain
a URR (Kardani & Bolhassani, 2018a). Therefore, a virus containing only the basic set of core genes
(E1, E2, L1, L2 and URR) should theoretically be able to infect a cell and replicate. It is thus
hypothesised that ancestral papillomaviruses comprise the essential genes and no additional adaptive
proteins. All papillomaviruses contain at least one adaptive protein, such as E5, E6 or E7, which plays
a role in immune evasion and cellular growth (Kardani & Bolhassani, 2018a, 2018b). Moreover, genes
that encode adaptive proteins such as E5, E6 and E7, have been associated with malignant
transformation, although not all papillomaviruses encode for all these adaptive proteins, and not all
papillomaviruses have an association with malignant transformation (Egawa et al., 2015; van Doorslaer,
2013).
1.3.1. The early region
The early region comprises several ORFs that encode non-structural proteins, which play a regulatory
function immediately following infection. These ORFs can be divided into the regulatory genes (E1,
E2 and E4), which play a crucial role in virion synthesis, and three oncoproteins (E5, E6 and E7). The
ORFs in the early region are expressed within the deeper, less differentiated layers of the infected tissue,
such as the basal layer (Harari et al., 2014; Kardani & Bolhassani, 2018a).
The E1 and E2 genes, play pivotal roles during replication and amplification of the viral circular dsDNA
genome and are essential for virus survival. E1 is separated into three functional regions: The N-
terminal regulatory region, which stimulates cyclin-dependent kinase 2 (CDK2) phosphorylation, the
central origin-binding domain (OBD), which binds E1 and E2 protein forming a complex (E1-E2), and
the C-terminal helicase domain (HD) which acts as adenosine triphosphate (ATP)-dependent helicase
(Graham & Faizo, 2017; Kardani & Bolhassani, 2018a). Functions of this protein include optimising
viral reproduction in vivo, recognising the origin of replication, and assembling into a double hexameric
4
complex that unwinds DNA duplexes upstream of the DNA replication fork. E2 contains a DNA-
binding domain and a protein-binding domain (Graham & Faizo, 2017; Kardani & Bolhassani, 2018a;
Wallace & Galloway, 2014).
M14119.1
Human
papillomavirus
type 11 (HPV11)
complete
genome
7933bp
Figure 1: Genome organisation of a low-risk human papillomavirus type 11 (HPV11).
E1-E7 early genes, L1-L2 late genes, URR Upper regulatory region, AE early polyA signal, and P97 and P742 promotors are
indicated. The figure is drawn based on the HPV11 prototype A1 with a 2bp insertion at genomic position 7717-7718 (GenBank
accession number M14119.1) (Maver et al., 2011).
The E2 protein is a multifunctional protein encoded by all papillomaviruses expressed during early and
intermediate viral life cycle stages. The E2 has several primary functions essential for the viral life
cycle, such as regulating expression levels for viral gene products in the early stages of the HPV life
cycle, recruiting the E1 helicase protein to the viral origin of replication for replication initiation, and
delivering the replicated viral genome to daughter cells during the division of the parent cell.
Additionally, the E2 plays a role in transcription as a transcription factor and ensures low viral
concentrations in the early stages of the HPV life cycle. Mutation or integration of the viral genome can
result in the inactivation of the E2 protein, leading to over-expression of E6 and E7 genes (Graham &
Faizo, 2017; Kardani & Bolhassani, 2018a; Wallace & Galloway, 2014).
5
Replication is initiated when E2 recruits E1 to the viral origin (Figure 2). The recruitment step involves
a crucial protein-protein interaction between the transactivation domain (TAD) of the E2 protein and
the HD of E1. Then, additional E1 molecules are recruited, and the E2 protein promotes assembly into
a replication-competent double hexameric helicase. ATP likewise promotes the oligomerisation of E1
and is further required to power E1 helicase activity. Lastly, E1 interacts with host cell replication
factors to promote bidirectional replication of the viral genome (Graham & Faizo, 2017; Kardani &
Bolhassani, 2018a; Wallace & Galloway, 2014) (Figure 2).
The HPV E4 ORF is located within the E2 ORF and is the first gene expressed in the later stage of
infection. E4 does not possess an initiation codon and uses the E2 initiation codon. The E4 protein plays
a role in virion release by restructuring the epithelial cell's cytokeratin and may play a part in genome
amplification and the enhancement of virion synthesis. The E4 protein displays various effects on cell
behaviour, such as suppressing the host cell's DNA synthesis to promote apoptosis in terminally
differentiated cells by interacting with the host mitochondria (Graham & Faizo, 2017; Kardani &
Bolhassani, 2018a). In addition to this, it has also been reported to have a potential role in HPV
screening (Yajid et al., 2017).
The E5 protein is a transmembrane hydrophobic protein expressed late in infection made up of
approximately 83 amino acids. The protein has several functions, including immune evasion via major
histocompatibility complex (MHC) repression and regulating apoptosis. Moreover, the E5 protein plays
a role in cell cycle pathways via interaction with growth factor receptors (Graham & Faizo, 2017;
Kardani & Bolhassani, 2018a). Similar to other HR proteins, the E5 influences cellular gene expression
by regulating small non-coding ribonucleic acid (RNA) molecules, thereby exhibiting oncogenic
activity (Georgescu et al., 2018).
The E6 and E7 proteins exhibit oncogenic activity by associating with tumour protein 53 (p53) and
retinoblastoma protein (pRB) (Altamura et al., 2018; Georgescu et al., 2018; Pal & Kundu, 2020).
Expression of these ORFs in the lower epithelial layer triggers differentiating cells of the suprabasal
layers (the thin layer above the basal layer) to re-enter the synthesis-phase (S-phase) in which DNA
replicates. This phase is highly regulated and conserved, and dysregulation generally leads to cell
apoptosis (Georgescu et al., 2018; Kardani & Bolhassani, 2018a). E6 proteins are responsible for
degrading p53, which inhibits apoptosis, and therefore the infected cells continue viral DNA replication
by recruiting E1/E2 complexes which play a significant role in replication. The pRB gene acts as a
tumour suppressor by suppressing the replication of enzyme expression genes (Pal & Kundu, 2020).
When E7 binds to pRB, the complex inactivates, stimulating replication and cell division. Association
of the oncoproteins with these tumour suppressor genes leads to disruption of the physiological
functions of these genes and subsequent possible malignant transformation (Pal & Kundu, 2020).
6
Figure 2: Schematic drawing of human papillomavirus type 11 (HPV11) replication initiation mechanisms.
Early proteins E1 and E2 are involved. Locations of the functional domains in the proteins are indicated. OBD origin binding
domain, TAD transactivation domain, DBD Deoxyribonucleic acid binding domain, and H hinge region. Replication is
initiated by E1 (red), by E2 (blue), to the OBD. This involves an interaction between the E2 TAD and the E1 helicase domain.
E2 recruits additional E1 molecules. Adenosine triphosphate (ATP) also stimulates helicase activity of E1, which interacts
with host cell replication factors to promote bidirectional replication of the viral genome. E1 interacts with host cell replication
factors to promote bidirectional replication of the viral genome. Adapted from D'Abramo & Archambault (2011) and Stenlund
(2003).
Recent studies have demonstrated the role of oxidative stress and chronic inflammation in carcinogenic
development. The oncogenes E5, E6 and E7, are all involved in developing chronic inflammation,
which induces oxidative stress, leading to cell damage and subsequent malignant transformation
(Georgescu et al., 2018). The type of papillomavirus influences the oncogenicity of E5, E6 and E7.
HPV types such as HPV11 have limited activity compared to the other HPV types and are classified as
LR-HPV. LR-HPV types, unlike HR-HPV types, do not degrade pRB or p53 and are less likely to result
in malignant transformation than their HR counterparts (Egawa & Doorbar, 2017; Klingelhutz &
Roman, 2012; Pal & Kundu, 2020).
7
1.3.2. The late region
The late region contains two ORFs, namely L1 and L2, which express structural- or capsid proteins
required for transmission and survival of HPV in the upper, more differentiated layers of the infected
tissues (Buck et al., 2013; Wang & Roden, 2013). The L1 major capsid protein can spontaneously self-
assemble into virus-like particles (VLPs), presenting an icosahedral exterior surface. The L1 gene is
highly conserved, but contains a small number of lineage specific nucleotide polymorphisms that makes
it useful for typing (Buck et al., 2013; Burk et al., 2011). The structure is highly immunogenic and has
formed the basis of successful vaccines targeted against HR-HPV types (Buck et al., 2013).
The L1 capsid surface contains knob-like structures constituting 360 copies of the protein organised
into 72 pentameric capsomers with a copy of L2 protein in the centre. The L1 has two termini, namely
the N- and C termini, arranged as 'invading arms' arranged to form the floor between the knob-like
capsomeres (Buck et al., 2013; Conway & Meyers, 2009). The capsid undergoes various essential
conformational changes during the virus life cycle. These conformational changes mediate vital
functions for virus survival, including encapsidation of the papillomavirus genome, interaction with the
host cell for infectious entry, and releasing the viral DNA into a new host cell (Buck et al., 2013).
The capsid protein L2 has complex roles in the biology of all papillomaviruses, most notably in virus
assembly and the infectious process. Unlike the major capsid protein L1, the minor capsid protein L2
cannot self-assemble into VLPs, although it can be integrated into VLPs when co-assembled and
expressed with the L1 protein (Conway & Meyers, 2009; Wang & Roden, 2013). Roles of the L2 protein
include facilitation of efficient genome encapsidation, vesicular trafficking of the viral genome in the
direction of the host cell nucleus and escaping the vesicular compartment to travel to the host cell
nucleus for successful infection (Conway & Meyers, 2009; Wang & Roden, 2013).
1.3.3. Upper regulatory region
The URR is the most variable region within the HPV genome since it does not encode for proteins.
Consequently, it is capable of accumulating and tolerating more nucleic mutations. The URR is located
between the E6 ORF and L1 ORF and consists of approximately 850bp (Kardani & Bolhassani, 2018a;
Ribeiro et al., 2018). The URR interacts with numerous cellular and viral factors and is involved with
functions such as virus replication, gene expression, and transcription. The region divides into three
segments which all include E2 binding sites. The URR includes the 5' segment, the central segment and
the 3' segment. The 3' segment also bears the E1 binding site, which overlaps with the origin of
replication (Graham & Faizo, 2017; Ribeiro et al., 2018). The URR includes genetic elements, for
example, early promotors, the transcription enhancer, the origin of replication, the late polyadenylation
8
site, and the late regulatory element. Mutations in the URR may impact binding sites and functionality
(Fang et al., 2020; Graham & Faizo, 2017; Ribeiro et al., 2018).
1.4. Classification and taxonomy of human papillomavirus
Initially, papillomaviruses and polyomaviruses were grouped into one family, namely the
Papovaviridae. These viruses were subsequently separated into two families, Papillomaviridae and
Polyomaviridae, due to the lack of nucleotide and amino acid similarities, different genome sizes and
genome organisation (Bernard et al., 2010; Fauquet et al., 2005; van Regenmortel et al., 2002).
Papillomaviruses consists of a family of viruses that classifies into different genera, species, and types.
HPVs consist of three significant genera, including the Alphapapillomaviruses, Betapapillomaviruses
and Gammapapillomaviruses, but smaller genera Mupapillomavirus and Nupapillomavirus, also exist
(Bernard et al., 2010; de Villiers, 2013; Murahwa et al., 2019).
Throughout the evolutionary process, nucleotide sequences across the HPV genome have transformed.
Evolutionary changes in papillomaviruses' hosts created new ecological niches for papillomaviruses to
adapt to, resulting in the formation of different clades. These viruses then co-evolved alongside their
specific hosts resulting in further co-speciation (Bernard, 1994; van Doorslaer, 2013). HPV tropism is
used to classify viruses into cutaneous viruses or mucosal viruses. Mucosal viruses are further divided
into LR, probable HR, and HR viral types (Egawa & Doorbar, 2017; Egawa et al., 2015).
A DNA sequence-based classification system is primarily used to classify HPVs. The International
Committee on Taxonomy of Viruses (ICTV) established the designation and naming of virus taxa based
on recommendations from the Papillomaviridae Study Group, which consists of 11 researchers
(https://talk.ictvonline.org/taxonomy/). A classification and nomenclature system for the
Alphapapillomaviruses was established in 2011 by Burk and colleagues (2011). This study was the first
to describe the nomenclature system for HPV6 and HPV11 variants based on whole-genome analyses.
The ORF nucleotide sequence coding for the capsid protein L1 is used to classify HPV types and
represent whole-genome variation due to the high level of conservation of the L1 gene (Bernard et al.,
2010; Burk et al., 2011; de Villiers et al., 2004; de Villiers, 2013). Consensus primers that target the L1
ORF are used to differentiate between various HPV types. These amplicons can be sequenced and
aligned to existing HPV prototypes to identify HPV types, lineages and sub-lineages or any nucleic
heterogeneity (Bernard et al., 2010; Burk et al., 2011; de Villiers et al., 2004; de Villiers, 2013). A
novel HPV type has less than 90% similarity to any HPV type. Nucleotide differences in the complete
genome of between 1.0% to 10.0% and 0.5% to 1.0% define variant lineages and sub-lineages,
respectively (Burk et al., 2011). Data acquired by sequencing is used to construct phylogenetic trees
and used for taxonomy (Burk et al., 2011). Approximately 220 HPV types have thus far been identified
9
as of 2021 according to the International HPV Reference Centre (Bernard et al., 2010; de Villiers et al.,
2004) (https://www.hpvcenter.se/human_reference_clones).
The HPV11 genome was formally classified into two sub-lineages (A1 and A2) based on complete
HPV11 genomes (Burk et al., 2011; Maver et al., 2011). A more recent study on the global genomic
diversity of HPV11 identified a separate lineage (lineage B) and two additional sub-lineages (sub-
lineages A3 and A4) (Jelen et al., 2016). These results were also generated by sequencing the E5a ORF,
E5b ORF, L1 ORFs and the URR, which reliably produced the same phylogenetic results as whole-
genome sequencing. Moreover, this study identified a 208bp region found between the 3' end of the E2
ORF and the 5' of the non-coding region-2 (NCR2) that is the most suitable genomic region to sequence
to distinguish all identified HPV11 lineages and sub-lineages when whole-genome sequencing is
impractical (Jelen et al., 2016). A review of literature revealed that additional studies on HPV11 are
required to investigate the heterogeneity observed in this study.
1.5. Life cycle of human papillomavirus
Papillomaviruses all follow similar life cycles but are species-specific regarding infection and
replication within hosts (Doorbar et al., 2012; Kardani et al., 2018). They infect epithelial cells, which
are part of the cutaneous and mucosal tissue that generally serves as a protection against the external
environment (Madison, 2003; Stanley, 2020).
Basal cells compose the bottom layer of the epithelial layer and are capable of mitotic division. During
mitosis, a daughter cell moves up through the epithelial layer to replace non-living surface epithelial
cells. This movement marks the end of the cell cycle and the beginning of terminal differentiation,
resulting in the loss of the nucleus (Kardani et al., 2018; McBride, 2017). The cell cycle occurs in four
phases. Cellular growth occurs in the growth-1-phase (G1-phase), host DNA replicates in the S-phase,
the cells prepare for division in the growth-2-phase (G2-phase), and chromosomes are separated, and
daughter cells form in the mitosis-phase (M-phase) (Kardani et al., 2018; McBride, 2017). HPVs do
not possess replication machinery apart from the E1 replication enzyme. To overcome this, HPVs
depend on the epithelial cellular replicative enzymes for propagation and enter basal cells to link virus
and host replication life cycles (Doorbar et al., 2012; McBride, 2017).
There are four main stages of replication during the HPV life cycle: virus entry, establishment,
maintenance of the non-productive infectious state, and virus production (Kardani et al., 2018;
McBride, 2017). HPV replication is depicted in Figure 3. The first phase of HPV replication is host cell
entry. Infection with HPV is thought to require epithelium micro-abrasions to allow access of the virus
into actively dividing basal cells; however, multiple entry-pathways have been reported for the immense
diversity of HPV types (Graham & Faizo, 2017; Kardani et al., 2018; McBride, 2017). The L1 capsid
10
protein then binds to the host cell receptor such as heparin sulphate proteoglycans (HSPGs) which are
the primary receptors. Receptor types and strategies may be dependent on the HPV type and the host
cell. The virus enters the host cell using endocytosis and travels through vesicular trafficking to the
nucleus, where the virus enters the nucleus through nuclear pores or following cell mitosis (Graham &
Faizo, 2017; Kardani et al., 2018; McBride, 2017).
Once the virus has successfully entered the host cell, the second- or non-productive infectious state is
established. The initial amplification phase of the virus is thought to be transient and rapid, and relies
on host cell machinery for replication (Graham & Faizo, 2017; Kardani et al., 2018; McBride, 2017).
Early genes necessary for the initial genome amplification are expressed (E1, E2, E6 and E7). The URR,
E1 and E2 are the primary genes required for successful replication. The E1 and E2 proteins are
dependent on the host's replication machinery and DNA polymerase. In HR-HPVs, the E6 and E7
proteins have a pivotal role in driving cell proliferation in the basal layers and subverting pathways that
signal cell growth arrest, thus supporting viral genome replication. In LR-HPVs, these proteins do not
have a clear role in replication (Graham & Faizo, 2017; Kardani et al., 2018; McBride, 2017). The viral
genome doubles and segregates the viral DNA into daughter cells during basal cell division and
replicates with the host DNA as episomes (S-phase). Low copy numbers (50−200 copies) of the viral
episomes are maintained by maintenance replication to avoid host immune system detection (Graham
& Faizo, 2017; Kardani et al., 2018; McBride, 2017) (Figure 3).
In the third phase, the non-productive infectious state is maintained. Viral genome maintenance in
proliferating basal cells follows a burst of viral DNA replication, also known as vegetative viral DNA
amplification. The E4 and E5 proteins are expressed during this phase, although E4 is not essential for
HPV11 replication (Graham & Faizo, 2017; Kardani et al., 2018; McBride, 2017). Similarly, the late
proteins (L1 and L2) necessary for viral assembly are expressed (Buck et al., 2013; Wang & Roden,
2013). After differentiation of basal cells into keratinocytes, the cells exit the cell cycle (Graham &
Faizo, 2017; McBride, 2017). The purpose of the E6 and E7 proteins in HR- and LR-HPVs is to preserve
a reservoir of infection and direct differentiating cells into the S-phase for viral amplification in the
upper basal layers (Egawa & Doorbar, 2017) (Figure 3).
In the last phase of viral replication, large numbers of progeny viral genomes are synthesised. Structural
proteins expressed in the upper layers of epithelium and progeny virion particles are assembled and
released upon cell death (Conway & Meyers, 2009; Graham & Faizo, 2017; Kardani et al., 2018;
11
McBride, 2017) (Figure 3). When HPV infects and replicates within the respiratory tract mucosa,
papillomas may develop (Benedict & Derkay, 2021; Hoesli et al., 2020; Seedat et al., 2013).
Figure 3: Human papillomavirus (HPV) replication in epithelial cells.
HPV moves through the epidermis to promote infection. The proteins involved in the process include early proteins (E1, E2,
E4, E5, E6 and E7) and late proteins (L1 and L2). Modified from Stanley, 2020.
1.6. Transmission of human papillomavirus
HPV is commonly transmitted during sexual contact with an infected partner, even when asymptomatic
(Kardani et al., 2018). According to the Cancer Association of South Africa (CANSA), approximately
80% of people are infected with HPV at least once before the age of 50 years (https://cansa.org.za/).
Globally, HPV is characterised as a prevailing sexually transmitted infection (STI). HPV can also be
transmitted via auto-or hetero-inoculation, for example, kissing, non-penetrative sex, genital scratching,
or finger-genital contact (Sabeena et al., 2017). In addition, HPV is also known to persist in a person's
mouth, and HPV transmission between family members can thus be expected (Sabeena et al., 2017).
Although sexual transmission of HPV is predominant, HPV DNA has been identified in lesions in
infants, young children, and women who are not yet sexually active, which suggests that the virus can
be transmitted from an infected mother to child during or before childbirth (Kardani et al., 2018;
Sabeena et al., 2017; Zahreddine et al., 2020; Zouridis et al., 2018). Infants born via caesarean section
tend to have lower HPV transmission rates than those born naturally, although this delivery mode does
not exclude the probability of infection (Tseng et al., 1998; Zouridis et al., 2018). Many well-conducted
prospective studies have found that vertical transmission of HPV to newborn babies is relatively rare,
with less than 5% of children being infected with HPV from their mother (Smith et al., 2010; Watts et
12
al., 1998; Zouridis et al., 2018). However, results vary significantly, with other studies reporting higher
(>20%) rates of mother-to-infant transmission (Hahn et al., 2013; Rintala et al., 2005; Tseng et al.,
1992, 1998). A systematic review of the literature published in 2018, including 421 HPV-positive
mothers and their progenies, showed a 4.94% rate of vertical transmission of HPV and the relative risk
of vertical transmission of HPV between women undergoing caesareans and vaginal deliveries to be
0.912 (Zouridis et al., 2018).
Few studies have reported on the differences between intrauterine transmission and transmission during
labour and delivery. These studies produced varying results, with some studies unable to confirm
intrauterine transmission and other studies reporting high rates of intrauterine HPV transmission.
Nonetheless, an HPV positive mother continues to be a risk factor of infant infection. The prevalence
of HPV infection in children infected in this manner is an essential area of research due to the impact
on vaccination and management strategies.
1.7. Epidemiology of human papillomavirus type 11 in South Africa
A study conducted in the Western Cape between November 2012 to July 2013 on young women (age
16–24 years) determined that 71% of the population was infected with HPV, with half (50.1%) of the
HPV positive population infected with multiple HPV types. It was reported that between 1% and 3.1%
of the HPV-positive population was infected with HPV11 and HPV6, respectively. These are the HPV
types responsible for RRP development. This study also found that the younger women in the study
population were more likely to test HPV positive than the older women (83% among ages 16−17 years)
(Giuliano et al., 2015).
A study that recruited sexually active, human immunodeficiency virus (HIV)-negative young women
(age 16–22 years) from Cape Town and Soweto between November 2013 and December 2014 reported
a lower HPV prevalence among the study population (66%). In addition, this study revealed a lower
percentage of infection with multiple HPV types (41.6%). However, a higher prevalence of HPV11 and
HPV6 infections (~4% and ~6%, respectively) in the HPV-positive population was reported (Mbulawa
et al., 2018). According to a previous surveillance study (2015-2016) for cervical HPV infections on
unvaccinated women aged 18-20 years, it was determined that approximately two thirds (64.4%) of the
South African female population was infected with HPV, with a clear association between HPV and
HIV. In addition, most women were infected with more than one HPV type with a median of three HPV
types. Genotyping revealed that 3.4% of the population who tested HPV positive were infected with
HPV11, and 4.8% were infected with HPV6 (Mbulawa et al., 2017).
A recent cross-sectional study showed that among women aged 38–55 years visiting a community
health clinic in the OR Tambo district municipality of the Eastern Cape province of South Africa, HR-
13
HPV prevalence was 28.5%. Of the 417 participants, 40.7% of women infected with an HR-HPV type
were HIV-positive, and 14.4% of women reported having ulcers or warts in their lifetime. In addition,
HIV-positive women with normal or abnormal cytology had a higher viral load than HIV-negative
women (Taku et al., 2020). However, a limitation of most of these studies is that they targeted specific
population groups or sampling areas and did not include children who predominantly suffer from RRP.
1.8. Human papillomavirus pathogenicity
HPVs are ubiquitous in the human population and often cause morbidity. LR-HPVs cause many
seemingly benign lesions such as genital-, common-, and flat warts, as well as verrucas and other skin
lesions. Generally, lesions caused by HPV are self-limiting and are frequently cleared by the cell-
mediated response (Sterling et al., 2001). However, HPV associated papillomas are often refractory to
treatment and may persist. The development of cancer due to LR-HPV infection requires the host to be
genetically susceptible or immunosuppressed to endure raised viral gene expression and recalcitrant
lesions to progress to malignancy (Egawa et al., 2015; Georgescu et al., 2018; Pinidis et al., 2016;
Reidy et al., 2004). Nevertheless, the association between LR-HPV and carcinoma remains ambiguous.
RRP, which is associated with HPV11, can persist for years regardless of surgical intervention, and can
in some instances lead to malignant lesions in the lower respiratory tract and lungs (Gerein et al., 2005;
Reidy et al., 2004). In these circumstances, the viral genome is integrated into the hosts' cell
chromosome, signifying deregulated viral gene expression pursued by acquiring additional genetic and
epigenetic modifications as seen in HR-HPV types (Huebbers et al., 2013; Reidy et al., 2004). In some
RRP-associated cancers, rearrangement of the HPV11 genome has been observed, but viral gene
expression patterns in this regard are still poorly understood. Compared to the prevalence of HPV6 and
HPV11 infection in individuals, the development of RRP is rare, which may suggest that this disease
should be considered as a multigene disease as HPV type and tissue-specific immune deficiency hinders
the clearance and management of HPV6 and HPV11 infection. Therefore, in individuals who develop
this disease, long term follow-up is essential (Bonagura et al., 2010; Seedat, 2020; Seedat et al., 2013).
HPV pathogenicity is determined by genotype, epithelial micro-environment, and the infection site and
consequent pathogenesis is influenced mainly by viral protein function and regulation (Georgescu et
al., 2018; Stanley, 2020). Knowledge of molecular pathogenesis at the gene regulation and protein
function level is necessary to explain why LR-HPVs are less likely to cause progression to malignancy
compared to HR-HPVs (Georgescu et al., 2018; Stanley, 2020). In tumour cells, the virus integrates
into the host genome and interrupts the functions of the E2 protein, which is responsible for E6 and E7
transcription, and subsequently results in the E6 and E7 oncoproteins being expressed at elevated levels.
As a result, infected cells are immortalised and unable to differentiate (Georgescu et al., 2018;
Longworth & Laimins, 2004; Stanley, 2020). In contrast to LR-HPV, HR-HPV E6 and E7 proteins
14
disturb epithelial differentiation and apoptosis by binding to host cell proteins with high affinity
(Stanley, 2020). Epithelial differentiation and apoptosis disruption are stimulated through cellular
proliferation, synthesis of viral DNA and interference in the cell cycle (Stanley, 2020).
In HR-HPVs, the E6 proteins form a complex with p53 and a ubiquitin ligase, E6-associated protein
(E6AP), instigating cell cycle arrest and loss of apoptotic functions by blocking progression at the G1/S-
phase checkpoint (Altamura et al., 2018; Pal & Kundu, 2020). The E7 protein inactivates the pRB
protein which disrupts its interaction with the E2 factor (E2F) at the G1-phase which subsequently
results in disruption at the cell cycle control checkpoints (Liu et al., 2008; Pal & Kundu, 2020).
Additionally, the E7 protein interacts with the deacetylases, an enzyme, and the cyclin and cyclin-
dependant kinase regulatory proteins to alter the cell cycle (Wang & Roden, 2013). The interaction of
E7 with enzymes triggers DNA synthesis and cell replication in generally inactive mature epithelial
cells, subsequently stimulating differentiation‐dependent viral DNA amplification, leading to
pathological cell growth (Bedell et al., 1991; Pal & Kundu, 2020). The virions then assemble and are
released when the squamous cell layer is reached (Conway & Meyers, 2009).
1.9. Host immune response to human papillomavirus
People with RRP often develop a robust serological response to the HPV vaccine; however, a
serological response against natural infection is often delayed and produces low levels of antibodies.
The delayed serological response is attributable to low levels of exposure to viral proteins due to
immune evasion and the absence of viremia. It is postulated that only 50-70% of people infected with
HPV develop specific antibodies (Buchinsky et al., 2020; Zahreddine et al., 2020).
Initially, patients diagnosed with RRP generate a measurable serum antibody response to HPV6 and
HPV11 infection, indicating viral immune recognition. Therefore, immune dysfunction in patients
diagnosed with RRP is indicative of HPV6 and HPV11 tolerance instead of a lack of viral recognition
(Buchinsky et al., 2020; Ivancic et al., 2020). Furthermore, studies have demonstrated site‐specific
immune tolerance of HPV6 and HPV11 in the mucosa, suggesting that RRP is a multigene disease that
polarises the immune system to tolerate local and chronic HPV6 and HPV11 infection (Buchinsky et
al., 2020; Ivancic et al., 2020).
The adaptive immune system includes naïve T-lymphocytes that differentiate into cluster of
differentiation (CD) 4+ and CD8+ cells (cell-mediated immunity) and B-lymphocytes, including
plasma- and memory cells producing antibodies (humoral immunity). Phagocytes and antigen-specific
cytotoxic T-lymphocytes, which form part of the cell-mediated immunity, are activated, and cytokines
are released in response to the antigen. Due to cell-mediated immunity, the vast majority of papillomas
regress within two years (Ivancic et al., 2020). B-lymphocytes, which display terminal differentiation
15
and plasma cells, named multiple myeloma oncogene-1 (MUM1+), produce antibodies that correlate
with disease severity. HPV vaccination increases the level of antibodies and memory cells, thereby
boosting immunity to HPV (Buchinsky et al., 2020; Ivancic et al., 2020).
Conversely, the innate immune system is a non-specific system that includes cells such as dendritic-,
mast-, and natural killer cells, as well as macrophages and granulocytes. The innate immune system is
activated by the presence of antigens, such as the L1 and L2 capsid proteins (Ivancic et al., 2020). The
system further includes infiltration of leukocytes and production of nitric oxide, cytokines, and
chemokines at the site of infection. Antibodies developed against HPV are associated with containment
of the virus and papilloma regression (El Achkar et al., 2020; Ivancic et al., 2020).
Infection with HPV6 and HPV11 stimulates viral immune recognition and mounts a measurable serum
antibody response (Ivancic et al., 2020; El Achkar et al., 2020). The CD3+ T-cells count represents the
total number of T-lymphocytes. A higher number of CD3+ cells has been found in adults, who generally
present with less severe RRP. A lower number of CD8+ cells was observed in juvenile patients, who
generally present with more severe RRP. The immature juvenile immune system may affect the
presentation of antigens and the secretion of pro-inflammatory cytokines, contributing to the severity
of RRP. A higher number of CD8+ cells were detected in patients displaying low-grade dysplasia,
whereas the CD4+ count remained unchanged in different degrees of dysplasia (El Achkar et al., 2020).
Regulatory T-cells are responsible for immune self-tolerance by suppressing the activation of T-cells.
A lower number of CD4+ regulatory T-cells, such as T-helper cells and natural killer cells, are observed
in patients with frequent relapses (Ivancic et al., 2020; El Achkar et al., 2020).
A study describing the vertical transmission and clearance of immunoglobulin-G (IgG) antibodies
against HPV6, HPV11, HPV16 and HPV18 in children showed that antibodies from newborns and
mothers were moderately correlated, and that 80–100% of anti-HPV antibodies were cleared within the
first two years of life in seropositive newborns, suggesting vertical transfer of the antibodies
(Zahreddine et al., 2020). Thus, serological studies on HPV antibodies are essential to study natural
immunity and to monitor the impact of HPV vaccination programs.
1.10. Recurrent respiratory papillomatosis
RRP is a debilitating disease characterised by the recurrent formation of benign papillomas, more
commonly referred to as warts, in the mucosa of the respiratory tract (Benedict & Derkay, 2021). The
glottic and supraglottic regions of the larynx are most frequently affected. There are two forms of RRP,
namely juvenile-onset RRP and adult-onset RRP (Novakovic et al., 2018; Seedat, 2020; Swain et al.,
2020). HPV6 and HPV11 mainly cause RRP. However, HPV types 16, 18, 31, 33, 39, 44, 45, 55, and
16
70 have also been identified in respiratory papillomas (Hoesli et al., 2020; Peñaloza-Plascencia et al.,
2000).
Papillomas in the airway cause dysfunction in the larynx and trachea and may lead to morbidity and
possible mortality due to airway obstruction. Although the papillomas are histologically benign, the
disease may be life-threatening to some individuals if treatment is not sought (Benedict & Derkay,
2021; Seedat, 2020; Swain et al., 2020).
A study reviewing data from patients diagnosed with RRP between 2011 and 2015 in South Africa
established that RRP in South Africa has a predominantly juvenile onset. This study reported the overall
incidence of RRP to be 0.51 per 100000 population per year and the prevalence of RRP to be 1.39 per
100000 population. In children, the incidence of RRP was 1.34 per 100000 children per year and
prevalence 3.88 per 100000 children. This study also found that RRP in children caused by HPV11
tended to be diagnosed at a younger age (median 3.2 years) than RRP caused by HPV6 (median 5.6
years) (Seedat & Schall, 2018).
1.10.1. Risk factors associated with the development of recurrent respiratory papillomatosis
Numerous studies have compared the severity of RRP with the HPV genotype. It has been established
that infection with HPV11 is possibly more severe, especially in the younger population, due to the
rapid regrowth of the lesions (Intakorn & Sonsuwan, 2014; Omland et al., 2014; Seedat, 2020). HPV
infection disease is transmitted during childbirth or sexual contact with an infected partner (Kardani et
al., 2018; Sabeena et al., 2017; Zouridis et al., 2018). Condylomas during pregnancy are regarded as a
significant risk factor for acquiring juvenile-onset RRP by vertical HPV transmission (Rodier et al.,
2013).
Risk factors for the development of RRP include any immunodeficiency or co-infection, particularly
with herpes viruses and HIV. Prolonged exposure to the virus and a high viral load may also be risk
factors for HPV11 infection and RRP development. Furthermore, the number of children born by a
mother and the age at which birth is given may increase the likelihood of HPV11 infection and the
development of associated diseases (Rodríguez-Álvarez et al., 2018; Taku et al., 2020). Another factor
that may govern a patient's susceptibility to RRP includes genetic predisposition (Hahn et al., 2013;
Rintala et al., 2005; Smith et al., 2010; Zouridis et al., 2018). However, more research on the vertical
transmission of genes that increase the likelihood of HPV infection is needed. The tissue adjacent to the
site of infection may also act as a latent virus reservoir. A trigger such as recuperation after surgical
intervention to remove papillomas may prompt reactivation and replication of the HPV in the
surrounding tissue (So et al., 2019).
17
Previous studies reported that RRP, although a benign neoplasm, might progress to carcinoma.
Consequently, patients diagnosed with RRP showing evidence of dysplastic epithelium may be at
possible increased risk of developing laryngeal cancer (Benedict & Derkay, 2021; Cornall et al., 2013;
Georgescu et al., 2018; Pinidis et al., 2016). However, a study on the association between LR-HPV
types and the development of laryngeal neoplasia found no association between infection with HPV11
and laryngeal squamous intraepithelial neoplasia (SIN)-2+ or carcinoma. This study also found that co-
infection with other HR-HPV types did not correlate with high-grade SIN and carcinoma. A significant
preponderance of SIN2+ was instead identified in an HPV negative adult with adult-onset RRP (Omland
et al., 2014).
1.10.2. Diagnosis of recurrent respiratory papillomatosis
The presenting symptoms of respiratory papillomas are progressive dysphonia, stridor, and respiratory
distress. Laryngeal papillomas appear as off-white, exophytic, pedunculated, polypoidal masses. Single
or multiple papillomas may be present (Ivancic et al., 2018; Seedat, 2020; Wilcox et al., 2014).
Histopathological examination of a biopsy of the lesion gives a definitive diagnosis with RRP.
Exophytic finger-like projections of keratinised squamous epithelium maintained by connective tissue
and a vascular centre present in RRP. Vacuolated cells with noticeable cytoplasmic inclusions will be
visible, indicating the presence of viruses (Ivancic et al., 2018; Seedat, 2020; Welschmeyer & Berke,
2021; Wilcox et al., 2014).
1.10.3. Treatment of recurrent respiratory papillomatosis
Once a patient develops RRP, there is no definitive cure. Papillomas are removed using various
techniques but tend to recur (Ivancic et al., 2018; Seedat, 2020; Swain et al., 2020). HPV DNA is
present in uninvolved tissue and anatomical sites adjacent to the papillomas. Therefore, albeit with the
removal of all papillomas, lesions tend to recur. The frequency of recurrence tends to decrease over
time for most patients diagnosed with RRP, although this is not the case for all patients (Ivancic et al.,
2018; Seedat, 2020; Seedat et al., 2013).
The mainstay of treatment for RRP is repeated microlaryngoscopic procedures aiming to clear the
airway while preserving the mucosa and vocal folds (Ivancic et al., 2018; Seedat, 2020; Swain et al.,
2020). However, in developing countries, cold steel instruments are often the only mean of removing
papillomas (Seedat, 2020; Swain et al., 2020). Complications due to repeated surgical interventions
may result in long-standing abnormal vocal quality. Patients with airway obstruction may also require
a tracheostomy (Ivancic et al., 2018; Seedat, 2020; Swain et al., 2020).
A popular treatment option includes the microdebrider, which is quickly becoming the new gold
standard for removing respiratory papillomas. The microdebrider causes minimal trauma to the
18
surrounding tissue and minimises the lower respiratory tract from being contaminated with papillomas
and blood (Seedat, 2020; Swain et al., 2020). Controlled ablation is a relatively non-invasive low heat
method used for the dissolution of soft tissue. This method is designed to cause minimal charring and
burning of the tissue (Seedat, 2020; Swain et al., 2020).
Therapeutic vaccines that trigger the cell-mediated immune response are ideal for treating established
HPV infections (Chabeda et al., 2018). Numerous approaches are being explored to develop
prophylactic and therapeutic vaccines, including peptide-based vaccines, epitope-based vaccines,
recombinant vaccines, bacteria-based vaccines, yeast-based vaccines, VLP-based vaccines, DNA
vaccines, plant-based vaccines, dendritic cells (DC)-based vaccines, and protein-based subunit
vaccines. Several candidates have progressed to clinical trials (Chabeda et al., 2018; Dadar et al., 2018).
The E1, E2, E6 and E7 proteins are near-ideal targets for vaccines. The E6 and E7 oncoproteins are
expressed constitutively and at high levels and have a low mutation rate to maintain malignancy. The
E1 and E2 proteins are essential proteins expressed in all HPVs. These proteins are expressed at very
high levels at the early stages of infection and are a valuable target for therapeutic vaccines. Currently,
no therapeutic HPV vaccines have been approved for use in people infected with HPV, but promising
vaccine candidates have been identified in numerous studies (Chabeda et al., 2018; Dadar et al., 2018).
Prophylactic vaccines have also been used as adjuvant therapy for RRP (Rosenberg et al., 2019). A
systematic review and meta-analysis published in 2019 including patients previously diagnosed with
RRP receiving the Gardasil® vaccine as a therapeutic option showed that the number of monthly
surgical procedures for removal of laryngeal papillomas significantly reduced (0.35 to 0.06 per month)
and that the intersurgical interval increased from seven months to over 34 months (Rosenberg et al.,
2019).
Another adjuvant therapy for RRP treatment is intralesional cidofovir, an antiviral agent for treating
cytomegalovirus retinitis in individuals diagnosed with acquired immunodeficiency syndrome (AIDS).
The mechanisms of action of cidofovir involve decreasing the efficiency of DNA transcription of HPV.
In RRP, cidofovir has been shown to increase relapse-free time and decrease the number of surgical
procedures to remove papillomas (Fusconi et al., 2014; Gazia et al., 2020; Grasso et al., 2014; Graupp
et al., 2013; Tjon Pian Gi et al., 2012; Welschmeyer & Berke, 2021).
1.11. Prophylactic vaccines against human papillomavirus
There are currently three vaccines for protection against HPV: Cervarix®, Gardasil®, and Gardasil-9®.
The vaccines are developed using VLPs and mimic a natural HPV infection. These VLPs are not
infectious and do not contain any HPV DNA (Barra et al., 2019). The vaccines utilise the fact that HPV
L1 proteins form VLPs, which are antigenically like native virions when expressed in a range of cell
19
types. The targeted HPV types are blocked from entering the host cell by eliciting neutralising
antibodies that bind to the VLPs. However, these vaccines cannot eliminate established HPV infections
since the target antigens are not expressed in infected cells (Benedict & Derkay, 2021; Chabeda et al.,
2018).
Cervarix is a bivalent vaccine that protects against HR-HPV types 16 and 18, responsible for causing
high–grade cervical lesions. Gardasil is a quadrivalent vaccine that protects against two HR-HPV types,
HPV16 and HPV18, and two LR-HPV types, HPV6 and HPV11, responsible for RRP development.
Cervarix and Gardasil are aluminium-based first-generation vaccines proven to be highly immunogenic.
Both vaccines exhibit cross-protection against HPV types 31, 33, 45, and 51 due to their phylogenetic
relationship to HPV16 and HPV18. Gardasil-9 is a second-generation vaccine that protects against five
HPV types in addition to the HPV types targeted by the quadrivalent Gardasil vaccine. These include
HPV types 31, 33, 45, 52, and 58 (Barra et al., 2019; Chabeda et al., 2018; Dadar et al., 2018). The
effectiveness of HPV vaccines in preventing infection by the targeted types has been documented in
many countries (Brotherton, 2019).
Cervarix® has been incorporated into the national vaccination program by the South African National
Department of Health in 2014 and is administered to girls aged nine to ten at public schools in two
doses approximately six months apart. Approximately 354 000 age-eligible girls in grade four have
been vaccinated (Delany-Moretlwe et al., 2018). However, this vaccine does not confer immunity
against other HPV types, such as HPV6 and HPV11. A study conducted by Mbulawa and colleagues in
2018 reported a high prevalence of the HPV types targeted by the Gardasil-9 vaccine (38.5%)
circulating amongst South-African women, encouraging the introduction of this vaccine into school
health programs (Mbulawa et al., 2018). The introduction of this vaccine could be beneficial against
the development of genital warts and RRP.
Nokavic and colleagues published a five-year report on surveys of 28 paediatric otolaryngologists on
cases of RRP diagnosed post-HPV vaccination implementation in Australia. They demonstrated a
decrease in the incidence of RRP from 0.16 to 0.02 per 100 000 children and was the first study to report
a decrease in the incidence of RRP following the implementation of an HPV vaccination program
(Novakovic et al., 2018).
Future studies on the effectiveness of the 2014 HPV vaccination campaign are essential to inform
vaccination campaigns, monitor the vaccination campaign's impact on circulating HPV types, and
collect baseline data for future impact studies. Prevalence surveys reporting on the 2014 HPV
vaccination campaign in South Africa are underway and expected to be published in 2024. Additionally,
progress in immunology- and biotechnology-derived therapeutics, recombinant DNA technology and
20
molecular biology could multiply opportunities to fabricate vaccines that can effectively prevent and
treat infection with HPV, which may in return result in a diminution of HPV cases globally.
1.12. Detection of human papillomavirus infections
Accurate identification and typing of HPVs relies on molecular biology techniques as the virus cannot
be grown in conventional cell culture. At present, nucleic acid hybridisation -, signal amplification- and
nucleic acid amplification assays are primarily used to identify and type HPV (Abreu et al., 2012).
1.12.1. Nucleic acid hybridisation assays
Nucleic acid hybridisation assays include Southern blotting, dot blot hybridisation and in situ
hybridisation. However, nucleic acid hybridisation assays are not routinely used as they are time-
consuming and have low sensitivity (Abreu et al., 2012).
1.12.2. Signal amplification assays
Two signal amplification assays commonly utilised for diagnostic purposes are the Hybrid Capture®2
(hc2, Digene Corp., USA) and the Cervista® HPV HR assay (Abreu et al., 2012). The hc2 detects 13
HR-HPV types (HPV types 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, and 68). However, hc2 can
cross-react with LR-HPV types, including HPV11 (Burd, 2016). The Cervista® HPV HR assay detects
the same HR-HPV types as the hc2 plus HPV66 and has a lower cross-reactivity rate to other HPV
types (Burd, 2016). Although signal amplification assays have increased sensitivity for genotyping and
a low false-positive rate, they are not intended for single HPV genotyping (Abreu et al., 2012).
1.12.3. Nucleic acid amplification assays
There are numerous nucleic acid amplification-based methods to detect the presence of HPV in clinical
samples reliably. Polymerase chain reaction (PCR) is a widely used, specific and sensitive way to detect
the presence of HPV DNA and targets conserved regions such as the L1 capsid gene. In conventional
PCR, consensus primers targeting the HPV L1 gene are often used for amplification. Primers, for
example, MY09/MY11, GP5+/GP6+, PGMY and SPF10, are designed to detect various HPV types
(Cornelissen et al., 1992; de Roda Husman et al., 1995). HPV genotypes can also be detected using
primers targeting the E6 and E7 ORFs. The nucleotide sequences of the E6 and E7 ORFs contain many
sequence variations (Bernard et al., 2006). Other ORFs such as the E5 ORF, in combination with the
L1 ORF and URR, can also be used to distinguish between HPV types (Jelen et al., 2016). Following
amplification, HPV can be genotyped using different techniques, for instance, genotype-specific
primers, restriction-fragment length polymorphism (RFLP), sequencing, hybridisation, and linear probe
assays (Abreu et al., 2012; Dixit et al., 2011). Real-time PCR is a specific and sensitive tool that has
21
been shown to reliably detect and genotype HPV. Real-time PCR targets a relatively small segment of
the genome and can thus be used on archival clinical samples. This method delivers reproducible, rapid
results and can be used on clinical samples (Abreu et al., 2012).
The Linear Array® HPV Genotyping Test (Roche Molecular Diagnostics, CA, USA) is an assay based
on PCR amplification of an L1 gene segment and co-amplification of the β-globin gene, combined with
reverse line blot hybridisation, which can distinguish between 15 HR-HPVs, three probable HR-HPVs,
ten LR-HPVs (including HPV11) and nine other HPV genotypes with undetermined risk (Abreu et al.,
2012). The Clinical arrays® HPV kit can be used to detect and genotype HPV and allows detection of
20 HR-HPV types and 15 LR types, including HPV11, and can also detect co-infections (Abreu et al.,
2012). The Microplate colorimetric hybridisation assay (MCHA) is a PCR-based method that relies on
amplifying a region within the L1 gene followed by colorimetric hybridisation to HPV type-specific
probe micro-well plates to identify six different HR-HPV types (Abreu et al., 2012). The COBAS®
4800 HPV test combines automatic sample preparation and real-time PCR to detect 14 HR-HPV types
(Burd, 2016). The CLART® Human papillomavirus 2 (Genomica, Madrid, Spain) is an amplification-
based methodology that targets a conserved region, L1, within the HPV genomic region and a region
within the formylmethanofuran-tetrahydromethanopterin formyltransferase (FTR) gene. This test
detects 35 HPV types using specific probes in a low-density microarray (Abreu et al., 2012).
1.12.4. Genotyping
Many HPV tests use probes that test for 13−14 of the most common HPV types. It has been proposed
that testing for other HPV types may have a minimal effect on the knowledge of known, circulating
HPV types. Still, there is an increasing need to distinguish between different HPV genotypes due to
current advancements in the HPV field, evidence that certain HPV types correlate with high-grade
carcinogenicity, the effect of multiple HPV infections on clinical presentation and the effect of
vaccination campaigns on the current circulating HPV genotypes (Abreu et al., 2012; Brink et al., 2007;
Godínez et al., 2014; Pinidis et al., 2016).
Traditionally, Sanger sequencing, also known as the chain termination method, was used for analysing
DNA (Sanger et al., 1977). However, Sanger sequencing has limitations due to its restricted sensitivity,
and it cannot perform a parallel investigation of multiple targets. Even though, Sanger sequencing
remains the gold standard when analysing HPV DNA, developments in NGS technologies such as
whole genome sequencing (WGS) and corresponding analytical tools have provided comprehensive
novel genomic information on HPV and associated diseases on an unprecedented scale (Ambulos et al.,
2016; Parker & Chen, 2017; Tuna & Amos, 2017). NGS enables the analysis of millions of DNA
fragments simultaneously and has successfully been used to provide detailed information on the
aetiology of HPV-driven diseases and cancers (Ambulos et al., 2016; Parker & Chen, 2017; Tuna &
22
Amos, 2017). Additionally, the advancement of NGS technology allows for the development of a high-
throughput, affordable assay for HPV genotyping and has proven to be sensitive, specific, and capable
of identifying multiple co-infected HPV genotypes from a single sample (Ambulos et al., 2016; Parker
& Chen, 2017; Tuna & Amos, 2017).
HPV genome sequencing can be used to detect and genotype HPV on an assortment of clinical
specimens such as blood, saliva, tissue, and formalin-fixed paraffin-embedded samples using the L1
ORF (Ambulos et al., 2016; Parker & Chen, 2017). Additional sequencing is required to differentiate
between the different lineages and sub-lineages (Burk et al., 2011; Maver et al., 2011). It has been
determined that sequencing the E5 region and a small 208bp segment which includes the 3' end of E2
to the 5' end of NCR2, can be used to construct a phylogenetic tree representing the different lineages
and sub-lineages of HPV11 (Jelen et al., 2016).
1.13. Phylogenetic analysis of human papillomavirus isolates
Phylogenetics is the study of phylogeny, which pertains to the evolutionary relationship among and
within different organisms and species. Phylogenetics is linked to taxonomy, which is concerned with
naming, describing, and classifying organisms (Balaban et al., 2019). Taxonomy relies on information
gathered for phylogenetics during the classification of organisms. In phylogenetics, DNA sequencing
methods such as Sanger sequencing or NGS investigate inheritable traits (Ambulos et al., 2016; Parker
& Chen, 2017; Sanger et al., 1977). This results in a schematic postulate of the relationship between
organisms, termed a phylogenetic tree, that reflects the evolutionary history of said organism (Balaban
et al., 2019; Hirose et al., 2018).
1.13.1. Sequence evolution
Before the division of any cell, its genome must be duplicated to be inherited by the offspring. Despite
proof-reading mechanisms, errors still occur in the replication process. In time, genomic mutations such
as point-mutations accumulate as traces of evolutionary variances. Point-mutations include nucleotide
substitutions, which are the exchange of one nucleotide by another, the insertion of one or more bases,
and the deletion of one or more bases. To distinguish between specific point-mutations, the sequence
of a common ancestor is necessary to decide whether an insertion or deletion has occurred. Therefore,
when investigating the evolutionary relationship of HPV11, a prototype sequence is used as a reference
to identify nucleic heterogeneity (Balaban et al., 2019; Hirose et al., 2018).
Genomic mutations of DNA may also affect the amino acid sequence translated in protein-coding
regions. Synonymous mutations occur if the nucleotide substitution does not alter the amino acid, and
non-synonymous mutations occur when the nucleotide substitution changes the amino acid translated
(Lebeuf-Taylor et al., 2019).
23
1.13.2. Multiple sequence alignment
Sequence alignment is an important research field in bioinformatics as it guides crucial tasks such as
phylogenetic analysis. A multiple sequence alignment is often used to analyse homologous sequences.
An alignment is a data matrix where homologous characters of the sequence are aligned in columns to
identify discrepancies (Chowdhury et al., 2017).
1.13.3. Phylogenetic trees
Molecular Evolutionary Genetics Analysis (MEGA) is an example of software that provides a powerful
and flexible interface for conducting phylogenetic analyses (Hall, 2013). Phylogenetic trees are a
practical way to present the evolutionary relationship between organisms as the genetic sequence of
contemporary sequences evolved from a common ancestor (Balaban et al., 2019). A phylogenetic tree
consists of a root, nodes, tips and branches. The shape in which the branches connect the tips, nodes
and root is termed the tree's topology. The tips of the phylogenetic tree represent the descendant or
current taxa, and the nodes represent a common ancestor. The root represents the DNA sequence of all
the species' sole common ancestor in the data set. Phylogenetic trees can be rooted or unrooted. A rooted
tree denotes a common ancestor, whereas an unrooted tree has no origin and does not hypothesise the
ancestral line (Balaban et al., 2019).
1.13.4. Types of phylogenetic methods
Methods of tree reconstruction based on DNA sequences include maximum parsimony methods,
evolutionary distances between sequences and approaches applying the maximum likelihood principle
(Rusinko & McPartlon, 2017; Serdoz et al., 2017).
Neighbor-Joining is a distance-based agglomerative clustering technique that takes a distance matrix
detailing the distance between each pair of taxa as an input. Distance-based methods alter the sequence
data into a pairwise similarity matrix to use during tree interpretation. The algorithm starts with a
completely unresolved tree and iterates until the tree is resolved and branch lengths are established
(Rusinko & McPartlon, 2017). The Neighbor-Joining method is suited for complex datasets comprising
lineages with variable rates of evolution and permits correction for numerous substitutions.
Additionally, the Neighbor-Joining method is suited for large input data sets and generates results in a
small amount of time. However, due to the Neighbor-Joining tree being distance matrix based, the
output information is often limited, and only one out of several possible trees are depicted depending
on the model of evolution used (Kuhner & Felsenstein, 1994; Saitou & Nei, 1987).
Parsimony reconstruction and maximum likelihood methods are both character-based methods that use
all known evolutionary information, such as nucleotide substitutions, to identify the most likely
24
ancestral relationship (Serdoz et al., 2017). Character-based methods aim to create an algorithm for
scoring the probability that a given tree would produce the observed sequences at its leaves. The
algorithm then filters through the possible trees to identify a tree with the maximum probability of
producing the correct results (Serdoz et al., 2017).
The most popular character-based method when constructing a phylogenetic tree for HPV11 is the
maximum likelihood method (Burk et al., 2011; Jelen et al., 2016). This algorithm provides
probabilities that a particular evolutionary model has generated the observed data. This algorithm uses
each position in a sequence alignment and considers all possible trees. The higher the probability of the
sequence given, the more the tree is favoured until the maximum likelihood is determined. Since an
evolutionary model can be chosen, this tree is suitable for a large variety of groups (Miguel Rocha &
Ferreira, 2018; Serdoz et al., 2017). The supposed advantages of maximum likelihood methods include
a lower variance rate compared to other methods, and it is rarely affected by errors during sampling.
The method tends to be robust to several violations of evolutionary model assumptions, and even with
short sequences, it outperforms alternate methods. However, the maximum likelihood method is
computationally intensive, and the result is dependent on the model of evolution used (Miguel Rocha
& Ferreira, 2018; Serdoz et al., 2017).
1.14. Problem statement
During a previous study conducted at the Division of Virology, UFS, a novel HPV11 lineage B isolate
was identified, warranting investigation of further isolates for clarification of lineages currently in
circulation (Makatsa, 2012). It is essential to characterise the current strains circulating amongst the
community, compare the various lineages and sub-lineages with disease severity in future studies, and
monitor the impact of the vaccination campaign on the circulating HPV types and guide vaccine
development. Baseline data on circulating HPV genotypes may effectively monitor the impact of the
vaccination program on the community. Also, determining the genotype of HPV responsible for RRP
in patients may have prognostic implications. Hence, this study investigated HPV11 lineages circulating
in patients with RRP and identify novel lineages.
1.15. Aims and objectives
The aim of this study is to genetically characterise HPV11 isolates from patients diagnosed with RRP
treated at Universitas Academic Hospital, Free State, South Africa.
To achieve these aims the following objectives were identified:
• To determine the nucleotide sequence of the L1, URR, E5a/b and a segment of E2 genomic
regions of HPV11 isolates from patients with RRP in the Free State, South Africa.
25
• To conduct phylogenetic analyses using sequence data from HPV11 isolates to determine the
genetic relationship and to determine the presence of HPV11 variants and lineages.
• To generate and analyse the whole genome sequences of novel HPV11 variants.
26
CHAPTER 2- Phylogenetic analysis of human papillomavirus type 11
isolates
2.1. Introduction
HPV11 is a causative agent of RRP, a benign laryngeal neoplasm presenting mainly in children
(Benedict & Derkay, 2021). To date, no published data on the HPV11 intratypic variants circulating in
the Free State region is available. Therefore, phylogenetic analysis may aid in identifying circulating
variants in patients with RRP.
It is commonly accepted that natural selection influenced the evolution of organisms and their
contemporary genes. An increasing number of DNA sequences are being analysed today, and the study
of the ancestry of organisms and their phylogenetic trees is in high demand. A better understanding of
phylogenetic trees and the evolutionary processes that they model is necessary to gain insight into how
organisms have mutated.
Phylogenetic analysis investigates evolutionary development in different species by analysing genome
information such as DNA, deepening our knowledge of how different species' genomes develop
(Charleston, 2013; Felsenstein, 1981; Hall, 2013). To date, over 220 HPV types have been identified
and classified into different risk groups through phylogenetic analysis, according to The International
HPV Reference Centre (Bernard et al., 2010; de Villiers et al., 2004)
(https://www.hpvcenter.se/human_reference_clones). Although Sanger sequencing remains the gold
standard for sequencing, especially when sequencing single genes, recent advances in NGS technology
has led to NGS being applied as the preferential technology for several kinds of molecular studies and
typing (Ambulos et al., 2016; Kocjan et al., 2015).
Analysis of sequence heterogeneity across the complete genome of HPV is frequently used to identify
HPV variants. The ORF nucleotide sequence coding for the L1 capsid protein is sufficient for
classifying HPV types and is used to represent the whole genome variation due to its high level of
conservation (Burk et al., 2011, 2013). However, there is a loss of clear resolution when the L1 sequence
is used to identify intratypic HPV variants. To discriminate between different HPV11 lineages, the URR
and a coding region such as the L1 ORF are necessary, but for clustering of sub-lineages, additional
sequencing is required (Burk et al., 2011, 2013). Sequence data can be aligned to a prototype genome
to identify nucleotide heterogeneity, and variant lineages and sub-lineages of HPV11 can be identified
by the percentage of nucleotide differences in the complete genome. A nucleic acid change of 0.5%-
1.0% across the whole genome defines a variant sub-lineage, 1%-10% defines a variant lineage, and
>10% defines a novel HPV type (Burk et al., 2011). In 2011, the circulating HPV11 variants were
27
classified as A1 and A2, and in 2016, whole-genome analysis of 78 HPV11 complete genomes revealed
two additional sub-lineages (A3 and A4) and a novel lineage B (Burk et al., 2011; Jelen et al., 2016).
Nucleotide variances in genomic DNA can lead to changes in translated gene products. Therefore, the
amino acid sequence can also help study the evolutionary relationship between organisms (Charleston,
2013; Felsenstein, 1981; Hall, 2013).
Phylogenetic trees are constructed by use of sequence data and are helpful to depict the evolutionary
relationships between organisms. Phylogenetic trees that reflect the evolutionary history of an organism
is preferred (Charleston, 2013; Felsenstein, 1981; Hall, 2013). Finding an accurate phylogenetic tree
for a set of related species with only DNA is complex. Statistical inference using statistical models and
Markov models has helped scientists estimate how comparable a phylogenetic tree is to the actual
phylogenetic tree for a specified set of DNA sequences (Charleston, 2013; Felsenstein, 1981; Hall,
2013).
Neighbor-Joining is a distance-based agglomerative clustering technique that takes a distance matrix
detailing the distance between each pair of taxa as an input (Rusinko & McPartlon, 2017). The
Neighbor-Joining method is suited for large, complex datasets comprising lineages with variable rates
of evolution and permits correction for numerous substitutions and generates results in a small amount
of time (Kuhner & Felsenstein, 1994; Saitou & Nei, 1987).
However, maximum likelihood trees are preferred when constructing a tree for HPV11 isolates and use
an explicit evolutionary model. This method provides probabilities that a particular evolutionary model
will generate the observed data until the tree with the maximum likelihood is determined (Charleston,
2013; Felsenstein, 1981; Hall, 2013). The method has been proven to outperform alternate methods
(Charleston, 2013; Felsenstein, 1981; Hall, 2013).
The current study explored phylogenetic relatedness and nucleotide variability among HPV11 isolates
from patients diagnosed with RRP. The L1, URR, E5a/b and a segment of the E2 ORF were targeted
for sequencing. The concatenated L1-URR is necessary to discriminate between HPV11 lineages (Burk
et al., 2011), and according to a study published in 2016, the concatenated L1-URR-E5a/b can
discriminate between different lineages and sub-lineages (Jelen et al., 2016). The 208bp region in the
E2 ORF has also been suggested to represent whole-genome variation and could therefore be used to
construct phylogenetic trees representative of the different lineages and sub-lineages (Jelen et al., 2016).
Characterising the current circulating strains will aid in future disease severity studies, guide vaccine
development and have prognostic implications. Hence, this study investigated HPV11 lineages and sub-
lineages circulating in patients with RRP.
28
2.2. Materials and methods
2.2.1 Study samples
During a previous study conducted at the Division of Virology at the University of the Free State,
Bloemfontein, 94 laryngeal papilloma biopsies were collected from 2008 to 2018 and stored at -80℃
(ETOVS 194/2007 and ECUFS 6/2011) (Appendix 1). Informed consent for collection and storage of
biopsies was obtained from each patient by Professor Seedat from the Department of
Otorhinolaryngology, Faculty of the Health Sciences, University of the Free State.
All samples were assigned a VBD number followed by the year of collection. DNA was extracted from
each biopsy by use of the QIAamp DNA Mini Kit (Qiagen, Hilden, Germany) according to the
manufacturer’s instructions and stored at -20℃. Samples were screened for HPV DNA by PCR,
targeting a region of the L1 ORF using the GoTaq ® Flexi DNA Polymerase mediated PCR
amplification kit (Promega, Madison, USA) according to manufacturer’s instructions. Following PCR
samples were cleaned with the Promega Wizard ® SV Gel PCR Clean-up System kit according to
manufacturer’s instructions (Promega, Madison, USA). The positive reactors were genotyped by
nucleotide sequence determination of the amplicon with the BigDye ® Terminator v 3.1 Cycle
Sequencing kit (Thermo Fisher Scientific, Illinois, USA) and aligned with sequence data retrieved from
GenBank using Basic Local Alignment Search Tool (BLAST) analysis. A novel HPV11 lineage B
isolate identified in a previous study was also included in the current study (Makatsa, 2012).
2.2.2 Study setting and population
All tests were performed at the Division of Virology, Faculty of Health Sciences at the University of
the Free State (UFS). The study population included patients diagnosed with RRP at the Universitas
Academic Hospital, Bloemfontein, a referral hospital including patients mainly from the Free State
province, but also from surrounding regions. Eighty-one isolates from 60 patients were available for
this study. Additional sequence data for 28 geographically distinct representatives of each HPV11
lineage, including lineages A and sub-lineages A1 to A4, and lineage B, were retrieved from GenBank
(Appendix 2). The current study was approved by the Health Sciences Research Ethics Committee of
the University of the Free State (UFS-HSD2019/1109/2708) (Appendix 3).
2.2.3 Amplification of five human papillomavirus genomic regions
PCR amplification of the complete L1 ORF, URR, E5a ORF and E5b ORF was performed with primers
as described by Maver and colleagues (2011) summarised in Table 1. Primers flanking a segment of the
E2 were designed according to whole genome HPV11 sequences retrieved from GenBank (Appendix
4). Sequence data for the E2 ORF from 28 HPV11 isolates from different geographic distribution were
29
retrieved from GenBank and aligned using ClustalX v2.1 (Larkin et al., 2007). Primers were designed
to amplify the 3’ of E2 to the 5’ of the NCR2 which is 208bp.
The primers’ specificity was determined with BLAST analysis and are summarised in Appendix 5. The
GC content and melting temperature (Tm) of the forward primer (HPV11-E2seg-F) and reverse primer
(HPV11-E2seg-R) were determined with Thermo Fisher Scientific Tm calculator (Allawi &
SantaLucia, 1997) and are summarised in Table 1. PCR amplification of the partial E2 ORF was
performed with the designed primer pair which targets a region of approximately 331bp, flanking the
target partial E2 ORF. Amplification of all samples was performed with the Applied Biosystems™
Proflex™ PCR system (Thermo Fisher Scientific, Illinois, USA). PCR amplification was performed
using the Go Taq®G2 Hot Start Polymerase (Promega, Madison, USA) according to the manufacturer’s
instructions.
Briefly, all reaction mixtures were prepared in 200μl PCR reaction tubes. Each reaction comprised 4μl
template HPV11 DNA, 10μl 5XGreen Go Taq® flexi buffer, 2mM MgCl2 solution, 0.2mM PCR
nucleotide mix, 0.4μM forward primer, 0.4μM reverse primer, 1.25U Go Taq ® G2 Hot Start Polymerase
and nuclease free water up to 50μl. A no template control was included in each PCR amplification run
by replacing the template with nuclease free water during preparation of the reaction. PCR components
are included in Table 2.
PCR amplification conditions were as follows: initial denaturation at 95℃ for two minutes for one
cycle, followed by 30 cycles of denaturation at 95℃ for 30 seconds, annealing at 47℃ for 30 seconds
and elongation at 72℃ for two minutes. Lastly, final elongation at 72℃ for five minutes before cooling
to 4℃ indefinitely.
30
Table 1: List of primers used for sequencing the L1, E5a/b, upper regulatory region (URR) and a segment of the E2 of
human papillomavirus type 11 (HPV11). Primers HPV11-L1F, HPV11-L1R, HPV11-LCR-F, HPV11-LCR-R, HPV11-E5F
and HPV11-E5R are modified from Maver et al., 2011. Primers HPV11-E2seg-F and HPV11-E2seg-R were designed
according to whole genome HPV11 sequences retrieved from GenBank.
Primer sequence GC Primer binding
Region Primer name e f fTm
(5'→3') e d
Gene size Amplicon size
content site
TGTTTTTATTACAG
a
HPV11‐L1F 32.00% 56.0 ℃ nt 5682–5706
GTTCTGACTTC
L1 1,506bp 1,770bp
AAAACATACATAC
a
HPV11‐L1R 28.00% 55.4 ℃ nt 7451–7427
ACATTCCACAAA
CTACAGCCCCCAA
a
HPV11‐LCR‐F 55.56% 57.5 ℃ nt 7236–7253
ACGAA
URR 756bp 823bp
CGTGGAGGCATCTT
a
HPV11‐LCR‐R 47.62% 58.5 ℃ nt 125–105
TACTTTC
AATACCACCCACC
a
HPV11‐E5F 50.00 % 56.4 ℃ nt 3772–3791
ATTAGGC
E5a/b g500bp 684bp
GGCTGACGCACGTT
b
HPV11‐E5R 58.82 % 56.2 ℃ nt 4455–4439
TAC
ACAGTGCAGCTAC
c
HPV11-E2seg-F 50,00% 56.0 ℃ nt 3561 – 3580
GCCTATA
c
E2 segment 208bp 331bp
TTGTACAGGCACTA
c
HPV11-E2seg-R 47.83% 60.1 ℃ nt 3891-3869
CCTCCATAC
a
Primers reported by Maver et al. , 2011.
b
Primers reported by Maver et al. , 2011 with modifications.
c
Primers designed according to geographically distinct whole genome human papillomavirus type 11 sequences retrieved from GenBank (GenBank Acc. No.
M14119.1; JN644141.1; JQ773408.1; MN788368.1; LN833187.1).
d
Positions of the nucleotides determined with respect to the prototype human papillomavirus type 11 genome (GenBank acc. no. M14119.1).
e
Melting temperature (Tm) and GC content determined by Thermo Fisher Scientific Tm calculator (Allawi & SantaLucia, 1997).
f
Size determined with respect to the prototype human papillomavirus type 11 genome (GenBank acc. no. M14119.1).
g
The E5a/b region contains one overlapping nucleotide (E5a – nt3871-4146; E5b – nt4146-4370) and therefore consist of 500bp.
2.2.4 Agarose gel electrophoresis
The size of the PCR amplicons generated during PCR amplifications were determined by separating
5μl aliquots of PCR products with gel electrophoresis. A 1% agarose gel was used to determine the
amplicon size and integrity of the amplicon generated by the PCR. A table summarising the preparation
of agarose gels is included in Appendix 7. For all amplicons, a 1% gel was prepared with 1g Seakem®
31
LE agarose powder and 100ml of 1x Tris-Acetic acid- (ethylenediaminetetraacetic acid)- EDTA (TAE)
buffer at pH 8.0 and the gel was electrophoresed at 90 volts (V) for 45 minutes. The preparation of TAE
is included in Appendix 6. The agarose gel was stained with a GelRed nucleic acid gel stain (Thermo
Fisher Scientific, Illinois, USA) (Appendix 8) and visualised with the BioRad Molecular Imager Gel
Doc™ XR+ with Image Lab™ Software to determine the fragment sizes according to the known DNA
size marker (BioRad, California, USA). The O’GeneRuler™ 100bp DNA ladder SM1173 and SM0333
(Fermentas, Illinois, USA) containing DNA fragments from 100 to 10 000bp was used to estimate the
size of the amplicons. Expected amplicon sizes varied, with the primers targeting the L1 ORF producing
the largest amplicon (1770bp), followed by the primers targeting the URR (823bp), the E5a/b ORF
(685bp) and the E2 segment (331bp).
Table 2: Polymerase chain reaction (PCR) components for amplifying human papillomavirus type 11 (HPV11) L1 gene,
E5a/b gene, and upper regulatory region (URR).
PCR components Volume Final concentration
5XGreen Go Taq® flexi buffer 10μl 1x
MgCl2 solution, 25mM 4μl 2mM
PCR nucleotide mix, 10mM each 1μl 0.2mM
Forward primer: HPV11‐L1F (20pmol/μl) 1μl 0.4 μM
Reverse primer: HPV11‐L1R (20pmol/μl) 1μl 0.4 μM
GoTaq ®G2 Hot Start Polymerase (5U/μl) 0.25μl 1.25U
Template DNA 4μl N/A
Nuclease free water 28.75μl N/A
Total 50µl N/A
2.2.5 Purification of PCR product
Following electrophoresis, DNA amplicons were purified using the GeneJET PCR Purification Kit
(Thermo Scientific, Illinois, USA) according to the manufacturer’s instructions. Briefly, equal amounts
of binding buffer and PCR mixture were mixed thoroughly and transferred to the purification column
before centrifuging for 60 seconds on the Eins-Sci E-C15-12.2 High Speed Micro Centrifuge (Eins Sci,
South Africa) at 25200 relative centrifugal force (rcf) and discarding the flow-through. A 700µl aliquot
of wash buffer was added to the purification column and centrifuged twice for 60 seconds (2520xrcf)
and discarding the flow-through twice. The column was transferred to a 1,5ml microcentrifuge tube and
32
between 30µl to 50µl aliquots of elution buffer was added to the column. The tube was centrifuged at
2520xrcf for 60 seconds and the column was discarded. The eluted DNA was stored at -20 ℃ until use.
2.2.6 Determination of DNA concentration
DNA concentration was determined with the NANODROP 2000 spectrophotometer (Thermo Fisher
Scientific, Illinois, USA).
2.2.7 Sequencing
The nucleotide sequences of each amplicon were determined with Sanger sequencing to identify the
nucleotide composition in order to identify the variant lineages and sub-lineages. Sequencing reactions
were prepared with the BigDye Terminator v3.1 Sequencing Reaction kit according to the
manufacturer’s instructions (Thermo Fisher Scientific, Illinois, USA). Briefly, the purified HPV
isolates were diluted with nuclease-free water to 20 ng/µl. Sequencing primers were diluted with
nuclease-free water to a concentration of 3.2pmol/µl.
For the L1 ORF, four primers were used for sequencing as described by Maver and colleagues in 2011
(Table 3). For the URR, primers used for the initial PCR as described in Table 1, with one additional
primer (HPV11-LCR-FF, 5’- TTCGGTTGCCCTTACATACA -3’, nt7598 - 7617) were used as
described by Maver and colleagues (2011). For E5a/b ORF and the E2 segment, primers used for the
initial PCR as described in Table 1 were used. Each amplicon was sequenced using bidirectional
sequencing.
Table 3: List of primers used for sequencing the L1 gene of human papillomavirus type 11 (HPV11) from Maver et al., 2011.
Primer sequence
Region Primer name c c b d dGC content Tm Primer binding site Gene size Amplicon size
(5' → 3')
CGTAAACGTATTCC
a
HPV-11‐L1U1 29.17% 54.24℃ nt 5743 - 5766
CTTATTTTTT
871bp
TGATCTGTTATTAC
a
HPV-11‐L1U1R 37.50% 56.16℃ nt 6613 - 6590
CCCCTTTTAC
L1 1,506bp
TTTTATTTGCGAAA
a
HPV-11‐L1U2 30.00% 51.86℃ nt 6503- 6522
GGAACA
809bp
ACATATAAATAAC
a
HPV-11‐L1U2R 30.77% 56.30℃ nt 7311- 7286
ACAACACACTGAC
a
Primers reported by Maver et al. , 2011.
b
Positions of the nucleotides determined with respect to the prototype human papillomavirus type 11 genome (GenBank acc. no. M14119.1).
c
Melting temperature (Tm) and GC content determined by Thermo Fisher Scientific Tm calculator (Allawi & SantaLucia, 1997).
d
Size determined with respect to the prototype human papillomavirus type 11 genome (GenBank acc. no. M14119.1).
33
A reaction mixture for every primer containing 2ul 5x sequencing buffer, 1ul BigDye v3.1 ready
reaction, 1ul primer (3,2pmol/µl) was prepared. A total volume of 2µl diluted HPV DNA and 8µl of
each reaction mixture was added to the well of the 96-well plate. The sequencing components for
nucleotide determination of HPV11 isolates are included in Table 4. The plate was sealed with thermo-
adhesive film and briefly centrifuged. The plate was placed in the Applied Biosystems™ Proflex™
PCR system (Thermo Fisher Scientific, Illinois, USA) for DNA amplification. PCR amplification
conditions were as followed: initial denaturation at 96℃ for one minute and one cycle, followed by 25
cycles of denaturation at 96℃ for 10 seconds, annealing at 50℃ for five seconds and elongation at
60℃ for four minutes before cooling to 4℃ until further use.
Table 4: Sequencing reaction components for nucleotide determination of human papillomavirus type 11 (HPV11) isolates.
Components Volume
5 x Sequencing Buffer 2µl
Primer (3,2pmol/µl) 1µl
BigDye v3.1 1µl
Template DNA 2µl
Nuclease free water 4 µl
Total 10µl
Following DNA amplification, ethanol/EDTA precipitation was performed. Nuclease free water and
125mM EDTA was combined in a 2:1 ratio, and a 15µl aliquot of the diluted EDTA solution was
added to each well. Thereafter, a 60µl aliquot of cold 96-100% ethanol stored at -20℃ was added to
each well. A silicone cover was placed on the 96-well plate before briefly vortexing the plate. The
plate was placed in the -20℃ freezer for five minutes. Afterwards, the plate was centrifuged at
2720xrcf for 80 minutes at 4℃. After centrifugation, the supernatant was aspirated entirely without
disturbing the pellet. A volume of 200µl cold 70% ethanol stored at -20℃ was added to each well. The
plate was sealed and centrifuged at 2720xrcf for 40 minutes at 4℃. The liquid was aspirated without
disturbing the pellet. The plate was then placed in the Applied biosystems™ Proflex™ PCR system
(Thermo Fisher Scientific, Illinois, USA) at 94℃ for 25-40 seconds to evaporate ethanol. Lastly, a
10µl aliquot Hi-Di was added to each well, and the samples were denatured at 95℃ for five minutes
in the Applied biosystems™ Proflex™ PCR system (Thermo Fisher Scientific, Illinois, USA).
The Applied Biosystems™ 3500Xl Genetic Analyzer (Thermo Fisher Scientific, Illinois, USA) was
used to sequence all samples. Briefly, a primer, enzyme and four fluorescently labelled
dideoxynucleotides triphosphates (ddNTPs) are added to the reaction. The ddNTPs emit light at
different wavelengths when excited by a laser. This emission can be captured by a camera and
34
converted to a chromatogram. As the fluorescently labelled extension products pass the laser, each
nucleotide is “called”. Raw sequence data were analysed as described below.
2.2.8 Data analysis
Analysis of nucleotide variation
Sequence data for five regions, namely the L1- (1506bp), URR- (758bp), E5a- (276bp), E5b region
(225bp) and a section from the 3’ of E2 to the 5’ of the NCR2 (208bp) was determined. The E5a/b
region contains one overlapping nucleotide (E5a – nt3871-4146; E5b – nt4146-4370) and therefore
consist of 500bp. Raw sequence data was manually verified with Unipro UGENE (Okonechnikov et
al., 2012). Briefly, the ends of the reads were trimmed based on the quality value. Each region was
sequenced bi-directionally, and the chromatograms of the filtered trimmed reads were aligned and
inspected to correct for possible nucleotide-calling faults due to ambiguous peaks.
Genomic variants and genomic positions in HPV11 L1-, E5a, E5b- and a segment of the E2 ORF and
the URR were identified by comparing the sequence data with the prototype HPV11 genome (GenBank
accession number M14119.1). For the URR, a corrected sequence with a 2bp insertion at genomic
position 7717-7718 was used when determining genomic variants and genomic positions (Maver et al.,
2011). The nucleotide substitutions, nucleotide insertions, and nucleotide deletions were identified for
the isolates sequenced in this study with respect to prototype strain (GenBank accession number
M14119.1) by use of MEGA X software (Kumar et al., 2018).
To estimate the percentage of variation within coding regions, the number of nucleotide variations
compared to the A1 prototype sequence M14119.1 were converted into a percentage using the following
equation:
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑛𝑢𝑐𝑙𝑒𝑜𝑡𝑖𝑑𝑒 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒𝑠 𝑤𝑖𝑡ℎ𝑖𝑛 𝑎 𝑠𝑝𝑒𝑠𝑖𝑓𝑖𝑐 𝑐𝑜𝑑𝑖𝑛𝑔 𝑟𝑒𝑔𝑖𝑜𝑛 𝑐𝑜𝑚𝑝𝑎𝑟𝑒𝑑 𝑡𝑜 𝑀14119.1
× 100
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑛𝑢𝑐𝑙𝑒𝑜𝑡𝑖𝑑𝑒𝑠 𝑤𝑖𝑡ℎ𝑖𝑛 𝑠𝑝𝑒𝑠𝑖𝑓𝑖𝑐 𝑐𝑜𝑑𝑖𝑛𝑔 𝑟𝑒𝑔𝑖𝑜𝑛 𝑜𝑓 𝑀14119.1
Degree of divergence
The degree of divergence was calculated using pairwise analysis of percentage divergence of
nucleotides with MEGA X software using concatenated E5a/b-L1-URR sequences of sequences
obtained in the current study and sequences retrieved from GenBank. Codon positions included were
1st, 2nd, 3rd, and non-coding. All positions with less than 95% site coverage were removed using the
pairwise deletion option.
35
Analysis of amino acid variation
The amino acid data order was determined for the L1- (501aa), E5a- (91aa), and E5b region (74aa) with
use of a codon chart included in Appendix 9. Amino acid variants in HPV11 L1-, E5a, E5b ORFs were
identified by comparing the amino acid data with the prototype HPV11 (GenBank accession number
M14119.1).
Phylogenetic analysis
All sequences were aligned with ClustalX v2.1. For L1 ORF sequence data, the ClustalX alignment
file was exported to IQTree webserver (Trifinopoulos et al., 2016) to determine the best model for a
reliable estimate maximum likelihood phylogeny with an ultrafast bootstrap of 1000. This model was
used to construct all maximum likelihood phylogenetic trees using MEGA X software. Additional
sequence data for geographically distinct representatives of each HPV11 lineage, including lineages
A, sub-lineages A1 to A4, and lineage B retrieved from GenBank were also included in downstream
phylogenetic analysis.
Concatenated L1-URR and E5a/b-L1-URR sequences were generated with Geneious Prime software
(https://www.geneious.com). To explore the phylogenetic relationship of the HPV isolates, maximum
likelihood phylogenetic trees with a bootstrap of 1000 were constructed with the concatenated L1-
URR sequences and the concatenated E5a/b-L1-URR sequences. Phylogenetic trees were constructed
with MEGA X software.
To explore whether the 208bp region in the E2 ORF sequenced in this study represents whole-genome
variation, a Neighbor-Joining phylogenetic tree and a maximum likelihood phylogenetic tree with a
bootstrap of 1000 were constructed with MEGA X software.
2.2.9 Ethical considerations
Ethical clearance was obtained from the Health Sciences Research Ethics Committee of the UFS. All
isolates used in this study were collected by Professor Seedat (UFS-HSD2019/1109/2708) and informed
consent was obtained to store the isolates and use them for related studies (ETOVS 194/2007 and
ECUFS 6/2011). No additional isolates were collected in this study. Permission to perform the study
was be obtained from the Head of the Division of Virology and the Head of the School of Pathology
(National Health Laboratory Service).
36
2.3. Results
2.3.1 Patient data
During previous studies at the Division of Virology at the University of the Free State, Bloemfontein,
laryngeal papilloma biopsies were collected from 2008 to 2018. DNA extracted from each biopsy was
screened for HPV DNA and positive reactors were genotyped. Over this 10-year period, 94 HPV11
isolates from 70 patients diagnosed with RRP were identified. Eighty-one isolates from 60 patients were
available for this study. Patient data is included in Appendix 1.
2.3.2 L1 ORF amplification and sequencing
PCR amplification of the L1 ORF was performed on all available isolates with primers HPV11-L1 and
HPV11L1-R (Table 1). Eighty-one samples were purified, and separated with gel electrophoresis and
72 samples had a band at the predicted size of approximately 1770bp confirming the presence of HPV11
L1 DNA. The DNA concentration of the 72 samples ranged from 14.5ng/µl to 202.1ng/µl. DNA
concentrations of all isolates are included in Table 5.
Nucleotide sequences of each amplicon within the predicted size range was determined with Sanger
sequencing. Nucleotide substitutions, insertions and deletions are summarised in Table 6. By comparing
sequence data of isolates sequenced in the current study and the A1 prototype sequence L1 sequence,
86 nucleotide substitutions were observed in 11 genomic regions. This included a C6028T in 69 isolates
(all except VBD34/08, VBD28/14 and VBD04/11), C6831T in one isolate (VBD34/08), A7126C in
one isolate (VBD23/10), A7131C in one isolate (VBD34/08). Additionally, isolates VBD34/08 and
VBD28/14 had A5929G, T6296A, T6298C, T6949A, G7045A C7197T, and T7237C nucleotide
substitutions. The maximum number of nucleotide substitutions in a single isolate, VBD34/08, was nine
which is 0,6% of the complete L1 ORF. Only one of the 72 (1.4%) isolates, VBD04/11, was identical
to the prototype.
37
Table 5: Quantitative estimation of DNA concentration revealed by Nanodrop spectrophotometer of human papillomavirus
type 11 (HPV11) genomic regions including late region L1, upper regulatory region (URR), early region E5a/b and a segment
of early region E2.
Isolates with insufficient DNA for quantitative estimation of DNA concentration are marked in dark grey.
VBD L1 (ng/µl) URR (ng/µl) E5a/b (ng/µl) E2 segment (ng/µl)
01/10 103,5 81,2 77,8 68,4
02/12 101,9 136,3 103,2 67,3
04/11 72,2
04/12 34 114 194 62,6
07/10 16,7 79,8 142,9 67
08/09 14,5 99,2 85,8 52,9
08/13 27,9 109,3 28,7 104
09/12 50,1 87,1 95,8 54,1
12/18 44,4 80,8 92,8 58,4
13/10 41,5 163,9 84,6 44
14/09 49,75 126,8 92,7 58
14/11 87,9 133,9 100,8 74,4
15/10 81,4 100,3 108,9 60,3
16/09 122,7 85,3 111,4 57,8
16/10 110,2 66,9 117,9 56,1
17/09 93,5 48,5 79,6 65,5
17/10 76,8 106,1 71,1 76,6
21/10 149,2 125,7 55,2 64,4
23/10 124,1 159,7 112,4 117,5
26/10 77,4 118 72 60,7
30/11 119,9 76,3 32,7 71,3
33/10 95,5 134,8 137,4 53,1
33/11 89,7 423,9 33,9 42,1
34/08 60,9 254 68,9 45,7
34/10 143,5 84,1 133,9 77,4
34/11 111,7 115,6 178,9 65,6
35/11 140,1 147,1 93 54,6
37/08 73,4 126 55,9 49
37/10 112,6 117,9 69,8 67,9
37/11 63,7 119,2 83,7 48,2
37/12 27,4 122,6 54,7 63,6
39/12 29,9 106,1 74,9 62,1
41/11 102,9 135,8 95,1 54,4
41/12 59,7 111,5 73,2 61,1
45/08 75,2
47/12 29 103,5 62,4 61,9
48/10 86,6 122,9 51,1 67,8
48/12 29,3 116,5 40,1 47,9
49/08 35 138,8 53,2 51,7
49/12 33,8 125,4 229,3 63,3
52/09 104,7 115,3 136,7 51,1
52/11 81,6 127,3 121,3 52,2
54/08 71,1
55/08 120,7 99,7
58/11 27,2 73,8 77 47,6
59/08 202,1 90,2 143,1 70,1
59/09 153 142,5 209,2 49,2
61/11 43 136,3 88,9 56,1
62/11 88 142,1 95,5 52,9
63/09 88,1 141,3 102,1 60,4
63/11 100,4 108,5 108,7 63,1
69/09 112,7 138,2 108,2 48,5
74/09 77,6 92,9 97,9 58,1
79/09 66,4 88 87,6 50,9
14/15 21,6 122,1 77,3 46
16/15 83,6 75,3 67 58,7
17/14 35,2 116,8 56,7 58,6
22/14 93,7 141,2 46,4 58,4
23/17 43,8 141,5 91,4 52,4
28/14 73,5 117,5 91,4 36,6
29/13 50,1 93,5 94,4 69,1
29/14 40 135 92,7 83,9
33/16 29,9 110,6 135,8 52,3
41/14 62 179,6 85,3 66
41/15 23,6 123,9 116,7 54,2
44/15 23,6 146,2 75,2 56,3
48/16 29,8 126,8 33,7 60,5
49/15 21,2 198,6 47,8 53,7
55/13 40,4 100,05 80,5 64,1
68/15 16 120,52 113,2 57,5
68/16 22,5 119,8 68,4 55,6
69/15 21,7
38
Table 6: Sequence comparison of nucleotide variances among 72 human papillomavirus type 11 (HPV11) isolates L1 region
against the HPV11 prototype with GenBank accession number M14119.1.
Residues that differed to those of HPV11 L1 prototype are indicated by a white colour difference and the substituted base.
L1 Genomic region
5 6 6 6 6 6 7 7 7 7 7
9 0 2 2 8 9 0 1 1 1 2
2 2 9 9 3 4 4 2 3 9 3
9 8 6 8 1 9 5 6 1 7 7
Base A C T T C T G A A C T
Isolate
VBD
Number
12/18 T
23/17 T
68/16 T
48/16 T
33/16 T
68/15 T
49/15 T
41/15 T
44/15 T
16/15 T
14/15 T
41/14 T
28/14 G A C A A T C
17/14 T
22/14 T
29/14 T
55/13 T
29/13 T
08/13 T
48/12 T
47/12 T
39/12 T
37/12 T
09/12 T
61/11 T
62/11 T
58/11 T
52/11 T
41/11 T
37/11 T
35/11 T
49/12 T
30/11 T
34/11 T
14/11 T
04/11
48/10 T
33/10 T
63/11 T
02/12 T
04/12 T
26/10 T
21/10 T
23/10 T C
01/10 T
07/10 T
17/10 T
34/10 T
79/09 T
74/09 T
15/10 T
59/09 T
63/09 T
69/15 T
55/08 T
34/08 G A C T A A C T C
37/08 T
08/09 T
14/09 T
41/12 T
17/09 T
49/08 T
59/08 T
69/09 T
16/10 T
16/09 T
52/09 T
37/10 T
33/11 T
45/08 T
13/10 T
54/08 T
39
2.3.3 URR amplification and sequencing
Following the amplification of the L1 ORF, four isolates had insufficient DNA and were excluded from
DNA amplification and nucleotide determination during downstream analysis. These included
VBD04/11, VBD69/15, VBD45/08 and VBD54/08.
PCR amplification of the URR was performed on the available isolates with primers summarised in
Table 1. The remaining 68 samples were purified, and 5μl aliquots of the purified HPV11 DNA were
separated with gel electrophoresis. The gel was visualised and 68 samples had a band at the predicted
size confirming the presence of HPV11 URR DNA. The DNA concentration of the 68 samples ranged
from 48.5ng/µl – 423.9ng/µl. DNA concentrations of all isolates are included in Table 5.
Nucleotide sequences of each amplicon within the predicted size range were determined with Sanger
sequencing using the same primers as for the initial PCR (Table 1) plus an additional primer HPV11-
LCR-FF. By comparing sequence data of isolates sequenced in the current study and the A1 prototype
sequence URR sequence, 237 nucleotide substitutions could be observed in 23 genomic regions.
Nucleotide substitutions, insertions, and deletions in the URR are summarised in Table 7. All isolates
had a T7547C nucleotide substitution. Isolates VBD28/14 and VBD34/08 had A7289G, A7302C,
G7319T, G7329C, G7349C, T7359C, T7411G, G7519T, C7536T, A7570C, A7591G, A7626C,
T7646A, T7775G, G7780T, C7880T, C7928T, G29T, and C50T nucleotide substitutions. Other
nucleotide substitutions included C7333A in 10 isolates, A7413C in 55 isolates, C7479T in 66 isolates.
The maximum number of nucleotide substitutions per isolate was 20 in VBD34/08 and VBD28/14. Five
nucleotide substitutions were detected in isolates VBD33/16, VBD68/15, VBD41/15, VBD41/14,
VBD35/11, VBD59/09, VBD63/09, VBD16/09, VBD52/09, VBD37/10. A total of 118 nucleotide
insertions were present among 58 isolates. The majority of isolates had a C (50/68, 73.5%) or CC (7/68,
10.3%) insertion following genomic position 7575 (All except VBD44/15, VBD29/14, VBD55/13,
VBD47/12, VBD39/12, VBD37/11, VBD34/11, VBD23,10, VBD74/09, VBD15/10, and VBD34/08).
Only isolates with a C or CC insertion following genomic position 7575 had no nucleotide substitution
at genomic position 7413.
The maximum number of nucleotide insertions in an isolate was 28 in VBD28/14 in five genomic
regions, followed by 27 in VBD34/08 in four genomic regions. Insertions in VBD28/14 included a
GCACGC insertion following genomic position 7527, a C insertion following genomic position 7575,
a GGCGCCA insertion following genomic position 7715, a TGGGTTG insertion following genomic
position 7744, and a TTATCTC insertion following genomic position 7746. Insertions in VBD34/08
included a GCACGC insertion following genomic position 7527, a GGCGCCA insertion following
40
Table 7: Sequence comparison of nucleotide variances among 68 human papillomavirus type 11 (HPV11) isolates upper
regulatory region (URR) against the HPV11 prototype with GenBank accession number M14119.1.
For the URR, a corrected sequence with a 2bp insertion at nt 7717-7718 was used when determining genomic variants and
genomic positions (Maver et al., 2011). Residues that differed to those of HPV11 URR prototype are indicated by colour
differences. White depicts nucleotide substitutions, yellow depicts nucleotide insertions and green depicts nucleotide deletions.
Samples excluded from analysis are indicated in dark grey.
URR Genomic region
Base A A G G C G T T A C T G T-G T C A A-C A A T T GCCC AAGTAT T T-G A-T G-T T G C C G C
Isolate
VBD
Number
12/18 C T Del C C
23/17 C T Del C C
68/16 C T Del C C
48/16 C T Del C C
33/16 A C T Del C C
68/15 A C T Del C C
49/15 C T Del C C
41/15 A C T Del C C
44/15 T Del C
16/15 C T Del C C
14/15 C T Del C C
41/14 A C T Del C C
28/14 G C T C C C G T GCACGC C T C C G C A Del Del Del Del GGCGCCA TGGGTTG TTATCTC G T T T T T
17/14 C T Del C CC
22/14 C T Del C CC
29/14 T Del C
55/13 T Del C
29/13 C T Del C C
08/13 C T Del C C
48/12 C T Del C C
47/12 T Del C
39/12 T Del C
37/12 C T Del C C
09/12 C T Del C C
61/11 C T Del C C
62/11 C T Del C C
58/11 C T Del C C
52/11 C T Del C C
41/11 C T Del C C
37/11 T Del C
35/11 A C T Del C C
49/12 C T Del C C
30/11 C T Del C C
34/11 T Del C
14/11 C T Del C C
04/11
48/10 C T Del C C
33/10 C T Del C C
63/11 C T Del C C
02/12 C T Del C C
04/12 C T Del C C
26/10 C T Del C C
21/10 C T Del C C
23/10 T Del C
01/10 C T Del C CC
07/10 C T Del C CC
17/10 T Del C CC
34/10 C T Del C CC
79/09 C T Del C C
74/09 T Del C
15/10 T Del C
59/09 A C T Del C C
63/09 A C T Del C C
69/15
55/08 C T Del C C
34/08 G C T C C C G T GCACGC C T C G C A Del Del Del Del GGCGCCA TGGGTTG TTATCTC G T T T T T
37/08 C T Del C C
08/09 C T Del C C
14/09 C T Del C C
41/12 C T Del C C
17/09 C T Del C C
49/08 C T Del C C
59/08 C T Del C C
69/09 C T Del C C
16/10 C T Del C C
16/09 A C T Del C C
52/09 A C T Del C C
37/10 A C T Del C C
33/11 C T Del C CC
45/08
13/10 C T Del C C
54/08
genomic position 7715, a TGGGTTG insertion following genomic position 7744, and a TTATCTC
insertion following genomic position 7746.
41
50
29
7928
7880
7780
7775
7746-7747
7744-7745
7715-7716
7699
7692-7697
7684-7687
7682
7646
7626
7591
7575-7576
7570
7536
7547
7527-7528
7519
7506
7479
7413
7411
7359
7349
7333
7329
7319
7302
7289
There were 90 nucleotide deletions observed in the 68 isolates; 66 of the 68 (97%) isolates had a single
nucleotide deletion at genomic position 7506 and isolates VBD28/14 and VBD34/08 each had 12
nucleotide deletions at genomic positions 7682, 7684-7687, 7692-7697 and 7699. No isolates were
identical to the prototype (GenBank accession number M14119.1).
2.3.1 E5a/b ORF amplification and sequencing
PCR amplification of the E5a/b ORF were performed on all available isolates with primers summarised
in Table 1. One isolate, VBD55/08, had insufficient DNA for downstream analysis.
Samples were purified, and 5μl aliquots of the purified HPV11 DNA were separated with gel
electrophoresis and visualised. Sixty-seven isolates had a band at the predicted size confirming the
presence of HPV11 E5a/b DNA. The DNA concentration of the isolates ranged from 28.7 ng/µl–229.3
ng/µl. All isolates DNA concentration are included in Table 5.
Nucleotide sequences of each amplicon within the predicted size range were determined with Sanger
sequencing using the same primers as for the initial PCR (Table 1). By comparing sequence data of
isolates sequenced in the current study and the A1 prototype sequence E5a/b sequence, 166 nucleotide
substitutions could be observed in 15 genomic regions (Table 8). These included A3891C in one isolate
(VBD23/10), A3952T and G3991C in all 67 isolates, A4142C in 11 isolates, G4325T in one isolate
(VBD48/10), and A4344G in one isolate (VBD68/16). Additionally, isolates VBD34/08 and VBD28/14
had C3902G, A3978G, A3993T, T4048C, C4166G, C4227T, A4274C, C4312A, and C4333T
nucleotide substitutions.
The maximum number of nucleotide substitutions was 11 in isolates VBD28/14 and VBD34/08, which
is 2.2% of the complete E5a/b ORF, followed by four nucleotide substitutions in VBD23/10, which is
0.8% of the complete E5a/b ORF. All isolates had nucleotide substitutions at genomic position 3952
and 3991. No nucleotide insertions or nucleotide deletions were observed compared to GenBank
accession number M14119.
42
Table 8: Sequence comparison of nucleotide variances among 67 human papillomavirus type 11 (HPV11) isolates E5a/b
region against the HPV11 prototype with GenBank accession number M14119.1.
Residues that differed to those of HPV11 E5a/b prototype are indicated by a white colour difference and the substituted base.
Samples excluded from analysis are indicated in dark grey.
E5a/b Genomic region
3 3 3 3 3 3 4 4 4 4 4 4 4 4 4
8 9 9 9 9 9 0 1 1 2 2 3 3 3 3
9 0 5 7 9 9 4 4 6 2 7 1 2 3 4
1 2 2 8 1 3 8 2 6 7 4 2 5 3 4
Base A C A A G A T A C C A C G C A
Isolate
VBD
Number
12/18 T C
23/17 T C
68/16 T C G
48/16 T C
33/16 T C
68/15 T C
49/15 T C
41/15 T C
44/15 T C C
16/15 T C
14/15 T C
41/14 T C
28/14 G T G C T C G T C A T
17/14 T C
22/14 T C
29/14 T C C
55/13 T C C
29/13 T C
08/13 T C
48/12 T C
47/12 T C C
39/12 T C C
37/12 T C
09/12 T C
61/11 T C
62/11 T C
58/11 T C
52/11 T C
41/11 T C C
37/11 T C C
35/11 T C
49/12 T C
30/11 T C
34/11 T C C
14/11 T C
04/11
48/10 T C T
33/10 T C
63/11 T C
02/12 T C
04/12 T C
26/10 T C
21/10 T C
23/10 C T C C
01/10 T C
07/10 T C
17/10 T C
34/10 T C
79/09 T C
74/09 T C C
15/10 T C C
59/09 T C
63/09 T C
69/15
55/08
34/08 G T G C T C G T C A T
37/08 T C
08/09 T C
14/09 T C
41/12 T C
17/09 T C
49/08 T C
59/08 T C
69/09 T C
16/10 T C
16/09 T C
52/09 T C
37/10 T C
33/11 T C
45/08
13/10 T C
54/08
43
2.3.2 E2 ORF amplification and sequencing
PCR amplification of the E2 ORF segment was performed on all available isolates with designed
primers (Table 1). The 67 available samples were purified, and a 5μl aliquot of the purified HPV11
DNA was separated with gel electrophoresis. All the isolates had a band at the predicted size confirming
the presence of target DNA. The DNA concentration ranged from 36.6 ng/µl – 117.5 ng/µl. DNA
concentrations of all isolates are included in Table 5.
Nucleotide sequences of each amplicon were determined with Sanger sequencing using the same
primers as for the initial PCR (Table 1). Among the 67 isolates, 144 nucleotide substitutions were
observed by comparing sequence data of isolates sequenced in the current study and the E2 segment of
the A1 prototype. Nucleotide substitutions are summarised in Table 9. Nucleotide substitutions were
observed in 11 genomic regions. These included A3633C in 65 isolates (all except VBD34/08 and
VBD28/14), A3652G in all 67 isolates, T3672C in one isolate (VBD62/11), C3705A in one isolate
(VBD61/11), A3738T in one isolate (VBD14/11), G3758C in one isolate (VBD29/14), A3779C is one
isolate (VBD74/09), and A3827G in one isolate (VBD34/08). Isolates VBD34/08 and VBD28/14 had
G3638C, C3788T, A3821G nucleotide substitutions.
The maximum number of nucleotide substitutions in the 208bp gene segment of the E2 was five (5/208,
2.4%) nucleotide substitutions in VBD34/08 and four (4/208, 1.9%) nucleotide substitutions in
VBD28/14. No nucleotide insertions or nucleotide deletions were observed. No isolates were identical
to the prototype (GenBank accession number M14119.1).
2.3.3 Analysis of amino acid variations
Analysis of L1 ORF sequences
The L1 gene contains 501 amino acids. By comparing the amino acid of isolates sequenced in the
current study and the A1 prototype, six nucleotide substitutions were identified as synonymous, and
five as non-synonymous. Non-synonymous nucleotide substitutions resulted in four amino acid
substitutions.
Nucleotide substitutions changed the amino acid sequence of two isolates, namely VBD28/14 and
VBD34/08. Both isolates contained T6296A and T6298C nucleotide substitutions causing a serine (S)
to a threonine (T) amino acid substitution and a C7197T nucleotide substitution causing an alanine (A)
to a valine amino acid substitution and only VBD34/08 contained a C6831T nucleotide substitution
causing a S to a leucine (L) amino acid substitution and a A7131C nucleotide substitution causing a
glutamic acid (E) to an A.
44
Table 9: Sequence comparison of nucleotide variances among 67 human papillomavirus type 11 (HPV11) isolates E2 segment
against the HPV11 prototype with GenBank accession number M14119.1.
Residues that differed to those of HPV11 E2 segment prototype are indicated by a white colour difference and the substituted
base.
E2 Genomic region
3 3 3 3 3 3 3 3 3 3 3
6 6 6 6 7 7 7 7 7 8 8
3 3 5 7 0 3 5 7 8 2 2
3 8 2 2 5 8 8 9 8 1 7
Base A G A T C A G A C A A
Isolate
VBD
Number
12/18 C G
23/17 C G
68/16 C G
48/16 C G
33/16 C G
68/15 C G
49/15 C G
41/15 C G
44/15 C G
16/15 C G
14/15 C G
41/14 C G
28/14 C G T G
17/14 C G
22/14 C G
29/14 C G C
55/13 C G
29/13 C G
08/13 C G
48/12 C G
47/12 C G
39/12 C G
37/12 C G
09/12 C G
61/11 C G A
62/11 C G C
58/11 C G
52/11 C G
41/11 C G
37/11 C G
35/11 C G
49/12 C G
30/11 C G
34/11 C G
14/11 C G T
04/11
48/10 C G
33/10 C G
63/11 C G
02/12 C G
04/12 C G
26/10 C G
21/10 C G
23/10 C G
01/10 C G
07/10 C G
17/10 C G
34/10 C G
79/09 C G
74/09 C G C
15/10 C G
59/09 C G
63/09 C G
69/15
55/08
34/08 C G T G G
37/08 C G
08/09 C G
14/09 C G
41/12 C G
17/09 C G
49/08 C G
59/08 C G
69/09 C G
16/10 C G
16/09 C G
52/09 C G
37/10 C G
33/11 C G
45/08
13/10 C G
54/08
45
Analysis of E5a/b ORF sequences
The E5a gene (nt 3871-4146) contains 91 amino acids and the E5b gene (nt 4146-4370) contains 73
amino acids. Three nucleotide substitutions were identified as synonymous and 12 substitutions were
identified as non-synonymous by comparing the amino acid of isolates sequenced in the current study
and the A1 prototype. Eleven amino acid substitutions were observed. Non-synonymous substitutions
included A3891C nucleotide substitution causing a glutamine (Q) to isoleucine (I) amino acid
substitution in one isolate (VBD23/10), A4142C nucleotide substitution causing Q to proline (P) amino
acid substitution in 11 isolates (VBD44/15, VBD29/14, VBD55/13, VBD47/12, VBD39/12,
VBD41/11, VBD37/11, VBD34/11, VBD23/10, VBD74/09 and VBD15/10), and A4344G nucleotide
substitution causing a N to aspartic acid (D) amino acid substitution in one isolate (VBD68/16). All
isolates contained a A3952T and G3991C nucleotide substitutions causing an I to phenylalanine (F) and
a valine to L amino acid substitution, respectively.
Isolates VBD28/14 and VBD34/08 had the most non-synonymous substitutions including C3902G
nucleotide substitution causing an A to glycine (G) amino acid substitution, A3952T nucleotide
substitution causing an I to F amino acid substitution, G3991C and A3993T nucleotide substitution
causing valine to L amino acid substitution, C4166G nucleotide substitution causing a histidine to Q
amino acid substitution, C4227T nucleotide substitution causing a L to F amino acid substitution,
A4274C nucleotide substitution causing a lysine (K) to asparagine (N) amino acid substitution, C4312A
nucleotide substitution causing a T to N amino acid substitution, and C4333T nucleotide substitution
causing a S to L amino acid substitution.
2.3.4 Phylogenetic analysis
Four phylogenetic trees were constructed to explore the phylogenetic relationship of the HPV11 isolates
sequenced in this study. Neighbor-Joining phylogenetic trees and maximum likelihood trees were
constructed with sequences obtained in this study and 28 sequences retrieved from GenBank of
geographically distinct HPV isolates belonging to different (sub)lineages.
Isolates with GenBank accession numbers M14119.1, LN833187.1, LN833185.1, LN833184.1,
LN833183.1, LN833169.1, LN833165.1, LN833161.1, KU298879.1, JQ773412.1, JQ773411.1,
JQ773409.1, JQ773408.1, JN644141.1, HE611263.1, HE574702.1, FR872717.1, FN907963.1,
FN907962.1, FN870021.1, EU918768.1, MN788368.1, MK463921.1, MK463916.1, MK463914.1,
MK313767.1, MK313765.1, and MK313763.1 were included. Lineage and sub-lineage representatives
are summarised in Table 10.
46
Table 10: GenBank accession numbers of human papillomavirus type 11 (HPV11) isolates and the (sub)lineage
representation.
GenBank Accession number Lineage and sub-lineage representatives
M14119.1 Sub-lineage A1
LN833161.1 Sub-lineage A2
Lineage A
LN833169.1 Sub-lineage A3
LN833187.1 Sub-lineage A4
LN833183.1 Lineage B
Maximum Likelihood method
The evolutionary history was deduced using the maximum likelihood method. The Tamura-Nei model
was determined as the best model for a reliable estimate maximum likelihood phylogeny with an
ultrafast bootstrap of 1000. The trees with the highest log-likelihood for concatenated L1-URR (-
10510,15) and E5a/b-L1-URR (-11642,10) are depicted in Figure 4 and Figure 5, respectively.
Analysis of concatenated L1-URR involved 96 nucleotide sequences, including 68 isolates sequenced
in this study and 28 sequences of geographically distinct isolates retrieved from GenBank. Results
revealed that analysis of concatenated L1-URR resolved into two lineages, namely the lineage A and
lineage B. The majority of isolates (49/68, 72.0%) clustered together with a 94% certainty in the
bootstrap test.
Analysis of concatenated E5a/b-L1-URR involved 95 nucleotide sequences. Sixty-seven sequences
obtained in this study, and 28 sequences retrieved from GenBank from geographically distinct isolates.
HPV11 concatenated E5a/b-L1-URR resolved into two lineages, namely lineage A and lineage B. The
majority of the isolates sequenced in this study (48/67, 71.6%) clustered with the A2 prototype
LN833161.1 and revealed a 97% certainty in the bootstrap test. Seven isolates from this study, clustered
with one isolate (retrieved from GenBank) previously identified as an A2 sub-lineage isolate with an
82% certainty in the bootstrap test (Jelen et al., 2016).
Three isolates previously identified as A1 sub-lineage isolates (EU918768.1, FN870021.1 and
FN907963.1), did not closely cluster together. No isolates sequenced in this study clustered with the A3
and A4 prototype strains.
47
Lineage A
Figure 4: Phylogenetic tree of human papillomavirus
type 11 (HPV11) based on alignment of 96 nucleotide
sequences of HPV11 concatenated L1-URR. This
analysis involves 68 isolates sequenced in this study
and 28 sequences of geographically distinct isolates
retrieved from GenBank.
M14119.1 (blue) was used as a representative of sub-
lineage A1, LN833161.1 (green) as a representative of
Lineage B sub-lineage A2, LN833169.1 (yellow) as a
representative of sub-lineage A3, LN833187.1 (red) as
a representative of sub-lineage A4. Lastly, LN833183.1
was used as a representative of lineage B. The
maximum likelihood tree was constructed using MEGA
X using the Maximum Likelihood method and Tamura-
Nei model.
48
Ten isolates from this study (VBD15/10, VBD29/14, VBD34/11, VBD37/11, VBD39/12, VBD44/15,
VBD47/12, VBD55/13, VBD23/10 and VBD74/09) clustered with two (MK463921.1 and
MK463916.1) sequences retrieved from GenBank with 99% and 100% certainty in the bootstrap test in
HPV11 concatenated L1-URR and E5a/b-L1-URR data sets, respectively. The sub-lineage
classification of these GenBank sequences are unknown.
Seven isolates from the current study (VBD01/10, VBD07/10, VBD17/10, VBD17/14, VBD22/14,
VBD33/11, and VBD34/10) clustered with isolates FN907963.1 and FN907962.1 with a 98% and 99%
certainty in the bootstrap test in HPV11 concatenated L1-URR and E5a/b-L1-URR data sets,
respectively. However, FN907963.1 has been previously identified as sub-lineage A1 and FN907962.1
as sub-lineage A2. One confirmed A1 sub-lineage isolate (EU918768.1) retrieved from GenBank shared
a node with confirmed A2 sub-lineage isolates with a 95% certainty in the bootstrap test. In both
concatenated L1-URR and E5a/b-L1-URR sequences, three lineage B isolates, one from GenBank
(LN833183.1) and two from the current study (VBD34/08 and VBD28/14), clustered together in the
bootstrap test with 100% certainty with one isolate being identical to the previously identified lineage
B isolate.
Pairwise analysis of percentage divergence of nucleotides
Analyses were performed with the use of the Maximum Composite Likelihood model and included 95
nucleotide sequences and used the concatenated E5a/b-L1-URR data set. This includes 67 sequences
obtained in this study, and 28 sequences retrieved from GenBank. All numbers were limited to three
decimals. The pairwise analysis of percentage divergence of nucleotides using the human
papillomavirus type 11 concatenated E5a/b-L1-URR data set is included in Appendix 10.
Divergence from the A1 prototype M14119.1 ranged from minimum 0,087 (EU918768.1 and
FN870021.1) to a maximum of 0,154 (VBD34/08, FN907962.1 and LN833183.1) with an average
divergence of 0,119. Divergence from the A2 prototype LN833161.1 ranged from 0,001 (VBD02/12,
VBD04/12, VBD08/09, VBD08/13, VBD09/12, VBD12/18, VBD13/10, VBD14/09, VBD14/11,
VBD14/15, VBD16/10, VBD16/15, VBD17/09, VBD21/10, VBD23/17, VBD26/10, VBD29/13,
VBD30/11, VBD33/10, VBD37/08, VBD37/12, VBD41/11, VBD41/12, VBD48/12, VBD48/16,
VBD49/08, VBD49/12, VBD49/15, VBD52/11, VBD58/11, VBD59/08, VBD61/11, VBD62/11,
VBD63/11, VBD69/09, HE611263.1, JN644141.1, JQ773409.1, MK313763.1, MN788368.1,
VBD79/09) to 0,166 (VBD34-08 and LN833183.1) with an average divergence of 0,037. Divergence
from the A3 prototype LN833169.1 ranged from 0,000 (LN833165.1) to 0,158 (LN833183.1) with an
average divergence of 0,058. Divergence from the A4 prototype LN833187.1 ranged from 0,005
(LN833165.1 and LN833169.1) to 0,159 (VBD34/08 and LN833183.1) with an average divergence of
0,060. Divergence from the lineage B prototype LN833183.1 ranged from 0,000 (VBD34/08) to 0,166
49
Sub-lineage A2
Sub-lineage A4
Sub-lineage A3
Sub-lineage A1
Figure 5: Phylogenetic tree of human papillomavirus
type 11 (HPV11) based on alignment of 95 nucleotide
sequences of HPV11 concatenated E5a/b-L1-URR. This
analysis involves 67 isolates sequenced in this study and
28 sequences of geographically distinct isolates
retrieved from GenBank.
M14119.1 (blue) was used as a representative of sub-
lineage A1, LN833161.1 (green) as a representative of
sub-lineage A2, LN833169.1 (yellow) as a
representative of sub-lineage A3, LN833187.1 (red) as a
Lineage B representative of sub-lineage A4. Lastly, LN833183.1
(orange) was used as a representative of lineage B. The
maximum likelihood tree was constructed using MEGA
X using the Maximum Likelihood method and Tamura-
Nei model.
50
(VBD16/09, VBD33/16, VBD35/11, VBD37/10, VBD41/14, VBD41/15, VBD48/10, VBD52/09,
VBD59/09, VBD63/09, VBD68/15, VBD68/16, FN870021.1, FR872717.1, JQ773408.1, JQ773409.1,
JQ773411.1, JQ773412.1, KU298879.1, LN833161.1, LN833165.1, LN833169.1, LN833183.1,
LN833184.1, LN833185.1, M14119.1, MK313763.1, MK313765.1, MK313767.1, MK463914.1) with
an average divergence of 0,161. VBD28/14 had a divergence of 0,120 from LN833183.1 and VBD34/08
(Appendix 10).
Analysis of E2 segment as representative of whole genome variation
The evolutionary history was inferred with the Neighbor-Joining method and the Maximum Composite
Likelihood method and involved 95 nucleotide sequences. There were a total of 208 nucleotide
positions in the final dataset. The phylogenetic trees are shown in Figure 6 and Figure 7.
HPV11 E2 segment sequences clearly resolved into two lineages, namely the lineage A and lineage B.
Sixty-five of the 67 isolates (97.0%) sequenced clustered together with the A2 prototype LN833161.1
and revealed a 52% and 56% certainty in the bootstrap test the Neighbor-Joining and maximum
likelihood phylogenetic trees, respectively. Lineage B isolates clustered together with a 95% certainty
in the bootstrap test in both the Neighbor-Joining and maximum likelihood phylogenetic trees.
Four isolates clustered with the A1 prototype M14119.1 with 54% and 59% confidence according to
the Neighbor-Joining and maximum likelihood methods, respectively. Analysis of estimates of
evolutionary divergence between sequences revealed the minimum divergence from the A1 prototype
M14119.1 was 0,000 (EU918768.1, FN870021.1, and FN907963.1), the maximum was 0,024
(VBD34/08 and LN833183.1) (Appendix 11).
Divergence from A2 prototype LN833161.1 ranged from 0,000 (VBD01/10, VBD02/12, VBD04/12,
VBD07/10, VBD08/09, VBD08/13, VBD09/12, VBD12/18, VBD13/10, VBD14/09, VBD14/15,
VBD15/10, VBD16/09, VBD16/10, VBD16/15, VBD17/09, VBD17/10, VBD17/14, VBD21/10,
VBD22/14, VBD23/10, VBD23/17, VBD26/10, VBD29/13, VBD30/11, VBD33/10, VBD33/11,
VBD33/16, VBD34/10, VBD34/11, VBD35/11, VBD37/08, VBD37/10, VBD37/11, VBD37/12,
VBD39/12, VBD41/11, VBD41/12, VBD41/14, VBD41/15, VBD44/15, VBD47/12, VBD48/10,
VBD48/12, VBD48/16, VBD49/08, VBD49/12, VBD49/15, VBD52/09, VBD52/11, VBD55/13,
VBD58/11, VBD59/08, 59/09, VBD63/09, VBD63/11, VBD68/15, VBD68/16, VBD69/09,
VBD69/15, VBD79/09, HE574702.1, HE611263.1, JN644141.1, JQ773409.1, KU298879.1,
LN833184.1, LN833185.1, MK313763.1, MK313765.1, MK313767.1, MN788368.1) to 0,020
(VBD34/08 and LN833183.1) (Appendix 11).
No isolates sequenced in the current study clustered with the A3 or A4 prototype sequences. Divergence
from the A3 prototype LN833169.1 ranged from 0,000 (LN833165.1) to 0,024 (VBD34/08 and
51
LN833183.1) (Appendix 11). The A4 prototype LN833187.1 contains a wobble nucleotide in the 208bp
region of the E2 ORF. One nucleotide was substituted with the degenerative nucleotide ‘K’, which can
be interpreted as either a G or a T. Therefore, the trees were constructed using both a G and a T and
allocated LN833187.1(1) and LN833187.1 (2), respectively (Appendix 4).
Divergence from LN833187.1_(1) ranged from 0,005 (VBD01/10, VBD02/12, VBD04/12, VBD07/10,
VBD08/09, VBD08/13, VBD09/12, VBD12/18, VBD13/10, VBD14/09, VBD14/15, VBD15/10,
VBD16/09, VBD16/10, VBD16/15, VBD17/09, VBD17/10, VBD17/14, VBD21/10, VBD22/14,
VBD23/10, VBD23/17, VBD26/10, VBD29/13, VBD30/11, VBD33/10, VBD33/11, VBD33/16,
VBD34/10, VBD34/11, VBD35/11, VBD37/08, VBD37/10, VBD37/11, VBD37/12, VBD39/12,
VBD41/11, VBD41/12, VBD41/14, VBD41/15, VBD44/15, VBD47/12, VBD48/10, VBD48/12,
VBD48/16, VBD49/08, VBD49/12, VBD49/15, VBD52/09, VBD52/11, VBD55/13, VBD/11,
VBD59/08, VBD59/09, VBD63/09, VBD63/11, VBD68/15, VBD68/16, VBD69/09, VBD69/15,
VBD79/09, EU918768.1, FN870021.1, FN907963.1, HE574702.1, HE611263.1, JN644141.1,
JQ773409.1 KU298879.1, LN833161.1, LN833165.1, LN833169.1, LN833184.1, LN833185.1,
LN833187.1_(2), M14119.1, MK313763.1, MK313765.1, MK313767.1, MN788368.1) to 0,020
(VBD34/08 and LN833183.1) (Appendix 11).
Divergence from LN833187.1_(2) ranged from 0,005 (LN833187.1_(1)) to 0,024 (VBD34/08 and
LN833183.1) with an average divergence of 0,011. Divergence from LN833183.1 ranged from 0,000
(VBD34/08) to 0,034 (FR872717.1, JQ773408.1, JQ773411.1, JQ773412.1, and MK463914.1)
(Appendix 11).
Three lineage B isolates (VBD34/08, VBD28/14 and, LN833183.1) clustered together in the bootstrap
test with 95% certainty according to the bootstrap test in both the Neighbor-Joining and maximum
likelihood phylogenetic trees. Divergence from ranged from 0,000 (VBD34/08) to 0,034 (FR872717.1,
JQ773408.1, JQ773411.1, JQ773412.1 (Appendix 11).
52
Lineage A
Lineage B
Figure 6: Phylogenetic tree of human papillomavirus type 11 (HPV11) based on 95 nucleotide sequence alignments of
HPV11 E2 segment. This analysis involves 67 isolates sequenced in this study and 28 sequences of geographically distinct
isolates retrieved from GenBank.
M14119.1 (blue) was used as a representative of sub-lineage A1, LN833161.1 (green) as a representative of sub-lineage
A2, LN833169.1 (yellow) as a representative of sub-lineage A3, LN833187.1 (red) as a representative of sub-lineage A4.
Lastly, LN833183.1 (orange) was used as a representative of lineage B. A single, double peak was denoted as a wobble
nucleotide in the LN833187.1 genome sequence submitted to GenBank, and therefore for readability, the trees were
constructed using both a G and a T and allocated LN833187.1(1) and LN833187.1 (2), respectively. The maximum
likelihood tree was constructed using MEGA X using the Maximum Likelihood method and Tamura-Nei model.
53
Lineage A
Lineage B
Figure 7: Phylogenetic tree of human papillomavirus type 11 (HPV11) based on 95 nucleotide sequence alignments of
HPV11 E2 segment. This analysis involves 67 isolates sequenced in this study and 28 sequences of geographically distinct
isolates retrieved from GenBank.
M14119.1 (blue) was used as a representative of sub-lineage A1, LN833161.1 (green) as a representative of sub-lineage
A2, LN833169 (yellow) as a representative of sub-lineage A3, LN833187.1 (red) as a representative of sub-lineage A4.
Lastly, LN833183.1 (orange) was used as a representative of lineage B. A double peak was denoted as a wobble nucleotide
in the LN833187.1 genome sequence submitted to GenBank, hence, the tree was constructed using both a G and a T and
allocated LN833187.1(1) and LN833187.1 (2), respectively. The Neighbor-Joining tree was constructed using MEGA X.
54
2.4. Discussion
During the course of this study, genome sequencing was used to identify unique HPV11 variants
circulating in patients diagnosed with RRP treated at the Universitas Academic Hospital. HPV11 has a
prevalence of 1% to 4% in cervical samples in various South African regions (Giuliano et al., 2015;
Mbulawa et al., 2018). Several studies have reported HPV prevalence from different parts of South
Africa (Giuliano et al., 2015; Mbulawa et al., 2018). However, very few studies have focused on the
prevalence of HPV11 genetic variants in South Africa. Therefore, in the current study, we aimed to
investigated HPV11 variants circulating in the study population by means of genomic sequencing of
the L1, E5a/b and URR genes, which have been postulated to produce phylogenetic trees able to
distinguish between lineages and sub-lineages (Jelen et al., 2016). In addition, the hypothesis that a
208bp segment of the E2 genome generates tree topology which represents whole-genome variation
was also challenged (Jelen et al., 2016). HPV11 variant investigation is essential, as it is unknown if
genome variations may result in divergent infectivity rates, affects vaccine effectiveness, and disease
prognosis.
A previous study aiming to identify HPV6 variants in patients diagnosed with RRP in South Africa
identified three novel HPV6 variants (Combrinck et al., 2012). Subsequent studies focusing on the
molecular characterisation of HPV6 suggested that HPV6 sub-lineage B1 is associated with an
increased risk of developing genital warts (Flores-Díaz et al., 2017). HPV11 variants circulating in
South Africa are currently largely unknown and the common occurrence of HPV6 variants in patients
diagnosed with RRP in South Africa, suggest that similar patterns may be observed with HPV11.
Therefore, further investigative studies with regards to HPV 11 are warranted.
Between 2008 and 2018, 94 HPV11 isolates were identified in patients diagnosed with RRP at the
Universitas Academic Hospital in the Free State province of South Africa. Previously, a novel lineage
B isolate was identified, necessitating further investigation (Makatsa, 2012). However, this isolate was
not available for characterisation in the current study. In this study, the concatenated L1-URR and
E5a/b-L1-URR genes were characterised genetically to discriminate between different (sub)lineages.
Despite the proof-reading abilities of host cell polymerases, many SNPs were identified in isolates
characterised in this study which likely reflect a difference in the HPV11 genome.
Within the L1 ORF, which is generally considered a conserved region of the HPV genome, sequence
variation was low, likely due to the slow genomic evolutionary rate, which is estimated to be 10−8
nucleotide substitutions per site per year (Chen et al., 2009, 2011). The highest percentage of divergence
from the A1 prototype M14119.1 was 0.6%, implying a variant sub-lineage as this region is often used
to represent whole-genome variation (Burk et al., 2011). Ninety-five per cent (69/72) of sequences
incorporating a C6028T substitution and only one isolate was identical to the A1 prototype M14119.1.
55
However, the L1 gene is generally only used to identify the type of HPV, and additional sequencing is
required to identify variant lineages and sub-lineages reliably (Burk et al., 2011).
Although nucleic acid changes may imply a variant lineage or sub-lineage, nucleic acid variances do
not necessarily impact the function of the expressed protein. DNA alterations in an ORF may lead to
alterations in the gene product if non-synonymous substitutions are present (Lebeuf-Taylor et al., 2019).
Analysis of genome variances in the L1 ORF identified four non-synonymous substitutions in two
isolates, namely VBD28/14 and VBD34/08. The L1 major capsid protein is highly immunogenic and
has formed the basis of successful vaccines (Chabeda et al., 2018; Dadar et al., 2018). The L1 also
mediates vital functions for virus survival, such as encapsulation of the papillomavirus genome,
interaction with the host cell for infectious entry, and releasing the viral DNA into a new host cell (Barra
et al., 2019; Benedict & Derkay, 2021). Consequently, amino acid changes may impact virus survival
and the effectiveness of HPV11 targeted vaccines.
Typically, evaluation of vaccine antibody responses is studied using serological assays. Sites of
variation for the L1 region of HPV11 isolates analysed in this study were confined to a limited number
of internal residues implying that the influence of these variances on recognition by L1-specific
antibodies is likely to be insignificant, as indicated in a study on HPV16 (Pastrana et al., 2004). The
consequences of these polymorphisms in the novel lineage B and sub-lineage A3 and A4 and antibody
recognition following natural infection or vaccination by the present-day vaccines requires additional
investigation.
For HPV11 sub-lineage classification, sequencing of additional gene segments is required. Genomic
sequencing of the E5a/b revealed a 2.2% divergence from the A1 prototype M14119.1 in VBD28/14
and VBD34/08, and a 0.8% divergence from the prototype in VBD23/10. Most of the nucleotide
substitutions were non-synonymous and resulted in amino acid alterations, which may interfere with
the function of this protein (Lebeuf-Taylor et al., 2019). Proteins expressed by the E5a/b are each made
up of 83 amino acids and are involved in many processes, including immune evasion, and regulating
apoptosis. The E5a/b proteins also play a role in cell cycle pathways and influence cellular gene
expression (Chagas et al., 2011; Zhang et al., 2018).
Few studies have reported on gene mutations of E5 in HPV11, but studies on various HPV types have
reported that non-synonymous mutations might lead to changes in polarity, hydropathic potential, and
the amino acid side chain, which may alter protein folding (Halavaty et al., 2014; Hemmat & Baghi,
2018; Venuti et al., 2011; Zhang et al., 2018). Point-mutations in E5 may inhibit cell transformation,
hinder activation of the cyclin A pathway, and disrupt the acidification of endosomes. The E5 protein
of HPV has also been proven to increases the expression level of hepatocyte growth factor receptors,
which promotes transformed cell invasiveness and regulates the proliferation of infected cells (Halavaty
56
et al., 2014; Hemmat & Baghi, 2018; Venuti et al., 2011; Zhang et al., 2018). Conversely, studies have
also demonstrated that the E5 ORF can tolerate many mutations and maintain the hydrophobic nature
of E5, and that the conserved residues are sufficient to confer transforming activity (Mattoon et al.,
2001; Venuti et al., 2011). As E5 is necessary for many pathways, minor sequence variations between
E5 proteins from different HPV11 lineages and sub-lineages may significantly impact the function of
this protein. Thus, functional analysis of E5 in HPV11 may further decipher the effects of E5 mutations
on the HPV virus life cycle.
The URR is the most variable region within the HPV genome capable of accumulating and tolerating
more nucleotide mutations as it does not encode proteins. The URR interacts with many cellular and
viral factors and is involved with functions such as virus replication, gene expression, and transcription
(Fang et al., 2020; Ribeiro et al., 2018). In the current study, the URR of 68 isolates was successfully
sequenced, and genome analysis revealed 237 nucleotide substitutions, 118 nucleotide insertions and
90 nucleotide deletions. Isolates VBD34/08 and VBD28/14 had the most nucleotide differences
compared to the A1 prototype M14119.1, with both containing 20 nucleotide substitutions and 12
nucleotide deletions. In addition, isolate VBD28/14 had the maximum number of nucleotide insertions
of all isolates, 28, followed by 27 in VBD34/08.
A correlation between the transcriptional activity of HPV11 variants and RRP disease severity have
previously been reported (Gáll et al., 2013). HPV11 presents E2 binding sites distributed along the URR
regions and various promotors. These promotors are regulated differently, which indicates that
independent regulation of early proteins is essential for the viral life cycle (DiLorenzo & Steinberg,
1995; Dollard et al., 1992; Stoler et al., 1989). Duplication in the early viral promoter sequence of
HPV11 has been associated with a higher degree of disease severity and the genome alterations T7904A
and thymine at position 7546, as in the A1 prototype sequence M14119.1, may have enhancer effects
(Gáll et al., 2013). Due to the frequency of mutations in isolates from the current study, the functionality
of binding sites in the URR may be impacted, affecting many cellular and viral factors. Although the
URR is highly variable, the E2 transcription binding sites are reported to be highly conserved, and
mutations are rarely found in these regions (Fang et al., 2020; Ribeiro et al., 2018).
Analysis of the E2 genome segment sequenced in this study revealed that most isolates (60/67) had only
two point-mutations, and only a few (7/67) isolates had other mutations. The maximum number of
point-mutations was five in VBD34/08 (2,4%), followed by four in VBD28/14 (1,9%). The E2 protein
regulates essential factors during the viral life cycle, such as replication, transcription, and viral genome
partitioning. Therefore, the binding of the E2 protein to corresponding binding sites is necessary for
viral survival (Graham & Faizo, 2017; Kardani & Bolhassani, 2018a; Wallace & Galloway, 2014).
57
Phylogenetic trees are a practical way to present the evolutionary relationship between organisms as the
genetic sequence of contemporary sequences evolved from a common ancestor. Specific HPV11
genomic regions have previously been used to identify genetic variants as they provide sufficient
genetic variation to differentiate between HPV11 variants. In 2011, Maver and colleagues identified
two clades based on the topology of the maximum likelihood phylogenetic tree, namely the prototypic
and non-prototypic clades (Maver et al., 2011). Subsequently, Burk and colleagues defined a new HPV
sub-lineage as a 0.5% to 1.0% variation from the complete genome (Burk et al., 2011). This study
revealed two clades and that the maximum pairwise difference between variants is approximately 0.4%.
They designated the two clades as sub-lineage A1 (previously termed prototypic variant group) and
sub-lineage A2 (previously termed non-prototypic variant group) (Burk et al., 2011). Further
comparative phylogenetic sequence analysis of isolates collected from Australia, Slovenia, China,
Hungary, and Thailand revealed that the A2 sub-lineage predominates (Danielewski et al., 2013; Jelen
et al., 2016). Results were further supported by a generally high degree of sequence conservation
observed between diverse geographical regions throughout the world, suggesting that geographically
specific variants were uncommon for HPV11 (Danielewski et al., 2013; Jelen et al., 2016; Maver et al.,
2011). However, the discovery of new (sub)lineages in geographically restricted areas contradicts
previous assumptions that intratypic variants of HPV11 are not geographically restricted (Jelen et al.,
2016).
An extensive study on the genomic diversity of HPV11 from six continents revealed an additional
lineage B and two additional sub-lineages A3 and A4, for the first time (Jelen et al., 2016). Lineage B
had a 1.3% maximal pairwise distance, sub-lineage A4 had a complete nucleotide pairwise difference
of above 0.5%, and the two sub-lineage A3 isolates had a total nucleotide pairwise difference in the
range of 0.5% (Jelen et al., 2016). Results were visualised with a heatmap, and the specific complete
nucleotide pairwise difference was not mentioned. A single, double peak was denoted as a wobble
nucleotide in the LN833187 genome sequence submitted to GenBank, and therefore, the exact sequence
is unknown (Appendix 4). As in the current study, phylogenetic analysis of E5a/b ORF, the L1 ORF
and the URR using the maximum likelihood method identified sub-lineage A2 as the predominant sub-
lineage globally (Jelen et al., 2016). For increased accuracy of phylogenetic analysis in the current
study, 28 geographically distinct HPV11 representatives were included. Nonetheless, the tree topology
generated in the present study did not correlate completely with that of the study published by Jelen and
colleagues (Jelen et al., 2016).
Phylogenetic analysis of the concatenated L1-URR and E5a/b-L1-URR sequences identified in this
study and sequences retrieved from GenBank correctly resolved into two lineages: Lineage A and
lineage B. However, sub-lineage classification was frequently unclear. For example, isolates previously
identified as sub-lineage A1 (FN907963.1) and A2 (FN907962.1), did not cluster closely with the
58
corresponding prototypes and instead, clustered together. Seven isolates from the current study
clustered with isolates FN907963.1 and FN907962.1 with great certainty in the bootstrap test, therefore,
phylogenetic analysis of the concatenated L1-URR and E5a/b-L1-URR is unable to reliably classify
these isolates.
Neighbor-Joining and maximum likelihood trees were constructed using E2 segment data to determine
whether E2 tree topology would be identical to phylogenetic trees constructed using concatenated
E5a/b-L1-URR or L1-URR sequence data, and to determine whether the Neighbor-Joining tree and
maximum likelihood tree based on E2 segment sequence data would generate similar results. Maximum
likelihood tree construction is computationally intensive, therefore, using the Neighbor-Joining method
would be advantageous to generate results fast when using large data sets and for bootstrap analysis. In
this study, the E2 segment tree topology was not identical to the phylogenetic trees using either the
concatenated E5a/b-L1-URR or L1-URR. For instance, slight but essential variations were observed in
the maximum likelihood tree constructed using the E2 segment. EU918768.1, FN870021.1 and
FN907963.1 clustered correctly with M14119.1 when using the E2 segment, but not the concatenated
E5a/b-L1-URR or L1-URR segments. However, in the study published by Jelen and colleagues (2016),
analysis of the complete and partial genomes revealed EU918768.1, FN870021.1 and FN907963.1
clustered appropriately with the A1 prototype M14119.1.
In this study, we used phylogenetic software to calculate evolutionary distances between sequences and
identify correspondence regions by computing the proportion of nucleotide differences between each
pair of sequences. Analysis revealed isolates highly analogous to reference strains LN833169,
LN833187.1, LN833165.1 and LN833183.1 appropriately clustered, and the results collated with those
observed in the phylogenetic tree topology. Further analysis revealed that lineage B isolates VBD34/08
and LN833183.1 had the greatest divergence from the A1 prototype M14119.1, which is anticipated.
Analysis of concatenated E5a/b-L1-URR revealed FN907962.1 as highly dissimilar, which is
anomalous because FN907962.1 belongs to sub-lineage A2. LN833183.1 and VBD34/08 were the most
different to LN833169.1 and LN833187.1, irrespective of the wobble nucleotide in the E2 segment.
Analysis of estimates of evolutionary divergence between sequences revealed sub-lineage A2 isolates
FR872717.1, JQ773408.1, JQ773411.1, JQ773412.1, and MK463914.1 were the most dissimilar to
lineage B using concatenated E5a/b-L1-URR sequences and the E2 segment. However, the E2 segment
also identified several other isolates belonging to various sub-lineages to be equally dissimilar.
Discrepancies observed in this study may be due to the large number of additional isolates included in
this study or differences in software used to analyse the sequence data. Jelen and colleagues (2016) used
RAxML and Bayesian trees to produce phylogenetic trees with complete genome data, and RAxML
HPC2 (version 8.1.11) and MEGA (version 5) software to identify the most informative areas for whole-
59
genome-based phylogenetic clustering (Jelen et al., 2016). In the current study, MEGA X was used to
analyse the most informative regions determined by Jelen and colleagues (2016).
Results obtained in the current study supports the notion of whole-genome sequencing for HPV11
classification below lineage level as a standard. Whole-genome sequencing generates more information
regarding genetic variants and eliminates discrepancies seen in the current study. Concatenated L1-
URR, E5a/b-L1-URR sequences and the E2 segment had equivalent discriminatory power for lineage
identification. However, concatenated E5a/b-L1-URR provides unclear results regarding sub-lineage
classification. E2 Neighbor-Joining and maximum likelihood tree generated in this study generated a
similar tree topology. Due to the inconsistencies and discrepancies observed in the phylogenetic tree
topology using concatenated E5a/b-L1-URR, the sub-lineage classification of several isolates remained
undetermined and therefore it is unclear whether tree topology using the E2 segment can reliably
distinguish between sub-lineages. Investigations with a greater sample size are necessary to validate
these findings.
It is essential to characterise the current strains circulating amongst the community to compare the
various lineages and sub-lineages with disease severity in future studies, monitor the impact of the
vaccination campaign on the circulating HPV types, and guide vaccine development. In addition,
baseline data on circulating HPV genotypes may effectively monitor the impact of the vaccination
program on the community. Also, determining the genotype of HPV responsible for RRP in patients
may have prognostic implications.
60
CHAPTER 3- Characterisation of novel human papillomavirus type 11
isolates
3.1. Introduction
The identification and characterisation of potentially novel HPV lineages and sub-lineages has great
public health significance (Sridhar et al., 2015). Little is known about the correlation between HPV
lineages and sub-lineages and disease persistence, severity, prognosis, and malignant capability. The
relevance of HPV variant classification has increased as HPV type and intratypic variants may possibly
influence the course of disease (Gáll et al., 2013).
In the last two decades, NGS has been the most frequently used technique for identifying novel HPV
types and variants. NGS refers to high-throughput sequencing and is capable of massive parallel
sequencing of various DNA molecules (Ambulos et al., 2016; Kocjan et al., 2015). This relatively new
technology is quickly becoming an indispensable tool in laboratories for both the detection and
characterisation of clinically important pathogens such as HPV. NGS surpasses more conventional
detection methods such as Sanger sequencing by faster turnaround time and the improved depth of
knowledge regarding serotyping (Ambulos et al., 2016; Kocjan et al., 2015; Parker & Chen, 2017).
High-throughput sequencing is a powerful method that has been used to identify novel lineages and
sub-lineages of HPV11 using a WGS approach. Complete genome sequencing using NGS and
subsequent determination of percentage homology with known viral nucleotide sequences may aid in
identifying novel lineages and sub-lineages (Sridhar et al., 2015).
The classification of HPV into genera, species and types based on the L1 ORF is well established. A
L1 sequence with >10% dissimilarity to any other deposited sequence is defined as a novel HPV type.
A nucleotide variance of 0.5%-1.0% across the whole genome defines a variant sub-lineage, and a
nucleotide variance of 1%-10% defines a variant lineage (Burk et al., 2011; Maver et al., 2011).
However, the existence of a novel HPV variant is only confirmed when its complete genome is
sequenced and deposited with the International HPV Reference Centre
(https://www.hpvcenter.se/human_reference_clones), and the complete genome meets the
classification requirements set by Burk and colleagues (2011).
Recently, one new lineage and two new sub-lineages of HPV11 were described, warranting further
investigation into the genetics and distribution of these lineages (Jelen et al., 2016). Previous reports
suggested that concatenated E5a/b-L1-URR sequences, as well as a 208bp gene segment in the E2
reportedly represent whole-genome variation and may be used to construct phylogenetic trees
representative of the different lineages and sub-lineages (Jelen et al., 2016). Numerous isolates with
61
unique SNPs, amino acid changes and indels within the E5a/b, L1 and URR, as well as potential lineage
B isolates were identified using phylogenetic investigations. However sub-lineages could not be
conclusively defined, suggesting that a novel lineage and sub-lineage requires a complete genome
according to the classification and nomenclature system proposed previously (Bernard et al., 2010;
Burk et al., 2013; Chen et al., 2011; de Villiers et al., 2004; Siqueira et al., 2016). Therefore, in the
current chapter, whole genome sequences of selected isolates were determined using NGS and the
genetic relationships analysed. In addition, the hypothesis that the partial genome sequences, as
obtained in Chapter 2, generates tree topology which represents whole-genome variation is challenged
(Jelen et al., 2016). Data on novel lineages and sub-lineages of HPV may contribute to future
evolutionary- and vaccine studies, aid in vaccine development, and give insight into the pathogenicity
of HPV11.
3.2. Materials and methods
3.2.1. Study samples
Sample selection
Four isolates with unique SNPs, amino acid changes or indels within the E5a/b, L1 and URR were
selected for whole-genome sequencing based on results from the phylogenetic analysis described in
Chapter 2. Isolates which did not cluster closely with sub-lineage prototype sequences, as well as a
possible lineage B isolate were selected for whole genome sequencing. Isolates included VBD28/14,
VBD15/10, VBD74/09 and VBD01/10. Two isolates, VBD74/09 and VBD15/10, were isolated from
one patient collected approximately five months apart to determine whether HPV11 experienced any
nucleotide mutations between sample collection dates. Information on the selected isolates, including
the date of birth, date of diagnosis, date of sample collection and sex, is summarised in Table 13.
Informed consent for collection and storage of biopsies was obtained from each patient by Professor
Seedat from the Department of Otorhinolaryngology, Faculty of the Health Sciences, UFS.
HPV11 genome sequences retrieved from GenBank
Complete genome sequence data for 28 geographically distinct representatives of each HPV11, sub-
lineages A1, A2, A3 and A4, and lineage B, were retrieved from GenBank (Appendix 2). This included
the following isolates with GenBank accession numbers: M14119.1, LN833187.1, LN833185.1,
LN833184.1, LN833183.1, LN833169.1, LN833165.1, LN833161.1, KU298879.1, JQ773412.1,
JQ773411.1, JQ773409.1,JQ773408.1, JN644141.1, HE611263.1, HE574702.1, FR872717.1,
FN907963.1, FN907962.1, FN870021.1, EU918768.1, MN788368.1, MK463921.1, MK463916.1,
MK463914.1, MK313767.1, MK313765.1, and MK313763.1. Isolate M14119.1 was used as the sub-
lineage A1 representative, LN833161.1 was used as the sub-lineage A2 representative, LN833169 was
62
used as the sub-lineage A3 representative, LN833187 was used as the sub-lineage A4 representative,
and LN833183 was used as the lineage B representative. Sequence length ranged from 7932bp to
7949bp and is summarised in Appendix 2.
Ethical clearance was obtained from the Health Sciences Research Ethics Committee of the UFS (UFS-
HSD2019/1109/2708) (Appendix 3).
3.2.2. Full‐genome amplification and sequencing
The complete genome was determined by amplifying two overlapping DNA fragments representing the
complete HPV11 genome. Primers used for amplification are summarised in Table 11 (Jelen et al.,
2016). Visual representation of overlapping fragments is depicted in Figure 8. For both overlapping
PCR fragments, a Phusion ® High-Fidelity PCR kit (Thermo Scientific, Illinois, USA) was utilised
according to the manufacturer's instructions. Contents of the reaction mixture are summarised in Table
12. The cycling conditions were carried out as follows: 98°C for two minutes, followed by 40 cycles of
98°C for 10 seconds, 53.5°C for 30 seconds, and 72°C for three minutes. The final extension step was
carried out at 72°C for eight minutes before cooling to 4°C.
Table 11: List of primers used for sequencing full-length human papillomavirus type 11 (HPV11) genome in two overlapping
fragments.
Primer sequence GC Primer binding Amplicon
Region Primer name c
c Tm b d
content site size
5’ → 3’
3′ end of the E2 a TTACAACAAGC
HPV-11-S2F 38.10% 60.1℃ nt 3529 to 3549
ORF, the E5a/b ACCAAAGAAG
ORF, the L2 ORF,
4851bp
the L1 ORF, the
URR, and the 5′ TTCTATTTCACA
a
end of the E6 ORF HPV-11-S2R 40.00% 60.7℃ nt 427 to 446CAACGGCT
HPV-11-
3′ end of the L1 GGATATGAGTTT
a 42.86% 60.7℃ nt 7084 to 7104
ORF, the URR, the S1MS-F TTGGGAGGT
E6 ORF, the E7
4,521bp
ORF, the E1 ORF,
and a segment of HPV-11- ATGCCACGTTGA
a 45.00% 62.9℃ nt 3660 to 3679
the E2 ORF S1MS-R AGATGCTA
a
Primers reported by Jelen et al. , 2016.
b
Positions of the nucleotides determined with respect to the prototype human papillomavirus type 11 genome
(GenBank acc. no. M14119).
c
Melting temperature (Tm) and GC content determined by Thermo Fisher Scientific Tm calculator (Allawi &
SantaLucia, 1997).
d
Size determined with respect to the prototype human papillomavirus type 11 genome (GenBank acc. no. M14119).
63
Table 12: Polymerase chain reaction (PCR) components for amplifying human papillomavirus type 11 full-length genome in
two overlapping fragments (E1 to L1 genes; L1 to E1 genes).
Component Volume Final concentration
Phusion DNA polymerase 0,5µl 1,0 units
5 x Phusion HF Buffer 10µl 1X
10µM Forward primer 2,5µl 0,5µM
10µM Reverse primer 2,5µl 0,5µM
10mM dNTPs 1µl 200µM
Template DNA 5µl N/A
Nuclease free water 27,5µl N/A
Total 50µl N/A
M14119.1
7933bp
Figure 8: Genome organisation of a low-risk human papillomavirus type 11 (HPV11).
E 1-E7 early genes, L1-L2 late genes, URR Upper regulatory region, AE early polyA signal, and P97 and P742 promotors are
i ndicated. The figure is drawn based on HPV11 A1 prototype (GenBank accession number M14119.1). The yellow arrow line
indicates genome fragment amplified by primer pair HPV-11-S2F and HPV-11-S2R, and the black arrow line indicates genome
fragment amplified by the HPV-11-S1MS-F and HPV-11-S1MS-R primer pair.
64
3.2.3. Agarose gel electrophoresis
PCR amplicons were separated and visualised by electrophoresis using a 1% agarose gel. Briefly, a 1%
gel was prepared using 1g Seakem® LE agarose powder, and 100ml of 1x TAE buffer at pH 8.0 and
the gel was electrophoresed at 90V for 45 minutes. Tables summarising the preparation of TAE and 1%
agarose gels is included in Appendix 6 and 7. The DNA was stained using a GelRed nucleic acid gel
stain (Thermo Fisher Scientific, USA) (Appendix 8) and visualised using the BioRad Molecular Imager
Gel Doc™ XR+ with Image Lab™ Software (BioRad, California, USA) to determine the fragment sizes
according to the known DNA size marker. The O’GeneRuler™ 100bp DNA ladder SM0333
(Fermentas, Illinois, USA) containing DNA fragments from 100bp to 10 000bp was used to estimate
the size of the amplicons.
3.2.4. Purification of PCR product
The Wizard® SV Gel and PCR Clean-up System (Promega, Madison, USA) was used to purify target
DNA from a 1% agarose gel according to manufacturer’s instructions. Membrane Binding Solution was
added at a ratio of 10μl of solution per 10mg of agarose gel and the gel heated at 60°C until completely
dissolved. The contents were transferred to the SV Minicolumn and washed twice by adding 700μl and
500μl of Membrane Wash Solution. The DNA was eluted in 50μl of nuclease-free water and stored at
4°C until further use.
3.2.5. Determination of DNA concentration
DNA concentration was measured using the Qubit fluorometer and the Qubit dsDNA BR assay kit,
according to the manufacturer’s instructions (Thermo Fisher Scientific, Illinois, USA). A working
solution was made by diluting dsDNA BR reagent and BR buffer (1:200) in microcentrifuge tubes.
Then, 190μl working solution and 10μl of each standard were loaded in microcentrifuge tubes. In
separate microcentrifuge tubes, 198μl working solution and 2μl sample were loaded, vortexed and
incubated for two minutes at room temperature. The Qubit was calibrated, and DNA concentrations of
all samples were measured.
3.2.6. MiSeq library preparation and sequencing
The Nextera XT DNA Library Preparation kit (Illumina, California, USA) was used to convert the
purified DNA product to a short, fragmented DNA library, followed by size selection using AMpure
XP beads (Beckman Coulter, California, USA). The multiplexed libraries were analysed on an Illumina
MiSeq (Illumina, California, USA) with the MiSeq reagent kit v3 (300 cycles) (Illumina, California,
USA) at the UFS Next Generation Sequencing Unit.
65
3.2.7. Next-generation sequencing data analysis
FASTQ sequences were imported into Geneious 2021 software (https://www.geneious.com) as paired
ends with an insert size of 500. Reads were trimmed with an error probability limit of 0.05 and filtered.
Reads were assembled and aligned to the A1 prototype M14119.1 using a medium sensitivity, and a
consensus sequence was generated using the highest quality threshold. Using Geneious 2021 software
(https://www.geneious.com), the consensus sequence was inspected to identify ambiguities and
degenerative bases and exported as text documents. An assembly report was generated and the length
of the genome, CG content and pairwise identity was determined using Geneious 2021 software.
Analysis of nucleotide and amino acid variation
Genomic variants and genomic positions were identified by comparing the sequence data with the
prototype HPV11 genome (GenBank accession number M14119.1). A corrected sequence with a 2bp
insertion at genomic position 7717-7718 was used when determining genomic variants and genomic
positions (Maver et al., 2011). For the isolates sequenced in this study, nucleotide substitutions,
nucleotide insertions, and nucleotide deletions were identified with respect to the A1 prototype strain
(GenBank accession number M14119.1) using MEGA X software (Kumar et al., 2018).
The predicted amino acid sequences for each coding region were determined with the use of a codon
chart included in Appendix 9. Non-synonymous substitutions resulting in amino acid alterations were
identified by comparing amino acid data with the HPV11 A1 prototype (GenBank accession number
M14119.1).
Percentage variation in coding regions
To estimate the percentage of variation within the coding regions, the number of nucleotide- and amino
acid variations compared to the A1 prototype sequence M14119.1 were converted into a percentage
using the following equations:
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑛𝑢𝑐𝑙𝑒𝑜𝑡𝑖𝑑𝑒 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒𝑠 𝑤𝑖𝑡ℎ𝑖𝑛 𝑎 𝑠𝑝𝑒𝑠𝑖𝑓𝑖𝑐 𝑐𝑜𝑑𝑖𝑛𝑔 𝑟𝑒𝑔𝑖𝑜𝑛 𝑐𝑜𝑚𝑝𝑎𝑟𝑒𝑑 𝑡𝑜 𝑀14119.1
× 100
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑛𝑢𝑐𝑙𝑒𝑜𝑡𝑖𝑑𝑒𝑠 𝑤𝑖𝑡ℎ𝑖𝑛 𝑠𝑝𝑒𝑠𝑖𝑓𝑖𝑐 𝑐𝑜𝑑𝑖𝑛𝑔 𝑟𝑒𝑔𝑖𝑜𝑛 𝑜𝑓 𝑀14119.1
and
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑚𝑖𝑛𝑜 𝑎𝑐𝑖𝑑 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒𝑠 𝑤𝑖𝑡ℎ𝑖𝑛 𝑎 𝑠𝑝𝑒𝑠𝑖𝑓𝑖𝑐 𝑐𝑜𝑑𝑖𝑛𝑔 𝑟𝑒𝑔𝑖𝑜𝑛 𝑐𝑜𝑚𝑝𝑎𝑟𝑒𝑑 𝑡𝑜 𝑀1411.1
× 100
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑎𝑚𝑖𝑛𝑜 𝑎𝑐𝑖𝑑𝑠 𝑤𝑖𝑡ℎ𝑖𝑛 𝑠𝑝𝑒𝑠𝑖𝑓𝑖𝑐 𝑐𝑜𝑑𝑖𝑛𝑔 𝑟𝑒𝑔𝑖𝑜𝑛 𝑜𝑓 𝑀14119.1
Degree of divergence
The degree of divergence was calculated using pairwise analysis of percentage divergence of
nucleotides with MEGA X software using the Maximum Composite Likelihood model. Codon positions
66
included were 1st, 2nd, 3rd, and non-coding. All positions with less than 95% site coverage were removed
using the partial deletion option. The degree of divergence was calculated for isolates sequenced in the
current chapter, as well as the 28 complete genomes retrieved from GenBank and divergence from
lineage A and B prototypes were described. The degree of divergence between isolates sequenced in
the current chapter was also calculated.
Maximum likelihood method
The evolutionary history was inferred using the maximum likelihood method and the Tamura-Nei
model with a bootstrap value of 1000 software using sequence data determined in the current chapter,
and 28 complete sequences retrieved from GenBank. Analysis was performed using MEGA X software.
Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ
algorithms to a matrix of pairwise distances estimated using the Tamura-Nei model, then selecting the
topology with the higher log-likelihood value. The 1st, 2nd, 3rd, and non-coding codon positions were
included in the analysis.
Isolates with known (sub)lineage classification based on phylogenetic analysis were aligned with the
corresponding reference strain using MEGA X software to identify nucleotide- and amino-acid
variabilities (Kumar et al., 2018).
3.3. Results
3.3.1. Selection of isolates for whole genome sequencing
Four isolates were selected for complete genome determination. Two isolates (VBD15/10 and
VBD74/09) from one patient collected approximately five months apart were selected because the
concatenated E5a/b-L1-URR nucleotide sequence profiles were highly similar and therefore considered
representative of two isolates retrieved from GenBank for which compete genome data was available
and eight isolates from the current study of RRP patients. VBD01/10 was selected because the
concatenated E5a/b-L1-URR nucleotide sequence profiles were highly similar to an isolate retrieved
from GenBank for which compete genome data was available and representative of six isolates from
the current study. These isolates did not cluster closely with any prototype sequence. VBD34/08 and
VBD28/14 were identified as potential lineage B isolates based on the tree topology however, there was
insufficient DNA available to obtain further sequence data for VBD34/08. Thus, overall, the complete
genome of four HPV11 isolates, VBD01/10, VBD15/10, VBD28/14 and VBD74/09, from three patients
were sequenced in this study.
Information regarding patient sex, date of birth, date of diagnosis and sample collection is included in
Table 13. The DNA sequencing reads were mapped to the A1 prototype M14119.1 and a consensus
67
sequence was generated for each isolate. The number of assembled reads is summarised in Table 14.
The minimum and maximum lengths of the consensus sequences, the CG content and the pairwise
identity are summarised in Table 15.
Additional genome sequence data for 28 geographically distinct representatives of each HPV11 lineage,
including lineage A, sub-lineages A1 to A4, and lineage B, were retrieved from GenBank and used for
downstream analysis (Appendix 2).
Table 13: Patient information.
Isolate Sex Date of birth Date of diagnosis Date of sample collection
VBD01/10 Male 2007/01/04 2010/01/06 2009/11/03
VBD28/14 Male 2009/03/29 2014/06/26 2010/01/06
VBD74/09 Male 2009/11/03
2009/10/13 2009/10/23
VBD15/10 Male 2014/06/26
Table 14: Next-generation sequencing assembly report for four complete human papillomavirus type 11 (HPV11) mapped to
the A1 prototype M14119.1 using Geneious Prime software (https://www.geneious.com).
Assembly report
Isolate Total reads Assembled reads Not assembled reads
VBD01/10 1,033,592 1,022,153 11,439 reads
VBD15/10 1,652,362 1,636,151 16,211 reads
VBD28/14 1,333,518 1,306,323 27,195 reads
VBD74/09 1,903,068 1,879,877 23,191 reads
Table 15: The lengths of the complete human papillomavirus type 11 (HPV11) consensus sequence, the CG content and the
pairwise identity of four HPV11 isolates mapped to the A1 prototype M14119.1 using Geneious Prime software
(https://www.geneious.com).
The corrected M14119.1 with a two base-pair (bp) insertion at genomic position 7717-7718 was used (Maver et al., 2011).
.
Isolate Sequence length CG content Pairwise identity
VBD01/10 7934bp 41.1% 99.3%
VBD15/10 7932bp 41.1% 99.4%
VBD28/14 7949bp 41.1% 99.3%
VBD74/09 7932bp 41.1% 99.4%
68
3.3.2. Nucleotide and amino acid variances across the human papillomavirus type 11
genome
The nucleotide sequence variabilities for the four complete HPV11 genomes are summarised in Table
16. Briefly, VBD01/10 contained 25 nucleotide substitutions, a nucleotide deletion and two nucleotide
insertions compared to the A1 prototype M14119.1, VBD15/10 and VBD74/09 contained 26 nucleotide
substitutions and one nucleotide deletion compared to the A1 prototype M14119.1, and VBD28/14
contained 94 nucleotide substitutions, 12 nucleotide deletions and 28 nucleotide insertions compared to
the A1 prototype M14119.1. In total, 45 indels were identified among four isolates in the URR.
Nucleotide insertions include an ACGCGC insertion following genomic position 7529 in VBD28/14, a
C and CC insertion following genomic position 7584 in VBD28/14 and VBD01/10, respectively, a
CGCCAGG insertion following genomic position 7723 in VBD28/14, and a GGTTGTGTTATCTC
insertion following genomic position 7746 in VBD28/14. Nucleotide deletions include a T deletion at
genomic position 7509 in VBD01/10, VBD15/10 and VBD74/09, and a GTATCTTGCCAA deletion
from genomic position 7694 to 7705 in VBD28/14. All isolates contained nucleotide substitutions
T137C, C1783G, G1784C, A2580G, C2884T, T2888C, G3436A, A3645G, A3832G, A3952T,
G3991C, C4647T, C4887A, and T7547C. VBD15/10 and VBD74/09 had identical nucleotide
substitutions, insertions, and deletions.
The amino acid sequence variabilities for the four complete HPV11 genomes are summarised in Table
17. By aligning the four isolates sequenced in the current study and the A1 prototype sequence
(GenBank accession number M14119.1), 69 non-synonymous substitutions resulting in changed amino
acids were identified. Briefly, VBD01/10 had ten non-synonymous substitutions, VBD15/10 and
VBD74/09 had 12 non-synonymous substitutions, and VBD28/14 had 35 non-synonymous
substitutions. All isolates contained an arginine (R) to A substitution in the E1 ORF, a K to R non-
synonymous substitution in the E2 ORF, a G to E non-synonymous substitution in the E4 ORF, and a
valine to F and T to L in the E5a ORF. VBD15/10 and VBD74/09 had identical amino acid sequences.
As previously mentioned, a nucleotide variance of 0.5%-1.0% across the whole genome defines a
variant sub-lineage and the nucleotide sequence of the L1 ORF is often used to represent the whole
genome variation due to the high degree of conservation in this region (Burk et al., 2011, 2013).
Subsequently, to estimate the degree of variation in coding regions, the percentage difference between
isolates in the current study and the A1 prototype sequence M14119.1 was determined. In brief, the
nucleotide sequences of coding regions varied from 0.46% in the L1 ORF to 2.22% in the E5b ORF.
The amino acid sequences variability ranged from 0.39% in the L1 ORF to 6.75% in the E5b ORF.
69
Table 16: Sequence comparison of nucleotide variances among four human papillomavirus type 11 (HPV11) isolates against
the HPV11 prototype with GenBank accession number M14119.1.
Residues that differed from those of the HPV11 prototype are indicated by a white (nucleotide substitutions), green (nucleotide
deletions) and yellow (nucleotide insertions) colour change and the substituted-, deleted- or inserted nucleotides. The
corrected M14119.1 with the 2bp insertion at genomic position 7717-7718 was used (Maver et al., 2011).
IsolateGene Genomic region Base VBD01/10 VBD15/10 VBD24/18 VBD74/09
137 T C C C C
180 C T
E6 102..554 295 C T
380 C T T T
466 G A
563 G T
E7 530..826 566 C T
662 G T T T
1107 A C
1130 A C C
1352 T C
1381 G A
1426 C T
1488 A C
1623 T C C
1783 C G G G G
1784 G C C C C
E1 2100 T A
832..2781 2169 T C
2172 A T
2256 C A
2262 A G
2328 T G
2358 C T
2385 A G
2412 A C
2523 G A
2580 A G G G G
2854 C T
2884 C T T T T
2888 T C C C
3166 A C
3391 A G
3436 G A A A A
3462 A C C
3487 C T T T
E2
3492 C T
2723..3826 3511 C A
^E4 3516 T G
3255..3581 3537 A C
3566 G A
3570 C A
3626 A C C C
3631 G C
3645 A G G G G
3772 A C C
3781 C T
3814 A G
3830 T G
3832 A G G G G
3902 C G
3952 A T T T T
E5a 3978 A G
3991 G C C C C
3871..4146
3993 A T
4048 T C
4142 A C C
4166 C G
E5b 4227 C T
4274 A C
4146..4370
4312 C A
4333 C T
4380 T G G
4452 A G
4584 A G
4647 C T T T T
4740 G A
4752 T C
4815 T G
4821 G A
4887 C A A A A
L2 4989 C A
5059 G A
4417..5784
5157 T G
5235 A G
5290 C A
5389 G A
5416 C A
5470 T C
5518 C T
5523 C T
5706 C T
5929 A G
6028 C T T T
6296 T A
L1 6298 T C
5771..7276 6949 T A
7045 G A
7197 C T
7237 T C
7289 A G
7302 A C
7319 G T
7329 G C
7349 G C
7359 T C
7411 T G
7413 A C
7479 C T T T
7509 T DEL DEL DEL
7519 G T
7529-7530 ACGCGC
URR 7547 T C C C C
7563 C T
7277..7933,
7570 A C
1..101 7584-7585 CC C
7591 A G
7626 A C
7646 T A
7694-7705 GTATCTTGCCA DEL
7724-7725 CGCCAGG
7746-7747 GGTTGTGTTATC
7775 T G
7780 G T
7880 C T
7928 C T
27 G T
48 C T
70
Table 17: Comparison of amino acid variances among four human papillomavirus type 11 (HPV11) isolates against the
HPV11 prototype with GenBank accession number M14119.1.
The total number of amino acids in each gene and the amino acid number where a non-synonymous is observed is indicated.
Residues that differed from those of the HPV11 prototype are indicated by a white colour difference and the substituted amino
acid.
Isolate
Amino Amino
Gene VBD01/10 VBD15/10 VBD28/14 VBD74/09
acid no. acid*
E6 65 A V
1..150 122 G E
E7 12 V L
1..98 45 A S S S
100 N T T
174 A A
E1
184 D N
1..649
318 R A A A A
527 Q H
247 N T T
257 S F
263 N K
265 I S
E2
272 K T
1..367
282 A T
283 A D
308 K R R R R
350 K N N
46 Q R
61 G E E E E
70 T P P
78 T L L L
E4
80 D S
1..108
86 T K
88 S A
95 S R
106 L I
11 I G
E5A 28 V F F F F
1..91 41 T L L L L
91 Q P P
7 H Q
28 L F
E5B
43 K N
1..74
56 T N
63 S L
191 D E
215 A T
292 L I
L2
325 V I
1..455
334 L M
351 F L
368 L F
L1 176 S T
1..501 476 A V
*A – alanine, D – aspartic acid, E – glutamic acid, F – phenylalanine, G – glycine, H – histidine,
I – isoleucine, K – lysine, L – leucine, M – methionine, N – asparagine, P – proline, Q –
glutamine, R – arginine, S – serine, T – threonine, V – valine
71
3.3.3. Pairwise nucleotide difference between complete human papillomavirus type
11 genomes
Pairwise nucleotide difference between complete genomes obtained in this study and sequence data
retrieved from GenBank is summarised in Appendix 12. There were a total of 7920 positions in the
final dataset. Pairwise nucleotide difference between the four complete genomes obtained in this study
and 28 complete genome sequences obtained from GenBank varied from 0.000 to 0,01275 (Appendix
12). Divergence from the sub-lineage A1 prototype M14119.1 ranged from 0,00038 (FN907963.1) to
0,01224 (LN833183.1) with an average divergence of 0,00378. Divergence from the sub-lineage A2
prototype LN833161.1 ranged 0,00025 (HE611263.1 and JQ773409.1) to 0,01224 (LN833183.1) with
an average divergence of 0,00241. Divergence from the sub-lineage A3 prototype LN833169.1 ranged
from 0,00139 (LN833165.1) to 0,01275 (LN833183.1) with an average divergence of 0,00578.
Divergence from the sub-lineage A4 prototype LN833187.1 ranged from 0,00507 (LN833165.1) to
0,01275 (LN833183.1) with an average divergence of 0,00624. Divergence from the lineage B
prototype LN833183.1 ranged from 0,00051 (VBD28/14) to 0,01275 (LN833169.1, LN833187.1) with
an average divergence of 0,01186 (Appendix 12).
Pairwise nucleotide difference between the four complete genomes obtained in this study is summarised
in Table 18. There were a total of 7920 positions in the final dataset. VBD15/10 and VBD74/09 had an
evolutionary divergence of zero.
Table 18: Estimates of evolutionary divergence between four human papillomavirus type 11 (HPV11) sequences using MEGA
X software.
The corrected M14119.1 with the 2bp insertion at genomic position 7717-7718 was used (Maver et al., 2011).
VBD01/10 VBD15/10 VBD28/14 VBD74/09
VBD01/10
VBD15/10 0,00139
VBD28/14 0,01184 0,01197
VBD74/09 0,00139 0 0,011973
72
3.3.4. Maximum Likelihood method
The evolutionary history was inferred by using the maximum likelihood method and Tamura-Nei
model. The tree with the highest log likelihood (-12751.59) is depicted. The analysis involved 32
nucleotide sequences and 7972 positions were included in the final dataset. Phylogenetic analysis of the
HPV11 complete genome resolved into two lineages, namely lineage A and lineage B.
Two isolates, VBD28/14 and LN833183, (2/32) were identified as lineage B and clustered together with
a 100% certainty in the bootstrap test (Figure 9). The majority of isolates (23/28, 82%), including
VBD01/10, VBD15/10, and VBD74/09, clustered with the A2 reference LN833161.1. VBD15/10 and
VBD74/09 clustered separately from the other sub-lineage A2 isolates (Figure 9).
3.3.5. Genome heterogeneities with respect to the corresponding prototype
Inferred by phylogenetic analysis, VBD28/14 was identified as a lineage B isolate. There were three
nucleotide differences identified between VBD28/14 and the lineage B prototype LN833183.1.
Nucleotide heterogeneities are summarised in Table 19. Based on the alignment of VBD28/14 with the
lineage B prototype LN833183.1, one non-synonymous substitution could be identified. A C6831T
nucleotide substitution resulted in an A to E amino acid substitution (Table 19).
Table 19: Genomic variations between human papillomavirus type 11 (HPV11) lineage B isolates VBD28/14 sequenced in
this study and lineage B prototype LN833183.1 retrieved from GenBank.
The corrected M14119.1 with the 2bp insertion at genomic position 7717-7718 was used (Maver et al., 2011).
Genomic position
Isolate 48 3820 6831
VBD28/14 T A C
LN833183.1 C G T
VBD01/10, VBD15/10 and VBD74/09 were identified as sub-lineage A2 isolates through phylogenetic
analysis. Based on the alignment of VBD01/10, VBD15/10 and VBD74/09 with the A2 prototype
LN833161.1, a total of 14 variable nucleotide positions and nine variable amino acid positions were
identified. Nucleotide heterogeneities are summarised in Table 20.
73
Table 20: Genomic variations between human papillomavirus type 11 (HPV11) sub-lineage A2 isolates sequenced in this study
and sub-lineage A2 prototype LN833161.1 retrieved from GenBank.
The corrected M14119.1 with the 2bp insertion at genomic position 7717-7718 was used (Maver et al., 2011).
Genomic position
Isolate 1107 1130 1623 2358 3391 3462 3492 3772 4142 4380 7252 7272 7413 7585-7586
LN833161.1 C A T T G A C A A T G C C C insertion
VBD01/10 T A A CC insertion
VBD15/10 A C C C A C C C G A A A No insertion
VBD74/09 A C C C A C C C G A A A No insertion
One non-synonymous substitution was identified in the E1 ORF of two isolates. An A1130C nucleotide
substitution resulted in a N to T in VBD15/10 and VBD74/09 at amino acid 100. Two non-synonymous
substitutions were identified in the E2 ORF in both VBD15/10 and VBD74/09. An A3462C nucleotide
substitution resulted in a N to T amino acid substitution at amino acid 247, and an A3772C nucleotide
substitution resulted in a K to N amino acid substitution at amino acid 350. One non-synonymous
substitution was identified in the E2 ORF in VBD01/10. A C3492T nucleotide substitution resulted in
a S to F amino acid substitution at amino acid 257. VBD15/10 and VBD74/09 both contained two non-
synonymous substitutions in the E4 ORF. A G3391A nucleotide substitution resulted in a R to Q amino
acid substitution at amino acid 46 and an A3462C nucleotide substitution resulted in a T to P amino
acid substitution at amino acid 70. One non-synonymous substitution was identified in the E4 ORF in
VBD01/10. A C3492T nucleotide substitution resulted in a P to S amino acid substitution at amino acid
80. One non-synonymous substitution was observed in VBD15/10 and VBD74/09 in the E5a ORF. An
A4142C nucleotide substitution resulted in a Q to P amino acid substitution at amino acid 91. One non-
synonymous substitution was identified in three isolates in the L1 ORF. A C7272A nucleotide
substitution resulted in a T to K amino acid substitution at amino acid 501 in VBD01/10, VBD15/10
and VBD74/09 (Table 20).
74
Lineage B Lineage A
Figure 9: Phylogenetic tree of human papillomavirus type 11 (HPV11) based 32 nucleotide sequence alignments of HPV11
including four isolates sequenced in this study and 28 sequences of geographically distinct isolates retrieved from GenBank.
M14119.1 (blue) was used as a representative of sub-lineage A1, LN833161.1 (green) as a representative of sub-lineage A2,
LN833169.1 (yellow) as a representative of sub-lineage A3, LN833187.1 (red) as a representative of sub-lineage A4 and
LN833183.1 (orange) was used as a representative of lineage B. The maximum likelihood tree was constructed using MEGA
X using the maximum likelihood method and Tamura-Nei model. The corrected M14119.1 with the 2bp insertion at genomic
position 7717-7718 was used (Maver et al., 2011).
75
3.4. Discussion
HPV classification below types is increasingly relevant since within-type variants (lineages and sub-
lineages) are associated with differential prognosis and outcomes concerning virus persistence,
development of lesions and progression to malignancy (Flores-Díaz et al., 2017). The advent of
metagenomics using high-throughput Illumina sequencing has enabled the discovery of numerous novel
viruses in various bio-niches at a reasonable cost and unprecedented speed. Additionally, genome
characterisation assists in studying viral diversity and the association of virus and disease (Ambulos et
al., 2016; Kocjan et al., 2015; Parker & Chen, 2017; Tuna & Amos, 2017).
The HPV nomenclature is well established. However, while classification into species and types is
based on unique genes, variant lineage and sub-lineage classification are based on the complete genome
of the virus. HPV variants are considered novel only after the complete genome has been cloned and
deposited with the International HPV Reference Centre (Bernard et al., 2010; Burk et al., 2011;
Bzhalava et al., 2015; de Villiers et al., 2004; Kocjan et al., 2015)
(https://www.hpvcenter.se/human_reference_clones). Phylogenetic analysis of concatenated E5a/b-L1-
URR sequences suggested a supposed novel HPV11 lineage or sub-lineage and isolates of interest using
a previously published PCR protocol (Jelen et al., 2016; Maver et al., 2011). However, due to the
discrepancies observed in the phylogenetic tree topology using concatenated E5a/b-L1-URR, the sub-
lineage classification of several isolates remained undetermined in Chapter 2. It was also unclear
whether the E2 segment can reliably distinguish between sub-lineages. Therefore, the complete HPV11
viral genome of selected isolates was amplified using long-range PCR and sequenced using high-
throughput sequencing.
Isolates characterised in this study were initially isolated from respiratory tract papilloma samples
obtained from patients with a clinical diagnosis of RRP. We assembled four HPV11 complete genomes
from three different patients. VBD28/14 had the most nucleotide substitutions and indels, followed by
VBD01/10. VBD15/10 and VBD74/09, isolated from one patient, were identical and indicated no signs
of evolution within the host. Comparison of the ORFs of each of the four isolates identified non-
synonymous substitutions that changed the amino acid, which may alter protein function and
expression, which underlie disease phenotype (Lebeuf-Taylor et al., 2019). Also, interference in gene
expression may lead to cancer progression and failure of treatment options. Therefore, future studies
regarding viral gene expression are vital.
Phylogenetic analysis confirmed that VBD28/14 clusters with another lineage B isolate which was
recently identified (Jelen et al., 2016), and analysis of the genome organisation of VBD28/14 revealed
many similarities and some differences, including three nucleotide substitutions and one non-
synonymous substitution in the L1 capsid protein. VBD15/10 and VBD74/09 was identified as sub-
76
lineage A2 but clustered separately from all other sub-lineage A2 isolates. VBD15/10 and VBD74/09
had seven non-synonymous heterogeneities in the E1-, E2-, E4-, E5a- and L1 ORF compared to the A2
prototype used. VBD01/10 was the most similar to the A2 prototype and only had three nucleotide
substitutions, one nucleotide insertion and three non-synonymous substitutions in the E2, E4 and L1
ORFs.
Conformational changes in the E1 and E2 proteins may affect replication of the HPV11 circular dsDNA
genome. Furthermore, E2 possesses an initiation codon for E4, and protein changes in either may affect
virion release and apoptosis in terminally differentiated cells (Graham & Faizo, 2017; Kardani &
Bolhassani, 2018a; Wallace & Galloway, 2014). Point-mutations in E5 may inhibit cell transformation,
hinder activation of various pathways, disrupt the acidification of endosomes, promote transformed cell
invasiveness, and misregulate the proliferation of infected cells. As E5 is necessary for many pathways,
sequence variations may significantly impact the function of this protein (Halavaty et al., 2014; Hemmat
& Baghi, 2018; Venuti et al., 2011; Zhang et al., 2018). Functional analysis of early proteins in HPV11
variants may further decipher the effects of mutations on the HPV virus life cycle.
Within the L1 ORF, sequence variation was low, likely due to the slow genomic evolutionary rate (Chen
et al., 2009, 2011). However, genomic- and amino acid variances within the L1 were identified in all
samples. In order to determine whether non-synonymous nucleotide substitutions have an effect on the
biological function of vaccines, functional analysis is required. Amino acid- and protein alterations that
occur within the conserved epitopes of L1, have the ability to influence vaccine efficacy. However, to
date, there have been no reports suggesting that the biological function of HPV type specific vaccines
is influenced by antigenic differences between intratypic variants (Barra et al., 2019; Benedict &
Derkay, 2021; Buck et al., 2013; Chabeda et al., 2018; Dadar et al., 2018). The consequences of
polymorphisms in the novel lineage B, isolates of interest VBD15/10 and VBD74/09, and sub-lineage
A3 and A4 on antibody recognition following natural infection or vaccination by the present-day
vaccines and effects on the viral life cycle requires additional investigation.
In conclusion, during this study the complete genome of four isolates of HPV11 were successfully
identified and characterised from three patients. Additionally, this is also the second report of a lineage
B isolate identified. Both lineage B isolates identified thus far have been isolated in South Africa,
suggesting that this lineage B may be geographically restricted or have a low infectivity rate. Further
research on sub-lineage A2 isolates, VBD15/10 and VBD74/09, is necessary to characterise these
isolates fully. Research with different sample types and larger sample sizes are needed to clarify the
prevalence and disease association and the impact of those newly characterised variants on HPV
persistence and RRP development.
77
Chapter 4 - Concluding remarks
Papillomaviruses have a well-documented history with human hosts, and most humans are infected at
some point in their lives. Papillomaviruses have a slow evolutionary rate of approximately 10−8
nucleotide substitutions per site per year and experience little recombination during their evolution;
hence nucleotide polymorphisms mainly transpire due to arbitrary mutation time (Bernard, 1994; Chen
et al., 2009, 2011; van Doorslaer, 2013). HPV was discovered in 1949 (Strauss et al., 1949), and
evidence suggesting the involvement of HPV11 in RRP was first reported in 1982 (Gissmann et al.,
1982). To date, over 220 HPV types are recognised globally
(https://www.hpvcenter.se/human_reference_clones).
Despite phylogenetic relatedness, knowledge regarding intratypic variants of HPV11 and association
with the clinical outcomes of infection are limited. Genetic traits resulting in various lineages and sub-
lineages of the virus are all encoded within the relatively small eight kilobyte circular dsDNA genome.
HPV11 is ordinarily associated with a more severe disease outcome in patients with RRP and earlier
age at diagnosis compared to HPV6, suggesting increased virulence and pathogenesis (El Achkar et al.,
2020; Intakorn & Sonsuwan, 2014; Omland et al., 2014; Seedat, 2020).
According to previous South African surveillance studies, between 64% and 71% of the population are
infected with HPV, and between 1% and 4% are infected with HPV11. However, majority of these
studies targeted specific populations, older age groups, restricted sampling areas, and used only cervical
samples (Giuliano et al., 2015; Mbulawa et al., 2017, 2018; Taku et al., 2020). Furthermore, no large-
scale South African studies or Free State based studies report on HPV11 intratypic variant distribution
in patients with RRP, and no current information regarding variants circulating in the Free State exists.
Therefore, this study aimed to genetically characterise HPV11 isolates from patients diagnosed with
RRP treated at Universitas Academic Hospital.
In a previous unpublished study performed in our laboratory, a novel HPV11 lineage B isolate was
identified (Makatsa, 2012). Although not included in this study, this motivated investigation of further
isolates for clarification of currently circulating lineages and sub-lineages. Therefore, HPV11 genetic
variants were characterised based on the nucleotide sequence of the L1, URR, E5a/b and a segment of
E2 genomic regions isolates from patients with RRP in the Free State. According to previous studies,
these gene segments encompass sufficient SNPs to discriminate between viral lineages and sub-
lineages, as the concatenated L1-URR-E5a sequence reportedly contains adequate information for
differentiation between A1 and A2 sub-lineages. The E5a/b ORF also has a faster evolutionary rate
compared to other genes and contains several (sub)lineage specific SNPs. Furthermore, it has been
reported that the E2 gene segment can reliably distinguish all known HPV11 variants (Bravo & Alonso,
2004; Burk et al., 2011; Godínez et al., 2014; Jelen et al., 2016; Maver et al., 2011). A previous study
78
published by Jelen and colleagues in 2014 suggested that concatenated E5a/b-L1-URR sequences can
be used as a surrogate for the phylogenetic clustering of HPV6 variants, a LR-HPV type (Jelen et al.,
2014). However, results obtained in the current study uncovered various classification errors using
concatenated E5a/b-L1-URR sequence data and does not support the use of these gene segments for
HPV11 variant classification.
In Chapter 2, the nucleotide- and amino acid composition of the L1-, E5a/b-, and a segment of the E2
ORF, as well as the nucleotide sequence of the URR were determined using Sanger sequencing.
Although nucleotide changes may imply a variant lineage or sub-lineage, these variances do not
necessarily impact the expressed protein function. However, DNA alterations in an ORF may lead to
gene product alterations if non-synonymous substitutions are present (Lebeuf-Taylor et al., 2019). Few
non-synonymous substitutions were reported in the L1 ORF of sequenced isolates, unlike the E5a/b
ORF, where nearly all nucleotide substitutions were non-synonymous. The URR is a variable region
within the HPV genome capable of accumulating and tolerating many nucleotide mutations as it does
not encode proteins. Nevertheless, variations in the URR may still impact virus survival as this region
contains crucial binding sites and promoters (Fang et al., 2020; Ribeiro et al., 2018). Amino acid
changes may affect many factors such as the effectiveness of HPV11 targeted vaccines, protein folding,
cell transformation, the viral life cycle, and disease severity. However, to fully understand the impact
if these changes functional analysis of HPV11 genes is required to decipher the effects of mutations on
the virus. Consequences of these polymorphisms in the novel lineage B and sub-lineage A3 and A4
warrants further investigation.
Analysis of the results obtained in the current study included constructing a maximum likelihood tree
to confirm the differentiation of the HPV variants based on tree topology and to determine the presence
of potentially novel intratypic variants. HPV11 concatenated L1-URR and E5a/b-L1-URR resolved into
two lineages, namely lineage A and lineage B and the majority of the isolates sequenced (48/67)
clustered together with the globally predominant A2 prototype isolates as anticipated. Two lineage B
isolates were identified in the current study, VBD34/08 and VBD28/14, with VBD34/08 appearing
identical to the previously identified lineage B isolate (GenBank accession number LN833183.1) (Jelen
et al., 2016). The concatenated L1-URR, E5a/b-L1-URR sequences, and the E2 segment provided
equivalent discriminatory power to distinguish between HPV-lineages; however, tree topology based
on E5a/b-L1-URR sequences were incoherent for sub-lineage classification.
Previous studies of intratypic evolution of HPV11 variants rarely dealt with whole-genome sequences.
Instead, partial regions of the viral genome such as L1, URR, and an ORF gene were used (de Matos et
al., 2013; Danielewski et al., 2013), leading to classification errors, as HPV genome variants are very
closely related and only require a 0.5% to 1% variance in the complete genome (Burk et al., 2011).
Therefore, the complete genome of selected novel HPV11 variants and variants of interest were
79
determined in Chapter 3 using NGS technology. Analysis of the whole genome ensures accurate
differentiation into sub-lineages and detection of any heterogeneity of HPV11 (Burk et al., 2011).
Based on complete sequence data, the majority of nucleotide substitutions and indels were identified in
VBD28/14. Non-synonymous substitutions were identified in ORFs of each of the four isolates which
may alter protein function (Lebeuf-Taylor et al., 2019), which underlie disease phenotype. Likewise,
altered protein function may lead to cancer progression and failure of treatment options (Barra et al.,
2019; Brotherton, 2019; Cornall et al., 2013; Gerein et al., 2005; Huebbers et al., 2013). Therefore,
future studies regarding viral gene expression are vital. Phylogenetic analysis of complete HPV11
genomes identified VBD28/14 as a lineage B variant, and VBD01/10, VBD15/10 and VBD74/09 as
sub-lineage A2 variants with great certainty. However, VBD15/10 and VBD74/09 contained numerous
genomic- and amino acid variances compared to the A2 prototype. The slow evolutionary rate of
papillomaviruses was reinforced as VBD15/10 and VBD74/09 isolated five months from each other
from a single patient were identical (Bernard, 1994; Chen et al., 2009, 2011; van Doorslaer, 2013).
Many inconsistencies were observed between the maximum likelihood trees constructed using the
concatenated E5a/b-L1-URR sequences and WGS. Phylogenetic analysis using the concatenated E5a/b-
L1-URR sequences revealed that VBD01/10 did not cluster closely with the A2 prototype sequence as
was the case using WGS. Also, analysis using the concatenated E5a/b-L1-URR sequences revealed
VBD01/10, FN90762.1, and FN907963.1 clustered together with great certainty, However,
FN907963.1 is a confirmed sub-lineage A1 isolate according to WGS. Furthermore, phylogenetic
analysis revealed VBD15/10, VBD74/09, MK463916.1 and MK463921.1 clustered closely using the
concatenated sequences, but this was not the case using complete HPV11 genomes. Discrepancies in
the location of clustering of sub-lineage A2 isolates JQ773408.1, JQ773411.1, JQJQ773412.1,
MK463914.1, and FR872717.1 was also revealed. Unlike using concatenated sequences, these
sequences did not cluster closely with the A2 prototype using the complete HPV11 genome. An
important discrepancy observed between using specific genes versus the complete genome was that A1
isolates did not cluster with the corresponding prototype using the concatenated sequences. Therefore,
using concatenated E5a/b-L1-URR sequences cannot reliably distinguish between HPV sub-lineages as
previously suggested (Jelen et al., 2016).
The 208bp region located at the 3′ end of the E2 ORF and the 5′ end of NCR2 provided more accurate
results for HPV11 whole-genome tree reconstruction compared to the concatenated E5a/b-L1-URR
sequences. All isolates were classified into appropriate sub-lineages as determined by WGS of the
complete HPV11 sequence. This is the second report of a lineage B isolate identified in South Africa
(Jelen et al., 2016). Two isolates of interest were identified in the current study. These isolates had
numerous amino acid heterogeneities and the consequences of these polymorphisms requires additional
investigation. The 208bp E2 segment could reliably classify all isolates in this study, suggesting that
80
this gene segment contains stable sub-lineage specific SNP’s and may serve in sub-lineage
identification when complete genome sequences cannot be obtained.
Although this study successfully completed the aim and objectives set out, limitations in the study were
identified. The Sanger sequencing approach was used to sequence selected genes of the isolates;
however, Sanger sequencing has a significantly lower sensitivity for genomic variant detection
compared to NGS which may hinder detection of genomic variants present at low copy numbers (Parker
& Chen, 2017; Sanger et al., 1977). Moreover, research with larger sample sizes will contribute towards
clarification of the prevalence, disease association and impact of newly characterised variants on HPV
persistence and RRP development. Functional analysis of genes would add valuable information
regarding protein interactions and functioning.
This study provides the most comprehensive data on the genomic diversity of HPV11 from patients
with RRP in the Free State to date. As within type variants may have an association with disease
outcome, it is essential to characterise the current strains circulating amongst the community. Baseline
data on circulating HPV variants may significantly contribute to future evolutionary, epidemiological,
vaccination, and molecular assay development studies, as well as to studies on the pathogenesis of
HPV11 and other Alphapapillomaviruses.
81
References
Abreu, AL., Souza, RP., Gimenes, F. & Consolaro, ME. (2012). A review of methods for detect human
papillomavirus infection. Virology Journal, 9, 262. DOI: 10.1186/1743-422X-9-262
Allawi, HT. & SantaLucia, J. (1997). Thermodynamics and NMR of internal G-T mismatches in
DNA. Biochemistry, 36(34), 10581-10594. DOI: 10.1021/bi962590c
Altamura, G., Power, K., Martano, M., Degli Uberti, B., Galiero, G., De Luca, G., Maiolino, P. &
Borzacchiello, G. (2018). Felis catus papillomavirus type-2 E6 binds to E6AP, promotes E6AP/p53
binding and enhances p53 proteasomal degradation. Scientific Reports, 8(1), 17529. DOI:
10.1038/s41598-018-35723-7
Altamura, G., Tommasino, M. & Borzacchiello, G. (2020). Cutaneous vs. Mucosal Tropism: The
papillomavirus paradigm comes to an "and". Frontiers in Microbiology, 11, 588663. DOI:
10.3389/fmicb.2020.588663
Ambulos, N., Schumaker, L., Mathias, TJ., White, R., Troyer, J., Wells, D. & Cullen, K. (2016). Next-
generation sequencing-based HPV genotyping assay validated in formalin-fixed, paraffin-embedded
oropharyngeal and cervical cancer specimens. Journal of Biomolecular Techniques, 27(2),46-52. DOI:
10.7171/jbt.16-2702-004
Balaban, M., Moshiri, N., Mai, U., Jia, X. & Mirarab, S. (2019). TreeCluster: Clustering biological
sequences using phylogenetic trees. PLOS One, 14(8), e0221068. DOI: 10.1371/journal.pone.0221068
Barra, F., Leone Roberti Maggiore, U., Bogani, G., Ditto, A., Signorelli, M., Martinelli, F., Chiappa,
V., Lorusso, D., Raspagliesi, F. & Ferrero, S. (2019). New prophylactics human papilloma virus (HPV)
vaccines against cervical cancer. Journal of Obstetrics and Gynaecology, 39(1), 1–10. DOI:
10.1080/01443615.2018.1493441
Bedell, MA., Hudson, JB., Golub, TR., Turyk, ME., Hosken, M., Wilbanks, GD. & Laimins, LA.
(1991). Amplification of human papillomavirus genomes in vitro is dependent on epithelial
differentiation. Journal of Virology, 65(5), 2254-2260. DOI: 10.1128/JVI.65.5.2254-2260.1991
Benedict, JJ. & Derkay, CS. (2021). Recurrent respiratory papillomatosis: A 2020 perspective.
Laryngoscope Investigative Otolaryngology, 6(2), 340-345. DOI: 10.1002/lio2.545
Bernard, HU. (1994). Coevolution of papillomaviruses with human populations. Trends in
Microbiology, 2(4), 140-3. DOI: 10.1016/0966-842x(94)90602-5
82
Bernard, HU., Burk, RD., Chen, Z., van Doorslaer, K., zur Hausen, H. & de Villiers, EM. (2010).
Classification of papillomaviruses (PVs) based on 189 PV types and proposal of taxonomic
amendments. Virology, 401(1), 70-79. DOI: 10.1016/j. virol.2010.02.002
Bernard, HU., Calleja-Macias IE. & Dunn, ST. (2006). Genome variation of human papillomavirus
types: Phylogenetic and medical implications. International Journal of Cancer, 118(5), 1071–1076.
DOI: 10.1002/ijc.21655
Bonagura, VR., Du, Z., Ashouri, E., Luo, L., Hatam, LJ., DeVoti, JA., Rosenthal, DW., Steinberg, BM.,
Abramson, AL., Gjertson, DW., Reed, EF. & Rajalingam, R. (2010). Activating killer cell
immunoglobulin-like receptors 3DS1 and 2DS1 protect against developing the severe form of recurrent
respiratory papillomatosis. Human Immunology, 71(2), 212-219. DOI: 10.1016/j.humimm.2009.10.009
Boshart, M., Gissmann, L., Ikenberg, H., Kleinheinz, A., Scheurlen, W. & zur Hausen, H. (1984). A
new type of papillomavirus DNA, its presence in genital cancer biopsies and in cell lines derived from
cervical cancer. The EMBO Journal, 3(5), 1151–1157. DOI: 10.1002/j.1460-2075.1984.tb01944.x
Bravo, IG. & Alonso, A. (2004). Mucosal human papillomaviruses encode four different E5 proteins
whose chemistry and phylogeny correlate with malignant or benign growth. Journal of Virology,
78(24), 13613–13626. DOI: 10.1128/JVI.78.24.13613-13626.2004
Brink, AA., Snijders, PJ. & Meijer, CJ. (2007). HPV detection methods. Disease Markers, 23(4), 273–
281. DOI: 10.1155/2007/147429
Brotherton, JM. (2019). Impact of HPV vaccination: Achievements and future challenges.
Papillomavirus Research, 7, 138-140. DOI: 10.1016/j.pvr.2019.04.004
Buchinsky, FJ., Ruszkay, N., Valentino, W., Derkay, CS., McClay, JE., Bastian, RW., Myer, CM.,
Lollar, KW. & Guris, D. (2020). In RRP, serologic response to HPV is frequently absent and slow to
develop. PLOS One, 15(3), e0230106. DOI: 10.1371/journal.pone.0230106
Buck, CB., Day, PM. & Trus, BL. (2013). The papillomavirus major capsid protein
L1. Virology, 445(1-2), 169–174. DOI: 10.1016/j.virol.2013.05.038
Burd, EM. (2016). Human papillomavirus laboratory testing: The changing paradigm. Clinical
Microbiology Reviews, 29(2), 291-319. DOI: 10.1128/cmr.00013-15
Burk, RD., Chen, Z., Harari, A., Smith, B. C., Kocjan, B. J., Maver, P. J. & Poljak, M. (2011).
Classification and nomenclature system for human Alphapapillomavirus variants: General features,
nucleotide landmarks and assignment of HPV6 and HPV11 isolates to variant lineages. Acta
83
dermatovenerologica Alpina, Pannonica, et Adriatica, 20(3), 113–123. Retrieved from
https://www.acta-apa.org/
Burk, RD., Harari, A. & Chen, Z. (2013). Human papillomavirus genome variants. Virology, 445(1-2),
232–243. DOI: 10.1016/j.virol.2013.07.018
Bzhalava, D., Eklund, C. & Dillner, J. (2015). International standardization and classification of human
papillomavirus types. Virology, 476, 341–344. DOI: 10.1016/j.virol.2014.12.028
Chabeda, A., Yanez, R., Lamprecht, R., Meyers, AE., Rybicki, EP. & Hitzeroth, II. (2018). Therapeutic
vaccines for high-risk HPV-associated diseases. Papillomavirus Research, 5, 46–58. DOI:
10.1016/j.pvr.2017.12.006
Chagas, BS., Batista, MV., Guimarães, V., Balbino, VQ., Crovella, S. & Freitas, AC. (2011). New
variants of E6 and E7 oncogenes of human papillomavirus type 31 identified in Northeastern Brazil.
Gynecologic Oncology, 123(2), 284–288. DOI: 10.1016/j.ygyno.2011.07.008
Charleston, M. (2013). Phylogeny, in Maloy, S., Hughes, K (eds) Brenner's Encyclopaedia of Genetics.
2nd edition. Academic Press, 324,325. DOI: 10.1016/B978-0-12-374984-0.01160-8
Chen, Z., DeSalle, R., Schiffman, M., Herrero, R. & Burk, RD. (2009). Evolutionary dynamics of
variant genomes of human papillomavirus types 18, 45, and 97. Journal of Virology, 83(3), 1443–1455.
DOI: 10.1128/jvi.02068-08
Chen, Z., Schiffman, M., Herrero, R., Desalle, R., Anastos, K., Segondy, M., Sahasrabuddhe, VV.,
Gravitt, P. E., Hsing, A. W. & Burk, R. D. (2011). Evolution and taxonomic classification of human
papillomavirus 16 (HPV16)-related variant genomes: HPV31, HPV33, HPV35, HPV52, HPV58 and
HPV67. PLOS One, 6(5), e20183. DOI: 10.1371/journal.pone.0020183
Chowdhury, B., Garai, A. & Garai, G. (2017). An optimized approach for annotation of large eukaryotic
genomic sequences using genetic algorithm. BMC bioinformatics, 18(1), 460. DOI: 10.1186/s12859-
017-1874-7
Combrinck, CE., Seedat, RY., Randall, C., Roodt, Y. & Burt, FJ. (2012). Novel HPV-6 variants of
human papillomavirus causing recurrent respiratory papillomatosis in southern Africa. Epidemiology
and Infection, 140(6), 1095–1101. DOI: 10.1017/s0950268811001580
Conway, MJ. & Meyers, C. (2009). Replication and assembly of human papillomaviruses. Journal of
Dental Research, 88(4), 307–317. DOI: 10.1177/0022034509333446
84
Cornall, AM., Roberts, JM., Garland, SM., Hillman, RJ., Grulich, AE. & Tabrizi, SN. (2013). Anal and
perianal squamous carcinomas and high-grade intraepithelial lesions exclusively associated with "low-
risk" HPV genotypes 6 and 11. International Journal of Cancer, 133(9), 2253–2258. DOI:
10.1002/ijc.28228
Cornelissen, MT., Bots, T., Briët, MA., Jebbink, M. F., Struyk, AP., van den Tweel, JG., Greer, CE.,
Smits, HL. & ter Schegget, J. (1992). Detection of human papillomavirus types by the polymerase chain
reaction and the differentiation between high-risk and low-risk cervical lesions. Virchows Archiv, 62(3),
167–171. DOI: 10.1007/bf02899679
Crawford, LV. (1965). A study of human papilloma virus DNA. Journal of Molecular Biology, 13(2),
362-372. DOI: 10.1016/s0022-2836(65)80103-6
D'Abramo, CM. & Archambault, J. (2011). Small molecule inhibitors of human papillomavirus protein
- protein interactions. The Open Virology Journal, 5, 80–95. DOI: 10.2174/1874357901105010080
Dadar, M., Chakraborty, S., Dhama, K., Prasad, M., Khandia, R., Hassan, S., Munjal, A., Tiwari, R.,
Karthik, K., Kumar, D., Iqbal, H. & Chaicumpa, W. (2018). Advances in designing and developing
vaccines, drugs and therapeutic approaches to counter human papillomavirus. Frontiers in Immunology,
9, 2478. DOI: 10.3389/fimmu.2018.02478
Danielewski, JA., Garland, SM., McCloskey, J., Hillman, RJ. & Tabrizi, SN. (2013). Human
papillomavirus type 6 and 11 genetic variants found in 71 oral and anogenital epithelial samples from
Australia. PLOS One, 8(5), e63892. DOI: 10.1371/journal.pone.0063892
Dartmann, K., Schwarz, E., Gissmann, L. & zur Hausen, H. (1986). The nucleotide sequence and
genome organization of human papilloma virus type 11. Virology, 151(1), 124–130. DOI:
10.1016/0042-6822(86)90110-8
de Matos, RP., Sichero, L., Mansur, IM., do Bonfim, CM., Bittar, C., Nogueira, RL., Küpper, DS.,
Valera, FC., Nogueira, ML., Villa, LL., Calmon, MF. & Rahal, P. (2013). Nucleotide and phylogenetic
analysis of human papillomavirus types 6 and 11 isolated from recurrent respiratory papillomatosis in
Brazil. Infection, Genetics and Evolution, 16, 282–289. DOI: 10.1016/j.meegid.2012.12.033
de Roda Husman, AM., Walboomers, JM., van den Brule, AJ., Meijer, CJ. & Snijders, PJ. (1995). The
use of general primers GP5 and GP6 elongated at their 3′ ends with adjacent highly conserved sequences
improves human papillomavirus detection by PCR. Journal of General Virology, 76(4), 1057-1062.
DOI: 10.1099/0022-1317-76-4-1057
85
de Villiers, EM. (2013). Cross-roads in the classification of papillomaviruses. Virology, 445(1-2), 2-10.
DOI: 10.1016/j.virol.2013.04.023
de Villiers, EM., Fauquet, C., Broker, TR., Bernard, HU. & zur Hausen, H. (2004). Classification of
papillomaviruses. Virology, 324(1), 17-27. DOI: 10.1016/j.virol.2010.02.002
Delany-Moretlwe, S., Kelley, KF., James, S., Scorgie, F., Subedar, H., Dlamini, NR., Pillay, Y., Naidoo,
N., Chikandiwa, A. & Rees, H. (2018). Human papillomavirus vaccine introduction in South Africa:
Implementation lessons from an evaluation of the national school-based vaccination campaign. Global
Health: Science and Practice, 6(3), 425-438. DOI: 10.9745/ghsp-d-18-00090
DiLorenzo, TP. & Steinberg, BM. (1995). Differential regulation of human papillomavirus type 6 and
11 early promoters in cultured cells derived from laryngeal papillomas. Journal of Virology, 69(11),
6865–6872. DOI: 10.1128/jvi.69.11.6865-6872.1995
Dixit, R., Bhavsar, C. & Marfatia, YS. (2011). Laboratory diagnosis of human papillomavirus infection
in female genital tract. Indian Journal of Sexually Transmitted Diseases and AIDS, 32(1), 50. DOI:
10.4103/2589-0557.81257
Dollard, SC., Wilson, JL., Demeter, LM., Bonnez, W., Reichman, RC., Broker, TR. & Chow, LT.
(1992). Production of human papillomavirus and modulation of the infectious program in epithelial raft
cultures. Genes and Development, 6(7), 1131–1142. DOI: 10.1101/gad.6.7.1131
Doorbar, J., Quint, W., Banks, L., Bravo, IG., Stoler, M., Broker, TR. & Stanley, MA. (2012). The
biology and life-cycle of human papillomaviruses. Vaccine, 30(5), F55–F70. DOI:
10.1016/j.vaccine.2012.06.083
Dürst, M, Gissmann, L., Ikenberg, H & zur Hausen, H. (1983). A papillomavirus DNA from a cervical
carcinoma and its prevalence in cancer biopsy samples from different geographic regions. Proceedings
of the National Academy of Sciences of the United States of America, 3812–3815. DOI:
10.1073/pnas.80.12.3812
Egawa, N. & Doorbar, J. (2017). The low-risk papillomaviruses. Virus Research, 231, 119–127. DOI:
10.1016/j.virusres.2016.12.017
Egawa, N., Egawa, K., Griffin, H. & Doorbar, J. (2015). Human papillomaviruses; epithelial tropisms,
and the development of neoplasia. Viruses, 7(7), 3863–3890. DOI: 10.3390/v7072802
El Achkar, VNR., Costa, V., Sichero, L., Ribeiro-Silva, A., Carlos, R., Villa, LL. & Kaminagakura, E.
(2020). Inflammatory cells and HPV type related to aggressiveness in laryngeal papillomatosis. Oral
86
Surgery, Oral Medicine, Oral Pathology and Oral Radiology, 30(3) e275. DOI:
10.1016/j.oooo.2020.04.752
Fang, L., Lin, X., Yang, Y., Song, Z., Ding, X., Tan, L. & Gao, P. (2020). Genetic variability, phylogeny
and functional implication of the long control region in human papillomavirus type 16, 18 and 58 in
Chengdu, China. Virology Journal, 17(1), 1-15. DOI: 10.1186/s12985-020-01349-3
Fauquet, CM., Mayo, MA., Maniloff, J., Desselberger, U. & Ball, LA. (2005). VIIIth report of the
International Committee on Taxonomy of Viruses, in CM, Fauquet; MA, Mayo; J, Maniloff; U,
Desselberger & LA, Ball (eds). Virus Taxonomy. Academic Press, (83), 988-992
Felsenstein, J. (1981). Evolutionary trees from DNA sequences: A maximum likelihood approach.
Journal of Molecular Evolution, 17(6), 368–376. DOI: 10.1007/bf01734359
Flores-Díaz, E., Sereday, KA., Ferreira, S., Sirak, B., Sobrinho, JS., Baggio, ML., Galan, L., Silva, RC.,
Lazcano-Ponce, E., Giuliano, AR., Villa, LL., and Sichero, L. (2017). HPV-6 molecular variants
association with the development of genital warts in men: The HIM Study, The Journal of Infectious
Diseases, 215(4), 559–565. DOI: 10.1093/infdis/jiw600
Fusconi, M., Grasso, M., Greco, A., Gallo, A., Campo, F., Remacle, M., Turchetta, R., Pagliuca, G. &
DE Vincentiis, M. (2014). Recurrent respiratory papillomatosis by HPV: Review of the literature and
update on the use of cidofovir. Acta otorhinolaryngologica Italica, 34(6), 375–381. Retrieved from
https://www.actaitalica.it/index
Gáll, T., Kis, A., Tatár, TZ., Kardos, G., Gergely, L. & Szarka, K. (2013). Genomic differences in the
background of different severity in juvenile-onset respiratory papillomatoses associated with human
papillomavirus type 11. Medical Microbiology and Immunology. 202, 353–363. DOI: 10.1007/s00430-
013-0297-y
Gaylis, B. & Hayden, RE. (1991). Recurrent respiratory papillomatosis: Progression to invasion and
malignancy. American Journal of Otolaryngology, 12(2), 104-112. DOI: 10.1016/0196-
0709(91)90045-h
Gazia, F., Galletti, B., Freni, F., Bruno, R., Sireci, F., Galletti, C., Meduri, F. & Galletti, F. (2020). Use
of intralesional cidofovir in the recurrent respiratory papillomatosis: A review of the literature.
European Review for Medical and Pharmacological Sciences, 24(2), 956-962. DOI:
10.26355/eurrev_202001_20081
Georgescu, SR., Mitran, CI., Mitran, MI., Caruntu, C., Sarbu, MI., Matei, C., Nicolae, I., Tocut, SM.,
Popa, MI. & Tampa, M. (2018). New insights in the pathogenesis of HPV infection and the associated
87
carcinogenic processes: The role of chronic inflammation and oxidative stress. Journal of Immunology
Research, 2018, 1-10. DOI: 10.1155/2018/5315816
Gerein, V., Rastorguev, E., Gerein, J., Draf, W. & Schirren, J. (2005). Incidence, age at onset, and
potential reasons of malignant transformation in recurrent respiratory papillomatosis patients: 20 years’
experience. Otolaryngology—Head and Neck Surgery, 132(3), 392-394. DOI:
10.1016/j.otohns.2004.09.035
Gissmann, L., Diehl, V., Schultz‐Coulon, HJ & zur Hausen, H. (1982). Molecular cloning and
characterization of human papilloma virus DNA derived from laryngeal papilloma. Journal of Virology,
44, 393–400. DOI: 10.1128/jvi.44.1.393-400.1982
Gissmann, L., Pfister, H. & zur Hausen, H. (1977). Human papilloma viruses (HPV): Characterization
of four different isolates. Virology, 76(2), 569-580. DOI: 10.1016/0042-6822(77)90239-2
Gissmann, L., Wolnik, L., Ikenberg, H., Koldovsky, U., Schnürch, HG. & zur Hausen, H. (1983).
Human papillomavirus types 6 and 11 DNA sequences in genital and laryngeal papillomas and in some
cervical cancers. Proceedings of the National Academies of Science, 80, 560–563. DOI:
10.1073/pnas.80.2.560
Giuliano, AR., Botha, MH., Zeier, M., Abrahamsen, ME., Glashoff, RH., van der Laan, LE., Papenfuss,
M., Engelbrecht, S., Schim van der Loeff, MF., Sudenga, SL., Torres, BN., Kipping, S. & Taylor, D.
(2015). High HIV, HPV, and STI prevalence among young Western Cape, South African women: EVRI
HIV prevention preparedness trial. Journal of Acquired Immune Deficiency Syndromes, 68(2), 227-35.
DOI: 10.1097/qai.0000000000000425
Godínez, JM., Nicolás-Párraga, S., Pimenoff, VN., Mengual-Chuliá, B., Muñoz, N., Bosch, FX.,
Sánchez, GI., McCloskey, J. & Bravo IG. (2014). Phylogenetically related, clinically different: Human
papillomaviruses 6 and 11 variants distribution in genital warts and in laryngeal papillomatosis. Clinical
Microbiology and Infection, 20(6), O406–O413. DOI: 10.1111/1469-0691.12420
Graham, SV. & Faizo, A. (2017). Control of human papillomavirus gene expression by alternative
splicing. Virus Research, 231, 83-95. DOI: 10.1016/j.virusres.2016.11.016
Grasso, M., Remacle, M., Bachy, V., van der Vorst, S. & Lawson, G. (2014). Use of cidofovir in HPV
patients with recurrent respiratory papillomatosis. European Archives of Oto-Rhino-Laryngology,
271(11), 2983-2990. DOI: 10.1007/s00405-014-3055-x
88
Graupp, M., Gugatschka, M., Kiesler, K., Reckenzaun, E., Hammer, G. & Friedrich, G. (2013).
Experience of 11 years use of cidofovir in recurrent respiratory papillomatosis. European Archives of
Oto-Rhino-Laryngology, 270(2), 641-646. DOI: 10.1007/s00405-012-2221-2
Hahn, HS., Kee, MK., Kim, HJ., Kim, MY., Kang, YS., Park, JS., Kim, TJ. (2013). Distribution of
maternal and infant human papillomavirus: Risk factors associated with vertical transmission. European
Journal of Obstetrics & Gynecology and Reproductive Biology, 169(2), 202–206. DOI:
10.1016/j.ejogrb.2013.02.024
Halavaty, KK., Regan, J., Mehta, K. & Laimins, L. (2014). Human papillomavirus E5 oncoproteins
bind the A4 endoplasmic reticulum protein to regulate proliferative ability upon differentiation.
Virology. 452, 223–30. DOI: 10.1016/j.virol.2014.01.013
Hall, BG. (2013). Building phylogenetic trees from molecular data with MEGA. Molecular Biology
and Evolution, 30(5), 1229–1235. DOI: 10.1093/molbev/mst012
Harari, A., Chen, Z. & Burk, RD. (2014). Human papillomavirus genomics: Past, present and future.
Current Problems in Dermatology, 45, 1-18. DOI: 10.1159/000355952
Hariri, S., Schuler, MS., Naleway, AL., Daley, MF., Weinmann, S., Crane, B., Newcomer, SR., Tolsma,
D. & Markowitz, LE. (2018). Human papillomavirus vaccine effectiveness against incident genital
warts among female health-plan enrollees, United States. American Journal of Epidemiology, 187(2),
298-305. DOI: 10.1093/aje/kwx253
Heinzel, PA., Chan, SY., Ho, L., O’Connor, M., Balaram, P., Campo, MS., Fujinaga, K., Kiviat, N.,
Kuypers, J., Pfister, H., Steinberg, BM., Tay, SK., Villa, LL. & Bernard, HU. (1995). Variation of
human papillomavirus type 6 (HPV-6) and HPV-11 genomes sampled throughout the world. Journal
of Clinical Microbiology, 33, 1746–1754. DOI: 10.1128/jcm.33.7.1746-1754.1995
Hemmat, N. & Baghi, HB. (2018). Human papillomavirus E5 protein, the undercover culprit of
tumorigenesis. Infectious Agents and Cancer, 13, 31. DOI: 10.1186/s13027-018-0208-3
Hirose, Y., Onuki, M., Tenjimbayashi, Y., Mori, S., Ishii, Y., Takeuchi, T., Tasaka, N., Satoh, T.,
Morisada, T., Iwata, T., Miyamoto, S., Matsumoto, K., Sekizawa, A. & Kukimoto, I. (2018). Within-
host variations of human papillomavirus reveal APOBEC signature mutagenesis in the viral
genome. Journal of Virology, 92(12), e00017-18. DOI: 10.1128/jvi.00017-18
Hoesli, RC., Wingo, ML., Richardson, BE. & Bastian, RW. (2020). Identification of 11 different HPV
subtypes in adult patients with recurrent respiratory papillomatosis. Otolaryngology—Head and Neck
Surgery, 163(4), 785-790. DOI: 10.1177/0194599820931817
89
Huebbers, CU., Preuss, SF., Kolligs, J., Vent, J., Stenner, M., Wieland, U., Silling, S., Drebber, U.,
Speel, EJM. & Klussmann, JP. (2013). Integration of HPV6 & downregulation of AKR1C3 expression
mark malignant transformation in a patient with juvenile-onset laryngeal papillomatosis. PLOS One,
8(2), e57207. DOI: 10.1371/journal.pone.0057207
Intakorn, P. & Sonsuwan, N. (2014). Human papillomatosis genotyping and severity in patients with
recurrent respiratory papillomatosis. Journal of the Medical Association of Thailand, 97 (6), S136–
S141
International Committee on Taxonomy of Viruses. (2021). Taxonomy, viewed on 22 July 2021,
Ivancic, R., Iqbal, H., deSilva, B., Pan, Q. & Matrka, L. (2018). Current and future management of
recurrent respiratory papillomatosis. Laryngoscope Investigative Otolaryngology, 3(1), 22-34. DOI:
10.1002/lio2.132
Ivancic, R., Iqbal, H., deSilva, B., Pan, Q. & Matrka, L. (2020). Immunological tolerance of low‐risk
HPV in recurrent respiratory papillomatosis. Clinical and Experimental Immunology, 199(2), 131-142.
DOI: 10.1111/cei.13387
Jelen, MM., Chen, Z., Kocjan, BJ., Burt, FJ., Chan, PK., Chouhy, D., Combrinck, CE., Coutlée, F.,
Estrade, C., Ferenczy, A., Fiander, A., Franco, EL., Garland, SM., Giri, AA., González, JV., Gröning,
A., Heidrich, K., Hibbitts, S., Hošnjak, L., Luk, TN., Marinic, K., Matsukura, T., Neumann, A.,
Oštrbenk, A., Picconi, MA., Richardson, H., Sagadin, M., Sahli, R., Seedat, RY., Seme, K., Severini,
A., Sinchi, JL., Smahelova, J., Tabrizi, SN., Tachezy, R., Tohme, S., Uloza, V., Vitkauskiene, A.,
Wong, YW., Zidovec Lepej, S., Burk, RD. & Poljak, M. (2014). Global genomic diversity of human
papillomavirus 6 based on 724 isolates and 190 complete genome sequences. Journal of Virology, 88,
7307–7316. DOI: 10.1128/jvi.00621-14
Jelen, MM., Chen, Z., Kocjan, BJ., Hošnjak, L., Burt, FJ., Chan, PK., Chouhy, D., Combrinck, CE.,
Estrade, C., Fiander, A., Garland, SM., Giri, AA., González, JV., Gröning, A., Hibbitts, S., Luk, TNM.,
Marinic, K., Matsukura, T., Neumann, A., Oštrbenk, A., Picconi, MA., Sagadin, M., Sahli, R., Seedat,
RY., Seme, K., Severini, A., Sinchi, JL., Smahelova, J., Tabrizi, SN., Tachezy, R., Tohme Faybush, S.,
Uloza, V., Uloziene, I., Wong, YW., Židovec Lepej, S., Burk, RD. & Poljak, M. (2016). Global genomic
diversity of human papillomavirus 11 based on 433 isolates and 78 complete genome sequences.
Journal of Virology, 90(11), 5503-5513. DOI: 10.1128/jvi.03149-15
90
Kardani, K. & Bolhassani, A. (2018a). HPV proteins and their functions, in A, Bolhassani. HPV
Infections: Diagnosis, Prevention and Treatment. Bentham Science Publishers, 8-20. DOI:
10.2174/97816810861701180101
Kardani, K. & Bolhassani, A. (2018b). Types of benign or malignant diseases associated with hpv
infection, in A, Bolhassani. HPV Infections: Diagnosis, Prevention and Treatment. Bentham Science
Publishers, 30-40. DOI: 10.2174/97816810861701180101
Kardani, K., Naderi, N. & Bolhassani, A. (2018). The life cycle and transmission of hpv types, in N,
Naderi; K, Kardani & Bolhassani. HPV Infections: Diagnosis, Prevention, and Treatment. Bentham
Science Publishers, 44-52. DOI: 10.2174/97816810861701180101
Klingelhutz, AJ. & Roman, A. (2012). Cellular transformation by human papillomaviruses: Lessons
learned by comparing high-and low-risk viruses. Virology, 424(2), 77-98. DOI:
10.1016/j.virol.2011.12.018
Klug, A. & Finch, JT. (1965). Structure of viruses of the papilloma-polyoma type: I. Human wart virus.
Journal of Molecular Biology, 11(2), 403-423. DOI: 10.1016/s0022-2836(65)80066-3
Kocjan, BJ., Bzhalava, D., Forslund, O., Dillner, J. & Poljak, M. (2015). Molecular methods for
identification and characterization of novel papillomaviruses. Clinical Microbiology and
Infection, 21(9), 808–816. DOI: 10.1016/j.cmi.2015.05.011
Kuhner, MK. & Felsenstein, J. (1994). A simulation comparison of phylogeny algorithms under equal
and unequal evolutionary rates. Molecular Biology and Evolution, 11(3), 459-68. DOI:
10.1093/oxfordjournals.molbev.a040126
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. (2018). MEGA X: Molecular evolutionary
genetics analysis across computing platforms. Molecular Biology and Evolution, 35(6), 1547–1549.
DOI: 10.1093/molbev/msy096
Larkin, MA., Blackshields, G., Brown, NP., Chenna, R., McGettigan, PA., McWilliam, H., Valentin,
F., Wallace, IM., Wilm, A., Lopez, R., Thompson, JD., Gibson, TJ. & Higgins, DG. (2007). Clustal W
and Clustal X version 2.0. Bioinformatics, 23, 2947-2948. DOI: 10.1093/bioinformatics/btm404
Lebeuf-Taylor, E., McCloskey, N., Bailey, SF., Hinz, A. & Kassen, R. (2019). The distribution of
fitness effects among synonymous mutations in a gene under directional selection. eLife, 8, e45952.
DOI: 10.7554/eLife.45952
91
Liu, X., Roberts, J., Dakic, A., Zhang, Y. & Schlegel, R. (2008). HPV E7 contributes to the telomerase
activity of immortalized and tumorigenic cells and augments E6-induced hTERT promoter function.
Virology, 375(2), 611-623. DOI: 10.1016/j.virol.2008.02.025
Longworth, MS. & Laimins, LA. (2004). Pathogenesis of human papillomaviruses in differentiating
epithelia. Microbiology and Molecular Biology Reviews, 68(2), 362-372. DOI:
10.1128/mmbr.68.2.362-372.2004
Madison, KC. (2003). Barrier function of the skin: “la raison d'etre” of the epidermis. Journal of
Investigative Dermatology, 121(2), 231-241. DOI: 10.1046/j.1523-1747.2003.12359.x
Makatsa, S. (2012). Detection of HPV-11 variants associated with recurrent laryngeal papillomatosis
based on LCR-E6 sequence data, Southern Africa. BMedSc Hons mini dissertation. University of the
Free State, Bloemfontein
Mattoon, D., Gupta, K., Doyon, J., Loll, PJ. & DiMaio, D. (2001). Identification of the transmembrane
dimer interface of the bovine papillomavirus E5 protein. Oncogene, 20(29) 3824-3834. DOI:
10.1038/sj.onc.1204523
Maver, PJ., Kocjan, BJ., Seme, K., Potocˇnik, M., Gale, N. & Poljak, M. (2011). Prevaccination
genomic diversity of human papillomavirus genotype 11: A study on 63 clinical isolates and 10 full-
length genome sequences. Journal of Medical Virology, 83, 461–470. DOI: 10.1002/jmv.21994.
Mbulawa, Z., Kularatne, R., Kufa-Chakezha, T. & Williamson, AL. (2017). Sentinel surveillance of
human papillomavirus genotypes among patients attending public healthcare facilities in South Africa.
16(2), 1-8. Retrieved from http://nicd.ac.za
Mbulawa, ZZ., Van Schalkwyk, C., Hu, NC., Meiring, TL., Barnabas, S., Dabee, S., Heather Jaspan,
H., Kriek, Z. Jaumdally, SZ., Muller, E., Bekker, LG., Lewis, DA., Dietrich, J., Gray, G., Passmore,
JAS. & Williamson, AL. (2018). High human papillomavirus (HPV) prevalence in South African
adolescents and young women encourages expanded HPV vaccination campaigns. PLOS One, 13(1),
e0190166. DOI: 10.1371/journal.pone.0190166
McBride, AA. (2017). Mechanisms and strategies of papillomavirus replication. Biological Chemistry,
398(8), 919-927. DOI: 10.1515/hsz-2017-0113
Miguel Rocha, M. & Ferreira, PG. (2018). Phylogenetic Analysis, in M, Miguel Rocha & PG, Ferreira.
Bioinformatics Algorithms. Academic Press, 199-220
Murahwa, AT., Nindo, F., Onywera, H., Meiring, TL., Martin, DP. & Williamson, AL. (2019).
Evolutionary dynamics of ten novel Gamma-PVs: Insights from phylogenetic incongruence,
92
recombination and phylodynamic analyses. BMC Genomics, 20(1), 368. DOI: 10.1186/s12864-019-
5735-9
Novakovic, D., Cheng, ATL., Zurynski, Y., Booy, R., Walker, PJ., Berkowitz, R., Harrison, H., Black,
R., Perry, C., Vijayasekaran, S., Wabnitz, D., Burns, H., Tabrizi, SN., Garland, SM., Elliott, E. &
Brotherton, JML. (2018). A prospective study of the incidence of juvenile-onset recurrent respiratory
papillomatosis after implementation of a national HPV vaccination program. The Journal of Infectious
Diseases, 217(2), 208-212. DOI: 10.1093/infdis/jix498
Okonechnikov, K., Golosova, O., Fursov, M. & the UGENE team. (2012). Unipro UGENE: A unified
bioinformatics toolkit. Bioinformatics, 28, 1166-1167. DOI: 10.1093/bioinformatics/bts091
Omland, T., Akre, H., Lie, KA., Jebsen, P., Sandvik, L. & Brøndbo, K. (2014). Risk factors for
aggressive recurrent respiratory papillomatosis in adults and juveniles. PLOS One, 9(11), e113584.
DOI: 10.1371/journal.pone.0113584
Pal, A. & Kundu, R. (2020). Human papillomavirus E6 and E7: The cervical cancer hallmarks and
targets for therapy. Frontiers in Microbiology, 10, 3116. DOI:
https://doi.org/10.3389/fmicb.2019.03116
Parker, J. & Chen, J. (2017). Application of next generation sequencing for the detection of human viral
pathogens in clinical specimens. Journal of Clinical Virology, 86, 20–26. DOI:
10.1016/j.jcv.2016.11.010
Pastrana, DV., Buck, CB., Pang, YY., Thompson, CD., Castle, PE., FitzGerald, PC., Krüger Kjaer, S.,
Lowy, DR. & Schiller, JT. (2004). Reactivity of human sera in a sensitive, high-throughput
pseudovirus-based papillomavirus neutralization assay for HPV16 and HPV18. Virology, 321(2), 205–
216. DOI: 10.1016/j.virol.2003.12.027
Peñaloza-Plascencia, M., Montoya-Fuentes, H., Flores-Martínez, SE., Fierro-Velasco, FJ., Peñaloza-
González, JM. & Sánchez-Corona, J. (2000). Molecular identification of 7 human papillomavirus types
in recurrent respiratory papillomatosis. Archives of Otolaryngology–Head and Neck Surgery, 126(9),
1119–1123. DOI: 10.1001/archotol.126.9.1119
Pinidis, P., Tsikouras, P., Iatrakis, G., Zervoudis, S., Koukouli, Z., Bothou, A., Galazios, G. &
Vladareanu, S. (2016). Human papilloma virus’ life cycle and carcinogenesis. Maedica, 11(1), 48-54.
Retrieved from https://www.maedica.ro/
93
Reidy, PM., Dedo, HH., Rabah, R., Field, JB., Mathog, RH., Gregoire, L. & Lancaster, WD. (2004).
Integration of human papillomavirus type 11 in recurrent respiratory papilloma-associated cancer. The
Laryngoscope, 114(11), 1906–1909. DOI: 10.1097/01.mlg.0000147918.81733.49
Ribeiro, AL., Caodaglio, AS. & Sichero, L. (2018). Regulation of HPV transcription. Clinics, 73(suppl
1), e486s. DOI: 10.6061/clinics/2018/e486s
Rintala, MA., Grénman, SE., Puranen, MH., Isolauri, E., Ekblad, U., Kero, PO. & Syrjänen, SM.
(2005). Transmission of high-risk human papillomavirus (HPV) between parents and infant: A
prospective study of HPV in families in Finland. Journal of Clinical Microbiology, 43(1), 376-381.
DOI: 10.1128/jcm.43.1.376-381.2005
Rodier, C., Lapointe, A., Coutlée, F., Mayrand, M. H., Dal Soglio, D., Roger, M. & Trottier, H. (2013).
Juvenile respiratory papillomatosis: Risk factors for severity. Journal of Medical Virology, 85(8),
1447–1458. DOI: 10.1002/jmv.23615
Rodríguez-Álvarez, MI., Gómez-Urquiza, JL., Husein-El Ahmed, H., Albendín-García, L., Gómez-
Salgado, J. & Cañadas-De la Fuente, GA. (2018). Prevalence and risk factors of human papillomavirus
in male patients: A systematic review and meta-analysis. International Journal of Environmental
Research and Public Health, 15(10), 2210. DOI: 10.3390/ijerph15102210
Rosenberg, T., Philipsen, BB., Mehlum, CS., Dyrvig, AK., Wehberg, S., Chirilǎ, M. & Godballe, C.
(2019). Therapeutic use of the human papillomavirus vaccine on recurrent respiratory papillomatosis:
A systematic review and meta-analysis. The Journal of Infectious Diseases, 219(7), 1016–1025. DOI:
10.1093/infdis/jiy616
Rous, P. & Beard, JW. (1935). The progression to carcinoma of virus-induced rabbit papillomas
(Shope). The Journal of Experimental Medicine, 62(4), 523-48. DOI: 10.1084/jem.62.4.523
Rusinko, J. & McPartlon, M. (2017). Species tree estimation using Neighbor Joining. Journal of
Theoretical Biology, 414, 5–7. DOI: 10.1016/j.jtbi.2016.11.005
Sabeena, S., Bhat, P., Kamath, V. & Arunkumar, G. (2017). Possible non‐sexual modes of transmission
of human papilloma virus. Journal of Obstetrics and Gynaecology Research, 43(3), 429–435. DOI:
10.1111/jog.13248
Saitou, N. & Nei, M. (1987). The Neighbor-Joining method: A new method for reconstructing
phylogenetic trees. Molecular Biology and Evolution, 4(4), 406-425. DOI:
10.1093/oxfordjournals.molbev.a040454
94
Sanger, F., Nicklen, S. & Coulson, AR. (1977). DNA sequencing with chain-terminating inhibitors.
Proceedings of the National Academy of Sciences of the United States of America, 74(12), 5463–5467.
DOI: 10.1073/pnas.74.12.5463
Seedat, RY. & Schall, R. (2018). Age of diagnosis, incidence and prevalence of recurrent respiratory
papillomatosis—A South African perspective. Clinical Otolaryngology, 43, 533– 537. DOI:
10.1111/coa.13016
Seedat, RY. (2020). Juvenile-onset recurrent respiratory papillomatosis diagnosis and management - A
developing country review. Pediatric Health, Medicine and Therapeutics, 11, 39–46. DOI:
10.2147/phmt.s200186
Seedat, RY., Combrinck, CE. & Burt, FJ. (2013). HPV associated with recurrent respiratory
papillomatosis. Future Virology, 8(5), 477–492. DOI: 10.2217/fvl.13.31
Serdoz, S., Egri-Nagy, A., Sumner, J., Holland, BR., Jarvis, PD., Tanaka, MM. & Francis, AR. (2017).
Maximum likelihood estimates of pairwise rearrangement distances. Journal of Theoretical
Biology, 423, 31–40. DOI: 10.1016/j.jtbi.2017.04.015
Shope, RE. & Hurst, EW. (1933). Infectious papillomatosis of rabbits: With a note on the
histopathology. The Journal of Experimental Medicine, 58(5), 607-24. DOI: 10.1084/jem.58.5.607
Siqueira, JD., Alves, BM., Prellwitz, IM., Furtado, C., Meyrelles, ÂR., Machado, ES., Seuánez, HN.,
Soares, MA. & Soares, EA. (2016). Identification of novel human papillomavirus lineages and
sublineages in HIV/HPV-coinfected pregnant women by next-generation sequencing. Virology, 493,
202–208. DOI: 10.1016/j.virol.2016.03.027
Smith, EM., Parker, MA., Rubenstein, LM., Haugen, TH., Hamsikova, E. & Turek, LP. (2010).
Evidence for vertical transmission of HPV from mothers to infants. Infectious Diseases in Obstetrics
and Gynecology, 2010, 326369. DOI: 10.1155/2010/326369
So, KA., Lee, IH., Kim, TJ. & Lee, KH. (2019). Risk factors of persistent HPV infection after treatment
for high-grade squamous intraepithelial lesion. Archives of Gynecology and Obstetrics, 299, 223–227.
DOI: 10.1007/s00404-018-4936-9
Sridhar, S., To, KK., Chan, JF., Lau, SK., Woo, PC. & Yuen, KY. (2015). A systematic approach to
novel virus discovery in emerging infectious disease outbreaks. The Journal of Molecular
Diagnostics, 17(3), 230–241. DOI: 10.1016/j.jmoldx.2014.12.002
95
Stanley, MA. (2020). Epithelial cell responses to infection with human papillomavirus. Clinical
Microbiology Reviews, 25(2), 215–222. DOI: 10.1128/cmr.05028-11
Stenlund, A. (2003). Initiation of DNA replication: Lessons from viral initiator proteins. Nature
Reviews Molecular Cell Biology, 4, 777–785. DOI: 10.1038/nrm1226
Sterling, JC., Handfield‐Jones, S. & Hudson, PM. (2001). Guidelines for the management of cutaneous
warts. British Journal of Dermatology, 144(1), 4-11. DOI: 10.1046/j.1365-2133.2001.04066.x
Stoler, MH., Wolinsky, SM., Whitbeck, A., Broker, TR. & Chow, LT. (1989). Differentiation-linked
human papillomavirus types 6 and 11 transcription in genital condylomata revealed by in situ
hybridization with message-specific RNA probes. Virology, 172(1), 331–340. DOI: 10.1016/0042-
6822(89)90135-9
Strauss, MJ., Shaw, EW., Bunting, H. & Melnick, JL. (1949). “Crystalline” virus-like particles from
skin papillomas characterized by intranuclear inclusion bodies. Sage Journals, 72(1), 46-50. DOI:
10.3181/00379727-72-17328
Swain, SK., Das, A., Sahoo, L., Debta, P. (2020). Pediatric laryngeal papillomatosis: A review. Annals
of Indian Academy of Otorhinolaryngology Head and Neck Surgery, 4(2), 19-23. DOI:
10.4103/aiao.aiao_11_19
Taku, O., Businge, CB., Mdaka, ML., Phohlo, K., Basera, W., Garcia-Jardon, M., Meiring, TL.,
Gyllensten, U., Williamson, AL. & Mbulawa, Z. (2020). Human papillomavirus prevalence and risk
factors among HIV-negative and HIV-positive women residing in rural Eastern Cape, South
Africa. International Journal of Infectious Diseases, 95, 176–182. DOI: 10.1016/j.ijid.2020.02.051
The International HPV Reference Centre. (2021). Human Reference clones, viewed on 22 July 2021,
Tjon Pian Gi, RE., Dietz, A., Djukic, V., Eckel, HE., Friedrich, G., Golusinski, W., Hantzakos, A.,
Lawson, G., Remacle, M., Rihkanen, H. & Dikkers, FG. (2012). Treatment of recurrent respiratory
papillomatosis and adverse reactions following off-label use of cidofovir (Vistide®). European
archives of Otorhino-Laryngology, 269(2), 361–362. DOI: 10.1007/s00405-011-1804-7
Trifinopoulos, J., Nguyen, LT., von Haeseler, A. & Minh, BQ. (2016) W-IQ-TREE: A fast online
phylogenetic tool for maximum likelihood analysis. Nucleic Acids Research, 44, W232-W235. DOI:
10.1093/nar/gkw256
96
Tseng, CJ., Liang, CC., Soong, YK. & Pao, CC. (1998). Perinatal transmission of human papillomavirus
in infants: Relationship between infection rate and mode of delivery. Obstetrics & Gynecology, 91(1),
92-96. DOI: 10.1016/S0029-7844(97)00594-2
Tseng, CJ., Lin, CY., Wang, RL., Chen, LJ., Chang, YL., Hsieh, TT. & Pao, CC. (1992). Possible
transplacental transmission of human papillomaviruses. American Journal of Obstetrics and
Gynecology, 166(1 Pt 1), 35–40. DOI: 10.1016/0002-9378(92)91825-u
Tuna, M. & Amos, CI. (2017). Next generation sequencing and its applications in HPV-associated
cancers. Oncotarget, 8(5), 8877–8889. DOI: 10.18632/oncotarget.12830
van Doorslaer, K. (2013). Evolution of the Papillomaviridae. Virology, 445,11–20. DOI:
10.1016/j.virol.2013.05.012
van Regenmortel, MHV., Fauquet, CM., Bishop, DHL., Calisher, CH., Carsten, EB., Estes, MK.,
Lemon, SM., Maniloff, J., Mayo, MA., McGeoch, DJ., Pringle, CR. & Wickner, RB. (2002). Virus
Taxonomy. Seventh Report of the International Committee for the Taxonomy of Viruses. Academic
Press, New-York
Venuti, A., Paolini, F., Nasir, L., Corteggio, A., Roperto, S., Campo, MS. & Borzacchiello, G. (2011).
Papillomavirus E5: the smallest oncoprotein with many functions. Molecular Cancer, 10, 140. DOI:
10.1186/1476-4598-10-140
Wallace, NA. & Galloway, DA. (2014). Manipulation of cellular DNA damage repair machinery
facilitates propagation of human papillomaviruses. Seminars in Cancer Biology, 26, 30–42. DOI:
10.1016/j.semcancer.2013.12.003
Wang, JW. & Roden, RB. (2013). L2, the minor capsid protein of papillomavirus. Virology, 445(1-2),
175–186. DOI: 10.1016/j.virol.2013.04.017
Watts, DH., Koutsky, LA., Holmes, KK., Goldman, D., Kuypers, J., Kiviat, NB. & Galloway, DA.
(1998). Low risk of perinatal transmission of human papillomavirus: Results from a prospective cohort
study. American Journal of Obstetrics and Gynecology, 178(2), 365–373. DOI: 10.1016/s0002-
9378(98)80027-6
Welschmeyer, A. & Berke, G. (2021). An updated review of the epidemiological factors associated with
recurrent respiratory papillomatosis. Laryngoscope Investigative Otolaryngology. 6. DOI:
10.1002/lio2.521.
97
Wilcox, LJ., Hull, BP., Baldassari, CM. & Derkay, CS. (2014). Diagnosis and management of recurrent
respiratory papillomatosis. The Pediatric Infectious Disease Journal, 33(12), 1283–1284. DOI:
10.1097/inf.0000000000000551
Wu, X., Zhang, C., Feng, S., Liu, C., Li, Y., Yang, Y., Gao, J., Li, H., Meng, S., Li, L., Zhang, Y., Hu,
X., Wu, X., Lin, L., Li, X. & Wang, Y. (2009). Detection of HPV types and neutralizing antibodies in
Gansu province, China. Journal of Medical Virology, 81, 693–702. DOI: 10.1002/jmv.21435
Yajid, AI., Zakariah, MA., Mat Zin, AA. & Othman, NH. (2017). Potential role of E4 protein in human
papillomavirus screening: A review. Asian Pacific Journal of Cancer Prevention, 18(2), 315–319. DOI:
10.22034/apjcp.2017.18.2.315
Zahreddine, M., Mayrand, MH., Therrien, C., Trevisan, A., Dagenais, C., Monnier, P., Laporte, L.,
Niyibizi, J., Deshaies, C., Carceller, AM., Fraser, W., Brassard, P., Lacroix, J., Bédard, MJ., Girard, I.,
Audibert, F., Coutlée, F. & Trottier, H. (2020). Antibodies to human papillomavirus types 6, 11, 16 and
18: Vertical transmission and clearance in children up to two years of age. EClinicalMedicine, 21,
100334. DOI: 10.1016/j.eclinm.2020.100334
Zhang, J., Zhang, S., Wang, M., Ding, X., Wen, Q., Chen, Z., Cao, M., Jing, Y. & Zhang, S. (2018).
Genetic variability in E5, E6, E7 and L1 genes of human papillomavirus type 31. Molecular Medicine
Reports, 17(4), 5498–5507. DOI: 10.3892/mmr.2018.8500
Zouridis, A., Kalampokas, T., Panoulis, K., Salakos, N. & Deligeoroglou, E. (2018). Intrauterine HPV
transmission: A systematic review of the literature. Archives of Gynecology and Obstetrics, 298(1), 35-
44. DOI: 10.1007/s00404-018-4787-4
zur Hausen, H. (2002). Papillomaviruses and cancer: From basic studies to clinical application. Nature
Reviews Cancer, 2, 342–350. DOI: 10.1038/nrc798
zur Hausen, H., Gissmann, L., Steiner, W., Dippold, W. & Dreger, I. (1975). Human papilloma viruses
and cancer. Bibliotheca Haematologica. 43, 569-71. DOI: 0.1159/000399220
98
Appendices
Appendix 1: Human papillomavirus type 11 positive patient sample information
Date of Birth Sex Diagnosis VBD number
2014/06/19 F 2018/03/14 114/18
2014/01/07 F 2018/02/27 12/18*
2014/07/03 F 2018/02/15 113/18
2013/03/31 M 2017/07/26 23/17*
1970/03/03 M 2017/06/21 47/17
1985/08/22 M 2017/06/13 21/17
1985/05/16 M 2016/12/05 68/16*
2012/03/13 M 2016/10/28 48/16*
2009/07/23 M 2016/08/17 33/16*
2002/11/12 M 2016/01/24 17/16
2012/01/08 F 2015/11/18 68/15*
1992/03/28 F 2015/10/23 70/15
2007/06/04 F 2015/09/16 49/15*
2009/03/04 M 2015/09/16 50/15
1990/05/18 M 2015/09/08 41/15*
2013/04/23 F 2015/09/01 44/15*
2011/09/01 M 2015/08/19 43/15
2013/10/28 M 2015/05/06 16/15*
1949/01/05 M 2015/04/15 08/15
2013/07/24 F 2015/04/03 14/15*
2011/05/29 M 2014/08/02 41/14*
2010/01/28 M 2014/07/25 38/14
1971/03/08 F 2014/07/23 36/14
2009/03/29 M 2014/06/26 28/14*
2011/03/17 M 2014/05/27 17/14*
2000/07/05 M 2014/05/26 22/14*
2007/11/12 M 2014/03/18 29/14*
2006/01/01 F 2013/12/09 55/13*
2011/09/07 F 2013/10/19 37/13
2011/04/24 M 2013/08/02 29/13*
Continued...
I
Date of Birth Sex Diagnosis VBD number
2010/06/11 F 2013/01/25 08/13*
2010/08/07 M 2012/10/07 48/12*
2008/01/27 M 2012/09/26 47/12*
2011/07/08 M 2012/07/31 39/12*
2003/10/21 F 2012/06/12 37/12*
14/14
2008/12/25 M 2012/01/03 09/12*
1992/03/25 F 2011/11/30 61/11*
62/11*
1999/08/08 M 2011/11/02 58/11*
2011/01/30 M 2011/09/19 52/11*
1968/12/21 M 2011/08/15 41/11*
2009/11/11 M 2011/07/25 37/11*
2010/04/15 F 2011/07/04 35/11*
2008/02/20 M 2011/06/14 49/12*
2008/05/16 M 2011/05/23 30/11*
2008/06/06 M 2011/03/07 34/11*
2010/08/11 F 2011/02/26 14/11*
18/14
2008/08/30 F 2010/12/29 04/11*
2000/10/29 F 2010/12/10 20/16
2008/10/26 M 2010/10/13 48/10*
2009/05/06 M 2010/08/18 33/10*
63/11*
01/12
02/12*
03/12
04/12*
1999/03/30 F 2010/07/26 26/10*
2008/04/23 F 2010/06/30 21/10*
2003/02/11 M 2010/06/11 23/10*
2007/01/04 M 2010/01/06 01/10*
07/10*
Continued...
II
Date of Birth Sex Diagnosis VBD number
17/10*
34/10*
1975/06/26 M 2009/11/23 79/09*
2007/06/27 M 2009/10/23 74/09*
15/10*
1967/12/22 M 2009/06/09 59/09*
63/09*
69/15*
2000/07/28 M 2008/08/05 55/08*
2000/10/14 M 2008/01/14 34/08*
1981/04/23 F 2006/12/13 01/08
2004/07/01 F 2005/11/28 04/08
37/08*
08/09*
14/09*
2003/03/22 F 2005/10/27 41/12*
2001/11/12 F 2003/10/01 17/09*
1997/09/29 M 2001/10/09 49/08*
59/08*
69/09*
16/10*
1997/02/23 M 2000/06/19 16/09*
52/09*
37/10*
1964/06/08 F 1997/06/30 33/11*
1993/09/13 M 1997/04/29 05/08
45/08*
13/10*
1977/05/01 F 1990/07/15 69/16
1979/08/04 F 1988/08/05 54/08*
1979/11/18 F 1984/11/18 77/09
*Human papillomavirus type 11 samples included in this study
III
Appendix 2: Human papillomavirus type 11 isolates retrieved from GenBank and
accession numbers
Isolate name on GenBank GenBank Length of
Accession number complete
genome
M14119.1 Human papillomavirus type 11 (HPV-11) complete M14119.1 7934bp
genome
LN833187.1 Human papillomavirus type 11 complete genome, LN833187.1 7934bp
isolate 27
LN833185.1 Human papillomavirus type 11 complete genome, LN833185.1 7934bp
isolate 25
LN833184.1 Human papillomavirus type 11 complete genome, LN833184.1 7934bp
isolate 24
LN833183.1 Human papillomavirus type 11 complete genome, LN833183.1 7948bp
isolate 23
LN833169.1 Human papillomavirus type 11 complete genome, LN833169.1 7934bp
isolate 9
LN833165.1 Human papillomavirus type 11 complete genome, LN833165.1 7934bp
isolate 5
LN833161.1 Human papillomavirus type 11 complete genome, LN833161.1 7934bp
isolate 1
KU298879.1 Human papillomavirus type 11 isolate 83A.11, KU298879.1 7932bp
complete genome
JQ773412.1 Human papillomavirus type 11 strain CU20, complete JQ773412.1 7934bp
genome
JQ773411.1 Human papillomavirus type 11 strain CU19, complete JQ773411.1 7934bp
genome
JQ773409.1 Human papillomavirus type 11 strain CU17, complete JQ773409.1 7934bp
genome
JQ773408.1 Human papillomavirus type 11 strain CU16, complete JQ773408.1 7934bp
genome
JN644141.1 Human papillomavirus type 11 isolate GUMC-AJ, JN644141.1 7934bp
complete genome
HE611263.1 Human papillomavirus type 11 complete genome, HE611263.1 7934bp
isolate LP220
HE574702.1 Human papillomavirus type 11 complete genome, HE574702.1 7934bp
isolate JO-RRP_2
IV
FR872717.1 Human papillomavirus type 11 complete genome FR872717.1 7934bp
Continued…
Isolate name on GenBank GenBank Length of
Accession number complete
genome
FN907963.1 Human papillomavirus type 11 complete genome, FN907963.1 7943bp
isolate CS20
FN907962.1 Human papillomavirus type 11 complete genome, FN907962.1 7934bp
isolate CAC86
FN870021.1 Human papillomavirus type 11 complete genome, FN870021.1 7934bp
isolate A86
EU918768.1 Human papillomavirus type 11 isolate LZod45-11, EU918768.1 7934bp
complete genome
MN788368.1 Human papillomavirus type 11 isolate JO-RRP10, MN788368.1 7934bp
complete genome
MK463921.1 Human papillomavirus type 11 isolate HPV11-gw- MK463921.1 7932bp
1110, complete genome
MK463916.1 Human papillomavirus type 11 isolate HPV11-gw- MK463916.1 7932bp
1105, complete genome
MK463914.1 Human papillomavirus type 11 isolate HPV11-gw- MK463914.1 7934bp
1103, complete genome
MK313767.1 Human papillomavirus type 11 isolate MK313767.1 7934bp
CAC1/HPV11, complete genome
MK313765.1 Human papillomavirus type 11 isolate JO- MK313765.1 7934bp
RRP8/HPV11, complete genome
MK313763.1 Human papillomavirus type 11 isolate JO- MK313763.1 7934bp
RRP2/HPV11, complete genome
V
Appendix 3: Health Sciences Research Ethics Committee approval document
VI
Appendix 4: Design of E2 segment primers
Forward primer 5’-ACAGTGCAGCTACGCCTATA-3’ (Blue)
Reverse primer 5’-TTGTACAGGCACTACCTCCATAC-3’ (Green)
>M14119.1 Human papillomavirus type 11 (HPV-11) complete genome
ACAGTGCAGCTACGCCTATAGTGCAACTGCAAGGTGATTCCAATTGTTTAAAATGTTTTAGATATAGACTGA
ATGACAAATATAAACATTTGTTTGAATTAGCATCTTCAACGTGGCATTGGGCCTCACCTGAGGCACCACATAA
AAATGCAATTGTAACATTAACATATAGCAGTGAGGAACAACGTCAGCAATTTTTAAACAGTGTAAAAATACCA
CCCACCATTAGGCATAAGGTGGGGTTTATGTCATTACATTTATTGTAACCATTACACCTGTATATATGTATATG
TGTACATAACATACGTGTATGGAGGTAGTGCCTGTACAA
>JN644141.1 Human papillomavirus type 11 isolate GUMC-AJ, complete genome
ACAGTGCAGCTACGCCTATAGTGCAACTGCAAGGTGATTCCAATTGTTTAAAATGTTTTAGATATCGACTGA
ATGACAAATATAGACATTTGTTTGAATTAGCATCTTCAACGTGGCATTGGGCCTCACCTGAGGCACCACATAA
AAATGCAATTGTAACATTAACATATAGCAGTGAGGAACAACGTCAGCAATTTTTAAACAGTGTAAAAATACCA
CCCACCATTAGGCATAAGGTGGGGTTTATGTCATTACATTTATTGTAACCATTGCACCTGTATATATGTATATG
TGTACATAACATACGTGTATGGAGGTAGTGCCTGTACAA
>JQ773408.1 Human papillomavirus type 11 strain CU16, complete genome
ACAGTGCAGCTACGCCTATAGTGCAACTGCAAGGTGATTCCAATTGTTTAAAATGTTTTAGATATCGACTGA
ATGACAAATATAAACATTTGTTTGAATTAGCATCTTCAACGTGGCATTGGGCCTCACCTGAGGCACCACATAA
AAATGCAATTGTAACATTAACCTATAGCAGTGAGGAACAACGTCAGCAATTTTTAAACAGTGTAAAAATACCA
CCCACCATTAGGCATAAGGTGGGGTTTATGTCATTACATTTATTGTAACCATTGCACCTGTATATATGTATATG
TGTACATAACATACGTGTATGGAGGTAGTGCCTGTACAA
>MN788368.1 Human papillomavirus type 11 isolate JO-RRP10, complete genome
ACAGTGCAGCTACGCCTATAGTGCAACTGCAAGGTGATTCCAATTGTTTAAAATGTTTTAGATATCGACTGA
ATGACAAATATAGACATTTGTTTGAATTAGCATCTTCAACGTGGCATTGGGCCTCACCTGAGGCACCACATAA
AAATGCAATTGTAACATTAACATATAGCAGTGAGGAACAACGTCAGCAATTTTTAAACAGTGTAAAAATACCA
CCCACCTTAGGCATAAGGTGGGGTTTATGTCATTACATTTATTGTAACCATTGCACCTGTATATATGTATATGT
GTACATAACATACGTGTATGGAGGTAGTGCCTGTACAA
>LN833187.1 Human papillomavirus type 11 complete genome, isolate 27
ACAGTGCAGCTACGCCTATAGTGCAACTGCAAGGTGATTCCAATTGTTTAAAATGTTTTAGATATAGACTGA
ATGACAAATATAGACATTTGTTTGAATTAGCATCTTCAACGTGGCATTGGGCCTCACCTGAGGCACCACATAA
AAATGCAATTGTAACATTAACATATAGCAGTGAKGAACAACGTCAGCAATTTTTAAACAGTGTAAAAATACC
ACCCACCATTAGGCATAAGGTGGGGTTTATGTCATTACATTTATTGTAACCATTGTACCTGTATATATGTATAT
GTGTACATAACATACGTGTATGGAGGTAGTGCCTGTACAA
VII
Appendix 5: Specificity of E2 segment primers
VIII
Appendix 6: 1x TAE preparation
TAE Buffer 50x Stock Recipe
▪ 242 g tris base in double-distilled H2O
▪ 57.1 ml glacial acetic acid
▪ 100 ml 0.5 M EDTA solution (pH 8.0)
To make the 1x TAE working buffer, add 49 parts of dH2O to 1 part of 50x TAE buffer
Appendix 7: Agarose gel preparation
1% Agarose gel
40ml 1%TAE 0.4g agarose powder
50ml 1%TAE 0.5g agarose powder
100ml 1%TAE 1.0g agarose powder
200ml 1%TAE 2.0g agarose powder
Appendix 8: GelRed stain preparation
To make 50ml GelRed stain:
▪ 10ul 10 000x GelRed® Nucleic Acid Gel Stain (Biotum) + 45ml dH20 + 5ml 0.1M NaCl
▪ Only use 2 – 3 times before preparing a new GelRed stain
▪ Store GelRed stain in the dark
IX
Appendix 9: Amino acid codon chart
SECOND BASE IN CODON
T C A G
T TTT Phenylalanine F TCT Serine S TAT Tyrosine Y TG Cysteine C T
T
TTC TCC TAC TG C
C
TTA Leucine L TCA TAA STOP TG STOP A
A
TTG TCG TAG TG Tryptophan W G
G
C CTT Leucine L CCT Proline P CAT Histidine H CG Arginine R T
T
CTC CCC CAC CG C
C
CT CC CA Glutamine Q CG A
A A A A
CT CC CA CG G
G G G G
A ATT Isoleucine I ACT Threonine T AAT Asparagine N AG Serine S T
T
AT AC AA AG C
C C C C
AT AC AA Lysine K AG Arginine R A
A A A A
AT Methionine M AC AA AG G
G G G G
G GTT Valine V GCT Alanine A GAT Aspartic D GG Glycine G T
acid T
GT GC GA GG C
C C C C
GT GC GA Glutamic E GG A
A A A acid A
GT GC GA GG G
G G G G
X
FIRST BASE IN CODON THIRD BASE IN CODON
Appendix 10: Pairwise analysis of percentage divergence of nucleotides using the
human papillomavirus type 11 concatenated E5a/b-L1-URR data set
XI
VBD01-10 VBD02-12 VBD04-12 VBD07-10 VBD08-09 VBD08-13 VBD09-12 VBD12-18 VBD13-10 VBD14-09 VBD14-11 VBD14-15 VBD15-10 VBD16-09 VBD16-10 VBD16-15 VBD17-09 VBD17-10 VBD17-14 VBD21-10 VBD22-14 VBD23-10 VBD23-17 VBD26-10 VBD28-14 VBD29-13
VBD01-10
VBD02-12 0,118
VBD04-12 0,118 0,000
VBD07-10 0,000 0,118 0,118
VBD08-09 0,118 0,000 0,000 0,118
VBD08-13 0,118 0,000 0,000 0,118 0,000
VBD09-12 0,118 0,000 0,000 0,118 0,000 0,000
VBD12-18 0,118 0,000 0,000 0,118 0,000 0,000 0,000
VBD13-10 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000
VBD14-09 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000
VBD14-11 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000
VBD14-15 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD15-10 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119
VBD16-09 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120
VBD16-10 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000
VBD16-15 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000
VBD17-09 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000
VBD17-10 0,000 0,118 0,118 0,000 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,128 0,119 0,118 0,118 0,118
VBD17-14 0,000 0,118 0,118 0,000 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,128 0,119 0,118 0,118 0,118 0,000
VBD21-10 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118
VBD22-14 0,000 0,118 0,118 0,000 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,128 0,119 0,118 0,118 0,118 0,000 0,000 0,118
VBD23-10 0,128 0,120 0,120 0,128 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,001 0,120 0,120 0,120 0,120 0,128 0,128 0,120 0,128
VBD23-17 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120
VBD26-10 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000
VBD28-14 0,164 0,161 0,161 0,164 0,161 0,161 0,161 0,161 0,161 0,161 0,161 0,161 0,164 0,161 0,161 0,161 0,161 0,164 0,164 0,161 0,164 0,165 0,161 0,161
VBD29-13 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161
VBD29-14 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,000 0,120 0,119 0,119 0,119 0,128 0,128 0,119 0,128 0,001 0,119 0,119 0,164 0,119
VBD30-11 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD33-10 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD33-11 0,000 0,118 0,118 0,000 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,128 0,119 0,118 0,118 0,118 0,000 0,000 0,118 0,000 0,128 0,118 0,118 0,164 0,118
VBD33-16 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD34-08 0,156 0,165 0,165 0,156 0,165 0,165 0,165 0,165 0,165 0,165 0,165 0,165 0,162 0,166 0,165 0,165 0,165 0,156 0,156 0,165 0,156 0,163 0,165 0,165 0,116 0,165
VBD34-10 0,000 0,118 0,118 0,000 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,118 0,128 0,119 0,118 0,118 0,118 0,000 0,000 0,118 0,000 0,128 0,118 0,118 0,164 0,118
VBD34-11 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,000 0,120 0,119 0,119 0,119 0,128 0,128 0,119 0,128 0,001 0,119 0,119 0,164 0,119
VBD35-11 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD37-08 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD37-10 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD37-11 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,000 0,120 0,119 0,119 0,119 0,128 0,128 0,119 0,128 0,001 0,119 0,119 0,164 0,119
VBD37-12 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD39-12 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,000 0,120 0,119 0,119 0,119 0,128 0,128 0,119 0,128 0,001 0,119 0,119 0,164 0,119
VBD41-11 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD41-12 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD41-14 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD41-15 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD44-15 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,000 0,120 0,119 0,119 0,119 0,128 0,128 0,119 0,128 0,001 0,119 0,119 0,164 0,119
VBD47-12 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,000 0,120 0,119 0,119 0,119 0,128 0,128 0,119 0,128 0,001 0,119 0,119 0,164 0,119
VBD48-10 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,001 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD48-12 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD48-16 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD49-08 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD49-12 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD49-15 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD52-09 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD52-11 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD55-13 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,000 0,120 0,119 0,119 0,119 0,128 0,128 0,119 0,128 0,001 0,119 0,119 0,164 0,119
VBD58-11 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD59-08 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD59-09 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD61-11 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD62-11 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD63-09 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD63-11 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD68-15 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD68-16 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,001 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
VBD69-09 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD74-09 0,128 0,119 0,119 0,128 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,119 0,000 0,120 0,119 0,119 0,119 0,128 0,128 0,119 0,128 0,001 0,119 0,119 0,164 0,119
LN833187.1 0,148 0,028 0,028 0,148 0,028 0,028 0,028 0,028 0,028 0,028 0,028 0,028 0,148 0,028 0,028 0,028 0,028 0,148 0,148 0,028 0,148 0,149 0,028 0,028 0,156 0,028
EU918768.1 0,145 0,023 0,023 0,145 0,023 0,023 0,023 0,023 0,023 0,023 0,023 0,023 0,145 0,023 0,023 0,023 0,023 0,145 0,145 0,023 0,145 0,146 0,023 0,023 0,154 0,023
FN870021.1 0,153 0,051 0,051 0,153 0,051 0,051 0,051 0,051 0,051 0,051 0,051 0,051 0,150 0,052 0,051 0,051 0,051 0,153 0,153 0,051 0,153 0,151 0,051 0,051 0,158 0,051
FN907962.1 0,001 0,120 0,120 0,001 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,129 0,120 0,120 0,120 0,120 0,001 0,001 0,120 0,001 0,129 0,120 0,120 0,165 0,120
FN907963.1 0,022 0,144 0,144 0,022 0,144 0,144 0,144 0,144 0,144 0,144 0,144 0,144 0,152 0,144 0,144 0,144 0,144 0,022 0,022 0,144 0,022 0,153 0,144 0,144 0,156 0,144
FR872717.1 0,120 0,002 0,002 0,120 0,002 0,002 0,002 0,002 0,002 0,002 0,002 0,002 0,120 0,002 0,002 0,002 0,002 0,120 0,120 0,002 0,120 0,120 0,002 0,002 0,162 0,002
HE574702.1 0,119 0,001 0,001 0,119 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,120 0,001 0,001 0,001 0,001 0,119 0,119 0,001 0,119 0,121 0,001 0,001 0,161 0,001
HE611263.1 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
JN644141.1 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
JQ773408.1 0,120 0,001 0,001 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,120 0,002 0,001 0,001 0,001 0,120 0,120 0,001 0,120 0,121 0,001 0,001 0,161 0,001
JQ773409.1 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
JQ773411.1 0,120 0,001 0,001 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,119 0,002 0,001 0,001 0,001 0,120 0,120 0,001 0,120 0,120 0,001 0,001 0,162 0,001
JQ773412.1 0,120 0,001 0,001 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,119 0,002 0,001 0,001 0,001 0,120 0,120 0,001 0,120 0,120 0,001 0,001 0,162 0,001
KU298879.1 0,129 0,123 0,123 0,129 0,123 0,123 0,123 0,123 0,123 0,123 0,123 0,123 0,027 0,124 0,123 0,123 0,123 0,129 0,129 0,123 0,129 0,028 0,123 0,123 0,163 0,123
LN833161.1 0,119 0,001 0,001 0,119 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,120 0,001 0,001 0,001 0,001 0,119 0,119 0,001 0,119 0,121 0,001 0,001 0,162 0,001
LN833165.1 0,146 0,025 0,025 0,146 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,147 0,025 0,025 0,025 0,025 0,146 0,146 0,025 0,146 0,148 0,025 0,025 0,154 0,025
LN833169.1 0,145 0,025 0,025 0,145 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,146 0,025 0,025 0,025 0,025 0,145 0,145 0,025 0,145 0,147 0,025 0,025 0,153 0,025
LN833183.1 0,156 0,165 0,165 0,156 0,165 0,165 0,165 0,165 0,165 0,165 0,165 0,165 0,162 0,166 0,165 0,165 0,165 0,156 0,156 0,165 0,156 0,163 0,165 0,165 0,116 0,165
LN833184.1 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
LN833185.1 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,001 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
M14119.1 0,153 0,111 0,111 0,153 0,111 0,111 0,111 0,111 0,111 0,111 0,111 0,111 0,143 0,111 0,111 0,111 0,111 0,153 0,153 0,111 0,153 0,144 0,111 0,111 0,152 0,111
MK313763.1 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
MK313765.1 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,001 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
MK313767.1 0,119 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,120 0,001 0,000 0,000 0,000 0,119 0,119 0,000 0,119 0,120 0,000 0,000 0,161 0,000
MK463914.1 0,120 0,001 0,001 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,120 0,001 0,001 0,001 0,001 0,120 0,120 0,001 0,120 0,120 0,001 0,001 0,161 0,001
MK463916.1 0,129 0,120 0,120 0,129 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,002 0,120 0,120 0,120 0,120 0,129 0,129 0,120 0,129 0,003 0,120 0,120 0,165 0,120
MK463921.1 0,128 0,120 0,120 0,128 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,002 0,120 0,120 0,120 0,120 0,128 0,128 0,120 0,128 0,003 0,120 0,120 0,165 0,120
MN788368.1 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
VBD79-09 0,118 0,000 0,000 0,118 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,000 0,000 0,000 0,000 0,118 0,118 0,000 0,118 0,120 0,000 0,000 0,161 0,000
Continued...
XII
VBD29-14 VBD30-11 VBD33-10 VBD33-11 VBD33-16 VBD34-08 VBD34-10 VBD34-11 VBD35-11 VBD37-08 VBD37-10 VBD37-11 VBD37-12 VBD39-12 VBD41-11 VBD41-12 VBD41-14 VBD41-15 VBD44-15 VBD47-12 VBD48-10 VBD48-12 VBD48-16 VBD49-08 VBD49-12 VBD49-15 VBD52-09
VBD01-10
VBD02-12
VBD04-12
VBD07-10
VBD08-09
VBD08-13
VBD09-12
VBD12-18
VBD13-10
VBD14-09
VBD14-11
VBD14-15
VBD15-10
VBD16-09
VBD16-10
VBD16-15
VBD17-09
VBD17-10
VBD17-14
VBD21-10
VBD22-14
VBD23-10
VBD23-17
VBD26-10
VBD28-14
VBD29-13
VBD29-14
VBD30-11 0,119
VBD33-10 0,119 0,000
VBD33-11 0,128 0,118 0,118
VBD33-16 0,120 0,000 0,000 0,119
VBD34-08 0,162 0,165 0,165 0,156 0,166
VBD34-10 0,128 0,118 0,118 0,000 0,119 0,156
VBD34-11 0,000 0,119 0,119 0,128 0,120 0,162 0,128
VBD35-11 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120
VBD37-08 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000
VBD37-10 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120 0,000 0,000
VBD37-11 0,000 0,119 0,119 0,128 0,120 0,162 0,128 0,000 0,120 0,119 0,120
VBD37-12 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119
VBD39-12 0,000 0,119 0,119 0,128 0,120 0,162 0,128 0,000 0,120 0,119 0,120 0,000 0,119
VBD41-11 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119
VBD41-12 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000
VBD41-14 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120 0,000 0,000 0,000 0,120 0,000 0,120 0,000 0,000
VBD41-15 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120 0,000 0,000 0,000 0,120 0,000 0,120 0,000 0,000 0,000
VBD44-15 0,000 0,119 0,119 0,128 0,120 0,162 0,128 0,000 0,120 0,119 0,120 0,000 0,119 0,000 0,119 0,119 0,120 0,120
VBD47-12 0,000 0,119 0,119 0,128 0,120 0,162 0,128 0,000 0,120 0,119 0,120 0,000 0,119 0,000 0,119 0,119 0,120 0,120 0,000
VBD48-10 0,120 0,000 0,000 0,119 0,001 0,166 0,119 0,120 0,001 0,000 0,001 0,120 0,000 0,120 0,000 0,000 0,001 0,001 0,120 0,120
VBD48-12 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000
VBD48-16 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000
VBD49-08 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000
VBD49-12 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000
VBD49-15 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000
VBD52-09 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120 0,000 0,000 0,000 0,120 0,000 0,120 0,000 0,000 0,000 0,000 0,120 0,120 0,001 0,000 0,000 0,000 0,000 0,000
VBD52-11 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD55-13 0,000 0,119 0,119 0,128 0,120 0,162 0,128 0,000 0,120 0,119 0,120 0,000 0,119 0,000 0,119 0,119 0,120 0,120 0,000 0,000 0,120 0,119 0,119 0,119 0,119 0,119 0,120
VBD58-11 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD59-08 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD59-09 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120 0,000 0,000 0,000 0,120 0,000 0,120 0,000 0,000 0,000 0,000 0,120 0,120 0,001 0,000 0,000 0,000 0,000 0,000 0,000
VBD61-11 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD62-11 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD63-09 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120 0,000 0,000 0,000 0,120 0,000 0,120 0,000 0,000 0,000 0,000 0,120 0,120 0,001 0,000 0,000 0,000 0,000 0,000 0,000
VBD63-11 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD68-15 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120 0,000 0,000 0,000 0,120 0,000 0,120 0,000 0,000 0,000 0,000 0,120 0,120 0,001 0,000 0,000 0,000 0,000 0,000 0,000
VBD68-16 0,120 0,000 0,000 0,119 0,001 0,166 0,119 0,120 0,001 0,000 0,001 0,120 0,000 0,120 0,000 0,000 0,001 0,001 0,120 0,120 0,001 0,000 0,000 0,000 0,000 0,000 0,001
VBD69-09 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD74-09 0,000 0,119 0,119 0,128 0,120 0,162 0,128 0,000 0,120 0,119 0,120 0,000 0,119 0,000 0,119 0,119 0,120 0,120 0,000 0,000 0,120 0,119 0,119 0,119 0,119 0,119 0,120
LN833187.1 0,148 0,028 0,028 0,148 0,028 0,159 0,148 0,148 0,028 0,028 0,028 0,148 0,028 0,148 0,028 0,028 0,028 0,028 0,148 0,148 0,028 0,028 0,028 0,028 0,028 0,028 0,028
EU918768.1 0,145 0,023 0,023 0,145 0,023 0,158 0,145 0,145 0,023 0,023 0,023 0,145 0,023 0,145 0,023 0,023 0,023 0,023 0,145 0,145 0,023 0,023 0,023 0,023 0,023 0,023 0,023
FN870021.1 0,150 0,051 0,051 0,153 0,052 0,162 0,153 0,150 0,052 0,051 0,052 0,150 0,051 0,150 0,051 0,051 0,052 0,052 0,150 0,150 0,052 0,051 0,051 0,051 0,051 0,051 0,052
FN907962.1 0,129 0,120 0,120 0,001 0,120 0,157 0,001 0,129 0,120 0,120 0,120 0,129 0,120 0,129 0,120 0,120 0,120 0,120 0,129 0,129 0,120 0,120 0,120 0,120 0,120 0,120 0,120
FN907963.1 0,152 0,144 0,144 0,022 0,144 0,148 0,022 0,152 0,144 0,144 0,144 0,152 0,144 0,152 0,144 0,144 0,144 0,144 0,152 0,152 0,144 0,144 0,144 0,144 0,144 0,144 0,144
FR872717.1 0,120 0,002 0,002 0,120 0,002 0,166 0,120 0,120 0,002 0,002 0,002 0,120 0,002 0,120 0,002 0,002 0,002 0,002 0,120 0,120 0,002 0,002 0,002 0,002 0,002 0,002 0,002
HE574702.1 0,120 0,001 0,001 0,119 0,001 0,165 0,119 0,120 0,001 0,001 0,001 0,120 0,001 0,120 0,001 0,001 0,001 0,001 0,120 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001
HE611263.1 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JN644141.1 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JQ773408.1 0,120 0,001 0,001 0,120 0,002 0,166 0,120 0,120 0,002 0,001 0,002 0,120 0,001 0,120 0,001 0,001 0,002 0,002 0,120 0,120 0,002 0,001 0,001 0,001 0,001 0,001 0,002
JQ773409.1 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JQ773411.1 0,119 0,001 0,001 0,120 0,002 0,166 0,120 0,119 0,002 0,001 0,002 0,119 0,001 0,119 0,001 0,001 0,002 0,002 0,119 0,119 0,002 0,001 0,001 0,001 0,001 0,001 0,002
JQ773412.1 0,119 0,001 0,001 0,120 0,002 0,166 0,120 0,119 0,002 0,001 0,002 0,119 0,001 0,119 0,001 0,001 0,002 0,002 0,119 0,119 0,002 0,001 0,001 0,001 0,001 0,001 0,002
KU298879.1 0,027 0,123 0,123 0,129 0,124 0,164 0,129 0,027 0,124 0,123 0,124 0,027 0,123 0,027 0,123 0,123 0,124 0,124 0,027 0,027 0,124 0,123 0,123 0,123 0,123 0,123 0,124
LN833161.1 0,120 0,001 0,001 0,119 0,001 0,166 0,119 0,120 0,001 0,001 0,001 0,120 0,001 0,120 0,001 0,001 0,001 0,001 0,120 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001
LN833165.1 0,147 0,025 0,025 0,146 0,025 0,158 0,146 0,147 0,025 0,025 0,025 0,147 0,025 0,147 0,025 0,025 0,025 0,025 0,147 0,147 0,026 0,025 0,025 0,025 0,025 0,025 0,025
LN833169.1 0,146 0,025 0,025 0,145 0,025 0,158 0,145 0,146 0,025 0,025 0,025 0,146 0,025 0,146 0,025 0,025 0,025 0,025 0,146 0,146 0,025 0,025 0,025 0,025 0,025 0,025 0,025
LN833183.1 0,162 0,165 0,165 0,156 0,166 0,000 0,156 0,162 0,166 0,165 0,166 0,162 0,165 0,162 0,165 0,165 0,166 0,166 0,162 0,162 0,166 0,165 0,165 0,165 0,165 0,165 0,166
LN833184.1 0,120 0,000 0,000 0,119 0,000 0,166 0,119 0,120 0,000 0,000 0,000 0,120 0,000 0,120 0,000 0,000 0,000 0,000 0,120 0,120 0,001 0,000 0,000 0,000 0,000 0,000 0,000
LN833185.1 0,120 0,000 0,000 0,119 0,001 0,166 0,119 0,120 0,001 0,000 0,001 0,120 0,000 0,120 0,000 0,000 0,001 0,001 0,120 0,120 0,000 0,000 0,000 0,000 0,000 0,000 0,001
M14119.1 0,143 0,111 0,111 0,153 0,111 0,154 0,153 0,143 0,111 0,111 0,111 0,143 0,111 0,143 0,111 0,111 0,111 0,111 0,143 0,143 0,111 0,111 0,111 0,111 0,111 0,111 0,111
MK313763.1 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
MK313765.1 0,120 0,000 0,000 0,119 0,001 0,166 0,119 0,120 0,001 0,000 0,001 0,120 0,000 0,120 0,000 0,000 0,001 0,001 0,120 0,120 0,001 0,000 0,000 0,000 0,000 0,000 0,001
MK313767.1 0,120 0,000 0,000 0,119 0,001 0,166 0,119 0,120 0,001 0,000 0,001 0,120 0,000 0,120 0,000 0,000 0,001 0,001 0,120 0,120 0,001 0,000 0,000 0,000 0,000 0,000 0,001
MK463914.1 0,120 0,001 0,001 0,120 0,001 0,166 0,120 0,120 0,001 0,001 0,001 0,120 0,001 0,120 0,001 0,001 0,001 0,001 0,120 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001
MK463916.1 0,002 0,120 0,120 0,129 0,120 0,163 0,129 0,002 0,120 0,120 0,120 0,002 0,120 0,002 0,120 0,120 0,120 0,120 0,002 0,002 0,120 0,120 0,120 0,120 0,120 0,120 0,120
MK463921.1 0,002 0,120 0,120 0,128 0,120 0,162 0,128 0,002 0,120 0,120 0,120 0,002 0,120 0,002 0,120 0,120 0,120 0,120 0,002 0,002 0,120 0,120 0,120 0,120 0,120 0,120 0,120
MN788368.1 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD79-09 0,119 0,000 0,000 0,118 0,000 0,165 0,118 0,119 0,000 0,000 0,000 0,119 0,000 0,119 0,000 0,000 0,000 0,000 0,119 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000
Continued...
XIII
VBD52-11 VBD55-13 VBD58-11 VBD59-08 VBD59-09 VBD61-11 VBD62-11 VBD63-09 VBD63-11 VBD68-15 VBD68-16 VBD69-09 VBD74-09 LN833187.1 EU918768.1 FN870021.1 FN907962.1 FN907963.1 FR872717.1 HE574702.1 HE611263.1 JN644141.1 JQ773408.1 JQ773409.1 JQ773411.1 JQ773412.1 KU298879.1
VBD01-10
VBD02-12
VBD04-12
VBD07-10
VBD08-09
VBD08-13
VBD09-12
VBD12-18
VBD13-10
VBD14-09
VBD14-11
VBD14-15
VBD15-10
VBD16-09
VBD16-10
VBD16-15
VBD17-09
VBD17-10
VBD17-14
VBD21-10
VBD22-14
VBD23-10
VBD23-17
VBD26-10
VBD28-14
VBD29-13
VBD29-14
VBD30-11
VBD33-10
VBD33-11
VBD33-16
VBD34-08
VBD34-10
VBD34-11
VBD35-11
VBD37-08
VBD37-10
VBD37-11
VBD37-12
VBD39-12
VBD41-11
VBD41-12
VBD41-14
VBD41-15
VBD44-15
VBD47-12
VBD48-10
VBD48-12
VBD48-16
VBD49-08
VBD49-12
VBD49-15
VBD52-09
VBD52-11
VBD55-13 0,119
VBD58-11 0,000 0,119
VBD59-08 0,000 0,119 0,000
VBD59-09 0,000 0,120 0,000 0,000
VBD61-11 0,000 0,119 0,000 0,000 0,000
VBD62-11 0,000 0,119 0,000 0,000 0,000 0,000
VBD63-09 0,000 0,120 0,000 0,000 0,000 0,000 0,000
VBD63-11 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000
VBD68-15 0,000 0,120 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD68-16 0,000 0,120 0,000 0,000 0,001 0,000 0,000 0,001 0,000 0,001
VBD69-09 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD74-09 0,119 0,000 0,119 0,119 0,120 0,119 0,119 0,120 0,119 0,120 0,120 0,119
LN833187.1 0,028 0,148 0,028 0,028 0,028 0,028 0,028 0,028 0,028 0,028 0,028 0,028 0,148
EU918768.1 0,023 0,145 0,023 0,023 0,023 0,023 0,023 0,023 0,023 0,023 0,023 0,023 0,145 0,009
FN870021.1 0,051 0,150 0,051 0,051 0,052 0,051 0,051 0,052 0,051 0,052 0,052 0,051 0,150 0,037 0,030
FN907962.1 0,120 0,129 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,121 0,120 0,129 0,149 0,146 0,154
FN907963.1 0,144 0,152 0,144 0,144 0,144 0,144 0,144 0,144 0,144 0,144 0,144 0,144 0,152 0,125 0,120 0,127 0,022
FR872717.1 0,002 0,120 0,002 0,002 0,002 0,002 0,002 0,002 0,002 0,002 0,002 0,002 0,120 0,029 0,024 0,053 0,121 0,145
HE574702.1 0,001 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,120 0,029 0,024 0,052 0,121 0,145 0,003
HE611263.1 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,028 0,023 0,051 0,120 0,144 0,002 0,001
JN644141.1 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,028 0,023 0,051 0,120 0,144 0,002 0,001 0,000
JQ773408.1 0,001 0,120 0,001 0,001 0,002 0,001 0,001 0,002 0,001 0,002 0,002 0,001 0,120 0,028 0,023 0,052 0,120 0,144 0,002 0,002 0,001 0,001
JQ773409.1 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,028 0,023 0,051 0,120 0,144 0,002 0,001 0,000 0,000 0,001
JQ773411.1 0,001 0,119 0,001 0,001 0,002 0,001 0,001 0,002 0,001 0,002 0,002 0,001 0,119 0,028 0,024 0,052 0,120 0,145 0,000 0,002 0,001 0,001 0,001 0,001
JQ773412.1 0,001 0,119 0,001 0,001 0,002 0,001 0,001 0,002 0,001 0,002 0,002 0,001 0,119 0,028 0,024 0,052 0,120 0,145 0,000 0,002 0,001 0,001 0,001 0,001 0,000
KU298879.1 0,123 0,027 0,123 0,123 0,124 0,123 0,123 0,124 0,123 0,124 0,124 0,123 0,027 0,154 0,150 0,149 0,131 0,155 0,125 0,124 0,123 0,123 0,125 0,123 0,124 0,124
LN833161.1 0,001 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,120 0,029 0,024 0,052 0,121 0,145 0,003 0,001 0,001 0,001 0,002 0,001 0,002 0,002 0,124
LN833165.1 0,025 0,147 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,026 0,025 0,147 0,005 0,006 0,034 0,147 0,122 0,026 0,026 0,025 0,025 0,026 0,025 0,026 0,026 0,152
LN833169.1 0,025 0,146 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,025 0,146 0,005 0,005 0,033 0,146 0,121 0,026 0,026 0,025 0,025 0,025 0,025 0,025 0,025 0,152
LN833183.1 0,165 0,162 0,165 0,165 0,166 0,165 0,165 0,166 0,165 0,166 0,166 0,165 0,162 0,159 0,158 0,162 0,157 0,148 0,166 0,165 0,165 0,165 0,166 0,165 0,166 0,166 0,164
LN833184.1 0,000 0,120 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,001 0,000 0,120 0,028 0,023 0,052 0,120 0,144 0,002 0,001 0,000 0,000 0,002 0,000 0,002 0,002 0,124
LN833185.1 0,000 0,120 0,000 0,000 0,001 0,000 0,000 0,001 0,000 0,001 0,001 0,000 0,120 0,028 0,023 0,052 0,120 0,144 0,002 0,001 0,000 0,000 0,002 0,000 0,002 0,002 0,124
M14119.1 0,111 0,143 0,111 0,111 0,111 0,111 0,111 0,111 0,111 0,111 0,111 0,111 0,143 0,092 0,088 0,087 0,154 0,127 0,112 0,112 0,111 0,111 0,111 0,111 0,111 0,111 0,149
MK313763.1 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,028 0,023 0,051 0,120 0,144 0,002 0,001 0,000 0,000 0,001 0,000 0,001 0,001 0,123
MK313765.1 0,000 0,120 0,000 0,000 0,001 0,000 0,000 0,001 0,000 0,001 0,001 0,000 0,120 0,028 0,023 0,052 0,120 0,144 0,002 0,001 0,000 0,000 0,002 0,000 0,002 0,002 0,124
MK313767.1 0,000 0,120 0,000 0,000 0,001 0,000 0,000 0,001 0,000 0,001 0,001 0,000 0,120 0,028 0,023 0,052 0,120 0,144 0,002 0,001 0,000 0,000 0,002 0,000 0,002 0,002 0,124
MK463914.1 0,001 0,120 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,001 0,120 0,028 0,023 0,052 0,120 0,144 0,001 0,002 0,001 0,001 0,001 0,001 0,001 0,001 0,125
MK463916.1 0,120 0,002 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,002 0,149 0,146 0,151 0,129 0,153 0,120 0,121 0,120 0,120 0,120 0,120 0,119 0,119 0,029
MK463921.1 0,120 0,002 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,120 0,002 0,149 0,145 0,150 0,128 0,153 0,119 0,120 0,120 0,120 0,120 0,120 0,119 0,119 0,028
MN788368.1 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,028 0,023 0,051 0,120 0,144 0,002 0,001 0,000 0,000 0,001 0,000 0,001 0,001 0,123
VBD79-09 0,000 0,119 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,119 0,028 0,023 0,051 0,120 0,144 0,002 0,001 0,000 0,000 0,001 0,000 0,001 0,001 0,123
Continued...
XIV
LN833161.1 LN833165.1 LN833169.1 LN833183.1 LN833184.1 LN833185.1 M14119.1 MK313763.1 MK313765.1 MK313767.1 MK463914.1 MK463916.1 MK463921.1 MN788368.1 VBD79-09
VBD01-10
VBD02-12
VBD04-12
VBD07-10
VBD08-09
VBD08-13
VBD09-12
VBD12-18
VBD13-10
VBD14-09
VBD14-11
VBD14-15
VBD15-10
VBD16-09
VBD16-10
VBD16-15
VBD17-09
VBD17-10
VBD17-14
VBD21-10
VBD22-14
VBD23-10
VBD23-17
VBD26-10
VBD28-14
VBD29-13
VBD29-14
VBD30-11
VBD33-10
VBD33-11
VBD33-16
VBD34-08
VBD34-10
VBD34-11
VBD35-11
VBD37-08
VBD37-10
VBD37-11
VBD37-12
VBD39-12
VBD41-11
VBD41-12
VBD41-14
VBD41-15
VBD44-15
VBD47-12
VBD48-10
VBD48-12
VBD48-16
VBD49-08
VBD49-12
VBD49-15
VBD52-09
VBD52-11
VBD55-13
VBD58-11
VBD59-08
VBD59-09
VBD61-11
VBD62-11
VBD63-09
VBD63-11
VBD68-15
VBD68-16
VBD69-09
VBD74-09
LN833187.1
EU918768.1
FN870021.1
FN907962.1
FN907963.1
FR872717.1
HE574702.1
HE611263.1
JN644141.1
JQ773408.1
JQ773409.1
JQ773411.1
JQ773412.1
KU298879.1
LN833161.1
LN833165.1 0,026
LN833169.1 0,026 0,000
LN833183.1 0,166 0,158 0,158
LN833184.1 0,001 0,025 0,025 0,166
LN833185.1 0,001 0,026 0,025 0,166 0,001
M14119.1 0,112 0,090 0,090 0,154 0,111 0,111
MK313763.1 0,001 0,025 0,025 0,165 0,000 0,000 0,111
MK313765.1 0,001 0,026 0,025 0,166 0,001 0,001 0,111 0,000
MK313767.1 0,001 0,026 0,025 0,166 0,001 0,001 0,111 0,000 0,000
MK463914.1 0,002 0,026 0,025 0,166 0,001 0,001 0,111 0,001 0,001 0,001
MK463916.1 0,121 0,148 0,147 0,163 0,120 0,120 0,145 0,120 0,120 0,120 0,120
MK463921.1 0,120 0,147 0,147 0,162 0,120 0,120 0,144 0,120 0,120 0,120 0,119 0,000
MN788368.1 0,001 0,025 0,025 0,165 0,000 0,000 0,111 0,000 0,000 0,000 0,001 0,120 0,120
VBD79-09 0,001 0,025 0,025 0,165 0,000 0,000 0,111 0,000 0,000 0,000 0,001 0,120 0,120 0,000
Appendix 11: Pairwise analysis of percentage divergence of nucleotides using the
208bp human papillomavirus type 11 E2 segment data set
XV
VBD01-10 VBD02-12 VBD04-12 VBD07-10 VBD08-09 VBD08-13 VBD09-12 VBD12-18 VBD13-10 VBD14-09 VBD14-11 VBD14-15 VBD15-10 VBD16-09 VBD16-10 VBD16-15 VBD17-09 VBD17-10 VBD17-14 VBD21-10 VBD22-14 VBD23-10
VBD01-10
VBD02-12 0,000
VBD04-12 0,000 0,000
VBD07-10 0,000 0,000 0,000
VBD08-09 0,000 0,000 0,000 0,000
VBD08-13 0,000 0,000 0,000 0,000 0,000
VBD09-12 0,000 0,000 0,000 0,000 0,000 0,000
VBD12-18 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD13-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD14-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD14-11 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD14-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005
VBD15-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000
VBD16-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000
VBD16-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000
VBD16-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000
VBD17-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000
VBD17-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000
VBD17-14 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD21-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD22-14 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD23-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD23-17 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD26-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD28-14 0,020 0,020 0,020 0,020 0,020 0,020 0,020 0,020 0,020 0,020 0,025 0,020 0,020 0,020 0,020 0,020 0,020 0,020 0,020 0,020 0,020 0,020
VBD29-13 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD29-14 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD30-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD33-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD33-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD33-16 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD34-08 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,029 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024
VBD34-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD34-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD35-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD37-08 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD37-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD37-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD37-12 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD39-12 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD41-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD41-12 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD41-14 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD41-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD44-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD47-12 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD48-10 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD48-12 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD48-16 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD49-08 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD49-12 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD49-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD52-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD52-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD55-13 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD58-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD59-08 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD59-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD61-11 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD62-11 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD63-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD63-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD68-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD68-16 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD69-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD69-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD74-09 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD79-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
EU918768.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
FN870021.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
FN907962.1 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
FN907963.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
FR872717.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
HE574702.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
HE611263.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JN644141.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JQ773408.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
JQ773409.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JQ773411.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
JQ773412.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
KU298879.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
LN833161.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
LN833165.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
LN833169.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
LN833183.1 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,029 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024
LN833184.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
LN833185.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
LN833187.1_(1) 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
LN833187.1_(2) 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
M14119.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
MK313763.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
MK313765.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
MK313767.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
MK463914.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
MK463916.1 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
MK463921.1 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
MN788368.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
Continued...
XVI
VBD23-17 VBD26-10 VBD28-14 VBD29-13 VBD29-14 VBD30-11 VBD33-10 VBD33-11 VBD33-16 VBD34-08 VBD34-10 VBD34-11 VBD35-11 VBD37-08 VBD37-10 VBD37-11 VBD37-12 VBD39-12 VBD41-11 VBD41-12 VBD41-14 VBD41-15
VBD01-10
VBD02-12
VBD04-12
VBD07-10
VBD08-09
VBD08-13
VBD09-12
VBD12-18
VBD13-10
VBD14-09
VBD14-11
VBD14-15
VBD15-10
VBD16-09
VBD16-10
VBD16-15
VBD17-09
VBD17-10
VBD17-14
VBD21-10
VBD22-14
VBD23-10
VBD23-17
VBD26-10 0,000
VBD28-14 0,020 0,020
VBD29-13 0,000 0,000 0,020
VBD29-14 0,005 0,005 0,025 0,005
VBD30-11 0,000 0,000 0,020 0,000 0,005
VBD33-10 0,000 0,000 0,020 0,000 0,005 0,000
VBD33-11 0,000 0,000 0,020 0,000 0,005 0,000 0,000
VBD33-16 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000
VBD34-08 0,024 0,024 0,005 0,024 0,029 0,024 0,024 0,024 0,024
VBD34-10 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024
VBD34-11 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000
VBD35-11 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000
VBD37-08 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000
VBD37-10 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000
VBD37-11 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000
VBD37-12 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000
VBD39-12 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD41-11 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD41-12 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD41-14 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD41-15 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD44-15 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD47-12 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD48-10 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD48-12 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD48-16 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD49-08 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD49-12 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD49-15 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD52-09 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD52-11 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD55-13 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD58-11 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD59-08 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD59-09 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD61-11 0,005 0,005 0,025 0,005 0,010 0,005 0,005 0,005 0,005 0,029 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD62-11 0,005 0,005 0,025 0,005 0,010 0,005 0,005 0,005 0,005 0,029 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD63-09 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD63-11 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD68-15 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD68-16 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD69-09 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD69-15 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD74-09 0,005 0,005 0,025 0,005 0,010 0,005 0,005 0,005 0,005 0,029 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD79-09 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
EU918768.1 0,010 0,010 0,020 0,010 0,015 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
FN870021.1 0,010 0,010 0,020 0,010 0,015 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
FN907962.1 0,005 0,005 0,025 0,005 0,010 0,005 0,005 0,005 0,005 0,029 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
FN907963.1 0,010 0,010 0,020 0,010 0,015 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
FR872717.1 0,010 0,010 0,029 0,010 0,015 0,010 0,010 0,010 0,010 0,034 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
HE574702.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
HE611263.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JN644141.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JQ773408.1 0,010 0,010 0,029 0,010 0,015 0,010 0,010 0,010 0,010 0,034 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
JQ773409.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
JQ773411.1 0,010 0,010 0,029 0,010 0,015 0,010 0,010 0,010 0,010 0,034 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
JQ773412.1 0,010 0,010 0,029 0,010 0,015 0,010 0,010 0,010 0,010 0,034 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
KU298879.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
LN833161.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
LN833165.1 0,010 0,010 0,020 0,010 0,015 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
LN833169.1 0,010 0,010 0,020 0,010 0,015 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
LN833183.1 0,024 0,024 0,005 0,024 0,029 0,024 0,024 0,024 0,024 0,000 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024
LN833184.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
LN833185.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
LN833187.1_(1) 0,005 0,005 0,015 0,005 0,010 0,005 0,005 0,005 0,005 0,020 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
LN833187.1_(2) 0,010 0,010 0,020 0,010 0,015 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
M14119.1 0,010 0,010 0,020 0,010 0,015 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
MK313763.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
MK313765.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
MK313767.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
MK463914.1 0,010 0,010 0,029 0,010 0,015 0,010 0,010 0,010 0,010 0,034 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010
MK463916.1 0,005 0,005 0,025 0,005 0,010 0,005 0,005 0,005 0,005 0,029 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
MK463921.1 0,005 0,005 0,025 0,005 0,010 0,005 0,005 0,005 0,005 0,029 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
MN788368.1 0,000 0,000 0,020 0,000 0,005 0,000 0,000 0,000 0,000 0,024 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
Continued...
XVII
VBD44-15 VBD47-12 VBD48-10 VBD48-12 VBD48-16 VBD49-08 VBD49-12 VBD49-15 VBD52-09 VBD52-11 VBD55-13 VBD58-11 VBD59-08 VBD59-09 VBD61-11 VBD62-11 VBD63-09 VBD63-11 VBD68-15 VBD68-16 VBD69-09 VBD69-15
VBD01-10
VBD02-12
VBD04-12
VBD07-10
VBD08-09
VBD08-13
VBD09-12
VBD12-18
VBD13-10
VBD14-09
VBD14-11
VBD14-15
VBD15-10
VBD16-09
VBD16-10
VBD16-15
VBD17-09
VBD17-10
VBD17-14
VBD21-10
VBD22-14
VBD23-10
VBD23-17
VBD26-10
VBD28-14
VBD29-13
VBD29-14
VBD30-11
VBD33-10
VBD33-11
VBD33-16
VBD34-08
VBD34-10
VBD34-11
VBD35-11
VBD37-08
VBD37-10
VBD37-11
VBD37-12
VBD39-12
VBD41-11
VBD41-12
VBD41-14
VBD41-15
VBD44-15
VBD47-12 0,000
VBD48-10 0,000 0,000
VBD48-12 0,000 0,000 0,000
VBD48-16 0,000 0,000 0,000 0,000
VBD49-08 0,000 0,000 0,000 0,000 0,000
VBD49-12 0,000 0,000 0,000 0,000 0,000 0,000
VBD49-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD52-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD52-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD55-13 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD58-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD59-08 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD59-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000
VBD61-11 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005
VBD62-11 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,000
VBD63-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005
VBD63-11 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000
VBD68-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000
VBD68-16 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000
VBD69-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000
VBD69-15 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000
VBD74-09 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,010 0,005 0,005 0,005 0,005 0,005 0,005
VBD79-09 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
EU918768.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
FN870021.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
FN907962.1 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,010 0,005 0,005 0,005 0,005 0,005 0,005
FN907963.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
FR872717.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
HE574702.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
HE611263.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
JN644141.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
JQ773408.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
JQ773409.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
JQ773411.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
JQ773412.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
KU298879.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
LN833161.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
LN833165.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
LN833169.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
LN833183.1 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,024 0,029 0,029 0,024 0,024 0,024 0,024 0,024 0,024
LN833184.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
LN833185.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
LN833187.1_(1) 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,010 0,005 0,005 0,005 0,005 0,005 0,005
LN833187.1_(2) 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
M14119.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
MK313763.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
MK313765.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
MK313767.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
MK463914.1 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,015 0,015 0,010 0,010 0,010 0,010 0,010 0,010
MK463916.1 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,010 0,005 0,005 0,005 0,005 0,005 0,005
MK463921.1 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,010 0,010 0,005 0,005 0,005 0,005 0,005 0,005
MN788368.1 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,000 0,005 0,005 0,000 0,000 0,000 0,000 0,000 0,000
Continued...
XVIII
VBD74-09 VBD79-09 EU918768.1 FN870021.1 FN907962.1 FN907963.1 FR872717.1 HE574702.1 HE611263.1 JN644141.1 JQ773408.1 JQ773409.1 JQ773411.1 JQ773412.1 KU298879.1 LN833161.1 LN833165.1 LN833169.1 LN833183.1 LN833184.1 LN833185.1 LN833187.1_(1)
VBD01-10
VBD02-12
VBD04-12
VBD07-10
VBD08-09
VBD08-13
VBD09-12
VBD12-18
VBD13-10
VBD14-09
VBD14-11
VBD14-15
VBD15-10
VBD16-09
VBD16-10
VBD16-15
VBD17-09
VBD17-10
VBD17-14
VBD21-10
VBD22-14
VBD23-10
VBD23-17
VBD26-10
VBD28-14
VBD29-13
VBD29-14
VBD30-11
VBD33-10
VBD33-11
VBD33-16
VBD34-08
VBD34-10
VBD34-11
VBD35-11
VBD37-08
VBD37-10
VBD37-11
VBD37-12
VBD39-12
VBD41-11
VBD41-12
VBD41-14
VBD41-15
VBD44-15
VBD47-12
VBD48-10
VBD48-12
VBD48-16
VBD49-08
VBD49-12
VBD49-15
VBD52-09
VBD52-11
VBD55-13
VBD58-11
VBD59-08
VBD59-09
VBD61-11
VBD62-11
VBD63-09
VBD63-11
VBD68-15
VBD68-16
VBD69-09
VBD69-15
VBD74-09
VBD79-09 0,005
EU918768.1 0,015 0,010
FN870021.1 0,015 0,010 0,000
FN907962.1 0,010 0,005 0,015 0,015
FN907963.1 0,015 0,010 0,000 0,000 0,015
FR872717.1 0,015 0,010 0,010 0,010 0,005 0,010
HE574702.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010
HE611263.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000
JN644141.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000
JQ773408.1 0,015 0,010 0,010 0,010 0,005 0,010 0,000 0,010 0,010 0,010
JQ773409.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010
JQ773411.1 0,015 0,010 0,010 0,010 0,005 0,010 0,000 0,010 0,010 0,010 0,000 0,010
JQ773412.1 0,015 0,010 0,010 0,010 0,005 0,010 0,000 0,010 0,010 0,010 0,000 0,010 0,000
KU298879.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010 0,000 0,010 0,010
LN833161.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010 0,000 0,010 0,010 0,000
LN833165.1 0,015 0,010 0,010 0,010 0,015 0,010 0,020 0,010 0,010 0,010 0,020 0,010 0,020 0,020 0,010 0,010
LN833169.1 0,015 0,010 0,010 0,010 0,015 0,010 0,020 0,010 0,010 0,010 0,020 0,010 0,020 0,020 0,010 0,010 0,000
LN833183.1 0,029 0,024 0,024 0,024 0,029 0,024 0,034 0,024 0,024 0,024 0,034 0,024 0,034 0,034 0,024 0,024 0,024 0,024
LN833184.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010 0,000 0,010 0,010 0,000 0,000 0,010 0,010 0,024
LN833185.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010 0,000 0,010 0,010 0,000 0,000 0,010 0,010 0,024 0,000
LN833187.1_(1) 0,010 0,005 0,005 0,005 0,010 0,005 0,015 0,005 0,005 0,005 0,015 0,005 0,015 0,015 0,005 0,005 0,005 0,005 0,020 0,005 0,005
LN833187.1_(2) 0,015 0,010 0,010 0,010 0,015 0,010 0,020 0,010 0,010 0,010 0,020 0,010 0,020 0,020 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,005
M14119.1 0,015 0,010 0,000 0,000 0,015 0,000 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,010 0,024 0,010 0,010 0,005
MK313763.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010 0,000 0,010 0,010 0,000 0,000 0,010 0,010 0,024 0,000 0,000 0,005
MK313765.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010 0,000 0,010 0,010 0,000 0,000 0,010 0,010 0,024 0,000 0,000 0,005
MK313767.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010 0,000 0,010 0,010 0,000 0,000 0,010 0,010 0,024 0,000 0,000 0,005
MK463914.1 0,015 0,010 0,010 0,010 0,005 0,010 0,000 0,010 0,010 0,010 0,000 0,010 0,000 0,000 0,010 0,010 0,020 0,020 0,034 0,010 0,010 0,015
MK463916.1 0,010 0,005 0,015 0,015 0,000 0,015 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,015 0,015 0,029 0,005 0,005 0,010
MK463921.1 0,010 0,005 0,015 0,015 0,000 0,015 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,005 0,015 0,015 0,029 0,005 0,005 0,010
MN788368.1 0,005 0,000 0,010 0,010 0,005 0,010 0,010 0,000 0,000 0,000 0,010 0,000 0,010 0,010 0,000 0,000 0,010 0,010 0,024 0,000 0,000 0,005
Continued...
XIX
LN833187.1_(2)M14119.1 MK313763.1 MK313765.1 MK313767.1 MK463914.1 MK463916.1 MK463921.1 MN788368.1
VBD01-10
VBD02-12
VBD04-12
VBD07-10
VBD08-09
VBD08-13
VBD09-12
VBD12-18
VBD13-10
VBD14-09
VBD14-11
VBD14-15
VBD15-10
VBD16-09
VBD16-10
VBD16-15
VBD17-09
VBD17-10
VBD17-14
VBD21-10
VBD22-14
VBD23-10
VBD23-17
VBD26-10
VBD28-14
VBD29-13
VBD29-14
VBD30-11
VBD33-10
VBD33-11
VBD33-16
VBD34-08
VBD34-10
VBD34-11
VBD35-11
VBD37-08
VBD37-10
VBD37-11
VBD37-12
VBD39-12
VBD41-11
VBD41-12
VBD41-14
VBD41-15
VBD44-15
VBD47-12
VBD48-10
VBD48-12
VBD48-16
VBD49-08
VBD49-12
VBD49-15
VBD52-09
VBD52-11
VBD55-13
VBD58-11
VBD59-08
VBD59-09
VBD61-11
VBD62-11
VBD63-09
VBD63-11
VBD68-15
VBD68-16
VBD69-09
VBD69-15
VBD74-09
VBD79-09
EU918768.1
FN870021.1
FN907962.1
FN907963.1
FR872717.1
HE574702.1
HE611263.1
JN644141.1
JQ773408.1
JQ773409.1
JQ773411.1
JQ773412.1
KU298879.1
LN833161.1
LN833165.1
LN833169.1
LN833183.1
LN833184.1
LN833185.1
LN833187.1_(1)
LN833187.1_(2)
M14119.1 0,010
MK313763.1 0,010 0,010
MK313765.1 0,010 0,010 0,000
MK313767.1 0,010 0,010 0,000 0,000
MK463914.1 0,020 0,010 0,010 0,010 0,010
MK463916.1 0,015 0,015 0,005 0,005 0,005 0,005
MK463921.1 0,015 0,015 0,005 0,005 0,005 0,005 0,000
MN788368.1 0,010 0,010 0,000 0,000 0,000 0,010 0,005 0,005
Appendix 12: Pairwise analysis of percentage divergence of nucleotides using the
human papillomavirus type 11 complete genomes
XX
M14119.1 01-10 15-10 28-14 74-09 LN833183.1 FN870021.1 FR872717.1 FN907962.1 JN644141.1 JQ773408.1 LN833169.1 LN833187.1 EU918768.1 HE574702.1
M14119.1
01-10 0,00316
15-10 0,00329 0,00139
28-14 0,01198 0,01185 0,01198
74-09 0,00329 0,00139 0,00000 0,01198
LN833183.1 0,01224 0,01211 0,01224 0,00051 0,01224
FN870021.1 0,00076 0,00291 0,00304 0,01172 0,00304 0,01198
FR872717.1 0,00304 0,00139 0,00152 0,01198 0,00152 0,01223 0,00278
FN907962.1 0,00393 0,00202 0,00215 0,01236 0,00215 0,01262 0,00367 0,00164
JN644141.1 0,00316 0,00025 0,00139 0,01185 0,00139 0,01211 0,00291 0,00139 0,00202
JQ773408.1 0,00329 0,00164 0,00177 0,01223 0,00177 0,01249 0,00304 0,00101 0,00190 0,00164
LN833169.1 0,00558 0,00520 0,00558 0,01249 0,00558 0,01275 0,00532 0,00545 0,00622 0,00520 0,00583
LN833187.1 0,00596 0,00571 0,00584 0,01249 0,00584 0,01275 0,00571 0,00571 0,00622 0,00571 0,00609 0,00584
EU918768.1 0,00139 0,00354 0,00367 0,01210 0,00367 0,01236 0,00114 0,00342 0,00431 0,00354 0,00367 0,00596 0,00634
HE574702.1 0,00329 0,00063 0,00177 0,01198 0,00177 0,01224 0,00304 0,00177 0,00240 0,00063 0,00202 0,00558 0,00609 0,00367
HE611263.1 0,00304 0,00013 0,00126 0,01172 0,00126 0,01198 0,00278 0,00126 0,00190 0,00013 0,00152 0,00507 0,00558 0,00342 0,00051
JQ773409.1 0,00304 0,00013 0,00126 0,01172 0,00126 0,01198 0,00278 0,00126 0,00190 0,00013 0,00152 0,00507 0,00558 0,00342 0,00051
JQ773411.1 0,00316 0,00152 0,00164 0,01210 0,00164 0,01236 0,00291 0,00013 0,00177 0,00152 0,00114 0,00558 0,00583 0,00354 0,00190
JQ773412.1 0,00316 0,00152 0,00164 0,01210 0,00164 0,01236 0,00291 0,00013 0,00177 0,00152 0,00114 0,00558 0,00583 0,00354 0,00190
KU298879.1 0,00304 0,00038 0,00126 0,01172 0,00126 0,01198 0,00278 0,00126 0,00190 0,00038 0,00152 0,00507 0,00558 0,00342 0,00076
LN833161.1 0,00329 0,00038 0,00152 0,01198 0,00152 0,01224 0,00304 0,00152 0,00215 0,00038 0,00177 0,00533 0,00583 0,00367 0,00076
LN833165.1 0,00494 0,00482 0,00494 0,01185 0,00494 0,01211 0,00469 0,00482 0,00558 0,00482 0,00520 0,00139 0,00507 0,00532 0,00520
LN833184.1 0,00316 0,00025 0,00139 0,01185 0,00139 0,01211 0,00291 0,00139 0,00202 0,00025 0,00164 0,00520 0,00558 0,00354 0,00063
LN833185.1 0,00329 0,00038 0,00152 0,01198 0,00152 0,01224 0,00304 0,00152 0,00215 0,00038 0,00177 0,00533 0,00584 0,00367 0,00051
MK313763.1 0,00304 0,00038 0,00152 0,01198 0,00152 0,01224 0,00278 0,00152 0,00215 0,00038 0,00177 0,00533 0,00583 0,00342 0,00025
MK313765.1 0,00316 0,00025 0,00139 0,01185 0,00139 0,01211 0,00291 0,00139 0,00202 0,00025 0,00164 0,00520 0,00571 0,00354 0,00063
MK313767.1 0,00316 0,00025 0,00139 0,01185 0,00139 0,01211 0,00291 0,00139 0,00202 0,00025 0,00164 0,00520 0,00571 0,00354 0,00063
MK463914.1 0,00291 0,00126 0,00139 0,01185 0,00139 0,01211 0,00266 0,00063 0,00152 0,00126 0,00088 0,00520 0,00571 0,00329 0,00164
MK463916.1 0,00354 0,00164 0,00177 0,01224 0,00177 0,01249 0,00329 0,00126 0,00190 0,00164 0,00152 0,00584 0,00609 0,00392 0,00202
MK463921.1 0,00342 0,00152 0,00164 0,01211 0,00164 0,01236 0,00316 0,00114 0,00177 0,00152 0,00139 0,00571 0,00596 0,00380 0,00190
MN788368.1 0,00316 0,00025 0,00139 0,01185 0,00139 0,01211 0,00291 0,00139 0,00202 0,00025 0,00164 0,00520 0,00571 0,00354 0,00063
FN907963.1 0,00038 0,00278 0,00291 0,01159 0,00291 0,01185 0,00038 0,00266 0,00354 0,00278 0,00291 0,00520 0,00558 0,00101 0,00291
Continued...
XXI
MK313763. MK313765. MK313767. MK463914. MK463916. MK463921.
HE611263.1 JQ773409.1 JQ773411.1 JQ773412.1 KU298879.1 LN833161.1 LN833165.1 LN833184.1 LN833185.1 1 1 1 1 1 1 MN788368.1 FN907963.1
0,00000
0,00139 0,00139
0,00139 0,00139 0,00000
0,00025 0,00025 0,00139 0,00139
0,00025 0,00025 0,00164 0,00164 0,00051
0,00469 0,00469 0,00494 0,00494 0,00469 0,00494
0,00013 0,00013 0,00152 0,00152 0,00038 0,00038 0,00469
0,00025 0,00025 0,00164 0,00164 0,00051 0,00051 0,00494 0,00038
0,00025 0,00025 0,00164 0,00164 0,00051 0,00051 0,00494 0,00038 0,00025
0,00013 0,00013 0,00152 0,00152 0,00038 0,00038 0,00482 0,00025 0,00038 0,00038
0,00013 0,00013 0,00152 0,00152 0,00038 0,00038 0,00482 0,00025 0,00038 0,00038 0,00000
0,00114 0,00114 0,00076 0,00076 0,00114 0,00139 0,00456 0,00126 0,00139 0,00139 0,00126 0,00126
0,00152 0,00152 0,00139 0,00139 0,00152 0,00177 0,00520 0,00164 0,00177 0,00177 0,00164 0,00164 0,00114
0,00139 0,00139 0,00126 0,00126 0,00139 0,00164 0,00507 0,00152 0,00164 0,00164 0,00152 0,00152 0,00101 0,00063
0,00013 0,00013 0,00152 0,00152 0,00038 0,00038 0,00482 0,00025 0,00038 0,00038 0,00025 0,00025 0,00126 0,00164 0,00152
0,00266 0,00266 0,00278 0,00278 0,00266 0,00291 0,00456 0,00278 0,00291 0,00266 0,00278 0,00278 0,00253 0,00316 0,00304 0,00278