gi|315619884|ref|ADUM00000000.1| Escherichia coli 3431 gec3431.assembly.100, whole genome shotgun .5223419, gc%: 50.92%
REGION4
REGION_LENGTH19.4Kb
COMPLETENESS(score)questionable(70)
SPECIFIC_KEYWORDtransposase,integrase
REGION_POSITION683397-702824
TRNA_NUM0
TOTAL_PROTEIN_NUM28
PHAGE_HIT_PROTEIN_NUM21
HYPOTHETICAL_PROTEIN_NUM3
PHAGE+HYPO_PROTEIN_PERCENTAGE85.7%
BACTERIAL_PROTEIN_NUM4
ATT_SITE_SHOWUPno
PHAGE_SPECIES_NUM12
MOST_COMMON_PHAGE_NAMEPHAGE_Entero_4795
MOST_COMMON_PHAGE_NUM6
MOST_COMMON_PHAGE_PERCENTAGE21.4%
GC_PERCENTAGE48.35%
Legend:
REGION: the number assigned to the region
REGION_LENGTH: the length of the sequence of that region (in bp)
PREDICT_INTACT_OR_INCOMPLETE (score): a prediction of whether the region contains an intact or incomplete prophage based on the above criteria (with score in brackets)
SPECIFIC_KEYWORD: the specific phage-related keyword(s) found in protein name(s) in the region
REGION_POSITION: the start and end positions of the region on the bacterial chromosome
TRNA_NUM: the number of tRNA genes present in the region
TOTAL_PROTEIN_NUM: the number of ORFs present in the region
PHAGE_HIT_PROTEIN_NUM: the number of proteins in the region with matches in the phage protein database
HYPOTHETICAL_PROTEIN_NUM: the number of hypothetical proteins in the region without a match in the database
PHAGE+HYPO_PROTEIN_PERCENTAGE: the combined percentage of phage proteins and hypothetical proteins in the region
BACTERIAL_PROTEIN_NUM: the number of proteins in the region with matches in the nrfilt database
ATT_SITE_SHOWUP: the putative phage attachment site
PHAGE_SPECIES_NUM: the number of different phages that have similar proteins to those in the region
MOST_COMMON_PHAGE_NAME: the phage with the highest number of proteins most similar to those in the region
MOST_COMMON_PHAGE_NUM: the number of phages with the highest number of proteins most similar to those in the region
MOST_COMMON_PHAGE_PERCENTAGE: the percentage of proteins in PHAGE_HIT_PROTEIN_NUM that are most similar to MOST_COMMON_PHAGE_NAME proteins
GC_PERCENTAGE: the percentage of gc nucleotides of the region