tra
nsc
rip
tion
al r
eg
ula
tor,
Lys
R fa
mily
CD
Sp
ha
ge
inte
gra
se
ph
ag
e p
rote
in
ph
ag
e r
ep
ress
or
ph
ag
e tr
an
scri
ptio
na
l re
gu
lato
r, C
ro-l
ike
pro
tein
Pu
tativ
e r
eg
ula
tor
for
pro
ph
ag
ep
ha
ge
pro
tein
ph
ag
e C
pro
tein
ph
ag
e r
ep
lica
tion
initi
atio
n p
rote
in A
ph
ag
e p
rote
inp
ha
ge
ca
psi
d p
ort
al p
rote
in
ph
ag
e te
rmin
ase
AT
Pa
se s
ub
un
it
ph
ag
e c
ap
sid
sca
ffold
ing
pro
tein
ph
ag
e m
ajo
r ca
psi
d p
rote
in, P
2 fa
mily
ph
ag
e s
ma
ll te
rmin
ase
su
bu
nit
ph
ag
e h
ea
d c
om
ple
tion
pro
tein
ph
ag
e p
rote
inp
ha
ge
pro
tein
ph
ag
e p
rote
in
ph
ag
e p
rote
inp
ha
ge
ho
linp
ha
ge
lyso
zym
ep
ha
ge
pro
tein
ph
ag
e p
rote
in
ph
ag
e ta
il le
ng
th d
ete
rmin
ato
r
ph
ag
e p
rote
in
ph
ag
e p
rote
in
ph
ag
e p
rote
in
ph
ag
e ta
il fib
er-
like
pro
tein
ph
ag
e ta
il fib
re a
sse
mb
ly p
rote
inp
ha
ge
pro
tein
ph
ag
e p
rote
in
ph
ag
e p
rote
in
CD
S
CD
S
tra
nsp
osa
se
tra
nsc
rip
tion
al r
eg
ula
tor,
XR
E fa
mily
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
Aeromonas salmonicida subsp. salmonicida A449-Pro1 38,386 bp
Aeromonas salmonicida subsp. salmonicida A449-Pro 2-18,624 bp
ph
ag
e in
teg
rase
ph
ag
e p
rote
in
ph
ag
e p
rote
inp
ha
ge
pro
tein
tra
nsp
osa
se s
ub
un
it 1
tra
nsp
osa
se s
ub
un
it 2
ph
ag
e p
rote
inp
ha
ge
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000
IS6
30
-fa
mily
tra
nsp
osa
se
ph
ag
e in
teg
rase
ph
ag
e in
teg
rase
pro
ph
ag
e tr
an
scri
ptio
na
l re
gu
lato
rb
act
eri
op
ha
ge
pro
tein
ph
ag
e p
rote
inp
ha
ge
tra
nsc
rip
tion
al p
rote
in
ph
ag
e p
rote
in
ph
ag
e p
rote
inp
ha
ge
ca
psi
d p
ort
al p
rote
in
ph
ag
e te
rmin
ase
AT
Pa
se s
ub
un
it
ph
ag
e c
ap
sid
sca
ffold
ing
pro
tein
Ph
ag
e m
ajo
r ca
psi
d p
rote
in, P
2
ph
ag
e te
rmin
ase
su
bu
nit
ph
ag
e h
ea
d c
om
ple
tion
pro
tein
ph
ag
e p
rote
inp
ha
ge
pro
tein
ph
ag
e p
rote
in
ph
ag
e p
rote
inp
ha
ge
pro
tein
ph
ag
e h
olin
ph
ag
e ly
sin
ph
ag
e p
rote
inp
ha
ge
pro
tein
ph
ag
e p
rote
inp
ha
ge
tail
len
gth
de
term
ina
tor
ph
ag
e p
rote
inp
ha
ge
pro
tein
ph
ag
e p
rote
in
ph
ag
e ta
il fib
er-
like
pro
tein
ph
ag
e ta
il fib
er
ass
em
bly
pro
tein
ph
ag
e p
rote
inp
ha
ge
pro
tein
ph
ag
e p
rote
in
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
Aeromonas salmonicida subsp. salmonicida A449-Pro3-38,675 bp
Aurantiomonas GTA-14,997 bp
ph
ag
e te
rmin
ase
larg
e s
ub
un
it
pu
tativ
e N
-ace
tyltr
an
sfe
rase
ph
ag
e p
ort
al p
rote
in
pu
tativ
e p
ha
ge
pro
tea
se
ph
ag
e c
ap
sid
pro
tein
pu
tativ
e s
ecr
ete
d p
rote
in
pu
tativ
e p
ha
ge
tail
pro
tein
pu
tativ
e g
en
e tr
an
sfe
r a
ge
nt
pu
tativ
e p
ha
ge
pro
tein
pu
tativ
e p
ha
ge
pro
tein
pu
tativ
e p
ha
ge
ho
st s
pe
cific
ity p
rote
in
pu
tativ
e tr
an
spo
sase
2000 4000 6000 8000 10000 12000 14000 16000
inte
gra
se
tra
nsc
rip
tion
al r
eg
ula
tor
tra
nsc
rip
tion
al r
eg
ula
tor
ph
ag
e r
ep
lica
tion
pro
tein
ER
CC
4-t
ype
nu
cle
ase
ph
ag
e a
ntir
ep
ress
or
tra
nsc
rip
tion
al r
eg
ula
tor
hyp
oth
etic
al p
rote
in
chlo
rog
lyci
ne
hyd
rola
setr
an
scri
ptio
na
l re
gu
lato
r
2000 4000 6000 8000 10000 12000 14000 16000 18000
Bacillus sp. B14905-Prophage 1-18,188 bp
Bacillus sp. B14905-Prophage 3-23,390 bpin
teg
rase
ph
ag
e r
ela
ted
pro
tein
rep
ress
or
ph
ag
e a
ntir
ep
ress
or
ph
ag
e r
ep
lica
tion
pro
tein
rep
lica
tion
pro
tein
ph
ag
e te
rmin
ase
larg
e s
ub
un
it
po
rta
l co
nse
rve
d d
om
ain
ph
ag
e ly
sis
pro
tien
Xly
B
CD
S
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000
Erythrobacter sp. NAP1-13,519 bp-GTA?
Ph
ag
e D
NA
Pa
cka
gin
g P
rote
in
Ph
ag
e p
ort
al p
rote
in
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
CO
G4
65
3 P
red
icte
d p
ha
ge
ph
i-C
31
gp
36
ma
jor
cap
sid
-lik
e p
rote
in
CO
G0
54
2 A
TP
ase
s w
ith c
ha
pe
ron
e a
ctiv
ity, A
TP
-bin
din
g s
ub
un
itC
OG
07
91
Ce
ll w
all-
ass
oci
ate
d h
ydro
lase
s (i
nva
sio
n-a
sso
cia
ted
pro
tein
s)
2000 4000 6000 8000 10000 12000 14000
Fulvimarina pelagi HTCC2506-66,477 ntG
lyco
syl t
ran
sfe
rase
, fam
ily 5
1:P
eni
cilli
n-b
ind
ing
pro
tein
, tra
nspe
ptid
ase
do
ma
in
Ph
age
DN
A P
ack
agi
ng
Pro
tein
phag
e he
ad
port
al p
rote
in
prop
hag
e M
uS
o1, t
rans
crip
tion
al r
egul
ato
r, C
ro/C
I fam
ily p
rote
in
CO
G3
423
Pre
dic
ted
tran
scrip
tiona
l reg
ula
tor
CO
G1
475
Pre
dic
ted
tran
scrip
tiona
l reg
ula
tors
Tra
nspo
sase
an
d in
act
ivat
ed
der
iva
tive
CO
G2
842
Un
char
act
eriz
ed
AT
Pas
e, p
uta
tive
tran
spos
ase
CO
G0
640
Pre
dic
ted
tran
scrip
tiona
l reg
ula
tors
Mu-
like
pro
pha
ge p
rote
in g
p16
N-a
cety
lmu
ram
oyl-L
-ala
nin
e a
mid
ase
Mu-
like
pha
ge g
p27
puta
tive
pha
ge-r
elat
ed
pro
tein
CO
G4
383
Mu
-like
pro
phag
e p
rote
in g
p29
prop
hag
e M
uS
o2, F
pro
tein
, pu
tativ
e
CO
G4
388
Mu
-like
pro
phag
e I p
rote
in
CO
G4
387
Mu
-like
pro
phag
e p
rote
in g
p36
CO
G5
005
Mu
-like
pro
phag
e p
rote
in g
pG
CO
G4
540
Pha
ge
P2
bas
epla
te a
ssem
bly
pro
tein
gpV
prop
hag
e La
mb
daW
5, b
asep
late
ass
em
bly
pro
tein
W, p
uta
tive
Ba
sepl
ate
J-l
ike
pro
tein
Ph
age
tail
pro
tein
I
Vir
ule
nce-
asso
ciat
ed
pro
tein
CO
G0
840
Me
thyl
-acc
eptin
g ch
em
ota
xis
pro
tein
phag
e-re
late
d c
ontr
actil
e ta
il sh
eat
h p
rote
in
prop
hag
e M
uM
c02
, ta
il ta
pe
mea
sure
pro
tein
, TP
901
fam
ily
CO
G5
004
P2-
like
pro
pha
ge ta
il p
rote
in X
phag
e-re
late
d ta
il pr
ote
in
Ad
enin
e s
peci
fic D
NA
met
hyl
tran
sfer
ase
, D12
cla
ssC
OG
152
5 M
icro
cocc
al n
ucl
ease
(th
erm
onuc
leas
e) h
omol
ogs
Ph
age
por
tal p
rote
in, H
K9
7 fa
mily
:Ph
age
por
tal p
rote
inC
OG
374
0 P
hag
e he
ad m
atu
ratio
n p
rote
ase
maj
or c
apsi
d p
rote
in, H
K9
7 fa
mily
CO
G5
281
Pha
ge-
rela
ted
min
or ta
il pr
ote
in
CO
G0
791
Ce
ll w
all-a
ssoc
iate
d h
ydro
lase
s (in
vasi
on-a
sso
ciat
ed
pro
tein
s)
Res
pon
se r
egul
ato
r re
ceiv
er:
Tra
nscr
iptio
nal
reg
ulat
ory
pro
tein
,
sen
sor
hist
idin
e k
ina
se
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000 46000 48000 50000 52000 54000 56000 58000 60000 62000 64000 66000 68000
Gly
cosy
l tra
nsf
era
se, f
am
ily 5
1:P
en
icill
in-b
ind
ing
pro
tein
, tra
nsp
ept
ida
se d
om
ain
Ph
ag
e D
NA
Pa
cka
gin
g P
rote
in
ph
ag
e h
ea
d p
ort
al p
rote
in
pro
ph
age
Mu
So
1, t
ran
scri
ptio
na
l re
gu
lato
r, C
ro/C
I fa
mily
pro
tein
CO
G3
42
3 P
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
r
CO
G1
47
5 P
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
rs
Tra
nsp
osa
se a
nd
ina
ctiv
ate
d d
eri
vativ
e
CO
G2
84
2 U
nch
ara
cte
rize
d A
TP
ase
, pu
tativ
e tr
an
spo
sase
CO
G0
64
0 P
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
rs
Mu
-lik
e p
rop
ha
ge
pro
tein
gp
16
N-a
cety
lmu
ram
oyl
-L-a
lan
ine
am
ida
se
Mu
-lik
e p
ha
ge
gp
27
pu
tativ
e p
ha
ge
-re
late
d p
rote
in
CO
G4
38
3 M
u-l
ike
pro
ph
ag
e p
rote
in g
p2
9
pro
ph
age
Mu
So
2, F
pro
tein
, pu
tativ
e
CO
G4
38
8 M
u-l
ike
pro
ph
ag
e I
pro
tein
CO
G4
38
7 M
u-l
ike
pro
ph
ag
e p
rote
in g
p3
6C
OG
50
05
Mu
-lik
e p
rop
ha
ge
pro
tein
gp
G
CO
G4
54
0 P
ha
ge
P2
ba
sep
late
ass
em
bly
pro
tein
gp
V
pro
ph
age
La
mb
da
W5
, ba
sep
late
ass
em
bly
pro
tein
W, p
uta
tive
Ba
sepl
ate
J-l
ike
pro
tein
Ph
ag
e ta
il p
rote
in I
Vir
ule
nce
-ass
oci
ate
d p
rote
in
CO
G0
84
0 M
eth
yl-a
cce
ptin
g c
hem
ota
xis
pro
tein
ph
ag
e-re
late
d c
on
tra
ctile
tail
she
ath
pro
tein
pro
ph
age
Mu
Mc0
2, t
ail
tap
e m
ea
sure
pro
tein
, TP
90
1 fa
mily
CO
G5
00
4 P
2-l
ike
pro
ph
ag
e ta
il p
rote
in X
ph
ag
e-re
late
d ta
il p
rote
in
Ad
en
ine
sp
eci
fic D
NA
me
thyl
tra
nsf
era
se, D
12
cla
ssC
OG
15
25
Mic
roco
cca
l nu
cle
ase
(th
erm
on
ucl
ea
se)
ho
mo
log
s
Ph
ag
e p
ort
al p
rote
in, H
K9
7 fa
mily
:Ph
ag
e p
ort
al p
rote
inC
OG
37
40
Ph
ag
e h
ea
d m
atu
ratio
n p
rote
ase
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily
CO
G5
28
1 P
ha
ge
-re
late
d m
inor
tail
pro
tein
CO
G0
79
1 C
ell
wal
l-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
Re
spon
se r
eg
ula
tor
rece
ive
r:T
ran
scri
ptio
na
l re
gu
lato
ry p
rote
in,
sen
sor
his
tidin
e k
ina
se
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000 46000 48000 50000 52000 54000 56000 58000 60000 62000 64000 66000 68000
Geobacillus kaustophilus Prophage-33,152 bp
seri
ne
pro
tein
kin
ase
site
-sp
eci
fic r
eco
mb
ina
se
ph
ag
e-l
ike
ele
me
nt P
BS
X p
rote
in
tra
nsc
rip
tion
al r
ep
ress
or
of P
BS
X g
en
es
tra
nsc
rip
tion
al r
eg
ula
tor
tra
nsc
rip
tion
al r
ep
ress
or
of P
BS
X g
en
es
tra
nsc
rip
tion
al r
eg
ula
tor
ba
cte
rio
ph
ag
e-r
ela
ted
pro
tein
ba
cte
rio
ph
ag
e-r
ela
ted
pro
tein
ph
ag
e-l
ike
ele
me
nt P
BS
X p
rote
in
ph
ag
e a
sso
cia
ted
-an
tire
pre
sso
r
site
-sp
eci
fic r
eco
mb
ina
se (
ph
ag
e in
teg
rase
fam
ily)
ph
ag
e-t
erm
ina
se la
rge
su
bu
nit
ph
ag
e-r
ela
ted
he
ad
po
rta
l pro
tein
seri
ne
pro
tea
se (
ph
ag
e r
ela
ted
-pro
tein
, Clp
P fa
mily
)
ba
cte
rio
ph
ag
e-r
ela
ted
pro
tein
ph
ag
e-r
ela
ted
pro
tein
(ta
il p
rote
in)
CD
S
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
Gramella forsetii KT0803-49,512 nt
Me
rR fa
mily
tra
nsc
rip
tion
al r
eg
ula
tor
pro
tein
ph
ag
e in
teg
rase
fam
ily p
rote
in
HN
H e
nd
on
ucl
ea
se fa
mily
pro
tein
secr
ete
d p
rote
in
Lu
xR fa
mily
tra
nsc
rip
tion
al r
eg
ula
tor
pro
tein
Re
cT fa
mily
pro
tein
me
tallo
-be
ta-l
act
am
ase
do
ma
in p
rote
in
con
serv
ed
hyp
oth
etic
al p
rote
in-p
oss
ibly
DN
A-m
e th
yltr
an
sfe
rase
ph
ag
e D
NA
mo
difi
catio
n m
eth
yla
se
me
mb
ran
e p
rote
in
me
mb
ran
e p
rote
in
ph
ag
e s
ma
ll te
rmin
ase
ph
ag
e te
rmin
ase
larg
e s
ub
un
it
ph
ag
e p
ort
al p
rote
in
HN
H e
nd
on
ucl
ea
se fa
mily
pro
tein
ph
ag
e ta
il ta
pe
me
asu
re p
rote
in
me
mb
ran
e p
rote
inm
em
bra
ne
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000 46000 48000 50000
Hahella chejuensis Prophage 1-27,588 nt
Hahella chejuensis Prophage 3-28,446 nt
Inte
gra
se
pu
tativ
e e
xon
ucl
ea
se
ph
ag
e p
ort
al p
rote
in, P
BS
X fa
mily
Mu
-lik
e p
rop
ha
ge
Flu
Mu
pro
tein
gp
28
Fla
ge
llar
bio
syn
the
sis/
typ
e II
I se
cre
tory
pat
hw
ay
pro
tein
ph
ag
e m
ajo
r ca
psi
d p
rote
in, P
2 fa
mily
pro
ba
ble
ph
age
sm
all
term
ina
se s
ub
un
itp
rob
ab
le p
ha
ge h
ea
d c
om
ple
teio
n p
rote
in
pro
ba
ble
ph
age
pro
tein
pro
ba
ble
ph
age
pro
tein
pro
ba
ble
ph
age
pro
tein
Dn
aK
su
pp
ress
or
pro
tein
pro
ba
ble
ph
age
pro
tein
pro
ba
ble
ph
age
pro
tein
pro
ba
ble
ph
age
pro
tein
un
cha
ract
eri
zed
ph
ag
e M
u p
rote
in g
p4
7-l
ike
pro
tein
pro
ba
ble
ph
age
pro
tein
pro
ba
ble
ph
age
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000
Hahella chejuensis Prophage 2-18,002
inte
gra
tion
ho
st fa
cto
r, a
lph
a s
ub
un
itp
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
rIn
teg
rase
Tra
nsp
osa
se a
nd
ina
ctiv
ate
d d
eri
vativ
es
Tra
nsp
osa
se a
nd
ina
ctiv
ate
d d
eri
vativ
es
gro
up
II in
tro
n-e
nco
din
g m
atu
rase
pro
ba
ble
ph
ag
e p
rote
in
Re
stri
ctio
n e
nd
on
ucl
ea
seH
isto
ne
ace
tyltr
an
sfe
rase
HP
A2
/re
late
d a
cety
ltra
nsf
era
sep
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
r co
nta
inin
g th
e C
op
G/A
rc/M
etJ
DN
A-b
ind
ing
do
ma
inP
lasm
id s
tab
iliza
tion
sys
tem
pro
tein
ph
ag
e te
rmin
ase
, sm
all
sub
un
it, p
uta
tive
, P2
7 fa
mily
Ph
ag
e te
rmin
ase
-lik
e p
rote
in, l
arg
e s
ub
un
it
ph
ag
e p
ort
al p
rote
in, H
K9
7 fa
mily
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily
ph
ag
e m
ajo
r ca
psi
d p
rote
in, H
K9
7 fa
mily
Ba
cte
rio
ph
ag
e h
ea
d-t
ail
ad
ap
tor
ba
cte
rio
ph
ag
e T
P9
01
-1 O
RF
40
-lik
e p
rote
in
2000 4000 6000 8000 10000 12000 14000 16000 18000
CD
S
Inte
gra
se
pro
ba
ble
ph
ag
e p
rote
in
pu
tativ
e e
xon
ucl
ea
se
ph
ag
e p
ort
al p
rote
in, P
BS
X fa
mily
Mu
-lik
e p
rop
ha
ge
Flu
Mu
pro
tein
gp
28
pro
ba
ble
ph
ag
e c
ap
sid
sca
ffold
ing
pro
tein
ph
ag
e m
ajo
r ca
psi
d p
rote
in, P
2 fa
mily
pro
ba
ble
ph
ag
e s
ma
ll te
rmin
ase
su
bu
nit
pro
ba
ble
ph
ag
e h
ea
d c
om
ple
teio
n p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
Ph
ag
e-r
ela
ted
tail
pro
tein
pro
ba
ble
ph
ag
e p
rote
in
un
cha
ract
eri
zed
ph
age
Mu
pro
tein
gp
47
-lik
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000
Hahella chejuensis Prophage 3
term
ina
se la
rge
su
bu
nit
po
rta
l pro
tein
ma
jor
cap
sid
ba
sep
late
ass
em
bly
pro
tein
Vb
ase
pla
te a
sse
mb
ly p
rote
inb
ase
pla
te a
sse
mb
ly p
rote
inp
ha
ge
tail
pro
tein
ph
ag
e ta
il p
rote
in
tail
fibe
r p
rote
in
tail
she
alth
pro
tein
tail
tub
e p
rote
in
ph
ag
e ta
il ta
pe
-me
asu
re p
rote
in
ph
ag
e ta
il p
rote
in
ph
ag
e ta
il p
rote
in
DN
A a
de
nin
e m
eth
yla
se
pa
rA
pro
telo
me
rase
rep
lica
tion
pro
tein
ph
ag
e r
ep
ress
or
pro
ph
ag
e a
ntir
ep
ress
or
ph
ag
e a
ntit
erm
ina
tor
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
Halomonas aquamarina prophage HAP-1-39,245 nt
Loktanella vestfoldensis SKA53 GTA-14,202 bp
3-o
xoa
cyl-
(acy
l ca
rrie
r p
rote
in)
syn
tha
se
pu
tativ
e la
rge
term
ina
se
pu
tativ
e p
ort
al p
rote
in
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ilym
ajo
r ca
psi
d p
rote
in, H
K9
7 fa
mily
pu
tativ
e p
ha
ge
tail-
he
ad
ad
ap
tor
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
seri
ne
ace
tyltr
an
sfe
rase
2000 4000 6000 8000 10000 12000 14000 16000
Maricaulis maris MCS10 GTA-16,193 bp
pe
nic
illin
-bin
din
g p
rote
in, 1
A fa
mily
Te
rmin
ase
ph
ag
e p
ort
al p
rote
in, H
K9
7 fa
mily
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily
ph
ag
e m
ajo
r ca
psi
d p
rote
in, H
K9
7 fa
mily
ge
ne
tra
nsf
er
ag
en
t-lik
e p
rote
inp
ha
ge
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ilyK
EG
G: r
pb
:RP
B_
34
64
ge
ne
tra
nsf
er
ag
en
t (G
TA
) o
rfg
10
rh
od
ob
act
er
cap
sula
tus
ge
ne
tra
nsf
er
ag
en
t (G
TA
) o
rfg
13
pu
tativ
e p
ha
ge
ce
ll w
all
pe
ptid
ase
, Nlp
C/P
60
fam
ily
ge
ne
tra
nsf
er
ag
en
t (G
TA
) o
rfg
15
cha
pe
ron
e D
na
J d
om
ain
pro
tein
Pro
pe
ptid
e, P
ep
SY
am
d p
ep
tida
se M
4tw
o c
om
po
ne
nt t
ran
scri
ptio
na
l re
gu
lato
r, w
ing
ed
he
lix fa
mily
pe
rip
lasm
ic s
en
sor
sig
na
l tra
nsd
uct
ion
his
tidin
e k
ina
se
Ccm
E/C
ycJ
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000
ph
osp
ho
glu
cosa
min
e m
uta
se [R
ose
ova
riu
s sp
. TM
10
35
]-lik
e
Ace
tate
--C
oA
lig
ase
[Ro
seo
vari
us
sp. T
M1
03
5]-
like
TR
AP
tra
nsp
ort
er
Pu
tativ
e m
em
bra
ne
tra
nsp
ort
er
Alc
oh
ol d
eh
ydro
ge
na
se (
gro
ES
)
pu
tativ
e S
-fo
rmyl
glu
tath
ion
e h
ydro
lase
pro
tein
(ca
rbo
xyl e
ste
rase
??
)
Lys
R-f
am
ily tr
an
scri
ptio
na
l re
gu
lato
r [S
tap
pia
ag
gre
ga
ta IA
M 1
26
14
]-lik
e
Rp
lI, r
ibo
som
al p
rote
in L
9 [P
arv
iba
culu
m la
vam
en
tivo
ran
s D
S-1
]-lik
eR
ibo
som
al p
rote
in S
6 [a
lph
a p
rote
ob
act
eri
um
HT
CC
22
55
]-lik
e
de
hyd
rog
en
ase
with
diff
ere
nt s
pe
cific
itie
s [P
arv
iba
culu
m la
vam
en
tivo
ran
s D
S-1
]-lik
e
be
ta-k
eto
acy
l syn
tha
se [R
ho
do
ba
cte
r sp
ha
ero
ide
s A
TC
C 1
70
29
]-lik
ea
min
od
eo
xych
ori
sma
te ly
ase
[Din
oro
seo
ba
cte
r sh
iba
e D
FL
12
]-lik
e
Te
rmin
ase
Po
rta
l pro
tein
, HK
97
fam
ily p
rote
in
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
he
ad
-ta
il a
da
pto
r, p
uta
tive
Alc
oh
ol d
eh
ydro
ge
na
se G
roE
S d
om
ain
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000
Sargasso Sea BAC-27,775 bp-GTA-like
Mu
tT/N
UD
IX fa
mily
pro
tein
me
thyl
-acc
ep
ting
ch
em
ota
xis
pro
tein
pu
tativ
e te
lluri
te r
esi
sta
nce
pro
tein
-re
late
d p
rote
in
dig
ua
nyl
ate
cyc
lase
(G
GD
EF
do
ma
in)
pro
ph
ag
e M
uS
o2
, tra
nsc
rip
tion
al r
eg
ula
tor,
Cro
/CI f
am
ily p
rote
in
tra
nsp
osa
se, p
uta
tive
pro
ph
ag
e M
uS
o2
, DN
A tr
an
spo
sitio
n p
rote
in, p
uta
tive
CO
G4
38
2 M
u-l
ike
pro
ph
ag
e p
rote
in g
p1
6
pro
ph
ag
e M
uS
o2
, po
sitiv
e r
eg
ula
tor
of l
ate
tra
nsc
rip
tion
, pu
tativ
e
C4
-typ
e z
inc
fing
er
pro
tein
, Dks
A/T
raR
fam
ily p
rote
in
pro
ph
ag
e M
uS
o2
, po
rta
l pro
tein
, pu
tativ
e
CO
G4
38
3 M
u-l
ike
pro
ph
ag
e p
rote
in g
p2
9
pro
ph
ag
e M
uS
o2
, F p
rote
in, p
uta
tive
pro
ph
ag
e M
uS
o2
, pro
tein
Gp
32
, pu
tativ
eC
DS
pro
ph
ag
e M
uS
o2
, ma
jor
he
ad
su
bu
nit,
pu
tativ
e
CO
G4
38
7 M
u-l
ike
pro
ph
ag
e p
rote
in g
p3
6p
rop
ha
ge
Mu
So
2, v
irio
n m
orp
ho
ge
ne
sis
pro
tein
, pu
tativ
e
pro
ph
ag
e M
uS
o2
, ta
il sh
ea
th p
rote
in, p
uta
tive
CO
G4
51
8 M
u-l
ike
pro
ph
ag
e F
luM
u p
rote
in g
p4
1
CO
G3
94
1 M
u-l
ike
pro
ph
ag
e p
rote
in
pro
ph
ag
e M
uS
o2
, DN
A c
ircu
latio
n p
rote
in, p
uta
tive
pro
ph
ag
e M
uS
o2
, 43
kD
a ta
il p
rote
in, p
uta
tive
pro
ph
ag
e M
uS
o2
, ba
sep
late
ass
em
bly
pro
tein
VC
OG
43
81
Mu
-lik
e p
rop
ha
ge
pro
tein
gp
46
CO
G3
29
9 U
nch
ara
cte
rize
d h
om
olo
g o
f ph
ag
e M
u p
rote
in g
p4
7
pro
ph
ag
e M
uS
o2
, ta
il fib
er
pro
tein
, pu
tativ
e
sen
sory
bo
x se
nso
r/G
GD
EF
/EA
L d
om
ain
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
Marinomonas Mu-like prophage-37,598 nt
Enterobacteriophage Mu-36,717 bp
am
ino
pe
ptid
ase
, pu
tativ
e
tra
nsc
rip
tion
al r
eg
ula
tor
tail
fibe
r p
rote
in H
, pu
tativ
e
N-a
cety
lmu
ram
oyl
-L-a
lan
ine
am
ida
se, p
uta
tive
pro
ph
ag
e P
SP
PH
06
, pu
tativ
e ta
il tu
be
pro
tein
pro
ph
ag
e P
SP
PH
06
, pu
tativ
e ta
il sh
ea
th p
rote
in
pu
tativ
e e
ste
rase
pu
tativ
e p
ha
ge
pro
tein
Pu
tativ
e r
ep
lica
tion
pro
tein
pro
ba
ble
tra
nsc
rip
tion
al r
eg
ula
tor
two
-co
mp
on
en
t re
spo
nse
re
gu
lato
rC
DS
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000
Marinomonas sp. MED 121-Fragment-23,996 bp
Mariprofundus ferrooxydans PV-1-Mu-like-23,277 nt
CO
G2
25
2 P
erm
ea
ses
CO
G2
93
2 P
red
icte
d tr
ansc
rip
tion
al r
eg
ula
tor
CO
G3
42
3 P
red
icte
d tr
ansc
rip
tion
al r
eg
ula
tor
Tra
nsp
osa
se a
nd
ina
ctiv
ate
d d
eri
vativ
e
pro
ph
ag
e M
uS
o1
, DN
A tr
an
spo
sitio
n p
rote
in, p
uta
tive
Mu
-lik
e p
rop
ha
ge
Flu
Mu
ho
st-n
ucl
ea
se in
hib
itor
pro
tein
Ga
m
lyso
zym
e, p
uta
tive
Mu
-lik
e p
ha
ge
gp
27
CO
G4
37
3 M
u-l
ike
pro
pha
ge
Flu
Mu
pro
tein
gp
28
CO
G4
38
3 M
u-l
ike
pro
pha
ge
pro
tein
gp
29
ba
cte
rio
ph
ag
e M
u G
P30
-lik
e p
rote
in
po
ssib
le b
act
eri
op
ha
ge
Mu
G-l
ike
pro
tein
pu
tativ
e I
pro
tein
Mu
-lik
e p
rop
ha
ge
ma
jor
he
ad
su
bu
nit
gp
T
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000
Sim
ilar
to s
ulfu
r o
xid
atio
n p
rote
in S
oxH
site
-sp
eci
fic r
eco
mb
ina
se, p
ha
ge
inte
gra
se fa
mily
ph
ag
e-r
ela
ted
DN
A-b
ind
ing
pro
tein
ph
ag
e te
rmin
ase
, sm
all
sub
un
it
ph
ag
e te
rmin
ase
, la
rge
su
bu
nit
Mo
aA
/NifB
/Pq
qE
fam
ily p
rote
in
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000
Nitratiruptor sp. SB155-2-37,734 bp
Nitrobacter sp. Nb-311A- prophage 1-20,638
resp
on
se r
eg
ula
tor
rece
ive
r
ph
ag
e-r
ela
ted
inte
gra
se
CO
G1
75
8 D
NA
-dir
ect
ed
RN
A p
oly
me
rase
, su
bu
nit
K/o
me
ga
AT
P-b
ind
ing
re
gio
n, A
TP
ase
-lik
e
Ph
ag
e te
rmin
ase
Gp
A
CO
G4
22
0 P
ha
ge
DN
A p
ack
ag
ing
pro
tein
, Nu
1 s
ub
un
it o
f te
rmin
ase
CO
G5
28
3 P
ha
ge
-re
late
d ta
il p
rote
in
CO
G5
51
1 B
act
eri
op
ha
ge
ca
psi
d p
rote
inC
OG
01
50
Ph
osp
ho
rib
osy
lam
ino
imid
azo
le (
AIR
) sy
nth
eta
se
reso
lva
sep
uta
tive
ba
cte
rio
ph
ag
e-r
ela
ted
pro
tein
CO
G0
64
2 S
ign
al t
ran
sdu
ctio
n h
istid
ine
kin
ase
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000
Nitrobacter sp. Nb-311A- prophage 2-GTA-like
Ph
ag
e m
ajo
r ta
il p
rote
ing
en
e tr
an
sfe
r a
ge
nt (
GT
A)
like
pro
tein
ge
ne
tra
nsf
er
ag
en
t (G
TA
) lik
e p
rote
in
pu
tativ
e p
ha
ge
ce
ll w
all
pe
ptid
ase
ge
ne
tra
nsf
er
ag
en
t (G
TA
) o
rfg
15
, lik
e p
rote
in
po
ssib
le g
lyco
hyd
rola
se
2000 4000 6000 8000 10000
Nitrobacter sp. Nb-311A- prophage 3-GTA-likeT
ran
scri
ptio
na
l re
gu
lato
ry p
rote
in, L
uxR
fam
ily p
rote
in
pro
ph
ag
e L
am
bd
aS
o, h
olin
, pu
tativ
e
ph
ag
e te
rmin
ase
, la
rge
su
bu
nit,
pu
tativ
e
po
rta
l pro
tein
, H
K9
7 fa
mily
, pu
tativ
e
hyp
oth
etic
al p
rote
ase
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
ge
ne
tra
nsf
er
ag
en
t-lik
e p
rote
in
tail
ass
em
bly
pro
tein
, pu
tativ
e
tail
fibe
r p
rote
in, p
uta
tive
pu
tativ
e m
em
bra
ne
-an
cho
red
ce
ll su
rfa
ce p
rote
in
1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 11000 12000 13000 14000 15000 16000 17000 18000 19000 20000 21000
Nitrosococcus oceani ATCC 19707-bacteriocin? C
DS
CD
S
pu
tativ
e tr
an
spo
sase
He
lix-t
urn
-he
lix, F
is-t
ype
AA
A A
TP
ase
Om
pA
/Mo
tB
Ph
ag
e ta
il sh
ea
th p
rote
in
Co
nse
rve
d h
ypo
the
tica
l ph
ag
e ta
il re
gio
n p
rote
in
Pe
ptid
og
lyca
n-b
ind
ing
Lys
M
Ph
ag
e b
ase
pla
te a
sse
mb
ly p
rote
in W
Ph
ag
e ta
il p
rote
in
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000
catio
n c
ha
nn
el f
am
ily p
rote
in
po
ssib
le p
en
icill
in-b
ind
ing
pro
tein
Ph
ag
e D
NA
Pa
cka
gin
g P
rote
in
pu
tativ
e p
ort
al p
rote
in
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
Ph
ag
e m
ajo
r ca
psi
d p
rote
in,
ge
ne
tra
nsf
er
ag
en
t (G
TA
) lik
e p
rote
in
CO
G5
28
1 P
ha
ge
-re
late
d m
ino
r ta
il p
rote
inR
ho
do
ba
cte
r ca
psu
latu
s g
en
e tr
an
sfe
r a
ge
nt (
GT
A)o
rfg
12
Ce
ll w
all-
ass
oci
ate
d h
ydro
lase
s (i
nva
sio
n-a
sso
cia
ted
pro
tein
s)
pu
tativ
e D
na
J/C
bp
A-t
ype
pro
tein
sen
sor
his
tidin
e k
ina
se
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000
Oceanicaulis alexandrii HTCC2633-GTA
Pu
tativ
e la
rge
term
ina
se
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily p
rote
in
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
seri
ne
ace
tyltr
an
sfe
rase
2000 4000 6000 8000 10000
Oceanicola granulosus HTCC2516-GTA-like
Oceanobacillus iheyensis HTE831??-12,912 bp
ep
ide
rma
l su
rfa
ce a
ntig
en
ph
ag
e-l
ike
ele
me
nt P
BS
X p
rote
in X
KD
Atr
an
scri
ptio
na
l re
pre
sso
r o
f PB
SX
ge
ne
s
RN
A p
oly
me
rase
sig
ma
-70
fact
or
ba
cte
rio
ph
ag
e r
ela
ted
pro
tein
ba
cte
rio
ph
ag
e r
ela
ted
pro
tein
ba
cte
rio
ph
ag
e m
ino
r ta
il su
bu
nit
ba
cte
rio
ph
ag
e r
ela
ted
pro
tein
N-a
cety
lmu
ram
oyl
-L-a
lan
ine
am
ida
se
ad
he
sio
n p
rote
in A
dp
2000 4000 6000 8000 10000 12000 14000
Photobacterium profundum 3TCK-26,131 nt
CO
G1
40
3 R
est
rict
ion
en
do
nu
cle
ase
hyp
oth
etic
al t
erm
ina
se e
nco
de
d b
y p
rop
ha
ge
Hyp
oth
etic
al p
ha
ge
po
rta
l pro
tein
hyp
oth
etic
al C
lpP
fam
ily s
eri
ne
pro
tea
se, p
oss
ible
ph
ag
e r
ela
ted
hyp
oth
etic
al g
p3
6 m
ajo
r ca
psi
d-l
ike
pro
tein
CO
G5
61
4 B
act
eri
op
ha
ge
he
ad
-ta
il a
da
pto
r
CO
G1
19
6 C
hro
mo
som
e s
eg
reg
atio
n A
TP
ase
s
CO
G3
20
3 O
ute
r m
em
bra
ne
pro
tein
(p
ori
n)
Hyp
oth
etic
al b
act
eri
op
ha
ge
re
plic
atio
n p
rote
in P
Hyp
oth
etic
al t
ran
scri
ptio
na
l re
gu
lato
r
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000
Photobacterium profundum SS9 chromosome 2-42,963 nt
hyp
oth
etic
al p
ha
ge
inte
gra
se fa
mily
hyp
oth
etic
al g
p3
6 m
ajo
r ca
psi
d-l
ike
pro
tein
hyp
oth
etic
al C
lpP
fam
ily s
eri
ne
pro
tea
se, p
oss
ible
ph
ag
e r
ela
ted
hyp
oth
etic
al p
ha
ge
po
rta
l pro
tein
hyp
oth
etic
al t
erm
ina
se la
rge
su
bu
nit
hyp
oth
etic
al t
ran
spo
sase
hyp
oth
etic
al t
ran
scri
ptio
na
l re
gu
lato
r, L
ysR
fam
ily
hyp
oth
etic
al t
ran
spo
sase
hyp
oth
etic
al D
NA
he
lica
se
hyp
oth
etic
al t
ran
scri
ptio
na
l re
gu
lato
r
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000
Photobacterium profundum SS9 chromosome 2-40,505 nt
pu
tativ
e T
rkA
fam
ily p
rote
in
inte
gra
se
hyp
ote
thic
al t
ran
scri
ptio
na
l re
gu
lato
r
hyp
oth
etic
al G
ifsy-
1 p
rop
ha
ge
pu
tativ
e e
xod
eo
xyri
bo
nu
cle
ase
VIII
term
ina
se s
ma
ll su
bu
nit
term
ina
se la
rge
su
bu
nit
pu
tativ
e h
ea
d-t
ail
con
ne
cto
r p
rote
in
pu
tativ
e e
nd
op
rote
ase
pu
tativ
e ta
il fib
er
pro
tein
pu
tativ
e to
xin
se
cre
tion
AT
P-b
ind
ing
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000
Pseudoalteromonas tunicata D2-36,166 nt T
ran
scri
ptio
na
l re
gu
lato
rC
OG
13
96
Pre
dic
ted
tra
nsc
rip
tion
al r
eg
ula
tors
CO
G4
19
7 U
nch
ara
cte
rize
d p
rote
in c
on
serv
ed
in b
act
eri
a, p
rop
ha
ge
-re
late
dC
OG
08
27
Ad
en
ine
-sp
eci
fic D
NA
me
thyl
ase
AT
P-d
ep
en
de
nt 2
6S
pro
tea
som
e r
eg
ula
tory
su
bu
nit
pu
tativ
e p
ha
ge
pro
tein
ph
ag
e p
ort
al p
rote
in, P
BS
X fa
mily
pro
tein
Mu
-lik
e p
rop
ha
ge
Flu
Mu
pro
tein
gp
28
pro
ba
ble
ca
psi
d s
caffo
ldin
g p
rote
in
ph
ag
e m
ajo
r ca
psi
d p
rote
in, P
2 fa
mily
pro
tein
pro
ba
ble
ph
ag
e s
ma
ll te
rmin
ase
su
bu
nit
pro
ba
ble
ph
ag
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
CO
G5
28
3 P
ha
ge
-re
late
d ta
il p
rote
in
pro
ba
ble
ph
ag
e p
rote
inu
nch
ara
cte
rize
d p
ha
ge
Mu
pro
tein
gp
47
-lik
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
pro
ba
ble
ph
ag
e p
rote
in
site
-sp
eci
fic r
eco
mb
ina
se, p
ha
ge
inte
gra
se fa
mily
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000
Rhodobacterales HTCC2150 GTA-14,569 nt
am
ino
de
oxy
cho
rism
ate
lya
se
Pu
tativ
e la
rge
term
ina
se
po
rta
l pro
tein
, HK
97
fam
ily
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
ph
ag
e m
ajo
r ca
psi
d p
rote
in, H
K9
7
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily
Ph
ag
e-r
ela
ted
min
or
tail
pro
tein
-lik
e
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
seri
ne
ace
tyltr
an
sfe
rase
2000 4000 6000 8000 10000 12000 14000 16000
Rhodobacterales HTCC2654-31,840 nt
Site
-sp
eci
fic r
eco
mb
ina
se X
erD
-lik
eD
NA
-bin
din
g p
rote
in, p
uta
tive
pro
ba
ble
ph
ag
e-r
ela
ted
lyso
zym
e
tail
fibe
r p
rote
in, p
uta
tive
tail
ass
em
bly
pro
tein
, pu
tativ
e
Pe
ptid
ase
U7
CO
G5
51
1 B
act
eri
op
ha
ge
ca
psi
d p
rote
in
pu
tativ
e p
ha
ge
term
ina
se la
rge
su
bu
nit
pu
tativ
e p
ha
ge
term
ina
se la
rge
su
bu
nit
CO
G4
22
0 P
ha
ge
DN
A p
ack
ag
ing
pro
tein
, Nu
1 s
ub
un
it o
f te
rmin
ase
CO
G0
35
8 D
NA
pri
ma
se (
ba
cte
ria
l typ
e)
ad
en
ine
-sp
eci
fic D
NA
me
thyl
tra
nsf
era
se
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000
Rhodobacterales HTCC2654-GTA 16,708 nt
Pu
tativ
e la
rge
term
ina
se
pu
tativ
e p
ort
al p
rote
in
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
he
ad
-ta
il ad
ap
tor,
pu
tativ
em
ajo
r ta
il p
rote
in, T
P9
01
-1 fa
mily
pro
tein
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
po
ssib
le c
yto
chro
me
P4
50
hyd
roxy
lase
su
pe
rfam
ily p
rote
ins
seri
ne
O-a
cety
ltra
nsf
era
se
2000 4000 6000 8000 10000 12000 14000 16000
CD
SC
OG
15
59
Pre
dic
ted
pe
rip
lasm
ic s
olu
te-b
ind
ing
pro
tein
CO
G0
46
4 A
TP
ase
s o
f th
e A
AA
+ c
lass
Pu
tativ
e la
rge
term
ina
se
ph
ag
e p
ort
al p
rote
in, p
uta
tive
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
ph
ag
e m
ajo
r ca
psi
d p
rote
in, H
K9
7
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily
CO
G5
28
1 P
ha
ge
-re
late
d m
ino
r ta
il p
rote
in
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
seri
ne
O-a
cety
ltra
nsf
era
se
2000 4000 6000 8000 10000 12000 14000 16000
Roseobacter CCS2 GTA
Roseobacter denitrificans Och114-GTA-13,783 nt
3-o
xoa
cyl-
(acy
l ca
rrie
r p
rote
in)
syn
tha
se II
am
ino
de
oxy
cho
rism
ate
lya
se
Te
rmin
ase
, lg
su
bu
nit
ph
ag
e p
roh
ea
d p
rote
ase
, pu
tativ
e
ph
ag
e m
ajo
r ca
psi
d p
rote
in, p
uta
tive
ph
ag
e h
ea
d-t
ail
ad
ap
tor,
pu
tativ
e
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily p
rote
in
Pu
tativ
e p
ha
ge
tail
min
or
pro
tein
seri
ne
ace
tyltr
an
sfe
rase
2000 4000 6000 8000 10000 12000 14000 16000
Roseobacter MED 193-47,496 ntC
OG
05
53
Su
pe
rfa
mily
II D
NA
/RN
A h
elic
ase
s, S
NF
2 fa
mily
CO
G3
18
3 P
red
icte
d r
est
rict
ion
en
do
nu
cle
ase
very
sh
ort
pa
tch
re
pa
ir p
rote
in (
DN
A m
ism
atc
h e
nd
on
ucl
ea
se)
CO
G0
27
0 S
ite-s
pe
cific
DN
A m
eth
yla
se
pu
tativ
e b
act
eri
op
ha
ge
-re
late
d p
rote
in
reso
lva
se
CO
G0
47
4 C
atio
n tr
an
spo
rt A
TP
ase
CO
G1
59
5 D
NA
-dir
ect
ed
RN
A p
oly
me
rase
sp
eci
aliz
ed
sig
ma
su
bu
nit,
sig
ma
24
ho
mo
log
hyp
oth
etic
al p
rote
inh
ypo
the
tica
l pro
tein
pu
tativ
e D
EA
D b
ox
fam
ily h
elic
ase
, ph
ag
e a
sso
cia
ted
rest
rict
ion
en
do
nu
cle
ase
DN
A m
eth
yla
se N
-4/N
-6
CO
G5
52
5 B
act
eri
op
ha
ge
tail
ass
em
bly
pro
tein
CO
G5
51
1 B
act
eri
op
ha
ge
ca
psi
d p
rote
in
pu
tativ
e p
rote
ase
/sca
ffold
pro
tein
pro
ba
ble
ba
cte
rio
ph
ag
e-r
ela
ted
pro
tein
Po
ten
tial p
ha
ge
tail
tap
e m
ea
sure
pro
tein
ho
st s
pe
cific
ity p
rote
in J
, in
tern
al d
ele
tion
SE
C-C
mo
tif p
rote
in
pro
ph
ag
e M
uM
c02
, he
ad
de
cora
tion
pro
tein
, pu
tativ
e
CD
S
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000 46000 48000
CD
S
term
ina
se, l
arg
e s
ub
un
it, p
uta
tive
po
rta
l pro
tein
, HK
97
fam
ily p
rote
in
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily p
rote
in
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
seri
ne
O-a
cety
ltra
nsf
era
se
2000 4000 6000 8000 10000 12000 14000 16000
Roseobacter MED 193-14,668 nt-GTA
Roseobacter sp. SK209-2-6-41,094 bpp
uta
tive
tra
nsp
ort
pro
tein
Inte
gra
se
ace
tyltr
an
sfe
rase
, pu
tativ
e
pu
tativ
e r
ep
lica
tion
pro
tein
tra
nsc
rip
tion
an
tite
rmin
atio
n p
rote
in N
usG
pro
ph
ag
e L
am
bd
aS
o, h
olin
, pu
tativ
e
hyp
oth
etic
al p
ha
ge
term
ina
se la
rge
su
bu
nit
po
rta
l pro
tein
, HK
97
fam
ily, p
uta
tive
Pe
rip
lasm
ic s
eri
ne
pro
tea
se
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily
tail
ass
em
bly
pro
tein
, pu
tativ
e
tail
fibe
r p
rote
in, p
uta
tive
N6
ad
en
ine
-sp
eci
fic D
NA
me
thyl
tra
nsf
era
se, N
12
cla
ss
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000
oxi
do
red
uct
ase
, FA
D-b
ind
ing
pro
tein
Pu
tativ
e la
rge
term
ina
se
po
rta
l pro
tein
, HK
97
fam
ily p
rote
in
CO
G1
19
6 C
hro
mo
som
e s
eg
reg
atio
n A
TP
ase
sp
ha
ge
pro
he
ad
pro
tea
se, H
K9
7 fa
mily
pro
tein
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily p
rote
in
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
seri
ne
O-a
cety
ltra
nsf
era
se
2000 4000 6000 8000 10000 12000 14000 16000
Roseovarius nubinhibens ISM-GTA-13,917 bp
carb
on
mo
no
xid
e d
eh
ydro
ge
na
se o
pe
ron
C p
rote
in
po
rta
l pro
tein
, HK
97
fam
ily p
rote
in
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily p
rote
in
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
ad
en
ylo
succ
ina
te ly
ase
ad
en
ylo
succ
ina
te ly
ase
typ
e I
secr
etio
n ta
rge
t re
pe
at p
rote
in
typ
e I
secr
etio
n ta
rge
t re
pe
at p
rote
in
2000 4000 6000 8000 10000 12000 14000 16000 18000
Roseovarius sp 217 Defective GTA-12,339 bp
au
toin
du
cer-
bin
din
g tr
an
scri
ptio
na
l re
gu
lato
r L
uxR
au
toin
du
cer
syn
the
sis
pro
tein
trig
ge
r fa
cto
r
Ph
ag
e p
ort
al p
rote
in
pe
ptid
ase
U3
5, p
ha
ge
pro
he
ad
Ph
ag
e m
ajo
r ca
psi
d p
rote
in
HN
H n
ucl
ea
se /
Pro
ba
ble
ph
ag
e P
HI-
10
5 h
olin
-lik
e p
rote
inU
nch
ara
cte
rize
d p
ha
ge
pro
tein
(p
oss
ible
DN
A p
ack
ag
ing
)P
ha
ge
pro
tein
, HK
97
ph
ag
e h
ea
d-t
ail
ad
ap
tor,
pu
tativ
ep
ha
ge
term
ina
se, s
ma
ll su
bu
nit,
pu
tativ
e, P
27
ph
ag
e T
erm
ina
se
CO
G1
13
1 A
BC
-typ
e m
ulti
dru
g tr
an
spo
rt s
yste
m, A
TP
ase
co
mp
on
en
t
2000 4000 6000 8000 10000 12000 14000 16000
Roseovarius sp 217 Second GTA rearranged-17,038 bp
Roseovarius sp. HTCC2601-36,890 bp a
de
nin
e-s
pe
cific
DN
A m
eth
yltr
an
sfe
rase
Pro
ph
ag
e in
teg
rase
CO
G1
39
6 P
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
rs
CO
G0
35
8 D
NA
pri
ma
se (
ba
cte
ria
l typ
e)
pro
ph
ag
e L
am
bd
aW
1, t
erm
ina
se la
rge
su
bu
nit,
pu
tativ
e
Ba
cte
rio
ph
ag
e c
ap
sid
str
uct
ura
l pro
tein
Pe
rip
lasm
ic s
eri
ne
pro
tea
se
ph
ag
e-r
ela
ted
tail
pro
tein
tail
ass
em
bly
pro
tein
, pu
tativ
e
tail
fibe
r p
rote
in, p
uta
tive
CD
S
Ph
ag
e-r
ela
ted
lyso
zym
e
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
NA
D(P
)+ tr
an
shyd
rog
en
ase
, be
ta s
ub
un
it
term
ina
se, l
arg
e s
ub
un
it, p
uta
tive
GC
N5
-re
late
d N
-ace
tyltr
an
sfe
rase
po
rta
l pro
tein
, HK
97
fam
ily
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily
pu
tativ
e p
ha
ge
tail-
he
ad
ad
ap
tor
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
seri
ne
O-a
cety
ltra
nsf
era
se
1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 11000 12000 13000 14000 15000 16000 17000
Roseovarius sp. HTCC2601-GTA 15,873 bp
Roseovarius sp. HTCC2601-3rd prophage-23,647 bp
tRN
A (
5-m
eth
yla
min
om
eth
yl-2
-th
iou
rid
yla
te)-
me
thyl
tra
nsf
era
se
DN
A-b
ind
ing
re
spo
nse
re
gu
lato
r C
trA
Inte
gra
se
Ba
cte
rio
ph
ag
e p
hi 1
.45
pro
tein
-lik
e p
rote
in
CO
G0
18
8 T
ype
IIA
top
ois
om
era
se (
DN
A g
yra
se/to
po
II, t
op
ois
om
era
se IV
), A
su
bu
nit
CO
G2
93
2 P
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
r
CO
G0
18
7 T
ype
IIA
top
ois
om
era
se (
DN
A g
yra
se/to
po
II, t
op
ois
om
era
se IV
), B
su
bu
nit
pro
ph
ag
e L
am
bd
aW
4, t
erm
ina
se la
rge
su
bu
nit,
pu
tativ
e
Ba
cte
rio
ph
ag
e c
ap
sid
str
uct
ura
l pro
tein
Pe
rip
lasm
ic s
eri
ne
pro
tea
se
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000
Sagittula stellata E-37-GTA 14,778 nt
asp
art
ate
am
ino
tra
nsf
era
se
term
ina
se, l
arg
e s
ub
un
it, p
uta
tive
po
rta
l pro
tein
, HK
97
fam
ily
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
Pre
dic
ted
ph
ag
e p
hi-
C3
1 g
p3
6 m
ajo
r ca
psi
d-l
ike
pro
tein
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily
Ph
ag
e-r
ela
ted
min
or
tail
pro
tein
-lik
e
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
dn
aK
su
pp
ress
or
pro
tein
, pu
tativ
ese
rin
e a
cety
ltra
nsf
era
se
2000 4000 6000 8000 10000 12000 14000 16000
Silicibacter pomeroy GTA-13,968 bp
term
ina
se, l
arg
e s
ub
un
it, p
uta
tive
po
rta
l pro
tein
, HK
97
fam
ily
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
dn
aK
su
pp
ress
or
pro
tein
, pu
tativ
e
cysE
2000 4000 6000 8000 10000 12000 14000 16000
Silicibacter TMS1040-Prophage 1-75,835 nt
Ph
ag
e c
on
serv
ed
hyp
oth
etic
al p
rote
in, p
hiE
12
5 g
p8
pe
ptid
ase
S4
9
Ph
ag
e p
ort
al p
rote
in, H
K9
7
tra
nsc
rip
tion
al r
eg
ula
tor,
XR
E fa
mily
Pa
rB-l
ike
nu
cle
ase
KE
GG
: nw
i:Nw
i_1
14
0 p
ha
ge
inte
gra
se, e
v=1
e-2
4, 2
9%
ide
ntit
y
Re
solv
ase
-lik
e
ph
ag
e m
ajo
r ca
psi
d p
rote
in, H
K9
7
pe
ptid
ase
U3
5, p
ha
ge
pro
he
ad
HK
97
Ph
ag
e p
ort
al p
rote
in, H
K9
7
ph
ag
e T
erm
ina
se
sin
gle
-str
an
d b
ind
ing
pro
tein
en
do
de
oxy
rib
on
ucl
ea
se R
usA
ad
en
ine
-sp
eci
fic
DN
A m
eth
yltr
an
sfe
rase
HN
H n
ucl
ea
se
C-5
cyt
osi
ne
-sp
eci
fic D
NA
me
thyl
ase
ph
ag
e in
teg
rase
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000 46000 48000 50000 52000 54000 56000 58000 60000 62000 64000 66000 68000 70000 72000 74000
Ph
ag
e c
on
serv
ed
hyp
oth
etic
al p
rote
in, p
hiE
12
5 g
p8
pe
ptid
ase
S4
9
Ph
ag
e p
ort
al p
rote
in, H
K9
7
tra
nsc
rip
tion
al r
eg
ula
tor,
XR
E fa
mily
Pa
rB-l
ike
nu
cle
ase
KE
GG
: nw
i:Nw
i_1
14
0 p
ha
ge
inte
gra
se, e
v=1
e-2
4, 2
9%
ide
ntit
y
Re
solv
ase
-lik
e
ph
ag
e m
ajo
r ca
psi
d p
rote
in, H
K9
7
pe
ptid
ase
U3
5, p
ha
ge
pro
he
ad
HK
97
Ph
ag
e p
ort
al p
rote
in, H
K9
7
ph
ag
e T
erm
ina
se
sin
gle
-str
an
d b
ind
ing
pro
tein
en
do
de
oxy
rib
on
ucl
ea
se R
usA
ad
en
ine
-sp
eci
fic
DN
A m
eth
yltr
an
sfe
rase
HN
H n
ucl
ea
se
C-5
cyt
osi
ne
-sp
eci
fic D
NA
me
thyl
ase
ph
ag
e in
teg
rase
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000 46000 48000 50000 52000 54000 56000 58000 60000 62000 64000 66000 68000 70000 72000 74000
ph
ag
e in
teg
rase
tra
nsp
osa
se IS
3/IS
91
1In
teg
rase
, ca
taly
tic r
eg
ion
typ
e II
I re
stri
ctio
n e
nzy
me
, re
s su
bu
nit
rest
rict
ion
mo
difi
catio
n s
yste
m D
NA
sp
eci
ficity
do
ma
in
N-6
DN
A m
eth
yla
se
ph
ag
e in
teg
rase
pu
tativ
e tr
an
scrip
tion
al r
eg
ula
tor
BR
O-l
ike
typ
e I
site
-sp
eci
fic d
eo
xyri
bo
nu
cle
ase
, Hsd
R fa
mily
rest
rict
ion
mo
difi
catio
n s
yste
m D
NA
sp
eci
ficity
do
ma
in
typ
e I
rest
rict
ion
-mo
difi
catio
n s
yste
m, M
su
bu
nit
BR
O-l
ike
ph
ag
e in
teg
rase
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000
Silicibacter TMS1040-Prophage 2-29,886 nt
ph
ag
e in
teg
rase
An
kyri
n
pu
tativ
e p
ha
ge r
ep
ress
or
gly
cosi
de
hyd
rola
se, f
am
ily 2
4
ph
ag
e D
NA
pa
cka
gin
g N
u1
ph
ag
e te
rmin
ase
Gp
A
ph
ag
e p
ort
al p
rote
in, l
am
bd
a
pe
ptid
ase
U3
5, p
ha
ge
pro
he
ad
HK
97
YP
_6
13
29
1.1
YP
_6
13
28
9.1
PA
AR
Ba
sep
late
J-l
ike
pro
tein
Ph
ag
e ta
il p
rote
in I
pu
tativ
e v
ari
ab
le ta
il fib
re p
rote
in
pu
tativ
e s
ecr
ete
d s
eri
ne
pro
tea
se
ph
ag
e ta
il sh
ea
th p
rote
in
ph
ag
e m
ajo
r ta
il tu
be
pro
tein
CD
SC
DS
ph
ag
e la
te c
ontr
ol D
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000
Silicibacter TMS1040-Prophage 3-40,151 nt
AT
P-d
ep
en
de
nt p
rote
ase
La
ph
ag
e in
teg
rase
exo
nu
cle
ase
ER
F
pu
tativ
e p
ha
ge
re
pre
sso
r
en
do
de
oxy
rib
on
ucl
ea
se R
usA
sin
gle
-str
an
d b
ind
ing
pro
tein
Te
rmin
ase
sm
all
sub
un
it
Te
rmin
ase
larg
e s
ub
un
it
ph
ag
e p
uta
tive
he
ad
mo
rph
og
en
esi
s p
rote
in, S
PP
1 g
p7
tra
nsc
rip
tion
al r
eg
ula
tor,
XR
E fa
mily
Ta
pe
me
asu
re d
om
ain
two
co
mp
on
en
t tra
nsc
rip
tion
al r
eg
ula
tor,
Lu
xR fa
mily
resp
on
se r
eg
ula
tor
rece
ive
r p
rote
in
PA
S/P
AC
se
nso
r si
gn
al t
ran
sdu
ctio
n h
istid
ine
kin
ase
tra
nsc
rip
tion
al r
eg
ula
tor,
Lys
R fa
mily
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000
Silicibacter TMS1040-Prophage 4-41,671 nt
Te
rmin
ase
larg
e s
ub
un
it
Ph
ag
e p
ort
al p
rote
in, H
K9
7
pe
ptid
ase
U3
5, p
ha
ge
pro
he
ad
HK
97
ph
ag
e m
ajo
r ca
psi
d p
rote
in, H
K97
He
ad
to ta
il a
da
pto
r
Ph
ag
e m
ajo
r ta
il p
rote
in, T
P9
01
-1
CD
S
Pu
tativ
e p
ha
ge
ce
ll w
all
pe
ptid
ase
, Nlp
C/P
60
cold
-sh
ock
DN
A-b
ind
ing
do
ma
in p
rote
in
seri
ne
O-a
cety
ltra
nsf
era
se
2000 4000 6000 8000 10000 12000 14000 16000
Silicibacter TMS1040-Prophage 6-16,000 nt-GTA
leu
cin
e a
min
op
ep
tida
se-r
ela
ted
pro
tein
Te
rmin
ase
larg
e s
ub
un
it
Ph
ag
e p
ort
al p
rote
in, H
K9
7
pe
ptid
ase
U3
5, p
ha
ge
pro
he
ad
HK
97
ph
ag
e m
ajo
r ca
psi
d p
rote
in, H
K9
7
CO
G5
28
1 P
ha
ge
-re
late
d m
ino
r ta
il p
rote
in
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
Om
pA
/Mo
tB
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000
Sphingomonas sp. SKA58-19,061 bp-GTA?
Stappia aggregata IAM 12614-32,622 bpD
NA
me
thyl
ase
N-4
/N-6
DN
A p
oly
me
rase
III s
ub
un
it b
eta
DN
A p
rim
ase
DN
A p
rim
ase
CO
G5
52
5 B
act
eri
op
ha
ge
tail
ass
em
bly
pro
tein
Pu
tativ
e c
ap
sid
pro
tein
of p
rop
ha
ge
pro
ba
ble
Clp
pro
tea
se p
rote
in
CO
G4
54
0 P
ha
ge
P2
ba
sep
late
ass
em
bly
pro
tein
gp
VC
OG
36
28
Ph
ag
e b
ase
pla
te a
sse
mb
ly p
rote
in W
CO
G3
94
8 P
ha
ge
-re
late
d b
ase
pla
te a
sse
mb
ly p
rote
in
Ph
ag
e p
rote
in g
p2
7
Vir
ule
nce
-ass
oci
ate
d p
rote
in
pu
tativ
e ta
il fib
er-
rela
ted
pro
tein
ph
ag
e-r
ela
ted
co
ntr
act
ile ta
il sh
ea
th p
rote
in
CO
G3
49
8 P
ha
ge
tail
tub
e p
rote
in F
II
ph
ag
e-r
ela
ted
tail
pro
tein
pu
tativ
e b
act
eri
op
ha
ge
tail
pro
tein
CO
G5
00
4 P
2-l
ike
pro
ph
ag
e ta
il p
rote
in X
SIA
M6
14
_1
31
53
pu
tativ
e e
nd
on
ucl
ea
se
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000
Sulfitobacter sp. NAS-14 GTA-14,257 bp
3-o
xoa
cyl-
(acy
l ca
rrie
r p
rote
in)
syn
tha
se
term
ina
se, l
arg
e s
ub
un
it, p
uta
tive
pu
tativ
e p
ort
al p
rote
in
ph
ag
e p
roh
ea
d p
rote
ase
, HK
97
fam
ily p
rote
in
ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
he
ad
-ta
il a
da
pto
r, p
uta
tive
ma
jor
tail
pro
tein
, TP
90
1-1
fam
ily p
rote
in
pu
tativ
e p
ha
ge
tail
min
or
pro
tein
CO
G0
79
1 C
ell
wa
ll-a
sso
cia
ted
hyd
rola
ses
(in
vasi
on
-ass
oci
ate
d p
rote
ins)
seri
ne
O-a
cety
ltra
nsf
era
se
2000 4000 6000 8000 10000 12000 14000 16000
site
-sp
eci
fic in
teg
rase
/re
com
bin
ase
-lik
e
CO
G1
39
6 P
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
rs
Pu
tativ
e P
ha
ge
-re
late
d te
rmin
ase
pu
tativ
e p
ha
ge
po
rta
l pro
tein
, HK
97
fam
ily p
rote
in
pu
tativ
e C
lpP
-lik
e p
rote
ase
pu
tativ
e p
rop
ha
ge
La
mb
da
So
, ma
jor
cap
sid
pro
tein
, HK
97
fam
ily p
rote
in
CO
G0
84
0 M
eth
yl-a
cce
ptin
g c
he
mo
taxi
s p
rote
in
CO
G3
21
0 L
arg
e e
xop
rote
ins
invo
lve
d in
he
me
util
iza
tion
or
ad
he
sio
n
ph
ag
e-r
ela
ted
en
do
lysi
n
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000
Sulfitobacter sp. NAS-14 Phage? 28,880 bp
Synechococcus sp. 5701-21,831 bpp
rop
ha
ge
CP
4-l
ike
inte
gra
se
DN
A-b
ind
ing
pro
tein
, pu
tativ
eC
OG
36
17
Pro
ph
ag
e a
ntir
ep
ress
or
Te
rmin
ase
sig
ma
fact
or
Sig
F
pu
tativ
e tr
an
spo
sase
tra
nsp
osa
se
pu
tativ
e b
act
eri
op
ha
ge
lyso
zym
e
Tra
nsp
osa
se (
cla
ss II
)
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000
Rlu
D Int
Cyt
C5
DN
A M
eth
yl
DN
A L
iga
se
CI r
ep
ress
or
Pri
ma
se
exo
IIIC
DS
Hyp
oth
etic
al p
rote
in
Te
rmin
ase
Po
rta
l
Min
or
cap
sid
Min
or
tail
pro
tein
Ba
sep
late
ass
em
bly
GP
25
Ba
sep
late
JT
ail
pro
tein
Ta
il fib
er
Ta
il sh
ea
th
Ph
ag
e m
ajo
r ta
il tu
be
Ta
pe
Me
asu
re
Gp
UP
ha
ge
tail
XG
pD
Tyr
osi
ne
pho
sph
ata
se
RN
asE
Rlu
D
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000
Thioimicrospira crunogena XCL-2-38,090 bp
Vibrio alginolyticus 12G01-36,811 nt p
uta
tive
inte
gra
se
pu
tativ
e tr
an
spo
sase
CP
S-5
3 (
Kp
LE
1)
pro
ph
ag
e
CO
G0
64
2 S
ign
al t
ran
sdu
ctio
n h
istid
ine
kin
ase
pu
tativ
e c
I pro
ph
ag
e r
ep
ress
or
pro
tein
pro
ph
ag
e L
am
bd
aS
o, t
ran
scri
ptio
na
l re
gu
lato
r, C
ro/C
I fa
mily
pro
tein
Un
kno
wn
pro
tein
en
cod
ed
by
cryp
tic p
rop
ha
ge
pu
tativ
e D
NA
-bin
din
g p
rote
in R
oi
tra
nsc
rip
tion
al r
eg
ula
tor,
XR
E fa
mily
pro
tein
pu
tativ
e ly
sozy
me
CO
G0
20
2 D
NA
-dir
ect
ed
RN
A p
oly
me
rase
, alp
ha
su
bu
nit/
40
kD
su
bu
nit
pu
tativ
e D
NA
pa
cka
gin
g p
rote
in o
f pro
ph
ag
e
pu
tativ
e h
ea
d-t
ail
join
ing
pro
tein
of p
rop
ha
ge
pu
tativ
e p
ort
al p
rote
in
Pu
tativ
e c
ap
sid
pro
tein
of p
rop
ha
ge
Pu
tativ
e h
ea
d-D
NA
sta
bili
zatio
n p
rote
in o
f pro
ph
ag
e
pu
tativ
e m
ajo
r ca
psi
d p
rote
in
min
or
tail
pro
tein
Pu
tativ
e ta
il fib
er
com
po
ne
nt V
of p
rop
ha
ge
tail
ass
em
bly
pro
tein
, pu
tativ
e
pu
tativ
e la
rge
se
cre
ted
pro
tein
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
V. fischeri prophage – 29,712 bp
me
thyl
-acc
ep
ting
ch
em
ota
xis
pro
tein
DN
A in
teg
ratio
n/r
eco
mb
ina
tion
/inve
rtio
n p
rote
in
tra
nsc
rip
tion
al r
eg
ula
tor,
Cro
/CI f
am
ily
po
ssib
le p
ha
ge
re
gu
lato
ry p
rote
in (
CII)
hyp
oth
etic
al b
act
eri
op
ha
ge
pro
tein
exo
de
oxy
rib
on
ucl
ea
se V
III
pu
tativ
e p
ha
ge
tail
pro
tein
pu
tativ
e p
ha
ge
ba
sep
late
ass
em
bly
pro
tein
ph
ag
e p
rote
in
ph
ag
e p
rote
in
pu
tativ
e p
ha
ge
tail
pro
tein
L-a
lan
yl-D
-glu
tam
ate
pe
ptid
ase
pro
ba
ble
ca
psi
d p
ort
al p
rote
in
term
ina
se, A
TP
ase
su
bu
nit
pro
ba
ble
ca
psi
d s
caffo
ldin
g p
rote
in
ma
jor
cap
sid
pro
tein
pre
curs
or
tail
she
ath
pro
tein
pu
tativ
e p
ha
ge
R p
rote
inp
uta
tive
ph
ag
e ta
il p
rote
in
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000
pu
tativ
e p
ha
ge
term
ina
se la
rge
su
bu
nit
Po
rta
l pro
tein
ma
jor
cap
sid
pro
ph
ag
e L
am
bd
aW
5, m
ino
r ta
il p
rote
in Z
ph
ag
e b
ase
pla
te a
sse
mb
ly p
rote
in V
pu
tativ
e b
ase
pla
te a
sse
mb
ly p
rote
in
ph
ag
e ta
il p
rote
in I
tail
fibe
r p
rote
in
ph
ag
e ta
il sh
ea
th p
rote
in F
I-lik
e
pu
tativ
e ta
il tu
be
pro
tein
ph
ag
e-r
ela
ted
tail
pro
tein
ph
ag
e p
rote
in U
-lik
ep
ha
ge
tail
XY
P_
00
10
39
84
6.1
DN
A a
de
nin
e m
eth
yla
se
pu
tativ
e e
xon
ucl
ea
se
pa
rA p
rote
in
pro
telo
me
rase
tra
nsc
rip
tion
al r
eg
ula
tor
pu
tativ
e r
ep
lica
tion
pro
tein
Re
pA
pu
tativ
e r
ep
ress
or
pro
tein
an
tite
rmin
atio
n p
rote
in Q
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
Vibrio parahaemolyticus phage VP882-38,197 bp
pu
tativ
e p
rote
lom
era
se
pu
tativ
e D
NA
pri
ma
se
cI r
ep
ress
or
of p
rop
ha
ge
CP
-93
3V
ad
en
ine
me
thyl
tra
nsf
era
se
ad
en
ine
me
thyl
tra
nsf
era
se
term
ina
se s
ma
ll su
bu
nit
pu
tativ
e te
rmin
ase
larg
e s
ub
un
it
pu
tativ
e p
ort
al p
rote
in
pu
tativ
e c
ap
sid
pro
tein
ba
sep
late
sp
ike
pro
tein
ba
sep
late
ass
em
bly
pro
tein
ba
sep
late
ass
em
bly
pro
tein
ba
sep
late
ass
em
bly
pro
tein
tail
fibre
pro
tein
reve
rse
tra
nsc
rip
tase
/ma
tura
se/tr
an
spo
sase
tail
con
tra
ctile
sh
ea
th p
rote
in
tail
tub
e p
rote
inta
il p
rote
in
tail
len
gth
de
term
ina
tor
pro
tein
tail
pro
tein
tail
pro
tein
tail
pro
tein
an
tire
pre
sso
r
pu
tativ
e e
xon
ucl
ea
se p
rote
in
pu
tativ
e p
art
itio
n p
rote
in
pu
tativ
e p
rote
lom
era
se
pu
tativ
e D
NA
pri
ma
se
cI r
ep
ress
or
of p
rop
ha
ge
CP
-93
3V
ad
en
ine
me
thyl
tra
nsf
era
se
ad
en
ine
me
thyl
tra
nsf
era
se
term
ina
se s
ma
ll su
bu
nit
pu
tativ
e te
rmin
ase
larg
e s
ub
un
it
pu
tativ
e p
ort
al p
rote
in
pu
tativ
e c
ap
sid
pro
tein
ba
sep
late
sp
ike
pro
tein
ba
sep
late
ass
em
bly
pro
tein
ba
sep
late
ass
em
bly
pro
tein
ba
sep
late
ass
em
bly
pro
tein
tail
fibre
pro
tein
reve
rse
tra
nsc
rip
tase
/ma
tura
se/tr
an
spo
sase
tail
con
tra
ctile
sh
ea
th p
rote
in
tail
tub
e p
rote
inta
il p
rote
in
tail
len
gth
de
term
ina
tor
pro
tein
tail
pro
tein
tail
pro
tein
tail
pro
tein
an
tire
pre
sso
r
pu
tativ
e e
xon
ucl
ea
se p
rote
in
pu
tativ
e p
art
itio
n p
rote
in
Vibrio harveyi Phage VHML-43,198 nt
CD
S
sen
sor
his
tidin
e k
ina
se/r
esp
on
se r
eg
ula
tor
ba
cte
rio
ph
ag
e f2
37
OR
F1
0
ba
cte
rio
ph
ag
e f2
37
OR
F1
ba
cte
rio
ph
ag
e f2
37
OR
F2
ba
cte
rio
ph
ag
e f2
37
OR
F3
ba
cte
rio
ph
ag
e f2
37
OR
F4
ba
cte
rio
ph
ag
e f2
37
OR
F5
ba
cte
rio
ph
ag
e f2
37
OR
F6
ba
cte
rio
ph
ag
e f2
37
OR
F7
ba
cte
rio
ph
ag
e f2
37
OR
F8
ba
cte
rio
ph
ag
e f2
37
OR
F9
pu
tativ
e s
tru
ctu
ral p
rote
in P
5 (
Alte
rom
on
as
ph
ag
e P
M2
)
pu
tativ
e p
ha
ge
-re
late
d p
rote
in
pu
tativ
e m
ajo
r p
ha
ge
ca
psi
d p
rote
in
hyp
oth
etic
al p
rote
in (
Alte
rom
on
as
ph
ag
e P
M2
)
pu
tativ
e p
ha
ge
re
plic
atio
n in
itia
tion
pro
tein
(A
ltero
mo
na
s p
ha
ge
PM
2)
pu
tativ
e p
ha
ge
pro
tein
, Vp
f14
8 [b
act
eri
op
ha
ge
VfO
3K
6]
CD
S3
-hyd
roxy
de
can
oyl
-AC
P d
eh
ydra
tase
rib
oso
me
mo
du
latio
n fa
cto
r
AB
C tr
an
spo
rte
r, A
TP
-bin
din
g p
rote
in
pu
tativ
e N
6-a
de
nin
e-s
pe
cific
DN
A m
eth
yla
se
dih
ydro
oro
tate
de
hyd
rog
en
ase
pu
tativ
e N
AD
-glu
tam
ate
de
hyd
rog
en
ase
CD
S
am
ino
pe
ptid
ase
N
CD
S
tail-
spe
cific
pro
tea
se
pu
tativ
e s
olu
te/D
NA
co
mp
ete
nce
effe
cto
rC
DS
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000 40000 42000 44000 46000 48000
V. Parahaemolyticus RMID2210633 Cr. I-47,981 nt
Note-like bacteriophage f237
pu
tativ
e p
ha
ge
str
uct
ura
l pro
tein
(A
ltero
mo
na
s P
M2
)
pu
tativ
e m
ajo
r p
ha
ge
ca
psi
d p
rote
in P
2
pu
tativ
e r
ep
lica
tion
initi
atio
n p
rote
in, p
ha
ge
-re
late
d
pu
tativ
e p
ha
ge
-re
late
d p
rote
in
pu
tativ
e tr
an
scri
ptio
na
l re
gu
lato
r
pu
tativ
e u
niv
ers
al s
tre
ss p
rote
in A
tra
nsc
rip
tion
al r
eg
ula
tor
sen
sor
his
tidin
e k
ina
se
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000
V. Parahaemolyticus RMID2210633 Cr.II-19,424 nt
Vibrio splendidus 12B01-First prophage-35,675 ntin
teg
rase
, pu
tativ
e
CO
G0
49
7 A
TP
ase
invo
lve
d in
DN
A r
ep
air
CO
G2
18
3 T
ran
scri
ptio
na
l acc
ess
ory
pro
tein
Pu
tativ
e r
ep
ress
or
pro
tein
of p
rop
ha
ge
CO
G0
54
1 S
ign
al r
eco
gni
tion
pa
rtic
le G
TP
ase
pu
tativ
e p
ha
ge
lyso
zym
e
Te
rmin
ase
larg
e s
ub
un
it
pu
tativ
e p
ort
al p
rote
in
AT
P-d
ep
en
de
nt p
rote
ase
pro
ph
ag
e L
am
bd
aW
5, m
ino
r ta
il p
rote
in Z
, pu
tativ
e
Ph
ag
e b
ase
pla
te a
sse
mbl
y p
rote
in V
CO
G3
62
8 P
ha
ge
ba
sep
late
ass
em
bly
pro
tein
WB
ase
pla
te J
-lik
e p
rote
inP
ha
ge
tail
pro
tein
Ip
rob
ab
le p
yoci
n R
2_
PP
, ta
il fib
er
pro
tein
Ph
ag
e ta
il sh
ea
th p
rote
in F
I-lik
e
tail
tap
e m
ea
sure
pro
tein
Ph
ag
e p
rote
in U
-lik
e
Ph
ag
e p
rote
in D
-lik
e
ph
ag
e in
teg
rase
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000
Vibrio splendidus 12B01-Second prophageIn
teg
rase
Pre
dic
ted
tra
nsc
rip
tion
al r
eg
ula
tor
hyp
oth
etic
al b
act
eri
op
ha
ge
pro
tein
pu
tativ
e p
ha
ge
ge
ne
CO
G5
51
8 B
act
eri
op
ha
ge
ca
psi
d p
ort
al p
rote
in
pu
tativ
e p
ha
ge
ge
ne
pu
tativ
e p
ha
ge
ge
ne
pro
ph
ag
e P
SP
PH
06
, pu
tativ
e h
ea
d c
om
ple
tion
/sta
bili
zatio
n p
rote
inp
uta
tive
ph
ag
e g
en
ep
rop
ha
ge
PS
PP
H0
6, v
irio
n m
orp
ho
ge
ne
sis
pro
tein
pro
ph
ag
e P
SP
PH
06
, pu
tativ
e ta
il sh
ea
th p
rote
in
pro
ph
ag
e P
SP
PH
06
, pu
tativ
e ta
il tu
be
pro
tein
CO
G1
73
4 D
na
K s
up
pre
sso
r p
rote
inp
uta
tive
ph
ag
e ly
sozy
me
CO
G0
64
2 S
ign
al t
ran
sdu
ctio
n h
istid
ine
kin
ase
pro
ph
ag
e P
SP
PH
06
, ta
il ta
pe
me
asu
re p
rote
in, T
P9
01
fam
ily
pu
tativ
e p
ha
ge
ge
ne
CO
G3
29
9 U
nch
ara
cte
rize
d h
om
olo
g o
f ph
ag
e M
u p
rote
in g
p4
7
pu
tativ
e b
act
eri
op
ha
ge
pro
tein
.
Ph
ag
e-r
ela
ted
tail
fibe
r p
rote
in
A/G
-sp
eci
fic D
NA
gly
cosy
lase
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000
Site
-sp
eci
fic r
eco
mb
ina
se X
erD
Co
-ch
ap
ero
nin
Gro
ES
Un
cha
ract
eri
zed
ph
ag
e-a
sso
cia
ted
pro
tein
AT
Pa
se c
om
po
ne
nt o
f AB
C tr
an
spo
rte
r
Ch
rom
oso
me
se
gre
ga
tion
AT
Pa
se
Ph
ag
e P
2 b
ase
pla
te a
sse
mb
ly p
rote
in g
pV
dT
DP
-glu
cose
pyr
op
ho
sph
ory
lase
Ph
ag
e-r
ela
ted
ba
sep
late
ass
em
bly
pro
tein
Ba
cte
rio
ph
ag
e P
2-r
ela
ted
tail
form
atio
n p
rote
in
Ph
ag
e-r
ela
ted
tail
fibe
r p
rote
in
A/G
-sp
eci
fic D
NA
gly
cosy
lase
Ph
ag
e ta
il sh
ea
th p
rote
in F
I
Ph
ag
e ta
il tu
be
pro
tein
FII
Ph
ag
e p
rote
in U
P2
-lik
e p
rop
ha
ge
tail
pro
tein
XP
ha
ge
pro
tein
D
Pre
dic
ted
tra
nsc
rip
tion
al r
eg
ula
tor
Pre
dic
ted
tra
nsc
rip
tion
al r
eg
ula
tor
Ba
cte
rio
ph
ag
e p
hi g
p5
5-l
ike
pro
tein
Site
-sp
eci
fic r
eco
mb
ina
se X
erC
AT
P-d
ep
en
de
nt 2
6S
pro
tea
som
e r
eg
ula
tory
su
bu
nit
pro
ba
ble
DN
A-d
ire
cte
d R
NA
po
lym
era
se, b
eta
' su
bu
nit/
16
0 k
D s
ub
un
it; C
OG
00
86
Ba
cte
rio
ph
ag
e p
hi 1
.45
pro
tein
-lik
e p
rote
inP
red
icte
d tr
an
scri
ptio
na
l re
gu
lato
r
2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 24000 26000 28000 30000 32000 34000 36000 38000
Vibrio vulnificus CMCP6 chromosome I-39,325 bp