microBioRust 0.1.2

Microbiology friendly bioinformatics Rust functions
Documentation
ID   AM236082; SV 1; linear; genomic DNA; STD; PRO; 6666 BP.
XX
AC   AM236082;
XX
PR   Project:PRJNA344;
XX
DT   04-MAY-2006 (Rel. 87, Created)
DT   06-FEB-2015 (Rel. 123, Last updated, Version 9)
XX
DE   Rhizobium leguminosarum bv. viciae plasmid pRL8 complete genome, strain
DE   3841
XX
KW   complete genome.
XX
OS   Rhizobium leguminosarum bv. viciae 3841
OC   Bacteria; Proteobacteria; Alphaproteobacteria; Rhizobiales; Rhizobiaceae;
OC   Rhizobium/Agrobacterium group; Rhizobium.
OG   Plasmid pRL8
XX
RN   [1]
RP   1-147463
RA   Crossman L.C.;
RT   ;
RL   Submitted (21-FEB-2006) to the INSDC.
RL   Crossman L.C., Pathogen Sequencing Unit, The Wellcome Trust Sanger
RL   Institute, Hinxton, Cambridge, Cambridgeshire, CB10 1SA, UNITED KINGDOM.
XX
RN   [2]
RX   DOI; 10.1186/gb-2006-7-4-r34.
RX   PUBMED; 16640791.
RA   Young J.W., Crossman L.C., Johnston A.W.B., Thomson N.R., Ghazoui Z.F.,
RA   Hull K.H., Wexler M., Curson A.R.J., Todd J.D., Poole P.S., Mauchline T.H.,
RA   East A.K., Quail M.A., Churcher C., Arrowsmith C., Cherevach A.,
RA   Chillingworth T., Clarke K., Cronin A., Davis P., Fraser A., Hance Z.,
RA   Hauser H., Jagels K., Moule S., Mungall K., Norbertczak H.,
RA   Rabbinowitsch E., Sanders M., Simmonds M., Whitehead S., Parkhill J.;
RT   "The genome of Rhizobium leguminosarum has recognizable core and accessory
RT   components";
RL   Genome Biol. 7(4):R34-R34(2006).
XX
DR   MD5; 8fe097fb2b9f874c5d043fe59cea066c.
DR   BioSample; SAMEA1705944.
DR   EnsemblGenomes-Gn; EBG00001182864.
DR   EnsemblGenomes-Gn; pRL80017.
DR   EnsemblGenomes-Gn; pRL80039.
DR   EnsemblGenomes-Gn; pRL80039A.
DR   EnsemblGenomes-Gn; pRL80050.
DR   EnsemblGenomes-Gn; pRL80051.
DR   EnsemblGenomes-Gn; pRL80055.
DR   EnsemblGenomes-Gn; pRL80058.
DR   EnsemblGenomes-Gn; pRL80089.
DR   EnsemblGenomes-Gn; pRL80091.
DR   EnsemblGenomes-Gn; pRL80106.
DR   EnsemblGenomes-Tr; EBT00001761573.
DR   EnsemblGenomes-Tr; pRL80017.
DR   EnsemblGenomes-Tr; pRL80039.
DR   EnsemblGenomes-Tr; pRL80039A.
DR   EnsemblGenomes-Tr; pRL80050.
DR   EnsemblGenomes-Tr; pRL80051.
DR   EnsemblGenomes-Tr; pRL80055.
DR   EnsemblGenomes-Tr; pRL80058.
DR   EnsemblGenomes-Tr; pRL80089.
DR   EnsemblGenomes-Tr; pRL80091.
DR   EnsemblGenomes-Tr; pRL80106.
DR   RFAM; RF00490; S-element.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..>6666
FT                   /organism="Rhizobium leguminosarum bv. viciae 3841"
FT                   /plasmid="pRL8"
FT                   /strain="3841"
FT                   /mol_type="genomic DNA"
FT                   /country="United Kingdom"
FT                   /db_xref="taxon:216596"
FT   CDS             1..1197
FT                   /transl_table=11
FT                   /gene="repAp8"
FT                   /locus_tag="pRL80001"
FT                   /product="replication protein RepA"
FT                   /db_xref="EnsemblGenomes-Gn:pRL80001"
FT                   /db_xref="EnsemblGenomes-Tr:CAK02801"
FT                   /db_xref="GOA:Q1M9K5"
FT                   /db_xref="InterPro:IPR000551"
FT                   /db_xref="InterPro:IPR017818"
FT                   /db_xref="InterPro:IPR025669"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="UniProtKB/TrEMBL:Q1M9K5"
FT                   /protein_id="CAK02801.1"
FT                   /translation="MENPAQLQKAIHKLIAAHARDLSGALHEHRVKLYPPEARKTLRSF
FT                   SSIEAAKLIGVNDGYLRHLSLEGKGPQPEIGNNNRRSYSVETIQALREYLDENGKGDRR
FT                   YSPRRSGREHLQVITAVNFKGGSGKTTTAAHLAQYLALNGYRVLAIDLDPQASMSALHG
FT                   FQPEFDVGDNETLYGAVRYDEERRPLKDIIKKTYFANLDLVPGNLELMEFEHDTAKVLG
FT                   SNDRKNIFFTRMDDAIASVADDYDVVVVDCPPQLGFLTISALCAATAVLVTVHPQMLDV
FT                   MSMCQFLLMTSELLSVVADAGGSMNYDWMRYLVTRYEPGDGPQNQMVSFMRTMFGDHVL
FT                   NHPMLKSTAISDAGITKQTLYEVSRDQFTRATYDRAMESLDNVNSEIEQLIQSSWGRK"
FT   misc_feature    1..6666
FT                   /colour=12
FT   CDS             1321..2280
FT                   /transl_table=11
FT                   /gene="repBp8"
FT                   /locus_tag="pRL80002"
FT                   /product="replication protein RepB"
FT                   /db_xref="EnsemblGenomes-Gn:pRL80002"
FT                   /db_xref="EnsemblGenomes-Tr:CAK02802"
FT                   /db_xref="GOA:Q1M9K4"
FT                   /db_xref="InterPro:IPR003115"
FT                   /db_xref="InterPro:IPR004437"
FT                   /db_xref="InterPro:IPR011111"
FT                   /db_xref="InterPro:IPR017819"
FT                   /db_xref="InterPro:IPR036086"
FT                   /db_xref="InterPro:IPR037972"
FT                   /db_xref="UniProtKB/TrEMBL:Q1M9K4"
FT                   /protein_id="CAK02802.1"
FT                   /translation="MARKHLLSDLKAPASSSTEFDEARAADVPTPQYAPRGAIGAVSRS
FT                   IEALKSQGLSELDPELIDAPSVTDRLDEDGAQFEEFARNIRENGQQVPILVRPHPTVEG
FT                   RYQIAYGRRRLRAVKAAGLKVKAAIRNLTDDELVLAQGQENSARQDLSFIERALYAAQL
FT                   EASGYQRPVIMAALAVDKSNLSRLIQAATQLPDDVIRLIGAAPKTGRDRWYELSSRLAA
FT                   EGAAEKARALLSTSEVGSLGSDERFVRVFDAVAPKKSKKEKVQADVWQADDGVKAASFR
FT                   QDKRTLTLMIDKKAAPEFGEYLMSALPEIYASFKKSKQ"
FT   CDS             2455..3672
FT                   /transl_table=11
FT                   /gene="repCp8"
FT                   /locus_tag="pRL80003"
FT                   /product="replication RepC protein"
FT                   /db_xref="EnsemblGenomes-Gn:pRL80003"
FT                   /db_xref="EnsemblGenomes-Tr:CAK02803"
FT                   /db_xref="InterPro:IPR005090"
FT                   /db_xref="InterPro:IPR021760"
FT                   /db_xref="UniProtKB/TrEMBL:Q1M9K3"
FT                   /protein_id="CAK02803.1"
FT                   /translation="METGYITTPFGRRPMTLALVKRQVKTEQAIADGSVDKWRVFRDIS
FT                   DARSRLGLQDRALAVLNALLTFFPVAELSNERNLVVFPSNAQLSARTNGIAGTTLRKCL
FT                   GSLVEAGVIIRKDSPNGKRYARKGKEGNIEDAYGFSLAPLLARAGEFASLAQDVAAEQR
FT                   RFRITKDRLTIVRRDVRKLITVGMEENLAGDWIAAETCFVEIVGRFVRHPTLQDLISSL
FT                   DEMSLLHEEVSRMLEIKEETAKSDGNAIPDGCHIQNSNTESCHELEPRSEKKQGEKSEP
FT                   NKKTERKDEPEAFPLSMVLRACPEINAFGPGGSIGSWREMMSAAVTVRSMLGVSPSAYQ
FT                   EACEVMGQAGAAIAIACIYQRGGHINSAGGYLRDLTGKARRGEFSLGPMLFTQLRANSG
FT                   TVKASA"
FT   CDS             3811..6666
FT                   /transl_table=11
FT                   /locus_tag="pRL80004"
FT                   /product="hypothetical protein"
FT                   /note="no significant database hits"
FT                   /db_xref="EnsemblGenomes-Gn:pRL80004"
FT                   /db_xref="EnsemblGenomes-Tr:CAK02804"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="UniProtKB/TrEMBL:Q1M9K2"
FT                   /protein_id="CAK02804.1"
FT                   /translation="MTEIVLPTENTIIAAAKKLDAAASQLVAETFFAIRHGMSINPIGR
FT                   NPDGQTIKGYPDITGRVPGEKKYLIEVTKDDWRTHLQSDLSKLSRLQKGAYAGFLLLCF
FT                   RKSESELTQSNRKKARETVQQAESRIEKLLGVQAGQVEFVFLGEFAREVRSAKYHRVLL
FT                   ALGLELVPAPFYTDLRFVQGLADFVPTAEEYEAESVVPRDEVSRTYERVFKNRLTLIEG
FT                   EGGSGKTSLALAVATEHRKQGEIFLFLDASVADWKSGSERARLVDVAAMFAESNVLIIL
FT                   DNVHLGDASGISELITNVQASGYDFRFLMTTRSSDEVEQWKRLGNIELLRRVPSGADVN
FT                   SAYHRLLTQKFPGSSFNDIPPAVTTRWSNQIPNLVILTLALEGLTKRGGYDRDWAIKVE
FT                   DAGTYLQAKFISKLSSDDVKQVGKIAALSLLEIPTSLRSLDHRVPKSAVDLGFVRLNSS
FT                   STTQRYELVHHELGKLITSFKDPDIKARLGEVMSADPFQATYIGLKLIGNGEASLAKEL
FT                   LSSVLSQSLTLSPDFSMGNSGGVFGILVQSNVTTYPEIERILLPDIGAFFDTKPDIVTG
FT                   LSSFLGAASENMERVYNAIVEKLAEQETIRRIEELLPSVGPTTFATLYRCANSRNLPFL
FT                   STLRKYLNRGKRIDSFAYRCRSESPSKVEICWGLIDEFFPHHKARFEVVLRSALAEGYI
FT                   ERLIPEELIESRSSRAVQTAIRCANSEVFKRYITFRDCSDATLLLLAHTMHDMGRNDLS
FT                   EVAADRVAGRTTSSIWYHRRTGGRALLTILRRASISAEGDVQKILMRLEAEGKMRAIVN
FT                   GMRPYRLANFIFVIWDRHEQFTSFISKTDLQEITNRRFKARAAEFSEERQASIYIAGIY
FT                   ALVGLDIPRDEWSAVDVTEDDFIGNQNNPVFWIGLKALEENGMIRLAHRSRFPTSVAAL
FT                   DTHSENTSRIMNDLKNWAATR"
SQ   Sequence 6666 BP; 1576 A; 1743 C; 1876 G; 1471 T; 0 other;
     gtggagaatc ccgctcagct tcagaaggct attcataaac tgatagcggc ccacgcgcga        60
     gatctctcgg gcgcgcttca cgagcatcgt gtgaagcttt atccgcctga agctcgaaag       120
     acgcttcggt cattttcgtc gatagaggct gcgaagctca ttggcgtcaa cgatggctat       180
     ctccgccatc tttcgctcga gggtaagggg ccgcagcctg agatcggaaa taacaatcgc       240
     cgttcgtatt cggtcgagac tattcaggcg ctccgcgagt atctcgacga gaacggcaag       300
     ggtgaccgtc ggtactcacc acgccggagc ggtcgtgagc atttgcaggt tataaccgca       360
     gtgaacttca agggaggcag cggtaagacc acgacggctg ctcatcttgc tcagtatctt       420
     gcgcttaatg gataccgggt tcttgcgatt gatcttgatc cgcaggccag catgtccgct       480
     ttgcacggat tccagcctga gtttgacgtt ggcgacaacg aaacgctcta cggcgccgtt       540
     cgttatgatg aagagcggcg cccgctgaag gatataatca agaaaaccta ctttgcgaac       600
     cttgatctcg ttccgggcaa cctcgagctt atggaattcg agcacgacac cgctaaagtg       660
     ctcggctcta acgaccgcaa gaacatcttc ttcacgcgaa tggatgacgc aatcgcgtca       720
     gtggcggacg actatgacgt tgtcgtcgtc gactgccctc cccagctcgg ctttctgacg       780
     atctcggctc tatgcgcggc aaccgccgtt cttgttactg tacatcctca gatgctcgat       840
     gtgatgtcga tgtgccagtt tctgctgatg acctcagaac ttctgagcgt cgttgcggat       900
     gctggcggga gcatgaacta cgattggatg cgttatctcg ttacgcgcta cgagccggga       960
     gacggaccgc aaaaccagat ggtgtcgttc atgcgcacga tgtttggcga ccatgtcctg      1020
     aaccacccga tgctcaagag cacagccatt tcagacgcgg ggattactaa gcagactctc      1080
     tatgaggtga gccgcgacca gttcacgcga gcaacatacg accgagccat ggaatcgctc      1140
     gacaacgtga acagcgaaat cgaacaactc attcaatcat cttggggtcg caaatgatgg      1200
     ctctagagat ctcagaaaac gcgacattga tggagaagtt gccagccgga aacttttcgg      1260
     aatttgcact ctctatgtcg aggaatccgg cttgtcacga gtacctcagg ggaaagcaag      1320
     atggctagaa aacacctcct ttcagatttg aaagctcctg cttcatcatc tacggagttc      1380
     gatgaagcta gggctgcaga cgtccctact ccgcagtatg cgcctcgagg tgcaatcggt      1440
     gccgtctcgc gatcgattga agctttgaag tcgcagggac tgagtgaact cgatcccgaa      1500
     ctgatagatg cgccgtccgt tactgatcgc cttgatgagg atggggctca gtttgaggag      1560
     ttcgctcgca acatccgtga gaatgggcag caggttccga ttcttgtccg gcctcacccg      1620
     accgtggaag gacggtatca gattgcctac ggccggagac ggttgagagc ggtcaaggcg      1680
     gccggcctca aggtcaaagc cgcaatcaga aatctgacag atgacgagct tgtactggcg      1740
     caaggtcagg aaaacagcgc gcgtcaggat ctgtcgttta tcgagcgggc gctctatgca      1800
     gcccagctcg aagcgagtgg ctaccagcgt cccgtcatca tggcagcgct ggctgtcgac      1860
     aaaagtaacc tttcgcggtt gattcaggct gcgacccaat tgccggacga cgtcatccga      1920
     ctaattggtg ctgcgcctaa gaccggccgt gatcgctggt acgagctatc atcgcggttg      1980
     gctgcagaag gtgctgcgga gaaggcgcgc gctcttcttt cgactagcga ggttggctcc      2040
     ctgggttctg atgagcgatt tgttcgcgtt ttcgacgcgg ttgcgccgaa gaaatctaag      2100
     aaggaaaaag ttcaggcgga tgtctggcaa gctgacgatg gggtcaaggc tgcgagtttc      2160
     cgccaggaca aacgaacact gacattgatg atcgacaaga aggcagcgcc ggaattcggt      2220
     gagtacctga tgtcggctct ccccgagatc tacgcttcgt tcaagaagtc gaagcaatag      2280
     atgagtcgta acgaagaaag gtgccgatag cgcaaagaaa aagccctccg aaacggtgtt      2340
     ccagaaggcc tctctcagtt tggtcgctta gagaatcgca tttcccggaa tcacagtcaa      2400
     gagtcaacgc cacaccggcg tagccttttc tttgccttgc gaaaggtgaa ggacatggaa      2460
     acgggttata tcacgacgcc ctttgggcgg cggccgatga cgcttgctct ggtgaagcgt      2520
     caggttaaga ccgagcaggc aatagcggat ggctcggtcg acaagtggcg cgtgtttcgc      2580
     gacataagcg acgcccgctc acgccttggc cttcaagatc gagccttggc ggtcttgaat      2640
     gcacttttaa cattcttccc agttgctgaa ctcagcaatg agaggaacct ggtcgtcttt      2700
     ccatcaaatg ctcagctatc agcccgcaca aacggtatcg ctgggacaac tctgcgcaag      2760
     tgcctcggtt cgctggtgga ggccggtgta atcatccgca aggatagccc taacggtaag      2820
     cgatatgctc gaaaaggcaa agaaggaaac atagaggacg cctacggctt cagtctggca      2880
     ccgcttcttg cgcgcgccgg cgagtttgct agcctcgccc aagacgtggc tgctgaacag      2940
     cgccgcttcc gcatcacgaa agaccgcctc acgatcgttc ggcgagatgt ccgcaagctg      3000
     atcaccgtcg ggatggaaga gaaccttgcc ggcgattgga ttgccgcgga aacgtgcttt      3060
     gtcgagattg tgggaaggtt cgttcggcac ccgacgctcc aggacctgat ttcgagcctc      3120
     gacgagatga gccttcttca cgaagaagtc tccaggatgc tggaaattaa agaagaaacc      3180
     gcaaaaagtg atggcaatgc catcccggac ggatgccaca tacagaattc aaataccgaa      3240
     tcctgccatg aacttgaacc ccgctccgaa aagaagcagg gcgaaaagtc cgagccaaac      3300
     aagaaaacgg agcggaaaga cgaaccggaa gcgtttccgt tgtccatggt gttgcgtgcc      3360
     tgcccggaga tcaacgcatt tggccctggt ggatcgattg gaagctggcg cgaaatgatg      3420
     tcagcggcgg taacggttcg gtccatgctt ggcgtcagcc cctctgccta tcaggaggca      3480
     tgcgaggtga tggggcaggc cggagcggcg atagcaatag cttgcattta ccagcgtggc      3540
     gggcacatca actcggcggg gggatatctt cgggatctaa cggggaaggc gcggcgaggg      3600
     gagttttcac ttgggccaat gctgtttacg caattgcggg cgaactcggg caccgtcaag      3660
     gcgtcagcgt aggtcaaagt atcatgattg tttagcctaa ccggttgaac taattaacct      3720
     attttgacta gtttccggct ggcaacttta tctcgatcta aagcgtcgag tgaatggcag      3780
     aagataatct tcctgatggg cgtccgtata atgaccgaaa ttgtgcttcc gaccgaaaac      3840
     acgatcatcg cggcagccaa aaaacttgac gcggccgcat cgcagctggt ggcagagacg      3900
     ttctttgcca ttcggcatgg gatgtcaatc aatccaattg gtcgcaaccc ggatgggcag      3960
     accatcaagg gataccctga cattactggg cgggtgccgg gtgagaagaa gtacctgatc      4020
     gaagtcacga aggacgactg gcgcacacat cttcagagcg atctatcaaa actgtcccgc      4080
     ctgcagaaag gagcctacgc gggtttccta cttctctgct tccgaaagtc cgagtccgaa      4140
     ctcactcaaa gcaacaggaa gaaggcacgg gaaaccgtcc agcaggccga gagccggatt      4200
     gaaaagcttt tgggtgtcca ggcaggacag gtagaattcg tctttcttgg cgagttcgcg      4260
     cgtgaggtca gatcggcgaa ataccaccgc gtattgctgg ctctgggtct cgagcttgtg      4320
     ccagcgccat tctacacgga tttgcgcttc gtgcagggct tagccgattt cgtaccgacc      4380
     gctgaggaat atgaggctga gagtgttgtt cctcgcgatg aggtaagccg gacctatgag      4440
     cgggtcttca aaaacagact aacgttgatc gaaggcgagg gcggtagcgg caaaacaagc      4500
     ctggccctag ccgttgcgac ggagcatcgg aagcaaggcg agatctttct gttcttagac      4560
     gcctctgtcg ctgactggaa gagcggttcg gagcgagctc gcctcgttga cgtagcggcg      4620
     atgttcgcgg aatcgaatgt cctgattata ttggacaacg tacatctggg cgatgcgtcc      4680
     ggcatttctg aactgattac aaatgtccag gcgtccggtt atgatttccg ctttttgatg      4740
     acgacgcgca gcagcgacga agttgaacaa tggaagcgcc tgggaaatat cgagcttctc      4800
     cgcagagttc cgtctggagc cgatgtcaac tctgcctatc accgcctgct cactcaaaag      4860
     tttcccggaa gcagtttcaa cgatattccc ccagcggtga ccacacgatg gtcaaatcaa      4920
     attcccaatc tggttattct cacgcttgct cttgaaggtc tcacaaagag aggcggctat      4980
     gatcgcgatt gggcgatcaa ggttgaggac gcaggcacat accttcaagc taagttcatc      5040
     tcgaagctgt cgtccgacga cgtcaaacag gtgggcaaga tcgctgcgct ctcacttctg      5100
     gaaattccca cctcgctcag gtcgctcgac caccgggttc caaagtctgc tgtggatctg      5160
     ggcttcgttc gtctgaactc gagttcaaca actcagcgat atgagctcgt tcaccacgaa      5220
     ctgggcaagc tgatcacgtc cttcaaagat ccggatatca aggcgcggct gggagaggtg      5280
     atgtccgctg atcccttcca ggcaacatat atcgggctga agcttatcgg aaacggagaa      5340
     gccagcctgg caaaggaatt gttgtcgtca gtcctttctc aatcactcac actctcgcca      5400
     gatttctcga tgggaaactc cggcggagtc ttcggtatcc tggtccagtc caacgtgact      5460
     acctatcccg aaattgagcg tatccttctt cctgatatcg gcgccttttt cgatacaaag      5520
     ccggatattg taaccggcct tagctccttc ctcggggctg cctccgaaaa catggagcgc      5580
     gtatacaatg ccattgtgga aaaacttgcc gaacaggaaa cgattcgacg gatcgaagag      5640
     cttctcccat ccgtcggccc gacgactttc gcgacacttt accgatgcgc gaactcacgg      5700
     aacctcccgt ttctttcaac gcttcgaaaa tatctcaaca gagggaagcg tatagattcc      5760
     tttgcctatc gatgcaggtc tgaaagtccg agtaaggtcg agatctgctg gggcctgatt      5820
     gatgagttct ttccacacca caaggcccgg tttgaagttg tgcttcgctc tgccctcgcc      5880
     gagggataca tcgagcgcct tatcccggaa gagcttattg agtctcgctc ttcaagggct      5940
     gttcagacgg cgatccgatg cgcaaatagc gaagttttca aacggtacat cacgttccgt      6000
     gactgcagcg acgcgacgct gttgcttctg gcccacacga tgcacgacat gggcaggaat      6060
     gatctctcgg aggtcgcagc tgaccgagtt gcaggcagga cgacctcttc aatctggtat      6120
     catcgtcgca ccggtggcag ggcgttgctg actattttgc ggagagcatc gatatctgca      6180
     gaaggagatg ttcagaaaat tctgatgcgg cttgaggctg aaggaaaaat gagggccatt      6240
     gtgaatggaa tgcggcctta tcgcctagcg aattttattt tcgtgatctg ggatcggcac      6300
     gagcaattta cttcattcat ctcgaagaca gatcttcagg aaattacaaa ccgccggttc      6360
     aaagcgcgag cggcagagtt ctctgaagag cgacaagcgt ccatctacat tgcaggaatc      6420
     tatgcgctgg taggcctcga cataccgcgg gacgagtgga gcgcggtcga cgtcactgaa      6480
     gacgatttca ttggaaacca gaacaacccg gtcttctgga tcggtctcaa ggctctggaa      6540
     gaaaatggca tgatacgcct tgcccatcga agcagatttc cgacatctgt cgcggcgcta      6600
     gatactcatt cggaaaacac cagccggatc atgaacgatt tgaaaaactg ggctgcgacc      6660
     aggtaa                                                                 6666
//