NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F021936

Metagenome / Metatranscriptome Family F021936

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F021936
Family Type Metagenome / Metatranscriptome
Number of Sequences 216
Average Sequence Length 130 residues
Representative Sequence VSTVVLAALLICLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Number of Associated Samples 184
Number of Associated Scaffolds 216

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 71.16 %
% of genes near scaffold ends (potentially truncated) 33.33 %
% of genes from short scaffolds (< 2000 bps) 68.06 %
Associated GOLD sequencing projects 173
AlphaFold2 3D model prediction Yes
3D model pTM-score0.70

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (57.407 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(14.352 % of family members)
Environment Ontology (ENVO) Unclassified
(37.037 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(40.741 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 11.18%    β-sheet: 40.13%    Coil/Unstructured: 48.68%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.70
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 216 Family Scaffolds
PF06727DUF1207 31.02
PF02537CRCB 11.11
PF01425Amidase 8.80
PF02641DUF190 6.48
PF00582Usp 1.85
PF06172Cupin_5 1.39
PF02627CMD 0.93
PF03641Lysine_decarbox 0.93
PF00561Abhydrolase_1 0.46
PF03098An_peroxidase 0.46
PF00005ABC_tran 0.46
PF00793DAHP_synth_1 0.46
PF00483NTP_transferase 0.46
PF14870PSII_BNR 0.46
PF12704MacB_PCD 0.46
PF00027cNMP_binding 0.46
PF08811DUF1800 0.46
PF02687FtsX 0.46

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 216 Family Scaffolds
COG0239Fluoride ion exporter CrcB/FEX, affects chromosome condensationCell cycle control, cell division, chromosome partitioning [D] 11.11
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 8.80
COG1993PII-like signaling proteinSignal transduction mechanisms [T] 6.48
COG3542Predicted sugar epimerase, cupin superfamilyGeneral function prediction only [R] 1.39
COG0599Uncharacterized conserved protein YurZ, alkylhydroperoxidase/carboxymuconolactone decarboxylase familyGeneral function prediction only [R] 0.93
COG1611Nucleotide monophosphate nucleosidase PpnN/YdgH, Lonely Guy (LOG) familyNucleotide transport and metabolism [F] 0.93
COG2128Alkylhydroperoxidase family enzyme, contains CxxC motifInorganic ion transport and metabolism [P] 0.93
COG5267Uncharacterized conserved protein, DUF1800 familyFunction unknown [S] 0.46


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms57.41 %
UnclassifiedrootN/A42.59 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2162886012|MBSR1b_contig_6046737All Organisms → cellular organisms → Bacteria1956Open in IMG/M
3300000881|JGI10215J12807_1220390All Organisms → cellular organisms → Bacteria → Proteobacteria1543Open in IMG/M
3300000891|JGI10214J12806_11810723Not Available598Open in IMG/M
3300002245|JGIcombinedJ26739_101065251Not Available695Open in IMG/M
3300002886|JGI25612J43240_1056709Not Available590Open in IMG/M
3300003324|soilH2_10088653All Organisms → cellular organisms → Bacteria10719Open in IMG/M
3300003503|JGI26141J51220_1013166Not Available555Open in IMG/M
3300004009|Ga0055437_10070595All Organisms → cellular organisms → Bacteria978Open in IMG/M
3300004022|Ga0055432_10015299All Organisms → cellular organisms → Bacteria1520Open in IMG/M
3300004058|Ga0055498_10057740Not Available702Open in IMG/M
3300004114|Ga0062593_100008779All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria4552Open in IMG/M
3300004156|Ga0062589_100115314All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1753Open in IMG/M
3300004463|Ga0063356_100238559All Organisms → cellular organisms → Bacteria → Proteobacteria2190Open in IMG/M
3300004643|Ga0062591_100507995Not Available1037Open in IMG/M
3300004643|Ga0062591_102020028Not Available595Open in IMG/M
3300005183|Ga0068993_10003431All Organisms → cellular organisms → Bacteria3004Open in IMG/M
3300005294|Ga0065705_10624724Not Available692Open in IMG/M
3300005295|Ga0065707_10222884All Organisms → cellular organisms → Bacteria1218Open in IMG/M
3300005295|Ga0065707_11064663Not Available523Open in IMG/M
3300005328|Ga0070676_10898668Not Available659Open in IMG/M
3300005336|Ga0070680_100078107All Organisms → cellular organisms → Bacteria → Proteobacteria2727Open in IMG/M
3300005341|Ga0070691_10180733All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1098Open in IMG/M
3300005406|Ga0070703_10007204All Organisms → cellular organisms → Bacteria3122Open in IMG/M
3300005441|Ga0070700_100204183All Organisms → cellular organisms → Bacteria1390Open in IMG/M
3300005444|Ga0070694_100012768All Organisms → cellular organisms → Bacteria → Proteobacteria5233Open in IMG/M
3300005444|Ga0070694_100086757All Organisms → cellular organisms → Bacteria2188Open in IMG/M
3300005445|Ga0070708_100294715All Organisms → cellular organisms → Bacteria → Proteobacteria1527Open in IMG/M
3300005445|Ga0070708_100743449All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium922Open in IMG/M
3300005459|Ga0068867_100094333All Organisms → cellular organisms → Bacteria2275Open in IMG/M
3300005467|Ga0070706_100392157All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1292Open in IMG/M
3300005546|Ga0070696_101923160Not Available513Open in IMG/M
3300005578|Ga0068854_101191201Not Available682Open in IMG/M
3300005834|Ga0068851_10052875All Organisms → cellular organisms → Bacteria2066Open in IMG/M
3300006806|Ga0079220_12132272Not Available502Open in IMG/M
3300006854|Ga0075425_100156186All Organisms → cellular organisms → Bacteria → Proteobacteria2615Open in IMG/M
3300006914|Ga0075436_100250268All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1262Open in IMG/M
3300006954|Ga0079219_10382689All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria922Open in IMG/M
3300007265|Ga0099794_10048549All Organisms → cellular organisms → Bacteria2037Open in IMG/M
3300007265|Ga0099794_10052513All Organisms → cellular organisms → Bacteria → Proteobacteria1965Open in IMG/M
3300009078|Ga0105106_10674766Not Available739Open in IMG/M
3300009089|Ga0099828_10046857All Organisms → cellular organisms → Bacteria3569Open in IMG/M
3300009098|Ga0105245_12894070Not Available532Open in IMG/M
3300009147|Ga0114129_10024971All Organisms → cellular organisms → Bacteria8472Open in IMG/M
3300009147|Ga0114129_10129380All Organisms → cellular organisms → Bacteria → Proteobacteria3470Open in IMG/M
3300009148|Ga0105243_12285758Not Available578Open in IMG/M
3300009174|Ga0105241_10000922All Organisms → cellular organisms → Bacteria22252Open in IMG/M
3300009176|Ga0105242_12915862Not Available529Open in IMG/M
3300009553|Ga0105249_10049643All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria3826Open in IMG/M
3300009553|Ga0105249_12490567Not Available590Open in IMG/M
3300009804|Ga0105063_1030014Not Available695Open in IMG/M
3300009806|Ga0105081_1047756Not Available617Open in IMG/M
3300009810|Ga0105088_1106428Not Available523Open in IMG/M
3300010375|Ga0105239_10519246All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1355Open in IMG/M
3300010397|Ga0134124_10081380All Organisms → cellular organisms → Bacteria2783Open in IMG/M
3300010399|Ga0134127_10033819All Organisms → cellular organisms → Bacteria → Proteobacteria4111Open in IMG/M
3300010401|Ga0134121_10080575All Organisms → cellular organisms → Bacteria → Proteobacteria2707Open in IMG/M
3300011270|Ga0137391_11259385Not Available587Open in IMG/M
3300011271|Ga0137393_10010404All Organisms → cellular organisms → Bacteria6399Open in IMG/M
3300011419|Ga0137446_1051924Not Available918Open in IMG/M
3300011438|Ga0137451_1119509Not Available813Open in IMG/M
3300012096|Ga0137389_10029198All Organisms → cellular organisms → Bacteria → Proteobacteria3981Open in IMG/M
3300012096|Ga0137389_10081848All Organisms → cellular organisms → Bacteria2524Open in IMG/M
3300012202|Ga0137363_10226585All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1511Open in IMG/M
3300012203|Ga0137399_10232858All Organisms → cellular organisms → Bacteria1506Open in IMG/M
3300012205|Ga0137362_10819165Not Available797Open in IMG/M
3300012225|Ga0137434_1004362Not Available1378Open in IMG/M
3300012360|Ga0137375_10043544All Organisms → cellular organisms → Bacteria4960Open in IMG/M
3300012361|Ga0137360_10214213All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1566Open in IMG/M
3300012362|Ga0137361_10357869All Organisms → cellular organisms → Bacteria1339Open in IMG/M
3300012902|Ga0157291_10310203Not Available551Open in IMG/M
3300012917|Ga0137395_10565327Not Available821Open in IMG/M
3300012922|Ga0137394_10017820All Organisms → cellular organisms → Bacteria → Proteobacteria5651Open in IMG/M
3300012923|Ga0137359_10061657All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria3268Open in IMG/M
3300012929|Ga0137404_10283969All Organisms → cellular organisms → Bacteria1431Open in IMG/M
3300012930|Ga0137407_10113409All Organisms → cellular organisms → Bacteria → Proteobacteria2343Open in IMG/M
3300012944|Ga0137410_10377631Not Available1138Open in IMG/M
3300012958|Ga0164299_11381031Not Available543Open in IMG/M
3300012986|Ga0164304_10525538Not Available869Open in IMG/M
3300013296|Ga0157374_12026828Not Available602Open in IMG/M
3300013307|Ga0157372_10172928All Organisms → cellular organisms → Bacteria2499Open in IMG/M
3300014262|Ga0075301_1069455Not Available714Open in IMG/M
3300014325|Ga0163163_11187698Not Available826Open in IMG/M
3300014745|Ga0157377_10029192All Organisms → cellular organisms → Bacteria → Proteobacteria2975Open in IMG/M
3300014873|Ga0180066_1070055Not Available710Open in IMG/M
3300014873|Ga0180066_1126142Not Available527Open in IMG/M
3300014881|Ga0180094_1065697Not Available795Open in IMG/M
3300014884|Ga0180104_1140352Not Available708Open in IMG/M
3300014885|Ga0180063_1019238All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1859Open in IMG/M
3300015259|Ga0180085_1114021Not Available801Open in IMG/M
3300015262|Ga0182007_10162479Not Available764Open in IMG/M
3300015264|Ga0137403_10166227All Organisms → cellular organisms → Bacteria2161Open in IMG/M
3300015264|Ga0137403_11166336Not Available616Open in IMG/M
3300015371|Ga0132258_13675864All Organisms → cellular organisms → Bacteria1047Open in IMG/M
3300015373|Ga0132257_103678225Not Available558Open in IMG/M
3300017939|Ga0187775_10004093All Organisms → cellular organisms → Bacteria3475Open in IMG/M
3300017997|Ga0184610_1017313All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1899Open in IMG/M
3300018000|Ga0184604_10007616All Organisms → cellular organisms → Bacteria2144Open in IMG/M
3300018028|Ga0184608_10261502Not Available761Open in IMG/M
3300018031|Ga0184634_10258243Not Available799Open in IMG/M
3300018031|Ga0184634_10535472Not Available520Open in IMG/M
3300018052|Ga0184638_1257791Not Available598Open in IMG/M
3300018053|Ga0184626_10082780All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1356Open in IMG/M
3300018063|Ga0184637_10031223All Organisms → cellular organisms → Bacteria3227Open in IMG/M
3300018063|Ga0184637_10197054All Organisms → cellular organisms → Bacteria1236Open in IMG/M
3300018074|Ga0184640_10209350Not Available881Open in IMG/M
3300018074|Ga0184640_10525721Not Available518Open in IMG/M
3300018075|Ga0184632_10094608All Organisms → cellular organisms → Bacteria1308Open in IMG/M
3300018076|Ga0184609_10264001Not Available805Open in IMG/M
3300018077|Ga0184633_10221112Not Available976Open in IMG/M
3300018084|Ga0184629_10059602All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1770Open in IMG/M
3300018422|Ga0190265_10061772All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria3333Open in IMG/M
3300018422|Ga0190265_10213258All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1952Open in IMG/M
3300018422|Ga0190265_10224887All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1907Open in IMG/M
3300018429|Ga0190272_10061590All Organisms → cellular organisms → Bacteria2229Open in IMG/M
3300018429|Ga0190272_10087765All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1952Open in IMG/M
3300019360|Ga0187894_10033039All Organisms → cellular organisms → Bacteria3296Open in IMG/M
3300019377|Ga0190264_11597412Not Available573Open in IMG/M
3300019458|Ga0187892_10017045All Organisms → cellular organisms → Bacteria7554Open in IMG/M
3300019458|Ga0187892_10024187All Organisms → cellular organisms → Bacteria5456Open in IMG/M
3300019458|Ga0187892_10045791All Organisms → cellular organisms → Bacteria → Proteobacteria3118Open in IMG/M
3300019487|Ga0187893_10012567All Organisms → cellular organisms → Bacteria11976Open in IMG/M
3300019487|Ga0187893_10061793All Organisms → cellular organisms → Bacteria3649Open in IMG/M
3300019487|Ga0187893_10448996Not Available857Open in IMG/M
3300019879|Ga0193723_1011141All Organisms → cellular organisms → Bacteria2870Open in IMG/M
3300019881|Ga0193707_1058328All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria1218Open in IMG/M
3300019882|Ga0193713_1013243All Organisms → cellular organisms → Bacteria2486Open in IMG/M
3300019883|Ga0193725_1021162All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1778Open in IMG/M
3300019883|Ga0193725_1052487Not Available1039Open in IMG/M
3300019885|Ga0193747_1147591Not Available537Open in IMG/M
3300019890|Ga0193728_1087148All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1462Open in IMG/M
3300019997|Ga0193711_1012545All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1067Open in IMG/M
3300020002|Ga0193730_1032502All Organisms → cellular organisms → Bacteria1512Open in IMG/M
3300020003|Ga0193739_1056992All Organisms → cellular organisms → Bacteria1002Open in IMG/M
3300020004|Ga0193755_1099928Not Available919Open in IMG/M
3300020006|Ga0193735_1168763Not Available550Open in IMG/M
3300020021|Ga0193726_1040099All Organisms → cellular organisms → Bacteria → Proteobacteria2278Open in IMG/M
3300020070|Ga0206356_11837515All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300020081|Ga0206354_10563096Not Available523Open in IMG/M
3300021073|Ga0210378_10137767Not Available944Open in IMG/M
3300021073|Ga0210378_10179715Not Available811Open in IMG/M
3300021081|Ga0210379_10158724All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla964Open in IMG/M
3300021090|Ga0210377_10012080All Organisms → cellular organisms → Bacteria6610Open in IMG/M
3300022534|Ga0224452_1174936Not Available661Open in IMG/M
3300025324|Ga0209640_10126491All Organisms → cellular organisms → Bacteria2195Open in IMG/M
3300025560|Ga0210108_1070040Not Available684Open in IMG/M
3300025580|Ga0210138_1009736Not Available1864Open in IMG/M
3300025904|Ga0207647_10356950Not Available827Open in IMG/M
3300025907|Ga0207645_10040902All Organisms → cellular organisms → Bacteria2969Open in IMG/M
3300025910|Ga0207684_10028592All Organisms → cellular organisms → Bacteria4750Open in IMG/M
3300025910|Ga0207684_10257005All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1507Open in IMG/M
3300025910|Ga0207684_10597707All Organisms → cellular organisms → Bacteria942Open in IMG/M
3300025917|Ga0207660_10216626All Organisms → cellular organisms → Bacteria1501Open in IMG/M
3300025917|Ga0207660_11521052Not Available540Open in IMG/M
3300025933|Ga0207706_10096717All Organisms → cellular organisms → Bacteria → Proteobacteria2597Open in IMG/M
3300025934|Ga0207686_11816382Not Available505Open in IMG/M
3300025942|Ga0207689_10001698All Organisms → cellular organisms → Bacteria20909Open in IMG/M
3300025949|Ga0207667_11727652Not Available592Open in IMG/M
3300025960|Ga0207651_10876936Not Available798Open in IMG/M
3300025961|Ga0207712_10232710All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1480Open in IMG/M
3300026075|Ga0207708_10177154All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1691Open in IMG/M
3300026118|Ga0207675_100325774All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1501Open in IMG/M
3300026285|Ga0209438_1002036All Organisms → cellular organisms → Bacteria6739Open in IMG/M
3300026285|Ga0209438_1017239All Organisms → cellular organisms → Bacteria2416Open in IMG/M
3300026320|Ga0209131_1002906All Organisms → cellular organisms → Bacteria11502Open in IMG/M
3300026340|Ga0257162_1005746All Organisms → cellular organisms → Bacteria1416Open in IMG/M
3300026356|Ga0257150_1068381Not Available536Open in IMG/M
3300026358|Ga0257166_1045375Not Available621Open in IMG/M
3300026359|Ga0257163_1004525All Organisms → cellular organisms → Bacteria1957Open in IMG/M
3300026376|Ga0257167_1010478All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1229Open in IMG/M
3300026377|Ga0257171_1000091All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria6768Open in IMG/M
3300026475|Ga0257147_1042108Not Available673Open in IMG/M
3300026480|Ga0257177_1030528Not Available794Open in IMG/M
3300026482|Ga0257172_1058265Not Available708Open in IMG/M
3300026496|Ga0257157_1003935All Organisms → cellular organisms → Bacteria2216Open in IMG/M
3300026499|Ga0257181_1006632All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1423Open in IMG/M
3300026508|Ga0257161_1003524All Organisms → cellular organisms → Bacteria2686Open in IMG/M
3300026514|Ga0257168_1001707All Organisms → cellular organisms → Bacteria → Proteobacteria3013Open in IMG/M
3300026514|Ga0257168_1142956Not Available533Open in IMG/M
3300026551|Ga0209648_10004215All Organisms → cellular organisms → Bacteria12180Open in IMG/M
3300026551|Ga0209648_10574824Not Available625Open in IMG/M
3300027378|Ga0209981_1043939Not Available671Open in IMG/M
3300027471|Ga0209995_1001389All Organisms → cellular organisms → Bacteria → Proteobacteria3731Open in IMG/M
3300027543|Ga0209999_1021715All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1179Open in IMG/M
3300027725|Ga0209178_1441119Not Available500Open in IMG/M
3300027765|Ga0209073_10176824Not Available801Open in IMG/M
3300027775|Ga0209177_10187304Not Available728Open in IMG/M
3300027815|Ga0209726_10055993All Organisms → cellular organisms → Bacteria → Proteobacteria2684Open in IMG/M
3300027903|Ga0209488_10124602All Organisms → cellular organisms → Bacteria → Proteobacteria1940Open in IMG/M
3300027903|Ga0209488_10706134Not Available722Open in IMG/M
3300027954|Ga0209859_1068633Not Available565Open in IMG/M
3300028047|Ga0209526_10007081All Organisms → cellular organisms → Bacteria7697Open in IMG/M
3300028380|Ga0268265_10048123All Organisms → cellular organisms → Bacteria3199Open in IMG/M
3300028673|Ga0257175_1016222All Organisms → cellular organisms → Bacteria1193Open in IMG/M
3300028792|Ga0307504_10330895Not Available582Open in IMG/M
3300028803|Ga0307281_10016555All Organisms → cellular organisms → Bacteria2092Open in IMG/M
3300028803|Ga0307281_10029937All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla1628Open in IMG/M
3300028824|Ga0307310_10749293Not Available503Open in IMG/M
3300028828|Ga0307312_10269125All Organisms → cellular organisms → Bacteria → Proteobacteria1106Open in IMG/M
3300028885|Ga0307304_10218595All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria821Open in IMG/M
3300030606|Ga0299906_11067181Not Available588Open in IMG/M
(restricted) 3300031150|Ga0255311_1005863All Organisms → cellular organisms → Bacteria2408Open in IMG/M
3300031152|Ga0307501_10223212Not Available549Open in IMG/M
(restricted) 3300031197|Ga0255310_10001863All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria5406Open in IMG/M
3300031455|Ga0307505_10557896Not Available555Open in IMG/M
3300031720|Ga0307469_10318364All Organisms → cellular organisms → Bacteria1290Open in IMG/M
3300031720|Ga0307469_11073091Not Available755Open in IMG/M
3300031949|Ga0214473_10002493All Organisms → cellular organisms → Bacteria22609Open in IMG/M
3300032174|Ga0307470_10111624All Organisms → cellular organisms → Bacteria1592Open in IMG/M
3300032180|Ga0307471_103559429Not Available551Open in IMG/M
3300033432|Ga0326729_1050008Not Available645Open in IMG/M
3300033433|Ga0326726_10015393All Organisms → cellular organisms → Bacteria6672Open in IMG/M
3300033550|Ga0247829_11474509Not Available562Open in IMG/M
3300033551|Ga0247830_10644255Not Available840Open in IMG/M
3300033811|Ga0364924_122406Not Available601Open in IMG/M
3300034090|Ga0326723_0036231All Organisms → cellular organisms → Bacteria2061Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil14.35%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil10.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil7.41%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.94%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere6.02%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil4.17%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands2.78%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.78%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.31%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.31%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil1.85%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.85%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand1.85%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks1.85%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere1.85%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.85%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.85%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.39%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.39%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.39%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil1.39%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze1.39%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Switchgrass Rhizosphere1.39%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Corn, Switchgrass And Miscanthus Rhizosphere0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.93%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.93%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.93%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.93%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.93%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere0.93%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.93%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Miscanthus Rhizosphere0.93%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.46%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.46%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment0.46%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.46%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.46%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands0.46%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.46%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil0.46%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.46%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere0.46%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.46%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.46%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere0.46%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere0.46%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.46%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2162886012Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300000881Soil microbial communities from Great Prairies - Wisconsin Restored Prairie soilEnvironmentalOpen in IMG/M
3300000891Soil microbial communities from Great Prairies - Wisconsin, Continuous corn soilEnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003503Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 S AMHost-AssociatedOpen in IMG/M
3300004009Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004022Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1EnvironmentalOpen in IMG/M
3300004058Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004156Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 1EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004643Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Combined assembly of AARS Block 3EnvironmentalOpen in IMG/M
3300005183Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005328Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaGHost-AssociatedOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005441Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005459Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M3-2Host-AssociatedOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005578Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2Host-AssociatedOpen in IMG/M
3300005834Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C1-2Host-AssociatedOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009098Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaGHost-AssociatedOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300009553Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaGHost-AssociatedOpen in IMG/M
3300009804Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_30_40EnvironmentalOpen in IMG/M
3300009806Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S1_50_60EnvironmentalOpen in IMG/M
3300009810Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_20_30EnvironmentalOpen in IMG/M
3300010375Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-4 metaGHost-AssociatedOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011419Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT357_2EnvironmentalOpen in IMG/M
3300011438Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT500_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012902Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-S169-409C-1EnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012958Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_221_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013296Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M2-5 metaGHost-AssociatedOpen in IMG/M
3300013307Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C5-5 metaGHost-AssociatedOpen in IMG/M
3300014262Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D1EnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300014745Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M5-5 metaGHost-AssociatedOpen in IMG/M
3300014873Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT200B_16_10DEnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300015262Rhizosphere microbial communities from Sorghum bicolor, Mead, Nebraska, USA - 072115-113_1 MetaGHost-AssociatedOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017939Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_BV02_MP12_10_MGEnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018028Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018063Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b2EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019885Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L1m2EnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300019997Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020006Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1m2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020070Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-1 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300020081Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - Diel MetaT C5am-3 (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021081Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_coex redoEnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300022534Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM2_30_b1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025560Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025580Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqC_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025904Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025907Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025933Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025942Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025949Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025961Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S4-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026118Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026340Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-04-AEnvironmentalOpen in IMG/M
3300026356Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-AEnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026376Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-02-BEnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026499Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-BEnvironmentalOpen in IMG/M
3300026508Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-AEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027378Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027471Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M3 AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027543Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Inoculated plant M1 AM (SPAdes)Host-AssociatedOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300027954Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S3_50_60 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028380Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028803Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_120EnvironmentalOpen in IMG/M
3300028824Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_197EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300028885Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_185EnvironmentalOpen in IMG/M
3300030606Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT145D125EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031152Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 15_SEnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031455Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 23_SEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031949Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT98D197EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M
3300033551Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day5EnvironmentalOpen in IMG/M
3300033811Sediment microbial communities from East River floodplain, Colorado, United States - 28_j17EnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
MBSR1b_0392.000018902162886012Miscanthus RhizosphereMAGGPEEGAAMTRSEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG
JGI10215J12807_122039023300000881SoilMRQTVDVVGTRRSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGMRLRVEALDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG*
JGI10214J12806_1181072313300000891SoilAVRAQIQTRSVDSDLRVEWTGSEDRRGRPIVSGYVYNQRAGGYAVSVRLQVEALDGSGQVAGSTIGYVLGEVPPSNRAYFEIKAPAKAASYRVTIESYAWRAYGAGGG*
JGIcombinedJ26739_10106525123300002245Forest SoilMAGVMAARSIEPAIRRAVALVTLLLGSAFGIVGGPVEAQVSGRPTDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGLYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSDRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG*
JGI25612J43240_105670913300002886Grasslands SoilVTLLICSTAGVAGAQVSGRSVDEDLRLEWTAAEDRRGRPIVSGYVYNQRSGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG*
soilH2_1008865333300003324Sugarcane Root And Bulk SoilMTRPDRRRAVVTIVLLALLAGAGGRPAGAQSFGRPTDGDLRLEWTATEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGVTVGSTSGFVFGDVPPSDRSYFEIKAPPAAASYRVTIQTVSWRSYGAGGG*
JGI26141J51220_101316613300003503Arabidopsis Thaliana RhizosphereMRQTVDVVGTRRSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGVRLRVETLDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG*
Ga0055437_1007059533300004009Natural And Restored WetlandsMRRLARGVSTTVLAALLMCLGAGAARAQVQARGVDGDLRVDWAGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVAIESFAWRAYGAGGG*
Ga0055432_1001529943300004022Natural And Restored WetlandsMRRLARGVSTTVLAALLMCLGAGAARAQVQARGVDGDLRVDWAGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAP
Ga0055498_1005774013300004058Natural And Restored WetlandsMRRLARGMSTTVLAALLMGLGAGAARAQVQTRPVDGDLRVEWAGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRV
Ga0062593_10000877933300004114SoilMTRSEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0062589_10011531423300004156SoilMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0063356_10023855933300004463Arabidopsis Thaliana RhizosphereAVRAQIQTRSVDSDLRVEWTGSEDRRGHPIVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRAYFEIKAPAKAASYRVTIESYAWRAYGAGGG*
Ga0062591_10050799523300004643SoilMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSAESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0062591_10202002823300004643SoilMRQTVDVVGNRHSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGVRLRVETLDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG*
Ga0068993_1000343113300005183Natural And Restored WetlandsMRRLARGVSTTVLAALLMCLGAGAARAQVQARGVDGDLRVDWAGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVAIESFAWR
Ga0065705_1062472423300005294Switchgrass RhizosphereMSTAVLVTVLVGLGAGGARAQVQTRSVDSDLRVESSDSEDRRGRPVVSGYVYNQRAGSYAISVRLRAEALDGSGQVVGSTIGYVLGDVPPSNRSYFEIKAPTRAASYRVTIESFAWRAYGAGGG*
Ga0065707_1022288433300005295Switchgrass RhizosphereMSRLARGLSTAVLAALLIGLGAGQTRAQVQARSVDSDLRVEGTDSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVTGSTVGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0065707_1106466313300005295Switchgrass RhizosphereMRRLARGMSTAVLVTVLVGLGAGGARAQVQTRSVDSDLRVESSDSEDRRGRPVVSGYVYNQRAGSYAISVRLRAEALDGSGQVVGSTIGYVLGDVPPSNRSYFEIKAPTRAASYRVTIESFAWRAYGAGGG*
Ga0070676_1089866823300005328Miscanthus RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAA
Ga0070680_10007810743300005336Corn RhizosphereMRRLVRGVSTAVLAMLLLCLGAGAVRAQIQTRSVDSDLRVEWTGSEDRRGRPIVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRAYFEIKAPAKAASYRVTIESYAWRAYGAGGG*
Ga0070691_1018073323300005341Corn, Switchgrass And Miscanthus RhizosphereVDVVGTRHSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGMRLRVEALDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG*
Ga0070703_1000720453300005406Corn, Switchgrass And Miscanthus RhizosphereMTRPEIRRAAVTIALLALVAAAGPASAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0070700_10020418313300005441Corn, Switchgrass And Miscanthus RhizosphereMTRSEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVS
Ga0070694_10001276843300005444Corn, Switchgrass And Miscanthus RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSAESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0070694_10008675723300005444Corn, Switchgrass And Miscanthus RhizosphereMRQTVDVVGTRHSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGMRLRVEALDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG*
Ga0070708_10029471513300005445Corn, Switchgrass And Miscanthus RhizosphereMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG*
Ga0070708_10074344913300005445Corn, Switchgrass And Miscanthus RhizosphereMSRLARTMSTREIARRAGILAALLLLMGAGSVAAQGFGRPADADLRLEWAGAEDRRGRPLVSGYVYNQRPGSYATSMRLLVEALDASGQVVGSTSGFVLGDVPPSSRSYFEIRAPAKAASYRVTIQSFSWRTYGAGAG*
Ga0068867_10009433323300005459Miscanthus RhizosphereMRRLASGMSTAVLAALLLCLGAGAARAQVQTRSVESDLRVESSGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0070706_10039215713300005467Corn, Switchgrass And Miscanthus RhizosphereAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG*
Ga0070696_10192316013300005546Corn, Switchgrass And Miscanthus RhizosphereMAGVMAARSIEPAIRRAVALVTLLLGSAFGIVGGPVEAQVSGRPTDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGLYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG*
Ga0068854_10119120113300005578Corn RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSAESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRA
Ga0068851_1005287543300005834Corn RhizosphereMTRPEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQ
Ga0079220_1213227213300006806Agricultural SoilDRRRATAVTIVLLALLASAGGKPAGAQSFGRPTEGDLRLEWTATEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGVTVGSTSGFVFGDVPPSDRSYFEIKAPPAAASYRVTIQTLSWRSYGAGGG*
Ga0075425_10015618643300006854Populus RhizosphereMTRPEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0075436_10025026813300006914Populus RhizospherePEIRRAAVTIALLALVAAAGPASAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0079219_1038268923300006954Agricultural SoilMMRPEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0099794_1004854933300007265Vadose Zone SoilMEGAGATRSIEPAVRRAVALVTVLICSTAGVAGAQVSGRSVDQDLRLEWTAAEDRLGRPIVSGYVYNARAGTYATAMRLRVEALDASGQAVGATTGFVFGDVPPSGRSYFEIKAPAKAGSYRVTIQGFSWRAYGAGGG*
Ga0099794_1005251313300007265Vadose Zone SoilVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG*
Ga0105106_1067476623300009078Freshwater SedimentMRGLTAVLAALLIYAGAAIAQVQTRSVDSDLRVEGSGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVVGSTSGYVLGDVPPSNRSYFEIKAPAKAASYRITIASFAWRSYGAGGG*
Ga0099828_1004685743300009089Vadose Zone SoilVSTVVLAALLICLGAGAARAQVSTRSVDRDLRVEWTDSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0105245_1289407013300009098Miscanthus RhizosphereVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0114129_1002497123300009147Populus RhizosphereMSRLARGLSTAVLATLLICLGAGQARPQVQTRSVDSDLRVEGTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVDALDGSGQVTGSTVGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0114129_1012938023300009147Populus RhizosphereMARGPEEGAVMTRPEIRRAAVTIALLALVAAAGPASAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0105243_1228575813300009148Miscanthus RhizosphereMRRLASGMSTAVLAGLLMCLGGGAARAQVQTRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0105241_10000922113300009174Corn RhizosphereMAGGPEEGAAMTRSEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0105242_1291586213300009176Miscanthus RhizosphereMEGVGATRSIEPAIRRAVGLVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSDRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG*
Ga0105249_1004964353300009553Switchgrass RhizosphereMPRAGEFAGDALARGVTMAGGSTMRRLASGMSTAVLAGLLMCLGGGAARAQVQTRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0105249_1249056713300009553Switchgrass RhizosphereVTIALLALVAAAGPASAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0105063_103001413300009804Groundwater SandMPRHAKAVSTRWIARRAGILAALLLFMGAGSVAAQGFSRSADADLRLERTGAEDRRGRPLVSGYVYNQRAGSYATSVQLLVEALDASGQVVGSTSGFVLGDVPPSSRSYFEARAPAKAASYRVTIQSFSWRTYGAGGL*
Ga0105081_104775613300009806Groundwater SandMPRHAKAVSTRGIARRAGILAALLLFMGAGPVAAQGFSRPADSDLRLEWAGAEDRRGRPLVSGYVYNQRAGSYATSVQLLVEAVDASGQVVGSTSGFVLGDVPPSSRAYFETRAPAKAASYRVTIQSFSW
Ga0105088_110642823300009810Groundwater SandLLMCLGAGAARAQVQTRAVDGDLRVEWAGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0105239_1051924613300010375Corn RhizosphereTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG
Ga0134124_1008138043300010397Terrestrial SoilMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0134127_1003381983300010399Terrestrial SoilLCLGAGAVRAQIQTRSVDSDLRVEWTGSEDRRGRPIVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRAYFEIKAPAKAASYRVTIESYAWRAYGAGGG*
Ga0134121_1008057543300010401Terrestrial SoilVTNALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0137391_1125938513300011270Vadose Zone SoilMEGAGATRSIEPAVRRAVALVTVLICSTAGVAGAQVSGRSVDQDLRLEWTAAEDRRGRPIVSGYVYNARAGTYATAMRLRVEALDASGQAVGATTGFVFGDVPPSGRSYFEIKAPAKAGSYRVTIQGFSWRAYGAGG
Ga0137393_1001040423300011271Vadose Zone SoilMEGAGATRSIEPAVRRAVALVTVLICSTAGVAGAQVSGRSVDQDLRLEWTAAEDRRGRPIVSGYVYNARAGTYATAMRLRVEALDASGQAVGATTGFVFGDVPPSGRSYFEIKAPAKAGSYRVTIQGFSWRAYGAGGG*
Ga0137446_105192423300011419SoilVSTVVLAALLICLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137451_111950913300011438SoilICLGAGAARAQVQTRSVDRDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137389_1002919833300012096Vadose Zone SoilMMRRLARSVSTVVLAALLICLGAGAARAQVSTRSVDRDLRVEWTDSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137389_1008184833300012096Vadose Zone SoilMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRRIVSGYVYNQRAGTYATSMRLQAEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG*
Ga0137363_1022658523300012202Vadose Zone SoilMEGVGATRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRSVGEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG*
Ga0137399_1023285823300012203Vadose Zone SoilMEGVGATRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRSVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG*
Ga0137362_1081916523300012205Vadose Zone SoilMEGVGATRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQAEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG*
Ga0137434_100436233300012225SoilVSTVVLAALLICLGAGAARAQVQTRSVDGDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137375_1004354423300012360Vadose Zone SoilMSTTVLAALLICLGAGPTRAQVQTRSVDSDLRVEGTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVDALDGSGQVTGSSVGYVLGEVPPSNRAYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137360_1021421323300012361Vadose Zone SoilMEGVGATRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQAEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG*
Ga0137361_1035786933300012362Vadose Zone SoilMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG*
Ga0157291_1031020323300012902SoilMRQTVDVVGTRRSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGVRLRVEALDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG*
Ga0137395_1056532713300012917Vadose Zone SoilMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYASSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG*
Ga0137394_1001782063300012922Vadose Zone SoilMPRAGEFARDALAPGVTMSGGSTMRRLARGMSTAVLAGLLMCLGAGAARAQVQTRSVESDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEVKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137359_1006165743300012923Vadose Zone SoilMEGVGATRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG*
Ga0137404_1028396913300012929Vadose Zone SoilMPRAGEFARDALAPGVTMSGGSTMRRLARGMSTAILAGLLMCLGAGAARAQVQTRSVESDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137407_1011340913300012930Vadose Zone SoilMPRAGEFARDALAPGVTMSGGSTMRRLARGMSTAVLAGLLMCLGAGAARAQVQMRSVESDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137410_1037763133300012944Vadose Zone SoilMPRAGEFARDALAPGVTMSGGSTMRRLARGMSTAVLAGLLMCLGAGAARAQVQTRSVESDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKVASYRVTIESFAWRAYGAGGG*
Ga0164299_1138103113300012958SoilMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRSGTYASSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG*
Ga0164304_1052553823300012986SoilMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRSVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFFWRGYGAGGG*
Ga0157374_1202682813300013296Miscanthus RhizosphereGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGPYATSVRLLVEALDASGATVGSTSGFAFGAVPPPARSYLELKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0157372_1017292833300013307Corn RhizosphereMRQTVDVVGTRHSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGVRLRVETLDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG*
Ga0075301_106945523300014262Natural And Restored WetlandsGAGAARAQVQARGVDGDLRVDWAGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVAIESFAWRAYGAGGG*
Ga0163163_1118769813300014325Switchgrass RhizosphereVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSSFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0157377_1002919213300014745Miscanthus RhizospherePEEGAVMTRPEIRRAAVTIALLALVAAAGPASAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0180066_107005523300014873SoilVVLVALRICLGAGAARAQVQTRSVDGDLRVEWIGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWGAYGAGGG*
Ga0180066_112614213300014873SoilVSTVALVVLLIGLGAGAARAQVQTRSVDGDLAVEWTGSEDRRGRPVVSGYVYNQRAGSYAESVRFRVEALDGSGQVVGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFVWRAYGAGGG*
Ga0180094_106569723300014881SoilVSTVALVVLLIGLGAGAARAQVQTRSVDGDLAVEWNGAEDRRGRPVVSGYVYNRRAGSYAERVRFRVEALDGSGQVVGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFVWRAYGAGGG*
Ga0180104_114035223300014884SoilVSTVALVVLLIGLGAGAARAQVQTRSVDGDLAVEWTGSEDRRGRPVVSGYVYNQRAGSYAESVRFRVEALDGSGQVVGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIKSFVWRAYGAGGG*
Ga0180063_101923823300014885SoilVSTVALVVLLIGLGAGAARAQVQTRSVDGDLAVEWTGSEDRRGRPVVSGYVYNQRAGSYAESVRFRVEALDGSGQAVGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFVWRAYGAGGG*
Ga0180085_111402113300015259SoilVSTVVLAALLICLGAGAARAQVQTRSLDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIQAPAKAASYRVTIESFAWR
Ga0182007_1016247913300015262RhizosphereIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0137403_1016622723300015264Vadose Zone SoilMPRAGEFARDALAPGVTMSGGSTMRRLARGMSTAVLAGLLMCLGAGAARAQVQTRSVESDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG*
Ga0137403_1116633613300015264Vadose Zone SoilMEGVGATRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRSVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYASSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG*
Ga0132258_1367586433300015371Arabidopsis RhizosphereMAAPATVQFTGSAIRRAVMLLTLLLASSAGAEVRGRSTEQDLRLDWTTAEDRRGRPVVAGYIYNQRAGSYATAVRVLVEALDASGQVAGSTSGLIVGDVPPSDRSYFEISAPARAASYRVTIQTFSWRTYGAGGG*
Ga0132257_10367822523300015373Arabidopsis RhizosphereMTRPEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLLLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG*
Ga0187775_1000409333300017939Tropical PeatlandMRESVAGPAGRRAVMRAATVFGMLMLLGGDLAVAQISGRPAGADLRVEWSAGADRRGRPVVSGYVYNERGGSYATAVRLRVDALDAGGQVIGSTTGYVLGDVPPSSRSYFEVGSPAPATDYRVTIESFAWRAYGAGGA
Ga0184610_101731313300017997Groundwater SedimentVWRSARSVSTVVLAALLSCLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184604_1000761633300018000Groundwater SedimentMSRLARGMSTAVLAALLICLGAGPARPQVQTRSVDSDLRVEGTGSEDRHGRPVVSGYVYNQRAGGYAVSVRLRVDALDGSGQVTGSTVGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184608_1026150223300018028Groundwater SedimentMSRLARGMSTAVLAALLICLTAGPARAQVQTRSIDSDLRVEGTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184634_1025824323300018031Groundwater SedimentVWRFARSVSTVVLAALLICLGAGAARAQVHTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184634_1053547213300018031Groundwater SedimentMRRLARGVSTVVLAALLICLGAGAARAQVHTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184638_125779113300018052Groundwater SedimentMRRLARGVSTVVLAALLICLAAGAAPAQVPTRSVDRDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184626_1008278013300018053Groundwater SedimentCLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184637_1003122353300018063Groundwater SedimentVWRSARSVSTVVLAALLSCLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEARDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184637_1019705423300018063Groundwater SedimentMPRLAKAVSTRGIAGRAGILAALLLLMGAGSVAAQGFSRPADADLRLEWAGAEDRRGRPLVSGYVYNQRAGSYATSVQLLVEALDASGQVVGSTSGFVLGDVPPSNRAYFETRAPAKAASYRVTIQSFSWRTYGAGGL
Ga0184640_1020935013300018074Groundwater SedimentSTRGIARRAGILAALLLFLGAGSVGAQGFSRPADADLRLEWAGAEDRRGRPLVSGYVYNQRAGSYATSVRLLVEALDASGQVVGSTTGFVLGEVPPSSRSYFEIKAPAKAASYRVTIQSFSWRAYGAGGG
Ga0184640_1052572113300018074Groundwater SedimentMRRFARGMSTAVLAALLICLGAGAARAQVQTRSVDGDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVMIESFAWRAYGAGGG
Ga0184632_1009460833300018075Groundwater SedimentVWRSARSVSTVVLAALLICLGAGAARAQVQTRSLDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184609_1026400123300018076Groundwater SedimentVWRFARSVSTVVLAALLICLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0184633_1022111233300018077Groundwater SedimentMPRLAKAVSTRGIARRAGILAALLLFLGAGSVGAQGFSRPADADLRLEWAGAEDRRGRPLVSGYVYNQRAGSYATSVRLLVEALDASGQVVGSTTGFVLGEVPPSSRSYFEIKAPAKAASYRVTIQSFSWRAYGAGGG
Ga0184629_1005960213300018084Groundwater SedimentSVSTVVLAALLICLGAGAARAQVQTRSVDRDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0190265_1006177213300018422SoilMSDLARRTSTAVLAALLVCLGAHQARAQVQARSADSDLRVESIGSEDRRGRPFVSGYVYNQRAGSYAVSVRLRVDALDGSGQVTGSTVGYVLGDVPPSNRSYFEIKAPAKAASY
Ga0190265_1021325823300018422SoilMRRFAPGTAVLALLIFLGASVATAQVQTRSVDGDLRVEATGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVLGSTIGYVLGDVPPSNRSYFEIKAPAKAASYRVTIASFTWRAYGAGGG
Ga0190265_1022488723300018422SoilMRRTPAWGVSVGVLALLIGLGAGAAHAQFGTRSVGDDLHVESSASEDRRGRPVMSGYVYNRRAGVYAVGVRLRVEALDDAGRVIGSTTGYVMGDVPPSTRSYFEIKAPAKAASYRVTIDSFEWRGYGAGGG
Ga0190272_1006159043300018429SoilMTRRLAGGTSTMLAVLLIYAGSAIAQLQTRSVDSDLRVEATGSEDRRGRPVVSGYVYNQRAGGYAVSVRLLVEALDGSGQVVGSTSGYVLGDVPPSNRSYFELKAPAKAASYRVTIASFICRVYGAGRG
Ga0190272_1008776513300018429SoilVRRPARSVSTAVLAALLICLGAGAARAQVQTRSVDGDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0187894_1003303943300019360Microbial Mat On RocksMRRLARGVGTVVLAALLLGMGVGVARAQVQAGSVDGDLRVEWTGSEDRRGRPVVSGYVYNQRAGTYAVSVRLRVEALDGSGQVDGSTIGYVLGEVPPSNRSYFEIKAPAKGASYRVTVESFAWRAYGAGGG
Ga0190264_1159741213300019377SoilMRRRARGVSTAVLAVLLIGLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0187892_1001704563300019458Bio-OozeMRRFARGVSTVVLAALLLGIGVGVARAQVQAGSVDGDLRVEWIGSEDRRGRPVVSGYVYNQRAGAYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKGTSYRVTIESFAWRAYGAGGG
Ga0187892_1002418763300019458Bio-OozeMPRLAKTVSTRGIARPAGFLAVLLLLMGAGSVAAQGFSRPADADLRLEWAGAEDRRGRPLVSGYVYNQRAGAYAASVQLLVEALDASGQVVGSTRGFVLGDVPPSSRTYFETRAPAKAASYRVTIQSVSFRAYGAGGGM
Ga0187892_1004579123300019458Bio-OozeMMRGAGRVVSMAVLAALLIGLSAGMARAQVQAGSIDANLSVEGIGSEDRRGRRIVSGYVYNRRAGTYAADVLLLVEALDGSGQVVGSTTAYVMGDVPPSNRSYFEVKAPAPAASYRVTVRSVAWRGYGAGGG
Ga0187893_1001256793300019487Microbial Mat On RocksMPRPAKTVSTRGIARPAGFLAVLLLLMGAGSVAAQGFSRPADADLRLEWAGAEDRRGRPLVSGYVYNQRAGAYAASVQLLVEALDASGQVVGSTRGFVLGDVPPSSRTYFETRAPAKAASYRVTIQSVSFRAYGAGGGM
Ga0187893_1006179333300019487Microbial Mat On RocksMRRFARGVSTVVLAALLLGMGVGVARAQVQAGSVDGDLRVEWIGSEDRRGRPVVSGYVYNQRAGAYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKGTSYRVTIESFAWRAYGAGGG
Ga0187893_1044899613300019487Microbial Mat On RocksMRRLARGVSTVVLAALLLGIGVGVARAQVQAGSVDRDLRVEWTGSEDRRGRPVVSGYVYNQRAGAYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKGASYRVTIESFAWRAYGAGGG
Ga0193723_101114123300019879SoilMRRLASGMSTAVLAALLLCLGAGAARAQVQTRSVESDLRVESSGSEDRRGRPVVSGYVYNQRAGSYAVGVRLLVEALDGSGQVLGSTTGYVVGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRGYGAGGG
Ga0193707_105832823300019881SoilMAGVMAARSIEPAIRRAAALVTLLLGSAFGIVGGTVEAQVSGRPTDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGSYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0193713_101324323300019882SoilMRRLASGMSTAVLAALLLCLGAGAARAQVQTRSVESDLRVESSGSEDRRGRPVVSGYVYNQRAGSYAVGVRLLVEALDGSGQVLGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRGYGAGGG
Ga0193725_102116213300019883SoilVWRSARSVSTVVLAALLSCLGAGAARAQVQTRSVDRDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAG
Ga0193725_105248723300019883SoilMRRLASGMSTAVLAALLLCLGAGAARAQVQTRSVESDLRVESSGSEDRRGRPVVSGYVYNQRAGSYAVGVRLLVEALDGSGQVLGSTIGYVVGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRGYGAGGG
Ga0193747_114759123300019885SoilAFGIVGGPVEAQVSGRPTDQDLRLEWTAAEDRRGRPIVSGYVYNQRAGIYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSDRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0193728_108714823300019890SoilLVTLLLGSAFGIVGGTVEAQVSGRPTDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGSYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSDRSYFEIKAPARAASYRVTIQTFAWRAYGARGG
Ga0193711_101254523300019997SoilAGAARAQVQTRSVESDLRVESSGSEDRRGRPVVSGYVYNQRAGSYAVGVRLLVEALDGSGQVLGSTIGYVVGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRGYGAGGG
Ga0193730_103250233300020002SoilMAGVMAARSIEPAIRRAAALVTLLLGSAFGIVGGTVEAQVSGRPTDQDLRLEWTAAEDRRGRPIVSGYVYNQRAGSYASSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPARAASYRVTIQTFTWRAYGAGGG
Ga0193739_105699213300020003SoilVWRSARSVSTVVLAALLSCLGAGAARAQVQTRSVDGDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRV
Ga0193755_109992813300020004SoilAGGALDPGVRMTGGSTVWRSARSVSTVVLAALLICLGAGAARAQVSTRSVDRDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0193735_116876323300020006SoilMAGVMAARSIGPAIRRAAALVTLLLGSAFGIVGGTVEAQVSGRPTDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGSYATSMRLQVEALDASGQAVRSTTGFVFGDVPPSDRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0193726_104009943300020021SoilVSVAERGRRAVALVTLLIGSAFGIVAGTAEAEVSGRPAEQDLRLEWAAGEDRRGRPIVSGYVYNQRAGTYATSMRLQVEAVDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0206356_1183751523300020070Corn, Switchgrass And Miscanthus RhizosphereMTRSEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSY
Ga0206354_1056309613300020081Corn, Switchgrass And Miscanthus RhizosphereMTRSEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG
Ga0210378_1013776723300021073Groundwater SedimentVRRPARDVSTAVLAALLICLGAGAARAQVQTRSVDGDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0210378_1017971523300021073Groundwater SedimentVWRFARSVSTVVLAALLICLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEINAPAKAASYRVTIESFAWRAYGAGGG
Ga0210379_1015872413300021081Groundwater SedimentAQVQTRSVDRDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0210377_1001208033300021090Groundwater SedimentMRKLARGVSTVVLAALLICLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVSIESFAWRAYGAGGG
Ga0224452_117493623300022534Groundwater SedimentCLGAGAARAQVQTRSVDGDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0209640_1012649113300025324SoilMRRFARGVSTVALAVLLIGLGAGAARAQVQTRSVAGDLAVEWTGSEDRRGRPVVSGYVYNQRAGSYADSVLLRVEALDGSGQIVGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0210108_107004023300025560Natural And Restored WetlandsMRRLARGVSTTVLAALLMCLGAGAARAQVQARGVDGDLRVDWAGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVAIESFAWRAYGAGGG
Ga0210138_100973623300025580Natural And Restored WetlandsMRRLARGVSTTVLAALLMCLGAGAARAQVQARGVDGDLRVDWAGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVAIESFAWRAYG
Ga0207647_1035695023300025904Corn RhizosphereEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG
Ga0207645_1004090213300025907Miscanthus RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAASYRVTIE
Ga0207684_1002859253300025910Corn, Switchgrass And Miscanthus RhizosphereMSRLARTMSTREIARRAGILAALLLLMGAGSVAAQGFGRPADADLRLEWAGAEDRRGRPLVSGYVYNQRPGSYATSMRLLVEALDASGQVVGSTSGFVLGDVPPSSRSYFEIRAPAKAASYRVTIQSFSWRTYGAGAG
Ga0207684_1025700523300025910Corn, Switchgrass And Miscanthus RhizosphereMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG
Ga0207684_1059770733300025910Corn, Switchgrass And Miscanthus RhizosphereMTRSEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAG
Ga0207660_1021662623300025917Corn RhizosphereMRRLVRGVSTAVLAMLLLCLGAGAVRAQIQTRSVDSDLRVEWTGSEDRRGRPIVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRAYFEIKAPAKAASYRVTIESYAWRAYGAGGG
Ga0207660_1152105213300025917Corn RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQIRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0207706_1009671723300025933Corn RhizosphereMRQTVDAVGTRRSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGMRLRVEALDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG
Ga0207686_1181638223300025934Miscanthus RhizosphereRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSDRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0207689_10001698143300025942Miscanthus RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0207667_1172765213300025949Corn RhizosphereMRQTVDVVGTRHSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGMRLRVEALDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG
Ga0207651_1087693613300025960Switchgrass RhizosphereIRRAAVTIALLALVAAAGPASAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG
Ga0207712_1023271013300025961Switchgrass RhizosphereAVLAGLLMCLGGGAARAQVQTRSVESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGG
Ga0207708_1017715443300026075Corn, Switchgrass And Miscanthus RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSAESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAATDSLERVIGRLQQLLLL
Ga0207675_10032577423300026118Switchgrass RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSAESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0209438_100203653300026285Grasslands SoilMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRSGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG
Ga0209438_101723943300026285Grasslands SoilMRRLARGMSTAVLAGLLMCLGAGAARAQVQTRSVESDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEVKAPAKAASYRVTIESFAWRAYGAGGG
Ga0209131_100290623300026320Grasslands SoilMRRLARGMSTAILAGLLMCLGAGAARAQVQTRSVESDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVLGDVPPSNRSYFEIKAPAKVASYRVTIESFAWRAYGAGGG
Ga0257162_100574623300026340SoilMEGVGATGSIEPAIRRAVALVTRLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG
Ga0257150_106838113300026356SoilVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGG
Ga0257166_104537513300026358SoilMMRRLARSVSTVVLAALLICLGAGAARAQVSTRSVDRDLRVEWTDSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0257163_100452523300026359SoilMAGVMAARSIAPAIRRAVALVTLLLGSAFGIVDGTVEAQVSGRPTDQDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG
Ga0257167_101047813300026376SoilTVWRSARSVSTVVLAALLICLGAGAARAQVSTRSVDRDLRVEWTDSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQMAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0257171_100009193300026377SoilICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG
Ga0257171_103869513300026377SoilSTRSVDRDLRVEWTDSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0257147_104210823300026475SoilEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG
Ga0257177_103052823300026480SoilMEGAGATRSIEPAVRRAVALVTVLICSTAGVAGAQVSGRSVDQDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAGSYRVTIQGFSWRAYGAGGG
Ga0257172_105826513300026482SoilTVEAQVSGRPTDQDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG
Ga0257157_100393533300026496SoilMAGVMAARSIAPAIRRAVALVTLLLGSAFGIVDGTVEAQVSGRPTDQDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALGASGQAVGSTTGFVFGDVPPSGRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0257181_100663223300026499SoilVWRSARSVSTVVLAALLICLGAGAARAQVSTRSVDRDLRVEWTDSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0257161_100352443300026508SoilMAGVMAARSIAPAIRRAVALVTLLLGSAFGIVDGTVEAQVSGRPTDQDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALGASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGAGGG
Ga0257168_100170723300026514SoilMEGVGATRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRSVDEDLRLEWTAAEDRRGRPIVSGYVYNQRSGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG
Ga0257168_114295613300026514SoilMEGVGATRSIEPAIRRAVALVTVLICSTAGVAGAQVSGRSVDQDLRLEWTAAEDRLGRPIVSGYVYNARAGTYATAMRLRVEALDASGQAVGATTGFVFGDVPPSGRSYFEIKAPAKAGSYRVTIQGFSWRAYGAGGG
Ga0209648_10004215153300026551Grasslands SoilMEGAGATRSIEPAVRRAVALVTVLICSTAGVAGGQVSGRSVDQDLRLEWTAAEDRLGRPIVSGYVYNARAGTYATAMRLRVEALDASGQAVGATTGFVFGDVPPSGRSYFEIKAPAKAGSYRVTIQGFSWRAYGAGGG
Ga0209648_1057482413300026551Grasslands SoilTRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRSVDEDLRLEWTAAEDRRGRPIVSGYVYNQRSGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG
Ga0209981_104393913300027378Arabidopsis Thaliana RhizosphereSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGMRLRVEALDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG
Ga0209995_100138923300027471Arabidopsis Thaliana RhizosphereMRQTVDVVGTRRSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGVRLRVETLDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG
Ga0209999_102171523300027543Arabidopsis Thaliana RhizosphereRRSIASAGSIAMLLALLIAGPVGAQITGRALAPDLRVEWSAEEDRRGRTVVSGYVYNERAGSYATGMRLRVEALDGSGQTVGSTTGYVLGDVPPSNRSYFEVKAPAKAAAYRVTVQSYTWRGYGAGGG
Ga0209178_144111913300027725Agricultural SoilMTRSEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYG
Ga0209073_1017682423300027765Agricultural SoilMTRPEIRRAAVTIALLALVAAAGPASAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG
Ga0209177_1018730413300027775Agricultural SoilMMRPEIRRAAVTIALLALVAAAGPAGAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG
Ga0209726_1005599323300027815GroundwaterMRRLTRGVSTAVLAALLLCLSAGAARAQVQARSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLLVEALDGSGQVIGSTIGYVLGEVPPSNRSYFEIRAPAKAASYRVTIESFAWRAYGAGGG
Ga0209488_1012460223300027903Vadose Zone SoilMEGAGATRSIEPAVRRAVALVTVLICSTAGVAGAQVSGRSVDQDLRLEWTAAEDRLGRPIVSGYVYNARAGTYATAMRLRVEALDASGQAVGATTGFVFGDVPPSGRSYFEIKAPAKAGSYRVTIQGFSWRAYGAGGG
Ga0209488_1070613413300027903Vadose Zone SoilMEGVGATGSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTIQGFSWRGYGAGGG
Ga0209859_106863323300027954Groundwater SandGILAALLLFMGAGPVAAQGFSRPADADLRLEWAGAEDRRGRPLVSGYVYNQRAGSYATSVQLLVEALDASGQVVGSTSGFVLGDVPPSSRSYFEARAPAKAASYRVTIQSFSWRTYGAGG
Ga0209526_1000708143300028047Forest SoilMAGVMAARSIEPAIRRAVALVTLLLGSAFGIVGGPVEAQVSGRPTDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGLYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSDRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0268265_1004812323300028380Switchgrass RhizosphereMRRLARGMSTAVLAGLLMCLGGGAARAQVQTRSAESDLRVEWTGSEDRRGRSVVSGYVYNQRAGSYAVSVRLLVEALDGSGQVAGSTTGYVFGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0257175_101622243300028673SoilMEGVGATRSIEPAIRRAVALVTLLICSTAGVAGAQVSGRAVDEDLRLEWTAAEDRRGRPIVSGYVYNQRAGTYATSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPAKAASYRVTVQGFSWRGYGA
Ga0307504_1033089523300028792SoilMPNIANVVSTSGAVLRAGALALLLIVVAAGGAGAQGFGRPAEADLRVEWTGSEDRGGRPVVSGYVYNQRAGSYAIAVRLLVEALDASGQVVGSTTGSLLGQVPPSGRAYFDIKAPAKAASYRVTIQSFSWRTYGAGG
Ga0307281_1001655523300028803SoilMRRFARGMSTGVLAALLICLGAGAGRAQVQTRSVDGDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAGSVRLRVEALDGSGQVVGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTVESFAWRAYGAGGG
Ga0307281_1002993723300028803SoilVWRSARSVSTVVLAALLICLGAGAARAQVQTRSVDRDLRVEWTGSEDRRGRPVVSGYVYNQRAGSYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0307310_1074929313300028824SoilSDHDRGALVSVAERGGRAVALGTLLIGSAFGIVAGTADAEVSGRPVDQDLRLEWTAEEDRRGRPIVSGYVYNQRAGTYATSMRLQVEAVDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPVRAASYRVTIQTFAWRAYGAGGG
Ga0307312_1026912523300028828SoilVSVAERGGRAVALGTLLIGSAFGIVAGTADAEVSGRPVDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGSYASSMRLQVEALDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0307304_1021859513300028885SoilDAEVSGRQVDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGTYATSMRLQVEAVDASGQAVGSTTGFVFGDVPPSGRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
Ga0299906_1106718113300030606SoilALDLGVRMTGGSTVWRSARSVNTVVLAALLICLGAGAARAQVHTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAASVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEITAPAEAASHRVTIESFEWRSYGAGGG
(restricted) Ga0255311_100586333300031150Sandy SoilMRRLARGMSTAALVTLLICLGAGGARAQVQTRSVDSDLRLESTDSEDRRGRPVVSGYVYNQRAGSYAISVRIRVEALDGSGQVAGSTIGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRAYGAGGG
Ga0307501_1022321213300031152SoilGALVSVAERGGRAVALGTLLIGSAFGIVAGTADAEVSGRPVDQDLRLEWTAGEDRRGRPIVSGYVYNQRAGTYATSMRLQVEAVDGSGQAVGSTTGFVFGDVPPSGRSYFEIKAPARAASYRVTIQTFAWRAYGAGGG
(restricted) Ga0255310_1000186373300031197Sandy SoilMRRLARGMSTAALVTLLICLGAGGARAQVQTRSVDSDLRLESTDSEDRRGRPVVSGYVYNQRAGSYAISVRIRVEALDGSGQVAGSTIGYVLGDVPPSNRSYFEIKAPARAASYRVTIESFAWRAYG
Ga0307505_1055789623300031455SoilGAAIAQMQTRSVDSDLRVEGTDSEDRRGRPVMSGYVYNQRAGSYAVSVRLLVEALDGSGQVVGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIASFAWRSYGAGGG
Ga0307469_1031836433300031720Hardwood Forest SoilMPRLAGGVSTAVLAALLLCLGAGAARAQVQTRSVESDLRVESSGSEDRRGRPVVSGYVYNQRAGSYAAGVRLLVEALDGSGQVLGSTTGYVLGDVPPSNRSYFEIKAPAKAASYRVTIESFAWRGYGAGGG
Ga0307469_1107309123300031720Hardwood Forest SoilMRRFARSVRTALLATLLICLGVGAGRAQVQTRSADSDLRVESTESEDRRGRPVVSGYVYNQRAGSYAVGVRLRVEALDGSGQVTDSTIGYVLGEVPPSNRSYFEIKAPAKAASYRVTIESFSWRAYGAGGG
Ga0214473_10002493243300031949SoilMPRLADAVSTRRFARRAGILAALLLFMGAGSVAAQGFSRSADADLRLEWAGAEDRRGRPLVSGYVYNQRAGSYATSVHLLVEALDASGQVVGSTSGFVLGDVPPSSRSYFEVKAPAKAASYRVTIRSFSWRTYGAGGL
Ga0307470_1011162423300032174Hardwood Forest SoilVNSRVARWRLGDNGETMRRLARGTSTVLAALLIWAGAAVAQVQTRSVDGDLRVEGTGSEDRRGRPVVSGYVYNQRAGTYAVSVRLLVEALDPAGQVVGSTIGYVFGDVPPSNRSYFEIKAPAKAANYRITIASFTWRAYGAGGG
Ga0307471_10355942913300032180Hardwood Forest SoilMRRFARSVRTALLATLLICLGVGAGRAQVQTRSADSDLRVESTESEDRRGRPVVSGYVYNQRAGSYAVGVRLRVEALDGSGQVTGSTIGYVLGEVPPSNRSYFEIKAPAKAASYR
Ga0326729_105000823300033432Peat SoilQTAVRSRGRTAIGGAAALLSLLIASSTGVVSAQVFGRSTEQDVRLEWTAREDRRGRPVVTGYLYNQRAGSYATSVRLLVETLDGSGQVAGSTTGFVFGDVPPSDRSYFEVKAPAKAASYRVTIQTLSWRTYGAGGG
Ga0326726_10015393123300033433Peat SoilMPGQTAVRSRGRTAIGGAAALLSLLIASSTGVVSAQVFGRSTEQDVRLEWTAREDRRGRPVVTGYLYNQRAGSYATSVRLLVETLDGSGQVAGSTTGFVFGDVPPSDRSYFEVKAPAKAASYRVTIQTLSWRTYGAGGG
Ga0247829_1147450913300033550SoilVHSRRRSPSAGPASAQVLGRPADGDLRLEWTAAEDRRGRPIVSGYIYNLRGGTYATSVRLLVEALDASGATVGSTSGFVFGDVPPSDRSYFEIKAPPRAASYRVSIQTFSWRSYGAGGG
Ga0247830_1064425523300033551SoilMRRLARGMSTAVLVTVLVGLGAGGARAQVQTRSVDSDLRVESSDSEDRRGRPVVSGYVYNQRAGSYAISVRLRAEALDGSGQVVGSTIGYVLGDVPPSNRSYFEIKAPTRAASYRVTIESFAWRAYGAGGG
Ga0364924_122406_7_3813300033811SedimentVSTVVLAALLICLGAGAARAQVQTRSVDSDLRVEWTGSEDRRGRPVVSGYVYNQRAGGYAVSVRLRVEALDGSGQVAGSTIGYVLGEVPPSNRSYFEIQAPAKAASYRVTIESFAWRAYGAGGG
Ga0326723_0036231_1357_17763300034090Peat SoilMPGQTAVRSRGRTAIGGAAALLSLLIASSTGVVSAQVFGRSTEQDVRLEWTAREDRRGRPVVTGYLYNQRAGSYATSVRLLVEALDGSGQVAGSTTGFVFGDVPPSDRSYFEVKAPAKAASYRVTIQTLSWRTYGAGGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.