NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F025461

Metagenome / Metatranscriptome Family F025461

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F025461
Family Type Metagenome / Metatranscriptome
Number of Sequences 201
Average Sequence Length 73 residues
Representative Sequence MTPKRMALAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Number of Associated Samples 168
Number of Associated Scaffolds 201

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 84.00 %
% of genes near scaffold ends (potentially truncated) 23.38 %
% of genes from short scaffolds (< 2000 bps) 76.62 %
Associated GOLD sequencing projects 159
AlphaFold2 3D model prediction Yes
3D model pTM-score0.37

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (94.030 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(12.935 % of family members)
Environment Ontology (ENVO) Unclassified
(32.338 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(32.338 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: Yes Secondary Structure distribution: α-helix: 40.00%    β-sheet: 0.00%    Coil/Unstructured: 60.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.37
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 201 Family Scaffolds
PF12706Lactamase_B_2 32.34
PF01208URO-D 17.41
PF09335SNARE_assoc 8.46
PF13193AMP-binding_C 5.47
PF01541GIY-YIG 2.49
PF00072Response_reg 1.49
PF10518TAT_signal 1.49
PF02518HATPase_c 1.49
PF00672HAMP 1.00
PF13416SBP_bac_8 1.00
PF00496SBP_bac_5 0.50
PF13432TPR_16 0.50
PF01979Amidohydro_1 0.50
PF00753Lactamase_B 0.50
PF12399BCA_ABC_TP_C 0.50
PF00383dCMP_cyt_deam_1 0.50
PF01738DLH 0.50

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 201 Family Scaffolds
COG0407Uroporphyrinogen-III decarboxylase HemECoenzyme transport and metabolism [H] 17.41
COG0398Uncharacterized membrane protein YdjX, related to fungal oxalate transporter, TVP38/TMEM64 familyFunction unknown [S] 8.46
COG0586Membrane integrity protein DedA, putative transporter, DedA/Tvp38 familyCell wall/membrane/envelope biogenesis [M] 8.46
COG1238Uncharacterized membrane protein YqaA, VTT domainFunction unknown [S] 8.46


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms94.03 %
UnclassifiedrootN/A5.97 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002122|C687J26623_10215223Not Available532Open in IMG/M
3300002245|JGIcombinedJ26739_100609325All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium969Open in IMG/M
3300002886|JGI25612J43240_1011780All Organisms → cellular organisms → Bacteria → Proteobacteria1291Open in IMG/M
3300002907|JGI25613J43889_10197366Not Available538Open in IMG/M
3300002914|JGI25617J43924_10090016All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1106Open in IMG/M
3300003994|Ga0055435_10122661All Organisms → cellular organisms → Bacteria → Proteobacteria705Open in IMG/M
3300003995|Ga0055438_10016923All Organisms → cellular organisms → Bacteria1596Open in IMG/M
3300004052|Ga0055490_10101759All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300004058|Ga0055498_10082221All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium625Open in IMG/M
3300004114|Ga0062593_100145447All Organisms → cellular organisms → Bacteria1786Open in IMG/M
3300004463|Ga0063356_100883703All Organisms → cellular organisms → Bacteria → Proteobacteria1259Open in IMG/M
3300004463|Ga0063356_104832584All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300005176|Ga0066679_10668062All Organisms → cellular organisms → Bacteria676Open in IMG/M
3300005294|Ga0065705_10700510All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300005336|Ga0070680_100269151All Organisms → cellular organisms → Bacteria → Proteobacteria1442Open in IMG/M
3300005341|Ga0070691_10079604All Organisms → cellular organisms → Bacteria1602Open in IMG/M
3300005406|Ga0070703_10048807All Organisms → cellular organisms → Bacteria → Proteobacteria1346Open in IMG/M
3300005434|Ga0070709_10093923All Organisms → cellular organisms → Bacteria1984Open in IMG/M
3300005440|Ga0070705_100162490All Organisms → cellular organisms → Bacteria1494Open in IMG/M
3300005444|Ga0070694_101231718All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium628Open in IMG/M
3300005445|Ga0070708_100042245All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4001Open in IMG/M
3300005445|Ga0070708_100283520All Organisms → cellular organisms → Bacteria1559Open in IMG/M
3300005458|Ga0070681_11069902All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300005467|Ga0070706_100071244All Organisms → cellular organisms → Bacteria3214Open in IMG/M
3300005468|Ga0070707_100031819All Organisms → cellular organisms → Bacteria5025Open in IMG/M
3300005468|Ga0070707_100556809All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1109Open in IMG/M
3300005536|Ga0070697_100023462All Organisms → cellular organisms → Bacteria4906Open in IMG/M
3300005542|Ga0070732_10849157Not Available557Open in IMG/M
3300005546|Ga0070696_100208933All Organisms → cellular organisms → Bacteria1460Open in IMG/M
3300005557|Ga0066704_10617991All Organisms → cellular organisms → Bacteria694Open in IMG/M
3300005880|Ga0075298_1015478All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium682Open in IMG/M
3300006041|Ga0075023_100036247All Organisms → cellular organisms → Bacteria1480Open in IMG/M
3300006047|Ga0075024_100490108All Organisms → cellular organisms → Bacteria642Open in IMG/M
3300006049|Ga0075417_10608135All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300006804|Ga0079221_10064815All Organisms → cellular organisms → Bacteria1685Open in IMG/M
3300006806|Ga0079220_10063030All Organisms → cellular organisms → Bacteria1797Open in IMG/M
3300006847|Ga0075431_101546863All Organisms → cellular organisms → Bacteria → Proteobacteria621Open in IMG/M
3300006854|Ga0075425_100390441All Organisms → cellular organisms → Bacteria1603Open in IMG/M
3300007255|Ga0099791_10000142All Organisms → cellular organisms → Bacteria23859Open in IMG/M
3300007258|Ga0099793_10573575All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium564Open in IMG/M
3300009038|Ga0099829_10006031All Organisms → cellular organisms → Bacteria → Proteobacteria7439Open in IMG/M
3300009053|Ga0105095_10111560All Organisms → cellular organisms → Bacteria1485Open in IMG/M
3300009078|Ga0105106_11312980All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium513Open in IMG/M
3300009143|Ga0099792_10089101All Organisms → cellular organisms → Bacteria1597Open in IMG/M
3300009147|Ga0114129_10839554All Organisms → cellular organisms → Bacteria1169Open in IMG/M
3300009147|Ga0114129_12234177All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300009162|Ga0075423_10000946All Organisms → cellular organisms → Bacteria → Proteobacteria24653Open in IMG/M
3300009171|Ga0105101_10289200All Organisms → cellular organisms → Bacteria792Open in IMG/M
3300009174|Ga0105241_10958427All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium798Open in IMG/M
3300009174|Ga0105241_12531648All Organisms → cellular organisms → Bacteria514Open in IMG/M
3300010400|Ga0134122_10518594All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1083Open in IMG/M
3300010400|Ga0134122_12247828All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium590Open in IMG/M
3300010401|Ga0134121_12209637All Organisms → cellular organisms → Bacteria587Open in IMG/M
3300011003|Ga0138514_100040208All Organisms → cellular organisms → Bacteria917Open in IMG/M
3300011119|Ga0105246_11035410All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300011119|Ga0105246_11591035All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium617Open in IMG/M
3300011395|Ga0137315_1021415All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium847Open in IMG/M
3300011433|Ga0137443_1186341All Organisms → cellular organisms → Bacteria619Open in IMG/M
3300011438|Ga0137451_1115923All Organisms → cellular organisms → Bacteria825Open in IMG/M
3300012034|Ga0137453_1089865Not Available599Open in IMG/M
3300012096|Ga0137389_10084074All Organisms → cellular organisms → Bacteria2494Open in IMG/M
3300012174|Ga0137338_1006742All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2044Open in IMG/M
3300012202|Ga0137363_10031212All Organisms → cellular organisms → Bacteria3683Open in IMG/M
3300012202|Ga0137363_10217765All Organisms → cellular organisms → Bacteria1539Open in IMG/M
3300012225|Ga0137434_1014541All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium960Open in IMG/M
3300012226|Ga0137447_1073681All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300012355|Ga0137369_10300134All Organisms → cellular organisms → Bacteria → Proteobacteria1192Open in IMG/M
3300012360|Ga0137375_10607583All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium909Open in IMG/M
3300012362|Ga0137361_10187331All Organisms → cellular organisms → Bacteria → Proteobacteria1867Open in IMG/M
3300012685|Ga0137397_10013995All Organisms → cellular organisms → Bacteria → Proteobacteria5615Open in IMG/M
3300012923|Ga0137359_11590559All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300012931|Ga0153915_10345451All Organisms → cellular organisms → Bacteria → Proteobacteria1672Open in IMG/M
3300012931|Ga0153915_12404298Not Available616Open in IMG/M
3300012944|Ga0137410_10012865All Organisms → cellular organisms → Bacteria5686Open in IMG/M
3300012957|Ga0164303_10361540All Organisms → cellular organisms → Bacteria881Open in IMG/M
3300014262|Ga0075301_1082575All Organisms → cellular organisms → Bacteria671Open in IMG/M
3300014308|Ga0075354_1006256All Organisms → cellular organisms → Bacteria1601Open in IMG/M
3300014870|Ga0180080_1049100All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium693Open in IMG/M
3300014881|Ga0180094_1079088All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300014882|Ga0180069_1068526All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium816Open in IMG/M
3300014884|Ga0180104_1004238All Organisms → cellular organisms → Bacteria → Proteobacteria3077Open in IMG/M
3300014885|Ga0180063_1038882All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1346Open in IMG/M
3300015254|Ga0180089_1057019All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium778Open in IMG/M
3300015259|Ga0180085_1056796All Organisms → cellular organisms → Bacteria1123Open in IMG/M
3300015264|Ga0137403_10183955All Organisms → cellular organisms → Bacteria → Proteobacteria2033Open in IMG/M
3300015371|Ga0132258_13601877All Organisms → cellular organisms → Bacteria1059Open in IMG/M
3300015372|Ga0132256_103649832All Organisms → cellular organisms → Bacteria518Open in IMG/M
3300017927|Ga0187824_10001983All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria5287Open in IMG/M
3300017930|Ga0187825_10020075All Organisms → cellular organisms → Bacteria → Proteobacteria2240Open in IMG/M
3300017993|Ga0187823_10023215All Organisms → cellular organisms → Bacteria → Proteobacteria1575Open in IMG/M
3300017997|Ga0184610_1000637All Organisms → cellular organisms → Bacteria → Proteobacteria8087Open in IMG/M
3300018000|Ga0184604_10097018All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300018027|Ga0184605_10328223All Organisms → cellular organisms → Bacteria691Open in IMG/M
3300018031|Ga0184634_10118750All Organisms → cellular organisms → Bacteria → Proteobacteria1168Open in IMG/M
3300018051|Ga0184620_10229802All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium621Open in IMG/M
3300018052|Ga0184638_1017690All Organisms → cellular organisms → Bacteria2494Open in IMG/M
3300018053|Ga0184626_10017861All Organisms → cellular organisms → Bacteria → Proteobacteria2853Open in IMG/M
3300018054|Ga0184621_10315071Not Available551Open in IMG/M
3300018059|Ga0184615_10278448All Organisms → cellular organisms → Bacteria935Open in IMG/M
3300018074|Ga0184640_10064464All Organisms → cellular organisms → Bacteria1540Open in IMG/M
3300018075|Ga0184632_10059605All Organisms → cellular organisms → Bacteria1654Open in IMG/M
3300018078|Ga0184612_10011609All Organisms → cellular organisms → Bacteria → Proteobacteria4463Open in IMG/M
3300018079|Ga0184627_10070523All Organisms → cellular organisms → Bacteria1827Open in IMG/M
3300018422|Ga0190265_10010529All Organisms → cellular organisms → Bacteria6922Open in IMG/M
3300018422|Ga0190265_10022005All Organisms → cellular organisms → Bacteria → Proteobacteria5113Open in IMG/M
3300018422|Ga0190265_10027640All Organisms → cellular organisms → Bacteria → Proteobacteria4654Open in IMG/M
3300018422|Ga0190265_10731780All Organisms → cellular organisms → Bacteria → Proteobacteria1111Open in IMG/M
3300018422|Ga0190265_10952326All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium980Open in IMG/M
3300018429|Ga0190272_10277451All Organisms → cellular organisms → Bacteria1280Open in IMG/M
3300018468|Ga0066662_11925894All Organisms → cellular organisms → Bacteria619Open in IMG/M
3300019259|Ga0184646_1612617All Organisms → cellular organisms → Bacteria638Open in IMG/M
3300019360|Ga0187894_10121827All Organisms → cellular organisms → Bacteria1362Open in IMG/M
3300019458|Ga0187892_10137238All Organisms → cellular organisms → Bacteria1391Open in IMG/M
3300019879|Ga0193723_1014158All Organisms → cellular organisms → Bacteria2498Open in IMG/M
3300019879|Ga0193723_1146564All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium636Open in IMG/M
3300019880|Ga0193712_1017410All Organisms → cellular organisms → Bacteria1502Open in IMG/M
3300019881|Ga0193707_1042454All Organisms → cellular organisms → Bacteria1466Open in IMG/M
3300019881|Ga0193707_1174067All Organisms → cellular organisms → Bacteria576Open in IMG/M
3300019883|Ga0193725_1031335All Organisms → cellular organisms → Bacteria1418Open in IMG/M
3300019883|Ga0193725_1033604All Organisms → cellular organisms → Bacteria1360Open in IMG/M
3300019886|Ga0193727_1069488All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1091Open in IMG/M
3300020002|Ga0193730_1037558All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1403Open in IMG/M
3300020004|Ga0193755_1027312All Organisms → cellular organisms → Bacteria1889Open in IMG/M
3300020021|Ga0193726_1020506All Organisms → cellular organisms → Bacteria → Proteobacteria3367Open in IMG/M
3300020060|Ga0193717_1169320Not Available623Open in IMG/M
3300020061|Ga0193716_1142872Not Available972Open in IMG/M
3300021086|Ga0179596_10205577All Organisms → cellular organisms → Bacteria960Open in IMG/M
3300021090|Ga0210377_10019069All Organisms → cellular organisms → Bacteria5059Open in IMG/M
3300021168|Ga0210406_11388975All Organisms → cellular organisms → Bacteria502Open in IMG/M
3300021432|Ga0210384_10004814All Organisms → cellular organisms → Bacteria → Proteobacteria14941Open in IMG/M
3300021432|Ga0210384_11700038All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300021559|Ga0210409_10998925All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300025165|Ga0209108_10030169All Organisms → cellular organisms → Bacteria3054Open in IMG/M
3300025324|Ga0209640_10064456All Organisms → cellular organisms → Bacteria3172Open in IMG/M
3300025535|Ga0207423_1003137All Organisms → cellular organisms → Bacteria2377Open in IMG/M
3300025904|Ga0207647_10573261All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium624Open in IMG/M
3300025910|Ga0207684_10092836All Organisms → cellular organisms → Bacteria2573Open in IMG/M
3300025910|Ga0207684_10150390All Organisms → cellular organisms → Bacteria → Proteobacteria2003Open in IMG/M
3300025910|Ga0207684_10551262All Organisms → cellular organisms → Bacteria986Open in IMG/M
3300025911|Ga0207654_10443133All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium909Open in IMG/M
3300025912|Ga0207707_10285022All Organisms → cellular organisms → Bacteria1430Open in IMG/M
3300025912|Ga0207707_10864562All Organisms → cellular organisms → Bacteria750Open in IMG/M
3300025922|Ga0207646_10241013All Organisms → cellular organisms → Bacteria1633Open in IMG/M
3300025922|Ga0207646_10345720All Organisms → cellular organisms → Bacteria1344Open in IMG/M
3300025934|Ga0207686_10324513All Organisms → cellular organisms → Bacteria1151Open in IMG/M
3300025965|Ga0210090_1019714All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium922Open in IMG/M
3300025971|Ga0210102_1071620Not Available748Open in IMG/M
3300025992|Ga0208775_1007662All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300026015|Ga0208286_1003695All Organisms → cellular organisms → Bacteria975Open in IMG/M
3300026025|Ga0208778_1011011Not Available852Open in IMG/M
3300026285|Ga0209438_1000571All Organisms → cellular organisms → Bacteria → Proteobacteria11535Open in IMG/M
3300026285|Ga0209438_1013055All Organisms → cellular organisms → Bacteria2778Open in IMG/M
3300026358|Ga0257166_1055502Not Available567Open in IMG/M
3300026359|Ga0257163_1024256All Organisms → cellular organisms → Bacteria947Open in IMG/M
3300026371|Ga0257179_1035632All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300026496|Ga0257157_1071031All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300026515|Ga0257158_1004102All Organisms → cellular organisms → Bacteria1958Open in IMG/M
3300026535|Ga0256867_10020906All Organisms → cellular organisms → Bacteria2796Open in IMG/M
3300026555|Ga0179593_1203409All Organisms → cellular organisms → Bacteria1751Open in IMG/M
3300026557|Ga0179587_10260765All Organisms → cellular organisms → Bacteria1111Open in IMG/M
3300026557|Ga0179587_10735825All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium650Open in IMG/M
3300027650|Ga0256866_1066530All Organisms → cellular organisms → Bacteria961Open in IMG/M
3300027651|Ga0209217_1194246All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300027765|Ga0209073_10064473All Organisms → cellular organisms → Bacteria1232Open in IMG/M
3300027765|Ga0209073_10471324All Organisms → cellular organisms → Bacteria525Open in IMG/M
(restricted) 3300027799|Ga0233416_10092611All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1031Open in IMG/M
3300027815|Ga0209726_10029563All Organisms → cellular organisms → Bacteria4304Open in IMG/M
3300027862|Ga0209701_10046433All Organisms → cellular organisms → Bacteria2797Open in IMG/M
3300028047|Ga0209526_10584108All Organisms → cellular organisms → Bacteria718Open in IMG/M
3300028381|Ga0268264_10439721All Organisms → cellular organisms → Bacteria1261Open in IMG/M
3300028673|Ga0257175_1049395All Organisms → cellular organisms → Bacteria768Open in IMG/M
3300028812|Ga0247825_10420216All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium946Open in IMG/M
3300028819|Ga0307296_10210574All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1056Open in IMG/M
3300028828|Ga0307312_11054824All Organisms → cellular organisms → Bacteria538Open in IMG/M
3300030006|Ga0299907_10191606All Organisms → cellular organisms → Bacteria → Proteobacteria1688Open in IMG/M
(restricted) 3300031150|Ga0255311_1006903All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium2253Open in IMG/M
(restricted) 3300031197|Ga0255310_10010920All Organisms → cellular organisms → Bacteria2332Open in IMG/M
(restricted) 3300031197|Ga0255310_10061440All Organisms → cellular organisms → Bacteria987Open in IMG/M
(restricted) 3300031197|Ga0255310_10170557All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium602Open in IMG/M
(restricted) 3300031248|Ga0255312_1074669All Organisms → cellular organisms → Bacteria819Open in IMG/M
(restricted) 3300031248|Ga0255312_1091482All Organisms → cellular organisms → Bacteria740Open in IMG/M
3300031720|Ga0307469_10023212All Organisms → cellular organisms → Bacteria3427Open in IMG/M
3300031720|Ga0307469_10155730All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1718Open in IMG/M
3300031720|Ga0307469_10194507All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1576Open in IMG/M
3300031720|Ga0307469_12165936All Organisms → cellular organisms → Bacteria541Open in IMG/M
3300031754|Ga0307475_10592324All Organisms → cellular organisms → Bacteria888Open in IMG/M
3300032174|Ga0307470_10590636All Organisms → cellular organisms → Bacteria828Open in IMG/M
3300032180|Ga0307471_100048343All Organisms → cellular organisms → Bacteria3485Open in IMG/M
3300032180|Ga0307471_100906997All Organisms → cellular organisms → Bacteria1048Open in IMG/M
3300032205|Ga0307472_100781920All Organisms → cellular organisms → Bacteria869Open in IMG/M
3300033412|Ga0310810_10002595All Organisms → cellular organisms → Bacteria19464Open in IMG/M
3300033432|Ga0326729_1001607All Organisms → cellular organisms → Bacteria4804Open in IMG/M
3300033433|Ga0326726_10634742All Organisms → cellular organisms → Bacteria1028Open in IMG/M
3300033502|Ga0326731_1021953All Organisms → cellular organisms → Bacteria1551Open in IMG/M
3300033513|Ga0316628_100099939All Organisms → cellular organisms → Bacteria → Proteobacteria3249Open in IMG/M
3300033513|Ga0316628_100130854All Organisms → cellular organisms → Bacteria2894Open in IMG/M
3300034090|Ga0326723_0001492All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria8561Open in IMG/M
3300034090|Ga0326723_0050418All Organisms → cellular organisms → Bacteria1759Open in IMG/M
3300034164|Ga0364940_0041260All Organisms → cellular organisms → Bacteria1224Open in IMG/M
3300034257|Ga0370495_0263504All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium565Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil12.94%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil9.45%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.46%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil6.97%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment6.47%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil4.98%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.48%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands3.48%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.99%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil2.99%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere2.99%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil2.49%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment1.49%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.49%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.49%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.49%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.49%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.99%
Rice Paddy SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Rice Paddy Soil1.99%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.99%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.50%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.50%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.50%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere0.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.50%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil0.50%
Untreated Peat SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Untreated Peat Soil0.50%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.50%
Bio-OozeEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Bio-Ooze0.50%
SedimentEnvironmental → Terrestrial → Floodplain → Sediment → Unclassified → Sediment0.50%
SoilEnvironmental → Terrestrial → Agricultural Field → Unclassified → Unclassified → Soil0.50%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.50%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.50%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.50%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.50%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.00%
Freshwater WetlandsEnvironmental → Aquatic → Freshwater → Wetlands → Unclassified → Freshwater Wetlands1.00%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.00%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands1.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil1.00%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002122Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_2EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300003994Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300004052Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2EnvironmentalOpen in IMG/M
3300004058Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqB_D1EnvironmentalOpen in IMG/M
3300004114Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 5EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005341Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-1 metaGEnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005434Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005880Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_201EnvironmentalOpen in IMG/M
3300006041Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2014EnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006049Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1Host-AssociatedOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006847Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD5Host-AssociatedOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009053Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (3) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009078Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 10-12cm September2015EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300009171Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P8 Core (1) Depth 19-21cm May2015EnvironmentalOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011003Agricultural soil microbial communities from Tamara ranch near Red Deer, Alberta, Canada - d1t9i015EnvironmentalOpen in IMG/M
3300011119Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M6-4 metaGHost-AssociatedOpen in IMG/M
3300011395Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT200_2EnvironmentalOpen in IMG/M
3300011433Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT300_2EnvironmentalOpen in IMG/M
3300011438Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT500_2EnvironmentalOpen in IMG/M
3300012034Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT526_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012174Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMLT366_2EnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012225Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT860_2EnvironmentalOpen in IMG/M
3300012226Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT400_2EnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012931Freshwater wetland microbial communities from Ohio, USA - Open water 3 Core 3 Depth 3 metaGEnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012957Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_207_MGEnvironmentalOpen in IMG/M
3300014262Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D1EnvironmentalOpen in IMG/M
3300014308Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_TuleC_D1EnvironmentalOpen in IMG/M
3300014870Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT560_16_10DEnvironmentalOpen in IMG/M
3300014881Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_1DaEnvironmentalOpen in IMG/M
3300014882Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT231B'_16_10DEnvironmentalOpen in IMG/M
3300014884Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_1DaEnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015254Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT860_16_10DEnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300017927Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_4EnvironmentalOpen in IMG/M
3300017930Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_5EnvironmentalOpen in IMG/M
3300017993Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_3EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300017997Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_coexEnvironmentalOpen in IMG/M
3300018000Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_5_coexEnvironmentalOpen in IMG/M
3300018027Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_30_coexEnvironmentalOpen in IMG/M
3300018031Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b1EnvironmentalOpen in IMG/M
3300018051Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_5_b1EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018059Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_coexEnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300019259Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300019360White microbial mat communities from a lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - GBC170108-1 metaGEnvironmentalOpen in IMG/M
3300019458Bio-ooze microbial communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-3 metaGEnvironmentalOpen in IMG/M
3300019879Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2m2EnvironmentalOpen in IMG/M
3300019880Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019883Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2a2EnvironmentalOpen in IMG/M
3300019886Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c2EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020060Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c2EnvironmentalOpen in IMG/M
3300020061Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2c1EnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021090Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65_b1 redoEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025535Wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_ThreeSqA_D1 (SPAdes)EnvironmentalOpen in IMG/M
3300025904Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025911Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025934Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025965Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025971Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025992Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_0N_104 (SPAdes)EnvironmentalOpen in IMG/M
3300026015Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_25C_80N_401 (SPAdes)EnvironmentalOpen in IMG/M
3300026025Rice paddy soil microbial communities from Twitchell Island, California, USA - SF_Rice_20C_80N_103 (SPAdes)EnvironmentalOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026358Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-14-BEnvironmentalOpen in IMG/M
3300026359Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-06-AEnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026496Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-AEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026535Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT150D86 (HiSeq)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027650Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67 HiSeqEnvironmentalOpen in IMG/M
3300027651Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM3H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027799 (restricted)Sediment microbial communities from Lake Towuti, South Sulawesi, Indonesia - Sediment_Towuti_2014_0_MGEnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028673Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-69-BEnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300028819Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_153EnvironmentalOpen in IMG/M
3300028828Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_202EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031197 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH1_T0_E1EnvironmentalOpen in IMG/M
3300031248 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH5_T0_E5EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033432Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF6AY SIP fractionEnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M
3300033502Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF9FY SIP fractionEnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M
3300034090Peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF00NEnvironmentalOpen in IMG/M
3300034164Sediment microbial communities from East River floodplain, Colorado, United States - 14_s17EnvironmentalOpen in IMG/M
3300034257Peat soil microbial communities from wetlands in Alaska, United States - Frozen_pond_02D_17EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
C687J26623_1021522323300002122SoilMTVKRMALAAVVVLIAVFLSPLAYLAGQRASVWLTTPAAERTGPWPSQDEAEPESGAGDRFKPAVRPGFGDI*
JGIcombinedJ26739_10060932523300002245Forest SoilMTPRRMALAALLVLTVVFLSPIAYFAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
JGI25612J43240_101178013300002886Grasslands SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSMPAAERSGSWPSVDEGERESAAPAERAKLQARPGFGEI*
JGI25613J43889_1019736623300002907Grasslands SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAEESTGPWPSVDEAEREQGPTERAKPAVRPGFGDI*
JGI25617J43924_1009001623300002914Grasslands SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGEREAAAPAERAKPLARPGFGEI*
Ga0055435_1012266123300003994Natural And Restored WetlandsMTPKLMAVAAIAVLTVVFLSPVAYMAGQQASVWLTAPAAEPAGPWPSVDEAEREGGPTERAKPPVRPGFGDI*
Ga0055438_1001692323300003995Natural And Restored WetlandsMTPKLMAVAAIAVLTVVFLSPVAYMAGQRASVWLTAPAAEPAGPWPSVDEAEREGGPTERAKPPVRPGFGDI*
Ga0055490_1010175913300004052Natural And Restored WetlandsMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGEPTGPWPSVDEAEREPGPIERSRP
Ga0055498_1008222123300004058Natural And Restored WetlandsMTPKLMAVAAIAVLTVVFLSPVVYMAGQRASVWLTAPAAEPAGPWPSVDEAEREGGPTERAKPPVRPGFGNI*
Ga0062593_10014544723300004114SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAEESTGPWPSVDDAEREQGPTERAKPSVRPGFGDI*
Ga0063356_10088370333300004463Arabidopsis Thaliana RhizosphereMTPKLMMLAAVVVLTVVFLSPVAYMAGQRASLWLIAPAGEANGPWPSVDEAERDAAPTERAKPSVRPGFGDI*
Ga0063356_10483258413300004463Arabidopsis Thaliana RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPDGEPTGPWPSGDEAEREQGPMERAK
Ga0066679_1066806223300005176SoilMNLLEWRLLSGFAQEGSMTPRRMALAAVLVLTVVFLSPIAYFAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
Ga0065705_1070051013300005294Switchgrass RhizosphereMTPKLMALAAVVVLTVVFLSPAAYMAGQRATVWLIAPAGEPTGPWPSVDEAEREPGPTEGSKPPVRPGFGDI*
Ga0070680_10026915133300005336Corn RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRASLWLIAPAGEANGPWPSVDEAERDAAPTERAKPSVRPGFGDI*
Ga0070691_1007960423300005341Corn, Switchgrass And Miscanthus RhizosphereMTPRRMTLAAVLALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPSVDEGERDAGDMPERSKPAVRPGFGQI*
Ga0070703_1004880713300005406Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAEPAKLQVRPGFGEI*
Ga0070709_1009392323300005434Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPADDRSGPWPSMDDGERDSSAPAERSKPAVRPGFGEI*
Ga0070705_10016249013300005440Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGEHESAGPAERAKLQARPGFG
Ga0070694_10123171823300005444Corn, Switchgrass And Miscanthus RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLIAPAGEPTGPWPSMDEAEREPGPTEGSKPPIRPGFGDI*
Ga0070708_10004224533300005445Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIIYLAGQRAGVWLSTPAAERSGPWPSVDEGEREAAAPAERAKPLARPGFGEI*
Ga0070708_10028352033300005445Corn, Switchgrass And Miscanthus RhizosphereLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAEPAKLQVRPGFGEI
Ga0070681_1106990223300005458Corn RhizosphereMTPRRMALAAALVLTVVFLSPIAYLAGQRAGVWLSTQAAERTGPWPSVDEGDREPAAPAERTKPQVRPGFGEI*
Ga0070706_10007124423300005467Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTMPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
Ga0070707_10003181963300005468Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
Ga0070707_10055680923300005468Corn, Switchgrass And Miscanthus RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLIAPAGEPTGPWPSVDEAEREPGPTEGSKPPVRPGFGDI*
Ga0070697_10002346233300005536Corn, Switchgrass And Miscanthus RhizosphereMTPRRMTLAAVLALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPSVDEGERDAGDMPERSRPAVRPGFGQI*
Ga0070732_1084915713300005542Surface SoilMTPRRMALATVLVLTMVFLSPIAYLAGQRAGAWLTTPATERSGPWPSVDEGERDSAAPAEHSKPAVRPGFGEI*
Ga0070696_10020893323300005546Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTQAAERTGPWPSVDEGDREPAAPAERTKPQVRPGFGEI*
Ga0066704_1061799113300005557SoilMTPRRMALAAVLVLTVVFLSPIIYLAGQRAGVWLSTPAAERSGPWPSVDEGEREAAAPAERAKPLARPGFGE
Ga0075298_101547823300005880Rice Paddy SoilMTPRRMTLAAVLALTVVFLSPIVYLAGQRAAVWLTTPAAERGPWPSVEEGERDAGDMPERSKPAVRPGFGQI*
Ga0075023_10003624723300006041WatershedsMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAERAKPQVRPGFGEI*
Ga0075024_10049010823300006047WatershedsMTPRRMALATVLVLTVVFLSPIAYLAGQRAGAWLTTPAAERSGPWPSVDEGERDSAAPAEHSKPTVRPGFGEI*
Ga0075417_1060813513300006049Populus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPTEDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
Ga0079221_1006481533300006804Agricultural SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPAEDRSGPWPSVDDGERDSSAPAELSKPAVRPGFGAI*
Ga0079220_1006303033300006806Agricultural SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPAEDRSGPWPSVDDGERDSSAPAERSKPAVRSGFGEI*
Ga0075431_10154686323300006847Populus RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLIAPAGEPTGPWPSGDEAEREPGPTEGTKPPVRPGFGDI*
Ga0075425_10039044123300006854Populus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPAEDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
Ga0099791_10000142143300007255Vadose Zone SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSMPAAERSGSWPSVDEGERESAAPAERAKLQARPGFGEI*
Ga0099793_1057357513300007258Vadose Zone SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAERAKPQVRPGFGEI*
Ga0099829_1000603143300009038Vadose Zone SoilMTPKRMTLAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI*
Ga0105095_1011156033300009053Freshwater SedimentMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPDGEPTGPWPSGDEAEREQGPMERAKPQVRPGFGDI*
Ga0105106_1131298023300009078Freshwater SedimentMALALTLGQEEGPMTPKLVALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGEPTGPWPSVDEAEREPGPTERSRPAVRPGFGDI*
Ga0099792_1008910113300009143Vadose Zone SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAERAKPQ
Ga0114129_1083955423300009147Populus RhizosphereMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPAEDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
Ga0114129_1223417723300009147Populus RhizosphereMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
Ga0075423_1000094633300009162Populus RhizosphereMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPTEDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI*
Ga0105101_1028920013300009171Freshwater SedimentMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPDGEPTGPWPSGDEAGREQCPMERAKPEVRPGFGDI*
Ga0105241_1095842723300009174Corn RhizosphereMAFAVTRGQEEGAMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAEESTGPWPSVDDAEREQGPTERAKPSVRPGFGDI*
Ga0105241_1253164813300009174Corn RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPDGEPAGPWPSGDEAEREQGPTERAKPPVRPGFGDI*
Ga0134122_1051859423300010400Terrestrial SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRASLWLIAPAGEPTGPWPSVDEAERDVAPTERSKPPVRPGFGDI*
Ga0134122_1224782813300010400Terrestrial SoilLMMLAAVVVLTVVFLSPVAYMAGQRASLWLIAPAGEGNGPWPSVDEAERDAAPTERAKPPVRPGFGDI*
Ga0134121_1220963723300010401Terrestrial SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSAPAAERSGPWPSVDEGERESAAPAERAKLQARPGFGEI*
Ga0138514_10004020823300011003SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAERGKPQVRPGFGEI*
Ga0105246_1103541023300011119Miscanthus RhizosphereMALAAVLVLAVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESSAAPAERAKPQVRPGFGEI*
Ga0105246_1159103513300011119Miscanthus RhizosphereMAFAVTRGQEEGAMTPKLMALAAAVVLTVVFLSPIAYMAGQRTTVWLTAPAGGGAGPWPSVDDAEREQGPTERAKPSVRPGFGDI*
Ga0137315_102141523300011395SoilMTPKRMALAAAVVLTVVFLSPVAYMAGQRASVWLAAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI*
Ga0137443_118634123300011433SoilPDGMALALTPGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRATIWLTAPAGENGPWPSVDEAEREQGPTERAKPPVRPGFGDI*
Ga0137451_111592323300011438SoilMTPKRMTLAAVVVLTVVFLSPVAYMAGQRATVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI*
Ga0137453_108986513300012034SoilMALALTPGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRASVWLAAPAAEPAGPWPSVDEAEREAGPTERSKPPVRPGFGDI*
Ga0137389_1008407423300012096Vadose Zone SoilMALAAVLVLTVVFLSPIAYLAGRRAGVWLSTPAAERSGPWPSVDEGERESAAPAERAKPQVRPGFGEI*
Ga0137338_100674223300012174SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRTTEWLTAPAGEPTGPWPSVDEAERDAGPTERSRPPVRPGFGDI*
Ga0137363_1003121253300012202Vadose Zone SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSAPAAERSGPWPSVDEGEREAAAPAERAKLQARPGFGEI*
Ga0137363_1021776523300012202Vadose Zone SoilMALAAVLVLTVVFLSPIIYLAGQRAGVWLSTPAAERSGPWPSVDEGEREAAAPAERAKPLARPGFGEI*
Ga0137434_101454123300012225SoilMTPKRMTLAAVVVLTVVFLSPVAYMAGQRATIWLTAPRGENGPWPSVDEAEREAGPTERSKPPVRPGFGDI*
Ga0137447_107368113300012226SoilRDIPDGMALALTPGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI*
Ga0137369_1030013433300012355Vadose Zone SoilLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGEPTGPWPSVDEAEREPSPTERSKPPVRPGFGDI*
Ga0137375_1060758313300012360Vadose Zone SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGEPTGPWPSVDEAEREPSPTERSKPPVRPGFGDI*
Ga0137361_1018733123300012362Vadose Zone SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGEREAAAPAERAKPLARPGFGEI*
Ga0137397_1001399513300012685Vadose Zone SoilMALALIPGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRATLWLIAPAGEPTGPWPSVDEAEREPGPTERSKPPVRPGFGDI*
Ga0137359_1159055913300012923Vadose Zone SoilMALAAVLVLTVVFLSPIAYLAGQRAGAWLSTPAAERSGPWPSVDEGEREAAAPAERAKLQARPG
Ga0153915_1034545113300012931Freshwater WetlandsMTLAAALALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPGVDEGERDAGDMPERSKPAVRPGFGQI*
Ga0153915_1240429823300012931Freshwater WetlandsMALAAVLVLTMVFLSPIVYLAGQRAGVWLTTPATERGPWASGDEGERDAGDMPEPSKPAVRPGFGQI*
Ga0137410_1001286573300012944Vadose Zone SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAEESTGPWPSVDEAEREQGPTERAKPVVRPGFGDI*
Ga0164303_1036154013300012957SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGEHESAGPAERAKLQARPGFGEI*
Ga0075301_108257513300014262Natural And Restored WetlandsMALALLPQQEEGPMTPKLMAVAAIAVLTVVFLSPVAYMAGQRASVWLTAPAAEPAGPWPSVDEAEREGGPTERAK
Ga0075354_100625623300014308Natural And Restored WetlandsMTPKLMAVAAIAVLTVVFLSPVAYMAGQRASVWLTAPAAEPAGPWPSVDEAEREGGPTERAKPPVRPGFGNI*
Ga0180080_104910023300014870SoilEWRLLGATGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRASVWLAAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI*
Ga0180094_107908823300014881SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREPGPTERSRPPVRPGFGDI*
Ga0180069_106852613300014882SoilMTSKRMALAAVVVLTVVFLSPVAYMAGQRASVWLAAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI*
Ga0180104_100423813300014884SoilMTPKLMALAAVVVLTVVFLSPVAYMAVQRATEWLTAPAGEPTGPWPSVDEAERDAGPTERSRPPVRPGFGDI*
Ga0180063_103888233300014885SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTVPAGEPTGPWPSVDAAEREHGPMERVKPPVRSGFGDI*
Ga0180089_105701923300015254SoilMTPKRMTLAAVVVLTVVFLSPVAYMAGQRASVWLAAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI*
Ga0180085_105679623300015259SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATEWLTAPAGEPTGPWPSVDEAERDAGPTERSRPPVRPGFGDI*
Ga0137403_1018395513300015264Vadose Zone SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLIAPAGEPTGPWPSVDEAEREPGPTEGSKPPVRPGF
Ga0132258_1360187723300015371Arabidopsis RhizosphereMTLAAVLALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPSVDEGERDAGDVPERSKPAVRPGFGQI*
Ga0132256_10364983213300015372Arabidopsis RhizosphereMTLAAVLALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPSVDEGERDAGDAPERSKPAVRPGFGQI*
Ga0187824_1000198333300017927Freshwater SedimentMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGAWLTTPATERSGPWPSVDEGERDSAAPAEHSKPTARPGFGEI
Ga0187825_1002007533300017930Freshwater SedimentMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGAWLTTPATERSGPWPSVDEGERDSAAPAEHSKPAARPGFGEI
Ga0187823_1002321533300017993Freshwater SedimentMTPRRMALAAVLVLTVVFLSLIAYLAGQRAGAWLTTPATERSGPWPSVDEGERDSAAPAEHSKPTARPGFGEI
Ga0187822_1003952523300017994Freshwater SedimentMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGAWLTTPATERSGPWPSVDEGERDSAAPA
Ga0184610_100063763300017997Groundwater SedimentMTPKRMALAAVVVLTVVFLSPAAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Ga0184604_1009701833300018000Groundwater SedimentMTPKLMALAAVVVLTVVFLSPVAYMAGQRATLWLIAPAGEPTGPWPSVDEAEREPGPTEGSKAPVRPGFGDI
Ga0184605_1032822323300018027Groundwater SedimentMTPKLMALAAVVVLTVLFLSPVAYMAGQRATLWLIAPTGEPTGPWPSVDEAEREPDPTER
Ga0184634_1011875023300018031Groundwater SedimentMTPKRMALAAVVVLTVVFLTPVAYMVGQRASVWLTAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0184620_1022980213300018051Groundwater SedimentMTPKLMALAAVVVLTVVFLSPVAYMAGHRATLWLIAPAGESTGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0184638_101769033300018052Groundwater SedimentMTPKRMTLAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Ga0184626_1001786133300018053Groundwater SedimentMTPKRMTLAAVVVLTVVFLSPAAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Ga0184621_1031507123300018054Groundwater SedimentMTPKRMALAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Ga0184615_1027844813300018059Groundwater SedimentEEGPMTPKRMTLAAVVVLTVVFLSPVAYLAGQRASVWLTAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0184640_1006446433300018074Groundwater SedimentMTPKRMALAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0184632_1005960533300018075Groundwater SedimentMTPKRMTLAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDL
Ga0184612_1001160943300018078Groundwater SedimentMTPKRMTLAAVVVLTVVFLSPVAYMAGQRASVWLTAPAVEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Ga0184627_1007052323300018079Groundwater SedimentMTPKRMALAAVVVLTVVFLTPVAYMAGQSASVWLTAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0190265_1001052983300018422SoilMTPKLMALAAAVVLTVVFLSPAAFMAGQRASVWLTSPGAEPTGPWPSVDEAEREPGPTERAKPPVRPGFGDI
Ga0190265_1002200533300018422SoilMTPKLVALAAVVVLTVVFLSPIAYLAGQRAGVWLTTPAVERTGPWPSLDEEPESAPGDRSRAPARPGFGDI
Ga0190265_1002764023300018422SoilMTPKRMALAAVVVLTVVFLSPVVYMAGQRATVWLTAPGAEPTGPWPSVDEAEREPAPTERAKPPVRPGFGDI
Ga0190265_1073178023300018422SoilMTPKLLALAAVVVLTVVFLSPVAYMAGQRTSLWLTAPHGEPAGPWPSVDDPEREQGPMERAKTPALPGVGDI
Ga0190265_1095232613300018422SoilLLRGVHTRKLLENMMIPDGMALALTRGQEEGPMTPKLMTLAAVVVLTVVFLSPVAFMAGQRATVWLTAPAGEGNGPWPSVDEAEREQGPTERAKPPVRPGFGDI
Ga0190272_1027745113300018429SoilMTPKRMTLAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGF
Ga0066662_1192589423300018468Grasslands SoilLAAVLVLTVVFLSPIIYLAGQRAGVWLSTPAAERSGPWPSVDEGEREAAAPAERAKPLARPGFGEI
Ga0184646_161261713300019259Groundwater SedimentNRAVSPGMALALLPGQEEGPMTPKRMTLAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Ga0187894_1012182723300019360Microbial Mat On RocksMTPKLMALAAVVVLTVVFLSPIAYMAGQRASVWLAAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0187892_1013723813300019458Bio-OozeMTPKLMALAAAVVLTVVFLSPIAFLAGQRASAWLAEEMALRTGPVPSQEEVERGPGPADRFKPAVRPGFGDI
Ga0193723_101415823300019879SoilMAFALNPGQEEGPMTPKLMALAAVVVLTVVFLSPLAYVAGQRATVWLTAPAGESTGPWPSVDEAEREQGPTERAKPSVRPGFGDI
Ga0193723_114656413300019879SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERDSAAPAERAKPQVRPGFGEI
Ga0193712_101741013300019880SoilMAFALNPGQEEGPMTPKLMALAAVVVLTVVFLSPLAYMAGQRATVWLTAPAGESTGPWPSVDDAEREQGPTERAKPPVLAGFGDI
Ga0193707_104245423300019881SoilMTPKLIALAAVVVLTVVFLSPVAYMAGQRATLWLIAPPGESTGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0193707_117406713300019881SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERDSAAPAERAKPQVRPGFG
Ga0193725_103133533300019883SoilNPLENRAILPGMAFALNPGQEEGPMTPKLMALAAVVVLTVVFLSPLAYMAGQRATVWLTAPAGESTGPWPSVDDAEREQGPTERAKPPVLAGFGDI
Ga0193725_103360433300019883SoilDGMALALIPGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRATLWLIAPAGESTGPWPSVDEAEREPGPTERSKPLVRPGFGDI
Ga0193727_106948823300019886SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATLWLIAPAGESTGPWPSVDEAEREPGPTERSKPLVRPGFGDI
Ga0193730_103755823300020002SoilMAFALNPGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRATLWLIAPAGESTGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0193755_102731223300020004SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATLWLIAPAGESTGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0193726_102050623300020021SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSAPAAERSGPWPSVDEGERESAAPAERAKPQVRPGFGEI
Ga0193717_116932023300020060SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGEPTGPWPSVDEAEREQGPTERIKPPVRPGFGDI
Ga0193716_114287223300020061SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATSWLTQPAGEPTGPWPSVDEAEREQGPTERAKPPVRPGFGDI
Ga0179596_1020557723300021086Vadose Zone SoilMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAERAKPQVRPGFGEI
Ga0210377_1001906963300021090Groundwater SedimentMTSKRMALAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPAGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0210406_1138897523300021168SoilAVLVLTVVFLSPIAYFAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI
Ga0210384_1000481453300021432SoilMNLLEWRLLSGFAQEVSMTPRRMALAAVLVLTVVFLSPIAYFAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI
Ga0210384_1170003813300021432SoilGFAQEGSMTPRRMALAAVLVLTVVFLSPIAYFAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI
Ga0210409_1099892513300021559SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAERAKLQARPGFGEI
Ga0209108_1003016923300025165SoilMTVKRMALAAVVVLIAVFLSPLAYLAGQRASVWLTTPAAERTGPWPSQDEAEPESGAGDRFKPAVRPGFGDI
Ga0209640_1006445633300025324SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGEPTGPWPSVDEAERDAGPTERSRPPVRPGFGDI
Ga0207423_100313723300025535Natural And Restored WetlandsMTPKLMAVAAIAVLTVVFLSPVAYMAGQRASVWLTAPAAEPAGPWPSVDEAEREGGPTERAKPPVRPGFGDI
Ga0207647_1057326123300025904Corn RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAEESTGPWPSVDDAEREQGPTERAKPSVRPGFGDI
Ga0207684_1009283623300025910Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTMPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI
Ga0207684_1015039013300025910Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIVYLAGQRAGVWLSTPAAERSGPWPSVDEGEREAAAPAERAKPLARPGFGEI
Ga0207684_1055126223300025910Corn, Switchgrass And Miscanthus RhizosphereVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGEHESAGPAERAKLQARPGFGEI
Ga0207654_1044313323300025911Corn RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPDGEPAGPWPSGDEAEREQGPTERAKPPVRPGFGDI
Ga0207707_1028502213300025912Corn RhizosphereMTPKLMMLAAVVVLTVVFLSPVAYMAGQRASLWLIAPAGEANGPWPSVDEAERDAAPTERAKPSV
Ga0207707_1086456223300025912Corn RhizosphereMTPRRMALAAALVLTVVFLSPIAYLAGQRAGVWLSTQAAERTGPWPSVDEGDREPAAPAERTKPQVRPGFGEI
Ga0207646_1024101323300025922Corn, Switchgrass And Miscanthus RhizosphereMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTSAAERSGPWPSVDEGEREAAAPAERAKPLARPGFGEI
Ga0207646_1034572023300025922Corn, Switchgrass And Miscanthus RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLIAPAGEPTGPWPSVDEAEREPGPTEGSKPPVRPGFGDI
Ga0207686_1032451323300025934Miscanthus RhizosphereMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAEESTGPWPSVDDAEREQGPTERAKPSVRPGFGD
Ga0210090_101971423300025965Natural And Restored WetlandsMTPKLMAVAAIAVLTVVFLSPVVYMAGQRASVWLTAPAAEPAGPWPSVDEAEREGGPTERAKPPVRPGFGNI
Ga0210102_107162013300025971Natural And Restored WetlandsMALALTHGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGEPTGPWPSVDEAEREPGPIERSRP
Ga0208775_100766223300025992Rice Paddy SoilMTPRRMTLAAVLALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPSVDEGERDAGDMPERSKPAVRPGFGQI
Ga0208286_100369523300026015Rice Paddy SoilMTPRRMTLAAVLALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPSVDEGERDAGDMPERSRPAVRPGFGQI
Ga0208778_101101113300026025Rice Paddy SoilMTPRRMTLAAVLALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPSVDEGERDAGDMPERSKP
Ga0209438_1000571113300026285Grasslands SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSMPAAERSGSWPSVDEGERESAAPAERAKLQARPGFGEI
Ga0209438_101305533300026285Grasslands SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAEESTGPWPSVDEAEREQGPTERAKPAVRPGFGDI
Ga0257166_105550223300026358SoilMTPKRMTLAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEPEAGPTERSKPPVRPGFGDI
Ga0257163_102425623300026359SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAERTKPQVRPGF
Ga0257179_103563213300026371SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAERAKPQVRPGFGEI
Ga0257157_107103123300026496SoilMTPRRMALAAVLILTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDDGERESAAPA
Ga0257158_100410223300026515SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSAPAAERSGPWPSVDEGERESAAPAERVKPQVRPGFGEI
Ga0256867_1002090633300026535SoilMERPLLSSEQEEGPMTPKLIALAAVAVLTVVFLSPVAYMAGQRASVWLTTPATERTGPWPSLDEAEREPTERSTPPVRPGFGDI
Ga0179593_120340923300026555Vadose Zone SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSAPAAERSGPWPSVDEGEREAAAPAERAKLQARPGFGEI
Ga0179587_1026076523300026557Vadose Zone SoilRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDDGERESAAPAERAKPQVRPGFGEI
Ga0179587_1073582513300026557Vadose Zone SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATLWLIAPAGEPTGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0256866_106653023300027650SoilMERPLLSSEQEEGPMTPKLIALAAVAVLTVVFLSPVAYMAGQRASVWLTTPATERTGPWPSLDEAEREPSERSTPPVRPGFGDI
Ga0209217_119424613300027651Forest SoilMTPRRMALAALLVLTVVFLSPIAYFAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSK
Ga0209073_1006447323300027765Agricultural SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTTPAEDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI
Ga0209073_1047132413300027765Agricultural SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGAWLTTPATERSGPWPSVDEGERDSAAPAEHSK
(restricted) Ga0233416_1009261123300027799SedimentMTPKRMVLAAVLVLTVVFLSPLAYLAGQRASVWLTTLAAEQAGPLPSVDETERDPGPIDRSRPPVRPGVGDI
Ga0209726_1002956343300027815GroundwaterMTSKRMALAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREPGPTERSKPPVRPGFGDI
Ga0209701_1004643313300027862Vadose Zone SoilMTLAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Ga0209526_1058410823300028047Forest SoilMTPRRMALAALLVLTVVFLSPIAYFAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI
Ga0268264_1043972113300028381Switchgrass RhizosphereTPGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAEESTGPWPSVDDAEREQGPTERAKPSVRPGFGDI
Ga0257175_104939523300028673SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSMPAAERSGSWPSVDEGERESAAPAERAKLQARPGF
Ga0247825_1042021613300028812SoilMAFAVTRGQEEGAMTPKLMALAAAVVLTVVFLSPIAYMAGQRTTVWLTAPAGDGAGPWPSVDDAEREQGPTEGAKPLARPGFGDI
Ga0307296_1021057423300028819SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATLWLIAPAGESTGPWPSVDEAEREPGPTEGSKPPVRPGFGDI
Ga0307312_1105482423300028828SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPADRAKPQV
Ga0299907_1019160633300030006SoilKSFPMERPLLSSEQEEGPMTPKLIALAAVAVLTVVFLSPVAYMAGQRASVWLTTPATERTGPWPSLDEAEREPSERSTPPVRPGFGDI
(restricted) Ga0255311_100690313300031150Sandy SoilMALALTLGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGEPTGPWPSVDEAEREPGPTERSRPAVRPGFGDI
(restricted) Ga0255310_1001092033300031197Sandy SoilMTPKLMALAAAVVLTVVFLSPIAYMAGQRTTGWLTAPAGEGTGPWPSVDEAEREQGPTERAKPPVRPGFGDI
(restricted) Ga0255310_1006144013300031197Sandy SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGAWLTTPAAERSGPWPSVDEGERDSAAPAEHSKPAVRPGFGEI
(restricted) Ga0255310_1017055713300031197Sandy SoilTPKLMALAAAVVLTVVFLSPIAYMAGQRTTVWLTAPAGEGTGPWPSVDEAEREQGPAEHAKPPVRPGFGDI
(restricted) Ga0255312_107466923300031248Sandy SoilMEWRLLGATGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRASVWLTAPAAEPTGPWPSVDEAEREPGPTERSKPPVRPGFGDI
(restricted) Ga0255312_109148223300031248Sandy SoilMEWGLLSTPGQEEGPMTPKLLALAAVVVLTVVFLSPVAYMAGQRATVWLTEPAGEPNGPWPSVDEAEREQGPTEHAKPPVRPGFGDI
Ga0307469_1002321243300031720Hardwood Forest SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGEHESAGPAERAKLQARPGFGEI
Ga0307469_1015573023300031720Hardwood Forest SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRASLWLIAPAGEPTGPWPSVDEAERDVAPTERSKPPVRPGFGDI
Ga0307469_1019450743300031720Hardwood Forest SoilMTPKLMALAAVAVLTVVFLSPVAYMAGQRATVWLTAPAGESTGPWPSVDDAEREQGPTERDKPPVRAGFGDI
Ga0307469_1216593613300031720Hardwood Forest SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRTSIWLTAPAEEPAGPWPSVDEAEREQGPTERAKPPVRPGFGDI
Ga0307475_1059232423300031754Hardwood Forest SoilMTPRRMALAAVLVLTVVFLSPIAYLAGQRAGVWLTMPADDGSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI
Ga0307470_1059063613300032174Hardwood Forest SoilVLVLTVVFLSPIAYLAGQRAGVWLTTPADDRSGPWPSMDDGERDSSAPAERSKPAVRPGFGEI
Ga0307471_10004834343300032180Hardwood Forest SoilMTPKLMALAAVVVLTVVFLSPVAYMAGQRATVWLTAPAGESTGPWPSVDDAEREQGPTERDKPPVRAGFGDI
Ga0307471_10090699713300032180Hardwood Forest SoilLVLTVVFLSPIAYLAGQRAGVWLSTPAAERSGPWPSVDEGERESAAPAEPAKLQVRPGFGEI
Ga0307472_10078192013300032205Hardwood Forest SoilLLEWRLLSGFAQEGSMTPRRMALAAVLVLTVVFLSPIAYFAGQRAGVWLTTPADDRSGPWPSVDDGERDSSAPAERSKPAVRPGFGEI
Ga0310810_10002595163300033412SoilMTPKRMTLAAVLVLTVVFLSPIVYLAGQRAGIWLTTAAAERGPWPSVDEGERDAGDSPERAKPAVRPGFGLI
Ga0326729_100160743300033432Peat SoilMTPKRMTLAAVLILTVVFLSPIVYLAGQRAGIWLTTAAAERGPWPSVDEGERDAGDTPERSKPVVRPGFGQI
Ga0326726_1063474213300033433Peat SoilMTPKRMALAAVLVLTMVFLSPIVYLAGQRASVWLTTPATERGPWASGDEGERDAGDMPERSKPAVRPGFGQI
Ga0326731_102195323300033502Peat SoilMTPKRMALAAVLVLTMVFLSPIVYLAGQRAGVWLTTPATERGPWASGDEGERDAGDMPERSKPAVRPGFGQI
Ga0316628_10009993933300033513SoilMTPKRMALAAVLVLTMVFLSPIVYLAGQRAGVWLTTPATERGPWASGDEGERDAGDMPEPSKPAVRPGFGQI
Ga0316628_10013085413300033513SoilMTPRRMTLAAVLALTVVFLSPIVYLAGQRAGVWLTTPAAERGPWPGVDEGERDAGDMPERSKP
Ga0326723_0001492_2725_29283300034090Peat SoilMALAAVLVLTMVFLSPIVYLAGQRASVWLTTPATERGPWASGDEGERDAGDMPERSKPAVRPGFGQI
Ga0326723_0050418_1090_12933300034090Peat SoilMTLAAVLILTVVFLSPIVYLAGQRAGIWLTTAAAERGPWPSVDEGERDAGDTPERSKPVVRPGFGQI
Ga0364940_0041260_521_7393300034164SedimentMTPKRMTLAAVVVLTVVFLSPVAYLAGQRASVWLTAPAAEPTGPWPSVDEAEREAGPTERSKPPVRPGFGDI
Ga0370495_0263504_208_4623300034257Untreated Peat SoilMALALTPGQEEGPMTPKLMALAAVVVLTVVFLSPVAYMAGQRATIWLTAPAGENGPWPSVDEAEREQGPTERVKPPVRPGFGDI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.