NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F020434

Metagenome / Metatranscriptome Family F020434

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F020434
Family Type Metagenome / Metatranscriptome
Number of Sequences 224
Average Sequence Length 83 residues
Representative Sequence VDRLDAIRLLKALVAAGANSRAPMDSAWVHKIAVRDMSLQGTALASALAYAEGEGWLADSPRKGWISLTRMGEVVAKVN
Number of Associated Samples 97
Number of Associated Scaffolds 224

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 24.55 %
% of genes near scaffold ends (potentially truncated) 58.04 %
% of genes from short scaffolds (< 2000 bps) 91.96 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.86

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (83.036 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(35.268 % of family members)
Environment Ontology (ENVO) Unclassified
(59.821 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(62.500 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 44.86%    β-sheet: 10.28%    Coil/Unstructured: 44.86%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.86
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
e.11.1.0: automated matchesd3ilwa_3ilw0.74059
a.4.5.48: F93-liked1tbxa11tbx0.72945
a.4.5.48: F93-liked2co5a12co50.70861
a.4.5.12: Restriction endonuclease FokI, N-terminal (recognition) domaind2foka12fok0.70559
a.4.5.0: automated matchesd6pcoa_6pco0.69817


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 224 Family Scaffolds
PF00589Phage_integrase 3.12
PF04392ABC_sub_bind 2.23
PF10263SprT-like 0.89
PF01520Amidase_3 0.89
PF13847Methyltransf_31 0.89
PF00144Beta-lactamase 0.45
PF06155GBBH-like_N 0.45
PF00085Thioredoxin 0.45
PF00216Bac_DNA_binding 0.45
PF13827DUF4189 0.45
PF01391Collagen 0.45
PF13426PAS_9 0.45
PF04851ResIII 0.45
PF00239Resolvase 0.45
PF00188CAP 0.45
PF13481AAA_25 0.45
PF14659Phage_int_SAM_3 0.45

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 224 Family Scaffolds
COG2984ABC-type uncharacterized transport system, periplasmic componentGeneral function prediction only [R] 2.23
COG0860N-acetylmuramoyl-L-alanine amidaseCell wall/membrane/envelope biogenesis [M] 0.89
COG0776Bacterial nucleoid DNA-binding protein IHF-alphaReplication, recombination and repair [L] 0.45
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.45
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.45
COG1961Site-specific DNA recombinase SpoIVCA/DNA invertase PinEReplication, recombination and repair [L] 0.45
COG2340Spore germination protein YkwD and related proteins with CAP (CSP/antigen 5/PR1) domainCell cycle control, cell division, chromosome partitioning [D] 0.45
COG2367Beta-lactamase class ADefense mechanisms [V] 0.45
COG2452Predicted site-specific integrase-resolvaseMobilome: prophages, transposons [X] 0.45
COG3536Uncharacterized conserved protein, DUF971 familyFunction unknown [S] 0.45


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A83.04 %
All OrganismsrootAll Organisms16.96 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000597|AF_2010_repII_A1DRAFT_10128107Not Available617Open in IMG/M
3300000597|AF_2010_repII_A1DRAFT_10133583Not Available602Open in IMG/M
3300003505|JGIcombinedJ51221_10463299Not Available512Open in IMG/M
3300004267|Ga0066396_10103278Not Available529Open in IMG/M
3300005332|Ga0066388_100464292All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1906Open in IMG/M
3300005332|Ga0066388_101764946All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1098Open in IMG/M
3300005332|Ga0066388_102977667Not Available866Open in IMG/M
3300005332|Ga0066388_105138844Not Available664Open in IMG/M
3300005332|Ga0066388_107144543Not Available561Open in IMG/M
3300005437|Ga0070710_10890915Not Available642Open in IMG/M
3300005439|Ga0070711_100101933Not Available2090Open in IMG/M
3300005439|Ga0070711_100378710Not Available1144Open in IMG/M
3300005536|Ga0070697_100635816Not Available939Open in IMG/M
3300005764|Ga0066903_100225152All Organisms → cellular organisms → Bacteria2850Open in IMG/M
3300005764|Ga0066903_101463494All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1287Open in IMG/M
3300005764|Ga0066903_102872203All Organisms → cellular organisms → Bacteria → Proteobacteria934Open in IMG/M
3300005764|Ga0066903_105860638All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae645Open in IMG/M
3300005764|Ga0066903_105901120Not Available642Open in IMG/M
3300005764|Ga0066903_107250672Not Available573Open in IMG/M
3300005764|Ga0066903_108691329Not Available516Open in IMG/M
3300005764|Ga0066903_108798313All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium513Open in IMG/M
3300006047|Ga0075024_100307939Not Available779Open in IMG/M
3300006172|Ga0075018_10087298Not Available1364Open in IMG/M
3300006172|Ga0075018_10316717Not Available773Open in IMG/M
3300006174|Ga0075014_100328718Not Available814Open in IMG/M
3300006174|Ga0075014_100335799Not Available806Open in IMG/M
3300006175|Ga0070712_100070374Not Available2499Open in IMG/M
3300006755|Ga0079222_10783561Not Available774Open in IMG/M
3300006804|Ga0079221_10097855All Organisms → cellular organisms → Bacteria → Proteobacteria1435Open in IMG/M
3300006954|Ga0079219_12081874Not Available543Open in IMG/M
3300009792|Ga0126374_11084564Not Available634Open in IMG/M
3300010048|Ga0126373_10231556All Organisms → cellular organisms → Bacteria1808Open in IMG/M
3300010048|Ga0126373_10267283All Organisms → cellular organisms → Bacteria1691Open in IMG/M
3300010048|Ga0126373_10444397All Organisms → cellular organisms → Bacteria1330Open in IMG/M
3300010048|Ga0126373_12339201Not Available595Open in IMG/M
3300010048|Ga0126373_13104807Not Available517Open in IMG/M
3300010358|Ga0126370_11495654Not Available642Open in IMG/M
3300010358|Ga0126370_11898617Not Available579Open in IMG/M
3300010360|Ga0126372_11039071Not Available833Open in IMG/M
3300010360|Ga0126372_11903501Not Available640Open in IMG/M
3300010360|Ga0126372_12483490Not Available569Open in IMG/M
3300010361|Ga0126378_10221893All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1974Open in IMG/M
3300010361|Ga0126378_11419242Not Available786Open in IMG/M
3300010361|Ga0126378_11501549Not Available764Open in IMG/M
3300010361|Ga0126378_13457411Not Available501Open in IMG/M
3300010366|Ga0126379_10756182Not Available1069Open in IMG/M
3300010376|Ga0126381_102656233Not Available716Open in IMG/M
3300010376|Ga0126381_102706103Not Available709Open in IMG/M
3300010376|Ga0126381_104265996Not Available554Open in IMG/M
3300010376|Ga0126381_104404092Not Available544Open in IMG/M
3300010398|Ga0126383_11111774Not Available880Open in IMG/M
3300010398|Ga0126383_11160878Not Available862Open in IMG/M
3300010398|Ga0126383_11610967Not Available738Open in IMG/M
3300010398|Ga0126383_11707217All Organisms → cellular organisms → Bacteria719Open in IMG/M
3300012211|Ga0137377_11013837Not Available761Open in IMG/M
3300012971|Ga0126369_12940886Not Available558Open in IMG/M
3300016270|Ga0182036_10112875Not Available1870Open in IMG/M
3300016270|Ga0182036_10724054Not Available806Open in IMG/M
3300016270|Ga0182036_11054641Not Available672Open in IMG/M
3300016294|Ga0182041_10390877All Organisms → cellular organisms → Bacteria1181Open in IMG/M
3300016294|Ga0182041_10463548All Organisms → cellular organisms → Bacteria → Proteobacteria1091Open in IMG/M
3300016319|Ga0182033_10243829All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1456Open in IMG/M
3300016319|Ga0182033_10548931Not Available998Open in IMG/M
3300016319|Ga0182033_10883226Not Available791Open in IMG/M
3300016319|Ga0182033_11559517Not Available597Open in IMG/M
3300016319|Ga0182033_11729757Not Available567Open in IMG/M
3300016341|Ga0182035_10063583All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2563Open in IMG/M
3300016341|Ga0182035_10441155Not Available1102Open in IMG/M
3300016341|Ga0182035_11847198Not Available547Open in IMG/M
3300016341|Ga0182035_12117880Not Available511Open in IMG/M
3300016371|Ga0182034_10422457All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1098Open in IMG/M
3300016371|Ga0182034_10492953Not Available1020Open in IMG/M
3300016371|Ga0182034_10777729Not Available818Open in IMG/M
3300016371|Ga0182034_10927650All Organisms → cellular organisms → Bacteria750Open in IMG/M
3300016371|Ga0182034_11173161Not Available667Open in IMG/M
3300016371|Ga0182034_11262034All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium 13_1_20CM_3_63_8644Open in IMG/M
3300016371|Ga0182034_11469685Not Available597Open in IMG/M
3300016387|Ga0182040_10619970Not Available877Open in IMG/M
3300016387|Ga0182040_11453958Not Available581Open in IMG/M
3300016387|Ga0182040_11640886Not Available548Open in IMG/M
3300016387|Ga0182040_11681622Not Available542Open in IMG/M
3300016404|Ga0182037_10143507Not Available1789Open in IMG/M
3300016404|Ga0182037_10249714Not Available1403Open in IMG/M
3300016404|Ga0182037_10436974All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria1085Open in IMG/M
3300016404|Ga0182037_10983648Not Available735Open in IMG/M
3300016404|Ga0182037_11417272Not Available615Open in IMG/M
3300016404|Ga0182037_11750875Not Available554Open in IMG/M
3300016422|Ga0182039_10328608Not Available1277Open in IMG/M
3300016422|Ga0182039_10563963Not Available991Open in IMG/M
3300016422|Ga0182039_11022280Not Available742Open in IMG/M
3300016422|Ga0182039_11369496Not Available642Open in IMG/M
3300016422|Ga0182039_11399244Not Available635Open in IMG/M
3300016422|Ga0182039_11630599Not Available589Open in IMG/M
3300016422|Ga0182039_12202796Not Available508Open in IMG/M
3300016445|Ga0182038_10249094Not Available1427Open in IMG/M
3300017970|Ga0187783_10126899Not Available1881Open in IMG/M
3300017974|Ga0187777_11249755Not Available544Open in IMG/M
3300018060|Ga0187765_10403275Not Available845Open in IMG/M
3300020150|Ga0187768_1154659Not Available529Open in IMG/M
3300021560|Ga0126371_10209150Not Available2050Open in IMG/M
3300021560|Ga0126371_10259854All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium → Bradyrhizobium barranii → Bradyrhizobium barranii subsp. barranii1853Open in IMG/M
3300021560|Ga0126371_10268602Not Available1825Open in IMG/M
3300021560|Ga0126371_10396562All Organisms → cellular organisms → Bacteria → Proteobacteria1521Open in IMG/M
3300021560|Ga0126371_11456823Not Available814Open in IMG/M
3300021560|Ga0126371_12394693Not Available638Open in IMG/M
3300021560|Ga0126371_12526063All Organisms → cellular organisms → Bacteria → Terrabacteria group622Open in IMG/M
3300021560|Ga0126371_13230849Not Available551Open in IMG/M
3300021560|Ga0126371_13507315Not Available529Open in IMG/M
3300025898|Ga0207692_10844877Not Available600Open in IMG/M
3300025915|Ga0207693_10099306Not Available2283Open in IMG/M
3300025916|Ga0207663_11079123Not Available645Open in IMG/M
3300026552|Ga0209577_10821836Not Available520Open in IMG/M
3300027874|Ga0209465_10301170Not Available803Open in IMG/M
3300027874|Ga0209465_10692128Not Available500Open in IMG/M
3300027898|Ga0209067_10166140Not Available1176Open in IMG/M
3300027915|Ga0209069_10061129Not Available1780Open in IMG/M
3300030916|Ga0075386_12000545Not Available568Open in IMG/M
3300031057|Ga0170834_112292836Not Available1232Open in IMG/M
3300031122|Ga0170822_11549637Not Available1161Open in IMG/M
3300031128|Ga0170823_12210666Not Available871Open in IMG/M
3300031231|Ga0170824_113162849Not Available560Open in IMG/M
3300031231|Ga0170824_119004938Not Available1056Open in IMG/M
3300031231|Ga0170824_119205698Not Available521Open in IMG/M
3300031446|Ga0170820_15851270Not Available2609Open in IMG/M
3300031474|Ga0170818_101358320Not Available1924Open in IMG/M
3300031543|Ga0318516_10333220Not Available876Open in IMG/M
3300031543|Ga0318516_10587920All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae636Open in IMG/M
3300031545|Ga0318541_10308027All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria883Open in IMG/M
3300031545|Ga0318541_10695585Not Available568Open in IMG/M
3300031572|Ga0318515_10754283Not Available514Open in IMG/M
3300031573|Ga0310915_10012085All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5043Open in IMG/M
3300031573|Ga0310915_10146017Not Available1633Open in IMG/M
3300031573|Ga0310915_10793265Not Available666Open in IMG/M
3300031573|Ga0310915_11166516Not Available534Open in IMG/M
3300031679|Ga0318561_10730808Not Available544Open in IMG/M
3300031679|Ga0318561_10808159Not Available515Open in IMG/M
3300031681|Ga0318572_10483375Not Available738Open in IMG/M
3300031681|Ga0318572_10547082Not Available690Open in IMG/M
3300031719|Ga0306917_10280191Not Available1284Open in IMG/M
3300031719|Ga0306917_10314637Not Available1211Open in IMG/M
3300031719|Ga0306917_10912051Not Available687Open in IMG/M
3300031719|Ga0306917_10998058Not Available654Open in IMG/M
3300031719|Ga0306917_11007627Not Available650Open in IMG/M
3300031719|Ga0306917_11180062Not Available595Open in IMG/M
3300031719|Ga0306917_11423272All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria534Open in IMG/M
3300031724|Ga0318500_10672931Not Available527Open in IMG/M
3300031724|Ga0318500_10741731Not Available502Open in IMG/M
3300031736|Ga0318501_10327805All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300031744|Ga0306918_10114779All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1941Open in IMG/M
3300031744|Ga0306918_10359203Not Available1131Open in IMG/M
3300031744|Ga0306918_10627850Not Available842Open in IMG/M
3300031744|Ga0306918_10630372Not Available840Open in IMG/M
3300031744|Ga0306918_11144764Not Available602Open in IMG/M
3300031747|Ga0318502_10191783All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1180Open in IMG/M
3300031748|Ga0318492_10759204Not Available521Open in IMG/M
3300031754|Ga0307475_10972936Not Available668Open in IMG/M
3300031768|Ga0318509_10463333Not Available709Open in IMG/M
3300031768|Ga0318509_10683734Not Available570Open in IMG/M
3300031771|Ga0318546_10341035Not Available1042Open in IMG/M
3300031771|Ga0318546_10369116All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia1000Open in IMG/M
3300031781|Ga0318547_10809083Not Available584Open in IMG/M
3300031797|Ga0318550_10126264Not Available1220Open in IMG/M
3300031821|Ga0318567_10635819Not Available606Open in IMG/M
3300031845|Ga0318511_10572619Not Available525Open in IMG/M
3300031846|Ga0318512_10656775Not Available536Open in IMG/M
3300031879|Ga0306919_10026409All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium 13_2_20CM_2_64_73599Open in IMG/M
3300031879|Ga0306919_10465857Not Available973Open in IMG/M
3300031890|Ga0306925_10590738Not Available1174Open in IMG/M
3300031890|Ga0306925_10743823Not Available1022Open in IMG/M
3300031896|Ga0318551_10830200Not Available538Open in IMG/M
3300031910|Ga0306923_10425577Not Available1509Open in IMG/M
3300031910|Ga0306923_11301284Not Available771Open in IMG/M
3300031910|Ga0306923_11870739Not Available614Open in IMG/M
3300031912|Ga0306921_10116939Not Available3104Open in IMG/M
3300031912|Ga0306921_10234214Not Available2151Open in IMG/M
3300031912|Ga0306921_10451296Not Available1498Open in IMG/M
3300031912|Ga0306921_10940142Not Available979Open in IMG/M
3300031912|Ga0306921_11116249Not Available883Open in IMG/M
3300031912|Ga0306921_11654453Not Available694Open in IMG/M
3300031941|Ga0310912_10705099Not Available782Open in IMG/M
3300031941|Ga0310912_11250370Not Available564Open in IMG/M
3300031945|Ga0310913_10736445Not Available696Open in IMG/M
3300031945|Ga0310913_10745118Not Available692Open in IMG/M
3300031945|Ga0310913_11009849Not Available583Open in IMG/M
3300031946|Ga0310910_10503112Not Available962Open in IMG/M
3300031946|Ga0310910_10755738Not Available767Open in IMG/M
3300031946|Ga0310910_10795036Not Available745Open in IMG/M
3300031946|Ga0310910_10998035All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium654Open in IMG/M
3300031946|Ga0310910_11296464Not Available563Open in IMG/M
3300031947|Ga0310909_11314721Not Available581Open in IMG/M
3300031947|Ga0310909_11452678Not Available547Open in IMG/M
3300031954|Ga0306926_10092952All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia3688Open in IMG/M
3300031981|Ga0318531_10440822Not Available590Open in IMG/M
3300032001|Ga0306922_10137217All Organisms → cellular organisms → Bacteria2609Open in IMG/M
3300032001|Ga0306922_12101433Not Available547Open in IMG/M
3300032010|Ga0318569_10606600Not Available510Open in IMG/M
3300032035|Ga0310911_10685332Not Available594Open in IMG/M
3300032042|Ga0318545_10197188Not Available720Open in IMG/M
3300032059|Ga0318533_10134927Not Available1736Open in IMG/M
3300032059|Ga0318533_10258451Not Available1258Open in IMG/M
3300032059|Ga0318533_10399211Not Available1004Open in IMG/M
3300032059|Ga0318533_11245310Not Available544Open in IMG/M
3300032060|Ga0318505_10087141All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601398Open in IMG/M
3300032063|Ga0318504_10412876Not Available644Open in IMG/M
3300032076|Ga0306924_11109469Not Available862Open in IMG/M
3300032076|Ga0306924_11540699Not Available703Open in IMG/M
3300032076|Ga0306924_11669982Not Available669Open in IMG/M
3300032076|Ga0306924_11765573Not Available646Open in IMG/M
3300032076|Ga0306924_12297451Not Available547Open in IMG/M
3300032094|Ga0318540_10383549Not Available679Open in IMG/M
3300032180|Ga0307471_102792893Not Available620Open in IMG/M
3300032261|Ga0306920_101572256Not Available936Open in IMG/M
3300032261|Ga0306920_102198009Not Available767Open in IMG/M
3300032261|Ga0306920_102989368Not Available638Open in IMG/M
3300032261|Ga0306920_103523434Not Available578Open in IMG/M
3300032261|Ga0306920_104347514Not Available509Open in IMG/M
3300033289|Ga0310914_10139625All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2127Open in IMG/M
3300033289|Ga0310914_10478114Not Available1127Open in IMG/M
3300033289|Ga0310914_10676384Not Available927Open in IMG/M
3300033289|Ga0310914_11468696Not Available585Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil35.27%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil25.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil15.18%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil7.14%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil4.46%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.57%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds3.12%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.79%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland1.79%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.34%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil0.45%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.45%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil0.45%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000597Forest soil microbial communities from Amazon forest - 2010 replicate II A1EnvironmentalOpen in IMG/M
3300003505Forest soil microbial communities from Harvard Forest LTER, USA - Combined assembly of forest soil metaG samples (ASSEMBLY_DATE=20140924)EnvironmentalOpen in IMG/M
3300004267Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 6 MoBioEnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005437Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006047Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013EnvironmentalOpen in IMG/M
3300006172Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2014EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016371Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300016445Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300017974Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_10_MGEnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300020150Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP10_20_MGEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300025898Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025915Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027898Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300030916Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA12 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031057Oak Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031122Oak Spring Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031128Oak Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031474Fir Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031543Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f20EnvironmentalOpen in IMG/M
3300031545Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f26EnvironmentalOpen in IMG/M
3300031572Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.176b2f19EnvironmentalOpen in IMG/M
3300031573Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN111EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031681Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f20EnvironmentalOpen in IMG/M
3300031719Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000 (v2)EnvironmentalOpen in IMG/M
3300031724Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f20EnvironmentalOpen in IMG/M
3300031736Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f21EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031747Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f22EnvironmentalOpen in IMG/M
3300031748Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f22EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031768Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f22EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031781Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f20EnvironmentalOpen in IMG/M
3300031797Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f23EnvironmentalOpen in IMG/M
3300031821Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f20EnvironmentalOpen in IMG/M
3300031845Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f18EnvironmentalOpen in IMG/M
3300031846Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f19EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031890Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176 (v2)EnvironmentalOpen in IMG/M
3300031896Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f19EnvironmentalOpen in IMG/M
3300031910Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.108 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300031981Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f25EnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032010Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.088b5f22EnvironmentalOpen in IMG/M
3300032035Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF170EnvironmentalOpen in IMG/M
3300032042Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.168b4f26EnvironmentalOpen in IMG/M
3300032059Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.053b4f27EnvironmentalOpen in IMG/M
3300032060Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18EnvironmentalOpen in IMG/M
3300032063Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f17EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032094Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.166b4f25EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300033289Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.AN108EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A1DRAFT_1012810713300000597Forest SoilVDRLDAIRLLKALVAAGAKPRAPMDRAWIDEIAVRDVSLHGTELASALAYAEGERWLADSRTRKDWIYLTRTGEILAKEGVI*
AF_2010_repII_A1DRAFT_1013358323300000597Forest SoilMDRLDAIRLLKALVAAGAKPRVPMDSAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGXIVAKESVI*
JGIcombinedJ51221_1046329913300003505Forest SoilLDAIRLLEALVAAGANSQAPMDSAWVDEIAVRDVSLHGTELASALAYAEGEGWLADSPTKGWISLTRTGEIVARAK*
Ga0066396_1010327813300004267Tropical Forest SoilLGATPSTHRRSRNSRARVRKESAYNAWVDRLNAIRLLQALVAAGAKLRAPMANDWVHSIAVRDVSLEGTALASVIAYAEGEGWLADSPRTGWVSLTRAGEVIARVK*
Ga0066388_10046429233300005332Tropical Forest SoilMDRLDAIRLLKALVAAGAKPRVPMDSAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI*
Ga0066388_10176494613300005332Tropical Forest SoilMHGLDRLDAIRLLKALVAAGADSRTPMASAWVDKIAVRDVALHGTALASAIAYAEGEGWLADIPTRKDCISLTRKGEV
Ga0066388_10297766713300005332Tropical Forest SoilRLLQALVAAGASTREPMDSAWVDKIAVRYMALYGTALASAIAYAEAEGWLTDSRTRKGWIYLTRTGEVIAKLP*
Ga0066388_10513884413300005332Tropical Forest SoilLDAIRLLQALVAAGADSRTPMASAWVHKIAMRDMGLQGTELASAIAYAEAEGWLADIPARKDCISLTRAGERAAKVK*
Ga0066388_10714454313300005332Tropical Forest SoilVDRLNAILLLKALIAAGANTRAPIARAWVHEIAVRDTSLQGTELACAIAYAEAERWLADSRTREDWIYLTRWGQIIAKVN*
Ga0070710_1089091523300005437Corn, Switchgrass And Miscanthus RhizosphereLNAIRLLQALVAAGAKLGVPMADTWVHDGTNLASALAYAEGAGWLVDSARKGWVSLTCAGEVVARLQ*
Ga0070711_10010193363300005439Corn, Switchgrass And Miscanthus RhizosphereLNAIRLLQALVAAGAKLGVPMANTWVHDIALRDMSLQGTNLASALAYAEGAGWLVDSARKGWVSLTCAGEVVARLQ*
Ga0070711_10037871023300005439Corn, Switchgrass And Miscanthus RhizosphereMRLLKALVAAGAKPGAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI*
Ga0070697_10063581623300005536Corn, Switchgrass And Miscanthus RhizosphereLNAIRLLEALVAAGAKLGVPMANTWVHDIALRDMSLQGTNLASALAYAEGAGWLVDSARKGWVSLTCAGEVVARLQ*
Ga0066903_10022515213300005764Tropical Forest SoilRVDRLDAIRLLQALVAAGANTRAPMDSAWVDEIAVRDMALHGAQLAAAIAYAEGERWLADSPARKDWIYLTRLGQHVAKRSVK*
Ga0066903_10146349423300005764Tropical Forest SoilMHGVDRLDAIRLLKALVAAGADSRTPMASAWVDKIAVRDVALHGTALASAIAYAEGEGWLADIPTQKRLHFPHSKR*
Ga0066903_10287220323300005764Tropical Forest SoilMHQESAYNARVDRLDAIRLLQALVAAGANSRAPMDSAWVDKIAVRDMWLHGMQLASAIAYAEAEGWLTDSRTRKGWISLTRTGEAVAKLP*
Ga0066903_10586063813300005764Tropical Forest SoilVDRLDAIRLLQALIAAGADSRTPMDSAWVDKIAVREMSLHGTGLASAIAYAEGEGWLADIPTRKGCVSLTRAGELVAKANR*
Ga0066903_10590112013300005764Tropical Forest SoilPRTVYNARMDRLDAIRLLQALVAAGANTRAPMASAWVDEIAVRDMELHGTGLASAIAYAEGQRWLAHGRTREDWIYLTRLGQLVAKESVA*
Ga0066903_10725067223300005764Tropical Forest SoilVDRLDAIRLLQALVAAGANSHAPMDKAWIDKIAVRDMALHGTALASAIAYAEAEGWLADIPTRKGWISLTSAGEVIAQGNDSNS*
Ga0066903_10869132913300005764Tropical Forest SoilVDRLDAIRLLEALVTAGASSRAPMDSAWIDKIAVRDMWLHGTQLASAIAYAEGEGWLADIPTRKGCISLTRAGEVVARVK*
Ga0066903_10879831313300005764Tropical Forest SoilSAYNARVDRLDAIRLLQALVAAGANSRTPIARDWVDKIAVRDMWLHGIQLASAIAYAEGEGWLADIPTRKGCISLTRAGELVAKAKLR*
Ga0075024_10030793923300006047WatershedsLDAIRLLKALVAAGAKPRAPMDSAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSSTRKDWIYLTRTGEIVAKESVI*
Ga0075018_1008729823300006172WatershedsFPLVKTLGSTRAPLTPSMRRTGYNARVDRLDAIRLLKALVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSLTRAGEAVARVK*
Ga0075018_1031671713300006172WatershedsLKALVAAGAKPRAPMDSAWLDEIAVRDVSLHGTELASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESV*
Ga0075014_10032871823300006174WatershedsLDAIRLLKALVATGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSLTRAGEVVARVK*
Ga0075014_10033579933300006174WatershedsDAIRLLKALVAAGANSRAPMDSAWVHKIAVRDMSLQGTALASALAYAEGEGWLADSPRKGWISLTRMGEVVAKVN*
Ga0070712_10007037443300006175Corn, Switchgrass And Miscanthus RhizosphereLNAIRLLQALVAAGAKLGVPMADTWVHDIALRDMSLQGTNLASALAYAEGAGWLVDSARKGWVSLTCAGEVVARLQ*
Ga0079222_1078356113300006755Agricultural SoilMRRTGYNARVDRFDAMRLLKALVAAGAKPSAPMDRAWVDKIAVRDVSLHGTELASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESV*
Ga0079221_1009785523300006804Agricultural SoilMRRTGYNARVDRFDAMRLLKALVAAGAKPSAPMDRAWVDKIAVRDVSLHGTELASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESAI*
Ga0079219_1208187413300006954Agricultural SoilALVAAGAKLGVPMANTWVHDIALRDMSLQGTNLASALAYAEGAGWLVDSARKGWVSLTCAGEVVARLK*
Ga0126374_1108456413300009792Tropical Forest SoilMARAASLRKQSTYNARVDRLDAIRLLKALVAAGAKPRAPMASAWIDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI*
Ga0126373_1023155623300010048Tropical Forest SoilMHGLDRLDAIRLLKALVAAGADSRTPMASAWVDKIAVRDVALHGTALASAIAYAEGEGWLADIPTRKDCISLTRKGEVVAKLP*
Ga0126373_1026728323300010048Tropical Forest SoilMDRLDAIRLLKALVAAGAKPRVPMDSAWIDKIAVRDVSLHGTQLASAPYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI*
Ga0126373_1044439733300010048Tropical Forest SoilAGVDRLNAIRLLEALVAAGAKARAPMASDWVHDIAARDMSLQGADLASAIAYAEGERWLADSRTRTGWIYLTRTGEVMAKTK*
Ga0126373_1233920113300010048Tropical Forest SoilLNAIRLLKALVAAGAKPRAPMASTWIHTIALRDVSLQDPALASALAYAEGVGWLVDSPRKGWVSVTRAGAFVARVK*
Ga0126373_1310480713300010048Tropical Forest SoilMRRTGYNARVDRLNAIRLLRALVAAGAKPRAPMDSAWMDKIALRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSLTRAGEIVARVK*
Ga0126370_1149565413300010358Tropical Forest SoilARVDRLDAIRLLQALVAAGADYRTPMVSAWVHKIAMRDMGLQGTDLASAIAYAEAEGWLADIPTRKDCISLTRTGEVIAKAK*
Ga0126370_1189861713300010358Tropical Forest SoilSMRGTGYNARVDRFDAMRLLKALVAAGAKPSAPMDRAWVDKIAVRDVSLRGTELASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSVI*
Ga0126372_1103907113300010360Tropical Forest SoilLNAIRLLKALVAAGAKPRAPMASTWIHTIALRDVSLQDTALASALAYAEGVGWLVDSPRKGWVSVTRAGAFVARVK*
Ga0126372_1190350113300010360Tropical Forest SoilYNALVDRLDAIRLLKALVAAGANSRAPMDSAWVRKIAMKDMSLQGTELASALAYAEGEGWLADSPRKGWISLTRKGEVIAKAK*
Ga0126372_1248349013300010360Tropical Forest SoilMQLLTPAHPAYNAQVDRLDAIRLLQALVAAGANTRAPMASAWVDKIAVRDMELHGTGLASAIAYAEGQRWLAHGRTREDWIYLTRLGQLVAKESVI*
Ga0126378_1022189333300010361Tropical Forest SoilMYAWIAWTPYDFYRQLVAGGANTRAPMASAWVDEIAVRDMALHGAQLAAAIAYAEGERWLADSPARKDWIYLTRLGQHVAKQSVK*
Ga0126378_1141924213300010361Tropical Forest SoilMDRLDAIRLLQALVAAGAKLRPPMANDWIDKIAVRDMALHGTALASAIAYAKAEGWLTDSPREGWVSLTR
Ga0126378_1150154923300010361Tropical Forest SoilMDRLDAIRLLQALVAAGADSRTPMASAWVHKIAMRDMGLQGTDLASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKAK*
Ga0126378_1345741113300010361Tropical Forest SoilGLASLAHMQLLAPAHPFYNARVDRLDAIRLLQALIAAGAASRTPMASAWVHKIAMRDMGLQGMELASAIAYAEAEGWLADIPTRKGCISLTRAGEVVARVK*
Ga0126379_1075618213300010366Tropical Forest SoilVDRLNAIRLLEALVAAGAKPNAAMDSAWVDEIAVRDVSLHGTQLASALAYAEGQGWLVDSPRKGWVSLTRAGEVVARQNDSS*
Ga0126381_10265623313300010376Tropical Forest SoilRLLQALVAAGADSRTPMASAWVHKIAMRDMGLQGMELASAIAYAEAEGWLADIPTRKDCISLTRTGEVIAKAK*
Ga0126381_10270610313300010376Tropical Forest SoilQALVAAGAKLRAPMANDWIDKIAVRDMALHGTALASAIAYAKAEGWLTDSPREGWVSLTRAGEVIARKK*
Ga0126381_10426599613300010376Tropical Forest SoilVDRLDAIRLLKALVAAGANSRVPMDSAWVRKIAMKDMSLQGTELASALAYVEGEGWLAASPRKGWISLTRKGEVIAKAK*
Ga0126381_10440409213300010376Tropical Forest SoilMDRLDAIRLLQALVAAGADSRTPMANAWVHKIAMRDMGLQGTDLASAIAYAEAEGWLADIPTRKDCISLTRAGELAAKVKRL*
Ga0126383_1111177413300010398Tropical Forest SoilKALVAAGANTRAPMANIWVHSIGVTDLSLQGTELACAIAYAEAERWLADSRTREDWIYLTRWGQIIAKVN*
Ga0126383_1116087813300010398Tropical Forest SoilMDRLDAIRLLQALVAAGADSRTPMASAWVHKIAMRDMGLQGTDLASAIAYAEGEGWLADIPARKDCISLTRAGELAAKVK*
Ga0126383_1161096713300010398Tropical Forest SoilNARVDRVNAILLLKALIAAGANTRVPMARAWVHEIALRDTSLQATELAFAIAYAEAERWLADSRTREDSIYLTRWGQVIAKAN*
Ga0126383_1170721723300010398Tropical Forest SoilMARAASLRKQSTYNARVDRLDAIRLLKALVAAGAKPRAPMASAWIDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSVI*
Ga0137377_1101383723300012211Vadose Zone SoilVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGERWLADSRTRKDWIYLTRTGEIVAKQSV*
Ga0126369_1294088613300012971Tropical Forest SoilMRQELAVDHLNAIRLLKALVAAGAKTRAPMAGDWVDKIAVRDMALYGTALASAIACAERERWLAGSQRRQDWIYLTRAGEVVANLL*
Ga0182036_1011287513300016270SoilMRLLKALVAAGAKPREPMDRAWVDKIAVRDVSLHRTELASALGYAEGQRWLADSQTRKDWIYLTRTGEIVAKESVI
Ga0182036_1072405423300016270SoilQQALVAAGAKTRAPMDSAWVDKIAVRDMWLHGMQLASAIAYAEAEGWIVDSRTRKGWISLTRAGEVIAKLP
Ga0182036_1105464123300016270SoilYNARVDRLDAIRLLQALVAAGANTRVPMDSAWVGKIAVREMSLQCTELASAIAYAEAEGWLADSPRKGWISFTRAGEVIARVK
Ga0182041_1039087713300016294SoilRLLQALVAAGAKPRAPMASAWIDEIAVRDLSFDGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSVI
Ga0182041_1046354833300016294SoilLDAIRLLKALVTAGAKPRAPIDRAWIDEIAVRDVSLHGTELASALAYAEGERWLADSPARKDWIYLTRTGE
Ga0182033_1024382933300016319SoilLTRQESVYNPWVDRLNAIRLLQALVAAGANTRAPMDSAWVHNIAVRDVSLEGTALASAIAYAEAEWLADRPRKGWVSLTRPGEVIAKAK
Ga0182033_1054893123300016319SoilVDRLDAIRLLQALVAAGADSRTPMASAWVDKVAVRDMWLHGMQLASAIAYAEAEGWLTDTPTIKGCVSLTRAGEVVAKTAAIARGG
Ga0182033_1088322623300016319SoilIVKRTVMSVTVFPSQCRFDEPMDRLDAIRLLKALVAAGAKPRAPMDSAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI
Ga0182033_1155951713300016319SoilAPGIGLNAIRLLKVLVAAEASSRAPMARDWVDKIAVRDMSLHGMELASAIAYAEGERWLAGSQSRKDWIYLTLTGELIAKRP
Ga0182033_1172975713300016319SoilLDAIRLLRALVAAGAKPRAPMDSAWMDKIALRDVSLYSTQLASALAYAEGEGWLVDSQRKGWVSLTRAGEIVARVK
Ga0182035_1006358353300016341SoilKALVTAGAKPRAPMTSAWIDEIAVRDLSFHGTQLASALAYAERQRWLADSRTRDDWIYLTRTGEIVAKKSVI
Ga0182035_1044115513300016341SoilWVDRLNAIRLLQALVAAGASSRAPMNGDWVHEIALRDLSLQIPELASAIAYAEGEGWFADSPRRKDWICLTRSGEVIARVK
Ga0182035_1184719813300016341SoilMARAASLRKQSTYNARVDRLDAIRLLKALVAAGAKPRAPMTSAWIDEIAVREVSLHGTQLASALAYAEGQRWLAGSRTRKDWIYLTRTGEIVAK
Ga0182035_1211788013300016341SoilAGAPAHPAYNPWVDRLDAIRLLQALVAAGADSGTPMASAWVDKVAVRDMWLHGMQLASAIAYAEAEGWLTDTPTIKGCVSLTRAGEVVAKTAAIARGG
Ga0182034_1042245723300016371SoilDAIRLLRALVAAGAKPRAPMASAWIDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKEWIYLTRTGEIVAKGSVI
Ga0182034_1049295313300016371SoilAIRLLQALVAAGAKTRAPMDSAWVDKIAVRDASLHGTGLASALAYAEAEGWLIDSPRKGWRSVTRAGEVIARAK
Ga0182034_1077772913300016371SoilIRLLKALVAAGANSRASIDSAWVHKIAMRDMSLQGTELASALAYAGGEGWLADSPRKGWISLTRTGAVIAKAK
Ga0182034_1092765023300016371SoilWRWLTRHRCTAPATPRVDRLNAIRLLKALVTAGAKPRAPMTSAWIDEIAVRDLSFHGTQLASALAYAEGQRWLADSRTRDDWIYLTRTGEIVAKRGVI
Ga0182034_1117316113300016371SoilPAHPAYNARVDRLDAIRLLQALVAAGANTRVPMDSAWVGKIAVREMSLQCTELASAIAYAEAEGWLADSPRKGCISLTRAGEVIARIK
Ga0182034_1126203423300016371SoilLVDRLNAIRLLKALVTAGAKPRAPMTSAWIDEIAVRDLSFHGTQLASALAYAERQRWLADSRTRDDWIYLTRTGEIVAKKSVI
Ga0182034_1146968513300016371SoilLNAIRLLKALVAAGAKLRAPMAKTWVHAIALRDLSLQSTDLASVVAYAEAEGWLVDSPKPGCLSLTRAGEAVARVK
Ga0182040_1061997023300016387SoilMQLLAPAHPAYNARVDRLDAIRLLKALVAAGADSRTPMASAWVHKIAMREMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0182040_1145395813300016387SoilDRLDAIRLLKALVAAGAKPRAPMTSAWIDEIAVREVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSVI
Ga0182040_1164088613300016387SoilLKALVAAGANARAPRASDWVHKMALRDMSLQGTDLASAIAYAEGEGWLADLPTRKGWIFLTRTGEVVAQANDSNSVRKTRYRISHR
Ga0182040_1168162213300016387SoilQLLAPAQPAYNARVDRLDAIRLLQALVAAGANTRVPMASAWVDKIAVRDMELHGTGLASAIAYAEGQRWLADSRTREDWIYLTRLGQRVAKESVI
Ga0182037_1014350723300016404SoilMRLLKALVAAGAKPREPMDRAWVDKIAVRDVSLRGTELASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI
Ga0182037_1024971423300016404SoilVLTRQESVYNPWVDRLNAIRLLQALVAAGANTRAPMDSAWVHNIAVRDVSLEGTAPASAIAYAEAEWLADSPRKGWVSLTRPGKVIAKAK
Ga0182037_1043697423300016404SoilARVDRVDAIRLLQALVAAGANSRAQMDSAWVDKIAVSDMWLHGMQLASAIAYAEAEGWLTDSRTRKGWISLTRTGEAVAKLR
Ga0182037_1098364823300016404SoilVDRLDAIRLLKALVAAGANTRASMDSAWVDKIAVRDMELHGTALASAIAYAEAEGWLADSARKGWVSLTRAGEVIARVK
Ga0182037_1141727213300016404SoilSAYNARVDRLDAIRLLKALVAAGANCRAPMDSAWVHKIAMRDMSLQGTELASALAYAEGEGWLADSPRKGWISLTRKGEVIARAK
Ga0182037_1175087523300016404SoilNPSLRTVQSGYNARVDRLNAIRLLGALVAAGAKLRAPMARAWVNNIAVRDISLWETELASAIAYAEGEGWLTDSPRKGWVSLTRAGEVVARAK
Ga0182039_1032860813300016422SoilQSGYNARVDRLNAIRLLKALVAAGANARTPRASDWVHKMALRDMSLQGTDLASAIAYAEGEGWLADIPTRKGCISLTRAGEVVARAK
Ga0182039_1056396323300016422SoilMRLLKALVAAGAKPREPMDRAWVDKIAVRDVSLHGTELASALAYAEGQRWLADSRTREDWIYLTR
Ga0182039_1102228023300016422SoilLNAIRRLKALVASGAKLRAPMAKTWVHAIALRDLSLQSTDLASVVAYAEAEGWLVDSPKPGCFSLTRAGEAVARVK
Ga0182039_1136949613300016422SoilLDAIRLLRALVAAGAKPRAPMDSAWMDKIALRDVSLYSTQLASALAYAEGEGWLVDSPRKGWVSLTRAGEIVARAK
Ga0182039_1139924423300016422SoilALVAAGANSRTPMASAWVDKIAVREMSLHGTALASALAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNNN
Ga0182039_1163059913300016422SoilMDRLDAIRLLQALVAAGANTRAPMNSAWVDEIAVRDMELHGTGLASAIAYAEGQRWLAHGRTREDWIYLTRLGQLV
Ga0182039_1220279613300016422SoilLKALVAAGADSRTPMASAWVHKIAMRDMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0182038_1024909433300016445SoilMRLLKALVAAGAKPREPMDRAWVDKIAVRDVSLHRTELASALAYAEGQRWLADSQTRKDWIYLTRT
Ga0187783_1012689913300017970Tropical PeatlandLDAMRLLQALVAAGAKTRAPMDSAWVDKIAVRDASLDGTGLASALAYAEAEGWLIDSPRKGWRSLTHAGEVIARAK
Ga0187777_1124975513300017974Tropical PeatlandMDRANAIRLLRALVAAGASSRAAMANAWVHDIAVRDLSLRGTGLASALAYAEGEGWLADGPKTGWALTRAGEVVGKGLK
Ga0187765_1040327523300018060Tropical PeatlandMRRTGYNARVDRSNAMRLLKALVAAGAKPRAPMDSAWIDKIAVRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSHTRAGEVVAGVK
Ga0187768_115465923300020150Tropical PeatlandDRLNAIRLLGALVAAGANTRAPMASAWVNNIAVRDISLRETELASAIAYAEGEGWLTDSPRRGWVSLTRAGEVVARAK
Ga0126371_1020915023300021560Tropical Forest SoilMDRLDAIRLLKALVAAGAKPRVPMDSAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI
Ga0126371_1025985423300021560Tropical Forest SoilLASAYNAAMDRLDAIRLLQALVAAGAKLRPPMANDWIDKIAVRDMALHGTALASAIAYAKAEGWLTDSPREGWVSLTRAGEGIARKK
Ga0126371_1026860223300021560Tropical Forest SoilMLRRDSAVAAGERFDGSSYSARVDRLDAIRLLQALVAAGANTRAPMDSAWVDEIAVRDMALHGAQLAAAIAYAEGERWLADSPARKDWIYLTRLGQHVAKRSVK
Ga0126371_1039656223300021560Tropical Forest SoilVDRLNAIRLLQALVAAGAKARAPMASEWVHNIAARDMSLQGVDLASAIAYAEAERWLADSRTRQDWVYLTRTGKLMAKAK
Ga0126371_1145682313300021560Tropical Forest SoilIRLLKALVAAGASTRAPMASAWVDKIAVRDMSLYGTALASAVACAERERWLAGSQRRQGWIYLTRTGEVVAKLP
Ga0126371_1239469313300021560Tropical Forest SoilMRQESAADHLNAIRLLQALVAAGASTREPMDSAWVDKIAVRNMALYGTALASAIACAERERWLAGSRRREDWIYLTRTGEVIAKLP
Ga0126371_1252606323300021560Tropical Forest SoilSAYNARVDRLDAIRLLQALVAAGADSRTPMDSAWVDKIAVREVSLHGTALASALAYAEGEGWLADIPTRKGCISLTRAGELVAKAKLR
Ga0126371_1323084913300021560Tropical Forest SoilAPSMRRTGYNARVDRLDAIRLLRALVAAGAKPRAPMDSAWMDKIAVRDVSLYGTQLASALAYAEGEGWLVDSPRKGWVSPLVRARSSLG
Ga0126371_1350731513300021560Tropical Forest SoilKQSTYNARVDRLDAIRLLKALVAAGAKPRAPMASDWIDEIAVRDESLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSVI
Ga0207692_1084487723300025898Corn, Switchgrass And Miscanthus RhizosphereLNAIRLLQALVAAGAKLGVPMANTLVHDIALRDMSLQGTNLASALAYAEGAGWLVDSARKGWVSLTCAGEVVARLQ
Ga0207693_1009930643300025915Corn, Switchgrass And Miscanthus RhizosphereVDRLNAIRLLQALVAAGAKLGVPMADTWVHDIALRDMSLQGTNLASALAYAEGAGWLVDSARKGWVSLTCAGEVVARLQ
Ga0207663_1107912313300025916Corn, Switchgrass And Miscanthus RhizosphereSRRRTGYNARVDRFDAMRLLKALVAAGAKPGAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI
Ga0209577_1082183613300026552SoilMRRTGYNARVDRLDAIRLLKALVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGERWLADSRTRKDWIYLTRTGEIVARES
Ga0209465_1030117023300027874Tropical Forest SoilMDRLDAIRLLKALVAAGAKPRAPMASAWIDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRAGEVVAKGSVI
Ga0209465_1069212823300027874Tropical Forest SoilAYNARVDRLDAIRLLQALVAAGAKPRAPMDGAWIDEIAVKDVSLHGMELASALAYAEKERWLADSRTRKDWIYVTRIGEIIAKQSGI
Ga0209067_1016614013300027898WatershedsVDRLDAIRLLKALVAAGANSRAPMDSAWVHKIAVRDMSLQGTALASALAYAEGEGWLADSPRKGWISLTRMGEVVAKVN
Ga0209069_1006112913300027915WatershedsYNARVDRFDAMRLLKALVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSLTRAGEVVARVK
Ga0075386_1200054523300030916SoilPSMRRTGYNARVDRLDAIRLLKALVAAGARPRAPMDSAWLDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESV
Ga0170834_11229283643300031057Forest SoilALVAAGAKPRAPMDSAWLDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESV
Ga0170822_1154963713300031122Forest SoilLDAIRLLKALVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSLTRAGEVVVRVK
Ga0170823_1221066613300031128Forest SoilMRRTGYNARVDRFDAMRLLKALVAAGAKPRAPMDSAWLDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKE
Ga0170824_11316284913300031231Forest SoilLDAIRLLKALVAAGAKPRAPLDRAWVDRIAVRDVSLHGTQLASALAYAEGERWLTDSRTRGDWIYLTRRGEIVAKESV
Ga0170824_11900493823300031231Forest SoilMRRTGYNARVDRLDAIRLLKALVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGERWLADSRTRKDWIYLTRTGEIVVRES
Ga0170824_11920569813300031231Forest SoilRVDRLDAMRLLKALVAAGAKPRAPMDKAWIDEIAVRDVSLHGMQLASALAYAEGQRWLADSRTRADWIYLTRTGEIVAKESVI
Ga0170820_1585127033300031446Forest SoilMRRTGYNARVDRLDAIRLLKALVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSLTHAGEVVARVK
Ga0170818_10135832023300031474Forest SoilMRRTGYNARVDRLDAIRLLKALVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSLTRAGEVVARVK
Ga0318516_1033322013300031543SoilPAHPAYNARVDRLDAIRLLQALVAAGANTRAPMASAWVDKIAVRDMELHGTGLASAIAYAEGQRWLAHGRTREDWIYLTRLGQLVAKESVI
Ga0318516_1058792023300031543SoilIRLLKALVAAGAKLRVPVANTWVHDIAVRDMALEGTEIASVVAYAEAEGWLVDSSRPGCVSLTRAGEVVARLK
Ga0318541_1030802723300031545SoilVTAGAKPRAPMTSAWIDEIAVRDLSFHGTQLASALAYAERQRWLADSRTRDDWIYLTRTGEIVAKKSVI
Ga0318541_1069558513300031545SoilLDAIRLLKALVTAGAKPRAPIDRAWIDEIAVRDVSLHGTELASALAYAEGERWLADSPARKDWIYLTRTGEIVAKRSVI
Ga0318515_1075428313300031572SoilMRLLKALVAAGAKPREPMDRAWVDKIAVRDVSLHRTELASALAYAEGQRWLADSQTRKDWIYLTRTGEIVAKESVI
Ga0310915_1001208533300031573SoilVDRLDAIRLLQALVAKGANSRTPMDSAWVDKIAVRDMALQGTELASAIAYAEAEGWLVDTPTRKGCISLTRAGEIIARAK
Ga0310915_1014601713300031573SoilVDRLDAIRLLKALVAAGAKVRVPMANTWVHDIAMRDLSLQSTNLASAIAYAEGEGWLADCARKGWVSLTRAGEVVARVNDLETVRKNGRQQCRQN
Ga0310915_1079326523300031573SoilLVAAGANCRAPMDSAWVHKIAMRDMSLQGTELAPALAYAEGEGWLADSPRKGWISLTRKGEVIAKIKMISNCP
Ga0310915_1116651613300031573SoilAAAYNAWVDRLDAIRLLQALVAAGANSRAPMDSAWVDKIAVRDMWLHGMQLASAIAYAEAEGWLTDSRARKGWISLTRTGEAVAKLR
Ga0318561_1073080813300031679SoilTRYNARVDRLDAIRLLKALVTAGAKPRAPIDRAWIDEIAVRDVSLHGTELASALAYAEGERWLADSPARKDWIYLTRTGEIVAKRSVI
Ga0318561_1080815913300031679SoilVDRLDAIRLLRALVAAGADSRTPIASAWVDKIAMRDMALHGTQLASAIAYAEAEGWLTDVPARKDCISLTRAGELVAKAK
Ga0318572_1048337533300031681SoilVDRFDAIRLLEALVAAGAKPNAAMDSAWVDEIAVRDVSLHGTQLASALAYAEGQGWLVDNPRKGWVSLTRLGEAVAKLKRKELLQKLKVKLSV
Ga0318572_1054708213300031681SoilMQLLAPAHPAYNARVDRLDAIRLLQALVAAGADSRTPMASAWVHKIAMREMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0306917_1028019113300031719SoilYNARVDRLDAIRLLQALVAKGANSRTPMDSAWVDKIAVRDMALQGTELASAIAYAEAEGWLVDTPTRKGCISLTRAGEIIARAK
Ga0306917_1031463713300031719SoilLNAIRLLKALVAAGAKPRAPMASTWIHTIALRDVSLQDTALASALAYAEGVGWLVDSPRKGWVSVTRAGAFVARVK
Ga0306917_1091205123300031719SoilLNAIRLLKALVAAGAKLRAPMAKTWVHAIALRDLSLQSTDLASVVAYAEAEGWLVDSPKPGCFSLTRAGEAVARVK
Ga0306917_1099805823300031719SoilLNAIRLLRALVAAGAKPRAPMDSAWMDKIALRDVSLYSTQLASALAYAEGEGWLVDSQRKGWVSLTRAGEIVARVK
Ga0306917_1100762713300031719SoilVDRLDAIRLLKALVAAGANTRAPMDRAWVDKIAVREMSLQCTELASAIAYAEAEGWLADSPRKGWISFTRAGEVIARVK
Ga0306917_1118006213300031719SoilMQLLAPAHPAYNARVDRLDAIRLLKALVAAGADSRTPMASAWVHKIAMRDMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVI
Ga0306917_1142327223300031719SoilAPATPRVDRLNAIRLLKALVTAGAKPRAPMTSAWIDEIAVRDLSFHGTQLASALAYAEGQRWLADSRTRDDWIYLTRTGEIVAKKSVI
Ga0318500_1067293113300031724SoilAYSARVDHLNAIRLLKALVAAGAKPRAPMASTWIHTIALRDVSLQDTALASALAYAEGVGWLVDSPRKGWVSVTRAGAFVARVK
Ga0318500_1074173123300031724SoilRSLSVHLFGVGTTRASNRAYSARVDRLNAIRLLKALVAAGAKLRAPMAKTWVHAIALRDLSLQSTDLASVVAYAEAEGWLVDSPKPGCFSLTRAGEAVARVK
Ga0318501_1032780513300031736SoilVRVDRLNAICLLNALVAAGAKVRVPMANTWVHDIAMRDLSLQSTNLASAIAYAEAEGWLVDSPRPGWVSLTRAGEVVARVK
Ga0306918_1011477933300031744SoilAPATPRVDRLNAIRLLKALVTAGAKPRAPMTSAWIDEIAVRDLSFHGTQLASALAYAERQRWLADSRTRDDWIYLTRTGEIVAKKSVI
Ga0306918_1035920333300031744SoilMQLLAPAHPAYNARVDRLDAIRLLRALVAAGADSRTPMASAWVHKIAMREMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNNN
Ga0306918_1062785013300031744SoilNAIRLLKALVAAGANARAPMASDWVHKMALRDMSLQGTDLASAIAYAEGEGWLADIPTRKGCISLTRAGEVVARAK
Ga0306918_1063037223300031744SoilVDRLDAIRLLQALVAAGADYRTPMGSAWVHKIAMRDMGLQGTDLASAIAYAEAEGWLADIPTRKDCISLTRAGVVIAKQK
Ga0306918_1114476413300031744SoilTPAHPAYNARVDRLDAIRLLQALVAAGADSRTPMASAWVHKIAIRDMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0318502_1019178313300031747SoilRLLKALVAAGANSRTPMASDWIHDMAVRDMSLEGTELASAIAYAEAEGWLADTPTRKGCISLTRTGEAMAKAK
Ga0318492_1075920413300031748SoilNVRVDRLDAIRLLQALVAAGANTRAPMASAWVDKIAVRDMELHGTGLASAIAYAEGQRWLAHGRTREDWIYLTRLGQLVAKESVI
Ga0307475_1097293613300031754Hardwood Forest SoilLDAIRLLKALVAAGAKPRAPMDSAWIDEIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESVI
Ga0318509_1046333323300031768SoilVDRLDAIRLLKALVAAGANCRAPMDSAWLHKIAMRDMSLQGTELASALAYAEGEGWLADSPRKGWISLTRKGEVIAKAK
Ga0318509_1068373413300031768SoilLDAIRLLQALVAAGANSRTPMASAWVDKIAVREMSLHGTALASALAYAEGEGWLADIPTRKDCISLTRAGEVIAKQK
Ga0318546_1034103513300031771SoilRVDRLDAIRLLKALVAAGANTRAPMDRAWVDKIAVREMSLQCTELASAIAYAEAEGWLADSPRKGWISFTRAGEVIARVK
Ga0318546_1036911613300031771SoilLDAIRLLKALVTAGAKPRAPIDRAWIDEIAVRDVSLHGTELASALAYAEGERWLADSPARKDWIYLTRTGEIVAKR
Ga0318547_1080908313300031781SoilAYNARVDRLDAIRLLKALVAAGANSRAPMDSAWVHKIAMRDMSLQGTELASALAYAEGEGWLADSPRKGWISLTRKGEVIAKAK
Ga0318550_1012626413300031797SoilIRLLKALVTAGAKPRAPIDRAWIDEIAVRDVSLHGTELASALAYAEGERWLADSPARKDWIYLTRTGEIVAKRSVI
Ga0318567_1063581923300031821SoilVGVDRLDAIRLLQALVAKGANSRTPMDSAWVDKIAVRDMALQGTELASAIAYAEAEGWLVDTPTRKGCISLTRAGEIIARAK
Ga0318511_1057261913300031845SoilDRLDAIRLLQALVAAGADSRTPMASAWVHKIAIRDMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0318512_1065677513300031846SoilVDRLNAIRLLKALVTAGAKPRAPMTSAWIDEIAVRDLSFHGTQLASALAYAERQRWLADSRTRDDWIYLTRTGEIVAKKSVI
Ga0306919_1002640973300031879SoilGLPVGPAYNARVDRLDAIRLLQALVAKGANSRTPMDSAWVDKIAVRDMALQGTELASAIAYAEAEGWLVDTPTRKGCISLTRAGEIIARAK
Ga0306919_1046585713300031879SoilARVDRLNAIRLLKALVAAGAKLRAPMAKTWDHAIALRDLSLQSTDLASVVAYAEAEGWLVDSPKPGCFSLTRAGEAVARVK
Ga0306925_1059073823300031890SoilLNAIRLLKALVAAGAKLRVPMANTWVHDIAVRDMALEGTEIASVVAYAEAEGWLVDSSRPGCVSLTRAGEVVARLK
Ga0306925_1074382323300031890SoilVLLLRCSLRRIGYNARVDRFDAIRLLQALVTAGATPRAPMASAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTREDWIYLTRTGEIVAKKSVI
Ga0318551_1083020013300031896SoilLDAIRLLQALVTAGAKPRAPMASAWIDEIAVRDLSFDGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSVI
Ga0306923_1042557713300031910SoilARVDRLNAIRLLKALVAAGANARAPMASDWVHKMALRDMSLQGTDLASAIAYAEGEGWLADIPTRKGCISLTRAGEVVARAK
Ga0306923_1130128423300031910SoilYNARVDRLDAIRLLQALVAAGADSRTPMASAWVHKIAIRDMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0306923_1147493213300031910SoilTIFGPRGWGSAGNSYARAVLLLRCSLRRIGYNARVDRFDAIRLLQALVTAGATPRAPMASAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTREDWIYLTRTGEIVAKKSVI
Ga0306923_1187073913300031910SoilVDRLNAIRLLGALVAAGAKLRAPMARAWVNNIAVRDISLWETELASAIAYAEGEGWLTDSPRKGWVSLTRAGEVVARAK
Ga0306921_1011693953300031912SoilMEIAQLAPAHTIASAYNARVDRLDAIRLLQALVAAGAKPRAPMATDWVHNIAVRNMSLDGTKLASALAYAQAEGWLIDSPRKGWISLTRAGEVIARVK
Ga0306921_1023421443300031912SoilVDRLNAIRLLKALVAAGANARAPMASDWVHKMALRDMSLQGTDLASAIAYAEGEGWLADIPTRKGCISLTRAGEVVARAK
Ga0306921_1045129623300031912SoilVLTRQESVYNPWVDRLNAIRLLQALVAAGANTRAPMDSAWVHNIAVRDVSLEGTAPASAIAYAEAEWLADSPRKGWVSLTRPGEVIAKAK
Ga0306921_1094014223300031912SoilGAIRLLQALVAAGAKTRAPMDSAWVDKIAVRDASLHGTGLASALAYAEAEGWLIDSPRKGWRSVTRAGEVIARAK
Ga0306921_1106231123300031912SoilSGLLKEISAASAYNPGVDRLDAIRLLQALVAAGANTRAPMASAWVDKIAVRHMELHGTGLASAIAYAEGQRWLAHGRTREDWIYLTRLGQLVAKESVI
Ga0306921_1111624913300031912SoilIRLLQALVAAGAKLRAPMASDWIDKIAVREMALHGTALASAIAYAKAEGWLTDSPREGWISLTHAGEVIARQK
Ga0306921_1165445323300031912SoilMQLLAPAHPAYNARVDRLDAIRLLKALVAAGADSRTPMASAWVHKIAMRDMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0310912_1070509923300031941SoilKALVAAGANCRAPMDSAWVHKIAMRDMSLQGTELAPALAYAEGEGWLADSPRKGWISLTRKGEVIAKIKMISNCP
Ga0310912_1125037013300031941SoilASAQMQLLAPAHPAYNARVDRLDAIRLLRALVAAGADSRTPMASAWVHKIAMREMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0310913_1073644513300031945SoilLNAIRLLKALVAAGAKPRAPMASTWIHTIALRDVSLQDTALASALAYAEGVGWLVDSPRKGWVSVTRAGAFVATPAQPAPAPT
Ga0310913_1074511813300031945SoilARVDRLNAIRLLKALVAAGAKLRVPMANTWVHDIAVRDMALEGTEIASVVAYAEAEGWLVDSSRPGCVSLTRAGEVVARLK
Ga0310913_1100984923300031945SoilSTYNARVDRLDAIRLLKALVAAGAKPRAPMTSAWIDEIAVREVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSVI
Ga0310910_1050311223300031946SoilVDRLDAIRLLQALVTAGAKPRAPMASAWIDEIAVRDLSFDGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSVI
Ga0310910_1075573823300031946SoilVDRLDAIRLLQALVAAGANTRAPMDSAWVDEIAVRDLALHGTQLASAIAYAEGERWLADSRTRKDWLYLTRLGQAVAKQSVI
Ga0310910_1079503613300031946SoilVDRLNAIRLLKALVAAGANARAPMASDWVHKMALRDMSLQGTDLASAIAYAEGEGWLADLPTRKGWIFLTRTGEVVAQANDSNSVRKTRYRISHR
Ga0310910_1099803513300031946SoilLDAIRLLKALVAAGAKPRAPMTSAWIDEIAVREVSLHGTQLASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKGSV
Ga0310910_1129646423300031946SoilLDAIRLLQALVAAGANSRAPMDSAWVDKIAVRDMWLHGMQLASAIAYAEAEGWLTDSRTRKGWISLTRTGEAVAMLP
Ga0310909_1131472113300031947SoilLNAICLLNALVAAGAKVRVPMANTWVHDIAMRDLSLQSTNLASAIAYAEAEGWLVDSPRPGWVSLTRAGEVVARV
Ga0310909_1145267813300031947SoilAQVDRLGAIRLLQALVAAGAKTRAPMDSAWVDKIAVRDASLHGTGLASALAYAEAEGWLIDSPRKGWRSVTRAGEVIARAK
Ga0306926_1009295243300031954SoilVDRLDAIRLLKALVTAGAKPRAPIDRAWIDEIAVRDVSLHGTELASALAYAEGERWLADSPARKDWIYLTRTGEIVAKRSVI
Ga0318531_1044082213300031981SoilLNAIRLLKALVAAGAKPRAPMASAWIHTIALRDVSLQDTALASALAYAEGVGWLVDSPRKGWVSVTRAGAFVARVK
Ga0306922_1013721773300032001SoilYNARVDRLDAIRLLKALVAAGANSRTPMANTWVHDIAMRDLSLQSTNLASAIAYAEGEGWLADCARKGWVSLTRAGEVVARVNDLETVRKNGRQQCRQN
Ga0306922_1210143313300032001SoilVDRLNAIRLLGALVAAGAKLRAPMARAWVNNIAVRDISLWETELASAIAYAEGEGWLTDSPRRGWVSLTRAGEVVARAK
Ga0318569_1060660013300032010SoilSRARASIGISAAYNARVDRLDAIRLLQALVAAGANTRAPMASAWVDKIAVRDMELHGTGLASAIAYAEGQRWLAHGRTREDWIYLTRLGQLVAKESVI
Ga0310911_1068533213300032035SoilMSVTVFPSQCRFDEPMDRLDAIRLLKALVAAGAKPRAPMDSAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTREDWIYLTRTGEIVAKKSVI
Ga0318545_1019718813300032042SoilYNARVDRLDAIRLLKALVAAGANSRTPMASDWIHDMAVRDMSLEGTELASAIAYAEAEGWLADTPTRKGCISLTRTGEAMAKAK
Ga0318533_1013492713300032059SoilYNVRVDRLDAIRLLQALVAAGANTRAPMASAWVDKIAVRDMELHGTGLASAIAYAEGQRWLAHGRTREDWIYLTRLGQLVAKESVI
Ga0318533_1025845143300032059SoilPSNPSLRTVQSGYNTRVDRLNAIRLLGALVAAGANTRAPMASAWVNNIAVRDISLRETELASAIAYAEGEGWLTDSPRKGWVSLTRAGEVVARAK
Ga0318533_1039921113300032059SoilMDRLDAIRLLQALVAAGADSRTPMASAWVHKIAMRDMGLQGTDLASAIAYAEAEGWLADIPTRKDCISLTQTGEVIARRNDLN
Ga0318533_1124531023300032059SoilAYNARVDRLDAIRLLQALVAAGADSRTPMASAWVRKIAMRDMGLQGTELASAIAYAEAEGWLVDIPTRKDLISLTRTGEVIAKAK
Ga0318505_1008714123300032060SoilLDAIRLLKALVAAGANSRTPMANTWVHDMAVRDMSLEGTELASAIAYAEAEGWLADTPTRKGCISLTRTGEAMAKAK
Ga0318504_1041287623300032063SoilLDAIRLLKALVAAGANCRAPMDSAWVHKIAMRDMSLQGTELASALAYAEGEGWLADSPRKGWISLTRKGEVIAKAK
Ga0306924_1110946923300032076SoilYNARVDRLDAIRLLKALVAAGADSRTPMASAWVHKIAIRDMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNDN
Ga0306924_1154069913300032076SoilVDRLDAIRLLQALVAAGAKLRAPMASDWIDKIAVREMALHGTALASAIAYAKAEGWLTDSPREGWISLTHAGEVIARQK
Ga0306924_1166998213300032076SoilRIGYNARVDRFDAIRLLQALVTAGATPRAPMASAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTREDWIYLTRTGEIVAKKSVI
Ga0306924_1176557313300032076SoilLNAICLLKALVAAGAKVRVPMANTWVHDIAMRDLSLQSTNLASAIAYAEGEGWLADCARKGWVSLTRAGEVVARVK
Ga0306924_1229745113300032076SoilLLVYNPLVDRLDAIRLLRALVAAGANTRAPMASAWIDEIAVRDVSLHGTELASAIAYAEGQRWLADSRTRDDWIYLTRT
Ga0318540_1038354923300032094SoilNAWVDRFDAMRLLKALVAAGAKPREPMDRAWVDKIAVRDVSLHRTELASALAYAEGQRWLADSQTRKDWIYLTRTGEIVAKESVI
Ga0307471_10005790713300032180Hardwood Forest SoilVHGSPALPPPSRRRTGYNARVDRLNAIRLLQALVAAGAKPRAPMDSAWIDKIAVRDVSLHGTELASALAYAEGERWLADSRTRKDWIYLTRTGEIVAKESVT
Ga0307471_10279289323300032180Hardwood Forest SoilTGYNARVDRFDAMRLLKALVAAGAKPSAPMDRAWVDKIAVRDVSLHGTELASALAYAEGQRWLADSRTRKDWIYLTRTGEIVAKESV
Ga0307472_10045737223300032205Hardwood Forest SoilVHGSPALPPPSRRRTGYNARVDRLNAIRLLQALVAAGAKPRAPMDSAWIDKIAARDVSLHGTELASALAYAEGERWLADSRTRKDWIYLTRTGEIVAKESVT
Ga0306920_10157225613300032261SoilMQLLAPAHPAYNARVDRLDAIRLLKALVAAGADSRTPMASAWVHKIAMRDMSLQGTELASAIAYAEGEGWLADIPTRKDCISLTRTGEVIAKARNNN
Ga0306920_10219800913300032261SoilMSVTVFPSQCRFDEPMDRLDAIRLLKALVAAGAKPRAPMDSAWIDKIAVRDVSLHGTQLASALAYAEGQRWLADSRTRKDWLYFTRTGEIVAKESVI
Ga0306920_10298936823300032261SoilAYNARVDRLDAIRLLKALVAAGANSRAPMDSAWVHKIAMRDMSLQGTELASALAYAEGEGWLADSPRKGWISLTRKGEVIARAK
Ga0306920_10352343413300032261SoilMQLLAPAHPAYNARMDRLDAIRLLQALVAAGANSRTPMASAWVYKIAMRDMGLQGTDLASAIAYAEAEGWLADIPTRKDCISLT
Ga0306920_10434751413300032261SoilAIRLLRALVAAGAKPRAPMDSAWMDKIALRDVSLHGTQLASALAYAEGEGWLVDSPRKGWVSLTRAGDIVAR
Ga0310914_1013962543300033289SoilRQHNQLGAVWLVYNPLVDRLNAIRLLKALVTAGAKPRAPMTSAWIDEIAVRDLSFHGTQLASALAYAERQRWLADSRTRDDWIYLTRTGEIVAKKSVI
Ga0310914_1047811413300033289SoilLNAIRLLKALVAAGAKPRAPMASAWIHTIALRDVSLQDTALASALAYAEGVGWLVDSPRKGWVSVTRAGAFV
Ga0310914_1067638443300033289SoilLLRALVAAGADSRTPIASAWVDKIAMRDMALHGTQLASAIAYAEAEGWLTDVPARKDCISLTRAGELVAKAK
Ga0310914_1146869613300033289SoilRLNAIRLLKALVAAGANARAPRASDWVHKMALRDMSLQGTDLASAIAYAEGEGWLADLPTRKGWIFLTRTGEVVAQANDSNSVRKTRYRISHR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.