NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F078520

Metagenome Family F078520

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F078520
Family Type Metagenome
Number of Sequences 116
Average Sequence Length 134 residues
Representative Sequence MKPLRSICILAAFPLERSGQILNHARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE
Number of Associated Samples 97
Number of Associated Scaffolds 116

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 9.48 %
% of genes near scaffold ends (potentially truncated) 38.79 %
% of genes from short scaffolds (< 2000 bps) 87.07 %
Associated GOLD sequencing projects 87
AlphaFold2 3D model prediction Yes
3D model pTM-score0.30

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (100.000 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(40.517 % of family members)
Environment Ontology (ENVO) Unclassified
(66.379 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(75.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: Yes Secondary Structure distribution: α-helix: 59.15%    β-sheet: 0.00%    Coil/Unstructured: 40.85%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.30
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 116 Family Scaffolds
PF00144Beta-lactamase 10.34
PF03548LolA 8.62
PF07045DUF1330 3.45
PF02517Rce1-like 1.72
PF04430DUF498 1.72
PF07726AAA_3 0.86
PF11992TgpA_N 0.86
PF13302Acetyltransf_3 0.86
PF13650Asp_protease_2 0.86
PF09650PHA_gran_rgn 0.86
PF01544CorA 0.86
PF07676PD40 0.86
PF03466LysR_substrate 0.86
PF13517FG-GAP_3 0.86
PF13619KTSC 0.86

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 116 Family Scaffolds
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 10.34
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 10.34
COG2367Beta-lactamase class ADefense mechanisms [V] 10.34
COG2834Outer membrane lipoprotein-sorting proteinCell wall/membrane/envelope biogenesis [M] 8.62
COG5470Uncharacterized conserved protein, DUF1330 familyFunction unknown [S] 3.45
COG1266Membrane protease YdiL, CAAX protease familyPosttranslational modification, protein turnover, chaperones [O] 1.72
COG1504Uncharacterized conserved protein, DUF498 domainFunction unknown [S] 1.72
COG3737Uncharacterized protein, contains Mth938-like domainFunction unknown [S] 1.72
COG4449Predicted protease, Abi (CAAX) familyGeneral function prediction only [R] 1.72
COG0598Mg2+ and Co2+ transporter CorAInorganic ion transport and metabolism [P] 0.86


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000363|ICChiseqgaiiFebDRAFT_11023676All Organisms → cellular organisms → Bacteria1068Open in IMG/M
3300000787|JGI11643J11755_11104256All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300002911|JGI25390J43892_10161080All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300004479|Ga0062595_102285842All Organisms → cellular organisms → Bacteria532Open in IMG/M
3300005166|Ga0066674_10021151All Organisms → cellular organisms → Bacteria2811Open in IMG/M
3300005167|Ga0066672_10117997All Organisms → cellular organisms → Bacteria1637Open in IMG/M
3300005171|Ga0066677_10200890All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1115Open in IMG/M
3300005172|Ga0066683_10453558All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300005175|Ga0066673_10638706All Organisms → cellular organisms → Bacteria617Open in IMG/M
3300005180|Ga0066685_10392022All Organisms → cellular organisms → Bacteria965Open in IMG/M
3300005186|Ga0066676_11015848All Organisms → cellular organisms → Bacteria551Open in IMG/M
3300005187|Ga0066675_10071216All Organisms → cellular organisms → Bacteria2211Open in IMG/M
3300005187|Ga0066675_10779240All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300005446|Ga0066686_10295099All Organisms → cellular organisms → Bacteria1100Open in IMG/M
3300005450|Ga0066682_10357232All Organisms → cellular organisms → Bacteria940Open in IMG/M
3300005540|Ga0066697_10763644All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Geodermatophilales → Geodermatophilaceae → Blastococcus → Blastococcus endophyticus526Open in IMG/M
3300005555|Ga0066692_10320274All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium984Open in IMG/M
3300005556|Ga0066707_10179810All Organisms → cellular organisms → Bacteria1355Open in IMG/M
3300005556|Ga0066707_10573348All Organisms → cellular organisms → Bacteria725Open in IMG/M
3300005558|Ga0066698_10792906All Organisms → cellular organisms → Bacteria614Open in IMG/M
3300005560|Ga0066670_10936733All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300005569|Ga0066705_10167254All Organisms → cellular organisms → Bacteria1360Open in IMG/M
3300005574|Ga0066694_10124228All Organisms → cellular organisms → Bacteria1216Open in IMG/M
3300005575|Ga0066702_10165049All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1320Open in IMG/M
3300005576|Ga0066708_10408230All Organisms → cellular organisms → Bacteria872Open in IMG/M
3300005576|Ga0066708_10523234All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium764Open in IMG/M
3300005586|Ga0066691_10256929All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1027Open in IMG/M
3300005587|Ga0066654_10292334All Organisms → cellular organisms → Bacteria871Open in IMG/M
3300005587|Ga0066654_10348151All Organisms → cellular organisms → Bacteria805Open in IMG/M
3300005598|Ga0066706_10431572All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1049Open in IMG/M
3300005598|Ga0066706_10568652All Organisms → cellular organisms → Bacteria901Open in IMG/M
3300005764|Ga0066903_104483580All Organisms → cellular organisms → Bacteria745Open in IMG/M
3300005764|Ga0066903_108058931All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300006031|Ga0066651_10491994All Organisms → cellular organisms → Bacteria650Open in IMG/M
3300006046|Ga0066652_100177417All Organisms → cellular organisms → Bacteria1813Open in IMG/M
3300006791|Ga0066653_10194927All Organisms → cellular organisms → Bacteria1023Open in IMG/M
3300006797|Ga0066659_11129620All Organisms → cellular organisms → Bacteria655Open in IMG/M
3300007255|Ga0099791_10467375All Organisms → cellular organisms → Bacteria611Open in IMG/M
3300007788|Ga0099795_10317580All Organisms → cellular organisms → Bacteria689Open in IMG/M
3300009012|Ga0066710_100199481All Organisms → cellular organisms → Bacteria2846Open in IMG/M
3300009012|Ga0066710_101477770All Organisms → cellular organisms → Bacteria1050Open in IMG/M
3300009012|Ga0066710_103495838All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300009012|Ga0066710_104084342All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300009038|Ga0099829_10167761All Organisms → cellular organisms → Bacteria1762Open in IMG/M
3300009090|Ga0099827_10244711All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1505Open in IMG/M
3300009137|Ga0066709_100102525All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria3511Open in IMG/M
3300010048|Ga0126373_11633393All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300010159|Ga0099796_10169435All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300010304|Ga0134088_10061409All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1736Open in IMG/M
3300010320|Ga0134109_10275415All Organisms → cellular organisms → Bacteria641Open in IMG/M
3300010321|Ga0134067_10105132All Organisms → cellular organisms → Bacteria970Open in IMG/M
3300010321|Ga0134067_10248287All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300010325|Ga0134064_10213882All Organisms → cellular organisms → Bacteria697Open in IMG/M
3300010326|Ga0134065_10359258All Organisms → cellular organisms → Bacteria574Open in IMG/M
3300010329|Ga0134111_10034394All Organisms → cellular organisms → Bacteria1781Open in IMG/M
3300010333|Ga0134080_10177009All Organisms → cellular organisms → Bacteria915Open in IMG/M
3300010335|Ga0134063_10722337All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300010337|Ga0134062_10009559All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3545Open in IMG/M
3300012189|Ga0137388_10948171All Organisms → cellular organisms → Bacteria795Open in IMG/M
3300012198|Ga0137364_10019179All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia4165Open in IMG/M
3300012198|Ga0137364_10929132All Organisms → cellular organisms → Bacteria659Open in IMG/M
3300012202|Ga0137363_10667692All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium879Open in IMG/M
3300012202|Ga0137363_10911657All Organisms → cellular organisms → Bacteria746Open in IMG/M
3300012203|Ga0137399_10753753All Organisms → cellular organisms → Bacteria820Open in IMG/M
3300012205|Ga0137362_10038252All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia3849Open in IMG/M
3300012205|Ga0137362_10214738All Organisms → cellular organisms → Bacteria1658Open in IMG/M
3300012207|Ga0137381_10306672All Organisms → cellular organisms → Bacteria1383Open in IMG/M
3300012208|Ga0137376_10422670All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1157Open in IMG/M
3300012285|Ga0137370_10065481All Organisms → cellular organisms → Bacteria1980Open in IMG/M
3300012351|Ga0137386_10342877All Organisms → cellular organisms → Bacteria1075Open in IMG/M
3300012354|Ga0137366_10510412All Organisms → cellular organisms → Bacteria868Open in IMG/M
3300012356|Ga0137371_11184904All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300012918|Ga0137396_11230405All Organisms → cellular organisms → Bacteria525Open in IMG/M
3300012923|Ga0137359_10666201All Organisms → cellular organisms → Bacteria909Open in IMG/M
3300012924|Ga0137413_10318875All Organisms → cellular organisms → Bacteria1092Open in IMG/M
3300012927|Ga0137416_10375797All Organisms → cellular organisms → Bacteria1198Open in IMG/M
3300012929|Ga0137404_10488058All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia1097Open in IMG/M
3300012930|Ga0137407_11168793All Organisms → cellular organisms → Bacteria730Open in IMG/M
3300012948|Ga0126375_11316411All Organisms → cellular organisms → Bacteria608Open in IMG/M
3300012975|Ga0134110_10086109All Organisms → cellular organisms → Bacteria1261Open in IMG/M
3300012976|Ga0134076_10035028All Organisms → cellular organisms → Bacteria1845Open in IMG/M
3300014150|Ga0134081_10136573All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300014157|Ga0134078_10181998All Organisms → cellular organisms → Bacteria846Open in IMG/M
3300014157|Ga0134078_10630389All Organisms → cellular organisms → Bacteria517Open in IMG/M
3300015359|Ga0134085_10216847All Organisms → cellular organisms → Bacteria827Open in IMG/M
3300015374|Ga0132255_104673785All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300016404|Ga0182037_11446511All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300017656|Ga0134112_10253363All Organisms → cellular organisms → Bacteria699Open in IMG/M
3300017659|Ga0134083_10089430All Organisms → cellular organisms → Bacteria1204Open in IMG/M
3300017659|Ga0134083_10366489All Organisms → cellular organisms → Bacteria623Open in IMG/M
3300018431|Ga0066655_10001269All Organisms → cellular organisms → Bacteria8999Open in IMG/M
3300018433|Ga0066667_10027488All Organisms → cellular organisms → Bacteria3144Open in IMG/M
3300018433|Ga0066667_12095999All Organisms → cellular organisms → Bacteria523Open in IMG/M
3300018468|Ga0066662_11006364All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium826Open in IMG/M
3300018482|Ga0066669_10193708All Organisms → cellular organisms → Bacteria1548Open in IMG/M
3300018482|Ga0066669_10303393All Organisms → cellular organisms → Bacteria1298Open in IMG/M
3300018482|Ga0066669_11812872All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300020018|Ga0193721_1020594All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1740Open in IMG/M
3300026306|Ga0209468_1005071All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia5160Open in IMG/M
3300026314|Ga0209268_1163241All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300026322|Ga0209687_1001621All Organisms → cellular organisms → Bacteria7562Open in IMG/M
3300026324|Ga0209470_1320967All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300026327|Ga0209266_1006647All Organisms → cellular organisms → Bacteria6893Open in IMG/M
3300026330|Ga0209473_1046147All Organisms → cellular organisms → Bacteria1915Open in IMG/M
3300026332|Ga0209803_1083762All Organisms → cellular organisms → Bacteria1330Open in IMG/M
3300026334|Ga0209377_1049965All Organisms → cellular organisms → Bacteria1875Open in IMG/M
3300026342|Ga0209057_1134758All Organisms → cellular organisms → Bacteria870Open in IMG/M
3300026528|Ga0209378_1208962All Organisms → cellular organisms → Bacteria636Open in IMG/M
3300026532|Ga0209160_1035201All Organisms → cellular organisms → Bacteria3094Open in IMG/M
3300026538|Ga0209056_10091005All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia2527Open in IMG/M
3300026547|Ga0209156_10298990All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300026548|Ga0209161_10002752All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia14379Open in IMG/M
3300027748|Ga0209689_1255304All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300027748|Ga0209689_1255305All Organisms → cellular organisms → Bacteria713Open in IMG/M
3300027903|Ga0209488_10204418All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1487Open in IMG/M
3300032180|Ga0307471_100261526All Organisms → cellular organisms → Bacteria1791Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil40.52%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.41%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil16.38%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.21%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.72%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.72%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.72%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.86%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.86%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.86%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.86%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000363Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300000787Soil microbial communities from Great Prairies - Iowa, Continuous Corn soilEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010159Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_3EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010321Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09212015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010337Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09082015EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300016404Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020018Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2s2EnvironmentalOpen in IMG/M
3300026306Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102 (SPAdes)EnvironmentalOpen in IMG/M
3300026314Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143 (SPAdes)EnvironmentalOpen in IMG/M
3300026322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136 (SPAdes)EnvironmentalOpen in IMG/M
3300026324Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026332Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026342Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_146 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
ICChiseqgaiiFebDRAFT_1102367623300000363SoilMYSPALPLEGRGQILNHARRCASLILAVSVVGSVYLFWRCDARAQSEEDLPTPQEPQLAEPGDWAADFNQLQIACYEGHMRACDSIWKSNRVLTDIFLYEYGRSCGGRVDRREISRAGLDCAEAFPGHE*
JGI11643J11755_1110425613300000787SoilMYSPALPLEGRGQILNHARRCASLILAVSVVGSVYLFWRCDARAQSEEDLPTPQEPQLAEPGDWAADFNQLQIACYEGHMRACDSIWKSNRVLTDIFLYEYGRSCGGRVD
JGI25390J43892_1016108013300002911Grasslands SoilRSICVLAAFPLERSGQILNHARRRASLILAVSVVASVYLSWKCDTRAQSADDLPTPLAPQLAEPGDWAADFNQAQTACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE*
Ga0062595_10228584213300004479SoilPNQNGSVKTLKNMYVLFAFPLRRRRQILNHLRPCGSLILAVSVVAGVCLFWKCDTRAQSEEDQDLPAPEEPQLAEEGDWAADFNQLQIACYQGHMSACDSLWKSNRVLTDTFLYNYGRSCGGRVNRREISRSGLDCNEAFPGHE*
Ga0066674_1002115153300005166SoilMKPLKNICILAALLMERRGQILNHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE*
Ga0066672_1011799723300005167SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE*
Ga0066677_1020089013300005171SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVG
Ga0066683_1045355813300005172SoilMKPLRNICILPAFPFGRRGQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMGACDSLWKSNRVLTDTFLYDYGRSCGGRVDRRQISRSGLDCTEAFPGHE*
Ga0066673_1063870613300005175SoilERRGQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE
Ga0066685_1039202223300005180SoilMKPLRNICILPAFPFGRRGQILNHARRRASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQLQIACYEGHMGACDSLWKSNRVLTDTFLYDYGRSCGGRVDRRQISRSGLDCTEAFPGHE*
Ga0066676_1101584823300005186SoilQILNHARRRASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQLQIACYEGHMGACDSLWKSNRVLTDTFLYDYGRSCGGRVDRRQISRSGLDCTEAFPGHE*
Ga0066675_1007121623300005187SoilMICLACFLAPHQNRGMKPLRNICILPALPLERWVQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0066675_1077924013300005187SoilARRRASLILAVSVVASICLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWRSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCTEAFPGHE*
Ga0066686_1029509923300005446SoilMKPLRNICILPAFPFGRRGQILNHARRRASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0066682_1035723233300005450SoilMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE*
Ga0066697_1076364413300005540SoilKPLKNICILAALLMERRGQILNHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE*
Ga0066692_1032027413300005555SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLINDALLQLQELWVAFLDVAF*
Ga0066707_1017981033300005556SoilLNHARRRASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQLQIACYEGHMGACDSLWKSNRVLTDTFLYDYGRSCGGRVDRRQISRSGLDCTEAFPGHE*
Ga0066707_1057334813300005556SoilMKPLRNICILTAFPLERWGQILNHARRRTSIILAVSVVASVYLFWKCDTRAQSAGNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0066698_1079290613300005558SoilHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE*
Ga0066670_1093673313300005560SoilMKPLRNICILTAFPLERWGQILNHARRHASIILAVSVVASVYLFWKCDTRAQSDSNLPTAQEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE*
Ga0066705_1016725423300005569SoilRRRASLILAVSVVASVYLSWKCDTRAQSADDLPTPLAPQLAEPGDWAADFNQAQTACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE*
Ga0066694_1012422813300005574SoilRRGQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0066702_1016504923300005575SoilMKPLRSICVLAAFPLERSGQILNHARRRASLILAVSVVASVYLSWKCDTRAQSADDLPTPLAPQLAEPGDWAADFNQAQTACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE*
Ga0066708_1040823023300005576SoilMKPLRNICILTAFPLERWGQILNHARRHASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0066708_1052323423300005576SoilMKPLRSICVLAAFPLERSGQILNHARRRASLILAVSVVASVYLSWKCDTRAQSADDLPTPLAPQLAEPGDWAADFNQAQTACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVG
Ga0066691_1025692913300005586SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRADLREIRRADLTCTEAFPGHE*
Ga0066654_1029233423300005587SoilMICLACFLAPYQNRGMKPLRNICILPALRLERRGQILNHVRRRASLLLAVSVVASVYQFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSG
Ga0066654_1034815113300005587SoilFVLFVFLRLIELKHEAAKKYMRSRRFPAGKIGQILNHARRRASLILAVSVVASVYLSWKCDTRAQSADDLPTALEPQMAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE*
Ga0066706_1043157223300005598SoilMKPLRSICILAAFPLERSGQILNHARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE*
Ga0066706_1056865223300005598SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0066903_10448358013300005764Tropical Forest SoilLSPSRLQWVQLRGENKDDAKNECVPRRYEGICLACFPCHDQNRSMKPLRSICVLPAFPLQRWGQILNHAHRLAFVILAVSVIASVCLFWRCDARAESEDDLPTPQEPQLAEEGDWAADFNELQVACYEGHMSACDSICKSKRMIFDTFLYDYGRTCGGRVDLREIRRSDLDCTEAFPGHD
Ga0066903_10805893123300005764Tropical Forest SoilKPLRNLCVLPALPLERWVQILNRARRCASLILAVSVVTGVCLFWRSDVRAQSGDDLPTPQEPQLAEEGDWAADFNELQVACYEGHMSACDSIWKSKRVLFDTFLFDYGRTCGGRVDRREIRRLDLDCTEAFPGHE*
Ga0066651_1049199413300006031SoilMICLACFLAPHQNRGMKPLRNICILPALPLERRGQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNPLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0066652_10017741733300006046SoilGQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAESGDWAADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0066653_1019492723300006791SoilMKPLKNICILAALLMERRGQILNHARRRASLMLAVSVVASVYLFWKCDAEAQSDLPAPQEPQLAEAGDWAADFNQSQLACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE*
Ga0066659_1112962023300006797SoilMKPLRSIGILTAFPLERWRQILNHARRCAPLILAVSVVASVYLFWKSDTRAQSADDLPAPQEPQLAEAGDWAADFNQAQIACYEGHMRACDSIWRSDRVLFDTWLFNYGRTCGGRVGLREIRRADLTCTEAFPGHE*
Ga0099791_1046737513300007255Vadose Zone SoilMKPLRNTCILPAFPLGRRGQILNHARRRASLILAVSVVASVYLFWKGDTRAQSDLPAPQEPQLAEEGDWAADFNQLQIACYEGHMGACDSIWKSNRVLFDTFLYDYGRSCGGRVDRRQISRSGLDCAEAFPGH*
Ga0099795_1031758013300007788Vadose Zone SoilMKPLRNICILTAFPLERWGHILNHARRRASIILAVSVVGSVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWATDFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0066710_10019948113300009012Grasslands SoilMKPSRNICVLAAFALERWGQILNHARRRASLILAVSVVASVYLFWKCDTRAQSAGNLPTAQEPQLAGPGDWAADFNPAQIACYEGSMRACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHD
Ga0066710_10147777023300009012Grasslands SoilMKPLRNICILPAFRLQRWGQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPEPQEQPRLAEPGDWAADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE
Ga0066710_10349583823300009012Grasslands SoilLNHARRRASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQLQIACYEGHMGACDSLWKSNRVLTDTFLYDYGRSCGGRVDRRQISRSGLDCTEAFPGHE
Ga0066710_10408434213300009012Grasslands SoilALLMERRGQILNHARRRASLILVVSVVASIYLFWTCDTRAQSDLPAPQEPQLAEEGDWAADFNQAQIACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRADLSCTEAFPGHE
Ga0099829_1016776133300009038Vadose Zone SoilMKPLRNICILAAFPLERWGQILNHARRRASVILTVSVVASVYVFWKCDTRAQSAGNLPTAQEPQVAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0099827_1024471123300009090Vadose Zone SoilMKSLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPKLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0066709_10010252523300009137Grasslands SoilMKPLKNICILPALLMERRGQILNHARRRASLILVVSVVASIYLFWTCDTRAQSDLPAPQEPQLAEEGDWAADFNQAQIACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRADLSCTEAFPGHE*
Ga0126373_1163339313300010048Tropical Forest SoilIKDLSCLLSCSSSESRHEAVKKYCVLPAFPLGRRGRVLNHARRRASLILAVSVLASVSLFWKCDTRAQSDLPEPQKPWLAEPGDWAADFNELQVACYEGHMSACDSMWKSDRVLFDTFLYEYGRSCGGRVDRRQISRSGLSSSEAFPGHD*
Ga0099796_1016943513300010159Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWATDFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0134088_1006140923300010304Grasslands SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE*
Ga0134109_1027541513300010320Grasslands SoilMICLACFLAPHQNRGMKPLRNICILPALPLERWVQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAESGDWAADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0134067_1010513223300010321Grasslands SoilMKPLRNICILTAFPLERWGQILNHARRHASIILAVSVVASVYLFWKCDTRAQSDSNLPTAQEPQLAEPGDWAADFNQAQIAWYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCAEAFPRHE*
Ga0134067_1024828713300010321Grasslands SoilLAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE*
Ga0134064_1021388213300010325Grasslands SoilMICLACFLAPHQNRGMKPLRNICILPALPLERWVQILNHARRRASLLLAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0134065_1035925813300010326Grasslands SoilMICLACFLAPHQNGGMKPLRNIFSLPALPLERWVQILNHARRCASLILAVSVVASVYLFWTCNTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0134111_1003439423300010329Grasslands SoilMKPLKNICILAALLMERRGQILNHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGVRVDLRQIRRAGLSCTEAFPGHE*
Ga0134080_1017700923300010333Grasslands SoilMICLACFLAPHQNRGMKPLRNICILPALPLERWVQILNHAPRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0134063_1072233713300010335Grasslands SoilMKLLRSICILAAFPLERSGQILNQARRRASLILAVSVVASICLFWKCDTRAQSADDLSTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLINDALLQLQELWVAFLDV
Ga0134062_1000955933300010337Grasslands SoilMKPLKNICILPALLMERRGQILNHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE*
Ga0137388_1094817123300012189Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARRRASVILAVSVVASVYLIWKCDTRAQSAGDLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRAGLDCAEAFPGH*
Ga0137364_1001917913300012198Vadose Zone SoilMKPLRNICILPALPLERWVQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0137364_1092913213300012198Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSDSNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0137363_1066769213300012202Vadose Zone SoilMKPLRNICILPAFRLQRWRQILNHARRRASLILAVSVVASVCLFWKCDTRAQSDLPAPQEPQLAEAGDWAADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQI
Ga0137363_1091165713300012202Vadose Zone SoilSMKSLRSMCILAAFLLERSGQSLNRARGRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRGADLSCSDAFPGHE*
Ga0137399_1075375313300012203Vadose Zone SoilMKPLRNICVLPAFPLGRRGQILNHARRRAFLILAVSVVASVYLFWKGDTRAQSDLPAAQEPQLAEEGDWAADFNQLQIACYEGHMGACDSLWKSNRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGH*
Ga0137362_1003825213300012205Vadose Zone SoilMICLACFLAPHQNRSMKPLRNICILPAFRLQRWRQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPEPQEQPRLAEPGDWAADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCNEAFPGHE*
Ga0137362_1021473823300012205Vadose Zone SoilMCILAAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSDLPAPQEPQLAEAGDWAADFNQAQIACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRGADLSCSDAFPGHE*
Ga0137381_1030667223300012207Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0137376_1042267023300012208Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGNE*
Ga0137370_1006548123300012285Vadose Zone SoilMKPLRNICILPAFPLERWVQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0137386_1034287723300012351Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARHRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTRLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0137366_1051041223300012354Vadose Zone SoilGFVLFVFLPSSELKHEAVKSICILAAFPPERSGQILNHARYRASLILAVSVVASVYLFWKYDTRAQSADDLPTALEPQLAEPGDWAADFNQDQIACYEGHMRACDSIWRSDRVLFDTWLFNYGRTCGGRVGLREIRRADLTCTDAFPGHE*
Ga0137371_1118490413300012356Vadose Zone SoilPLERWGQILNQARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGNE*
Ga0137396_1123040513300012918Vadose Zone SoilTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLTEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0137359_1066620113300012923Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTREQFASNLPTAQEQQLDEPGDWDADFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0137413_1031887523300012924Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTRAQSASNLPTAQEPQLAEPGDWATDFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0137416_1037579713300012927Vadose Zone SoilMKPLKNICILPAFPLGRRGQILNHARRRAFLILAVSVVASVYLFWKGDTRAQSDLPAAQEPQLAEEGDWAADFNQLQIACYEGHMGACDSLWKSNRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGH*
Ga0137404_1048805823300012929Vadose Zone SoilMICLACFLAPHQNRSMKPLRNICILPAFRLQRWGQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPEPQEQPRLAEPGDWAADFNQLQIACYEGHMRACASLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTDASPGHE*
Ga0137407_1116879313300012930Vadose Zone SoilHARRRASLILAVSVVASVYLSWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQTACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE*
Ga0126375_1131641113300012948Tropical Forest SoilMKTLKHICILPAFPLEGQEQILNHARRGASLILAVSVVTSLCLFWNGDTRAQSDDDLPAPQEPQLAEEGDWAADFNHLQIACYEGHMRACDSLWKSNRVITDTFLYNYGRSCGGRVDRRQISRSGLDYAEAFPGHD*
Ga0134110_1008610913300012975Grasslands SoilNCGMKPLRNICILPAFPFGRRGQILNHARRRASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQAQTACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLINDALLQLQELWVAFLDVAF*
Ga0134076_1003502823300012976Grasslands SoilMKPLRNICILPAFPFGRRGQILNHARRLASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQLQIACYEGHMGACDSLWKSNRVLTDTFLYDYGRSCGGRVDRRQISRSGLDCTEAFPGHE*
Ga0134081_1013657323300014150Grasslands SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRVDLSCTEAFPGHE*
Ga0134078_1018199823300014157Grasslands SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQMAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLINDALLQLQELWVAFLDVAF*
Ga0134078_1063038913300014157Grasslands SoilKPLRNICILPALPLERWVQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0134085_1021684713300015359Grasslands SoilLPLERWVQILNHARRRASLLLAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNPLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE*
Ga0132255_10467378523300015374Arabidopsis RhizosphereMKTLNNILFGSALERGKPISNHACCRASLILLVVASACLFWKCDTSAQSEQSDLPAPKDPWLAEEGDWAADFNEAQIACYNGSMQACDSIWRNNRVLFDTWLYEYGRSCGGRVDR
Ga0182037_1144651113300016404SoilPLERWKPTSNHACRRASLILHVVASACLIWKCDTSAQSDLPAPKDPWLAEEGDWAADFNNAQIACYEGHMSACDSIWKSDRVLFDTFLYEYGRSCGGRVDRRQISRAGLDCTEAFPGYE
Ga0134112_1025336323300017656Grasslands SoilMKLLRSICILAAFPLERSGQILNQARRRASLILAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRAGLSCTEAFPGHE
Ga0134083_1008943023300017659Grasslands SoilHARRRASLMLAVSVVARVYLCWQCDAGAQSDLPAPHEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE
Ga0134083_1036648913300017659Grasslands SoilMKPLRNICILPAFPFGRRGQILNHARRRASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQAQMACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE
Ga0066655_1000126983300018431Grasslands SoilMERRGQILNHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE
Ga0066667_1002748823300018433Grasslands SoilMERRGQILKHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE
Ga0066667_1209599913300018433Grasslands SoilRNICILPAFRLQGRGQILNHARRRASLILAVSVVASVYLFWKCDTRAQSEGDLPSPQEPQLAEPGDWAADFNQAQIACYEGHMRACDSIWKSDRVLFDTFLYEYGRSCGGRVDRRQISRSGLDCTEAFPGHE
Ga0066662_1100636423300018468Grasslands SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLIN
Ga0066669_1019370833300018482Grasslands SoilLIICLSCFLAPHQKRGMKPLRNICILPALPLERWVQILNHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAESGDWAADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE
Ga0066669_1030339323300018482Grasslands SoilMKPLKNICILAALLMERRGQILNHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE
Ga0066669_1181287213300018482Grasslands SoilMKPLRNICILTAFPLERWGQILNHARRHASIILAVSIVASVYLFLKYDTRAQSVSNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDL
Ga0193721_102059423300020018SoilMICLACFLAPHQNRGMKPLRNICILPAFPLERRGQILNHVRRRASLLLAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE
Ga0209468_1005071123300026306SoilMKPLKNICILAALLMERRGQILNHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWRSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE
Ga0209268_116324113300026314SoilHARRRASLILAVSVVASVYLFWTCDTRAQSDLPDPQEQPRLAEPGNWDADFNQLQIACYEGHMRACDSLWKSNRVLTDTFLYDFGRSCGARVDRRQISRSGLDCTEAFPGHE
Ga0209687_100162123300026322SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNHAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLINDALLQLQELWVAFLDVAF
Ga0209470_132096723300026324SoilGRRGQILNHARRRASLILAVSVVASVYLFWKSDTRAQSDLPAPQEPQLAEPGDWAADFNQLQIACYEGHMGACDSLWKSNRVLTDTFLYDYGRSCGGRVDRRQISRSGLDCTEAFPGHE
Ga0209266_100664773300026327SoilMKPLKNICILAALLMERRGQILKHARRRASLMLAVSVVASVYLFWKCDAGAQSDLPAPQEPQLAEAGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTFLFNYGRTCGGRVDLRQIRRAGLSCTEAFPGHE
Ga0209473_104614723300026330SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLINDALLQLQELWVAFLDVAF
Ga0209803_108376223300026332SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSAVASVYLFWKCDTRAQSADDLPTALEPQMAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLINDALLQLQELWVAFLDVAF
Ga0209377_104996523300026334SoilKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHDRPTLINDALLQLQELWVAFLDVAF
Ga0209057_113475823300026342SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE
Ga0209378_120896213300026528SoilMKPLRSICILAAFPLERSGQILNHARRRASLILAVSVVASVYLFWKCDTRAQSAGNLPTAQEPQLAEPGDWAADFNQAQMACYEGSMKACDSIWKSDRVLFDTPLFNYGRTCGGRVDLRE
Ga0209160_103520123300026532SoilMKPLRSICVLAAFPLERSGQILNHARRRASLILAVSVVASVYLSWKCDTRAQSADDLPTPLAPQLAEPGDWAADFNQAQTACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE
Ga0209056_1009100543300026538SoilMKLLRSICILAAFPLERSGQILNQARRRASLILAVSVVASICLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWRSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCT
Ga0209156_1029899013300026547SoilMKLLRSICILAAFPLERSGQILNQARRRASLILAVSVVASICLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWRSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCTEAFPGHE
Ga0209161_1000275223300026548SoilMKPLRSICILAAFPLERSGQILNHARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTCGGRVGLREIRRADLSCSDAFPGHE
Ga0209689_125530423300027748SoilMKPLRSICILVAFLLERSGQSLNRARRRASLILAVSVVASVYLFWKCDTRAQSADDLPTALEPQLAEPGDWAADFNQAQMACYEGHMRACDSIWKSDRVLFDTWLFNYGRTC
Ga0209689_125530523300027748SoilMKPLRSICVLAAFPLERSGQILNHARRRASLILAVSVVASVYLSWKCDTRAQSADDLPTPLAPQLAEPGDWAADFNQAQTACYEGHMRACDSIWKSDRVLFDTWLFNYGRTC
Ga0209488_1020441823300027903Vadose Zone SoilMKPLRNICILTAFPLERWGQILNHARRRASIILAVSVVASVYLFWKCDTGAQSASNLPTAQEPQLAEPGDWATDFNQAQMACYEGSMKACDSIWKSDRVLFDTWLFNYGRTCGGRVDLREIRRADLTCTEAFPGHE
Ga0307471_10026152613300032180Hardwood Forest SoilMRATSVRRITRYKVVAQYKGFLLLALLALIRIEAMKPFRNICRLPAVPLERCGQILNRAHRCASLLLAVSVVASVCLFWRCDTRAQSEDDLPTPQEPQLAEPGDWAADFNDLQVACYEGHMSACDSIWKSDRVLFDTFLYEYGRSCGGRVDRRQISRAGLDCTEAFPGHE


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.