NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F077077

Metagenome Family F077077

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F077077
Family Type Metagenome
Number of Sequences 117
Average Sequence Length 213 residues
Representative Sequence MNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATXHKDVLVKRGMSEQLLDDLAXTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNEPGAGGTQTPKAA
Number of Associated Samples 100
Number of Associated Scaffolds 117

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 60.68 %
% of genes near scaffold ends (potentially truncated) 47.86 %
% of genes from short scaffolds (< 2000 bps) 73.50 %
Associated GOLD sequencing projects 84
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (99.145 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(36.752 % of family members)
Environment Ontology (ENVO) Unclassified
(44.444 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(49.573 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Mixed Signal Peptide: No Secondary Structure distribution: α-helix: 77.09%    β-sheet: 0.00%    Coil/Unstructured: 22.91%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 117 Family Scaffolds
PF14499DUF4437 5.13
PF13650Asp_protease_2 3.42
PF05685Uma2 2.56
PF00291PALP 1.71
PF01402RHH_1 0.85
PF08843AbiEii 0.85
PF06769YoeB_toxin 0.85
PF03699UPF0182 0.85
PF00583Acetyltransf_1 0.85
PF17210SdrD_B 0.85
PF11954DUF3471 0.85
PF01553Acyltransferase 0.85
PF00144Beta-lactamase 0.85

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 117 Family Scaffolds
COG4636Endonuclease, Uma2 family (restriction endonuclease fold)General function prediction only [R] 2.56
COG1615Uncharacterized membrane protein, UPF0182 familyFunction unknown [S] 0.85
COG1680CubicO group peptidase, beta-lactamase class C familyDefense mechanisms [V] 0.85
COG1686D-alanyl-D-alanine carboxypeptidaseCell wall/membrane/envelope biogenesis [M] 0.85
COG2253Predicted nucleotidyltransferase component of viral defense systemDefense mechanisms [V] 0.85
COG2367Beta-lactamase class ADefense mechanisms [V] 0.85
COG4115Toxin component of the Txe-Axe toxin-antitoxin module, Txe/YoeB familyDefense mechanisms [V] 0.85


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms99.15 %
UnclassifiedrootN/A0.85 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10017298All Organisms → cellular organisms → Bacteria2582Open in IMG/M
3300002560|JGI25383J37093_10026519All Organisms → cellular organisms → Bacteria1943Open in IMG/M
3300002561|JGI25384J37096_10101753All Organisms → cellular organisms → Bacteria999Open in IMG/M
3300002562|JGI25382J37095_10051359All Organisms → cellular organisms → Bacteria1586Open in IMG/M
3300005166|Ga0066674_10342083All Organisms → cellular organisms → Bacteria → Terrabacteria group → Deinococcus-Thermus → Deinococci → Deinococcales → Deinococcaceae → Deinococcus → Deinococcus apachensis703Open in IMG/M
3300005166|Ga0066674_10439410All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300005172|Ga0066683_10037858All Organisms → cellular organisms → Bacteria2808Open in IMG/M
3300005174|Ga0066680_10052194All Organisms → cellular organisms → Bacteria2404Open in IMG/M
3300005336|Ga0070680_100986723All Organisms → cellular organisms → Bacteria727Open in IMG/M
3300005440|Ga0070705_101891905All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300005444|Ga0070694_100237118All Organisms → cellular organisms → Bacteria1374Open in IMG/M
3300005446|Ga0066686_10039232All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2830Open in IMG/M
3300005447|Ga0066689_10466121All Organisms → cellular organisms → Bacteria794Open in IMG/M
3300005451|Ga0066681_10332665All Organisms → cellular organisms → Bacteria932Open in IMG/M
3300005451|Ga0066681_10599970All Organisms → cellular organisms → Bacteria678Open in IMG/M
3300005468|Ga0070707_101706540All Organisms → cellular organisms → Bacteria597Open in IMG/M
3300005518|Ga0070699_100262499All Organisms → cellular organisms → Bacteria1545Open in IMG/M
3300005526|Ga0073909_10469516All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Actinoplanes → Actinoplanes missouriensis604Open in IMG/M
3300005545|Ga0070695_100810846All Organisms → cellular organisms → Bacteria751Open in IMG/M
3300005546|Ga0070696_100362280All Organisms → cellular organisms → Bacteria1125Open in IMG/M
3300005549|Ga0070704_100185495All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria1667Open in IMG/M
3300005553|Ga0066695_10221680All Organisms → cellular organisms → Bacteria1190Open in IMG/M
3300005554|Ga0066661_10707205All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300005555|Ga0066692_10707774All Organisms → cellular organisms → Bacteria624Open in IMG/M
3300005556|Ga0066707_10599520All Organisms → cellular organisms → Bacteria704Open in IMG/M
3300005558|Ga0066698_10032452All Organisms → cellular organisms → Bacteria3181Open in IMG/M
3300005559|Ga0066700_10340321All Organisms → cellular organisms → Bacteria1059Open in IMG/M
3300005568|Ga0066703_10306519All Organisms → cellular organisms → Bacteria960Open in IMG/M
3300005574|Ga0066694_10337839All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Mycobacteriaceae → Mycolicibacterium → Mycolicibacterium phlei715Open in IMG/M
3300005586|Ga0066691_10121555All Organisms → cellular organisms → Bacteria1482Open in IMG/M
3300005586|Ga0066691_10626734All Organisms → cellular organisms → Bacteria639Open in IMG/M
3300006034|Ga0066656_10170040All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium1374Open in IMG/M
3300006034|Ga0066656_10430396All Organisms → cellular organisms → Bacteria857Open in IMG/M
3300006046|Ga0066652_102103445Not Available500Open in IMG/M
3300006797|Ga0066659_10077977All Organisms → cellular organisms → Bacteria2188Open in IMG/M
3300006852|Ga0075433_10142842All Organisms → cellular organisms → Bacteria2128Open in IMG/M
3300006871|Ga0075434_100559144All Organisms → cellular organisms → Bacteria1164Open in IMG/M
3300006903|Ga0075426_10033718All Organisms → cellular organisms → Bacteria3658Open in IMG/M
3300006904|Ga0075424_100301363All Organisms → cellular organisms → Bacteria1706Open in IMG/M
3300007265|Ga0099794_10353658All Organisms → cellular organisms → Bacteria764Open in IMG/M
3300007788|Ga0099795_10361347All Organisms → cellular organisms → Bacteria652Open in IMG/M
3300009012|Ga0066710_103310598All Organisms → cellular organisms → Bacteria615Open in IMG/M
3300009038|Ga0099829_10601142All Organisms → cellular organisms → Bacteria914Open in IMG/M
3300009090|Ga0099827_10626663All Organisms → cellular organisms → Bacteria928Open in IMG/M
3300009143|Ga0099792_10068965All Organisms → cellular organisms → Bacteria1778Open in IMG/M
3300009147|Ga0114129_11620307All Organisms → cellular organisms → Bacteria792Open in IMG/M
3300009162|Ga0075423_10489546All Organisms → cellular organisms → Bacteria1293Open in IMG/M
3300010304|Ga0134088_10027932All Organisms → cellular organisms → Bacteria2532Open in IMG/M
3300010326|Ga0134065_10170744All Organisms → cellular organisms → Bacteria771Open in IMG/M
3300010333|Ga0134080_10123173All Organisms → cellular organisms → Bacteria1082Open in IMG/M
3300010364|Ga0134066_10315142All Organisms → cellular organisms → Bacteria567Open in IMG/M
3300010399|Ga0134127_10466811All Organisms → cellular organisms → Bacteria1266Open in IMG/M
3300010403|Ga0134123_10442098All Organisms → cellular organisms → Bacteria1206Open in IMG/M
3300011443|Ga0137457_1010755All Organisms → cellular organisms → Bacteria2216Open in IMG/M
3300011443|Ga0137457_1039581All Organisms → cellular organisms → Bacteria1339Open in IMG/M
3300012096|Ga0137389_11021003All Organisms → cellular organisms → Bacteria709Open in IMG/M
3300012189|Ga0137388_10186289All Organisms → cellular organisms → Bacteria1863Open in IMG/M
3300012200|Ga0137382_10157351All Organisms → cellular organisms → Bacteria1544Open in IMG/M
3300012200|Ga0137382_10290544All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1139Open in IMG/M
3300012201|Ga0137365_10116233All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes2012Open in IMG/M
3300012203|Ga0137399_10026703All Organisms → cellular organisms → Bacteria3902Open in IMG/M
3300012203|Ga0137399_10223849All Organisms → cellular organisms → Bacteria1535Open in IMG/M
3300012203|Ga0137399_10620303All Organisms → cellular organisms → Bacteria909Open in IMG/M
3300012208|Ga0137376_10065328All Organisms → cellular organisms → Bacteria3013Open in IMG/M
3300012208|Ga0137376_10211972All Organisms → cellular organisms → Bacteria1677Open in IMG/M
3300012210|Ga0137378_10112127All Organisms → cellular organisms → Bacteria2512Open in IMG/M
3300012211|Ga0137377_10237049All Organisms → cellular organisms → Bacteria1752Open in IMG/M
3300012211|Ga0137377_10944667All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300012285|Ga0137370_10368286All Organisms → cellular organisms → Bacteria867Open in IMG/M
3300012356|Ga0137371_10053129All Organisms → cellular organisms → Bacteria3134Open in IMG/M
3300012359|Ga0137385_10720130All Organisms → cellular organisms → Bacteria833Open in IMG/M
3300012362|Ga0137361_10575521All Organisms → cellular organisms → Bacteria1033Open in IMG/M
3300012363|Ga0137390_10201831All Organisms → cellular organisms → Bacteria1975Open in IMG/M
3300012683|Ga0137398_10531314All Organisms → cellular organisms → Bacteria810Open in IMG/M
3300012685|Ga0137397_10715484All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300012917|Ga0137395_10234844All Organisms → cellular organisms → Bacteria1284Open in IMG/M
3300012918|Ga0137396_10069864All Organisms → cellular organisms → Bacteria → Proteobacteria2452Open in IMG/M
3300012918|Ga0137396_10121644All Organisms → cellular organisms → Bacteria1882Open in IMG/M
3300012922|Ga0137394_10104168All Organisms → cellular organisms → Bacteria2392Open in IMG/M
3300012925|Ga0137419_10400930All Organisms → cellular organisms → Bacteria1072Open in IMG/M
3300012927|Ga0137416_10246407All Organisms → cellular organisms → Bacteria1458Open in IMG/M
3300012927|Ga0137416_10841956All Organisms → cellular organisms → Bacteria813Open in IMG/M
3300012930|Ga0137407_11089511All Organisms → cellular organisms → Bacteria757Open in IMG/M
3300012944|Ga0137410_10161169All Organisms → cellular organisms → Bacteria1716Open in IMG/M
3300012972|Ga0134077_10139048All Organisms → cellular organisms → Bacteria963Open in IMG/M
3300012972|Ga0134077_10355745All Organisms → cellular organisms → Bacteria625Open in IMG/M
3300014154|Ga0134075_10305297All Organisms → cellular organisms → Bacteria693Open in IMG/M
3300015052|Ga0137411_1155983All Organisms → cellular organisms → Bacteria640Open in IMG/M
3300015054|Ga0137420_1007406All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300015241|Ga0137418_10152990All Organisms → cellular organisms → Bacteria2031Open in IMG/M
3300015245|Ga0137409_10037390All Organisms → cellular organisms → Bacteria → Proteobacteria4676Open in IMG/M
3300015245|Ga0137409_10213585All Organisms → cellular organisms → Bacteria1731Open in IMG/M
3300015359|Ga0134085_10077485All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1359Open in IMG/M
3300017654|Ga0134069_1362166All Organisms → cellular organisms → Bacteria522Open in IMG/M
3300017657|Ga0134074_1118152All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium916Open in IMG/M
3300017657|Ga0134074_1161493All Organisms → cellular organisms → Bacteria786Open in IMG/M
3300018482|Ga0066669_10299215All Organisms → cellular organisms → Bacteria1305Open in IMG/M
3300019880|Ga0193712_1066186All Organisms → cellular organisms → Bacteria789Open in IMG/M
3300019997|Ga0193711_1012223All Organisms → cellular organisms → Bacteria1082Open in IMG/M
3300025917|Ga0207660_11103173All Organisms → cellular organisms → Bacteria647Open in IMG/M
3300026277|Ga0209350_1012192All Organisms → cellular organisms → Bacteria2748Open in IMG/M
3300026295|Ga0209234_1031650All Organisms → cellular organisms → Bacteria2010Open in IMG/M
3300026296|Ga0209235_1000668All Organisms → cellular organisms → Bacteria17667Open in IMG/M
3300026313|Ga0209761_1089044All Organisms → cellular organisms → Bacteria1574Open in IMG/M
3300026327|Ga0209266_1031792All Organisms → cellular organisms → Bacteria2786Open in IMG/M
3300026328|Ga0209802_1050919All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium2050Open in IMG/M
3300026333|Ga0209158_1166164All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300026334|Ga0209377_1106433All Organisms → cellular organisms → Bacteria1154Open in IMG/M
3300026528|Ga0209378_1195871All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium679Open in IMG/M
3300026532|Ga0209160_1041227All Organisms → cellular organisms → Bacteria2778Open in IMG/M
3300026536|Ga0209058_1038384All Organisms → cellular organisms → Bacteria2858Open in IMG/M
3300026537|Ga0209157_1024884All Organisms → cellular organisms → Bacteria3597Open in IMG/M
3300026537|Ga0209157_1185705All Organisms → cellular organisms → Bacteria897Open in IMG/M
3300026555|Ga0179593_1086840All Organisms → cellular organisms → Bacteria2055Open in IMG/M
3300027903|Ga0209488_10003533All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes → unclassified Gemmatimonadetes → Gemmatimonadetes bacterium12502Open in IMG/M
3300027903|Ga0209488_10101628All Organisms → cellular organisms → Bacteria2155Open in IMG/M
3300028536|Ga0137415_10057261All Organisms → cellular organisms → Bacteria3764Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil36.75%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil26.50%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil9.40%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil8.55%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere5.98%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.13%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.71%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.71%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.71%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.71%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.85%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005336Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005451Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005526Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire. - Coalmine Soil_Cen17_06102014_R1EnvironmentalOpen in IMG/M
3300005545Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-2 metaGEnvironmentalOpen in IMG/M
3300005546Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-3 metaGEnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009147Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. deltoides SRZDD1 (version 2)Host-AssociatedOpen in IMG/M
3300009162Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD2Host-AssociatedOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010403Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-3EnvironmentalOpen in IMG/M
3300011443Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT630_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012359Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300015052Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300017654Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019880Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1EnvironmentalOpen in IMG/M
3300019997Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3m2EnvironmentalOpen in IMG/M
3300025917Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1001729833300002558Grasslands SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATXHKDVLVKRGMSEQLLDDLAXTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNEPGAGGTQTPKAA*
JGI25383J37093_1002651943300002560Grasslands SoilEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLQAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNEPGAGGTQTPKAA*
JGI25384J37096_1010175313300002561Grasslands SoilLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLXXDLAXTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVXVLDGLVRYRFGDDAELMGAWASARXVLGPFKPKXXXXXXAVLRRRRRPDGA*
JGI25382J37095_1005135913300002562Grasslands SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATXHKDVLVKRGMSEQLLDDLAXTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0066674_1034208313300005166SoilEGLGLAKLEELIARAEALDAQQRAGIVTTRLSTKHREGLRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHLALLTAGRDMLEKATGQKDVLLSRGMPPTLLDDLAGALGGLEKTIEATRAGRRDHPSLRSGQDVGASADLQTVGAEIKKQVRALDGMVRYRFGDNAELMGAWRSARNVLGPFKTKNEPEAGGSQTPKAA*
Ga0066674_1043941013300005166SoilLVQRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAQTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETLREAQGRPPKAA*
Ga0066683_1003785813300005172SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGAGGSETLREAQGRPPKAA*
Ga0066680_1005219413300005174SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0070680_10098672313300005336Corn RhizosphereLKSRSFAPAALRRRVREFLRAHKTDGVVEGLGFAKLEELVQRGEALASQQRSGIVERRSSTKRRNNLRRALQTKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAEIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0070705_10189190513300005440Corn, Switchgrass And Miscanthus RhizosphereELVQRGEALASQQRSGIVERRSSTKRRNNLRRALQTELLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAAIAEQVRVLDGLVRYRFGDNAELMGAWASARNV
Ga0070694_10023711823300005444Corn, Switchgrass And Miscanthus RhizosphereMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRGEVLASQQRSGIVERRSSTKRRNNLRRALQTKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAEIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0066686_1003923253300005446SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRASRREHIGATADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETLREAQGRPPKAA*
Ga0066689_1046612113300005447SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGVAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTSGSETPKAA*
Ga0066681_1033266513300005451SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGAVAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGEFEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDDAELMGAWAGARNVLGPFKPKTEPGTGGSETPKAA*
Ga0066681_1059997013300005451SoilALLQRAEVLAAQQRAGLVAKRSATTWRKELRGTLHRKLLLYLRAVGAVAAKENAELAVEFQMPPSNASHKALITMARGKIEKATPHKELLVQRGMSEQLLDDLAKTVDQFEQTVEASQAAKRAHIGATADLWAVTAEITEQLKVLEGVVRYRFGDNAELMGAWNAARNVLGPFKTKNEPEAGGSQTPKAA*
Ga0070707_10170654013300005468Corn, Switchgrass And Miscanthus RhizosphereMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRGEALASQQRSGIVERRSSTKRRNNLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAAIAEQVRVLDGLVR
Ga0070699_10026249913300005518Corn, Switchgrass And Miscanthus RhizosphereMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRGEVLASQQRSGIVERRSSTKRRNNLRRALQTKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAAIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0073909_1046951613300005526Surface SoilEVLAAQQRAGLVATRSATNRRSDLRRALQSKLLIYLRAVGAVAAKENAELAVQFHVAPSNASNQALVTMARGMLEKATLHKDVLVKGGMAEQLLDDLAKTIGEVEQTIEATRASRREHVGATADLEAVAAEIAEQVKVLDGLVRYRFGENAELMGAWASARNVLGPFKPKDKPEAGGSQTPKAA*
Ga0070695_10081084613300005545Corn, Switchgrass And Miscanthus RhizosphereLGLAKLEELVQRGEVLASQQRSGIVERRSSTKRRNNLRRALQTKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAAIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0070696_10036228013300005546Corn, Switchgrass And Miscanthus RhizosphereLRAHKTDGVGEGLGLAKLEELVQRGEALASQQRSGIVERRSSTKRRNNLRRALQTKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAEIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0070704_10018549513300005549Corn, Switchgrass And Miscanthus RhizosphereLRAHKTDGVGEGLGLAKLEELVQRGEALASQQRSGIVERRSSTKRRNNLRRALQTKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAEIAEQVRVLDGLVRYRFGDNAELMGAWASARN
Ga0066695_1022168023300005553SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGALAAKETSELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLAAALGEFEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETPKAA*
Ga0066661_1070720513300005554SoilFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGALAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGELEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWAGARNVLGPFKPKT
Ga0066692_1070777413300005555SoilMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLPDDLAQTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGA
Ga0066707_1059952013300005556SoilMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETPKAA*
Ga0066698_1003245253300005558SoilMNAQLRRRLEMAGRVRDFLRAHKTDGVGEGLGLAKLEELIARAEALDAQQRGGMVTTRLSTKHREGLRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHQALLTAGRDMLEKAKGQTDVLLSRGMPPTLLDDLAGALGGLEQTIESTRAGRRDHPSLRSGQVVGASADLQTVAAEIKKQVRALDGMVRYRFGDNTELMGAWRSARNVLGPFKTKNEPEAGGSQTPKAA*
Ga0066700_1034032113300005559SoilGGRMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNEPGAGGTQTPKAA*
Ga0066703_1030651913300005568SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNEPGAGGTQTPK
Ga0066694_1033783913300005574SoilRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETPKAA*
Ga0066691_1012155513300005586SoilEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNEPGAGGTQTPKAA*
Ga0066691_1062673413300005586SoilLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGALAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGELEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWAGARNVLGPFKPKTEPG
Ga0066656_1017004033300006034SoilMNAQLRRRLEMAGRVRDFLRAHKTDGVGEGLGLAKLEELIARAEALDAQQRAGVVTTRLSTKHRKGIRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHLALLTAGRDMLEKAKGQTDVLLSRGMPPTLLDDLAGVLGGLEKTIEATRAGRRDHPSLRSGQVVGASADLQAVAAQIKSQVRALD
Ga0066656_1043039613300006034SoilGRMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGAGGSQTPKAA*
Ga0066652_10210344513300006046SoilLAKLEELIARAEVLDGQQRAGVVTTRLSTKHRKGIRRALQSKLLLYLRALGGLGDPENGEAAVQFEVPPSNASHQALLTAGRDMLEKATGQKDVLLSRGMPPTLLDDLAGALGGLEKTIEATRAGRRDHPSLRSGQDVGASADLQTVGAEIKKQVRALDGMVRYRF
Ga0066659_1007797713300006797SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAQTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0075433_1014284233300006852Populus RhizosphereMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELLQRAEVLASQQRAGLVATRSATNRRSDLRRALQSKLLIYLRAVGAVAAKENAELAVQFHVPPSNASNQALVTMARGMLEKATLHKDVLVKGGMAEQLLDDLAKTIGEVEQTIEATRASRREHVGATADLEAVAAEIAEQVKVLDGLVRYRFGENAELMGAWASARNVLGPFKPKDKPEAGGSQTPKAA*
Ga0075434_10055914413300006871Populus RhizosphereMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELLQRAEVLAAQQRAGLVATRSATNRRSDLRRALQSKLLIYLRAVGAVAAKENAELAVQFHVPPSNASNQALVTMARGMLEKATLHKDVLVKGGMAEQLLDDLAKTIGEVEQTIEATRASRREHVGATADLEAVAAEIAEQVKVLDGLVRYRFGENAELMGAWASARNVLGPFKPKDKPEAGGSQTPKAA*
Ga0075426_1003371833300006903Populus RhizosphereMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELLQRAEVLAAQQRAGLVATRSATNRRSDLRRALQSKLLLYLRAVGAVAAKENAELAVQFHVPPSNASNQALVTMARGMLEKATLHKDALVKGGMAEQLLDDLAKTIGEVEQTIEATRASRREHIGATADLEAVAAEIAEQVKVLDGLVRYRFGENAELMGAWASARNVLGPFKPKDKPEAGGSQTPKAA*
Ga0075424_10030136343300006904Populus RhizosphereMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELLQRAEVLAAQQRAGLVATRSATNRRSDLRRALQSKLLIYLRAVGAVAAKENAELAVQFHVPPSNASNQALVTMARGMLEKATLHKDVLVKGGMAEQLLDDLAKTIGEVEQTIEATRASRREHIGATADLEAVAAEIAEQVKVLDGLVRYRFGENAELMGAWASARNVLGPFKPKDKPEAGGSQTPKAA*
Ga0099794_1035365813300007265Vadose Zone SoilMAERVREFLRAHKTDGVGEGLGLAKLEVLLQRAEVLAAQQRAGLVARRSATKRRKDLRGALHGKLLLYLRAVGAVAAKENAELAVQFQMPPSNASHQALITMARGMLEKATANKDVLVQRGMSEQLLDDLAETLGEFERTVEASQAARREHIGATADLWAVAAEITEQLKVLDGLVRYRFGDKAELMGAWGSARNVLGPFKPKTEPGVDGSQT
Ga0099795_1036134713300007788Vadose Zone SoilEFLRAHKTDGVGEGLGLAKLEELVQRGEVLASQQRSGIVERRSSTKRRQNLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASVDLEAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAGDQPKAA*
Ga0066710_10331059813300009012Grasslands SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGALAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGEFEQTIEATRAGKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWA
Ga0099829_1060114213300009038Vadose Zone SoilASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFREPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAQTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKSEPNAGAGGSQTPKAA*
Ga0099827_1062666313300009090Vadose Zone SoilMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAQTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETPKAA*
Ga0099792_1006896523300009143Vadose Zone SoilMNALLRRRLEAAARVRDFLRAHKTDAVGEGLGLGKLEELITRAEALDEQQRAGVVTARLSTKHRKGLRRALQSKLLLYLRALGGLGDPENGEAAIQFQVPPSNASHQALLTTGRDTLTKALGQKDVLLARGMPPALLDDLAAALGGLEQTMADTRTGRRDHVGASSDLAAVGVEIVKQVRALDGMVRYRFGDNAELMGAWRSARNVLGPFRSKNGPESSAGGSQTPKAA*
Ga0114129_1162030713300009147Populus RhizosphereMAGRVREFLRAHKTDGVGEGLGLAKLEELLQRAEVLASQQRAGLVATRSATNRRSDLRRALQSKLLLYLRAVGAVAAKENAELAVQFHVPPSNASNQALVTMARGMLEKATLHKDVLVKGGMAEQLLDDLAKTIGEVEQTIEATRASRREHIGATADLEAVAAEIAEQVKVLDGLVRYRFGENAELMGAWASARNVLGPFKPKDKPEAGGSQTPKAA*
Ga0075423_1048954613300009162Populus RhizosphereMAGRVREFLRAHKTDGVGEGLGLAKLEELLQRAEVLAAQQRAGLVATRSATNRRSDLRRALQSKLLIYLRAVGAVAAKENAELAVQFHVPPSNASNQALVTMARGMLEKATLHKDVLVKGGMAEQLLDDLAKTIGEVEQTIEATRASRREHIGATADLEAVAAEIAEQVKVLDGLVRYRFGENAELMGAWASARNVLGPFKPKDKPEAGGSQTPKAA*
Ga0134088_1002793253300010304Grasslands SoilMAGRVREFLRAHKTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGAGGSQTPKAA*
Ga0134065_1017074413300010326Grasslands SoilMNAQLRRRLEMAGRVRDFLRAHKTDGVGEGLGLAKLEELIVRAEALDEQQRAGVVTTRLSTKHRKGIRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHLALLTAGRDMLEKATGQKDVLLSRGMPPTLLDDLAGALGGLEKTIESTRAGRRDHPSLRSGQVVGASADLVAVAAQIKSQVRALDGMVRYRFGDNAELMGAWRSARNVLGPFKTKNEPEAGGSETPKAA*
Ga0134080_1012317313300010333Grasslands SoilMAGRVREFLRAHKTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSQTPKAA*
Ga0134066_1031514213300010364Grasslands SoilASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGAVAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGELEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDDAELMGAWVSARNVLGPFKPKTEPGAGGSETPKAA*
Ga0134127_1046681113300010399Terrestrial SoilMRVREFLRAHKTDGVGEGLGLAKLEELVQRGEVLASQQRSGIVERRSSTKRRNNLRRALQTKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAAIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0134123_1044209813300010403Terrestrial SoilMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRGEALASQQRSGIVERRASTKRRRSLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAAIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0137457_101075533300011443SoilMAEGVREFLRAHKTDGVGEGLGLAKLEELLQRAEVLAAQQRAGLGARRAATNRRKDLRRALHGKLLLYLRAVGAVAAKENADLAEQFQMPPSNASHKALITMAHGMLEKATAHKELLVQRGMSGQLLDDLTATLGEFEQTVEASLAARRAHIGATTDLWAVAAEISEHLKVLEGLVRYRFGDNAELMGAWASARNVLGPFKPKDEPEAGGGETPKAA*
Ga0137457_103958113300011443SoilLLQRAEVLAAQQRAGLVAKRSATKRRRDLRSALHRTLLLYLRAVGAVAAKQNAELAVQFQMPPSNASHKALIEMARGMLEKATVHKDVLVQNGMSEQLLDDLSTTVNEFEQTVEASLAAKRAHIGATADLWAVTAEITEQLKVLEGLVRYRFGDNAELMGAWTAARNVLGSFKTKNEPEVGGSQTPKAA*
Ga0137389_1102100313300012096Vadose Zone SoilMAGRVREFLPAHKTDGVGEGLGLAKLEELLTRAEVLASQQRSGVVGTRSSTKRRKNLRRALQSKLLLYLRAVGLVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFK
Ga0137388_1018628923300012189Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELLTRAEVLASQQRSGVVGTRSSTKRRKNLRRALQSKLLLYLRAVGLVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKSEPNAGAGGSQTPKAA*
Ga0137382_1015735113300012200Vadose Zone SoilMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGALAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGELEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKTEPGTGGSETPKAA*
Ga0137382_1029054413300012200Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFEVPPTNASQQELVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLEAVAAGIAEQVKVLDGLVRYRFGDNAELMGAWRSARNVLGPFKTKNEPEAGGSQTPKAA*
Ga0137365_1011623323300012201Vadose Zone SoilMAGRVRDFLRAHKTDAVGEGLGLAKLEELIARAEALDAQQRAGMVTTRLSTKHREGLRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHQALLTAGRDMLEKAKGQKDVLLSRGMPPTLLDDLAGVLGGLEKTIEATRAGRRDHPSLRSGQVVGASADLQAVAAEIKKQVRALDGMVRYRFGDNAELMGAWRSARNVLGPFKTKNEPEASGSQPKAA*
Ga0137399_1002670323300012203Vadose Zone SoilMDGVGEGLGLAKLEELVQRAEVLASQQRAGLVATRSTSKHRKTLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWVSARNVLGPFKPKAEPGVGGSQTPKAA*
Ga0137399_1022384913300012203Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQRAGLVATRSTSKHRKALRRALQSKLLLYLRAVGVVAAKENTELATQFQVPPFNASNQALLTMARGMWEKATAHKDILVKRGMSEQLLDDLAKTLGEFEQTIEATRASRREHIGASADLEAVAAEIAEQIRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAEPGAGGSQTPKAA*
Ga0137399_1062030313300012203Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLGAVAAEIAEQVKVLDGLVRYRFGENAELMGAWASARNVLGPFKTKNAPGAGGSQTPKAA*
Ga0137376_1006532813300012208Vadose Zone SoilKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLGAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWGSARNVLGPFKTKNAPEAGGGQTPKAA*
Ga0137376_1021197223300012208Vadose Zone SoilMAERVREFLRAHKTDGVGEGLGLAKLEVLLQRAEVLAAQQRAGLVARRSATKRRKDLRGALHGKLLLYLRAVGAVAAKENAELAVQFQMPPSNASHQALITMARGMLEKATANKDVLVQRGMSEQLLDDLAGTLGEFEQTVEASRAARREHIGATADLWAVAAEITEQLKVLDGLVRYRFGDSAELMGAWTSARNVLGPFKPKAGDQPKAA*
Ga0137378_1011212743300012210Vadose Zone SoilMAGRVRDFLRAHKTDAVGEGLGLAKLEELIARAEALDAQQRAGIVTTRLSTKHREGLRRALQSKLLLYLRALGGLGDPENGEAAVQFQMPPSNASHQALLTAGRDMLEKAKGQKDVLLSRGMPPTLLDDLAGVLGGLEKTIEATRAGRRDHPSLRSGQVVGASADLQAVAAQIKSQVRALDGMVRYRFGDNAELMGAWRNARNVLGPFKAKNEPEAGGSQPKAA*
Ga0137377_1023704923300012211Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGVAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKSEPGAGGSERPKAA*
Ga0137377_1094466713300012211Vadose Zone SoilMAERVREFLRAHKTDGVGEGLGLAKLEVLLQRAEVLAAQQRAGLVARRSATKRRKDLRAALHGKLLLYLRAVGAVAAKENAELAVQFQMPPSNASHQALITMARGMLEKATANKDVLVQRGMSEQLLDDLAGTLGEFEQTVEASRAARREHIGATADLWAVAAEITEQLKVLDGLVRYRFGDSAELMGAWTSARNVLGPFKPKAGDQPKAA*
Ga0137370_1036828613300012285Vadose Zone SoilLLQRAEVLAAQQRAGLVAKRSATTRRKELRGTLHRKLLLYLRAVGAVAAKENAELAVEFQMPPSNASHKALITMARGMIEKATPHKELLVQRGMSEQLLDDLAKTVDQFEQTVEASQAAKRAHIGATADLWAVTAEITEQLKVLEGAVRYRFGDNAELMGAWNAARNVLGPFKAKSEPEAGGSQTPKAA*
Ga0137371_1005312913300012356Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETPKAA*
Ga0137385_1072013013300012359Vadose Zone SoilMAGRVRDFLRAHKTDAVGEGLGLAKLEELIARAEALDAQQRAGIVTTHLSTKHREGLRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHQALLTAGRDMLEKAKGQKDVLLSRGMPPTLLDDLAGVLGGLEKTIEATRAGRRDHPSLRSGQVVGASADLQAVGAEIKKQVRALDGMVRYRFGDNAELMGAWRSARNVLGPFKTKNEPEAGG
Ga0137361_1057552113300012362Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSKHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLLAVAAEIAEQVKVLDGLVRYRFGDSAELMGAWASTRNVLGPFKTKNAPEAGRSQTPKAA*
Ga0137390_1020183123300012363Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQRAGLVATRSTSNHRKTLRRALQSKLLLYLRAVGVVAPKENAELAMQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAGRREHIGASADLEAVAAEIAEQVRLLDGLVRYRFGDDAELMGAWASARNVLGPFKPKSEPNAGAGGSQTPKAA*
Ga0137398_1053131413300012683Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLDAVAAEIAEQVKVLDGLVRYRFGDSAELMGAWTSARNVLGFFKPKNEP
Ga0137397_1071548413300012685Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHIGATADLLAVAAGIAEQVKVLDGLVRYRFGDNAELMGAWGSARNVLGPFKTKNAPGAGGSQ
Ga0137395_1023484423300012917Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSKHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLGAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWASARNAPACEASWQCTPGRWWT*
Ga0137396_1006986433300012918Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLDAVAAEIAEQVKVLDGLVRYRFGDSAELMGAWGSARNVLGPFKAKNAPDAGGSQTPKAA*
Ga0137396_1012164423300012918Vadose Zone SoilMDGVGEGLGLAKLEELVQRAEVLASQQRAGLVATRSTSKHRKTLRRALQSKLLLYLRAVGVVAAKENAELAMQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRASRREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNAPEAGGTQTPKAA*
Ga0137394_1010416833300012922Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLDKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLEAVAAEIAEQVKVLDGLVRYRFGDSAELMGAWSSARNVLGPFKTKNAPGAGGSQTPKAA*
Ga0137419_1040093013300012925Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHIGATADLLAVAAEIAEQVKVLDGLVRYRFGDSAELMGAWASARNVLGPFKTKNAPGAGGSQTPKAA*
Ga0137416_1024640713300012927Vadose Zone SoilMDGVGEGLGLAKLEELLTRAEVLASQQRAGLVATRSTSKHRKTLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGAGGSQTLREAQGRPP
Ga0137416_1084195613300012927Vadose Zone SoilEFFRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHIGATADLLAVAAEIAEQVKVLDGLVRYRFGDSAELMGAWSSARNVLGPFKTKNAPGAGGSQTPKAA*
Ga0137407_1108951113300012930Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHIGATADLLAVAAGIAEQVKVLDGLVRYRFGDSAELMGAWASARNVLGPFKTKNEPE
Ga0137410_1016116923300012944Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHIGATADLLAVAAEIAEQVKVLDGLVRYRFGDSAELMGAWSSARNVLGPFKTKNAPEAGGSQTPKAA*
Ga0134077_1013904813300012972Grasslands SoilMAGRVREFLRAHKTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRASRREHIGATADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWA
Ga0134077_1035574513300012972Grasslands SoilMAGRVRDFLRAHKTDAVGEGLGLAKLEELIARAEALDAQQRGGMVTTRLSTKHREGLRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHQALLTAGRDMLEKAKGQTDVLLSRGMPPTLLDDLAGVLGGLEKTIEATRAGRRDHPSLRSGQVVGASADLVAVAAQIKSQVRALDGMVRYRFGDNAELMGAWRSARN
Ga0134075_1030529713300014154Grasslands SoilAEALDAQQRAGIVPSRLSTKHREGLRRELQRKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHLALLTAGRDMLEKATGQKDVLLSRGMPPTLLDDLAGVLGGLEKTIEATRAGRRDHPSLRSGQVVGASADLQAVAAQIKSQVRALDGMVRYRFGDNAELMGAWRSARNVLGPFKAKNEPEAGGSQTPKAA*
Ga0137411_115598313300015052Vadose Zone SoilLIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHIGATADLLAVAAGIAEQVKVLDGLVRYRFGENAELMGAWGSA
Ga0137420_100740613300015054Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLGAVAAEIAEQVKVCSTGWCGIGLEITRS*
Ga0137418_1015299033300015241Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLGAVAAEIAEQVKVLDGLVRYRFGDNAELMG
Ga0137409_1003739023300015245Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHIGATADLLAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWRSARNVLGPFKTKNAPEAGGSQTPKAA*
Ga0137409_1021358513300015245Vadose Zone SoilMAARVREFLRAHKTDGVGEGLGLAKLEELVQRGEVLASQQRSGIVERRSSTKRRQNLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLAAVAAEIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA*
Ga0134085_1007748513300015359Grasslands SoilMAGRVREFLRAHKTDGVGEGLGVAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGAGGSETPKAA*
Ga0134069_136216613300017654Grasslands SoilREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGAVAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGELEQTIEATRAGKREHIGASADLEAVAAEIAEQVRVLDGLVRY
Ga0134074_111815213300017657Grasslands SoilTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTELGAGGSQTPKAA
Ga0134074_116149313300017657Grasslands SoilMNAQLRRRLEMAGRVREFLRAQKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGALAAKETSELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGELEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETPKAA
Ga0066669_1029921513300018482Grasslands SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGAVAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGELEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWAGARNVLGPFKPKTEPGTGGSETTKAAGWKHQRL
Ga0193712_106618613300019880SoilPLSVPERGRSRSLRYAQGRLFAPAALRMRVREFLRAHKTDGVGEGLGLAKLEELVQRGEVLASQQRSGIVANRSSTKRRRNLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFELTIEATRAAQRQHIGASADLEAVAAEIAEQVRVLDGLVRYRFGDNAELMGAWASARNVLGPFKPKAGDQPKAA
Ga0193711_101222323300019997SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELVQRGEVLASQQRSGIVEKRSSTKRRRNLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVLPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGKFELTIEATRAAQRQHIGASADLEAVAAEIAEQVRVLDGLVRYRFGDNA
Ga0207660_1110317313300025917Corn RhizosphereECLKSRSFAPAALRRRVREFLRAHKTDGVVEGLGFAKLEELVQRGEALASQQRSGIVERRSSTKRRNNLRRALQTKLLLYLRAVGAVAAKENAELAVQFQVPPSNASHEALLTMARGMLEKATLHKDVLVNRGMSEQLLGDLAGALGEFEQTIEATRAAQRQHIGASADLEAVAAEIAERVRLLDGLVRYRFGDDPALMQGWFSARDVLGPFRTK
Ga0209350_101219223300026277Grasslands SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKAGDQPKAA
Ga0209234_103165023300026295Grasslands SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKAGDQPKAA
Ga0209235_1000668193300026296Grasslands SoilVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATTHKDVLVKRGMSEQLLDDLAQTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNEPGAGGTQTPKAA
Ga0209761_108904413300026313Grasslands SoilLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLPDDLAQTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKNEPGAGGTQTPKAA
Ga0209266_103179263300026327SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETLREAQGRPPKAA
Ga0209802_105091923300026328SoilMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKGLFCSPRPRAALEQRTASPRYASNRPSQLSLRGCFPGAFGARHGRIDRPATSLLS
Ga0209158_116616413300026333SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAE
Ga0209377_110643313300026334SoilVRARARVWWSDLGKGARASAMFTVTARTPTLCHNEGGRMNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKRAEVLASQQREGVVAARSATTRRRKVRRALQSKLLLYLRAVGALAAKETAELAEQFQVPPSNASHEALLTMARGMLEKATLHKDVLVKRGMSEQLLGDLALALGELEQTIEATRAGKREHIGASADLEAVAAEIAEQVKVLDGLVRYRFGDNAELMGAWAGARNVLGPFKPKGEPGAGDGQTPKAA
Ga0209378_119587113300026528SoilNAQLRRRLEMAGRVREFLRAHKTDGVGEGLGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKGLFCSPRPRAALEQ
Ga0209160_104122723300026532SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELLTRAEVLASQQRSGVVATRSSTKRRKNLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAS
Ga0209058_103838423300026536SoilMNAQLRRRLEMAGRVRDFLRAHKTDGVGEGLGLAKLEELIARAEALDAQQRGGMVTTRLSTKHREGLRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHQALLTAGRDMLEKAKGQTDVLLSRGMPPTLLDDLAGALGGLEQTIESTRAGRRDHPSLRSGQVVGASADLQTVAAEIKKQVRALDGMVRYRFGDNTELMGAWRSARNVLGPFKTKNEPEAGGSQTPKAA
Ga0209157_102488433300026537SoilMNAQLRRRLEMAGRVRDFLRAHKTDGVGEGLGLAKLEELIARAEALDAQQRAGMVTTRLSTKHREGLRRALQSKLLLYLRALGGLGDPENGEAAVQFQVPPSNASHQALLTAGRDMLEKAKGQTDVLLSRGMPPTLLDDLAGALGGLEQTIESTRAGRRDHPSLRSGQVVGASADLQTVAAEIKKQVRALDGMVRYRFGDNTELMGAWRSARNVLGPFKTKNEPEAGGSQTPKAA
Ga0209157_118570513300026537SoilGLAKLEELMQRAELLASQQRAGLVAALSTSKHRKGLRRALQSKLLLYLRAVAAVAAKETAELAVQFQVPPSNASNQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAGTLGEFEQTIEATRASRREHIGATADLEAVAAEVAEQVRVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKTEPGTGGSETLREAQGRPPKAA
Ga0179593_108684023300026555Vadose Zone SoilLRAHKTDGVGEGLGLAKLEQLIQRAEVLASQQRAGLVTALSTSTHRKTLRRALQSKLLLYLRAVGAVAAKENAELAVQFQVPPTNASQQALVTMARGMLEKATLHKDVLVKGGMAEQLLGDLAGTIGEFEQTIEATRASRREHVGATADLDAVAAEIAEQVKVLDGLVRLPVWGERGADGGVGQCPQRPRPVQNQERARGRW
Ga0209488_1000353363300027903Vadose Zone SoilMNALLRRRLEAAARVRDFLRAHKTDAVGEGLGLGKLEELITRAEALDEQQRAGVVTARLSTKHRKGLRRALQSKLLLYLRALGGLGDPENGEAAIQFQVPPSNASHQALLTTGRDTLTKALGQKDVLLARGMPPALLDDLAAALGGLEQTMADTRTGRRDHVGASSDLAAVGVEIVKQVRALDGMVRYRFGDNAELMGAWRSARNVLGPFRSKNGPESGAGGSQTPKAA
Ga0209488_1010162823300027903Vadose Zone SoilMNAQLRRRLEMAERVQEFLRIHRTDGVGEGLGLAKLEELLQRAVVLAAQQRAGLLARRSASKRRRTLRGELHGKLLLYLRAVGAAAAKENEQMATEFQAPPSNASHKALVTMARGMLEKATAHKELLVQRGMSEHLLDDLARTIDEFEQTVGASLAARREHIGATTDLWAVAAEITEQLKVLEGIVRYRFGENAELMGAWAAARNVLGPFKPKAEPGAGGQTPKAA
Ga0137415_1005726133300028536Vadose Zone SoilMNAQLRRRLEMAGRVREFLRAHKMDGVGEGLGLAKLEELVQRAEVLASQQRAGLVATRSTSKHRKTLRRALQSKLLLYLRAVGAVAAKETAELGEQFQVPPSNASQQALLTMARGMLEKATAHKDVLVKRGMSEQLLDDLAKTLGEFEQTIEATRAAKREHIGASADLEAVAAEVAEQVKVLDGLVRYRFGDDAELMGAWASARNVLGPFKPKAEPGVGGSQTPKAA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.