NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F071297

Metagenome / Metatranscriptome Family F071297

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F071297
Family Type Metagenome / Metatranscriptome
Number of Sequences 122
Average Sequence Length 119 residues
Representative Sequence HTAIFIKNDLGQSQTYYALTARPESTDSDLYFTALIKHSAVPVHSGWKAWLHKRAVQRKAVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRI
Number of Associated Samples 111
Number of Associated Scaffolds 122

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 0.82 %
% of genes near scaffold ends (potentially truncated) 98.36 %
% of genes from short scaffolds (< 2000 bps) 94.26 %
Associated GOLD sequencing projects 106
AlphaFold2 3D model prediction Yes
3D model pTM-score0.66

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (63.115 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(22.951 % of family members)
Environment Ontology (ENVO) Unclassified
(42.623 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(35.246 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 30.50%    β-sheet: 17.02%    Coil/Unstructured: 52.48%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.66
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 122 Family Scaffolds
PF13659Obsolete Pfam Family 16.39
PF04851ResIII 3.28
PF05175MTS 3.28
PF02683DsbD 2.46
PF03104DNA_pol_B_exo1 0.82
PF00176SNF2-rel_dom 0.82
PF02384N6_Mtase 0.82
PF01464SLT 0.82

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 122 Family Scaffolds
COG0417DNA polymerase B elongation subunitReplication, recombination and repair [L] 0.82


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms63.11 %
UnclassifiedrootN/A36.89 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2067725003|GPWSG_F5G3JLY01CZ8BCNot Available543Open in IMG/M
3300000364|INPhiseqgaiiFebDRAFT_101399732All Organisms → cellular organisms → Bacteria → Nitrospirae931Open in IMG/M
3300000956|JGI10216J12902_115228615All Organisms → cellular organisms → Bacteria → Nitrospirae680Open in IMG/M
3300004062|Ga0055500_10073720All Organisms → cellular organisms → Bacteria → Nitrospirae722Open in IMG/M
3300005167|Ga0066672_10417370All Organisms → cellular organisms → Bacteria → Nitrospirae878Open in IMG/M
3300005181|Ga0066678_10145175All Organisms → cellular organisms → Bacteria1480Open in IMG/M
3300005186|Ga0066676_10188565All Organisms → cellular organisms → Bacteria → Nitrospirae1319Open in IMG/M
3300005293|Ga0065715_10999715All Organisms → cellular organisms → Bacteria545Open in IMG/M
3300005294|Ga0065705_10412405All Organisms → cellular organisms → Bacteria → Nitrospirae868Open in IMG/M
3300005295|Ga0065707_10336188All Organisms → cellular organisms → Bacteria → Nitrospirae910Open in IMG/M
3300005332|Ga0066388_102840317All Organisms → cellular organisms → Bacteria → Nitrospirae885Open in IMG/M
3300005444|Ga0070694_100571393All Organisms → cellular organisms → Bacteria907Open in IMG/M
3300005446|Ga0066686_10631707Not Available728Open in IMG/M
3300005447|Ga0066689_10782460Not Available594Open in IMG/M
3300005536|Ga0070697_101799113All Organisms → cellular organisms → Bacteria548Open in IMG/M
3300005555|Ga0066692_10371971All Organisms → cellular organisms → Bacteria → Nitrospirae906Open in IMG/M
3300005598|Ga0066706_10262854All Organisms → cellular organisms → Bacteria → Nitrospirae1347Open in IMG/M
3300005983|Ga0081540_1079409All Organisms → cellular organisms → Bacteria1483Open in IMG/M
3300005983|Ga0081540_1231974All Organisms → cellular organisms → Bacteria → Nitrospirae651Open in IMG/M
3300006046|Ga0066652_100587632All Organisms → cellular organisms → Bacteria → Nitrospirae1048Open in IMG/M
3300006354|Ga0075021_10171291All Organisms → cellular organisms → Bacteria1318Open in IMG/M
3300006797|Ga0066659_11354572All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → unclassified Anaerolineales → Anaerolineales bacterium594Open in IMG/M
3300006800|Ga0066660_11323830Not Available564Open in IMG/M
3300006852|Ga0075433_10706333All Organisms → cellular organisms → Bacteria883Open in IMG/M
3300006865|Ga0073934_10391532All Organisms → cellular organisms → Bacteria → Nitrospirae857Open in IMG/M
3300006871|Ga0075434_101007531All Organisms → cellular organisms → Bacteria → Nitrospirae847Open in IMG/M
3300006871|Ga0075434_101039149All Organisms → cellular organisms → Bacteria → Nitrospirae833Open in IMG/M
3300006969|Ga0075419_10219467Not Available1263Open in IMG/M
3300006969|Ga0075419_11399804Not Available522Open in IMG/M
3300009012|Ga0066710_104740065Not Available508Open in IMG/M
3300009088|Ga0099830_10641187All Organisms → cellular organisms → Bacteria → Nitrospirae873Open in IMG/M
3300009088|Ga0099830_11851132Not Available504Open in IMG/M
3300009137|Ga0066709_104517336Not Available508Open in IMG/M
3300009157|Ga0105092_10894924Not Available524Open in IMG/M
3300009168|Ga0105104_10551509All Organisms → cellular organisms → Bacteria → Nitrospirae652Open in IMG/M
3300009792|Ga0126374_10279569All Organisms → cellular organisms → Bacteria → Nitrospirae1107Open in IMG/M
3300009811|Ga0105084_1085048Not Available586Open in IMG/M
3300010043|Ga0126380_11858033Not Available546Open in IMG/M
3300010046|Ga0126384_10168172All Organisms → cellular organisms → Bacteria1705Open in IMG/M
3300010047|Ga0126382_11332020Not Available651Open in IMG/M
3300010047|Ga0126382_11415328Not Available635Open in IMG/M
3300010359|Ga0126376_10030252All Organisms → cellular organisms → Bacteria3651Open in IMG/M
3300011107|Ga0151490_1569549Not Available501Open in IMG/M
3300011269|Ga0137392_11170142Not Available627Open in IMG/M
3300011437|Ga0137429_1068637All Organisms → cellular organisms → Bacteria → Nitrospirae1051Open in IMG/M
3300011444|Ga0137463_1211456Not Available726Open in IMG/M
3300011445|Ga0137427_10175108Not Available888Open in IMG/M
3300012022|Ga0120191_10074489All Organisms → cellular organisms → Bacteria → Nitrospirae649Open in IMG/M
3300012035|Ga0137445_1091041Not Available617Open in IMG/M
3300012200|Ga0137382_11110246All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → unclassified Anaerolineales → Anaerolineales bacterium565Open in IMG/M
3300012204|Ga0137374_10568774All Organisms → cellular organisms → Bacteria → Nitrospirae869Open in IMG/M
3300012204|Ga0137374_11141930All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → unclassified Anaerolineales → Anaerolineales bacterium551Open in IMG/M
3300012206|Ga0137380_10551374All Organisms → cellular organisms → Bacteria1012Open in IMG/M
3300012207|Ga0137381_10297278All Organisms → cellular organisms → Bacteria1406Open in IMG/M
3300012208|Ga0137376_11486236Not Available568Open in IMG/M
3300012209|Ga0137379_10842430All Organisms → cellular organisms → Bacteria → Nitrospirae821Open in IMG/M
3300012228|Ga0137459_1018355All Organisms → cellular organisms → Bacteria1950Open in IMG/M
3300012350|Ga0137372_11088658Not Available550Open in IMG/M
3300012351|Ga0137386_10599243All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium793Open in IMG/M
3300012353|Ga0137367_10266101All Organisms → cellular organisms → Bacteria → Nitrospirae1231Open in IMG/M
3300012353|Ga0137367_10843366All Organisms → cellular organisms → Bacteria634Open in IMG/M
3300012353|Ga0137367_10972814All Organisms → cellular organisms → Bacteria → Nitrospirae581Open in IMG/M
3300012354|Ga0137366_10694707Not Available725Open in IMG/M
3300012355|Ga0137369_11074982All Organisms → cellular organisms → Bacteria → Nitrospirae528Open in IMG/M
3300012356|Ga0137371_10276489All Organisms → cellular organisms → Bacteria1310Open in IMG/M
3300012358|Ga0137368_10696651All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → unclassified Anaerolineales → Anaerolineales bacterium639Open in IMG/M
3300012360|Ga0137375_10183681All Organisms → cellular organisms → Bacteria2001Open in IMG/M
3300012361|Ga0137360_10191687All Organisms → cellular organisms → Bacteria → Nitrospirae1649Open in IMG/M
3300012532|Ga0137373_10992056All Organisms → cellular organisms → Bacteria → Nitrospirae609Open in IMG/M
3300012924|Ga0137413_11356757Not Available573Open in IMG/M
3300012929|Ga0137404_11720597All Organisms → cellular organisms → Bacteria → Nitrospirae582Open in IMG/M
3300012930|Ga0137407_11335098All Organisms → cellular organisms → Bacteria → Nitrospirae681Open in IMG/M
3300012930|Ga0137407_11986187Not Available555Open in IMG/M
3300012961|Ga0164302_10771794All Organisms → cellular organisms → Bacteria → Nitrospirae722Open in IMG/M
3300012972|Ga0134077_10420291All Organisms → cellular organisms → Bacteria → Nitrospirae580Open in IMG/M
3300014154|Ga0134075_10182558All Organisms → cellular organisms → Bacteria → Nitrospirae901Open in IMG/M
3300014885|Ga0180063_1073969Not Available1011Open in IMG/M
3300015242|Ga0137412_10505272All Organisms → cellular organisms → Bacteria → Proteobacteria925Open in IMG/M
3300015357|Ga0134072_10345545Not Available569Open in IMG/M
3300015374|Ga0132255_105684997Not Available528Open in IMG/M
3300018053|Ga0184626_10001939All Organisms → cellular organisms → Bacteria → Proteobacteria7464Open in IMG/M
3300018054|Ga0184621_10260252Not Available617Open in IMG/M
3300018056|Ga0184623_10216164All Organisms → cellular organisms → Bacteria → Nitrospirae880Open in IMG/M
3300018072|Ga0184635_10074920All Organisms → cellular organisms → Bacteria1324Open in IMG/M
3300018076|Ga0184609_10240071All Organisms → cellular organisms → Bacteria → Nitrospirae847Open in IMG/M
3300018078|Ga0184612_10513506Not Available583Open in IMG/M
3300018079|Ga0184627_10406971Not Available708Open in IMG/M
3300018433|Ga0066667_10025092All Organisms → cellular organisms → Bacteria3250Open in IMG/M
3300019881|Ga0193707_1041450Not Available1487Open in IMG/M
3300019889|Ga0193743_1005860All Organisms → cellular organisms → Bacteria → Proteobacteria7639Open in IMG/M
3300020003|Ga0193739_1055606All Organisms → cellular organisms → Bacteria → Nitrospirae1016Open in IMG/M
3300020004|Ga0193755_1180851Not Available618Open in IMG/M
3300020022|Ga0193733_1172088Not Available573Open in IMG/M
3300021073|Ga0210378_10177839All Organisms → cellular organisms → Bacteria → Nitrospirae817Open in IMG/M
3300021082|Ga0210380_10201602All Organisms → cellular organisms → Bacteria → Nitrospirae899Open in IMG/M
3300021344|Ga0193719_10354785Not Available610Open in IMG/M
3300025155|Ga0209320_10235492All Organisms → cellular organisms → Bacteria782Open in IMG/M
3300025289|Ga0209002_10359363Not Available840Open in IMG/M
3300025318|Ga0209519_10090432All Organisms → cellular organisms → Bacteria1791Open in IMG/M
3300025322|Ga0209641_11088274Not Available511Open in IMG/M
3300025324|Ga0209640_10024117All Organisms → cellular organisms → Bacteria5310Open in IMG/M
3300025324|Ga0209640_10464648All Organisms → cellular organisms → Bacteria1036Open in IMG/M
3300025325|Ga0209341_10350460All Organisms → cellular organisms → Bacteria1206Open in IMG/M
3300025326|Ga0209342_10431692All Organisms → cellular organisms → Bacteria1111Open in IMG/M
3300026313|Ga0209761_1338912All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → unclassified Anaerolineales → Anaerolineales bacterium510Open in IMG/M
3300026327|Ga0209266_1294515Not Available511Open in IMG/M
3300026343|Ga0209159_1223443Not Available581Open in IMG/M
3300027722|Ga0209819_10192275Not Available712Open in IMG/M
3300027835|Ga0209515_10584616Not Available563Open in IMG/M
3300027862|Ga0209701_10347965All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium839Open in IMG/M
3300027873|Ga0209814_10016324All Organisms → cellular organisms → Bacteria2978Open in IMG/M
3300027880|Ga0209481_10361201Not Available742Open in IMG/M
3300027894|Ga0209068_10488204All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium709Open in IMG/M
3300030006|Ga0299907_10421892All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Anaerolineae → Anaerolineales → unclassified Anaerolineales → Anaerolineales bacterium1068Open in IMG/M
3300031576|Ga0247727_11085887Not Available545Open in IMG/M
3300031720|Ga0307469_10848367All Organisms → cellular organisms → Bacteria → Nitrospirae842Open in IMG/M
3300031740|Ga0307468_101330452Not Available656Open in IMG/M
3300031820|Ga0307473_10653880All Organisms → cellular organisms → Bacteria → Nitrospirae732Open in IMG/M
3300031879|Ga0306919_11291894All Organisms → cellular organisms → Bacteria → Nitrospirae553Open in IMG/M
3300031965|Ga0326597_10843764All Organisms → cellular organisms → Bacteria944Open in IMG/M
3300031965|Ga0326597_11417960Not Available672Open in IMG/M
3300033407|Ga0214472_11250992Not Available645Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil22.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil12.30%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.20%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment5.74%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere5.74%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil4.92%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil4.10%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.28%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.28%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment2.46%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.46%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.46%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment1.64%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.64%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Switchgrass Rhizosphere1.64%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.64%
Tabebuia Heterophylla RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Tabebuia Heterophylla Rhizosphere1.64%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.82%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands0.82%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment0.82%
TerrestrialEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial0.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.82%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.82%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.82%
Groundwater SandEnvironmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand0.82%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.82%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.82%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Miscanthus Rhizosphere0.82%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2067725003Soil microbial communities from Great Prairies - Wisconsin, Switchgrass soilEnvironmentalOpen in IMG/M
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300004062Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Sandmound_ThreeSqA_D2EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005186Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_125EnvironmentalOpen in IMG/M
3300005293Miscanthus rhizosphere microbial communities from Kellogg Biological Station, MSU, sample Bulk Soil Replicate 1 : eDNA_1Host-AssociatedOpen in IMG/M
3300005294Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL2 Bulk SoilEnvironmentalOpen in IMG/M
3300005295Switchgrass rhizosphere bacterial communities from Rose Lake, Michigan, USA - RL3EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005983Tabebuia heterophylla rhizosphere microbial communities from the University of Puerto Rico - S2T1R1Host-AssociatedOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006852Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD2Host-AssociatedOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006969Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3Host-AssociatedOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009168Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (6) Depth 19-21cm September2015EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300009811Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW N3_20_30EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300011107Soil and rhizosphere microbial communities from Centre INRS-Institut Armand-Frappier, Laval, Canada - Soil microcosm metaTmtHAC (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011437Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT736_2EnvironmentalOpen in IMG/M
3300011444Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2EnvironmentalOpen in IMG/M
3300011445Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMZT700_2EnvironmentalOpen in IMG/M
3300012022Terrestrial microbial communites from a soil warming plot in Okalahoma, USA - C6EnvironmentalOpen in IMG/M
3300012035Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT338_2EnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012204Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012228Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT700_2EnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012354Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012961Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_202_MGEnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014154Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09212015EnvironmentalOpen in IMG/M
3300014885Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLIBT47_16_10DEnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300015374Col-0 rhizosphere combined assemblyHost-AssociatedOpen in IMG/M
3300018053Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_b1EnvironmentalOpen in IMG/M
3300018054Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_30_b1EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018072Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_30_b2EnvironmentalOpen in IMG/M
3300018076Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_coexEnvironmentalOpen in IMG/M
3300018078Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_60_coexEnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300019881Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3c2EnvironmentalOpen in IMG/M
3300019889Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c2EnvironmentalOpen in IMG/M
3300020003Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2a2EnvironmentalOpen in IMG/M
3300020004Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1a2EnvironmentalOpen in IMG/M
3300020022Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1s2EnvironmentalOpen in IMG/M
3300021073Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_60_b1 redoEnvironmentalOpen in IMG/M
3300021082Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_5_coex redoEnvironmentalOpen in IMG/M
3300021344Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U2a2EnvironmentalOpen in IMG/M
3300025155Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 4EnvironmentalOpen in IMG/M
3300025289Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 2EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026343Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144 (SPAdes)EnvironmentalOpen in IMG/M
3300027722Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015 (SPAdes)EnvironmentalOpen in IMG/M
3300027835Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW60B uncontaminated upgradient, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027873Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD1 (SPAdes)Host-AssociatedOpen in IMG/M
3300027880Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. deltoides SBSDD3 (SPAdes)Host-AssociatedOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031576Biofilm microbial communities from Wishing Well Cave, Virginia, United States - WW16-25EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300033407Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT140D175EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
GPWSG_029264302067725003SoilTNHWLDVGIHEICLLPKTKYGTNVKALPSGIVHTAVFMKNDVGQTQTYYALTARPEPTVSDLYFTALIKHSAVPVHSGWKAWLYKRAVRRKEVEALTAHGIHGIKISLDDMSLADDISTALKKGKLRKVFDNVSLLPFSKTSSRI
INPhiseqgaiiFebDRAFT_10139973223300000364SoilAVPVHSGWKAWLHKRAVRKKEVEXLTAHGIHGIKVSLDDTSLSEDISTALRKGKLKKVLDAVSLLPFSETSSHI*
JGI10216J12902_11522861523300000956SoilHTAIFAKNDVGQPQTYYALTAKPDPTDFDLYFTALIKHSSVPVHCGWKGWLHKRAVRKKEVEPLTAHGIHGIKVSLDDTSLAEDISTALKKGKLKTIFDNVSLLPFSKTSPRI*
Ga0055500_1007372023300004062Natural And Restored WetlandsFAALTKHSAVPVHATWKTWLHKRAVQKKELEALTTHGIRGLKISLDDESLAEDISAALKKGKLRKAFDQATLLPFSKTSARI*
Ga0066672_1041737013300005167SoilSVKALPSGIVHTAIFIKNDLDQSQTYYALTARPEPTDSDLYFTALIKHSAVPVHFNWKAWLHKRAVQRKEVETLTVHGIHGIKVSLDDASLAEDISTALKKGKLTKVFDNVSLLPFSKTSPHI*
Ga0066678_1014517523300005181SoilEICLLPRTKYGTSVKALPSGIVHTAIFIKNDVNQAETHYALTARPDFTEADLYFTAIIKHSSVPVHPGWKTWLHKRAVKGREVETLTTHGIHGIKISLNEPSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0066676_1018856513300005186SoilLLPRTKYSTGVKALPSGVIHSALFIKNDLGQPQTYYALTARPEHTDSDLYFTALIKHSALPVHPNWKAWLHKRAVQRKEVETLITHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0065715_1099971513300005293Miscanthus RhizosphereIKNDPGQSQTYYALTARPEPTVSDLYFTALIEHSAVPVHSGWKAWLYKRAVQRKEVEALTTQGIHGVKVILDDDTLAVDISVAIKKGNLRNVFDNDSLLPFPKKS*
Ga0065705_1041240513300005294Switchgrass RhizosphereHEICLLPKTKYGTNVKALPSGIVHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKIFDNVSLLPFSKTSSRI*
Ga0065707_1033618823300005295Switchgrass RhizosphereKTDLGQSQTYYALTARPEPTVSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTAHGIHGIKISLDDISLAEDISTALKKGKLRKIFDNVSLLPFSKTSSRI*
Ga0066388_10284031713300005332Tropical Forest SoilPSGIVHTAIFIKNDLGQSQAYYALTAKPEPTDSDLYFTALITHSAVPVDSNWKTWLHKRAVQRKEVETLTTYGIRGIKISLDDTSLAKDISTALEKGKLRKVFDKISLLPFSKTS*
Ga0070694_10057139313300005444Corn, Switchgrass And Miscanthus RhizosphereKYGTNVKALPSGIIHTAIFIKNDPGQSQTYYALTARPEPTVSDLYFTALIEHSAVPVHSGWKAWLYKRAVQRKEVEALTTQGIHGVKVILDDDTLAVDISVAIKKGNLRNVFDNDSLLPFPKKS*
Ga0066686_1063170713300005446SoilAIFIKNDVNQAETHYALTARPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLITHGIRGIKISLDDTSLAEDISTALKKGKLKKIFDNDSLLPFSKTSSRI*
Ga0066689_1078246013300005447SoilDAVKAIRATLLTNHWINVGIHEICLLPRTKYGTSVKALPSGIVHTAIFIKNDVGQSQTYYTLTAKPEATDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQGKEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSSLPFSKTSSRF*
Ga0070697_10179911313300005536Corn, Switchgrass And Miscanthus RhizosphereDLGQSQTYYALTARPEPTVSDLYFTALIEHSAVPVHSGWKAWLYKRAVQRKEVEALTTQGIHGVKVILDDDTLAVDISVAIKKGNLRNVFDNDSLLPFPKKS*
Ga0066692_1037197123300005555SoilEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRNEVEALTARGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRI*
Ga0066706_1026285413300005598SoilRATLLTNHWINVGVHEICLLPRTKYSTGVKALPSGVIHSALFIKNDLGQLQIYYVLIARSEHMDSDLYFTALIKHSALPVHPNWKAWLHKRAVQRKEVETLITHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0081540_107940933300005983Tabebuia Heterophylla RhizosphereVPVHSGWKTWLHKRAVQKNELEALTAHGIHGIKISLNDTTLAEDISTALKKGKLKQVFDNVSLLPFSKTSSRT*
Ga0081540_123197413300005983Tabebuia Heterophylla RhizosphereAKNDVGQSQTYYALTAKPESTDSDLYFTALIKHSAVPVHSGWKAWLHKRAVRKKKVEPLTAHGIHGMKVSLDDTSLAEDISAALKKGKLKTIFDNVSLLPFSKTSPRI*
Ga0066652_10058763223300006046SoilFLSIAGHKDAVKAIRATLLTNHWINVGIHEICLLPRTKYSTGVKALPSGVIHSALFIKNDLGQPQTYYALTARPEHTDSDLYFTALIKHSALPVHPNWKAWLHKRAVQRKEVETLITHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0075021_1017129123300006354WatershedsAETHYTLTARQDFTEADLYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVETLTAHGIHGIKISLDDTSLAEDISTALKEGKLKTVFDNASLLPFSKKSSRI*
Ga0066659_1135457213300006797SoilIHEICLLPRTKYSTSVKALPSGIVHTTIFIKNDVGQSETYYALTAKPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKVSLDDTSLAEDISTALKKGKLKKIFDNVSLLPFSKISSRI*
Ga0066660_1132383013300006800SoilWVNVGIHEICLLPRTKYGTSVKALPSGIVHTAIFIKNDLGQSQTYYALTARPEPTDSDLYFTALIKHGAVPVHSNWKAWLHKRAVQRKEVETLITHGIRGIKISLDDTSLAEDISTALKKGKLKKIFDNVSLLPFSKISSRI*
Ga0075433_1070633323300006852Populus RhizosphereSAVPVHSGWKAWLHRRAVRKKEVEPLTAHGIHGLKISLDDSSLAEDISTALKRGKLKTIFDNVSLLPFSKTSPRI*
Ga0073934_1039153213300006865Hot Spring SedimentHTAIFIKNDLGQSQTYYALTARPESTDSDLYFTALIKHSAVPVHSGWKAWLHKRAVQRKAVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRI*
Ga0075434_10100753113300006871Populus RhizosphereALPSGVIHSALFIKNDLGQAQTCYALTARPETTDPDLYLTALIKHSAVPLHFSWKAWLHKRAVQRKELETLITHGIRGIKISLDDTSLAEDISTALKKGKLKRVFDNVSLLPFSKTSSRT
Ga0075434_10103914923300006871Populus RhizosphereICLLPRTKYGTSVKALPSGIVHTAIFIKNDLGQSQTYYALTAQPDPTDSDLYFTALITRSAVPVHSNWKTWLHKRAVQRKEVETLTTYGIRGIKISLDDASLAEDISTAIKKGKLRKVFDKVSLLPFSKTS*
Ga0075419_1021946713300006969Populus RhizosphereATVLTNHWINVGIHEICLLPRTKYGTSVKALPSGIVHTAIFAKNDVGQPQTYYALTAKPDPTDSDLYFTALIKHSAVPVHSGWKAWLHRRAVRKKEVEPLTAHGIHGLKISLDDSSLAEDISTALKRGKLKTIFDNVSLLPFSKTSPRI*
Ga0075419_1139980413300006969Populus RhizosphereTANPGIHDRDLYFRTVITHSAVPVHCSWKPWLYKRAVKKKEIEPLTVHGIHGIKVSVDDESLAEDISAALKKGKLKNVLSTVSLLPFSKTPSRI*
Ga0066710_10474006513300009012Grasslands SoilTLLTNHWIKVGIHEICLLPRTKYGTSVKALPSGIVHIAIFIKNDVGQLQTYYALTAKPEPMDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKIALDDTSLAEDISTALKKGKLKTVFDNVSFLPFSKTPSRT
Ga0099830_1064118723300009088Vadose Zone SoilFIKHDLDQSQTYYALTARPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTAHGIHGIKVSLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSTRI*
Ga0099830_1185113223300009088Vadose Zone SoilLPCGVTHTAFFIKNDFTQSQVYLLAAIPGIPESDLYFAALIKYGAVPVHFGWKAWLYKRAVQKKEVEPLTAHGIHGIKVTLDDEGLAEDVSAA
Ga0066709_10451733623300009137Grasslands SoilDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTAHGIHGIKVSLDDTSLAEDISTALKKGKLTKVFDNVSLLPFSKTSSHI*
Ga0105092_1089492413300009157Freshwater SedimentICLLPKTKYGTNVKALPSGIVHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI*
Ga0105104_1055150913300009168Freshwater SedimentKPEPTDSHLYFTGLIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSTRI*
Ga0126374_1027956923300009792Tropical Forest SoilPRTKYGTSAKALPSGIVHTAIFIKNDVGHSETHYALTARPGLTEADLYFAGLIKHSSVPVHSAWKTWLHKRAVQKNELEALTANGIHGIKISLDDTTLAEDISAALKKGKLKQVFDNVSLLPFSKTSSRT*
Ga0105084_108504823300009811Groundwater SandKAIRATLLTNHWINVGIHEICLLPRTKYGTSVRALPSGIVHTAIFIKNDLGQSQTYYALTAKPEPTDSDLYLTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTPSRT*
Ga0126380_1185803313300010043Tropical Forest SoilHEICLLPRTKYGTSVKALPSGIVHTAIFAKNDVGQSQTYYALTAKPESADSDLYFTALIKHSAVPVHSGWKAWLHKRAVRKKKVEPLTAHGIHGMKVSLDDTSLAEDISAALKKGKLKTIFDNVSLLPFSKTSPRI*
Ga0126384_1016817213300010046Tropical Forest SoilDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDDTSLAEDISTALKKGKLKNIFGNASLLPFSKTSSHI*
Ga0126382_1133202013300010047Tropical Forest SoilIHEICLLPRTKYGTSVKALPSGIVHTAIFIENDVGQSETHYALTARPGLTEADLYFTGLIKHSSVPVHSHWKTWLHKRAVQKNELEALTAHGIHGIKISLNDTTLAEDISTALKKGKLKQVFDNVSLLPFSKTSSRT*
Ga0126382_1141532813300010047Tropical Forest SoilTALIKHSAVPVHSGWKAWLHKRAVRKKKVEPLTAHGIHGMKVSLDDTSLAEDISAALKKGKLKTIFDNVSLLPFSKTSPRI*
Ga0126376_1003025243300010359Tropical Forest SoilHEICLLPRTKYGTSAKALPSGIVHTAIFIKNDVGHSETHYALTARPGLTEADLYFAGLIKHSSVPVHSAWKTWLHKRAVQKNELEALTANGIHGIKISLDDTTLAEDISAALKKGKLKQVFDNVSLLPFSKKSSRT*
Ga0151490_156954913300011107SoilLLTNHWINVSSHEICLLPRTKYGTTVKALPSGVIHSALFIKNDLGQPQTCYALTARPETTDPDLYLTALIKHSAVPLHFSWKAWLHKRAVQRKEVETLITHGIHGIKISLDDKSLAEDISTALKKGKLKKVLDDISLLPFSKTSPRI*
Ga0137392_1117014223300011269Vadose Zone SoilKNDLGQSQTYYALTARPEPTVSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGRLKRVFNNSLLPFSKTFGISKLS*
Ga0137429_106863723300011437SoilFTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISKALKKGKLRKVFDNVSLLPFSKTSSRI*
Ga0137463_121145623300011444SoilKAIRARLLTNHWLNVGIHEICLLPKTKYGTNVKALPSGIVHTAVFMKNDLGQSQTYYALTARPEPSDSDLYFTALIKHSTVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKASSRI*
Ga0137427_1017510813300011445SoilMKNDLGQSQTYYALTARPEPTVSDLYFTALVKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDVSLAEDISTALKKGKLRKFF
Ga0120191_1007448923300012022TerrestrialTSVKALPSGIVHTAIFIKNDLGQSQTYYALTARPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKISSRI*
Ga0137445_109104113300012035SoilIRARLLTNHWLNVGIHEICLLPKTKYGTNVKALPSGIVHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTALMKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLKKVFDNVSLPFSKTSSRI*
Ga0137382_1111024613300012200Vadose Zone SoilGTGVRALPSGIVHTAIFVKNDLGQSQTYYALTARPEPSDSDLYFTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTARGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI*
Ga0137374_1056877413300012204Vadose Zone SoilDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTVHGIHGIKVSLDDTSLAEDISTALKKGKLTKVFDNVSLLPFSKTSSHI*
Ga0137374_1114193023300012204Vadose Zone SoilVHTAVFIKNDVGQLQTYYALTAKPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDSVCLLPFSKTPSRT*
Ga0137380_1055137423300012206Vadose Zone SoilALIKHSTVPVHSGWKAWLHKRALKKKEMETLTGHGIHGIKVSLDDEGLAVDISAALKKGKLKKVFNTVSPLPLPETSSHI*
Ga0137381_1029727813300012207Vadose Zone SoilTAIFIKNDLGQSQTYYALTAKPESMDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDDTSLAEDISTALKKGTLKEIFDNVSLLPFSKTSSRI*
Ga0137376_1148623623300012208Vadose Zone SoilVHTAVFMKTDLGQSQTYYALTARPEPTVSDLYFTSLIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI*
Ga0137379_1084243013300012209Vadose Zone SoilNVGIREICLLPRTKYGTGVKPLPSGVIHSALFIKNDLGQSQTYYALTAKPESMDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDNTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRI*
Ga0137459_101835513300012228SoilIVHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTALVKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDVSLAEDISTALKKGKLRKVFDNLSLLPFSKTSSRI*
Ga0137372_1108865813300012350Vadose Zone SoilGIREICLLPRTKYGTSVKALPSGIVHTAIFIKNDLGQSQTYYALTAKPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDDTSLAEDISTALKKGKLEVFDNVSLLPFSKISSRI*
Ga0137386_1059924313300012351Vadose Zone SoilVGIHEIFLLLKTKYGTGVKALPSGIVHTAIFIKNDLDQSQTYYALTAKPDPTDSDLYFTALIKHSALPVHPNWKAWLHKRAVQRKEVETLTVHGIHGIKVSLDDTSLAEDISTALKKGRLKKVFNNSLLPFSKTFGISKLSVDRAWSK*
Ga0137367_1026610123300012353Vadose Zone SoilKNDVGQLQTYYALTAKPEPMDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKIALDDTSLAEDISTALKKGKLKKVFDVSLLPFSKTPSRT*
Ga0137367_1084336613300012353Vadose Zone SoilVKAIRATLLTNHWINVGIHEICLLPRTKYTTSVKALPSGIVHTAIFIKNDVGQSETHYALTAKPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDDTSLAKDISTALKKGKLEVFDNVSLLPFSKTSVRI*
Ga0137367_1097281413300012353Vadose Zone SoilTSVKALPSGIVHTAIFIKNDVGQLQTYYALTARPEPTDSDLYFTALIKHSAVPVHFNWKAWLHKRAVQRKEVETLTVHGIHGIKVSLDDTSLAEDISTALKKGKLTKVFDNVSLLPFSKTSSHI*
Ga0137366_1069470713300012354Vadose Zone SoilDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTVHGIHGIKVSLDDTSLAEDISTALKKGKLTKVFDNVSLLPFSKTSSHI*
Ga0137369_1107498213300012355Vadose Zone SoilESDLYFAALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKISLDDTSLAEDISTAIKKGKLKKIFDNVSLLPFSKISSRI*
Ga0137371_1027648923300012356Vadose Zone SoilKDAVKAIRATLLTNHWINVGIHEICLLPRTKYSTGVKALPSGVIHSALFIKNDLGQPQTYYALTARPEHTDSDLYFTALIKHSALPVHPNWKAWLHKRAVQRKEVETLITHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0137368_1069665113300012358Vadose Zone SoilNVGIHEICLLPRTQYGTSVKALPSGIVHTAIFIKNDLDQSQTYYALTAKPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKIALDDTSLAEDISTALKKGKLKKVFDNASFLPFSKTPSRT*
Ga0137375_1018368133300012360Vadose Zone SoilGIVHTAIFIKNDLDQSQTYYALTAKPEPTDSDLYLTALIKHSAVPVHFNWKAWLHKRAVQKKEVETLTVHGIHGIKVSLDDTSLAEDISTALKKGKLTKVFDNVSLLPFSKTSSHI*
Ga0137360_1019168713300012361Vadose Zone SoilKALPSGVIHSALFIKNDLGQPQTYYALTARPETTDSDLFLTALIKHSAVPVHPSWKAWLHKRAVQRKEVETLITHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0137373_1099205623300012532Vadose Zone SoilKALPSGIVHTAIFIKNDLDQSQTYYALTARPEPTDSDLYFTALIKHSAVPVHFNWKAWLHKRAVQRKEVETLTVHGIHGIKVSLDDTSLAEDISTALKKGKLTKVFDNVSLLPFSKTSSHI*
Ga0137413_1135675713300012924Vadose Zone SoilVSDLYFTGLIKHSSVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI*
Ga0137404_1172059713300012929Vadose Zone SoilSSVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSARI*
Ga0137407_1133509813300012930Vadose Zone SoilIKHSEVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKIALDDTSLAEDISTALKKGKLKTVFDNVSFLPFSKTPSRT*
Ga0137407_1198618723300012930Vadose Zone SoilTARPEPSDSDLYFTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKIFDNVSLLPFSKTSSRI*
Ga0164302_1077179413300012961SoilHSAVPVHPSWKAWLHKRAVQRKEVETLITHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0134077_1042029123300012972Grasslands SoilLTARPEPTDSDLYFTALIKHSAVPVHFNWKAWLHKRAAQRKEVETLTVHGIHRIKVSLDDTSLAEDISTALKKGKLTKVFDNVSLLPFSKTSSHI*
Ga0134075_1018255823300014154Grasslands SoilVGIHEICLLPRTKYGTSVKALPSGIVHTAIFIKNDVGQLQTYYALTAKPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKIALDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0180063_107396913300014885SoilKDAVKAIRATLLTDHWINVGINELCLLPRTKYGASVKALPSGIVHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTAIIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI*
Ga0137412_1050527213300015242Vadose Zone SoilQTYYALTARPEPTVSDLYFTGLIKHSSVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI*
Ga0134072_1034554513300015357Grasslands SoilVGQSETHYALTARPGLTEADLYFTGLIKHSSVPVHSGWKTWLHKRAVQRKEVETLITHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT*
Ga0132255_10568499713300015374Arabidopsis RhizosphereFFSIAGHKDAVKAIRARLLTNHWINVGIHEICLLPRAKYGTSVKPLPSGIVHTAIFIKNNVGQSETHYALTAKPEPTDSDLYFTALIKHSAVPVHSGWKAWLHKRAVRKKEVEPLTAHGIHGIKVSLDDTSLAEDISAALKKGKLKTIFDNVSLLPFSKTTPRI*
Ga0184626_1000193913300018053Groundwater SedimentLYLTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSTRI
Ga0184621_1026025223300018054Groundwater SedimentAGYKDAVKAIRARLLTNHWLNVGIHEICLLPKTKYGTGVKALPSGIVHTALFMKNDLGQSQTYYALTARPEPTVSDLYLTALVKHSAVPVHPGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI
Ga0184623_1021616423300018056Groundwater SedimentRARLLTNHWLNVGIHEICLLPKTKYGTNVKVLPSGIVHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDICLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI
Ga0184635_1007492013300018072Groundwater SedimentEPTDSDLYLTALIKHSAVPVHFNWKAWLHKRAVQRKEVENLTAHGIHGIKTSLDDTSLAEDISTALKKGKLRKVFDNVSLLPFSKTSTRI
Ga0184609_1024007113300018076Groundwater SedimentQTYYALTARPETTDLDLYLTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSARI
Ga0184612_1051350613300018078Groundwater SedimentTLLTDHWINVGIHEICLLPRTKYGTSVKALPSGVVHTAIFIKNDVGEAETHYALTARPDFTDADLYFTALIKHSSVPVHSGWRAWLHKRAVRRKEVEVLTTHGIRGIKISLDDASLAEDISTALKKGKVKSAFDNASLLPFSKTSIHI
Ga0184627_1040697113300018079Groundwater SedimentAVKAIRARLLTNHWVNVGVHEICLLPRTKYGTSVKALPSGIVHTAIFIKNDLGQSQTYYALTARPEPTYSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVEALTAHGIHGIKIALDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTPSRT
Ga0066667_1002509213300018433Grasslands SoilDAVKAIRATLLTNHWINVGIHEICLLPRTKYGTSVKALPSGIVHTAIFIKNDLGQSQTYYALTARPEPTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRKEVETLTTHGIRGIKISLDDTSLAEDISTALKKGKLKKIFDNISLLPFSKIYSRI
Ga0193707_104145043300019881SoilLNVGIHEICLLPKLKYGTKVKALPSGIIHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTALVKHSAVPVHSGWKAWLHKRAVQRMEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKISSRI
Ga0193743_100586083300019889SoilKNDPGQSQTYYALTARPEPTVSDLYFTVLIKHSAVPVHSGWKAWLYKRAVQRKEVEALTTHGIHGVKVILDDDTLAVDISVAIKKGKLRNVFDNDSLLPFPKTS
Ga0193739_105560623300020003SoilALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI
Ga0193755_118085113300020004SoilFMKNDLGQSQTYYALTARPEPTVSDLYFTALIKHSAVPVHSGWKAWLHKRAVQRMEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKISSRI
Ga0193733_117208823300020022SoilNHWLNVGIHEICLLPKTKYGTNVKALPSGIVHTAVFMKNDPGQSQTYYALTARPEPTVSDLYFTALVKHSAVPVHSGWKAWLHKRAVQRMEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKISSRI
Ga0210378_1017783923300021073Groundwater SedimentTYYALTARPEPTVSDLYFTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI
Ga0210380_1020160213300021082Groundwater SedimentAVKAIRARLLTNHWLNVGIHEICLLPKTKYGTNVKALPSGIVHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNLSLLPFSKTSSRI
Ga0193719_1035478523300021344SoilRARLLTNHWLNVGIHEICLLPKTKYGTNVKALPSGIVHTAVFMKNDLGQSQTYYALTARPEPTVSDLYFTALVKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGKLRKVFDNVSLLPFSKTSSRI
Ga0209320_1023549213300025155SoilFIKNDLGQAETHYTLTARQDFTEADLYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVKTLTAHGIHGIKISLDDTSLAEDISTALKKGKLKTVFDNASLLPFSKKSSRI
Ga0209002_1035936323300025289SoilLLTNNWLNVGIHEVCLLPRTKYRTSVKALPSGIVHTAIFIKNDLGQAETHYTLTARQDFTEADLYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKTVFDNASLLPFSKKSSRI
Ga0209519_1009043233300025318SoilKDAVKAIRATLLTNHWLNVGIHEVCLLPRTKYGTSVKALPSGIVHTAIFIKNDLGQAETHYTLTARQDFTEADLYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVKTLTAHGIHGIKISLDDTSLAEDISTALKKGKLKTVFDNASLLPFSKKSSRI
Ga0209641_1108827413300025322SoilTKYGTSVKALPSGIVHTAVFIKNDPGQSQTYYALTARPEPTDSDLYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSTRI
Ga0209640_1002411773300025324SoilDPGQSQTYYALTARPEPTDSDLYFTALIKHSAVPVHSDWKTWLHKRAVQGKEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKKSSRI
Ga0209640_1046464823300025324SoilLTNHWLNVGIHEVCLLPRTKYGTSVKALPSGIVHTAIFIKNDLGQAETHYTLTARQDFTEADLYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKTVFDNASLLPFSKKSSRI
Ga0209341_1035046023300025325SoilPSGIVHTAIFIKNDLGQAETHYTLTARQDFTEADLYFTALIRHSAVPVHSGWKTWLHKRAVQGNEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKTVFDNASLLPFSKKSSRI
Ga0209342_1043169213300025326SoilYFTALIRHSAVPVHSGWKTWLHKRAVQGNEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKTVFDNASLLPFSKKSSRI
Ga0209761_133891213300026313Grasslands SoilTYYALTARPEPTDSDLYFTALIKHSAVPVHFNWKAWLHKRAVQRKEVETLTVHGIHGIKVSLDDTSLAEDISTALKKGKLTKVFDNVSLLPFSKTSSHI
Ga0209266_129451513300026327SoilHTAIFIKNDLSQSQTYYALTARPEPTDSDLYFTALIKHSALPVHPNWKAWLHKRAVQRKEVETLITHGIRGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSSRT
Ga0209159_122344323300026343SoilHWVNVGIHEICLLPRTKYGTSVKALPSGIVHTAIFIKNDLGQSQTYYALTARSESTDSDLYFTALIKHSAVPVHSNWKAWLHKRAVQRNEVETLTTHGIRGIKISLDDTSLAEDISKALKKGKLKNVFDNVSLLPFSKISSRI
Ga0209819_1019227513300027722Freshwater SedimentFLSIAGHKDAVKAIRATLLTDHWINVGNHEICLLPRTRYGTSVKTLPSGIVHTAIFVKNELGQGDTHYALTASAEFTEADLYFTALIKDSSVPVHSSWKAWLHKRAVRQNEVETLTTHGIRGIKVSLDDASLAEDISTALKKGKLKSVFDSASLLPFSKTSIRI
Ga0209515_1058461623300027835GroundwaterVYYPLTASPGTPESDLYFTALIRHSAVPVHPGWKAWLHKRAVKKKEVETLTVHGIHGLKVSLNDEGLAEDVSAALKNGKLKKHLNSVSLLP
Ga0209701_1034796523300027862Vadose Zone SoilTSVKALPSGIFHTAIFIKNDLGQSQTYYALTMRREPKDSDLYLTALIKHSAVPVHSGWKAWLYKRAVQRKEVEALTAHGIHGIKISLDDISLAEDISTALKKGRLKRVFNNSLLPFSKTFGISKLS
Ga0209814_1001632433300027873Populus RhizosphereLIKHSAVPVHSGWKAWLHRKAVRKKEVEPLTAHGIHGLKISLDDSSLAEDISTALKRGKLKTIFDNVSLLPFSKTSPRI
Ga0209481_1036120123300027880Populus RhizosphereVLTNHWINVGIHEICLLPRTKYGTSVKALPSGIVHTAIFAKNDVGQPQTYYALTAKPDPTDSDLYFTALIKHSAVPVHSGWKAWLHRRAVRKKEVEPLTAHGIHGLKISLDDSSLAEDISTALKRGKLKTIFDNVSLLPFSKTSPRI
Ga0209068_1048820413300027894WatershedsYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVETLTAHGIHGIKISLDDTSLAEDISTALKEGKLKTVFDNASLLPFSKKSSRI
Ga0299907_1042189223300030006SoilMQIYYALTAKPDPTVSDLYFTALIKHSAVPVHSGWKAWLHKRAVRKNEVEPLTAHGIHGIKISLDDASLGEDISTALKKGKLKTIFDNVSLLPFSKTSSRI
Ga0247727_1108588713300031576BiofilmHTAIFIKNDLAQSHVNYPLAANPGTSESDLYFAVLIRHSAVPVHSEWKAWLHKRAVKKKEVERLTAHGIHGLKVSLDDGGLAEDISAALKKGKLKRVFNNVSLLPLPKPPSRI
Ga0307469_1084836713300031720Hardwood Forest SoilEICLLPRTKYGTTVKALPSGVIHSALFIKNDLGQPQTCYALTARPETTDPDLYLTALIKHSAVPLHFSWKAWLHKRAVQRKELETLITHGIRGVKISLDDTSLAEDISTALKKGKLKRVFDNVSLLPFSKTSSRT
Ga0307468_10133045223300031740Hardwood Forest SoilIAGHKDAVKAIRATLLTNHWINVGTHEICLLPRTKYSTSVKALPSGIVHTAVFIKNDVVQLLTYYALTAKPDCTDSDLYFTALIKHSAVPVHSGWKAWLHKRAVRKKEVEPLTAHGIHGIKVSLDDTSLAEDISTALKKGKLKKVLDDVSLLPFSKTSPRI
Ga0307473_1065388023300031820Hardwood Forest SoilHDRDLYFRTVITHSAVPVHCSWKPWLYKRAVKKKEIEPLTVHGIHGIKVSVDDESLAEDISAALKKGKLKNVLSTVSLLPFSKTPSRI
Ga0306919_1129189423300031879SoilRTKYGTSVKALPSGIVHTAIFIENDVGQSETHYGLTARPGLTEADLYFTGLIKHSSVPVHSGWKTWLHKRAVQKNELEALTAHGIHGIKISLNDTTLAEDISTALKKGKLKQVFDNVSLLPFSKTSSRT
Ga0326597_1084376413300031965SoilTARQDFTEADLYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKTVFDNASLLPFSKKSSRI
Ga0326597_1141796013300031965SoilSGIVHTAVFIKNDPGQSQTYYALTARPEPTDSDLYFTALIKHSAVPVHSDWKTWLHKRAVQGKEVETLTAHGIHGIKISLDDTSLAEDISTALKKGKLKKVFDNVSLLPFSKTSTRI
Ga0214472_1125099223300033407SoilAIFIKNDVGQAETHYALTARPDFTEADLYFTALIKHSAVPVHSGWKTWLHKRAVQGNEVKTLTAHGIHGIKISLDDTSLAEDISTALKKGKLKTVFDNASLLPFSKKSSRI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.