NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F058017

Metagenome / Metatranscriptome Family F058017

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F058017
Family Type Metagenome / Metatranscriptome
Number of Sequences 135
Average Sequence Length 107 residues
Representative Sequence MDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY
Number of Associated Samples 102
Number of Associated Scaffolds 135

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 61.48 %
% of genes near scaffold ends (potentially truncated) 28.15 %
% of genes from short scaffolds (< 2000 bps) 76.30 %
Associated GOLD sequencing projects 89
AlphaFold2 3D model prediction Yes
3D model pTM-score0.76

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (53.333 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(47.407 % of family members)
Environment Ontology (ENVO) Unclassified
(51.852 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(52.593 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 23.13%    β-sheet: 26.12%    Coil/Unstructured: 50.75%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.76
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
d.260.1.0: automated matchesd6lpha_6lph0.52137


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 135 Family Scaffolds
PF13814Replic_Relax 8.15
PF14659Phage_int_SAM_3 3.70
PF01555N6_N4_Mtase 1.48
PF00188CAP 0.74
PF01381HTH_3 0.74
PF05222AlaDh_PNT_N 0.74
PF01909NTP_transf_2 0.74
PF12728HTH_17 0.74
PF13482RNase_H_2 0.74
PF01935DUF87 0.74
PF00589Phage_integrase 0.74
PF13537GATase_7 0.74
PF01850PIN 0.74
PF00899ThiF 0.74

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 135 Family Scaffolds
COG0863DNA modification methylaseReplication, recombination and repair [L] 1.48
COG1041tRNA G10 N-methylase Trm11Translation, ribosomal structure and biogenesis [J] 1.48
COG2189Adenine specific DNA methylase ModReplication, recombination and repair [L] 1.48
COG2340Spore germination protein YkwD and related proteins with CAP (CSP/antigen 5/PR1) domainCell cycle control, cell division, chromosome partitioning [D] 0.74


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms53.33 %
UnclassifiedrootN/A46.67 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001213|JGIcombinedJ13530_107211230Not Available763Open in IMG/M
3300005167|Ga0066672_10136003All Organisms → cellular organisms → Bacteria → Acidobacteria1531Open in IMG/M
3300005172|Ga0066683_10528043All Organisms → cellular organisms → Bacteria720Open in IMG/M
3300005172|Ga0066683_10725929Not Available586Open in IMG/M
3300005445|Ga0070708_101358070All Organisms → cellular organisms → Bacteria → Acidobacteria663Open in IMG/M
3300005468|Ga0070707_100000058All Organisms → cellular organisms → Bacteria98373Open in IMG/M
3300005471|Ga0070698_100013091All Organisms → cellular organisms → Bacteria → Acidobacteria8781Open in IMG/M
3300005518|Ga0070699_100916169Not Available803Open in IMG/M
3300005531|Ga0070738_10000039All Organisms → cellular organisms → Bacteria389528Open in IMG/M
3300005542|Ga0070732_10045165All Organisms → cellular organisms → Bacteria → Acidobacteria2538Open in IMG/M
3300005554|Ga0066661_10049775All Organisms → cellular organisms → Bacteria → Acidobacteria2395Open in IMG/M
3300005555|Ga0066692_11043959All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia500Open in IMG/M
3300005559|Ga0066700_10914738Not Available582Open in IMG/M
3300006034|Ga0066656_10733174Not Available634Open in IMG/M
3300006050|Ga0075028_100023213All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium2764Open in IMG/M
3300006796|Ga0066665_10432259Not Available1087Open in IMG/M
3300006796|Ga0066665_10432851Not Available1087Open in IMG/M
3300006796|Ga0066665_11370906All Organisms → cellular organisms → Bacteria546Open in IMG/M
3300006865|Ga0073934_10187918All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Microgenomates group → Candidatus Woesebacteria → Candidatus Woesebacteria bacterium RBG_13_36_221420Open in IMG/M
3300007258|Ga0099793_10608221Not Available548Open in IMG/M
3300007265|Ga0099794_10071477All Organisms → cellular organisms → Bacteria → Acidobacteria1700Open in IMG/M
3300007265|Ga0099794_10475601Not Available656Open in IMG/M
3300009012|Ga0066710_100243643All Organisms → cellular organisms → Bacteria → Acidobacteria2595Open in IMG/M
3300009038|Ga0099829_10683960All Organisms → cellular organisms → Bacteria852Open in IMG/M
3300009038|Ga0099829_10875402Not Available745Open in IMG/M
3300009038|Ga0099829_11221082Not Available622Open in IMG/M
3300009088|Ga0099830_10964131Not Available706Open in IMG/M
3300009088|Ga0099830_11503944All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. NRRL F-6491561Open in IMG/M
3300009089|Ga0099828_10589330All Organisms → cellular organisms → Bacteria → Acidobacteria1001Open in IMG/M
3300009089|Ga0099828_11769273Not Available543Open in IMG/M
3300009090|Ga0099827_10309104All Organisms → cellular organisms → Bacteria → Acidobacteria1338Open in IMG/M
3300009137|Ga0066709_101199395All Organisms → cellular organisms → Bacteria → Acidobacteria1117Open in IMG/M
3300009143|Ga0099792_10555773All Organisms → cellular organisms → Bacteria726Open in IMG/M
3300009157|Ga0105092_10386086All Organisms → cellular organisms → Bacteria → PVC group → Candidatus Omnitrophica → unclassified Candidatus Omnitrophica → Omnitrophica bacterium RIFCSPLOWO2_12_FULL_44_17796Open in IMG/M
3300009444|Ga0114945_10004137All Organisms → cellular organisms → Bacteria → Acidobacteria8108Open in IMG/M
3300009523|Ga0116221_1517657Not Available523Open in IMG/M
3300009777|Ga0105164_10085966Not Available1647Open in IMG/M
3300009824|Ga0116219_10679390Not Available564Open in IMG/M
3300010361|Ga0126378_11109326Not Available892Open in IMG/M
3300010379|Ga0136449_101705566Not Available949Open in IMG/M
3300010429|Ga0116241_10006554All Organisms → cellular organisms → Bacteria → Acidobacteria14854Open in IMG/M
3300011269|Ga0137392_10535600All Organisms → cellular organisms → Bacteria → Acidobacteria972Open in IMG/M
3300011269|Ga0137392_10734778All Organisms → cellular organisms → Bacteria816Open in IMG/M
3300011270|Ga0137391_11366558Not Available555Open in IMG/M
3300011270|Ga0137391_11486458Not Available522Open in IMG/M
3300011270|Ga0137391_11550272Not Available507Open in IMG/M
3300011271|Ga0137393_11076708All Organisms → cellular organisms → Bacteria → Acidobacteria683Open in IMG/M
3300011444|Ga0137463_1160353Not Available846Open in IMG/M
3300012096|Ga0137389_10711398All Organisms → cellular organisms → Bacteria863Open in IMG/M
3300012096|Ga0137389_10818476Not Available800Open in IMG/M
3300012096|Ga0137389_10975208Not Available727Open in IMG/M
3300012096|Ga0137389_11463978Not Available579Open in IMG/M
3300012189|Ga0137388_10412035All Organisms → cellular organisms → Bacteria1252Open in IMG/M
3300012189|Ga0137388_10528954All Organisms → cellular organisms → Bacteria → Acidobacteria1095Open in IMG/M
3300012189|Ga0137388_10998318Not Available772Open in IMG/M
3300012202|Ga0137363_10025226All Organisms → cellular organisms → Bacteria → Acidobacteria4042Open in IMG/M
3300012202|Ga0137363_10069880All Organisms → cellular organisms → Bacteria → Acidobacteria2580Open in IMG/M
3300012202|Ga0137363_10759690Not Available822Open in IMG/M
3300012202|Ga0137363_10884483Not Available758Open in IMG/M
3300012202|Ga0137363_11240525Not Available633Open in IMG/M
3300012203|Ga0137399_10257371All Organisms → cellular organisms → Bacteria → Acidobacteria1433Open in IMG/M
3300012205|Ga0137362_11592595Not Available540Open in IMG/M
3300012205|Ga0137362_11790747All Organisms → cellular organisms → Bacteria → Elusimicrobia → unclassified Elusimicrobiota → Elusimicrobia bacterium CG1_02_56_21501Open in IMG/M
3300012209|Ga0137379_11744247Not Available518Open in IMG/M
3300012361|Ga0137360_10163254All Organisms → cellular organisms → Bacteria → Acidobacteria1777Open in IMG/M
3300012361|Ga0137360_10432813All Organisms → cellular organisms → Bacteria → Acidobacteria1113Open in IMG/M
3300012362|Ga0137361_10017145All Organisms → cellular organisms → Bacteria → Acidobacteria5403Open in IMG/M
3300012362|Ga0137361_10274112All Organisms → cellular organisms → Bacteria → Acidobacteria1541Open in IMG/M
3300012363|Ga0137390_10537612All Organisms → cellular organisms → Bacteria1140Open in IMG/M
3300012363|Ga0137390_10809172All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → unclassified Terriglobia → Acidobacteriia bacterium895Open in IMG/M
3300012363|Ga0137390_11336838Not Available662Open in IMG/M
3300012582|Ga0137358_10392978Not Available938Open in IMG/M
3300012685|Ga0137397_10027617All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium4026Open in IMG/M
3300012685|Ga0137397_10081301All Organisms → cellular organisms → Bacteria2357Open in IMG/M
3300012918|Ga0137396_10545762Not Available859Open in IMG/M
3300012918|Ga0137396_10683205Not Available757Open in IMG/M
3300012923|Ga0137359_10002031All Organisms → cellular organisms → Bacteria16211Open in IMG/M
3300012923|Ga0137359_10149572All Organisms → cellular organisms → Bacteria → Acidobacteria2081Open in IMG/M
3300012925|Ga0137419_10428008Not Available1039Open in IMG/M
3300012927|Ga0137416_10697216All Organisms → cellular organisms → Bacteria → Acidobacteria892Open in IMG/M
3300012929|Ga0137404_10121880All Organisms → cellular organisms → Bacteria → Acidobacteria2133Open in IMG/M
3300012929|Ga0137404_12337334Not Available500Open in IMG/M
3300012930|Ga0137407_10071279All Organisms → cellular organisms → Bacteria2894Open in IMG/M
3300012930|Ga0137407_10921196Not Available827Open in IMG/M
3300013297|Ga0157378_12931831Not Available529Open in IMG/M
3300015241|Ga0137418_10065272All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3323Open in IMG/M
3300015259|Ga0180085_1000030All Organisms → cellular organisms → Bacteria → Acidobacteria38053Open in IMG/M
3300015264|Ga0137403_10075760All Organisms → cellular organisms → Bacteria3407Open in IMG/M
3300015371|Ga0132258_13562926All Organisms → cellular organisms → Bacteria1065Open in IMG/M
3300017659|Ga0134083_10577650Not Available511Open in IMG/M
3300017943|Ga0187819_10084945All Organisms → cellular organisms → Bacteria → Acidobacteria1889Open in IMG/M
3300018006|Ga0187804_10186466All Organisms → cellular organisms → Bacteria → Acidobacteria883Open in IMG/M
3300019487|Ga0187893_10723776Not Available613Open in IMG/M
3300020580|Ga0210403_10585762Not Available902Open in IMG/M
3300020583|Ga0210401_10009088All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium9809Open in IMG/M
3300021086|Ga0179596_10617836Not Available550Open in IMG/M
3300021406|Ga0210386_11297206Not Available613Open in IMG/M
3300021432|Ga0210384_10001400All Organisms → cellular organisms → Bacteria → Acidobacteria32925Open in IMG/M
3300021476|Ga0187846_10424532Not Available546Open in IMG/M
3300021478|Ga0210402_10023953All Organisms → cellular organisms → Bacteria → Acidobacteria5247Open in IMG/M
3300021861|Ga0213853_11035323Not Available692Open in IMG/M
3300022563|Ga0212128_10325047All Organisms → cellular organisms → Bacteria962Open in IMG/M
3300024182|Ga0247669_1058892Not Available639Open in IMG/M
3300024246|Ga0247680_1029255All Organisms → cellular organisms → Bacteria → Acidobacteria797Open in IMG/M
3300025173|Ga0209824_10088392Not Available1152Open in IMG/M
3300025310|Ga0209172_10139810All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Patescibacteria group → Microgenomates group → Candidatus Woesebacteria → Candidatus Woesebacteria bacterium RBG_13_36_221338Open in IMG/M
3300025922|Ga0207646_10000015All Organisms → cellular organisms → Bacteria337243Open in IMG/M
3300026309|Ga0209055_1040697All Organisms → cellular organisms → Bacteria → Acidobacteria2055Open in IMG/M
3300026313|Ga0209761_1271028Not Available620Open in IMG/M
3300027671|Ga0209588_1053325All Organisms → cellular organisms → Bacteria → Acidobacteria1308Open in IMG/M
3300027706|Ga0209581_1000113All Organisms → cellular organisms → Bacteria260613Open in IMG/M
3300027815|Ga0209726_10089304Not Available1908Open in IMG/M
3300027862|Ga0209701_10072253All Organisms → cellular organisms → Bacteria → Acidobacteria2182Open in IMG/M
3300027875|Ga0209283_10610743Not Available690Open in IMG/M
3300027875|Ga0209283_10641711Not Available669Open in IMG/M
3300027882|Ga0209590_10151747All Organisms → cellular organisms → Bacteria → Acidobacteria1437Open in IMG/M
3300027894|Ga0209068_10582276Not Available650Open in IMG/M
3300027903|Ga0209488_10134438All Organisms → cellular organisms → Bacteria → Acidobacteria1865Open in IMG/M
3300028536|Ga0137415_10120397All Organisms → cellular organisms → Bacteria → Acidobacteria2471Open in IMG/M
3300031236|Ga0302324_102489956Not Available632Open in IMG/M
3300031525|Ga0302326_11724903Not Available825Open in IMG/M
3300031718|Ga0307474_11299727All Organisms → cellular organisms → Bacteria → Acidobacteria574Open in IMG/M
3300031754|Ga0307475_11297260Not Available564Open in IMG/M
3300031823|Ga0307478_10962095Not Available714Open in IMG/M
3300031962|Ga0307479_10173782All Organisms → cellular organisms → Bacteria → PVC group → Planctomycetes → unclassified Planctomycetota → Planctomycetes bacterium GWF2_39_102117Open in IMG/M
3300031962|Ga0307479_11884963Not Available548Open in IMG/M
3300032160|Ga0311301_12472178Not Available582Open in IMG/M
3300032770|Ga0335085_10096046All Organisms → cellular organisms → Bacteria → Acidobacteria3845Open in IMG/M
3300032770|Ga0335085_10339037All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1766Open in IMG/M
3300032805|Ga0335078_12144433Not Available592Open in IMG/M
3300032892|Ga0335081_10035914All Organisms → cellular organisms → Bacteria7918Open in IMG/M
3300032897|Ga0335071_12085945Not Available510Open in IMG/M
3300032954|Ga0335083_10396338All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1181Open in IMG/M
3300033417|Ga0214471_10529288Not Available940Open in IMG/M
3300033513|Ga0316628_102398087Not Available698Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil47.41%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil8.15%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil5.19%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil5.19%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.70%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere3.70%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.22%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.22%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil2.96%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.48%
WastewaterEnvironmental → Aquatic → Freshwater → Drinking Water → Unchlorinated → Wastewater1.48%
Thermal SpringsEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Thermal Springs1.48%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment1.48%
SoilEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Soil1.48%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.48%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa1.48%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Freshwater Sediment0.74%
WatershedsEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Watersheds0.74%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.74%
WetlandEnvironmental → Aquatic → Marine → Wetlands → Sediment → Wetland0.74%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil0.74%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil0.74%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.74%
Microbial Mat On RocksEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Microbial Mat On Rocks0.74%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm0.74%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.74%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.74%
Anaerobic Digestor SludgeEngineered → Wastewater → Anaerobic Digestor → Unclassified → Unclassified → Anaerobic Digestor Sludge0.74%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001213Combined assembly of wetland microbial communities from Twitchell Island in the Sacramento Delta (Jan 2013 JGI Velvet Assembly)EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005531Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen12_06102014_R2EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009157Freshwater sediment microbial communities from Prairie Pothole Lake near Jamestown, North Dakota, USA - PPLs Lake P7 Core (1) Depth 19-21cm March2015EnvironmentalOpen in IMG/M
3300009444Hot spring microbial communities from Beatty, Nevada to study Microbial Dark Matter (Phase II) - OV2 TP3EnvironmentalOpen in IMG/M
3300009523Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_8_FC metaGEnvironmentalOpen in IMG/M
3300009777Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking waterEnvironmentalOpen in IMG/M
3300009824Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_6_BS metaGEnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010379Sb_50d combined assemblyEnvironmentalOpen in IMG/M
3300010429AD_USRAcaEngineeredOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300011444Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River MetaG ERMGT800_2EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015259Soil microbial communities from meander-bound floodplain in the East River, Colorado, USA - East River metaG ERMLT730_16_10DEnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300018006Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_4EnvironmentalOpen in IMG/M
3300019487White microbial mat communities from a basaltic lava cave in the Kipuka Kanohina Cave System on the Island of Hawaii, USA - MA170107-4 metaGEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021406Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-OEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021861Metatranscriptome of freshwater sediment microbial communities from post-fracked creek in Pennsylvania, United States - ABR_2016 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300022563OV2_combined assemblyEnvironmentalOpen in IMG/M
3300024182Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK10EnvironmentalOpen in IMG/M
3300024246Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK21EnvironmentalOpen in IMG/M
3300025173Wastewater microbial communities from Netherlands to study Microbial Dark Matter (Phase II) - VDW unchlorinated drinking water (SPAdes)EnvironmentalOpen in IMG/M
3300025310Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027706Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen15_06102014_R2 (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027903Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031236Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_1EnvironmentalOpen in IMG/M
3300031525Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_3EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032160Sb_50d combined assembly (MetaSPAdes)EnvironmentalOpen in IMG/M
3300032770Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.5EnvironmentalOpen in IMG/M
3300032805Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.2EnvironmentalOpen in IMG/M
3300032892Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.5EnvironmentalOpen in IMG/M
3300032897Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_1.5EnvironmentalOpen in IMG/M
3300032954Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.2EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033513Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M2_C1_D5_CEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ13530_10721123023300001213WetlandMSIFGEVIYSYTDQQALEDGVLVAVPGEGKVNRVTNAVFYHFTKDVGSSPVSGKVTDISLVMEAIRAILTVPADEDGWRKLTYQERELWLLPNETGGLTLMFPTDY*
Ga0066672_1013600333300005167SoilMHKHPLVIVEYTDKQALEDGVLVSVPGEGRVNRVTRAVFDHFTKPMGSSPATGQVIDITPLQDAIRAMLKIEPDEDAWRTGTYEGKEFWLLPNEVGGFTLL
Ga0066683_1052804313300005172SoilTEEKNIISVYTDRQAVEDGVLVAVDGDAGVNRVTRAVFDHFTESMGTSALTGLVTNITPLMEEIRAILKVPPDEDGWRTFTYRGKELWLVPNEVRGLTLMFPDDY*
Ga0066683_1072592913300005172SoilTDAQAREDGVLVAVPGEGGVNRVSRAVFDHFTETMGNSPVTGPVTNIGPLMDAIRAMLKVEPDDGWRVGDYQGRELWLIPNEVGGLTLMFPEDY*
Ga0070708_10135807013300005445Corn, Switchgrass And Miscanthus RhizosphereMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPVGSSPATGEVTDITPLQDAIRAMLKVEPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0070707_100000058583300005468Corn, Switchgrass And Miscanthus RhizosphereMDHQPPLIACYTDREALEDGVLVAFPGEGRVNRVTRAVFDHFTKRLGSSPLTGSVTDITLLQEAIRAMLKVESDEDGWRVGSYQEKELWLVPNEVGGFTLLFPDDY*
Ga0070698_100013091103300005471Corn, Switchgrass And Miscanthus RhizosphereMDEESNVIVEYTDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPQTGEVTDITPLQEAIRAMLKVKPDEDRWRTGTYEGKELWLLPNEVGGFTLLFPDDY*
Ga0070699_10091616923300005518Corn, Switchgrass And Miscanthus RhizosphereMDEESNVIVEYTDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPQTGEVTDITPLQEAIRAMLKVKPDEDRWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0070738_10000039763300005531Surface SoilMRAGSFALNSSEGQNGEDGPVTMPMTEDFEVIDSYTDREAVEDGVLVPVSGEGGVNRVTRAVFDRFTKPMGSSPMTGPVTDIGPLMEAIRAMVKVPVDEDGWRTGAYDGEKIWVVPNEIGGLTMMFPEDY*
Ga0070732_1004516523300005542Surface SoilMHEESNVIVEYTDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSVPATGEAIDITPLQDAIRAMLKITPDEDKWRTGTYEGKELWLLPNEVGGFTLLFPDDY*
Ga0066661_1004977533300005554SoilVIVEYTDKQALEDGVLVSVPGEGRVNRVTRAVFDHFTKPMGSSPATGQVIDITPLQDAIRAMLKIEPDEDAWRTGTYEGKEFWLLPNEVGGFTLLFPEDY*
Ga0066692_1104395913300005555SoilVLVAVPGDGGVNRVTRAVFDRFTESLGTSPITGPVANIGPLMDAIRAMLAIEPDDGWRTGDYQGKRLWLIPNEVGGLTLMFPEDY*
Ga0066700_1091473813300005559SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPEDC*
Ga0066656_1073317413300006034SoilMEQHDNVISQYTDRQAVEDGVLVAVSGPGGVNRVTRAVFDHFTQALGDSPITGPVTDITKLMDAIRAIVDIPPDADGWRTGAYQDKTLWLVPNEIGGLTLMFPEDY*
Ga0075028_10002321333300006050WatershedsMQEPFVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPLGSTPATGQVTDITRLQDAIRTLLTVKPDGDAWRTGTYEGKDLWLIPNEVAGLTLLFPDDY*
Ga0066665_1043225923300006796SoilMDEQSNVIVEYSDKQALEDGVLVYVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKIKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0066665_1043285123300006796SoilMEQHENVISRYTDRQAVEDGVLVPVSGPGGVNRVTRAVFDHFTQQQLGDSPITGPVTDISKLMDAIRAMANIPPDADGWRTGAYQDKTLWLVPNEIGGRTLMFPEDY*
Ga0066665_1137090623300006796SoilGEPISTYTDQQALADGVLVAVPGDGGVNRVTRAMFDHFTESLGTSPITGTVTNIGPLMDAIRAMLAIEPDDGWRTGDYQGNCLWLIPNEVGGLTLMFPEDY*
Ga0073934_1018791823300006865Hot Spring SedimentVISTYTDSQALDDGVLVAVSGEGGVNRVTRAVFDHFAKPMGESPGTGPVIDIGPVMEAIRAMLKIAADQDGWRTGTYQEKMLWLVPNEVKGLTLMFPEDY*
Ga0099793_1060822113300007258Vadose Zone SoilLEDGVLGSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0099794_1007147713300007265Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKIKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0099794_1047560123300007265Vadose Zone SoilMQNDPIILSYTDREALADGVLVAFPGEGGVNRVTRAVFDHFTEPMGSSPVTGPVIDIGPLQDAIRAMLKLEPDADGWRVGTWQGKTLWLLPNEVGGLTLMFPDDY*
Ga0066710_10024364323300009012Grasslands SoilMHRALRWNEGGEGNPFLMDEQSNVIVEYSDKQALEDGVLVYVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKIKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY
Ga0099829_1068396013300009038Vadose Zone SoilPHALDAPCFEVERGAGSGARFYMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0099829_1087540223300009038Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSTPRTGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0099829_1122108213300009038Vadose Zone SoilMHKHPLVIVEYTDKQALEDGLLVSVPGEGRVNRVTRAVFDHFTKPMGSSPATGQVIDITPLQDAIRATLKIEPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPEDY*
Ga0099830_1096413123300009088Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKIKPDEDIWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0099830_1150394413300009088Vadose Zone SoilEKPMDNPLDDLFGKPISVYTDAQALDDGVLVAVPGDGGVNRATRAVFDHFTESLGTSPITGTVTNIGPLMDGIRAMLRIEPDDGWRVGEYQGRELWLIPNELGGLTLMFPEDY*
Ga0099828_1058933023300009089Vadose Zone SoilMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPVTGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0099828_1176927313300009089Vadose Zone SoilMHKHPLVIVEYTDKQALEDGVLVSVPGEGRVNRVTRAAFDHFTKPMGTSPATGQVIDITPLQDASRAMLKIEPDEDAWRTGTYEGKEFWLLPN
Ga0099827_1030910423300009090Vadose Zone SoilMYKHQLMIVEYTDKQAVEDGVLVSVPGEGRVNRVTRAVFDHFTKPMGSSPATGQVIDITPLQDAIRAMLKIEPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPEDY*
Ga0066709_10119939513300009137Grasslands SoilMEQHDNVISQYTDRQAVEDGVLVAVSGPGGVNRVTRAVFDHFTRFLGNSPMKGPVIDITPLMEVITAIVDIPPDADGWRTGAYQDKMLWLLPNEIGGLTLMFPDDY*
Ga0099792_1055577323300009143Vadose Zone SoilLVSVPGEGKVNRVTRAVFDHFTKPVGSTPLTGEVTDITLLQDAIRALLKVEPDEDAWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0105092_1038608613300009157Freshwater SedimentMEDLPIIDIYTDRQAIADGVLVSVPGDGRVNRVTSSVFTHFTRPMGNSSLTGPVTDISPLMDAIRAMLRIPADEDGWRTGDYQGQKLWLLPNEVGGFTLMYPDDY*
Ga0114945_1000413773300009444Thermal SpringsMDDDSRVIVSYTDAQALEDGVLVAFPGEGGANRVTRAVFDFFAKPLGSSPLTGPVTDITPVGDAIRAMLGVAPDADGWRTGSYRGKDLWLVPNEVGGLTLMFPDDY*
Ga0116221_151765713300009523Peatlands SoilMQQEPFVIVEYSDAQALEDGVLVSFPGEGKVNRVTRAVFDHFTKPLGRSPATWEVTDITPLQDAIRAMLKIEPDKDTWRTGTYEGKNLWLIPNEVEGLTLLFPDDY*
Ga0105164_1008596633300009777WastewaterMDQHPLLIVEYSDKQALEDGVLVSVPGEGRVNRVTRAVFDYFTKPMGSSPVTGRVTDITPLQDAIRAMLKIEPDEDGWRTGTYEGKELWLLPNEIGGLTLLFPGDY*
Ga0116219_1067939013300009824Peatlands SoilMQQEPFVIVEYSDAQALEDGVLVSFPGEGKVNRVTRAVFDHFTKPLGRSPATGEVTDITPLQDAIRAMLKIEPDKDTWRTGTYEGKNLWLIPNEVEGLTLLFPDDY*
Ga0126378_1110932613300010361Tropical Forest SoilMEEESNVIVEYTDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPETGEVTDITRLQEAIRAMLKVEPDEDGWRTGAYEGEELWLVPNEVGGFTLLFPDDY*
Ga0136449_10170556613300010379Peatlands SoilMQQEPFVIVEYSDSQALEDGVLVSFPGEGKVNRVTRAVFDHFTKPLGSTPRTGEVTDITPLQDAIRAMLKIEPDKDTWRTGTYEGKNLWLIPNEVEGLTLLFPDDY*
Ga0116241_1000655463300010429Anaerobic Digestor SludgeMNNFGEVIVSYTDAQALEDGVLVAFPGPGKVNRVTRSVFDHFTNALGSSPVTGTVTDITPVMEAIRAILAVPADEDGWRKLTYREKELWLLPNETGGYTLMFPGDY*
Ga0137392_1053560023300011269Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSTPRTGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPDDY*
Ga0137392_1073477823300011269Vadose Zone SoilSSEGSHISGKELPHALDASCFEVERGVGRETRFFMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGFTLLFPDDY*
Ga0137391_1136655823300011270Vadose Zone SoilVSKQIDEFFGKPIYAYTDAQALDDGVLVAVPGDGGVNCVTRAVFDHFTESLGTSPITGTVTNIGPLMDAIRAMLAIEPDDGWRTGDYQGKRLWLIPNEVGGLTLMFPEDY*
Ga0137391_1148645813300011270Vadose Zone SoilMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGFTLLFSDDY*
Ga0137391_1155027213300011270Vadose Zone SoilENVISEYTNRQAVEDGVLVAVNGEGGVNRVTRAVFDHFTKSMGNSPITGPVINITPLMDAIRAMVKIPPDEGRWRTGTHHGKTLWLVPNEVNGLTLMFPEDY*
Ga0137393_1107670813300011271Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPVGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY
Ga0137463_116035323300011444SoilMAEDWEVISEYTDEEAIEDGFLVVVPGPGGVNRATTAVFNFFTESLGTTPVTGTVTNIGPLMDAIRAMLNVAPDQDGWRTGDWQGKRLWLVPNEVGGLTLMFPDDY*
Ga0137389_1071139823300012096Vadose Zone SoilQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPVTGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0137389_1081847623300012096Vadose Zone SoilVSKQIDEFFGKPIYAYTDAQALDDGVLVAVPGDGGVNRVTSAVFDHFTESLGTSPITGTVTNIGPLMDAIRAMLAIEPDDGWRTGDYQGKRLWLIPNEVGGLTLMFPEDY*
Ga0137389_1097520823300012096Vadose Zone SoilMANDENVISEYTDRQAIEDGALVAVNGEGGVNRVTRAVFDHFAQTMGSSPLTGPVINIGPLMDAIRAMVKIPPDEGGWRTGAYQGKKLWLVPNEVRGLTLMFPEDY*
Ga0137389_1146397813300012096Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRALLKVKPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPDDY*
Ga0137388_1041203533300012189Vadose Zone SoilVSKQIDELFGNIYVYTDAQALDDGVLVAVPGDGGVNCVTRAVFDHFTESLGTSPITGTVTNIGPLMDAIRAMLAIEPDDGWRTGDYQRKRLWLIPNEVGGLTLMFPEDY*
Ga0137388_1052895413300012189Vadose Zone SoilMHKHPLVIVEYTDKQALEDGLLVSVPGEGRVNRVTRAVFDHFTKPMGSSPATGQVIDITPLQDAIRAMLKIEPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPEDY*
Ga0137388_1099831813300012189Vadose Zone SoilMDQDRQIIVAYTDAEAVADGVLVAFPGERGVNRITRAVFDSFAKPMGSSPATGQVTDITPIQDAIRALLKVEPDERGWRTGAYQGKDLWLVPNEAEGLTLLFPDDY*
Ga0137363_1002522633300012202Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTEPMGSTPRTGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKDLWLVPNEVGGFTLLFPDDY*
Ga0137363_1006988023300012202Vadose Zone SoilMNKPFEIVAYTDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTAPMGSSPATGQVKDITRLQDAIRALLKVEPDQDAWRTGTYEGKELWLIPNEVAGLTLLFPDDY*
Ga0137363_1075969013300012202Vadose Zone SoilRGQPYSGKELMHALDAPRFEVERGAGKEARFFMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGQVTDITPLQDAIRALLKVKHDEDAWRTGTYEGKELWLLPNEVGGFTLLFPDDY*
Ga0137363_1088448313300012202Vadose Zone SoilMDEQSNVVVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMESSPATGQATDITRLQDAIRAMLKIEPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPDDY*
Ga0137363_1124052513300012202Vadose Zone SoilKFGRDESQGQAAAPSMAEDDNLIVEYTDRQAVEDGVLVPVPGEGSVNRVTRAVFDYFTRRIGTSPITGPVTNIGPLMDAIRAMLAIKPDAEGWRTGTWRGKELWLLPNEIGGLTLMFPEDY*
Ga0137399_1025737123300012203Vadose Zone SoilMHCALRWNEGGEGNPFLMDEQSNVIVEYSDKQALEDGVLVSVPCEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0137362_1159259513300012205Vadose Zone SoilMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0137362_1179074713300012205Vadose Zone SoilTGRGARLFMENQPEVIAEYTDKQALEDGVLVSVSGEGKVNRVTRAVFDHFTTPMGSTSLTGEAADLVRLQDAIRALLKIESDEDGWRTGSHEGKELWLVPNEVEGLTLLFPDDY*
Ga0137379_1174424713300012209Vadose Zone SoilQQALQDGVLVAVPDEGGVNRVTRAVFDHFTESLGTSPITGPVANIGPLMDAIRAMLQIEPDDGWRVGEYQGRELWLIPNEVGGLTLMFPEDY*
Ga0137360_1016325423300012361Vadose Zone SoilMQKEPFVIDVYSDKQALEDGVLVSVPGEGRVNRVTRAVFDNFTEPLGSSPATDQVNDITRLQDAIRAMLKVEPDQDAWRTGTYEGKGLWLIPNEVGGLTLLFPDDY*
Ga0137360_1043281313300012361Vadose Zone SoilMEKQPFVIDEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPETGQVTDITRLQEAIRALLKIEPDEDAWRTGAYEGKELWLIPNEVGGLTLLFPDDY*
Ga0137361_1001714563300012362Vadose Zone SoilMEKQPFVIDEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPETGQVTDITRLQEAIRALLKIEPDEDAWRTGAYEGKEVWLIPNEVGGLTLLFPDDY*
Ga0137361_1027411223300012362Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKDLWLVPNEVGGFTLLFPDDY*
Ga0137390_1053761223300012363Vadose Zone SoilMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPTGSSPATGEVTDITPLQHAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGFTLLFPDDY*
Ga0137390_1080917223300012363Vadose Zone SoilMAEDNNVIFEYTDRQAVEDGVLVPVSSEGGVNRVTRAIFDYLTERIGTSPITGPVTNIGPLMDAIRAMLAIKPDADGWRTGTYRGKELWLLPNEIGGLTLMFPEDY*
Ga0137390_1133683813300012363Vadose Zone SoilSDQIDEFFGGPISTYTDQQALEDGILVAIPGDGGVNRVTRAGFDYFVIPEDDPAQEVSNITPPMDAIRAMLQIAPDDGWRVGEYRGRDLWLIPNEVGGLTLMFPEDY*
Ga0137358_1039297813300012582Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPGEDAWRTGTYEGKDLWLVPNEVGGFTLLFPDDY*
Ga0137397_1002761753300012685Vadose Zone SoilMDHQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVMDITPLQDAIRAMLKVKLDEDAWRTGTYEGKELWLLPNEVGGFTLLFPDDY*
Ga0137397_1008130123300012685Vadose Zone SoilMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAVLKVKHDEDIWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0137396_1054576213300012918Vadose Zone SoilLVIVEYTDKQALEDGVLVSVTGEGRVNRVTRAVFDHFTKPMGSYPATGQVIDITPLQDAIRAMLKIEPDEEAWRTGTYEGKELWLLPNEVGGFTLLFPEDY*
Ga0137396_1068320513300012918Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPVTGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0137359_1000203113300012923Vadose Zone SoilMDEQSNVIVEYSDRQALEDGVLISVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKEL
Ga0137359_1014957223300012923Vadose Zone SoilMAEDNNVIFEYTDRQAVEEGVLVPVSGEGGVNRITRAVFDYFTGRIGTSPITGPVTNIGPLTNAIRAMLGIKADGDGWRRGTYRGKELWLLPNEIGGLTLLFPEDIRRI*
Ga0137419_1042800813300012925Vadose Zone SoilVENQPEVIAEYTDKQALADGVLVSVSGEGKVNRVTRAVFDHFTKPMGSSPVTGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0137416_1069721623300012927Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPCEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0137404_1012188023300012929Vadose Zone SoilMHRALRWNEGGEGNPFLMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDVWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0137404_1233733423300012929Vadose Zone SoilLADGVLVAVPGDGGVNSVTRAVFDHFTESLGTSPITGPVANIGPLMDAICAMLAIEPDDGWRTGDYQGMRLWLIPNEVGGLTLMFPEDY*
Ga0137407_1007127933300012930Vadose Zone SoilMHRALRWNEGGEGNPFLMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDVWRTGTYEGKELWLVPNEVGGLTLLFPDDY*
Ga0137407_1092119613300012930Vadose Zone SoilMDELFGHIYVYTDAQALDDGVLVAVPGDGGVNCVTRAVFDHFTESLGTSPITGTVTNIGPLMDAISAMLVIEPDDGLRVGEYQGRDLWLIPNELGGLTLLFPEDY*
Ga0157378_1293183113300013297Miscanthus RhizosphereIIEYTDQQALEDGVLVAITGEGGVNRVTRAVFDHFTQAVGSSPITGPVTNVTPLMDAIRAVLHLPPDEDGWRIGKHHDRELWLVPNEVGGLTLMFPEDY*
Ga0137418_1006527243300015241Vadose Zone SoilMHCALRWNEGGEGNPFLMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPVTGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY*
Ga0180085_1000030263300015259SoilMTDDANVIYSYTDRQAVEDGVLVPVDGEGQVNRVTRAVFDHFTESMGSSALTGPVIDITPLKEVIREILRASPDEDGWRKLTWQGKELWLVPNEVRGLTLMFPDDY*
Ga0137403_1007576033300015264Vadose Zone SoilMHRALRWNEGGEGNPFLMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY*
Ga0132258_1356292623300015371Arabidopsis RhizosphereMAKDFEVISSYTDREAVEDGVLVAVPGQGGVNRVTRSVFDRFTSPMGSSPTTGPVVDIGPLMEAIRAMVKIPADADGWRTGAWKGQRLWVIPNEVGGLTMMFPEDY*
Ga0134083_1057765023300017659Grasslands SoilMEQHDNVISQYTDRQAVEDGVLVAVSGPGGVNRVTRAVFDHFTRFLGNSPMKGPVIDITPLMEVITAIVDIPPDADGWRTGAYQDKMLWLLPNEIGGLAL
Ga0187819_1008494523300017943Freshwater SedimentMQNEPFVIAEYTDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPAGQVTDITRLQDAIRALLKVEPDEDAWRTGTYEGKELWLIPNEVGGLTLLFPDDY
Ga0187804_1018646613300018006Freshwater SedimentMQNEPFVIAEYTDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPAGQVTDITRLQDAIRALLKVEPDEDAWRTGTYEGKELWLV
Ga0187893_1072377623300019487Microbial Mat On RocksHAMVENPNVVFEYTDRQALEDGFLVAVSGDGSVNRVTRAVFDHFVQPMGESPVTGPVMNIGPLMEAIRETLKVAPDSDGWRSGDYQGKRLWLVPNEVRGLTLMFPEDR
Ga0210403_1058576213300020580SoilMENQPFVIDEYSDKQALEDGVLVSVPGEGKVNRVTRAAFDYFTEPMGSSPATGQVNDITRLQDAIRALLKVEPDQGAWRTGTYERKKLWLIPNEVGGLTLLFPDDY
Ga0210401_1000908813300020583SoilLSWNEGGGEGARTLMQKEPFLIAEYTDKQALEDGVLVSVPGEGKVNRVTGAVFDHFTKPVGNSPATGQVTDITRLQEAIRAMLKVEPDQDAWRTGTYQGKELWLLPNEVGGFTLLFPDYY
Ga0179596_1061783613300021086Vadose Zone SoilYMNEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPVGSTPLTGEVTDITPLQDAIRALLKVEPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY
Ga0210386_1129720613300021406SoilMQKEPFLIAEYTDKQALEDGVLVSVPGEGKVNRVTGAVFDHFTKPVGNSPATGQVTDITRLQEAIRAMLKVEPDQDAWRTGTYQGKELWLLPNEVGGFTLLFPDYY
Ga0210384_10001400303300021432SoilMQKQPFMIVEYSDQQALEDDVLVSVPAEGGVNRVTRAVFDHFTQAMGSGPATSHVTDITRLQDAIRALLNVEPDKDAWRTGSYEEKSYG
Ga0187846_1042453213300021476BiofilmMHKQPFVVVEYTDKQALEDGVLVSIPGEGKVNRVTRAVFDHFAKPVGSTPLTGQVTDITPLQDAIRAMLKIEPDKDAWRTGTYEGKEMWLVPNEVGGFTLLFPEDY
Ga0210402_1002395323300021478SoilMQNEPFVISEYTDKQAVEDGVLVSVSGEGRVNRVTRAVFDHFTKPLASSPATDQVNDITRLQDAIRALLKVEPDQDAWRTGAHEGKELWLIPNEVGGLTLLFPDDY
Ga0213853_1103532313300021861WatershedsVEFFGEPISVYTDAQAREDGFLVAVPGPGGVNRVTRAVFDHFVEPMGTSPLTGPVTDITPLMDAIRAMLQIPPDDGWRVGEYRGKRLWLIPNEVRGLTLMFPEDY
Ga0212128_1032504723300022563Thermal SpringsMDDDSRVIVSYTDAQALEDGVLVAFPGEGGANRVTRAVFDFFAKPLGSSPLTGPVTDITPVGDAIRAMLGVAPDADGWRTGSYRGKDLWLVPNEVGGLTLMFPDDY
Ga0247669_105889213300024182SoilPSFMEEQPIVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSDPQTGEVIDITRLQDAIRALLKIEPDQDAWRTGTYDGKELWVVPNEVGGFTLLFPDDY
Ga0247680_102925523300024246SoilMEEQPIVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSDPQTGEVIDITRLQDAIRALLKIEPDQDAWRTGTYDGKELWVVPNEVGGFTLLFPDDY
Ga0209824_1008839233300025173WastewaterMDQHPLLIVEYSDKQALEDGVLVSVPGEGRVNRVTRAVFDYFTKPMGSSPVTGRVTDITPLQDAIRAMLKIEPDEDGWRTGTYEGKELWLLPNEIGGLTLLFPGDY
Ga0209172_1013981023300025310Hot Spring SedimentVISTYTDSQALDDGVLVAVSGEGGVNRVTRAVFDHFAKPMGESPGTGPVIDIGPVMEAIRAMLKIAADQDGWRTGTYQEKMLWLVPNEVKGLTLMFPEDY
Ga0207646_100000151063300025922Corn, Switchgrass And Miscanthus RhizosphereMDHQPPLIACYTDREALEDGVLVAFPGEGRVNRVTRAVFDHFTKRLGSSPLTGSVTDITLLQEAIRAMLKVESDEDGWRVGSYQEKELWLVPNEVGGFTLLFPDDY
Ga0209055_104069723300026309SoilVIVEYTDKQALEDGVLVSVPGEGRVNRVTRAVFDHFTKPMGSSPATGQVIDITPLQDAIRAMLKIEPDEDAWRTGTYEGKEFWLLPNEVGGFTLLFPEDY
Ga0209761_127102813300026313Grasslands SoilMHKHPLVIVEYTDKQALEDGVLVSVPGEGRVNRVTRAVFDHFTKPMGSSPATGQVIDITPLQDAIRAMLKIEPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPEDY
Ga0209588_105332543300027671Vadose Zone SoilWEGNPFLMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY
Ga0209581_1000113683300027706Surface SoilMRAGSFALNSSEGQNGEDGPVTMPMTEDFEVIDSYTDREAVEDGVLVPVSGEGGVNRVTRAVFDRFTKPMGSSPMTGPVTDIGPLMEAIRAMVKVPVDEDGWRTGAYDGEKIWVVPNEIGGLTMMFPEDY
Ga0209726_1008930433300027815GroundwaterMDMDQNPPVIVAYTDAQALADGVLVAFPGEGGVNRITRAVFDHFAKPMGSSPATGQVTDITPIQDAIRAMLKVEPDKDGWRTGTHQGKELWLVPNEVRGLTLLFPDDY
Ga0209701_1007225323300027862Vadose Zone SoilMHRALRWNEDGEGNPFLMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSTPRTGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY
Ga0209283_1061074313300027875Vadose Zone SoilMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPVTGEVTDITPLQDAIRAMLKVKPDEDIWRTGTYEGKELWLIPNEVGGLTLLFPDDY
Ga0209283_1064171113300027875Vadose Zone SoilVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPDDY
Ga0209590_1015174713300027882Vadose Zone SoilMYKHQLMIVEYTDKQAVEDGVLVSVPGEGRVNRVTRAVFDHFTKPMGSSPATGQVIDITPLQDAIRAMLKIEPDEDAWRTGTYEGKELWLLPNEVGGFTLLFPEDY
Ga0209068_1058227623300027894WatershedsMQEPFVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPLGSTPATGQVTDITRLQDAIRTLLTVKPDGDAWRTGTYEGKDLWLIPNEVAGLTLLFPDDY
Ga0209488_1013443823300027903Vadose Zone SoilMDEQSNVIVEYSDKQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKIKPDEDIWRTGTYEGKELWLVPNEVGGFTLLFPDDY
Ga0137415_1012039733300028536Vadose Zone SoilMDEQSNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPATGEVTDITPLQDAIRAMLKVKPDEDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY
Ga0302324_10248995613300031236PalsaIEDGVLVSVSSLGEGKVNRVTRAVFDHFTKPMGSTGAIDITRLREAIVALLKVKPDQDAWRTGAYEGKKLWLIPNEVGGLTLLFPDDY
Ga0302326_1172490323300031525PalsaCEYTDKQAIEDGVLVSVSSLGEGKVNRVTRAVFDHFTKPMGSTGAIDITRLREAIVALLKVKPDQDAWRTGAYEGKKLWLIPNEVGGLTLLFPDDY
Ga0307474_1129972713300031718Hardwood Forest SoilILVSIPGEGKVNRVTRAVFDHFTKPIWSSLLVGEVTDITPLQKVIRALLKIEPDQDAWRTGTYEGEDLWLIPNEVGGLTLLFPEDY
Ga0307475_1129726013300031754Hardwood Forest SoilLWEEFFKFGGDEQQGRAVAASMTNNDNVIIAYTDQQAVEDGVLVAVNGASGVNRVTRAVFDDFTKSIGSSPLTGPVIDITPLTEAIRAMVKIPADEDGWRTGMLQGKKLWLVPNEVDGLTLMFPEDY
Ga0307478_1096209513300031823Hardwood Forest SoilMDEESNVIVEYSDRQALEDGVLVSVPGEGKVNRVTRAVFDHFTKPMGSSPQTGEVTDITPLQDAIRAMLKIKLDEDKWRTGTYEGKELWLLPNEVGGFTLLFPDDY
Ga0307479_1017378223300031962Hardwood Forest SoilMDDQSNVIVEYTDKQALEDGVLVSVPGEGNVNRVTRAVFDHFTKPMGSSPQTGEVTDITPLQDAIRAMLKVEPDEDKWRTGTYEGKELWLLPNEVGGFTLLFPDDY
Ga0307479_1188496313300031962Hardwood Forest SoilMDEQSNVIVEYTDRQALEDGVLVSVPGEGKVNRVTRAVFDHFAKPMGSSPQTGEVTDITPLQDAIRALLKVEPDDDAWRTGTYEGKELWLVPNEVGGFTLLFPDDY
Ga0311301_1247217813300032160Peatlands SoilMQQEPFVIVEYSDAQALEDGVLVSFPGEGKVNRVTRAVFDHFTKPLGRSPATGEVTDITPLQDAIRAMLKIEPDKDTWRTGTYEGKNLWLIPNEVEGLTLLFPDDY
Ga0335085_1009604623300032770SoilMQLHSQLCLEAERGAGRGARNSMDDNMPVISEYRDDQALEDGVLVSVSGEGKVNRVTRAVFDHFTKPMGSSPANGEVTDITRLTEAIRALLKVAPDQDAWRTGAYEGKELWLVPNEVGGFTLLFPDDY
Ga0335085_1033903733300032770SoilMKIIAEYTDRRALADGVLVAFPGEGKVNRVTRAVFDYFTKSMGSTPETGEVIDITRLQDAIRILLNQLPDVDGWRTGAYEGKDLWLIPNEVAGLTLLFPDDY
Ga0335078_1214443313300032805SoilMQNEPFVIVEYSDQQAVEDGVLVSVSGEGKVNRVTRAVFDHFTRPLESSPATDQVNDITRLQDAIRALLKVEPDQDAWRTGTYVGKELWLIPNEVGGLTLLFP
Ga0335081_1003591423300032892SoilMQNEPFVIVEYSDQQAVEDGVLVSVSGEGKVNRVTRAVFDHFTRPLESSPATDQVNDITRLQDAIRALLKVEPDQDAWRTGTYVGKELWLIPNEVGGLTLLFPDDY
Ga0335071_1208594513300032897SoilKIIAEYTDRRALADGVLVAFPGEGKVNRVTRAVFDYFTKSMGSTPETGEVIDITRLQDAIRILLNQLPDVDGWRTGAYEGKDLWLIPNEVAGLTLLFPDDY
Ga0335083_1039633823300032954SoilMKIIAEYTDRRALADGVLVAFPGEGKVNRVTRAVFDYFTKSMGSTPETGEVIDITRLQDAIRILLNQLPDVDGWRTGAYEGKDLWLIPNEV
Ga0214471_1052928813300033417SoilMAENREVISTYTDEQAVEDGVLVAVPGDGGVNRVTRAVFDHFTEPMGESPMTGPVTNIGPLFQAIRAMLKIAPDEGGWRTGEYQSKQLWLVPNELGALTLMFPEDY
Ga0316628_10239808713300033513SoilLGLPIAVQGARHKLGRMAGTGGATHMDDFGELISSYSDRDAVADGVLVPIPGEGHVNRVTRAVFDHFTESLGSSPITGPVTNIGPLMEAIRAVLKIQADSDGWRTLTYQGKELWLVPNETGGLTLMFPEDY


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.