NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F069197

Metagenome / Metatranscriptome Family F069197

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F069197
Family Type Metagenome / Metatranscriptome
Number of Sequences 124
Average Sequence Length 192 residues
Representative Sequence MDLAQGAPKDLVSRRQALVLLGAVAAAHGADAAEQAIMQPVSLDHVNIRVSNVAKTAEFYMGLFDTLVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDTPDLDHYSVGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTAAPYQGPARVGPVLSPLSMSRIGLR
Number of Associated Samples 102
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 66.94 %
% of genes near scaffold ends (potentially truncated) 99.19 %
% of genes from short scaffolds (< 2000 bps) 93.55 %
Associated GOLD sequencing projects 96
AlphaFold2 3D model prediction Yes
3D model pTM-score0.59

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (70.161 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil
(28.226 % of family members)
Environment Ontology (ENVO) Unclassified
(30.645 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(61.290 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 19.35%    β-sheet: 20.74%    Coil/Unstructured: 59.91%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.59
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF03972MmgE_PrpD 2.42
PF00155Aminotran_1_2 1.61
PF07731Cu-oxidase_2 0.81
PF12974Phosphonate-bd 0.81
PF04229GrpB 0.81
PF04828GFA 0.81
PF00108Thiolase_N 0.81
PF02518HATPase_c 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG20792-methylcitrate dehydratase PrpDCarbohydrate transport and metabolism [G] 2.42
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 0.81
COG2132Multicopper oxidase with three cupredoxin domains (includes cell division protein FtsP and spore coat protein CotA)Cell cycle control, cell division, chromosome partitioning [D] 0.81
COG2320GrpB domain, predicted nucleotidyltransferase, UPF0157 familyGeneral function prediction only [R] 0.81
COG3791Uncharacterized conserved proteinFunction unknown [S] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A70.16 %
All OrganismsrootAll Organisms29.84 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000597|AF_2010_repII_A1DRAFT_10148061Not Available565Open in IMG/M
3300001661|JGI12053J15887_10576372Not Available536Open in IMG/M
3300002245|JGIcombinedJ26739_101389460All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Nocardiaceae → Rhodococcus → Rhodococcus erythropolis group → Rhodococcus erythropolis595Open in IMG/M
3300002910|JGI25615J43890_1105069Not Available505Open in IMG/M
3300002914|JGI25617J43924_10032320All Organisms → cellular organisms → Bacteria → Proteobacteria1866Open in IMG/M
3300002914|JGI25617J43924_10325760Not Available531Open in IMG/M
3300004080|Ga0062385_10877458Not Available593Open in IMG/M
3300005184|Ga0066671_10983618Not Available531Open in IMG/M
3300005332|Ga0066388_103048130Not Available856Open in IMG/M
3300005332|Ga0066388_106527584Not Available588Open in IMG/M
3300005542|Ga0070732_10866040Not Available552Open in IMG/M
3300005764|Ga0066903_103212260Not Available884Open in IMG/M
3300005764|Ga0066903_103466416Not Available850Open in IMG/M
3300005921|Ga0070766_10595641Not Available742Open in IMG/M
3300006057|Ga0075026_100380742All Organisms → cellular organisms → Bacteria → Proteobacteria789Open in IMG/M
3300006086|Ga0075019_10723228Not Available631Open in IMG/M
3300006086|Ga0075019_10890038Not Available571Open in IMG/M
3300006102|Ga0075015_100138640All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1256Open in IMG/M
3300006174|Ga0075014_100253873All Organisms → cellular organisms → Bacteria → Proteobacteria → unclassified Proteobacteria → Proteobacteria bacterium909Open in IMG/M
3300006176|Ga0070765_101885245Not Available560Open in IMG/M
3300006354|Ga0075021_10864221Not Available586Open in IMG/M
3300007258|Ga0099793_10153623All Organisms → cellular organisms → Bacteria1091Open in IMG/M
3300009090|Ga0099827_10178604All Organisms → cellular organisms → Bacteria → Proteobacteria1754Open in IMG/M
3300009525|Ga0116220_10088176All Organisms → cellular organisms → Bacteria1310Open in IMG/M
3300009792|Ga0126374_10021951All Organisms → cellular organisms → Bacteria → Proteobacteria2817Open in IMG/M
3300010043|Ga0126380_11360492Not Available621Open in IMG/M
3300010046|Ga0126384_10961541Not Available775Open in IMG/M
3300010047|Ga0126382_11870645Not Available567Open in IMG/M
3300010049|Ga0123356_11088263All Organisms → cellular organisms → Bacteria968Open in IMG/M
3300010162|Ga0131853_10725852Not Available817Open in IMG/M
3300010360|Ga0126372_10944154Not Available869Open in IMG/M
3300010362|Ga0126377_10284887All Organisms → cellular organisms → Bacteria → Proteobacteria1623Open in IMG/M
3300010366|Ga0126379_10051194All Organisms → cellular organisms → Bacteria → Proteobacteria3401Open in IMG/M
3300010366|Ga0126379_12097502Not Available667Open in IMG/M
3300010398|Ga0126383_11752485Not Available710Open in IMG/M
3300010398|Ga0126383_13573815Not Available508Open in IMG/M
3300010880|Ga0126350_11720712Not Available714Open in IMG/M
3300010880|Ga0126350_11999912Not Available738Open in IMG/M
3300011270|Ga0137391_10499050Not Available1030Open in IMG/M
3300011271|Ga0137393_10438601All Organisms → cellular organisms → Bacteria1119Open in IMG/M
3300012189|Ga0137388_10270018All Organisms → cellular organisms → Bacteria → Proteobacteria1553Open in IMG/M
3300012189|Ga0137388_10741050Not Available912Open in IMG/M
3300012209|Ga0137379_10393002All Organisms → cellular organisms → Bacteria1296Open in IMG/M
3300012351|Ga0137386_10824408Not Available666Open in IMG/M
3300012357|Ga0137384_11523417Not Available518Open in IMG/M
3300012917|Ga0137395_10113638All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1814Open in IMG/M
3300012929|Ga0137404_10782398Not Available866Open in IMG/M
3300012948|Ga0126375_10378647All Organisms → cellular organisms → Bacteria1015Open in IMG/M
3300012971|Ga0126369_10621873All Organisms → cellular organisms → Bacteria1152Open in IMG/M
3300015241|Ga0137418_10093585All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2712Open in IMG/M
3300016270|Ga0182036_10884224Not Available732Open in IMG/M
3300016270|Ga0182036_11683984Not Available535Open in IMG/M
3300016294|Ga0182041_10852631Not Available817Open in IMG/M
3300016294|Ga0182041_12310438Not Available503Open in IMG/M
3300016341|Ga0182035_11501045Not Available606Open in IMG/M
3300016357|Ga0182032_10091684All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2120Open in IMG/M
3300016357|Ga0182032_11494035Not Available586Open in IMG/M
3300016357|Ga0182032_11619384Not Available563Open in IMG/M
3300016387|Ga0182040_10048086All Organisms → cellular organisms → Bacteria → Proteobacteria2635Open in IMG/M
3300016387|Ga0182040_10291147Not Available1245Open in IMG/M
3300017936|Ga0187821_10272536Not Available666Open in IMG/M
3300017970|Ga0187783_10081625All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales2382Open in IMG/M
3300018060|Ga0187765_10290216All Organisms → cellular organisms → Bacteria977Open in IMG/M
3300018062|Ga0187784_10735812Not Available788Open in IMG/M
3300020580|Ga0210403_11235534Not Available574Open in IMG/M
3300021168|Ga0210406_10440902Not Available1038Open in IMG/M
3300021168|Ga0210406_10732630Not Available759Open in IMG/M
3300021402|Ga0210385_10948446Not Available661Open in IMG/M
3300021404|Ga0210389_10106359All Organisms → cellular organisms → Bacteria → Proteobacteria2163Open in IMG/M
3300021420|Ga0210394_10418695All Organisms → cellular organisms → Bacteria1179Open in IMG/M
3300021420|Ga0210394_11783320Not Available513Open in IMG/M
3300021559|Ga0210409_10552804Not Available1017Open in IMG/M
3300022527|Ga0242664_1095554Not Available604Open in IMG/M
3300022532|Ga0242655_10335947Not Available501Open in IMG/M
3300022533|Ga0242662_10300761Not Available535Open in IMG/M
3300026361|Ga0257176_1044808Not Available690Open in IMG/M
3300026467|Ga0257154_1029641Not Available819Open in IMG/M
3300026481|Ga0257155_1004292All Organisms → cellular organisms → Bacteria → Proteobacteria1728Open in IMG/M
3300026482|Ga0257172_1021620All Organisms → cellular organisms → Bacteria1133Open in IMG/M
3300026551|Ga0209648_10249496All Organisms → cellular organisms → Bacteria1315Open in IMG/M
3300026557|Ga0179587_10167077All Organisms → cellular organisms → Bacteria1379Open in IMG/M
3300026557|Ga0179587_10974345Not Available559Open in IMG/M
3300027528|Ga0208985_1011237All Organisms → cellular organisms → Bacteria → Proteobacteria1693Open in IMG/M
3300027842|Ga0209580_10122885Not Available1266Open in IMG/M
3300027842|Ga0209580_10132813Not Available1218Open in IMG/M
3300027874|Ga0209465_10044421All Organisms → cellular organisms → Bacteria → Proteobacteria2117Open in IMG/M
3300027898|Ga0209067_10809976Not Available547Open in IMG/M
3300027915|Ga0209069_10091389All Organisms → cellular organisms → Bacteria1462Open in IMG/M
3300028906|Ga0308309_10779715Not Available831Open in IMG/M
3300029636|Ga0222749_10329059Not Available796Open in IMG/M
3300031231|Ga0170824_111253699Not Available681Open in IMG/M
3300031561|Ga0318528_10534528Not Available629Open in IMG/M
3300031682|Ga0318560_10635385Not Available578Open in IMG/M
3300031708|Ga0310686_102042696Not Available595Open in IMG/M
3300031713|Ga0318496_10856419Not Available501Open in IMG/M
3300031718|Ga0307474_11492518Not Available532Open in IMG/M
3300031744|Ga0306918_10833322Not Available720Open in IMG/M
3300031747|Ga0318502_10641413Not Available641Open in IMG/M
3300031751|Ga0318494_10935190Not Available509Open in IMG/M
3300031771|Ga0318546_10583304Not Available786Open in IMG/M
3300031833|Ga0310917_10173737All Organisms → cellular organisms → Bacteria → Proteobacteria1432Open in IMG/M
3300031833|Ga0310917_10429491Not Available899Open in IMG/M
3300031846|Ga0318512_10225305Not Available922Open in IMG/M
3300031879|Ga0306919_10385320Not Available1074Open in IMG/M
3300031879|Ga0306919_10523862Not Available914Open in IMG/M
3300031896|Ga0318551_10719498Not Available579Open in IMG/M
3300031912|Ga0306921_11839484Not Available650Open in IMG/M
3300031941|Ga0310912_11047140Not Available625Open in IMG/M
3300031941|Ga0310912_11201155Not Available577Open in IMG/M
3300031942|Ga0310916_10854670Not Available764Open in IMG/M
3300031947|Ga0310909_11057448Not Available661Open in IMG/M
3300032001|Ga0306922_10335651All Organisms → cellular organisms → Bacteria1622Open in IMG/M
3300032001|Ga0306922_10434243All Organisms → cellular organisms → Bacteria1404Open in IMG/M
3300032001|Ga0306922_11647442Not Available637Open in IMG/M
3300032041|Ga0318549_10436456Not Available590Open in IMG/M
3300032043|Ga0318556_10439833Not Available682Open in IMG/M
3300032064|Ga0318510_10394401Not Available589Open in IMG/M
3300032076|Ga0306924_10563491All Organisms → cellular organisms → Bacteria1292Open in IMG/M
3300032091|Ga0318577_10431786Not Available629Open in IMG/M
3300032180|Ga0307471_100845611All Organisms → cellular organisms → Bacteria1082Open in IMG/M
3300032261|Ga0306920_102772196Not Available668Open in IMG/M
3300032515|Ga0348332_10923892Not Available667Open in IMG/M
3300032783|Ga0335079_12008978Not Available557Open in IMG/M
3300033290|Ga0318519_10217396Not Available1096Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil28.23%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil15.32%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil11.29%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil9.68%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds6.45%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil4.03%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.23%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.42%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland2.42%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil2.42%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.42%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil1.61%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.61%
Termite GutHost-Associated → Arthropoda → Digestive System → Gut → Unclassified → Termite Gut1.61%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil1.61%
Bog Forest SoilEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog Forest Soil0.81%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.81%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.81%
Plant LitterEnvironmental → Terrestrial → Plant Litter → Unclassified → Unclassified → Plant Litter0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000597Forest soil microbial communities from Amazon forest - 2010 replicate II A1EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300004080Coassembly of ECP04_OM1, ECP04_OM2, ECP04_OM3EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006086Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013EnvironmentalOpen in IMG/M
3300006102Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2013EnvironmentalOpen in IMG/M
3300006174Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Alex Branch Run_MetaG_ABR_2014EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006354Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009525Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_7_NC metaGEnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010043Tropical forest soil microbial communities from Panama - MetaG Plot_26EnvironmentalOpen in IMG/M
3300010046Tropical forest soil microbial communities from Panama - MetaG Plot_36EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010049Embiratermes neotenicus P3 segment gut microbial communities from Petit-Saut dam, French Guiana - Emb289 P3Host-AssociatedOpen in IMG/M
3300010162Labiotermes labralis P1 segment gut microbial communities from Petit-Saut dam, French Guiana - Lab288 P1 (version 2)Host-AssociatedOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010366Tropical forest soil microbial communities from Panama - MetaG Plot_24EnvironmentalOpen in IMG/M
3300010398Tropical forest soil microbial communities from Panama - MetaG Plot_35EnvironmentalOpen in IMG/M
3300010880Boreal forest soil eukaryotic communities from Alaska, USA - C5-1 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012948Tropical forest soil microbial communities from Panama - MetaG Plot_14EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300016270Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080EnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300016341Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170EnvironmentalOpen in IMG/M
3300016357Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.000EnvironmentalOpen in IMG/M
3300016387Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.176EnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017970Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP02_20_MGEnvironmentalOpen in IMG/M
3300018060Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 0216_QUI02_MP05_10_MGEnvironmentalOpen in IMG/M
3300018062Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_SJ02_MP15_20_MGEnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021402Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-OEnvironmentalOpen in IMG/M
3300021404Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022527Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-4-O (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022533Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-7-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300026361Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-BEnvironmentalOpen in IMG/M
3300026467Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-AEnvironmentalOpen in IMG/M
3300026481Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NI-19-AEnvironmentalOpen in IMG/M
3300026482Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-16-BEnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027528Forest soil microbial communities from El Dorado National Forest, California, USA - Mediterranean Blodgett CA Ref_O1 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027898Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300027915Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2013 (SPAdes)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031561Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f26EnvironmentalOpen in IMG/M
3300031682Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f22EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031713Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.109b1f22EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031744Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00H (v2)EnvironmentalOpen in IMG/M
3300031747Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f22EnvironmentalOpen in IMG/M
3300031751Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f24EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031846Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f19EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031896Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f19EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031941Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX080EnvironmentalOpen in IMG/M
3300031942Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF176EnvironmentalOpen in IMG/M
3300031947Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.T000HEnvironmentalOpen in IMG/M
3300032001Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.082 (v2)EnvironmentalOpen in IMG/M
3300032041Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f22EnvironmentalOpen in IMG/M
3300032043Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f24EnvironmentalOpen in IMG/M
3300032064Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f17EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032091Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f25EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032515FICUS49499 Metatranscriptome Czech Republic combined assembly (additional data)EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300033290Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.178b2f15EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
AF_2010_repII_A1DRAFT_1014806113300000597Forest SoilMNLARGARTDLLSRRQALILLSAAAAARDVDAAEQAITQPVSLDHINIRVSNVAKTAEFYMELFDTPVLRNTALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKARLTRRLQDSGIAVPPRSSTDVWVADLDGALMQLRQPGGWARQTATPYQR
JGI12053J15887_1057637213300001661Forest SoilGVRKDLVSRREALALLGAMAATHSAGAAEQAIMQPVSLDHVNIRVSNVARSGAFYMGLFDTPVLRAPPTSPPSEGFFLKFGDGYLAISQAIPPDRPDLDHYSLGIRDYEKAKLAAKLQDNGIAVPPRSSTDVWISDLDGALMQLRPPGGWARQTAKPYQPPARVGPAMSPLSMSRIAL
JGIcombinedJ26739_10138946013300002245Forest SoilLLAAIAAVRSAGAADQAIMQPVSLDHVNIRVSNTARSGAFYMGLFDTPVLRSATLRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPDLDHYSLGIRDYDKAKLTTKLQEGGIAALQRSSTDLWVSDPDGALMQLRPPGGWARQTVTPYQGAARVGPALSPLSMSRIGIRVADLARAGDFYGRLFGTEIASAASGRS
JGI25615J43890_110506913300002910Grasslands SoilAAAHGADAAEQAIMQPVSLDHVNIRVSNVAKTAEFYMGLFDTLVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDTPDLDHYSVGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTAAPYQGPARVGPVLSPLSMSRIGLRSA
JGI25617J43924_1003232023300002914Grasslands SoilMDLAQGAPKDLVSRRQALVLLGAVAAAHGADAAEQAIMQPVSLDHVNIRVSNVAKTAEFYMGLFDTLVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDTPDLDHYSVGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTAAPYQGPARVGPVLSPLSMSRIGLR
JGI25617J43924_1032576013300002914Grasslands SoilAAEQAIMQPVSLDHVNIRVSNVARAGEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLRFGDGYLAISQAFAPDRPDLDHYSIGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPARVGPALSPLSMSRIGLRSADVGRAGDFYGRLFGTE
Ga0062385_1087745813300004080Bog Forest SoilHAAEAGILQPVSLDHVNIRVANVAKTAQFHMGLFDTPVLRNPALKAQPTSPPSEGFFLKFGDGYLAISQSFAPDRPGLDHYSLGLRDYDKAKMTAKLQESGVAALPRSSGDLWLADPDGGLMQLRPPGGWARQTATPYQPPARVGPALSPLSMSRIGLPCKDLARGGDYYGRVFGTEVASAASARSRTFSVGDSVLE
Ga0066671_1098361813300005184SoilMDFTHAVNSVSRRQALMLLATMAATRGAGAQAGMMQPLSLDHVNIRVSNVARSSAFYMGLFDTPVLRNPSLRAQPGSPPGEAFFLKFGDGYLAISQAFSPETPGLDHYSLGLRDYDKAKIEARLRDNGIAVPPRSGNDLWLNDLDGALMQLRPPGGW
Ga0066388_10304813013300005332Tropical Forest SoilMNLMRRVRKDLVSRRRALMMLGAVAAVRGASAAEQAIMQPLSLDHVNIRVSNVAKTAEFYMGLFDTPILRNAALRAQPTSPPSEGFFLKFGEGYLAISQAFAPDRPDLDHYSVGLRDYEKARLTTRLQDNGIAVPPRSSTDVWVADPDGALMQLRQPGGWARQTATPYQAPARGSTALLPLSMSRIGLRTADLKRTGDFYGRLFGTEIAS
Ga0066388_10652758413300005332Tropical Forest SoilPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKSKMTAKMQDGGIAALPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPARVGPALSPLSMSRIALPSANLARAGDYYGRLFGTEIASAASSRSRAFSVGDSVLELISAGST
Ga0070732_1086604013300005542Surface SoilLDHVNIRVSNVARSAQFYMALVDTAVLRSQTLRAQPNSPQSNGFFLRFGEGYLAISQAFAPDPPGLDHYSVGLSDYDKAKVAARLQDNGTPAQARSSADVWLSDPDGNVMQLRPPGGWARQTATPYWPPARLGPAFSPLQISRLTLRTADLDRAGNFYRRLFGAEITSPAAYRSRAFPIGDAVL
Ga0066903_10321226013300005764Tropical Forest SoilMEQQAKNFISRRGALALLGAAAVIRNAHAAEQAVMQPISLDHVNIRVVNSPKTAQFYMGLFDTPVLRNPSLRAQPDSPPGEGFFLKFGDGYLVITPAFGQDRPGLDHYSLGLRDYVKATLEAKLKDNGTPALARSGGDVWLSDLDGSLMQLRQPGGWARQTATPYQAPARVGPALSPSSMSRIGLRSADLARTGSY
Ga0066903_10346641623300005764Tropical Forest SoilMDRAQGARKDLISRRQTLSMLAAMAAVGNAGAAEQAIMQPVSLDHVNIRVSDPARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPDLDHYSLGIRDYDKTKLAAKLQDGGMAALPRSSTDLWVSDPDGILMQLRPPGGWARQTATPYQGPDRVGPALSPLSMSRIGLRIANLARAGDFYSRLF
Ga0070766_1059564113300005921SoilMNLAQGTRRNLLSRRETLALLGAAAAVRSAGAAEQAVMQPISLDHVNIRVSNVAKTAEFLMGLFDTPVLRNPALRAQPTSPPSEGFFLKFGEGYLAISQAFSPDVPGLDHYSLGIRDYDKAKLAARLQDSGMTALPRPSNDLWVSDLDGSLMQLRPTGGWARQTATPYQPPTRIGPALSPLSMSRIGL
Ga0075026_10038074213300006057WatershedsMDHAQRIRARMVSRRETLALLGAMAAIPSASAAEPAIMQPVSLDHVNVRVSSVAKTAEFYIGLFDTPLLRNPALRARPDVPPSEGYFLKFGDGYLAISQAFPPDRPDLDHYSLGIRDYEKAKLTAKLQGAGIAVPPRSATDVWIADLDGALMQLRPTGGWARQTAAPYQPPARVGPALSPLSMSRIGINCADLAHAGDFYRRLFGTEIAS
Ga0075019_1072322813300006086WatershedsMGLAQGPRQDFVSRRHALALLGAVAATRGAGAAEAGIMQPLTLDHVNIRVSNTAKSGAFYMGLFDTPVLRSAALRAQPASPPSEGFFLKFGDGYLAISQAFAPDMPGLDHYSIGLRDYDQAKLAAKLRDNGIEVPSRSSTDVWVSDLDGNQMQLRSPGGWARQTATPYQGPARVGPALSPLSMSRIALRS
Ga0075019_1089003813300006086WatershedsMGLVHGPRQQFVSRRHALALLGAIAATRGAGAAEAGIMQPLTLDHVNIRVSSTAKSGAFYMGLFDTPVLRSATLRAQPASPPSEGFFLKFGDGYLAISQAFAPDLPGLDHYSIGLRDYDQAKLAAKLRDNGIEVPSRSSTDVWIGDLDGNLMQLRSPG
Ga0075015_10013864023300006102WatershedsMGLVHGPRQQFVSRRHALALLGAIAATRGAGAAEAGIMQPLTLDHVNIRVSSTAKSGAFYMGLFDTPVLRSAALRAQPASPPSEGFFLKFGDGYLAISQAFAPDMPGLDHYSIGLRDYDQAKLAAKLRDNGIEVPSRSSTDVWVSDLDGNQMQLRSPGGWARQTATPYQGPARVGPALSPLSMSRIALRSA
Ga0075014_10025387313300006174WatershedsMDHTQSGCRFSVSRRQALALLGAIAVVPAAGAAEQGITQPLSLDHVNIRVSNVARTAELLMGLFDTPVLRAAALRAQPNSPPSEGYFLKFGDGYLAISQAFAPDRPGLDHYSLGLRDYDKARLMAKLQESGIAVPPRSSGDLWVADPDGALMQLRQPGGWARQTATPYRAPGRIGPALSPLSMSRLALSSTDPARAGDFY
Ga0070765_10188524513300006176SoilSLDHVNIRVSNVAKTGQFHMGLFDTPVLRNPALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPGLDHYSLGLRDYDKAKMTAKLQEGGVAALPRSSGDLWLADPDGSLMQLRPPGGWARQTATPYQPPARVGPALSPLSMSRIGLPCKDLARAGDYYARVFGTEIASAASARSRTFSVGDSVLE
Ga0075021_1086422113300006354WatershedsMDLAQGMRQHLVSRRHALAMLGAIAVVPGAGAAEQAIMQPVSLDHVNIRVSSVAKTAEFYMGLFDTPVLRNPALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPGLDHYSLGLRDYEKARLTARLQDSGIAVPPRSSTDVWIADLDGALMQLRQPGGWARQTAAPYQGPARVG
Ga0099793_1015362313300007258Vadose Zone SoilMDLAQGARKDLVSRRHALALIGAAAAVRGADAAQQAIMQPVSLDHVNIRVSNVAKTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPARVGAALSPLSMSRIGLGSADVGRAGDFYGRLFGTEIAAAASGRSRAFGLGDSVLE
Ga0099827_1017860423300009090Vadose Zone SoilMDLAQGVRKGLVSRREALALLGAMAAARSAGAADEAIMQPVSLDHVNIRVSSVAKTAEFYIGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFPPDRPDLDHYSLGIRDYEKAKLTAKLQDNGIAVPPRSSTDVWISDLDGALMQLRPPGGWARQ
Ga0116220_1008817623300009525Peatlands SoilMEELPMSLAQGIRRNLLSRRETLALFGAIAAVRSAGAAEQAVMAPISLDHVNIRVSNVAKTAEFLMGLFDTPVLRNPALRAQPTSPPSEGFFLKFGEGYLAISQAFSPDVPGLDHYSLGIRDYDKAKLAARLQDSGMTALPRPSNDLWLSDLDGSLMQLRPTGGWARQTATPYQAPARVGPALSPLSMSRIGLPTTDLGREADFYRRLFGTEVASADA
Ga0126374_1002195133300009792Tropical Forest SoilMNLARGARSDLVSRRQVLILLGAAAAVRDVEAAEQAITQPVSLDHINIRVSSVAKTAEFYMGLFDTPVLRNEVLRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKARLTRRLQDSGIAVPPRSSTDAWVADPDGALMQLRQPGGWARQTAKPYQGPPRVGPALLPLSMSRIGLHSADLGRAGDFYARLFG
Ga0126380_1136049213300010043Tropical Forest SoilLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPARVGPALSPLSM
Ga0126384_1096154113300010046Tropical Forest SoilMNLARGARSDLVSRRQVLILLGAAAAVRDVEAAEQAITQPVSLDHINIRVSSVAKTAEFYMGLFDTPVLRNEVLRAQPTSPPSEGFFLKFGNGYLAISQAFAPDRPDLDHYSVGLRDYDKARLTRRLQDSGIAVPPRASTDAWVADPDGALMQLRPPGGWARQTAKPYQGPPRVGPALLPLSMSRIGLHSTDLGRAGDFYARLFGTEIASSTPARSRAFGLGDSVLELISAP
Ga0126382_1187064513300010047Tropical Forest SoilVSRRQVLILLGAAAAVRDVEAAEQAITQPVSLDHINIRVSSVAKTAEFYMGLFDTPVLRNEVLRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKARLTRRLQDSGIAVPPRSSTDAWVADPDGALMQLRPPGGWARQTAKPYQGPPRVGPALLPLSMSRIGLHSTDLGRAGDF
Ga0123356_1108826323300010049Termite GutMNLARGACTDFVSRRQALILLGAAAAVREVAAAEQAITQPVLLDHINIRVSNVAKTAEFYMGLFDTPVLRNAALRAQPASPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKTRLTRRLQDSDIAVPPRSSADVWVADLDGALMQLRQPGGWARQTATPYQAPPRVGPALLPLSISRIGL
Ga0131853_1072585213300010162Termite GutMILLGAAVAVPKVEAAEQAITQPVSLDHINIRVSNVAKTAEFYMGLFDTPVLRNAALRAQPSSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSIGLRDYDKARLTKRLQESDIAVPPRSSADVWVADLDGALMQLRQPGGWARQTAKPYQGPPRVGPVLLPLSMSRIGLHSADLARAGDFYA
Ga0126372_1094415413300010360Tropical Forest SoilMDLAQRARKDLVSRRQALVLLGAAAAVRGADAAEQAIMQPVSLDHVNIRVSNVAKTAEFYMGLFETPVLRNAALRAQPTSLPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPARIGPAMSPLSMSRIALRSADLGRAGDFYGRLFGTEIASAASGRSRAFGL
Ga0126377_1028488713300010362Tropical Forest SoilMNLARGARSDLVSRRQVLILLGAAAAVRDVEAAEQAITQPVSLDHINIRVSSVAKTAEFYMGLFDTPVLRNEVLRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKARLTRRLQDSGIAVPPRSSTDAWVADPDGALMQLRQPGGWARQTAKPYQGPPRVGPALLPLSMSR
Ga0126379_1005119413300010366Tropical Forest SoilMNLARGARGDLVSRRQVLILLGAAAAVRDVEAAEQAITQPVSLDHINIRVSSVAKTAEFYMGLFDTPVLRNEVLRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKARLTRRLQDSGIAVPPRSSTDAWVADPDGALMQLRQPGGWARQTAKPYQGSPRVGPALLPLSMSRIG
Ga0126379_1209750213300010366Tropical Forest SoilMKTARGVSASSVSRRQTLGLLGTLAIMRSAGAAESGLMQPFSLDHIHIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKARLEQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGG
Ga0126383_1175248513300010398Tropical Forest SoilMNLARGARTDLVSRRQALILLGAAAAVRNVDAAEQAITRPVSLDHINIRVSNVAKTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLSDYDKARLTRRLQDSGIAVPPRSSTDVWITDLDGALMQLRQPGGWARQTATPYQGPSRVGPALLPLSMSRIGLHCADLGRAGGFYSRLFGTE
Ga0126383_1357381513300010398Tropical Forest SoilAAEQAIMQPVSLDHVNIRVSNVAKTAAFYMGLFDTPVLRNEALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKTRLTARLQDNGIAVPPRSSTDVWVADLDGALMQLRQPGGWARQTAKPYQGPARVGLALSPLSMSRVGLRSADLGRAGDFY
Ga0126350_1172071213300010880Boreal Forest SoilLVSRREALALLGAIATTRGSGAAEQAIMQPVSLDHVNVRVSSVAKTAEFYMGLFDTPVLKNPALRARPDVPPSEGYFLKFGDGYLAISQAFPPDRPDLDHYSLGIHDYEKAKLATKLQDNGIAVPPRPSTDVWIADLDGALMQLRPPGGWARQTATPYQAPARVGPALSPVSMSRIGIHCADLAHAGDFYRRLFGTEIASA
Ga0126350_1199991213300010880Boreal Forest SoilMNLAHAVRSVVSRRETLALLAAMAAPRSAGAAEVAIMQPVSLDHVNIRVSSVAKTAEFYIGLFDTPVLRNPALRAQPNSPPSEGYFLKFGDGYLAISQAFAPEMPGLDHYSLGIRDYDKAKLAAKLQDNGITVPARSSTDVWVSDLDGSWMQLRPPGGWARQTATPYQPPARVGPALSPLSMSRIGLPCADLGRGGDFYRRLFGTEIASAASNRSRAFGLGDAVLELVSAPANPAPAAG
Ga0137391_1049905023300011270Vadose Zone SoilMLLGAVAIVRSAGAAEPGIMQPVTLDHVNIRVSNVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDTPGLDHYSLGLRDYEKAKLEAKLRDNGTPALARSSTDVWLSDLDGSLMQLRSPGGWARQTATPYQGPVRVGPALSPLSMSRIGIRTPDLAHAGDLYGRLFGTEIVSAASDRSRAFAIGDSVLELISVPANSAPA
Ga0137393_1043860113300011271Vadose Zone SoilMDLAQGARKGLVSRRQALVLLGAAAAVRGADAAEQAIMQAVSLDHVNIRVSNVARTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLRFGDGYLAISQAFAPDRPDLDHYSIGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPARVGPALSPLSMSRIGLRSADVGRAGDFYGRLFGTEITSAASGRSRAFGLGD
Ga0137388_1027001813300012189Vadose Zone SoilMDLAQGARQDLVSRRQALVLLAAVAAVRGADAAEPAIMQPVSLDHVNIRVSNVAATAEFYMGLFETPVLRNAALRAQPTSPPSEGFFLKFGDGYLALSQAFAPDRPDLDHYSVGLRDYEKTRLTARLQDSGIAVPPRSSTDIWVADLDGALMQLRQPGGWARQTATPHQGPARVGPALSPLSMSRIGLRSADVGRAGDFYG
Ga0137388_1074105013300012189Vadose Zone SoilMDLAQGARKDLVSRRHALALLGAAVAVRGADAAQQAIAQPVSLDHVNIRVSNVAKTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYEKTRLTARLQDNGIAVPPRSSTDIWVADLDGALMQLRQPGGWARQTATPYRGPARVGPALSPLSMSRIGLGSADVGRAGDFYGRLFGTEIASAASGRSRAFGLGDSVLELIS
Ga0137379_1039300213300012209Vadose Zone SoilMDLAQGVRKSLVSRREALALLGAMAAARSAGAADEAIMQPVSLDHVNIRVSSVAKTAEFYIGLFDTPVLRNAAFRAQPTSPPSEGFFLKFGDGYLAISQAFPPERPDLDHYSLGIRDYEKAKLTAKLQDNGIAVPPRSSTDVWISDLDGALMQLRPPGGWARQTAAPYQGPVRIGPALSPLSMS
Ga0137386_1082440813300012351Vadose Zone SoilRRQTIVLLGAAAAVRSADAAEPAIMQPVSLDHVNIRVSSVAKTAEFYIGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFPPERPDLDHYSLGIRDYEKAKLTAKLQDNGIAVPPRSSTDVWISDLDGALMQLRPPGGWARQTAAPYQVPARTGPALSPLSMSRIGIHCADLAHAGDFYRRLFGTEIASAASSRSRAFGVGDSVLELISAPANS
Ga0137384_1152341713300012357Vadose Zone SoilIPIRSCPERLTKEEMPMDPARGVRKDLVTRRDALTLLGALAAVRSAGAAEQAIMQPISLDHVNIRVLSVAKTAQFLMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDTPGLDHYSLGLRHYDKAKMTAKMQDGGMAPLPRSSGDLWVADLDGSLMQLRQ
Ga0137395_1011363833300012917Vadose Zone SoilMDLAQGARKGLVSRRQALVLLGAAAAVRGADAAEQAIMQAVSLDHVNIRVSNVARTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLRFGDGYLAISQAFAPDRPDLDHYSIGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWAR
Ga0137404_1078239823300012929Vadose Zone SoilMERAQAVRSNSISRRHALALLGAFAAVRSAGAAEQAIMQPISLDHVNIRVSSVAKTAQFYMGLFETPVLRNPALRAQPNSPPSEGFFLKFGDGYLAISQAFAPEMPDLDHYSLGIRDYDKAKVTAKLQDNGITVPARSSTDVWVADLDGSWMQLRPTGGWARQTAAPYQPPARVGPVLSPLSMSRIGIHCADLARG
Ga0126375_1037864713300012948Tropical Forest SoilMDRAQGARKDLISRRQTLSLLAAMAAVGNAGAAEQAIMQPVSLDHVNIRVSDTARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPDLDHYSLGIRDYDKTKLAAKLQDGGMAALPRSSTDLWVSDPDGILMQLRPPGG
Ga0126369_1062187313300012971Tropical Forest SoilVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPARVGPALSPLSMSGMALRSADLARAGDYYNRLFGTEIASA
Ga0137418_1009358533300015241Vadose Zone SoilMDLAQGARKDLVSRRHALALLGAAVAVRGADAAQQAIAQPVSLDHVNIRVSNVAKTAEFYMGLFETPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPARVGPALSPLSMSRIGLRSADVGNPSRSATAGYWA*
Ga0182036_1088422413300016270SoilMKTARELSAGSVSRRQALGLLGTLAITRSAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQPPARTGPALSPLSISRVVLLCADVARAGDYYGKLF
Ga0182036_1168398413300016270SoilMDRAQGARKDLISRRQTLSMLAAMAAVGNAGAAEQAIMQPVSLDHVNIRVSDPARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKADLDHYSLGIRDYDKTKLAAKLQEGGMAALPRSSTDLWVSDPDGILMQLRLPGGWARQT
Ga0182041_1085263123300016294SoilMKTARGVCPGSISRRQALGLLGTLAITRNAGAAEQGLMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSDGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQGPARTGPALSPLSISRVVLLCADVARAGDYYGKLFGTE
Ga0182041_1231043813300016294SoilNAGLISRRHALAVLGAAAAVGIAEAAESGLLQPLSLDHVNIRVSNVAKTGQFYMALFDTPVLRNPALRAQPNSPPSEGFFLKFGDGYLVISPAFAPEMPGLDHYSLGLRDYVKATLEAKLKDNGTPTQARSGGDVWLSDLDGSLMQLRQPGGWARQTATPYQAPART
Ga0182035_1150104513300016341SoilMEQGQAKDFISRRAALALLGAAAVVRNAGAAEQAVMQPISLDHVNIRVADSTKTAQFYMGLFDTPVLRNPSLRAQPNSPPGEGYFLKFGDGYLVISPAFTEMPGLDHYSLGLRDYVKATLEAKLKDNGTPALARSGGDVWLGDLDGSLMQLRQPGGWARQTATPYQPPPRVGPALSPLSISRIGIKSADLARAGD
Ga0182032_1009168413300016357SoilMDHVQRAGLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMLRQPGGWARQSATPYQGPARVGPALS
Ga0182032_1149403513300016357SoilMKTARGVSASSVSRRQALGLLGTLAIMRSAGAAESGLMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQGPARTGPALSPLSISRVVLLCADVA
Ga0182032_1161938413300016357SoilDFISRRGALALLGAAVLVRDARAAEQAVMQPISLDHVNIRVADSTKTAQFYMGLFDTPVLRNPSLRAQPNSPPGEGYFLKFGHGYLVISPAFTEMPGLDHYSLGLRDYVKATLEAKLKDNGTPALVRSGGDVWLGDLDGSLMQLRQPGGWARQTATPYQPPPRVGPALSPLSISRIGIKSADLARAG
Ga0182040_1004808633300016387SoilMDHVQRAGLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPARVGPALSPLSMSRMALRSADLARAGDYYSRLFGTEIASAASSRARAFSVGDSIVELVSAGSAPGA
Ga0182040_1029114713300016387SoilMKTARGVSAGSVSRRQALGLLGTLAITRGAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFETPVLRNAALRAQPTSPPSDGYFLKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAPPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQGPARTGPALSPLSISRVVLLCSDVARAGDYYGKLFGTEIASAASSRSRAFSIGDSVLEVIPVAANSA
Ga0187821_1027253613300017936Freshwater SedimentVEFGQGSGRDLISRRGALALFGATAAMRRAGAAEQAIMQPVALDHVNIRVSNVARSAQFYMALVDTAVLRSQTLRAQPNSPQSNGFFLRFGEGYLAISQAFAPDPPGLDHYSVGLSDYDKAKVAARLQDNGTPAQARSSADVWLSDPDGNVMQLRPPGGWARQTATPYWPPARLGPAFSPLQISRLTLRTADL
Ga0187783_1008162533300017970Tropical PeatlandMGHGEGVSKKLVSRREALAMVGAIAAIGRAGAAEQGLLQPVMLDHVNIRVSNVARTGAFYMGLFDTPVLRNPGLRAQPGSQPSEAFFLKFGDGYLAISQAFAPNTPDLDHYSVGIRDFDSPKVAARLQGNGIKGTSRGADVWADDPDGSQIQLRSPGGWARQNAMPYQGAARSGPALSPLSI
Ga0187765_1029021623300018060Tropical PeatlandMDLAQGARKDLVSRRKALILLGAAAAVRGADAAEQAIMRPVSLDHVNIRVSDVARTAQFYMGLFETPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKTRLTTKLQDSGIAVPPRSSADVWVADLDGALMQLRQPGGWARQTAVPYQGPARVG
Ga0187784_1073581213300018062Tropical PeatlandMVFAQGVGRDSVSRRQALALIGAIATMRSAGAAEAGLVQPVSLDHVNIRVSNVAKTAAFYMGLFDTPVLRNPALRAQPASPPSEGFFLKFGDGYLAISQAFAPNVPDLDHYSVGLRDYDVAKVAAKLRDNGMKADARNVDVWADDPDGSLIQLRPPGGWARQTATPYQ
Ga0210403_1123553413300020580SoilNLTHKVRTDLVSRRQTLMLLGAVAAVRGANAAEQAIMQPVSLDHVNIRVSNVAKTAEFYVGLFDTPVLRNAALRAQPTSPPSEGFFLKFGGGYLAISQAFAPDRPDLDHYSVGLRDYEKTRLTARLQDSGIAVPPRSSTDVWIADLDGALMQLRQPGGWARQTAMPYQGPARIGPALLPLSMSRIGLRSAD
Ga0210406_1044090213300021168SoilMKIGQGSRHDLISRRGTLALLGAIVAMRNAGAAEQAIMQPVSLDHVNIRVSSVARTAEFYMGLFDTPILRSAALRAQPNSPPSEGFFLRFGDGYLAISQAFAPDMPGLDHYSLGLRDYDKAKLAAKLQQNGTPAQARSSADVWLSDLDGSLMQLRPPGGWARQNATPYQGPARAGPAFSPLTISRIALRSADLARAGDFYRRLFGAE
Ga0210406_1073263013300021168SoilMNLTHKIRTDLVSRRQTLMLLGAVAAGYGADAAEQAIMQPVSLDHVNIRVSNVAKTAEFYVGLFDTPVLRNAALRAQPTSPPSEGFFLKFGGGYLAISQAFAPDRPDLDHYSVGLRDYEKTRLTARLQDSGIAVPPRSSTDVWIADLDGALMQLRQPGGWARQTA
Ga0210385_1094844613300021402SoilMDLARRVRKDLVSRRQALALLGAMAAVRSAGAAEQGIMQPISLDHVNIRVSSVAKTAEFYMGLFDTPVLRNAGLRAQPSSPPSEGYFIKFGDGYLAISQAFAPERPDLDHYSLGIRDYEKAKLAARLQDGGFAVPPRSGGDIWVGDLDGAMMQLRPPGGWARQTATPYQPPARVGPSLSPLSMSRIGLR
Ga0210389_1010635913300021404SoilMNLTRGARTDLVSRRQALILLGAAAAARDVEAAEQAITQPVSLDHVNIRVSNVAKTAEFYMGLFDTPVLRNTALRAQPNSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSIGLRDYDKARLTRRLQDGDISAPPRSSTDVWVADLDGALMQLRQPGGWARQTAT
Ga0210394_1041869523300021420SoilMNLAQGTRRNLLSRRETLALLGAAAAVRSAGAAEQAVMQPISLDHVNIRVSNVAKTAEFLMGLFDTPVLRNPALRAQPTSPPSEGFFLKFGEGYLAISQAFSPDVPGLDHYSLGIRDYDKAKLAARLQDSGMTALPRPSNDLWVSDLDGSLMQLRPTGGWARQTATP
Ga0210394_1178332013300021420SoilPMNLAQGTRRDLFSRRETLALLGAIAAVRSAGAAEQAVMQPISLDHVNIRVSNVAKTAELLMGLFDTPVLRNPALRAQPTSPPSEGFFLKFGEGYLAISQAFAPDVPGLDHYSLGIRDYDKAKLMARLQDSGMTALPRPSNDLWVSDLDGSLMQLRPTGGWARQTATPYQ
Ga0210409_1055280413300021559SoilLEFGQGSGRDLVSRRGALALFGAIAAMRHAGAAEQAIMQPVALDHVNIRVPNVARTAQFYMALVDTPVLRSQTLRAQPDSPPGEGFFLRFGDGYLAISQAFAPDLPGLDHYSLGLRDYDKAKVTASLQDNGTPAPARSSADVWLSDPDGSVMQLRPPGGWARQSATPYQPPARRGPAFSPLRISRLTLRTADLDRAGNFYRRLFGAE
Ga0242664_109555413300022527SoilPMNLTRGARTDLVSRRQALILLGAAAAARDVEAAEQAITQPVSLDHVNIRVSNVAKTAEFYMGLFDTPVLRNTALRAQPNSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSIGLRDYDKARLTRRLQDGDISAPPRSSTDVWVADLDGALMQLRQPGGWARQTATPYQGPPRVGPALSPQSMSRIGLHSADLGRAGDFY
Ga0242655_1033594713300022532SoilERPMDLARGVRKDFVSRREALAMLSAVAAVRSAGAAEQGIMQPISLDHVNIRVSSVAKTAEFYMGLFDTPVLRNPSLRAQPNSPPGEGFFLKFGDGYLVISPTFGQEMPGLDHCSLGLHDYVKATLEAKLKDNGTPALARSGGDVWLAELDGSPMQLRQPGGWARQT
Ga0242662_1030076113300022533SoilTRAGLISRRPAVVLLGAVAAARSAGAAEPGILQPVSLDHVNIRVSNVPRSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAISQAFASDTPGLDHYSLGLRDYDKARMTAKMQDGGISALPRSGGDLWVADLDGSLMQLRRPGGWARQSATPYQGTARVGPALSPLSMS
Ga0257176_104480813300026361SoilSLCQAIGPSQLKGSFTKFTLYSVEAKARYADPDLTREEVPMDLAQGARKGLVSRRQALVLLGAAAAVRGADAAEQAIMQAVSLDHVNIRVSNVARTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLRFGDGYLAISQAFAPDRPDLDHYSIGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPARVGPALSPLSMSRIGLR
Ga0257154_102964113300026467SoilMDLTQGARKDLVSRREALTLLGAIAAVRSAGAAEQAIMQPVSLDHVNIRVSNTARSGAFYSGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPDLDHYSLGIRDYDKAKLTTKLQEGGIAALQRSSTDLWVSDPDGALMQLRPPGGWARQTAM
Ga0257155_100429223300026481SoilMDLTQGARKDLVSRREALTLLGAIAAVRSAGAAEQAIMQPVSLDHVNIRVSNTARSGAFYTGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPDLDHYSLGIRDYDKAKLAVKLQEGGIAALQRSSTDLWVSDPDGALMQLRPPGGWARQTAMPYQGAARVGPALSPLSMSRIGIRVADLARAGDFYGRLFGTEIASAAS
Ga0257172_102162023300026482SoilMDLAQGARKGLVSRRQALVLLGAAAAVRGADAAEQAIMQAVSLDHVNIRVSNVARTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLRFGDGYLAISQAFAPDRPDLDHYSIGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPAR
Ga0209648_1024949613300026551Grasslands SoilMDLAQGGRNGVVSRRQALVLLGAAAAVRGADAAEQAIMQPVSLDHVNIRVSNVARAGEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLRFGDGYLAISQAFAPDRPDLDHYSIGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPARVGPALSPLSMSRIGLRSADVGRAGDFYGRLFGTEITSAASGRSRAFGLGDSVLELISVP
Ga0179587_1016707713300026557Vadose Zone SoilMDLAQGARKDLVSRRHALALLGAAVAVRGADAAQQAIAQPVSLDHVNIRVSNVAKTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYEKTRLTARLQDNGIAVPPRSSTDVWLADLDGALMQLRQPGGWARQTATPYQGPARVGAALSPLSMSRIGLGSADVGRAGDFYGRLFGTEIA
Ga0179587_1097434513300026557Vadose Zone SoilMERAQAVRSNSISRRHALTMFGALAAVRSAGAAEQAIMQPISLDHVNIRVSSVAKTAQFYIALFDTPVLRNPGLRAQPNSPPSEGFFLKFGDGYLAISQAFAPETPDLDHYSLGIRDYDKAKVTAKLQDNGITVPARSPTDVWVADLDGSWMQLRPTGGWARQTAAPYQPPARVGPALSP
Ga0208985_101123713300027528Forest SoilMDHVQGARAGLVSRRHTLMLLGALAAVRSAGAAEAGILQPVSLDHLNIRVSNVARSGAFYMALFDTPVLRNPALRAQPTSPPSEGFFIRFGDGYLAISQAFAPDTPGLDHYSLGLRDYDKAKMAAKLQDGGIAVLPRSSGDLWVADLDGSLMQLRQPGGWARQTATPYQGPVRAGPALSPLSMSRIGLRSADVTRAGDYYGRLFGTEIASAGSRRSRA
Ga0209580_1012288523300027842Surface SoilMELGQGSGRDLISRRGALALFGAIAAMRRAGAAEQAIMAPVALDHVNIRVSNVARSTQFYMALVDTAVLSSQTLRAQPNAPQSEGFFLRFGDGYLAISQAFAPDVPGLDHYSLGLSDYDKAKVTARLQDNGTPAQARSSADVWLSDPDGNVMQLRPPGGWARQTATPYQPPARLGPAFSPLRISRLTLRTADLDRAGNFYRRLFGEITSPA
Ga0209580_1013281333300027842Surface SoilMERRQAKDFISRRATLALLGAAAVVRNACAAEQAVMQPISLDHVNIRVASSPKTAQFYMGLFDTPVLRNPSLRAQPNSPPGEGFFLKFGDGYLVISPTFGQEMPGLDHYSLGLRDYAKATLETKLKDNGTPALARSGGDVWLSDLDGSLMQLRQPGGWTRQTATPYQPPPRVGPALSPLSISRIGIKSADLARG
Ga0209465_1004442113300027874Tropical Forest SoilMNLARGARTNLVSRRQALILLGVAAAVRDVEAAEQAITQPVSLDHINIRVSSVAKTAEFYMGLFDTPVLRNEVLRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKAGLTRRLQDSGIAVPPRSSTDVWVADLDGALMQLRQPGGWARQTAKAYQGPPRVGPALLPLSMSRIGLHSADIGRAGDFYTRLFGTEIASSAP
Ga0209067_1080997613300027898WatershedsMGLAQGPRQDFVSRRHALALLGAVAATRGAGAAEAGIMQPLTLDHVNIRVSNTAKSGAFYMGLFDTPVLRSAALRAQPASPPSEGFFLKFGDGYLAISQAFAPDMPGLDHYSIGLRDYDQAKLAAKLRDNGIEVPSRSSTDVWVSDLDGNQMQLRSPGGWAR
Ga0209069_1009138923300027915WatershedsMDPAQGARRDLVTRREALVLLSALAAMRSAGAAEQAIMQPISLDHVNIRVSNVARTGQFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDMPGLDHYSLGIRDYEKAKLAAKLQDNGITVPSRSATDIWVSDLDGSWMQLRPSGGWARQTAAPYRGPARVGPSLSPLSMSRIALLCGDLAHAGDYYGRLFGTEIASAASSRSR
Ga0308309_1077971513300028906SoilLEFGQGSGRDLVSRRGALALFGAIAAMRHAGAAEQAIMQPVALDHVNIRVPNVARTAQFYMALVDTPVLRSQTLRAQPDSPPGEGFFLRFGDGYLAISQAFAPDLPGLDHYSLGLRDYDKAKVTASLQDNGTPAPARSSADVWLSDPDGSVMQLRPPGGWARQSATPYQPPARRGP
Ga0222749_1032905913300029636SoilMEQGQANFISRRAALALLGAAAVVRNACAAEQAVMQPISLDHVNIRVASSPKTAQFYMGLFDTPVLRNPSLRAQPNSPPGEGFFLKFGDGYLVISPTFGQEMPGLDHYSLGLRDYVKATLETKLKDNGTPALARSGGDVWLADLDGSLMQLRQPGGWARQTATPYQPPPRVGPALSPLSISRIGIKSADLARA
Ga0170824_11125369913300031231Forest SoilLISRRQAVILLGAVAAARSAGAAEPGILQPVSLDHVNIRVSNVPRSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAISQAFASDTPGLDHYSLGLRDYDKARMTAKMQDGGISALPRSGGDLWVADLDGSLMQLRKPGGWARQSATPYQGPARVGPALSPLSMSRMVLRSADLARAGDYYSRLFGTEIASAASSRARAFSVGDSIVELISAGAAPG
Ga0318528_1053452813300031561SoilMNLAQGARKDLVSRRHALVLLGAVAAVRGADAAEPAIMQPVSLDHVNIRVSNVARTGEFYMRLFETPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDRPDLDHYSVGLRDYDKTRLTTKLQDSGIAVPPRSSTDVWVADLDGALMQLRQPGGWARQTATPYLGPARVGPALSPLSMSRIGIR
Ga0318560_1063538513300031682SoilMKTARGVSAGSVSRRQALGLLGTLAITRGAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKTKLEEKLRENGIAAPPRPGADLWVSDVDGNLMQLRPPGGWARQTATPHQGAARVGPALSPLSMSRIGKRTAD
Ga0310686_10204269613300031708SoilAEAGILQPVSLDHVNIRVSNVAGSAAFYLGLCDTPVLRSAGLRARPDSPPSEAYFLRFGEGYLAISQAYAPDSPGLDHYSVGLRDYDQAKMAPKLRDNGFAAEPRGAADIWVRDLDGSYIQLRAPGGWARQTATPFAGAARAGPALSPLAMSRIALRSADLARAGDYYGRLFGTEIASAASRRSRTFSVGDSVLELIS
Ga0318496_1085641913300031713SoilVQRAGLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPA
Ga0307474_1149251813300031718Hardwood Forest SoilMNVARGARTDWVSRRQALILLGAAAAACDVDAAEQAITQPLSLDHINIRVADVAKTAEFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGEGYLAISQAFAPDRPDLDHYSIGLRDYDKARLTRRLQDGDIAVPPRSSSDVWVADLDGALMQLRQPGGWARQT
Ga0306918_1083332213300031744SoilMDHVQRAGLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATRYQGPA
Ga0318502_1064141313300031747SoilMKTARGVSASSVSRRQALGLLGTLAITRSAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAPPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYTPVRLGPALSPLSMSRIALLCVDVARAGDY
Ga0318494_1093519013300031751SoilSMLAAMAAVGNAGAAEQAIMQPVSLDHVNIRVSDPARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKADLDHYSLGIRDYDKTKLAAKLQEGGMAALPRSSTDLWVSDPDGILMQLRLPGGWARQTATPYQGPARVGPALSPLSMSRIG
Ga0318546_1058330423300031771SoilMKTARGVCPSSISRRQALGLLGTLAIPRSAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSLPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQGPARIGPALSPLSVSRVVLACADVARAGDYY
Ga0310917_1017373723300031833SoilMDHVQRAGLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPARVGPALSPLSMSRM
Ga0310917_1042949113300031833SoilVEFGQGSGRDLISRRGAVALFGAIAAVRHAGAAEQAIMQPVALDHVNIRVSNVARTAQFHMALFDTLVLRSQTLRAQPNSPPSEGFFLKFGEGYLAISQAFAPDLPGLDHYSLGLRDYDKAKISARLQDNGTPAQARSSADVWLSDPDGNVMQLRQPGGWARQTATPYQPPAQLG
Ga0318512_1022530523300031846SoilMDHVQRAGLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSAGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPARVGPALSPLSMSRMALRSADLARAGDYYSRLFGTEIASA
Ga0306919_1038532013300031879SoilMKTARGVCPSSISRRQALGLLGTLAIPRSAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQGPARTGPALSPPSISRVVLLCSDVARAGDYY
Ga0306919_1052386223300031879SoilMDRAQGARKDLISRRQTLSMLAAMAAVGNAGAAEQAIMQPVSLDHVNIRVSDPARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKADLDHYSLGIRDYDKTKLAAKLQEGGMAALPRSSTDLWVSDPDGILMQLRLPGGWARQTATPYQGPARVGPALSPLSMSRIGLRIASLGRAGDFYSRLFGTEIASAAS
Ga0318551_1071949813300031896SoilMKTARGVCPGSISRRQALGLLGTLAITRSAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQSPARTGPALS
Ga0306921_1183948413300031912SoilMDRAQGARKDLISRRQTLSMLAAMAAVGNAGAAEQAIMQPVSLDHVNIRVSDPARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKADLDHYSLGIRDYDKTKLAAKLQEGGMAALPRSSTDLWVSDPDGILMQLRPPGGWARQTATPYQGPARVGPALSPLSMSRIGLRVANLTRAGD
Ga0310912_1104714013300031941SoilLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGALMQLRQPGGWARQSATPYQGPARVGPALSPLSMSRMALRSADLARAGDYYSRLFGTEIASAASSRARAFSVGDSIVEL
Ga0310912_1120115513300031941SoilMDRAQGARKDLISRRQTLSMLAAMAAVGNAGAAEQAIMQPVSLDHVNIRVSDPARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKADLDHYSLGIRDYDKTKLAAKLQEGGMAALPRSSTDLWVSDPDGILMQLRLPGGWARQ
Ga0310916_1085467013300031942SoilMDRAQGARKDLISRRQTLSMLAAMAAVGNAGAAEQAIMQPVSLDHVNIRVSDPARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKADLDHYSLGIRDYDKTKLAAKLQEGGMAALPRSSTDLWVSDPDGILMQLRLPGGWARQTATPYQGPARVGPALSPLSMSRIGLRIASLGRAGDFY
Ga0310909_1105744813300031947SoilMDLAQGARLVSRREALTLLAATAAVRSAGAAEQAIMQPISLDHVNIRVSDVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPDLDHYSLGIRDYEKAKLAAKLQDGGMAALPRSSTDLWVSDPDGALMQLRPPGGWARQTATPHQGAARVGPALSPLSMSRIGKRTADLARAGNFYGR
Ga0306922_1033565113300032001SoilMEFGHGSGRDIISRRGALALFGAIAAMRHARAAEQAIMQPVALDHVNIRVSNVARTAQFHMALFDTLVLRSQTLRAQPNSPPSEGFFLKFGEGYLAISQAFAPDLPGLDHYSLGLRDYDKAKISARLQDNGTPAQARSSADVWLSDPDGNVMQLRQPGGWARQTATPYQPPAQLGPAFSPVRISRLTLRTADLDRAGNFYRRLFGEITSPAAYRSRAFRIGDAVL
Ga0306922_1043424313300032001SoilMDHVQRAGLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPARVGPALSPLSMSRMALRSADLAR
Ga0306922_1164744213300032001SoilLLGAAAVVRNAGAAEQAVMQPISLDHVNIRVADSTKTAQFYMGLFDTPVLRNPSLRAQPNSPPGEGYFLKFGDGYLVISPAFTEMPGLDHYSLGLRDYVKATLEAKLKDNGTPALARSGGDVWLGDLDGGLMQLRQPGGWARQTATPYQPPPRVGPALSPLSISRIGIKSADLARAGDFYRRLFGAEMASAASSGSRAFSVGDSVVQLVAAP
Ga0318549_1043645613300032041SoilLGTLAITRSAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLDQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQGPARTGPALSPLSISRVVLLCSDVARAGDYYAKLFGTEIASAASSRS
Ga0318556_1043983313300032043SoilSRRQALGLLGTLAITRGAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGYFIKFGDGYLAISQAFSPNMPGLDHYSLGLRDYDKAKLEQKLRENGIAAPPRPGADLWLSDVDGNLMQLRPPGGWARQTATPYQGPARTGPALSPLSISRVVLLCSDVARAGDYYAKLFGTEIASAASSRSRAFSIGDSVLEVIPVAANSAAPT
Ga0318510_1039440113300032064SoilMKTARGVSAGSVSRRQALGLLGTLAITRSAGAAEQALMQPVSLDHINIRVANVARSGAFYMGLFDTPVLRNAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPDLDHYSLGIRDYEKAKLAAKLQDGGMAALPRSSTDLWVSDPDGALMQLRPPGGWARQT
Ga0306924_1056349123300032076SoilMEFGHGSGRDIISRRGALALFGAIAAMRHARAAEQAIMQPVALDHVNIRVSNVARTAQFHMALFDTLVLRSQTLRAQPNSPPSEGFFLKFGEGYLAISQAFAPDLPGLDHYSLGLRDYDKAKISARLQDNGTPAQARSSADVWLSDPDGNVMQLRQPGGWARQTATPYQPPAQLGPAFSPVRISRLTLRTADLDRAGN
Ga0318577_1043178613300032091SoilDHVQRAGLISRRQAVILLGAVAAARSAGAAEAGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAMSQAFAPDTPGLDHYSLGLRDYDKVKMTAKMQEGGIAPLPRSGGDLWVADLDGSLMQLRQPGGWARQSATPYQGPARVGPALSPLSMSRMALRSADLARAGDYYSRLFGTEIASAA
Ga0307471_10084561113300032180Hardwood Forest SoilMDLTQGARKDLVSRREVLTLLGAIAAVRSAGAAEQAAMQPVSLDHVNIRVSSTARSGAFYTGLFDTPVLRSAALRAQPTSPPSEGFFLKFGDGYLAISQAFAPDKPDLDHYSLGIRDYDKAKLTTKLQEGGIAALQRSSTDLWVSDPDGALMQLRPPGGWARQTAAPY
Ga0306920_10277219613300032261SoilMEQQAIDFISRRGALALLGAAVLVRDARAAEQAVMQPISLDHVNIRVADSTKTAQFYMGLFDTPVLRNPSLRAQPNSPPGEGYFLKFGDGYLVISPAFTEMPGLDHYSLGLRDYVKATLEAKLKDNGTPALARSGGDVWLGDLDGGLMQLRQPGGWARQTATPYQPPPRVGPALSPLS
Ga0348332_1092389213300032515Plant LitterMNLAQGVPKDVSRREALAMLGAVAAVRSAGAAEQAIMQPVSLDHVNIRVSSVAKTAEFYMGLFDTPVLRNAALRAQPSSPPSEGYFIKFGDGYLAISQAFAPERPDLDHYSLGIRDYEKAKLAARLQEGGIAVPPRSSGDIWIGDLDGAMMQLRPPGGWARQTATPYQPPARVGPALSPLSMSRIGLPTTDLAREADFYRKLFGTEIAPADATPSRVFGIGD
Ga0335079_1200897813300032783SoilARSAGAAEPGILQPVSLDHVNIRVSNVAKSGAFYVALFDTPVLRSTTLRAQPNSPPSEGFFIKFGDGYLAISQAFAPDTPGLDHYSLGLRDYDKAKMAAKMQEGGVAPLPRSGGDLWVADLDGALMQLRKPGGWARQSATPYQPPARVGPALSPLSMSRMALRSADLARAGDYYGRLFGTEIASA
Ga0318519_1021739613300033290SoilMEFGHGSGRDIISRRGALALFGAIAAMRHARAAEQAIMQPVALDHVNIRVSNVARTAQFHMALFDTLVLRSQTLRAQPNSPPSEGFFLKFGEGYLAISQAFAPDLPGLDHYSLGLRDYDKAKISARLQDNGTPAQARSSADVWLSDPDGNVMQLRQPGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.