NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F043328

Metagenome Family F043328

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F043328
Family Type Metagenome
Number of Sequences 156
Average Sequence Length 85 residues
Representative Sequence MKNKWVIIGVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAGFK
Number of Associated Samples 122
Number of Associated Scaffolds 156

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 98.72 %
% of genes near scaffold ends (potentially truncated) 96.15 %
% of genes from short scaffolds (< 2000 bps) 88.46 %
Associated GOLD sequencing projects 110
AlphaFold2 3D model prediction Yes
3D model pTM-score0.29

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (96.154 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(33.974 % of family members)
Environment Ontology (ENVO) Unclassified
(32.051 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(51.282 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 52.38%    β-sheet: 0.00%    Coil/Unstructured: 47.62%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.29
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 156 Family Scaffolds
PF02621VitK2_biosynth 13.46
PF00903Glyoxalase 8.33
PF01112Asparaginase_2 5.13
PF13377Peripla_BP_3 2.56
PF00990GGDEF 1.92
PF12681Glyoxalase_2 1.92
PF01336tRNA_anti-codon 1.28
PF01266DAO 1.28
PF11954DUF3471 0.64
PF07963N_methyl 0.64
PF00069Pkinase 0.64
PF00532Peripla_BP_1 0.64
PF01261AP_endonuc_2 0.64
PF13673Acetyltransf_10 0.64
PF04794YdjC 0.64
PF01494FAD_binding_3 0.64
PF13545HTH_Crp_2 0.64
PF01019G_glu_transpept 0.64
PF00581Rhodanese 0.64
PF00756Esterase 0.64

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 156 Family Scaffolds
COG1427Chorismate dehydratase (menaquinone biosynthesis, futalosine pathway)Coenzyme transport and metabolism [H] 13.46
COG1446Isoaspartyl peptidase or L-asparaginase, Ntn-hydrolase superfamilyAmino acid transport and metabolism [E] 5.13
COG0515Serine/threonine protein kinaseSignal transduction mechanisms [T] 2.56
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 1.28
COG0405Gamma-glutamyltranspeptidaseAmino acid transport and metabolism [E] 0.64
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 0.64
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 0.64
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 0.64
COG3394Chitooligosaccharide deacetylase ChbG, YdjC/CelG familyCarbohydrate transport and metabolism [G] 0.64


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms96.15 %
UnclassifiedrootN/A3.85 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001137|JGI12637J13337_1010146All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300002914|JGI25617J43924_10255943All Organisms → cellular organisms → Bacteria593Open in IMG/M
3300002914|JGI25617J43924_10287647All Organisms → cellular organisms → Bacteria563Open in IMG/M
3300002917|JGI25616J43925_10002546All Organisms → cellular organisms → Bacteria → Acidobacteria7481Open in IMG/M
3300004633|Ga0066395_10838914All Organisms → cellular organisms → Bacteria553Open in IMG/M
3300005166|Ga0066674_10516876All Organisms → cellular organisms → Bacteria536Open in IMG/M
3300005184|Ga0066671_10968490All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Chromobacteriaceae → Pseudogulbenkiania → Pseudogulbenkiania ferrooxidans536Open in IMG/M
3300005537|Ga0070730_10096296All Organisms → cellular organisms → Bacteria2053Open in IMG/M
3300005537|Ga0070730_11068597Not Available501Open in IMG/M
3300005555|Ga0066692_10144751All Organisms → cellular organisms → Bacteria1454Open in IMG/M
3300005557|Ga0066704_10038995All Organisms → cellular organisms → Bacteria2948Open in IMG/M
3300005561|Ga0066699_11154867All Organisms → cellular organisms → Bacteria533Open in IMG/M
3300005568|Ga0066703_10296615All Organisms → cellular organisms → Bacteria978Open in IMG/M
3300005569|Ga0066705_10069307All Organisms → cellular organisms → Bacteria2031Open in IMG/M
3300005574|Ga0066694_10381177All Organisms → cellular organisms → Bacteria665Open in IMG/M
3300005598|Ga0066706_10241553All Organisms → cellular organisms → Bacteria1403Open in IMG/M
3300005610|Ga0070763_10895217All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300006031|Ga0066651_10813771All Organisms → cellular organisms → Bacteria507Open in IMG/M
3300006755|Ga0079222_12114374All Organisms → cellular organisms → Bacteria557Open in IMG/M
3300006796|Ga0066665_10468895All Organisms → cellular organisms → Bacteria → Acidobacteria1036Open in IMG/M
3300006804|Ga0079221_10316312All Organisms → cellular organisms → Bacteria924Open in IMG/M
3300006806|Ga0079220_10290843Not Available1003Open in IMG/M
3300006806|Ga0079220_10717630All Organisms → cellular organisms → Bacteria738Open in IMG/M
3300006854|Ga0075425_101935192All Organisms → cellular organisms → Bacteria660Open in IMG/M
3300006954|Ga0079219_10920002All Organisms → cellular organisms → Bacteria711Open in IMG/M
3300007255|Ga0099791_10003300All Organisms → cellular organisms → Bacteria → Acidobacteria6560Open in IMG/M
3300007255|Ga0099791_10126718All Organisms → cellular organisms → Bacteria1185Open in IMG/M
3300007265|Ga0099794_10110757All Organisms → cellular organisms → Bacteria1375Open in IMG/M
3300009012|Ga0066710_100007190All Organisms → cellular organisms → Bacteria → Acidobacteria10586Open in IMG/M
3300009012|Ga0066710_102710754All Organisms → cellular organisms → Bacteria → Acidobacteria706Open in IMG/M
3300009012|Ga0066710_102817057All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300009012|Ga0066710_103269402All Organisms → cellular organisms → Bacteria620Open in IMG/M
3300009038|Ga0099829_10859940All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium753Open in IMG/M
3300009038|Ga0099829_11443814All Organisms → cellular organisms → Bacteria568Open in IMG/M
3300009088|Ga0099830_11327389All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300009143|Ga0099792_10761962All Organisms → cellular organisms → Bacteria631Open in IMG/M
3300009143|Ga0099792_11082632All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300009792|Ga0126374_10329490All Organisms → cellular organisms → Bacteria1037Open in IMG/M
3300010047|Ga0126382_10869495All Organisms → cellular organisms → Bacteria776Open in IMG/M
3300010048|Ga0126373_10018354All Organisms → cellular organisms → Bacteria → Acidobacteria5861Open in IMG/M
3300010322|Ga0134084_10318787All Organisms → cellular organisms → Bacteria583Open in IMG/M
3300010335|Ga0134063_10556031All Organisms → cellular organisms → Bacteria580Open in IMG/M
3300010358|Ga0126370_10022349All Organisms → cellular organisms → Bacteria3663Open in IMG/M
3300010358|Ga0126370_11916877All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300010360|Ga0126372_11525804All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300010361|Ga0126378_10682493All Organisms → cellular organisms → Bacteria1139Open in IMG/M
3300010361|Ga0126378_12359679All Organisms → cellular organisms → Bacteria607Open in IMG/M
3300010364|Ga0134066_10014465All Organisms → cellular organisms → Bacteria1656Open in IMG/M
3300010376|Ga0126381_100030397All Organisms → cellular organisms → Bacteria6438Open in IMG/M
3300010376|Ga0126381_103748153All Organisms → cellular organisms → Bacteria594Open in IMG/M
3300010401|Ga0134121_10618296All Organisms → cellular organisms → Bacteria1016Open in IMG/M
3300011269|Ga0137392_10261790All Organisms → cellular organisms → Bacteria1425Open in IMG/M
3300011270|Ga0137391_10416427All Organisms → cellular organisms → Bacteria1146Open in IMG/M
3300011270|Ga0137391_11135767All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300011270|Ga0137391_11529798All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300011271|Ga0137393_10596536All Organisms → cellular organisms → Bacteria947Open in IMG/M
3300011271|Ga0137393_10596539All Organisms → cellular organisms → Bacteria947Open in IMG/M
3300011271|Ga0137393_11743148All Organisms → cellular organisms → Bacteria511Open in IMG/M
3300012096|Ga0137389_10731851All Organisms → cellular organisms → Bacteria850Open in IMG/M
3300012096|Ga0137389_11063153All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae → unclassified Acidobacteriaceae → Acidobacteriaceae bacterium693Open in IMG/M
3300012189|Ga0137388_10083404All Organisms → cellular organisms → Bacteria2701Open in IMG/M
3300012189|Ga0137388_10341114All Organisms → cellular organisms → Bacteria1380Open in IMG/M
3300012189|Ga0137388_11494470All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium613Open in IMG/M
3300012199|Ga0137383_11356145All Organisms → cellular organisms → Bacteria505Open in IMG/M
3300012203|Ga0137399_10094516All Organisms → cellular organisms → Bacteria → Acidobacteria2299Open in IMG/M
3300012203|Ga0137399_10212566All Organisms → cellular organisms → Bacteria1574Open in IMG/M
3300012203|Ga0137399_10423989All Organisms → cellular organisms → Bacteria1111Open in IMG/M
3300012205|Ga0137362_10872625All Organisms → cellular organisms → Bacteria → Acidobacteria769Open in IMG/M
3300012207|Ga0137381_10695556All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium884Open in IMG/M
3300012209|Ga0137379_10824192All Organisms → cellular organisms → Bacteria832Open in IMG/M
3300012211|Ga0137377_11559961All Organisms → cellular organisms → Bacteria585Open in IMG/M
3300012285|Ga0137370_10274730All Organisms → cellular organisms → Bacteria1002Open in IMG/M
3300012351|Ga0137386_10267914All Organisms → cellular organisms → Bacteria1229Open in IMG/M
3300012357|Ga0137384_10332086All Organisms → cellular organisms → Bacteria1262Open in IMG/M
3300012357|Ga0137384_10617944All Organisms → cellular organisms → Bacteria884Open in IMG/M
3300012357|Ga0137384_10717339All Organisms → cellular organisms → Bacteria812Open in IMG/M
3300012362|Ga0137361_10921386All Organisms → cellular organisms → Bacteria792Open in IMG/M
3300012363|Ga0137390_10281970All Organisms → cellular organisms → Bacteria1645Open in IMG/M
3300012363|Ga0137390_10975104All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300012683|Ga0137398_10535275All Organisms → cellular organisms → Bacteria807Open in IMG/M
3300012917|Ga0137395_10130187All Organisms → cellular organisms → Bacteria1705Open in IMG/M
3300012917|Ga0137395_10882060All Organisms → cellular organisms → Bacteria648Open in IMG/M
3300012918|Ga0137396_10339907All Organisms → cellular organisms → Bacteria → Acidobacteria1111Open in IMG/M
3300012925|Ga0137419_10018025All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae4048Open in IMG/M
3300012930|Ga0137407_12222393All Organisms → cellular organisms → Bacteria524Open in IMG/M
3300012971|Ga0126369_10091717All Organisms → cellular organisms → Bacteria → Acidobacteria2726Open in IMG/M
3300012976|Ga0134076_10481260All Organisms → cellular organisms → Bacteria564Open in IMG/M
3300014166|Ga0134079_10623289All Organisms → cellular organisms → Bacteria539Open in IMG/M
3300015054|Ga0137420_1326665All Organisms → cellular organisms → Bacteria1447Open in IMG/M
3300015054|Ga0137420_1384581All Organisms → cellular organisms → Bacteria4993Open in IMG/M
3300015241|Ga0137418_10506557All Organisms → cellular organisms → Bacteria964Open in IMG/M
3300015359|Ga0134085_10286239All Organisms → cellular organisms → Bacteria722Open in IMG/M
3300016422|Ga0182039_11744695All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300018431|Ga0066655_11033612All Organisms → cellular organisms → Bacteria570Open in IMG/M
3300018482|Ga0066669_12084750All Organisms → cellular organisms → Bacteria534Open in IMG/M
3300020199|Ga0179592_10284798All Organisms → cellular organisms → Bacteria736Open in IMG/M
3300020580|Ga0210403_10022748All Organisms → cellular organisms → Bacteria5005Open in IMG/M
3300020583|Ga0210401_10582727All Organisms → cellular organisms → Bacteria980Open in IMG/M
3300021170|Ga0210400_11226558Not Available603Open in IMG/M
3300021180|Ga0210396_11198760All Organisms → cellular organisms → Bacteria635Open in IMG/M
3300021401|Ga0210393_11050839All Organisms → cellular organisms → Bacteria658Open in IMG/M
3300021474|Ga0210390_10694166All Organisms → cellular organisms → Bacteria848Open in IMG/M
3300021479|Ga0210410_10492256All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1095Open in IMG/M
3300021560|Ga0126371_12609503All Organisms → cellular organisms → Bacteria612Open in IMG/M
3300021560|Ga0126371_13008768All Organisms → cellular organisms → Bacteria571Open in IMG/M
3300024187|Ga0247672_1079561Not Available563Open in IMG/M
3300025922|Ga0207646_11948498All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300026297|Ga0209237_1176610All Organisms → cellular organisms → Bacteria757Open in IMG/M
3300026318|Ga0209471_1224258All Organisms → cellular organisms → Bacteria687Open in IMG/M
3300026323|Ga0209472_1107906All Organisms → cellular organisms → Bacteria1104Open in IMG/M
3300026330|Ga0209473_1180526All Organisms → cellular organisms → Bacteria821Open in IMG/M
3300026334|Ga0209377_1122230All Organisms → cellular organisms → Bacteria1043Open in IMG/M
3300026532|Ga0209160_1188128All Organisms → cellular organisms → Bacteria837Open in IMG/M
3300026542|Ga0209805_1152785All Organisms → cellular organisms → Bacteria1058Open in IMG/M
3300026542|Ga0209805_1323188All Organisms → cellular organisms → Bacteria585Open in IMG/M
3300026550|Ga0209474_10061078All Organisms → cellular organisms → Bacteria → Acidobacteria2654Open in IMG/M
3300026550|Ga0209474_10656882All Organisms → cellular organisms → Bacteria540Open in IMG/M
3300026551|Ga0209648_10356961All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1000Open in IMG/M
3300026557|Ga0179587_10097249All Organisms → cellular organisms → Bacteria1776Open in IMG/M
3300026557|Ga0179587_10893381All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300027061|Ga0209729_1029661All Organisms → cellular organisms → Bacteria677Open in IMG/M
3300027562|Ga0209735_1072678All Organisms → cellular organisms → Bacteria744Open in IMG/M
3300027765|Ga0209073_10061263Not Available1258Open in IMG/M
3300027775|Ga0209177_10126310All Organisms → cellular organisms → Bacteria842Open in IMG/M
3300027846|Ga0209180_10357114All Organisms → cellular organisms → Bacteria833Open in IMG/M
3300027846|Ga0209180_10726612All Organisms → cellular organisms → Bacteria538Open in IMG/M
3300027874|Ga0209465_10573517All Organisms → cellular organisms → Bacteria560Open in IMG/M
3300027875|Ga0209283_10293830All Organisms → cellular organisms → Bacteria1073Open in IMG/M
3300027882|Ga0209590_10175348All Organisms → cellular organisms → Bacteria1344Open in IMG/M
3300027884|Ga0209275_10118348Not Available1375Open in IMG/M
3300027889|Ga0209380_10316592All Organisms → cellular organisms → Bacteria916Open in IMG/M
3300028536|Ga0137415_10322386All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1349Open in IMG/M
3300031680|Ga0318574_10264841All Organisms → cellular organisms → Bacteria996Open in IMG/M
3300031682|Ga0318560_10781807All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300031718|Ga0307474_10508052All Organisms → cellular organisms → Bacteria945Open in IMG/M
3300031751|Ga0318494_10134154All Organisms → cellular organisms → Bacteria1387Open in IMG/M
3300031754|Ga0307475_10306504All Organisms → cellular organisms → Bacteria1275Open in IMG/M
3300031754|Ga0307475_10582265All Organisms → cellular organisms → Bacteria897Open in IMG/M
3300031754|Ga0307475_10736141All Organisms → cellular organisms → Bacteria → Acidobacteria785Open in IMG/M
3300031763|Ga0318537_10272273All Organisms → cellular organisms → Bacteria627Open in IMG/M
3300031782|Ga0318552_10210162All Organisms → cellular organisms → Bacteria985Open in IMG/M
3300031796|Ga0318576_10116908All Organisms → cellular organisms → Bacteria1228Open in IMG/M
3300031820|Ga0307473_10091888All Organisms → cellular organisms → Bacteria1589Open in IMG/M
3300031823|Ga0307478_11304775All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300031845|Ga0318511_10231537All Organisms → cellular organisms → Bacteria826Open in IMG/M
3300031846|Ga0318512_10053185All Organisms → cellular organisms → Bacteria1812Open in IMG/M
3300031879|Ga0306919_11285453All Organisms → cellular organisms → Bacteria554Open in IMG/M
3300031912|Ga0306921_10397268All Organisms → cellular organisms → Bacteria1609Open in IMG/M
3300031945|Ga0310913_10004570All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae7791Open in IMG/M
3300031945|Ga0310913_10052738All Organisms → cellular organisms → Bacteria → Acidobacteria2660Open in IMG/M
3300031946|Ga0310910_10178293All Organisms → cellular organisms → Bacteria1639Open in IMG/M
3300031962|Ga0307479_11028161All Organisms → cellular organisms → Bacteria792Open in IMG/M
3300032009|Ga0318563_10617714All Organisms → cellular organisms → Bacteria584Open in IMG/M
3300032055|Ga0318575_10701439All Organisms → cellular organisms → Bacteria512Open in IMG/M
3300032076|Ga0306924_12434539All Organisms → cellular organisms → Bacteria527Open in IMG/M
3300032180|Ga0307471_100848828All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae1080Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil33.97%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil13.46%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil12.82%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil8.33%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil7.05%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.13%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil4.49%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil2.56%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil1.92%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.28%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.28%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.64%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.64%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.64%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001137Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3EnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300002917Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_100cmEnvironmentalOpen in IMG/M
3300004633Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBioEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005184Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_120EnvironmentalOpen in IMG/M
3300005537Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005574Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_143EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300006031Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Angelo_100EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006854Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD4Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009792Tropical forest soil microbial communities from Panama - MetaG Plot_12EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010048Tropical forest soil microbial communities from Panama - MetaG Plot_11EnvironmentalOpen in IMG/M
3300010322Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010360Tropical forest soil microbial communities from Panama - MetaG Plot_6EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09182015EnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300016422Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021474Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-OEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300024187Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK13EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026323Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_130 (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027061Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM1H0_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027562Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027765Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027775Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Control (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027874Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil Plot 1 MoBio (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027884Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2 (SPAdes)EnvironmentalOpen in IMG/M
3300027889Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031680Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f22EnvironmentalOpen in IMG/M
3300031682Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f22EnvironmentalOpen in IMG/M
3300031718Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_05EnvironmentalOpen in IMG/M
3300031751Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.108b1f24EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031763Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.117b4f29EnvironmentalOpen in IMG/M
3300031782Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f20EnvironmentalOpen in IMG/M
3300031796Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f24EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031845Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f18EnvironmentalOpen in IMG/M
3300031846Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.171b2f19EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300031945Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.OX082EnvironmentalOpen in IMG/M
3300031946Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.HF172EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032009Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.066b5f19EnvironmentalOpen in IMG/M
3300032055Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.089b5f23EnvironmentalOpen in IMG/M
3300032076Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statanox.12C.anox.44.000.111 (v2)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12637J13337_101014613300001137Forest SoilMKSKWIIIGVAAVCVLLGGSGTAAISYNPIKWIKKAPGPTASEQLAANKEEEKKLSVQLHAVLPPRTSVKDACAGFKSLNDCVAALYASHNLQIKFTCLKWDV
JGI25617J43924_1025594323300002914Grasslands SoilMKNRWIIVGVAGVSMLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANQQEEKKLSLQLQAVLPPRTSLRDACAGFKSLNDCVASL
JGI25617J43924_1028764713300002914Grasslands SoilMKNKWVIIGVAGVSVLLGAAGAAASSYNPLKWIKKGPTPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAGF
JGI25616J43925_1000254693300002917Grasslands SoilMKNRWIIVGILGVSVLAGAARAAEHSYNPIKWFKKKPAPTASQQLAGNSEMEKKLTTQLQAELPSRTR
Ga0066395_1083891413300004633Tropical Forest SoilMTKKSFIIGAAAVSMLLCAAGAAARSYNPLKWLKGSPRQTANEQLAANKEEERKLTLQLQALLPPKTTLREACSGFLSLEDCVAALHVSRNLKMKYNCLKWDLTAARPSGDVKTCEAPPRDKPLS
Ga0066674_1051687613300005166SoilMKYKWLIIGAAGISILLGTAGAAARSYNPLKWIKGSPRLTANEQLAANKEEERKLTLQLQALLPP
Ga0066671_1096849013300005184SoilMLLGAAVAAGHSYNPLKWIKKSPGPTANEQLAADKEEERRLTMQLQALLPPKTTLREACSGFLSLEDCVAALHVSRNVKIKFN
Ga0070730_1009629613300005537Surface SoilMKNKWIIVGVAAGCVLVGGAGTAAHTYNPIKWIKRGPTPTASQQLAANKEQDKKLSVQLQAVL
Ga0070730_1106859713300005537Surface SoilMKNKWIIVGVAAVCVLVGGAGTAAHTYNPIKWIKRGPTPTASQQLAANKEQDKKLSVQLQAVL
Ga0066692_1014475123300005555SoilMKNRWIIIGVAGVSVLLGAAGAAASSYNPLKWIKKGPNPTASEQLAANKEEEKKLSLQLQAVLPPRTS
Ga0066704_1003899513300005557SoilMKTKWLVMGAAGIGILLGASGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLREACNGLLSLEDCVAALHVSRNLKIKYNCLKWDMTAAT
Ga0066699_1115486713300005561SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSLQLQAVLPPRTSMKDACAGFRTLSDCVASLHVSRNLSIKF
Ga0066703_1029661523300005568SoilMKTKWLVMGAAGIGILLGASGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLREACNGLLSLEDCVAALHV
Ga0066705_1006930733300005569SoilMKHKWLVIGGAGITMLLGAAGAAGHSYNPLKWIKKSPGPTANEQLAADKEEERKLTMQLQALLPPKTTLREACSGFLSLEDCVAALHVSRNVKIKFNCLKWDMTAARPSGDVKACEAPPRDKAVSLD
Ga0066694_1038117713300005574SoilMKNKWFFVGVAGVSILLGATGAAARSYNPLKWIKRNPGPTANEQLAANKEEERKLTLQLQALLPPKIT
Ga0066706_1024155313300005598SoilMKTKWLVMGAAGIGILLGASGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLREACNGLLSL
Ga0070763_1089521713300005610SoilMKSKWIIIGVATVCILLAGSGTAALSYNPLKWIKKPPSPTASEQLAANKDEEKKLSVQLQAVLPPRTSVKDACAGFKSLNDCVA
Ga0066651_1081377113300006031SoilMKYKWLIIGAAGISILLGTAGAAARSYNPLKWIKGSPRLTANEQLAANKEEERKLTLQLQALLPPKTTL
Ga0079222_1211437413300006755Agricultural SoilMKNKWFVVVAAGVSILLGAAGAAARSYNPLKWIKKNPGPTANEQLAANKEEERKLTLQLQALLPPKTTLRDACSGFVSLEECVAALHVSRNVKIKYNCLKWDMTAAR
Ga0066665_1046889523300006796SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAGNKEQEKKLSLQLQAVLPPRTSMKDACAGFRTLSDCVASLHVSRNLSIKFNCLKWDVTGAKP
Ga0079221_1031631213300006804Agricultural SoilMKGKWIIIGVTGASVLFGGAAAAARSYNPIKWIKKGPAPTATQELKANAEEERKLTLQLQALLPPRTTLLDACSGFKALDECVAALHVSRNANI
Ga0079220_1029084323300006806Agricultural SoilMKNRWFVVGAVGVSILLGAAGAAARSYNPLKWIKKNPGPTANEQLAANKEEERKLTLQLQALLPPKTTLR
Ga0079220_1071763013300006806Agricultural SoilMKNKWLVIGVASISILLGAVGAAGRSYNPLKWIRGNPRLTANEQLAANKEEERKLTLQLQALLPPKTTLRDACSGFTSLEDCVAALHVSRNVKLKFNCL
Ga0075425_10193519213300006854Populus RhizosphereMKNRWFVVGAVGVSILLGAAGAAARSYNPLKWIKKNPGPTANEQLAANKEEERKLTLQLQALLPPKTTLRDACSGFLSLEDCVA
Ga0079219_1092000213300006954Agricultural SoilMKNKWFVVVAAGVSILLGAAGAAARSYNPLKWIKKNPGPTANEQLAANKEEERKLTLQLQALLPPKTTLRDACSGFVSLEECVAALHVSRNVKIKYNCLKWDMTAARPS
Ga0099791_1000330013300007255Vadose Zone SoilMKNKWVIIGVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRD
Ga0099791_1012671823300007255Vadose Zone SoilMKNKWIIVGVVAVSVLLGGALAAAHSYNPIKWIKKGPSPTASEQLAANKEEEKKLSLQLQALLPPRTSLKDACAGFK
Ga0099794_1011075713300007265Vadose Zone SoilMKNRWIIVSVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTATEQLAANKEEEKKLSLQLQALLPP
Ga0066710_100007190123300009012Grasslands SoilMKNRWIVIAVAGLGVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLKDACAGFK
Ga0066710_10271075433300009012Grasslands SoilMKNRWIIVGVVGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLKDACAGFK
Ga0066710_10281705713300009012Grasslands SoilMKNRWIIVGVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLKDACAGFKNLNDC
Ga0066710_10326940223300009012Grasslands SoilMKNKWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSLQLQAVLPPRTSMK
Ga0099829_1085994013300009038Vadose Zone SoilMKSKWIIAGVAATGVLLGGAAAEARSYNPVKWIKKSPSPTATEQLAANRDEEKKLTLQLQALLPPRTPLKDACSAFKQLEDCVEALHVSHNLNIKFN
Ga0099829_1144381423300009038Vadose Zone SoilMKNRWIIVGVAGVSVLLGAAGVAASSYNPLKWIKKGPSPTASEQLAANQQEEKKLSLQLQALLPSRTSLRDACAGF
Ga0099830_1132738923300009088Vadose Zone SoilMKNRWIIVGVAAVSVLLGGAGTAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLP
Ga0099792_1076196213300009143Vadose Zone SoilMLLGSAGVAARSNNPLKWIKKGPSPTASEQLAANKEEERKLSLQLQAVLPPRTSLKDACA
Ga0099792_1108263213300009143Vadose Zone SoilMKNRWMVIGVAGISVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSQQLQAVLP
Ga0126374_1032949023300009792Tropical Forest SoilMKNKWLVIAAAGISLLVGASGAAARSYNPLKWIKKNPGLTANEQLAANKEEERKLTLQLQALLPPKTTLRE
Ga0126382_1086949513300010047Tropical Forest SoilMKNRWFVVGAAGVSILLGAAGAAARSYNPLKWMKKNPGATATEQLAAHPEEERKLTLQLQALLLSKTILRDVCSGFLMFEDCVAALHINRN
Ga0126373_1001835413300010048Tropical Forest SoilMKNKWLVIAAAGISLLVGASGAAARSYNPLKWIKKNPGLTANEQLAANKEEERKLTLQLQALLPPKTTLREACSGFLSLEDCVAALHVSRNLKIKY
Ga0134084_1031878713300010322Grasslands SoilMKNKWFFVGVAGVSILLGATGAAARSYNPLKWIKRNPGPTANEQLAANKEEERKLTLQLQALLPPKITLRDACSGFLSLEDCVAALHVSRNLK
Ga0134063_1055603113300010335Grasslands SoilMKYKWLIIGAAGISILLGTAGAAARSYNPLKWIKGSPRLTANEQLAANKEEERKLTLQLQALLPPKTTLREACSGFLSLQDCVAA
Ga0126370_1002234913300010358Tropical Forest SoilMKNRWFVVGAAGVSILLGAAGAAARSYNPLKWMKKNPGATATEQLAAHPEEERKLTLQLQALLPPKTTLRDACSGFLTLEDCVAALHVSRNVKLKYNCLKWALTAARPSGDVR
Ga0126370_1191687713300010358Tropical Forest SoilMKNKWLVIAAAGISLLVGASGAAARSYNPLKWIKKNPGLTANEQLAANKEEERKLTLQLQALLPPKTTLREACSGFLSLEDCVAALH
Ga0126372_1152580413300010360Tropical Forest SoilMKNKWLVIGVAGISMLLGAAGAAGRSYNPLKWIKKGPGLTANEELAKNKEEERKLTLQLQALLPPKTTLRDTCSGFTSLEDCV
Ga0126378_1068249313300010361Tropical Forest SoilMKNKWMVIPAVGIGVLLGASAAGARSYNPIKWIKKSPGPTASQQLAANKDEEKKLTLQLQALLPPRTPLKDACTEFKSLEDCVAALHVSRNLKIKFNCLKWDV
Ga0126378_1235967913300010361Tropical Forest SoilMKNKWFVVGAAAIIILFGAAGAAARSYNPLKWIKKNPGPTANEQLAAHPEEERKLTLQLPALLPSKTTLRDACSGFQTLEDCVAALHVSRNVNLK
Ga0134066_1001446513300010364Grasslands SoilMKYKWLIIGAAGISILLGTAGAAARSYNPLKWIKGSPRLTANEQLAANKEEERKLTLQLQALLPPKTTLREACSGFLSLQDCVAALHVSRNLKIKYNCLKWDMTAARPSGDVKSCETPPRNKA
Ga0126381_10003039753300010376Tropical Forest SoilMKNRWFVVGAAAIIILFGAAGAAARSYNPLKWIKKNPGPTANEQLAAHPEEERKLTLQLQALLPPKT
Ga0126381_10374815313300010376Tropical Forest SoilMKNKWLVIGAAGIILLVGATGAAARSSNPLKWIKKNPGLTANEQLAANKEEERKLSLQLQALLPPKTTLREACSGFLSLEDCVAALHVSRNLK
Ga0134121_1061829613300010401Terrestrial SoilMKNKWMIVRVAGVCLLLGSTVTAAHTYNPIKWIKKGPAPTASEQLAANKEQNKKLSVQLQAVLPPRTSLKDACAGFKSLND
Ga0137392_1026179013300011269Vadose Zone SoilMKNRWIIVGVAGVSMLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANQQEEKKLSLQLQAVLPPRTS
Ga0137391_1041642713300011270Vadose Zone SoilMKNRWIIVGVAGVSVLLGAAGVAASSYNPLKWIKKGPSPTASEQLAANQQEEKKLSLQLQAVLPP
Ga0137391_1113576713300011270Vadose Zone SoilMKNRWIFVGAVGVSVLLGAAGTAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAG
Ga0137391_1152979813300011270Vadose Zone SoilMKNKWIIAGVSVLLCAAGAAARSYNPMKWVKKGPSPTATEQLAAHSEEDKKLTLQLQELLPPRTTLKNA
Ga0137393_1059653633300011271Vadose Zone SoilMKNKWIIVAVAALSVLLGGAATAAHSYNPIKWIKKAPSPTASEQLAANKDQEKKLSLQLQAVLPPRTNLKDACAGF
Ga0137393_1059653933300011271Vadose Zone SoilMKNKWIIVAVAAVSVLLGGAATAAHSYNPMKWIKKSPSPTASEQLAANKDQEKKLSLQLQAVLPPRTNLKDACAGF
Ga0137393_1174314813300011271Vadose Zone SoilMKNRWIIVGVAGVSVLLGGAGTAAHSYNPIKWIKKGPGPTASEQLAANKQEEKKLSLQLQAVLPPRTSLRDACAGFKNLNDCVAALH
Ga0137389_1073185133300012096Vadose Zone SoilMKNRWIIVGVAGVSVLLGAAATAAHSYNPIKWIKKAPSPTASQQLAANKEEQKKLSIQLQAVLPPRTSLKDACAGF
Ga0137389_1106315313300012096Vadose Zone SoilMKNRWIIVGVAGVSVLLGAAGVAASSYNPLKWIKKGPSPTASEQLAANQQEEKKLSLQLQALLPSRTS
Ga0137388_1008340433300012189Vadose Zone SoilMKNKWIVIGAVGISVLLGAAGAAGRSYNPMKWIKKSPGPTASEQLAANKEEEKKLTLQLQALMPPRTTLKDACATFKSLDDCVAALHVSRNLKIKFNCLKWDLTAARPTGTLNHAKRPRETGL*
Ga0137388_1034111423300012189Vadose Zone SoilMKNKWIIVGVVAVSVLLGGALAAAHSYNPIKWIKKGPSPTASEQLAANKEEEKKLSLQLQALLPPR
Ga0137388_1149447023300012189Vadose Zone SoilMKNKWIIVAVAALSVLLGGAATAARSYNPIKWIKKAPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLKDAC
Ga0137383_1135614513300012199Vadose Zone SoilMKNKWIVVGVAGVSILFGSAGAAARTYNPLKWIKKGPSPTASEQLAANKEQERKLSLELQAVLPARTSLKDACAGFKTLNDC
Ga0137399_1009451623300012203Vadose Zone SoilMKSRWIVVGVVSVLLGASGAAAGSYNPLKWIKKSPSPTASEQLAANKEEEKKLSLQLQAVLPPRTTLRMRARDLRT*
Ga0137399_1021256613300012203Vadose Zone SoilMKNRWIIVGVTGVSVLLGAAGPAASSYNPLKWIKKGPSPTATEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAG
Ga0137399_1042398923300012203Vadose Zone SoilMKNRWIIVGLAGVSILFGSAAAAASSYNPLKWIKKGPSPTASEQLAANKEEERKLSLQLQAVLPPRTSLKDACA
Ga0137362_1087262513300012205Vadose Zone SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSQQLQAVLPPRTSLKDACAGFRSLNDCVASLH
Ga0137381_1069555613300012207Vadose Zone SoilMKNKWIVIGVVGISVLLGAAGAAARSYNPIKWIKTGPGPTASEQLEANREEEKKLTLQLQALLPPRTTLKDACTAFKGLDDCVAALHV
Ga0137379_1082419213300012209Vadose Zone SoilMKNRWFIIGAAGISILFGAAGAAGRSYNPLKWIKRSPGMTANEQLAANKEQERKLTLQLQALLPPKTTLREACTGFSNLEDCVAALHTSRNLKIKYNCLKW
Ga0137377_1155996123300012211Vadose Zone SoilMKNKWIIAGVSALLFAAGAAARSYNPMKWIKKGPSPTATEQLAAHSEEDKKLTLQLQALL
Ga0137370_1027473013300012285Vadose Zone SoilMKYKWLIIGAAGISILLGTAGAAARSYNPLKWIKGSPRLTANEQLAANKEEERKLTLQLQALLPPKTTLREACSGFLS
Ga0137386_1026791423300012351Vadose Zone SoilMKNKWLIIGAAGISILLGAAGTAARSYNPLKWLKGSPRLTANEQLAANKEEERKLTLQLQALLPPKTTLREACSGFTSLEDCVAALHVSRNLKIKYN
Ga0137384_1033208613300012357Vadose Zone SoilMKNKWIIAGVSVLLCAAGAAARSYNPMKWIKKGPSPTATEQLAAHGEEDKKLTLQLQALLPPRTTLKN
Ga0137384_1061794413300012357Vadose Zone SoilMKNKWIIAGVSVLLCAAGAAARSYNPIKWIKKGPSPTATEQLAAHGEEDKKLTLQLQALLPPRTTLKN
Ga0137384_1071733913300012357Vadose Zone SoilMKSRWIIIGVAGVSILLGGAAAAAHSYNPIKWIKKGPSPTASEQLAANKEEEKKLSQQLQAVLPPRTSLKDACAGFKDLNDCVAALYISHN
Ga0137361_1092138613300012362Vadose Zone SoilMKNRWIIVGVAGVSVLLGGAATAAHSYNPIKWIKKGPSPTASEQLASNNQEEKKLSLQLQAVLPPRTSLKDACAG
Ga0137390_1028197013300012363Vadose Zone SoilMKNRWIIVGVTGVSVLLGAAGPAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAG
Ga0137390_1097510423300012363Vadose Zone SoilMKNKWVIIGVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAGFKSLNDCVASLHVSH
Ga0137398_1053527513300012683Vadose Zone SoilMKNKWVIIGVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAGFKSLNDCVASLHV
Ga0137395_1013018733300012917Vadose Zone SoilMKNRWIIVGVAGVSVLFGAVGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPSR
Ga0137395_1088206013300012917Vadose Zone SoilMKNRWILVSVVGVGVFFGAAGAAATSYNPIKWIKKGPSPTASEQLAANKEQEKRLSLQLQAVLPPRTSLKDACAGFKNLSDCVAALH
Ga0137396_1033990723300012918Vadose Zone SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSQQLQAVLPPRTSLKDACAGFRSLNDC
Ga0137419_1001802513300012925Vadose Zone SoilMKNKWVIIGVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAGFK
Ga0137407_1222239313300012930Vadose Zone SoilMKNKWIVVGVAGVSILFGSAGAAARTYNPLKWIKKGPTPTASEQLAANKEEERKLSLQLQAVLPPRTSLRNACAGFKGLNECVAALHVSHN
Ga0126369_1009171733300012971Tropical Forest SoilMKNKWLFIGVAAISMLLGAAGAAGRSYNPLKWIKKGSGPSANEELAAHKEEERKLTLQLQALLPPKTTLREACSGFTAWRIA*
Ga0134076_1048126023300012976Grasslands SoilMKNKWIIAGVSALLCAAGAAARSYNPMKWIKKGPSPTATEQLAAHSEQEKKLTLQLQALLPARTALKNACSAFKDLE
Ga0134079_1062328913300014166Grasslands SoilMKTKWLVMGAAGIGILLGASGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLREACNGLLSLEDCVAALHVS
Ga0137420_132666513300015054Vadose Zone SoilMKNRWILVSVVGVGVFFGAAGAAATSYNPIKWIKKGPSPTASEQLAANKEQEKKLSLQLQAVLPPRTSLKDACAGLCLRRF*
Ga0137420_138458123300015054Vadose Zone SoilVGVFFGAAGAAATSYNPIKWIKKGPSPTASEQLAANKEQEKKLSLQLQAVLPPRNQP*
Ga0137418_1050655723300015241Vadose Zone SoilMKNKWIVVGVAGVSILFGSAGAAARTYNPLKWIKKGPTPTASEQLAANKEEERKLSLQLQAVLPPRTSLRNACAGFKGLNECVAALHVS
Ga0134085_1028623923300015359Grasslands SoilMKNKWIIAGVSVLLCAAGAAARSYNPMKWIKKGPSPTATEQLAAHSEEDKKLTLQLQALLPPRTTLKNACSAFKDLEGCVAALHVSHNLQIKFNC
Ga0182039_1174469513300016422SoilMKNKWLVIAAAGISLLVGASGAAARSYNPLKWIKKNPGLTANEQLAANKEEERKLTLQLQALLPPKTTLREACTGFL
Ga0066655_1103361213300018431Grasslands SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAGNKEQEKKLSLQLQAVLPPRTSLKDACAGFRTLSDCVASLHVSRNLSIK
Ga0066669_1208475013300018482Grasslands SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAGNKEQEKKLSLQLQAVLPPRTSLKDACA
Ga0179592_1028479823300020199Vadose Zone SoilMKNKWIIAAVVAAASVLVGGAACAARSYNPIKWIKKSPGPTASEQLAANKEEEKKLSLQLQALL
Ga0210403_1002274843300020580SoilMKNRNRWIIVGVVGASVLLCGVGTAAHSYNPIKWIKKGPSPTASEQLVANKEEERKLSLQLQAVLPPRTSLKDACAGFKNLNDCVAVLH
Ga0210401_1058272713300020583SoilMKIKWIIVGVAAVCALLAGTGTAAITYNPIKWIKKTPGPTASEQLAANKEEEKKLSVQLQAVLPPRTSLKDACAGFKSLNDCVASLYVSHNLQLKFNCLKWDVTGAKPAGDVK
Ga0210400_1122655823300021170SoilVAYSRHSHARPSKEASMKNKWIIVGVATVSVLLGGAGTAAHSYNPIKWIKRGPSPTASEQLAANKQEEKKLSLQLQALLPPRTSLK
Ga0210396_1119876013300021180SoilMKNRWIIVGIAAVSVLLGAAGTAAHSYNPIKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPR
Ga0210393_1105083913300021401SoilVCVLLAATGTAAIAYNPIKWIKKAPGPTASEQLAANKEEEKKLSVQLQAVLPPRT
Ga0210390_1069416623300021474SoilMKSKWIVIGVAAVCVLLAATGTAAITYNPIKWIKKAPGPTASEQLAANKEEEKKLSVQLQAVLPPRT
Ga0210410_1049225613300021479SoilMKSKWITIGVAAVCVLLGGSVTAAITYNPIKWIKKAPGPTASEQLAANKEEEKKLSVQLQAVLPPRTKID
Ga0126371_1260950313300021560Tropical Forest SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCVAALHVSRNVKIK
Ga0126371_1300876823300021560Tropical Forest SoilMKNKCVMIGAAGITILLGAAGAAAHSYNPLKWIKKGPKPTANEQLAANKDEERKLSLQLQALLPPTTTLREACSGFTALEDCVAALHVSPNQ
Ga0247672_107956113300024187SoilMKNKWIIVGVAAGCVLVGGAGTAAHTYNPIKWIKRGPTPTASQQLAANKEQDKKLSVQLQAVLP
Ga0207646_1194849813300025922Corn, Switchgrass And Miscanthus RhizosphereMKNKWLFVGAAGVSILLGATGAAARSYNPLKWIKGSPRLTANEQLAANKEEERKLTLQLQALLPPKTTLRDACSGFLSLEDCVAALHVSRNLKMKYNCLKWDMTAARPSGDVKSCEAPPRDKPL
Ga0209237_117661023300026297Grasslands SoilLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLKDACAGFKNLNDCVASLYVSHN
Ga0209471_122425813300026318SoilMKTKWLVMGAAGIGILLGASGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLREACNGLLSLEDCVAALHVSRNL
Ga0209472_110790613300026323SoilMKNKWFFVGVAGVSILLGATGADARSYNPLKWIKRNPGPTANEQLAANKEEERKLTLQLQALLPPKITLRDACSGFLSLEDCVAALHVSRNLKLKYNCLKWDITAARPAG
Ga0209473_118052613300026330SoilMKTKWLVMGAAGIGILLGASGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLREACNGLLSLEDCVAALHVSRNLKIKYNCLKWDMTAARPNG
Ga0209377_112223033300026334SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSLQLQAVLPPRTSLKDACAGFRTLSDCVASLHVSRNLSIKFNCLKWDVTGAKPAGDVE
Ga0209160_118812813300026532SoilMKTKWLVMGAAGIGILLGASGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLREACNGLLSLEDCVAALHVSRNLKIKYNCLKWDMTAATQSSRLSKPLHASLRVVF
Ga0209805_115278513300026542SoilMKTKWLVMGAAGIGILLGASGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLRE
Ga0209805_132318813300026542SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSLQLQAVLPPRTSMKDACAGFRTLSDCVASLHVSRNLSIKFNCL
Ga0209474_1006107843300026550SoilLLGGAGTLAHSYNPIKWIRKGPSPTASEQLAANKEEEKKLSLELQAVLPPRTSLKDACAGFKSLNDCVASL
Ga0209474_1065688213300026550SoilMKNRWIVVGVAGVSVLLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSLQLQAVLPPRTSMK
Ga0209648_1035696133300026551Grasslands SoilMKNKWIIVGVAAVCVLLAGTGTAAITYNPIKWIKKGPSPTASEQLAANKEEEKKLSVQLQAVL
Ga0179587_1009724933300026557Vadose Zone SoilMKNRWIFIGVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLTANQQEEKKLSLQLQALL
Ga0179587_1089338113300026557Vadose Zone SoilMKNKWIVVGVAGVSILFGSAGAAARTYNPLKWIKKGPTPTASEPLAANKEEERKLALQLQAVLPPRTSLRNACAGFKGLN
Ga0209729_102966123300027061Forest SoilMKSKRLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPPKTTLREACSGFLSLEDCVAALHVSRNIKIKYNCLKWDLTAARPNGD
Ga0209735_107267813300027562Forest SoilMKSKWIIIGVAAVCVLLGGSGTAAISYNPIKWIKKAPGPTASEQLAANKEEEKKLSVQLHAVLPPRTSVKDACAGFKSLN
Ga0209073_1006126323300027765Agricultural SoilMKNRWFVVGAVGVSILLGAAGAAARSYNPLKWIKKNPGPTANEQLAANKEEERKLTLQLQALLPPKTTLRDACSGF
Ga0209177_1012631013300027775Agricultural SoilMRNKWFVITAASASILLGTAGAAARSYNPLKWIKKNPGPTANEQLAAHPEEERKLTLQLQALLPPKTTLRDACSGFVTLEDCVAALHVSRNLKLKYNCLKWDMTAARPSGDVKSCEAPPRDKALP
Ga0209180_1035711433300027846Vadose Zone SoilMKNKWIIVAVAALSVLLGGAATAAHSYNPIKWIKKAPSPTASEQLAANKDQEKKLSLQLQAV
Ga0209180_1072661213300027846Vadose Zone SoilMKNRWILIGVAGVGMFFGAAGAAATSYNPIRWIKKSPSPTASDQLAANKEQEKKLSLQLQAALPPRTSLRDACAGFKTLNDCVAALHVSH
Ga0209465_1057351713300027874Tropical Forest SoilMTKKSFIIGAAAVSMLLCAAGAAARSYNPLKWLKGSPRQTANEQLAANKEEERKLTLQLQALLPPKTTLREACSGFLSLEDCVAALHVSRNLKMKYNCLKWDLTAARPSGDVKTCEAPPRDKPLSLD
Ga0209283_1029383023300027875Vadose Zone SoilMKNKWIIVGVVAVSVLLGGALAAAHSYNPIKWIKKGPSPTASEQLAANKEEEKKLSLQLQALLPPRTSLKDACAGFKNLND
Ga0209590_1017534823300027882Vadose Zone SoilMKNRWIIVGVAGVSVLLGGAGTAAHSYNPIKWIKKGPGPTASEQLAANKQEEKKLSLQLQAVLPPRTSLRDACAGFKNLNDCV
Ga0209275_1011834813300027884SoilMKSKWIVIGVAGVCVLLAATGTAAITYNPIKWIKKAPSPTASEQLAANKEEEKKLSVQLQAVLPPRTKINDACAGFKSLNDCVASLHASHN
Ga0209380_1031659213300027889SoilMKTKWIIVGVAGACVLLGGTGTAAITYNPIRWIKKSPGPTASEQLAANKEEEKKLSVQLQAVLPPRTSVKDACAGFKSLNDCVASLHASHNLQIKFNCLKWNVTGAKPAG
Ga0137415_1032238633300028536Vadose Zone SoilMKNKWVIIGVAGVSVLLGAAGAAASSYNPLKWIKKGPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLRDACAGFKSLNDCV
Ga0318574_1026484123300031680SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTT
Ga0318560_1078180713300031682SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCVAALHASRNAKIKY
Ga0307474_1050805213300031718Hardwood Forest SoilMKNRWIIIGVAGASVLLGGAGIAARSYNPIKWIKKGPSPTASEQLAANKEEEKKLSLELQAVLPTHTSLKDACAEF
Ga0318494_1013415413300031751SoilMKSKWLFMGAEGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCVAALHASRNAKIKYNCLKWDMTAARPSGDVKTCEAPPSDKAL
Ga0307475_1030650413300031754Hardwood Forest SoilMKTRWIVVGVVSVLLGASGAAASSYNPLKWIKKSPSPTASEQLAANKEEEKKLSLQLQAVLPPRTSLKDACAGFKNLNDCVASLH
Ga0307475_1058226513300031754Hardwood Forest SoilMKSKWIIIGVAAVCVLLGGSRTAAITYNPIKWIKKAPGPTASEQLAANKEEEKKLSVQLQAV
Ga0307475_1073614123300031754Hardwood Forest SoilMKNRWIVVGVAGVSILLGGATAAAHSYNPIKWIRKGPSPTASEQLAANKEQEKKLSQQLQAVLPPRTSLKDACAGF
Ga0318537_1027227313300031763SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCVAALHASRNAKIKYNCLKWDMTAARPSGDVKTCEAPPRDK
Ga0318552_1021016223300031782SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREAVAS
Ga0318576_1011690823300031796SoilMKSKWLFTGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCVAALHASRNAKIKYNCLKWDMTAARPSGDVKTCEAPP
Ga0307473_1009188823300031820Hardwood Forest SoilMKTKWLVMGAAGIGILLGAGGAAAQSYNPLKWIKKNPGPTANEQLAANQEEERKLSLQLQALLPPKTTLREACSGLLSLEDCVAALHVSRNLKIKYNCLKW
Ga0307478_1130477513300031823Hardwood Forest SoilMKNKWIIVAVAAASVLLGGAACAARSYNPIKWIKKSPGPTASEQLAANKEEEKKLSLQLQALLPPRTSLK
Ga0318511_1023153713300031845SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCLAALHASRNAK
Ga0318512_1005318523300031846SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLNLEDCVAALH
Ga0306919_1128545313300031879SoilMKNKWLVIAAAGISLLVGASGAAARSYNPLKWIKKNPGLTANEQLAANKEEERKLTLQLQALLPPKTTLREACTGFLSLEDCVAALHVSRNLKIKYNCLK
Ga0306921_1039726813300031912SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDC
Ga0310913_1000457013300031945SoilMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCVAALHASRNAKIKYNCLKWDMTAARPSGDVKTCEAPPRDK
Ga0310913_1005273833300031945SoilMKNKWLVIAAAGISLLVGASGAAARSYNPLKWIKKNPGLTANEQLAANKEEERKLTLQLQALLPPKTTLREACTGFLSLEDCVAALHVSRNLKIKYNCLKWHITAARP
Ga0310910_1017829323300031946SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCVAALHASRNAKIKYNCLKWDMTAARPSGDVKTCEAPPSDKAL
Ga0307479_1102816123300031962Hardwood Forest SoilMKKRWIIAGIVGVGVLAGAARAAEHSYNPIKWIKKGPSPTASQQLAANSGMEKKLNIQLQAVLPPRI
Ga0318563_1061771413300032009SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCVAALHASRNAKIKYNCLKWDMTAARPSGDVKTCEAPP
Ga0318575_1070143913300032055SoilMKSKWLFMGAAGISILLGASGAAARSYNPLKWIKKNPGPTANEQLAANHEEERKLSLQLQALLPAKTTLREACSGFLSLEDCLAALHASRNAKIKYN
Ga0306924_1243453913300032076SoilMRNKWFVAGAVGISILLGAAGAAARSYNPLKWIKKNPGPTANEELAANKEEERKLTLQLQALLPPKTTLREACSGFLSLEDCVAALHVSRNLRMKYNCLKWDMTAARPSG
Ga0307471_10084882833300032180Hardwood Forest SoilVLLGAAGAAASSYNPLKWIKKSPSPTASEQLAANKEQDRKLSLQLQAVLPARTSLKDACAGFKSLND


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.