NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F040046

Metagenome Family F040046

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F040046
Family Type Metagenome
Number of Sequences 162
Average Sequence Length 111 residues
Representative Sequence VGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS
Number of Associated Samples 117
Number of Associated Scaffolds 162

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Bacteria
% of genes with valid RBS motifs 48.97 %
% of genes near scaffold ends (potentially truncated) 24.69 %
% of genes from short scaffolds (< 2000 bps) 47.53 %
Associated GOLD sequencing projects 97
AlphaFold2 3D model prediction Yes
3D model pTM-score0.79

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Bacteria (89.506 % of family members)
NCBI Taxonomy ID 2
Taxonomy All Organisms → cellular organisms → Bacteria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Grasslands → Soil
(30.247 % of family members)
Environment Ontology (ENVO) Unclassified
(43.210 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(54.938 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 65.93%    β-sheet: 0.00%    Coil/Unstructured: 34.07%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.79
Powered by PDBe Molstar

Structural matches with SCOPe domains

SCOP familySCOP domainRepresentative PDBTM-score
f.13.1.0: automated matchesd5c1ma_5c1m0.69032
f.54.1.2: Amino acid antiporter-liked3giaa_3gia0.6793
a.24.3.1: Cytochrome b562d4l6ra24l6r0.67902
a.238.1.1: BAR domaind2q13a12q130.63478
f.13.1.0: automated matchesd6tk6a_6tk60.6333


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 162 Family Scaffolds
PF02811PHP 32.72
PF02637GatB_Yqey 16.05
PF02934GatB_N 14.20
PF01425Amidase 4.32
PF04191PEMT 3.70
PF05958tRNA_U5-meth_tr 3.70
PF02686Glu-tRNAGln 2.47
PF02475Met_10 1.85
PF00155Aminotran_1_2 1.23
PF00121TIM 0.62
PF01553Acyltransferase 0.62
PF02081TrpBP 0.62
PF13424TPR_12 0.62
PF12826HHH_2 0.62
PF13361UvrD_C 0.62
PF00528BPD_transp_1 0.62
PF00211Guanylate_cyc 0.62
PF14579HHH_6 0.62
PF02909TetR_C_1 0.62
PF00196GerE 0.62
PF14520HHH_5 0.62

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 162 Family Scaffolds
COG0064Asp-tRNAAsn/Glu-tRNAGln amidotransferase B subunitTranslation, ribosomal structure and biogenesis [J] 30.25
COG2511Archaeal Glu-tRNAGln amidotransferase subunit E, contains GAD domainTranslation, ribosomal structure and biogenesis [J] 30.25
COG1610Uncharacterized conserved protein YqeY, may have tRNA amino acid amidase activityGeneral function prediction only [R] 16.05
COG2265tRNA/tmRNA/rRNA uracil-C5-methylase, TrmA/RlmC/RlmD familyTranslation, ribosomal structure and biogenesis [J] 5.56
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 4.32
COG0721Asp-tRNAAsn/Glu-tRNAGln amidotransferase C subunitTranslation, ribosomal structure and biogenesis [J] 2.47
COG109223S rRNA G2069 N7-methylase RlmK or C1962 C5-methylase RlmITranslation, ribosomal structure and biogenesis [J] 1.85
COG2264Ribosomal protein L11 methylase PrmATranslation, ribosomal structure and biogenesis [J] 1.85
COG2520tRNA G37 N-methylase Trm5Translation, ribosomal structure and biogenesis [J] 1.85
COG0149Triosephosphate isomeraseCarbohydrate transport and metabolism [G] 0.62
COG1309DNA-binding protein, AcrR family, includes nucleoid occlusion protein SlmATranscription [K] 0.62
COG2114Adenylate cyclase, class 3Signal transduction mechanisms [T] 0.62


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms89.51 %
UnclassifiedrootN/A10.49 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300001086|JGI12709J13192_1002117All Organisms → cellular organisms → Bacteria2711Open in IMG/M
3300002560|JGI25383J37093_10076368All Organisms → cellular organisms → Bacteria1026Open in IMG/M
3300002853|draft_1000296All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi5650Open in IMG/M
3300002908|JGI25382J43887_10036908All Organisms → cellular organisms → Bacteria2652Open in IMG/M
3300002911|JGI25390J43892_10043538All Organisms → cellular organisms → Bacteria1070Open in IMG/M
3300002914|JGI25617J43924_10006293All Organisms → cellular organisms → Bacteria3670Open in IMG/M
3300005166|Ga0066674_10326478All Organisms → cellular organisms → Bacteria721Open in IMG/M
3300005167|Ga0066672_10034653All Organisms → cellular organisms → Bacteria2793Open in IMG/M
3300005167|Ga0066672_10271684All Organisms → cellular organisms → Bacteria1096Open in IMG/M
3300005175|Ga0066673_10768336All Organisms → cellular organisms → Bacteria552Open in IMG/M
3300005176|Ga0066679_10098294All Organisms → cellular organisms → Bacteria1775Open in IMG/M
3300005176|Ga0066679_10152585All Organisms → cellular organisms → Bacteria1447Open in IMG/M
3300005177|Ga0066690_10314850All Organisms → cellular organisms → Bacteria1058Open in IMG/M
3300005179|Ga0066684_10296691All Organisms → cellular organisms → Bacteria1072Open in IMG/M
3300005181|Ga0066678_10135772All Organisms → cellular organisms → Bacteria1525Open in IMG/M
3300005187|Ga0066675_10304030All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Actinomycetales1155Open in IMG/M
3300005440|Ga0070705_100009492All Organisms → cellular organisms → Bacteria4840Open in IMG/M
3300005445|Ga0070708_100000126All Organisms → cellular organisms → Bacteria52000Open in IMG/M
3300005447|Ga0066689_10208990All Organisms → cellular organisms → Bacteria1187Open in IMG/M
3300005454|Ga0066687_10352168All Organisms → cellular organisms → Bacteria844Open in IMG/M
3300005468|Ga0070707_100004424All Organisms → cellular organisms → Bacteria13161Open in IMG/M
3300005534|Ga0070735_10129487All Organisms → cellular organisms → Bacteria1573Open in IMG/M
3300005536|Ga0070697_100061906All Organisms → cellular organisms → Bacteria3052Open in IMG/M
3300005536|Ga0070697_100324676All Organisms → cellular organisms → Bacteria1326Open in IMG/M
3300005542|Ga0070732_10371227All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium862Open in IMG/M
3300005552|Ga0066701_10016574All Organisms → cellular organisms → Bacteria3549Open in IMG/M
3300005554|Ga0066661_10023069All Organisms → cellular organisms → Bacteria3340Open in IMG/M
3300005555|Ga0066692_10575630All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium709Open in IMG/M
3300005556|Ga0066707_10708913All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium630Open in IMG/M
3300005557|Ga0066704_10141358All Organisms → cellular organisms → Bacteria1609Open in IMG/M
3300005557|Ga0066704_10252401All Organisms → cellular organisms → Bacteria1197Open in IMG/M
3300005558|Ga0066698_11080025All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Chromatiales → Chromatiaceae → Nitrosococcus → Nitrosococcus halophilus508Open in IMG/M
3300005561|Ga0066699_10056091All Organisms → cellular organisms → Bacteria2444Open in IMG/M
3300005561|Ga0066699_10249630All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1252Open in IMG/M
3300005568|Ga0066703_10094885All Organisms → cellular organisms → Bacteria1746Open in IMG/M
3300005568|Ga0066703_10294982All Organisms → cellular organisms → Bacteria981Open in IMG/M
3300005569|Ga0066705_10013608All Organisms → cellular organisms → Bacteria3979Open in IMG/M
3300005575|Ga0066702_10022249All Organisms → cellular organisms → Bacteria3116Open in IMG/M
3300005576|Ga0066708_10016163All Organisms → cellular organisms → Bacteria3697Open in IMG/M
3300005576|Ga0066708_10329849All Organisms → cellular organisms → Bacteria979Open in IMG/M
3300005576|Ga0066708_11072327All Organisms → cellular organisms → Bacteria500Open in IMG/M
3300005586|Ga0066691_10001065All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi10254Open in IMG/M
3300005586|Ga0066691_10691511All Organisms → cellular organisms → Bacteria603Open in IMG/M
3300005764|Ga0066903_100302534All Organisms → cellular organisms → Bacteria2532Open in IMG/M
3300005764|Ga0066903_108371127All Organisms → cellular organisms → Bacteria528Open in IMG/M
3300006032|Ga0066696_10120479All Organisms → cellular organisms → Bacteria1604Open in IMG/M
3300006032|Ga0066696_10254875All Organisms → cellular organisms → Bacteria1134Open in IMG/M
3300006755|Ga0079222_10514588All Organisms → cellular organisms → Bacteria882Open in IMG/M
3300006791|Ga0066653_10377880All Organisms → cellular organisms → Bacteria723Open in IMG/M
3300006796|Ga0066665_10044769All Organisms → cellular organisms → Bacteria3010Open in IMG/M
3300006797|Ga0066659_10755195All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300006806|Ga0079220_10574384All Organisms → cellular organisms → Bacteria793Open in IMG/M
3300006904|Ga0075424_100131903All Organisms → cellular organisms → Bacteria2644Open in IMG/M
3300006954|Ga0079219_10216136All Organisms → cellular organisms → Bacteria1104Open in IMG/M
3300007076|Ga0075435_100027875All Organisms → cellular organisms → Bacteria4423Open in IMG/M
3300007255|Ga0099791_10440941All Organisms → cellular organisms → Bacteria629Open in IMG/M
3300007265|Ga0099794_10556879All Organisms → cellular organisms → Bacteria605Open in IMG/M
3300007982|Ga0102924_1004753All Organisms → cellular organisms → Bacteria13508Open in IMG/M
3300009088|Ga0099830_11116895All Organisms → cellular organisms → Bacteria654Open in IMG/M
3300009089|Ga0099828_10006748All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi8302Open in IMG/M
3300009089|Ga0099828_10009315All Organisms → cellular organisms → Bacteria7238Open in IMG/M
3300009090|Ga0099827_10020230All Organisms → cellular organisms → Bacteria4605Open in IMG/M
3300010361|Ga0126378_10301977All Organisms → cellular organisms → Bacteria1704Open in IMG/M
3300011269|Ga0137392_10240495All Organisms → cellular organisms → Bacteria1488Open in IMG/M
3300012189|Ga0137388_10138009All Organisms → cellular organisms → Bacteria2144Open in IMG/M
3300012189|Ga0137388_10791103All Organisms → cellular organisms → Bacteria879Open in IMG/M
3300012201|Ga0137365_10009283All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria7806Open in IMG/M
3300012202|Ga0137363_11721755All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300012203|Ga0137399_10494624All Organisms → cellular organisms → Bacteria1025Open in IMG/M
3300012203|Ga0137399_10761444All Organisms → cellular organisms → Bacteria815Open in IMG/M
3300012203|Ga0137399_11259991All Organisms → cellular organisms → Bacteria622Open in IMG/M
3300012206|Ga0137380_10122644All Organisms → cellular organisms → Bacteria2374Open in IMG/M
3300012206|Ga0137380_10162709All Organisms → cellular organisms → Bacteria2034Open in IMG/M
3300012207|Ga0137381_10017227All Organisms → cellular organisms → Bacteria5658Open in IMG/M
3300012207|Ga0137381_10680543All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria895Open in IMG/M
3300012208|Ga0137376_10113934All Organisms → cellular organisms → Bacteria2303Open in IMG/M
3300012209|Ga0137379_10000172All Organisms → cellular organisms → Bacteria53645Open in IMG/M
3300012209|Ga0137379_10001181All Organisms → cellular organisms → Bacteria23759Open in IMG/M
3300012210|Ga0137378_10421698All Organisms → cellular organisms → Bacteria1237Open in IMG/M
3300012211|Ga0137377_11534652All Organisms → cellular organisms → Bacteria591Open in IMG/M
3300012918|Ga0137396_10018062All Organisms → cellular organisms → Bacteria4474Open in IMG/M
3300012927|Ga0137416_10100011All Organisms → cellular organisms → Bacteria2179Open in IMG/M
3300012927|Ga0137416_10218996All Organisms → cellular organisms → Bacteria1538Open in IMG/M
3300012944|Ga0137410_10402513All Organisms → cellular organisms → Bacteria1103Open in IMG/M
3300012971|Ga0126369_11984393All Organisms → cellular organisms → Bacteria670Open in IMG/M
3300012975|Ga0134110_10007800All Organisms → cellular organisms → Bacteria4013Open in IMG/M
3300012977|Ga0134087_10043698All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1744Open in IMG/M
3300018431|Ga0066655_10048263All Organisms → cellular organisms → Bacteria2181Open in IMG/M
3300018433|Ga0066667_10105754All Organisms → cellular organisms → Bacteria1885Open in IMG/M
3300018433|Ga0066667_10404579All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1104Open in IMG/M
3300018433|Ga0066667_11624403All Organisms → cellular organisms → Bacteria577Open in IMG/M
3300018482|Ga0066669_10071082All Organisms → cellular organisms → Bacteria2299Open in IMG/M
3300022557|Ga0212123_10002493All Organisms → cellular organisms → Bacteria43717Open in IMG/M
3300025910|Ga0207684_10000051All Organisms → cellular organisms → Bacteria230737Open in IMG/M
3300025910|Ga0207684_10000060All Organisms → cellular organisms → Bacteria205816Open in IMG/M
3300025922|Ga0207646_10009288All Organisms → cellular organisms → Bacteria9729Open in IMG/M
3300026277|Ga0209350_1062214All Organisms → cellular organisms → Bacteria1064Open in IMG/M
3300026296|Ga0209235_1142857All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium967Open in IMG/M
3300026298|Ga0209236_1052045All Organisms → cellular organisms → Bacteria2035Open in IMG/M
3300026300|Ga0209027_1011523All Organisms → cellular organisms → Bacteria3342Open in IMG/M
3300026301|Ga0209238_1005573All Organisms → cellular organisms → Bacteria5036Open in IMG/M
3300026301|Ga0209238_1071342All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1220Open in IMG/M
3300026310|Ga0209239_1010335All Organisms → cellular organisms → Bacteria4972Open in IMG/M
3300026310|Ga0209239_1023326All Organisms → cellular organisms → Bacteria3031Open in IMG/M
3300026313|Ga0209761_1047243All Organisms → cellular organisms → Bacteria2436Open in IMG/M
3300026318|Ga0209471_1003977All Organisms → cellular organisms → Bacteria → Terrabacteria group → Firmicutes → Clostridia8241Open in IMG/M
3300026318|Ga0209471_1046753All Organisms → cellular organisms → Bacteria2027Open in IMG/M
3300026318|Ga0209471_1121358All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1112Open in IMG/M
3300026328|Ga0209802_1035028All Organisms → cellular organisms → Bacteria2601Open in IMG/M
3300026331|Ga0209267_1111372All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1158Open in IMG/M
3300026333|Ga0209158_1027453All Organisms → cellular organisms → Bacteria2493Open in IMG/M
3300026335|Ga0209804_1055827All Organisms → cellular organisms → Bacteria1911Open in IMG/M
3300026530|Ga0209807_1020793All Organisms → cellular organisms → Bacteria3196Open in IMG/M
3300026530|Ga0209807_1206489All Organisms → cellular organisms → Bacteria684Open in IMG/M
3300026536|Ga0209058_1111388All Organisms → cellular organisms → Bacteria1365Open in IMG/M
3300026547|Ga0209156_10113979All Organisms → cellular organisms → Bacteria1332Open in IMG/M
3300026548|Ga0209161_10044133All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2966Open in IMG/M
3300026548|Ga0209161_10048892All Organisms → cellular organisms → Bacteria2778Open in IMG/M
3300026550|Ga0209474_10042065All Organisms → cellular organisms → Bacteria3329Open in IMG/M
3300026551|Ga0209648_10015837All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria6576Open in IMG/M
3300026552|Ga0209577_10743615All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium550Open in IMG/M
3300027587|Ga0209220_1001673All Organisms → cellular organisms → Bacteria6099Open in IMG/M
3300027643|Ga0209076_1006118All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium2903Open in IMG/M
3300027674|Ga0209118_1002521All Organisms → cellular organisms → Bacteria7857Open in IMG/M
3300027725|Ga0209178_1008281All Organisms → cellular organisms → Bacteria3268Open in IMG/M
3300027787|Ga0209074_10220828All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium722Open in IMG/M
3300027842|Ga0209580_10395220All Organisms → cellular organisms → Bacteria689Open in IMG/M
3300027875|Ga0209283_10000464All Organisms → cellular organisms → Bacteria20533Open in IMG/M
3300027875|Ga0209283_10084048All Organisms → cellular organisms → Bacteria2066Open in IMG/M
3300027882|Ga0209590_10061752All Organisms → cellular organisms → Bacteria2128Open in IMG/M
3300028536|Ga0137415_10013223All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Thermomicrobia8184Open in IMG/M
3300028536|Ga0137415_10087183All Organisms → cellular organisms → Bacteria2978Open in IMG/M
3300031753|Ga0307477_10482891All Organisms → cellular organisms → Bacteria843Open in IMG/M
3300031823|Ga0307478_10084656All Organisms → cellular organisms → Bacteria2427Open in IMG/M
3300031896|Ga0318551_10850296All Organisms → cellular organisms → Bacteria531Open in IMG/M
3300031962|Ga0307479_10161727All Organisms → cellular organisms → Bacteria2198Open in IMG/M
3300031962|Ga0307479_11214019All Organisms → cellular organisms → Bacteria717Open in IMG/M
3300032067|Ga0318524_10408878All Organisms → cellular organisms → Bacteria708Open in IMG/M
3300032180|Ga0307471_100088245All Organisms → cellular organisms → Bacteria2761Open in IMG/M
3300032180|Ga0307471_100654876All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae1213Open in IMG/M
3300032180|Ga0307471_100894605All Organisms → cellular organisms → Bacteria1055Open in IMG/M
3300032180|Ga0307471_101867668All Organisms → cellular organisms → Bacteria751Open in IMG/M
3300032180|Ga0307471_102134112All Organisms → cellular organisms → Bacteria705Open in IMG/M
3300032180|Ga0307471_104028273All Organisms → cellular organisms → Bacteria519Open in IMG/M
3300032205|Ga0307472_100901922All Organisms → cellular organisms → Bacteria818Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil30.25%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil20.99%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil11.73%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere9.26%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil8.02%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil4.94%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere3.09%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.85%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil1.85%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.23%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.23%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.23%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil1.23%
Hydrocarbon Resource EnvironmentsEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Hydrocarbon Resource Environments0.62%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.62%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300001086Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3EnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002853PDIso9.ppmwps2EnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005179Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005454Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_136EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005471Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005542Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1EnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005557Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300005575Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_151EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006791Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_102EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006804Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300007076Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD4Host-AssociatedOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007982Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPM 11 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012971Tropical forest soil microbial communities from Panama - MetaG Plot_1EnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300022557Paint Pots_combined assemblyEnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026277Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026296Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026298Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026310Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026536Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147 (SPAdes)EnvironmentalOpen in IMG/M
3300026547Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124 (SPAdes)EnvironmentalOpen in IMG/M
3300026548Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155 (SPAdes)EnvironmentalOpen in IMG/M
3300026550Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027725Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027787Agricultural soil microbial communities from Georgia to study Nitrogen management - GA Plitter (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027875Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031823Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_05EnvironmentalOpen in IMG/M
3300031896Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.082b2f19EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032060Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f18EnvironmentalOpen in IMG/M
3300032067Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.052b4f22EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI12709J13192_100211723300001086Forest SoilVGYALEGVAYVGGAVLIGAGLFLVMRGSFPTWWQSRLLWPVANVTPMVARLQGWAAIGVGASIIAIVFTTVAPEKVGGILVLAAFAAYLAGVALFAFSTWLSRRPA*
JGI25383J37093_1007636813300002560Grasslands SoilMTRLQSEAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS*
draft_100029663300002853Hydrocarbon Resource EnvironmentsVHVEIRGQGHASRLQFRAVGYVVEGVAYVGGALLIGAGIFLVMRGSFPTGWSNRLLWPLANVTPIVARLQGWAGIAVGASIIAIVFTTVAPEKVGGILVIAAIAAYLAGLVLFVFSTWLSRRPV*
JGI25382J43887_1003690823300002908Grasslands SoilVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS*
JGI25390J43892_1004353823300002911Grasslands SoilVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA*
JGI25617J43924_1000629333300002914Grasslands SoilMGYAVEGIAYVGGTVLIGAGLYLVMRGTFPAWWRRRLLWPLVRVTPTVSHLQGWAAIGLGVSVIAIVFTTVAPEVVAGILVLLAIALYLLGVALFLFSTWLSRRPAS*
Ga0066674_1032647823300005166SoilMHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA*
Ga0066672_1003465333300005167SoilVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA*
Ga0066672_1027168423300005167SoilGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLLSTWLSRRPAS*
Ga0066673_1076833613300005175SoilCARADQEAEASIGSRLQSASVGYALEGVAYVGGTMLIGAGLYLVMRGNFPAWWRRRLLWPLLNVTPRVAHLQGWAAVGLGISAIAIVFATVAAEVAAGILVVFALTMYVAALALFLLSTWLSRRGATSG*
Ga0066679_1009829413300005176SoilGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPV*
Ga0066679_1015258523300005176SoilVGYVVEGAAYVGGTMLIAAGVYLVMRGTLPAWWQRRMLWPLVRVTPTIAHLQGWTAIVLGISVLAIVFTTVAPELVAGILVVVALAGYLVALALFGFSTWLSRRPA*
Ga0066690_1031485023300005177SoilVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFV
Ga0066684_1029669123300005179SoilVEGVAYVGGTVLIGAGLYLVMRGSFPSWWGQRLLWPLVRVTPTVSHLQGWAAIGLGASILAIVFTTVAPDLVAGILVVLAMAAYVVGVALFLFSTWLSRRPA
Ga0066678_1013577213300005181SoilVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHMQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA*
Ga0066675_1030403023300005187SoilVGYVLEGVAYVGGTMLIGAGLYLVMRGNFPAWWRRRLLWPLLNVTPRVAHLQGWAAVGLGISAIAIVFATVAAEVAAGILVVFALTMYLAALVLFLLSTWLSRRGATSG*
Ga0070705_10000949223300005440Corn, Switchgrass And Miscanthus RhizosphereVDMEVGRPGHAPRLQFRAVAYVVEGLAFVAGGLLIAGGLYLVMRGAFPAWWQQRLLWPLVRVTPTVAHMQGVAAIGLGASIVTIVLTSVVSEAVGGILVLVAFLAYLVALALFLFSTWLSRRPA*
Ga0070708_100000126333300005445Corn, Switchgrass And Miscanthus RhizosphereMGYVVEGIAYVGGTMLIGAGLYLVMRGTFPAWWRQRLLWPLVRVTPTVSHLQGWAAIGLGVSVIAIVFTTVAPPVVAGLLVLLAMACYVVGVALFLFSTWLSRRPAA*
Ga0070708_10001297223300005445Corn, Switchgrass And Miscanthus RhizosphereVGYGLQGLGYVAGTGLIAAGVYLVVRGNFPPWWSQRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPVVVGGVLVVVALLLYALALALYLFSTWLSRRPAA*
Ga0070708_10128908823300005445Corn, Switchgrass And Miscanthus RhizosphereVGYGLQALAYVAGTGLIAAGVYLVVRGNFPSWWSQRLMWPLVNVTRRVSHMQGWAAIALGLSIIAIVFTSAAPVLVGGVLVVVALSLYAVALGLYLFSTWLSRRPAA*
Ga0066689_1020899023300005447SoilVEGVAYVGGTVLIGAGLYLVMRGSFPSWWGQRLLWPLVRVTPTVSHLQGWAAIGLGASILAIVFTTVAPDLVAGILVVLAMAAYVVGVALFLFSTWLSRRPAA*
Ga0066687_1035216823300005454SoilVGYVVEGVAYLSGTVLIGAGLYLIMRGTFPAWWQRRFLWPLVRVTPGVAHLQGWAAVGLGISILAIVFTSVAPDRIGGILVLAAMAAYLVGLALFGVSTWLSRRPAG*
Ga0070706_10005076643300005467Corn, Switchgrass And Miscanthus RhizosphereVGYGLQALGYVAGTGLIAAGVYLVVRGNFPSWWSQRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPVVVGGVLVVVALLLYALALALYLFSTWLSRRPAA*
Ga0070707_10000442423300005468Corn, Switchgrass And Miscanthus RhizosphereVHVEIWRQGHGSRLQFPAVGYVLEGVAYVSGTALIGAGLYLIMRGTFPAWWQRRFLWPLVRVTPTVAHLQGWAAVGLGSSILAIVFTSVAPDGVAGILVLAALAAYLVGLVLFVVSTWLSRRPAA*
Ga0070698_100002823153300005471Corn, Switchgrass And Miscanthus RhizosphereVGYGLEALAYVAGTGLIAAGVYLVMRGNFPAWWSPRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALFFYLLAVALYLFSTWLSRRPAA*
Ga0070699_10119544623300005518Corn, Switchgrass And Miscanthus RhizosphereVGYGLQALGYVAGTGLIAAGVYLVVRGNFPSWWSQRLMWPLVNVTRRVSHLQGWAAIALGLSIIAVVLTSAAPVVVGGVLVVVALLLYALALALYLFSTWLSRRP
Ga0070735_1012948723300005534Surface SoilMTRQGYNPGFVGYVLEGVAYVGGTILIGAGLYLVMRGNFPAWWRRRLLWPLVNVTPRVSHLQGWAAVALGISILAIVFTTVAPEVVAGALAVFALVAYTASVALYVLSTWLSRRPQAS*
Ga0070697_10006190633300005536Corn, Switchgrass And Miscanthus RhizosphereVGYVLEGVAYVSGTVLIGAGLFLIMRGTFPAWWQRRFLWPLVRVTPAVARLQGWAAIGLGGSILAIVFTSVAPDGIAGILVLAALAAYLVGLVLFVVSTWLSRRPAA*
Ga0070697_10018377133300005536Corn, Switchgrass And Miscanthus RhizosphereMGYGLQALGYVAGTGLIAAGVYLVVRGNFPSWWSQRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPVVVGGVLVVVALLLYALALALYLFSTWLSRRPAA*
Ga0070697_10032467623300005536Corn, Switchgrass And Miscanthus RhizosphereMEVGRPGHAPRLQFRAVAYVVEGLAFVAGGLLIAGGLYLVMRGAFPAWWQQRLLWPLVRVTPTVAHMQGVAAIGLGASIVTIVLTSVVSEAVGGILVLVAFLAYLVALALFLFSTWLSRRPA*
Ga0070732_1037122713300005542Surface SoilGASHDHSVCTWRSGGSGTAVRLPELESPAVGYVLEGVAYVSGVLMIGGGVYLVMRGAFPGWWGDRLVWPLVSVTPGVARLQGWAVIVLGASILAIVFTTVAPAVVGGGLVLGAIAAYVVGVGLFFVSTWLSRRKAA*
Ga0066701_1001657413300005552SoilVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS*
Ga0066661_1002306933300005554SoilVHVEIGREGHLVRLQSDSVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA*
Ga0066692_1057563013300005555SoilEGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS*
Ga0066707_1070891323300005556SoilLINIAAVGYVAEGFAYIFGTVLIGAGLYLVMRGTFPAWWRRRLLWPLVRVTPAVSHLQGWAAIGLGISVLAIVFTTVAPELVAGLLVVLALAAYIVGLALFAFSTWLSRRPA*
Ga0066704_1014135833300005557SoilGAETVYKIGSVGYVVEGVAYVGGTVLIGAGLYLVMRGSFPNWWGQRLLWPLVRVTPTVSHLQGWAAIGLGASILAIVFTTVAPDLVAGILVVLAMAAYVVGVALFLFSTWLSRRPAA*
Ga0066704_1025240123300005557SoilVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA*
Ga0066698_1108002513300005558SoilRPLGVHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA*
Ga0066699_1005609123300005561SoilVHVEVGRKRHRTRLQSEAVGYVVEGVAYLSGTVLIGAGLYLIMRGTFPAWWQRRFLWPLVRVTPGVAHLQGWAAVGLGISILAIVFTSVAPDRIGGILVLAAMAAYLVGLALFGVSTWLSRRPAG*
Ga0066699_1024963023300005561SoilEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA*
Ga0066703_1009488523300005568SoilVGYVVEGAAYVGGTMLIAAGVYLVMRGTLPAWWQRRMLWPLVRVTPTIAHLQGWTAIVLGISVLAIVFTTVAPELLAGILVVVALAGYLVALALFGFSTWLSRRPA*
Ga0066703_1029498223300005568SoilVHVEVGRKRHRTRLQSEAVGYVVEGVAYLSGTVLIGAGLYLIMRGTFPAWWQRRFLWPLIRVTPGVAHLQGWAAVGLGISILAIVFTSVAPDRIGGILVLAAMVAYLVGLALFGVSTWLSRRPAG*
Ga0066705_1001360823300005569SoilLINIAAVGYVAEGFAYIFGTVLIGAGLYLVMRGTFPAWWRRRLLWPLVRVTPAVSHLQGWAAIGLGISVLAIVFTTVAPELVAGLLVVLALAAYTVGLALFVFSTWLSRRPA*
Ga0066702_1002224933300005575SoilVHVEVGRKRHRTRLQSEAVGYVVEGVAYLSGTVLIGAGLYLIMRGTFPAWWQRRFLWPLIRVTPGVAHLQGWAAVGLGISILAIVFTSVAPDRIGGILVLAAMAAYLVGLALFGVSTWLSRRPAG*
Ga0066708_1001616333300005576SoilVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA*
Ga0066708_1032984913300005576SoilVGYALEGVAYVGGTMLIGAGLYLVMRGNFPAWWRRRLLWPLLNVTPRVAHLQGWAAVGLGISAIAIVFATVAAEVAAGILVVFALTMYLAALALFLLSTWLSRRGATSG*
Ga0066708_1107232723300005576SoilCPSGAARHLPMNGEEINIAAVGYVAEGFAYIFGTVLIGAGLYLVMRGTFPAWWRRRLLWPLVRVTPAVSHLQGWAAIGLGISVLAIVFTTVAPELVAGLLVVLALAAYTVGLALFVFSTWLSRRPA*
Ga0066691_1000106533300005586SoilVEIGREGHLVRLQSDSVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA*
Ga0066691_1069151113300005586SoilGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA*
Ga0066903_10030253433300005764Tropical Forest SoilVGYAIETVAYVGGALLIGAGLYLVIRGKFPGWWARRFLWPVVRVTPTIARMQGWAAFGAGASIIAIVFTTVASERVSGMLVLAAIAAYVVAVLLFLFSTWLSRRPAA*
Ga0066903_10837112723300005764Tropical Forest SoilVDVEVWRYGHAPRLQFADVGYVVEGVAYVSGVFLIGAGLYLVLRGSFPAWWHRRLLWPLVRVTPTVSHLQGLAAIGLGLSILAIVFSSAVSDSLAGLLVLGALVAYVLGLALFVLSTWLSRRPAS
Ga0066696_1012047923300006032SoilVVRPLGVHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA*
Ga0066696_1025487523300006032SoilVGYALEGVAYVGGTMLIGAGLYLVMRGNFPAWWRRRLLWPLLNVTPRVAHLQGWAAVGLGISAIAIVFATVAAEVAAGILVVFALTMYLAALVLFLLSTWLSRRSAA*
Ga0079222_1051458823300006755Agricultural SoilVGYVVEGLALVGGAFLVGAGVYLVRRGTFPEWWRRRMLWPLVHVTPRVSHLQGWAAIALGVSILSIVLTTVAPDGAAGILVVLALLAYIGGVALYLLSTWISRRPAA*
Ga0066653_1037788023300006791SoilMHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLV
Ga0066665_1004476923300006796SoilLINIAAVGYVAEGFAYIFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPELVAGLLVVLALAAYILGLVLFVFSTWLSRRPA*
Ga0066659_1075519523300006797SoilVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHMQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPV*
Ga0079221_1008387733300006804Agricultural SoilVHVEISGQGHPSRLQSASVGYGLEALAYVAGTALIAAGVYLVMRGNFPSWWSPRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALLLYVLAVALYLFSTWLSRRPAA*
Ga0079220_1057438423300006806Agricultural SoilVHVEVGWEGHQLRLMARLELPAVGYVVEGVAYVGGTVLIGAGLYLVLRGNFPAWWRRRLLWPLVRVTPRVSHLQGWAAIGLGVSLISIVFTTVAPELVAGVLVLVAMAAYVIGVALFLFSTWLSRRPT
Ga0075426_1096030213300006903Populus RhizosphereGQGHQSRLQSASVGYGLEALAYVAGTGLIAAGVYLVIRGNFPSWWSRRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALFFYVLAVALYLFSTWLSRRPAA*
Ga0075424_10013190343300006904Populus RhizosphereVGYLVQAAAYLSGVLLIGGGLYLVLRGSFPGWWQKRLTWPLVRVTPTVAHLQGWAAVGVGASILAIVFTSVAPEGVSGLLVLLAMVAYLVGLLLFLFSTWLSRRPAS*
Ga0075436_10070938023300006914Populus RhizosphereAGTGLIAAGVYLVIRGNFPSWWSRRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALFFYMLAVALYLFSTWLSRRPTA*
Ga0075436_10127066923300006914Populus RhizosphereAGTGLIAAGVYLVIRGNFPSWWSRRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALFFYVLAVALYLFSTWLSRRPAA*
Ga0079219_1012409923300006954Agricultural SoilVHVEISGQGHPSRLQSASVGYGLEALAYVAGTGLIAAGVYLVMRGNFPSWWSPRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALLLYVLAVALYLFSTWLSRRPAA*
Ga0079219_1021613613300006954Agricultural SoilVGYVVEGLALVGGAFLVGAGFYLVRRGTFPEWWRRRMLWPLVHVTPRVSHLQGWAAIALGVSILSIVLTTVAPDGAAGILVVLALLAYIGG
Ga0075435_10002787543300007076Populus RhizosphereVGYLVQAVAYLSGVLLIGGGLYLVLRGSFPGWWQKRLTWPLVRVTPTVAHLQGWAAVGVGASILAIVFTSVAPEGVSGLLVLLAMVAYLVGLLLFLFSTWLSRRPAS*
Ga0099791_1044094123300007255Vadose Zone SoilVGYVVEGIAYVSGVVLIGAGGYLIMRGGFPEWWQRRLLWPLVRVTPRVSHLQGWAAVGLGISILAIVFTTVVPELVAGILVVVALALYLVGLALFLFSTWLSRRPAS*
Ga0099794_1055687923300007265Vadose Zone SoilLIGAGGYLIMRGGFPEWWQRRLLWPLVRVTPRVSHLQGWAAVGLGISILAIVFTTVVPELVAGILVVVALALYLVGLALFLFSTWLSRRPAS*
Ga0102924_100475373300007982Iron-Sulfur Acid SpringVGYVVEGVAYVGGALLIGAGIFLVMRGSFPTGWSNRLLWPLAHVTPIVARLQGWAGIAVGASIIAIVFTTVAPEKVGGILVIAAIAAYLAGLVLFVFSTWLSRRPV*
Ga0099830_1111689523300009088Vadose Zone SoilVVEGIAYVSGVFLIGAGGYLIMRGSFPDWWQRRFLWPLVRITPRVSHLQGWALVGLGISILAIVFTSVAPETVAGVLVVVALALYLAGLALFLLSTWLSRRQAS*
Ga0099828_1000674873300009089Vadose Zone SoilMTRLQFRAVGYALEGIAYVGGALLIGAGIFLVMRGSFPAGWSNRLVWPLANVTPIVARLQGWAAIAVGASIIAIVFTTVAPERLGGVLVLAAIAAYLAGLALFAFSTWLSRRPAA*
Ga0099828_1000931573300009089Vadose Zone SoilMLIGAGLYLVMRGTFPAWWRSRLLWPLVRLTPTVSHLQGWAAIGLGISVLAIVFTTVAPELVAGVLVVLAMAAYLVGLVLFLFSTWLSRRPAG*
Ga0099827_1002023063300009090Vadose Zone SoilVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGVAGILVVAAMAAYLVGLVLFLFSTWVSRRAS*
Ga0126378_1030197723300010361Tropical Forest SoilVGYAVETVAYVAGALLIGAGLYLVIRGKFPGWWARRFLWPVVRVTPTVARLQGWAALGAGASIVAIVFSGVAPERVGGILVLAAIAAYVVAVLLFLFSTWLSRRPAA*
Ga0137392_1024049533300011269Vadose Zone SoilVVEGIAYISGVFLIGAGGYLIMRGSFPDWWQRRFLWPLVRITPRVSHLQGWALVGLGISILAIVFTSVAPETVAGVLVVVALALYLAGLALFLLSTWLSRRQAS*
Ga0137388_1013800913300012189Vadose Zone SoilVHVEIRWQSHPSRLQFPAVGYVVEGIAYVSGVFLIGAGGYLIMRGSFPDWWQRRFLWPLVRITPRVSHLQGWAVVGLGISILAIVFTSVVPETVAGILVVVALTLYIAGLALFLFSTWLSRRPTA*
Ga0137388_1079110323300012189Vadose Zone SoilVGYVLEAVAYISGVCLVGGGLYVVLRGTFPAWWPERLLWPLIRITPTVSHLQGLAAIGLGVSILAIVLSSAVPESTAGLLVLGALLAYLIGLLLFIFSSWLSRRPAS*
Ga0137365_1000928373300012201Vadose Zone SoilVGYLVQAAAYVSGVLLIGGGLYLILRGSLPGWWQRRLMWPLVRVTPTVAHLQGWAAVGVGASILAIVVTSVAPEGVSGLLVWLAMVAYLAGLLLFLFSTWLSRRPAS*
Ga0137363_1172175513300012202Vadose Zone SoilIGVVGEGDCGKPELDRSLAASFHRPAAAVVRPLGVHVEVGGQRHASRLQFRAVGYVVEGIAYVSGVFFIGVGGYLLMRGSFPEWWQKRFRWPLVQITPRVSHLQGLAAVGIGISIVALVFTTAVPDVAGGILVAFALAAYLAGLVLYLFSTWLSRRPAA*
Ga0137399_1049462423300012203Vadose Zone SoilVVEGIAYVSGVFLIGAGGYLIMRGSFPDWWQRRFLWPLVRITPRVSHLQGWAVVGLGISILAIVFTSVVPETVAGILVVVALALYIAGLALFLFSTWLSRRPTA*
Ga0137399_1076144423300012203Vadose Zone SoilVGYVIEGVAYISGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPGVAHLQGWAAVGLGISILAIVFTSVAPAVIGGILVLAAMAAYLVGLVLFAISTWLSRRRAA*
Ga0137399_1125999113300012203Vadose Zone SoilVHVEVWGKGHRTRLQSEAVGYVVEGVAYVSGACLIGGGLYLVMRGTFPGWWPQRILWPLVRVTPSVARLQGLTAVAAGASILTIVFTSIVPETTGGILVL
Ga0137380_1012264423300012206Vadose Zone SoilVHVEIGREGHPVRLQSGSVGYVVEGFAYVFGTMLIGAGLYLVMRGTFPTWWRRRLLWPLVRVTPTVSHLQGWAAIGLGISVLAIVFTTVAPELVAGLLVVLALAAYLVGVALFVFSTWLSRRPA*
Ga0137380_1016270923300012206Vadose Zone SoilVGYLVQAAAYVSGVLLIGGGLYLILRGSFPCWWQRRLMWPLVRVTPTVAHLQGWAAVGVGASILAIVVTSVAPEGVSGLLVWLAMVAYLAGLLLFLFSTWLSRRPAS*
Ga0137381_1001722743300012207Vadose Zone SoilVGYVVEGFAYVFGTMLIGAGLYLVMRGTFPTWWRRRLLWPLVRVTPTVSHLQGWAAIGVGISVLAIVFTTVAPELVAGLLVVLALAAYLVGVALFVFSTWLSRRPA*
Ga0137381_1068054323300012207Vadose Zone SoilVGYLVQAAAYVSGVLLIGGGLYLILRGSFPGWWQRRLTWPLVRVTPTVAHLQGWAAVGVGASILAIVVTSVAPEGVSGLLVWLAMVAYLAGLLLFLFSTWLSRRPAS*
Ga0137376_1011393423300012208Vadose Zone SoilVVEGFAYVFGTMLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHMQGWAAIGLGISVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA*
Ga0137379_1000017253300012209Vadose Zone SoilVGYLVQAAAYVSGVLLIGGGLYLILRGSFPGWWQRRLMWPLVRVTPTVAHLQGWAAVGVGASILAIVLTSVAPEGVSGLLVWLAMVAYLAGLLLFLFSTWLSRRPAS*
Ga0137379_10001181123300012209Vadose Zone SoilVHVEIGREGHPVRLQSGSVGYVVEGFAYVFGTMLIGAGLYLVMRGTFPTWWRRRLLWPLVRVTPTVSHLQGWAAIGLGISVLAIVFTTVAPQLVAGLLVVLALAAYLVGVALFVFSTWLSRRPA*
Ga0137378_1042169823300012210Vadose Zone SoilVGYVVEGFAYVFGTMLIGAGLYLVMRGTFPTWWRRRLLWPLVRVTPTVSHLQGWAAIGLGISVLAIVFTTVAPQLVAGLLVVLALAAYLVGVALFVFSTWLSRRPA*
Ga0137377_1153465223300012211Vadose Zone SoilVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPAVSHLQGWAAIGLGISVLAIVFTTVAPELVAGLLVVLALAAYTVGLALFVFSTWLSRRPA*
Ga0137360_1055064123300012361Vadose Zone SoilLAYVAGTGLIAAGVYLVMRGNFPSWWSRRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALFLYVLAVALYLFSTWLSRRPAA*
Ga0137396_1001806233300012918Vadose Zone SoilVVEGVAYVSGTVLIGAVLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGISGILVLGAMAAYLVGLVLYVVSTWLSRRRAV*
Ga0137416_1010001123300012927Vadose Zone SoilVHVEFRWQSHPSRLQFPAVGYVVEGIAYVSGVFLIGAGGYLIMRGSFPDWWQRRFLWPLVRITPRVSHLQGWAVVGLGISILAIVFTSVVPETVAGILVVVALALYIAGLALFLFSTWLSRRPTA*
Ga0137416_1021899623300012927Vadose Zone SoilVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLLSLVRVTPAVAHLQGWAALGLGTSILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS*
Ga0137410_1040251313300012944Vadose Zone SoilVAYVSGTVLIGAGLYLIMRGTFPAWWQRRFLWPLVRVTPGVAHLQGWAAVGLGGSILAIVFTPVAPDGIAGILVLAALAAYLVGLVLFVVSTWLLSAARSPAH*
Ga0126369_1198439323300012971Tropical Forest SoilVGYALETVAYVGGALLIGVGLYLVIRGKFPGWWARRFLWPVVRVTPTVARMQGWAAFGAGASIIAIVFTTVASERVSGMLVLAAIAAYVVAVLLFLFSTWLSRRPAA*
Ga0134110_1000780033300012975Grasslands SoilVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLKGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA*
Ga0134087_1004369823300012977Grasslands SoilVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPAGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA*
Ga0066655_1004826333300018431Grasslands SoilMHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPAGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA
Ga0066667_1010575413300018433Grasslands SoilMHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMATYLVGLVLFLFSTWLSRRRAA
Ga0066667_1040457923300018433Grasslands SoilVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA
Ga0066667_1162440313300018433Grasslands SoilVGYALEGVAYVGGTMLIGAGLYLVMRGNFPAWWRRRLLWPLLNVTPRVAHLQGWAAVGLGISAIAIVFATVAAEVAAGILVVFALTMYL
Ga0066669_1007108223300018482Grasslands SoilMHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA
Ga0212123_10002493183300022557Iron-Sulfur Acid SpringVHVEIRGQGHASRLQFRAVGYVVEGVAYVGGALLIGAGIFLVMRGSFPTGWSNRLLWPLANVTPIVARLQGWAGIAVGASIIAIVFTTVAPEKVGGILVIAAIAAYLAGLVLFVFSTWLSRRPV
Ga0207684_100000511643300025910Corn, Switchgrass And Miscanthus RhizosphereMGYVVEGIAYVGGTMLIGAGLYLVMRGTFPAWWRQRLLWPLVRVTPTVSHLQGWAAIGLGVSVIAIVFTTVAPPVVAGLLVLLAMACYVVGVALFLFSTWLSRRPAA
Ga0207684_10000060783300025910Corn, Switchgrass And Miscanthus RhizosphereVDMEVGRPGHAPRLQFRAVAYVVEGLAFVAGGLLIAGGLYLVMRGAFPAWWQQRLLWPLVRVTPTVAHMQGVAAIGLGASIVTIVLTSVVSEAVGGILVLVAFLAYLVALALFLFSTWLSRRPA
Ga0207684_1008988733300025910Corn, Switchgrass And Miscanthus RhizosphereMGYGLQALGYVAGTGLIAAGVYLVVRGNFPSWWSQRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPVVVGGVLVVVALLLYALALALYLFSTWLSRRPAA
Ga0207646_1000928863300025922Corn, Switchgrass And Miscanthus RhizosphereVHVEIWRQGHGSRLQFPAVGYVLEGVAYVSGTALIGAGLYLIMRGTFPAWWQRRFLWPLVRVTPTVAHLQGWAAVGLGSSILAIVFTSVAPDGVAGILVLAALAAYLVGLVLFVVSTWLSRRPAA
Ga0209350_106221413300026277Grasslands SoilVGYVVEGAAYVGGTMLIAAGVYLVMRGTLPAWWQRRMLWPLVRVTPTIAHLQGWTAIVLGISVLAIVFTTVAPELVAGILVVVALAGYLVALALFG
Ga0209235_114285713300026296Grasslands SoilRPLGMHVEVGREGHMTRLQSEAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS
Ga0209236_105204523300026298Grasslands SoilVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS
Ga0209027_101152313300026300Grasslands SoilVGYALEGVAYVGGTMLIGAGLYLVMRGNFPAWWRRRLLWPLLNVTPRVAHLQGWAAVGLGISAIAIVFATVAAEVAAGILVVFALTMYLAALVLFLLSTWLSRRSAAA
Ga0209238_100557353300026301Grasslands SoilVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQGRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA
Ga0209238_107134223300026301Grasslands SoilEASIGSRLQSASVGYALEGVAYVGGTMLIGAGLYLVMRGNFPAWWRRRLLWPLLNVTPRVAHLQGWAAVGLGISAIAIVFATVAAEVAAGILVVFALTMYLAALVLFLLSTWLSRRGATS
Ga0209239_101033543300026310Grasslands SoilVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMATYLVGLVLFLFSTWLSRRRAA
Ga0209239_102332633300026310Grasslands SoilVGYVVEGAAYVGGTMLIAAGVYLVMRGTLPAWWQRRMLWPLVRVTPTIAHLQGWTAIVLGISVLAIVFTTVAPELVAGILVVVALAGYLVALALFGFSTWLSRRPA
Ga0209761_104724333300026313Grasslands SoilMHVEVGREGHRTRLQSEAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS
Ga0209471_100397773300026318SoilVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRR
Ga0209471_104675333300026318SoilVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA
Ga0209471_112135823300026318SoilIAAGVYLVMRGTLPAWWQRRMLWPLVRVTPTIAHLQGWTAIVLGISVLAIVFTTVAPELVAGILVVVALAGYLVALALFGFSTWLSRRPA
Ga0209802_103502823300026328SoilVGYVVEGVAYVSGTVLVGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS
Ga0209267_111137223300026331SoilAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA
Ga0209158_102745323300026333SoilVEIGREGHLVRLQSDSVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA
Ga0209804_105582733300026335SoilVGYVAEGFAYVFGTVLIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPA
Ga0209807_102079323300026530SoilLINIAAVGYVAEGFAYIFGTVLIGAGLYLVMRGTFPAWWRRRLLWPLVRVTPAVSHLQGWAAIGLGISVLAIVFTTVAPELVAGLLVVLALAAYTVGLALFVFSTWLSRRPA
Ga0209807_120648923300026530SoilYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA
Ga0209058_111138813300026536SoilVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSR
Ga0209156_1011397933300026547SoilVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVTHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVG
Ga0209161_1004413333300026548SoilLINIAAVGYVAEGFAYIFGTVLIGAGLYLVMRGTFPAWWRRRLLWPLVRVTPAVSHLQGWAAIGLGISVLAIVFTTVAPEPVGGLLVVLALAAYLVGTVLFVFSTWLSRRPA
Ga0209161_1004889223300026548SoilMHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPAGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS
Ga0209474_1004206533300026550SoilMHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWVAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRRAA
Ga0209648_1001583733300026551Grasslands SoilMGYAVEGIAYVGGTVLIGAGLYLVMRGTFPAWWRRRLLWPLVRVTPTVSHLQGWAAIGLGVSVIAIVFTTVAPEVVAGILVLLAIALYLLGVALFLFSTWLSRRPAS
Ga0209577_1074361523300026552SoilIGAGLYLVMRGTFPAWWRRRLMWPLVRVTPTVSHLQGWAAIGLGVSVLAIVFTTVAPEVVAGLLVVLALAAYVVGLALFVFSTWLSRRPV
Ga0209220_100167323300027587Forest SoilVHVEIGRQGHRSRLQFRAVGYALEGVAYVGGAVLIGAGLFLVMRGSFPTWWQSRLLWPVANVTPMVARLQGWAAIGVGASIIAIVFTTVAPEKVGGILVLAAFAAYLAGVALFAFSTWLSRRPA
Ga0209076_100611833300027643Vadose Zone SoilMHVEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS
Ga0209118_100252163300027674Forest SoilVGYALEGVAYVGGAVLIGAGLFLVMRGSFPTWWQSRLLWPVANVTPMVARLQGWAAIGVGASIIAIVFTTVAPEKVGGILVLAAFAAYLAGVALFAFSTWLSRRPA
Ga0209178_100828123300027725Agricultural SoilVGYVVEGLALVGGAFLVGAGVYLVRRGTFPEWWRRRMLWPLVHVTPRVSHLQGWAAIALGVSILSIVLTTVAPDGAAGILVVLALLAYIGGVALYLLSTWISRRPAA
Ga0209178_121106513300027725Agricultural SoilGTGLIAAGVYLVMRGNFPSWWSPRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALLLYVLAVALYLFSTWLSRRPAA
Ga0209074_1022082823300027787Agricultural SoilRTGGSTGQGYNPDLVGYVVEGLALVGGAFLVGAGVYLVRRGTFPEWWRRRMLWPLVHVTPRVSHLQGWAAIALGVSILSIVLTTVAPDGAAGILVVLALLAYIGGVALYLLSTWISRRPA
Ga0209580_1039522023300027842Surface SoilVLEGVAYVGGSALIGAGLFLVMRGNFPTWWRERFSWPLVNITPTVSHLQGAAAIVIGASVLAIVFTTVAPESVAGWLVILALAAYVAGAALFLFSTWLSRRPV
Ga0209283_1000046433300027875Vadose Zone SoilVEIGRENHAVRLQSASVGYVLEGFAYVFGTMLIGAGLYLVMRGTFPAWWRSRLLWPLVRLTPTVSHLQGWAAIGLGISVLAIVFTTVAPELVAGVLVVLAMAAYLVGLVLFLFSTWLSRRPAG
Ga0209283_1008404823300027875Vadose Zone SoilMTRLQFRAVGYALEGIAYVGGALLIGAGIFLVMRGSFPAGWSNRLVWPLANVTPIVARLQGWAAIAVGASIIAIVFTTVAPERLGGVLVLAAIAAYLAGLALFAFSTWLSRRPAA
Ga0209590_1006175223300027882Vadose Zone SoilVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLWPLVRVTPAVAHLQGWAAVGLGISILAIVFTSVAPDGVAGILVVAAMAAYLVGLVLFLFSTWVSRRAS
Ga0137415_1001322353300028536Vadose Zone SoilVGYVVEGIAYVSGVFLIGAGGYLIMRGSFPDWWQRRFLWPLVRITPRVSHLQGWAVVGLGISILAIVFTSVVPETVAGILVVVALALYIAGLALFLFSTWLSRRPTA
Ga0137415_1008718333300028536Vadose Zone SoilMEVGREGHRTRLQSAAVGYVVEGVAYVSGTVLIGAGLYLIMRGTFPAWWQRRLLLSLVRVTPAVAHLQGWAALGLGTSILAIVFTSVAPDGIAGILVVAAMAAYLVGLVLFLFSTWLSRRPAS
Ga0307477_1048289123300031753Hardwood Forest SoilVGYVVEGIAFVGGALLVGAGLYLVRRGTFPAWWRQRMLWPLVHVTPRVSHLQGWAAIALGVSILSIVFTTVAPDGIAGVLVVLGLLAYLGGVALYLLSTWLSRRPTV
Ga0307478_1008465633300031823Hardwood Forest SoilVGYVVEGIAFVGGALLVGAGLYLVRRGTFPAWWRQRMLWPLVHVTPRVSHLQGWAAIALGVSILSIVFTTVAPDGIAGILVVLGLLAYLAGVALYVLSSWLSRRPTA
Ga0318551_1085029613300031896SoilHRGGDTPQGYNPTFVGHVIEGVAYVGGTFLIGAGLYLVLRGTFPAWWQRRMLWPLVRVTPRVSHLQGWTAVALGISILSIVFTTAAPETLAGLLVAIALVMYVGAVALYLFSTWLSRRPA
Ga0307479_1016172733300031962Hardwood Forest SoilVHVEVGWEGHLTRLQSASVGYGLAALAYISGSALIAIGVYLVMRGSYPAWWGARLQWPLVNVTPRVSHLQGWAAIGLGVSVLAIVFSTVAGTVIAGFLVLFALAAYVLAVGLFAFSTWLSRRPAD
Ga0307479_1121401923300031962Hardwood Forest SoilISGSALIAVGVYLVMRGSYPAWWGARLQWPLVNVTPRVSHLQGWAAIGLGVSVLAIVFSTVAGTVIAGFLVLFALAAYLLSVGLFAFSTWLSRRPAG
Ga0318505_1033761723300032060SoilLQSASVGYGLEAVAYVAGTALIAAGVYLVMRGAFPAWWSPRLLWPLVNVTRRVSHLHGWAAIALGVSIIAIVFATAAGDVVSGLLVMVAIAFYLLALALFLFSTWLSRRRAA
Ga0318524_1040887813300032067SoilVGHVIEGVAYVGGTFLIGAGLYLVLRGTFPAWWQRRMLWPLVRVTPRVSHLQGWTAVALGISILSIVFTTAAPETLAGLLVAIALVMYVGAVALYLFSTWLSRRPAA
Ga0307471_10008824533300032180Hardwood Forest SoilVGYVVEGAAYVGGTFLIGAGLYLVMRGTFPAWWRQRMLWPLVNLTPRVSHLQGWAAVALGISILSIVFTTVAPDTVAGILVVVAFVAYLGGVALYLFSTWLSRRPTA
Ga0307471_10065487623300032180Hardwood Forest SoilVGYVVEGLAYVGGTFLIGAGLYLVMRGTFPSWWRQRMLWPLVNVTPRVSHLQGWAAIALGVSILSIVFTTVAPDGVAGILVVLALVTYLGGVALYLLSTWLSRRPAA
Ga0307471_10089460513300032180Hardwood Forest SoilVHVEVGWEGHLTRLQSASVGYGLEALAYISGSALIAIGVYLVMRGSYPAWWGARLQWPLVNVTPRVSHLQGWAAIGLGVSVLAIVFSTVAGTVIAGFLVLFALAAYLLSVG
Ga0307471_10108274623300032180Hardwood Forest SoilGLEALAYVAGTGFIAAGVYLVMRGNFPSWWSRRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALFLYVLALALYLFSTWLSRRPAA
Ga0307471_10186766813300032180Hardwood Forest SoilHVEVRRQRHPSRLQSASVGYALEAAAYVGGTVLIACGVYLVMRGSFPAWWSDRLLWPVVNVTRRVSHLQGWAAIGLGVSVLAIVFSTVAGAVIAGFLVVFALAAYLLAVGLFALSTWLSRRPAG
Ga0307471_10213411223300032180Hardwood Forest SoilVGYALEAVAYVGGAMLIGAGLYLIMRGNFPAWWRSRLLWPLVNVTPRVAHLQGWAAVGLGASMIAVVFATVAAEVAAGLLVVFALTAYIGALALFLLSTWLSRRTAA
Ga0307471_10402827323300032180Hardwood Forest SoilTFLIGAGLFLIMRGTFPAWWQRRFLWPLARVTPPVARLQGWAAIGLGSSILAIVFTSVAPDGIAGILVLAALAAYLVGLVLFVVSTWLSRRPVS
Ga0307472_10000122423300032205Hardwood Forest SoilVGYGLEALAYVAGTGFIAAGVYLVMRGNFPSWWSRRLMWPLVNVTRRVSHLQGWAAIALGLSIIAIVFTSAAPEVVGGVLVVVALFLYVLALALYLFSTWLSRRPAA
Ga0307472_10090192213300032205Hardwood Forest SoilVHVEVWWKRHRVSRLQSALVGYALEAAAYIGGTVLIACGVYLVMRGNFPAWWGDRLLWPLVNVTRRVSHLQGWAAIGLGVSVLAIVFSTVAGAVIAGFLVAFALAAYLLAAGLFAVSTWLSRRPAG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.