NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104941

Metagenome Family F104941

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104941
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 144 residues
Representative Sequence MFPDKAKMVRSANALFVIFAVAPGLYLMLAALATFGKPGFARDPNIIPWLFLELALVSLANIGVTIFVQTSTKLMSERARYDPIGRTYLIMATGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVAWASLWWVWKRFKQSLASLPNE
Number of Associated Samples 68
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 62.00 %
% of genes near scaffold ends (potentially truncated) 21.00 %
% of genes from short scaffolds (< 2000 bps) 57.00 %
Associated GOLD sequencing projects 58
AlphaFold2 3D model prediction Yes
3D model pTM-score0.53

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (84.000 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(58.000 % of family members)
Environment Ontology (ENVO) Unclassified
(50.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(62.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Transmembrane (alpha-helical) Signal Peptide: No Secondary Structure distribution: α-helix: 71.84%    β-sheet: 0.00%    Coil/Unstructured: 28.16%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.53
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF04299FMN_bind_2 23.00
PF00848Ring_hydroxyl_A 8.00
PF01872RibD_C 5.00
PF00254FKBP_C 3.00
PF01850PIN 2.00
PF02219MTHFR 2.00
PF06197DUF998 1.00
PF13470PIN_3 1.00
PF13668Ferritin_2 1.00
PF07088GvpD_P-loop 1.00
PF01963TraB_PrgY_gumN 1.00
PF00924MS_channel 1.00
PF01380SIS 1.00
PF00355Rieske 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG4638Phenylpropionate dioxygenase or related ring-hydroxylating dioxygenase, large terminal subunitInorganic ion transport and metabolism [P] 16.00
COG0262Dihydrofolate reductaseCoenzyme transport and metabolism [H] 5.00
COG1985Pyrimidine reductase, riboflavin biosynthesisCoenzyme transport and metabolism [H] 5.00
COG06855,10-methylenetetrahydrofolate reductaseAmino acid transport and metabolism [E] 2.00
COG0668Small-conductance mechanosensitive channelCell wall/membrane/envelope biogenesis [M] 1.00
COG1916Pheromone shutdown protein TraB, contains GTxH motif (function unknown)Function unknown [S] 1.00
COG3264Small-conductance mechanosensitive channel MscKCell wall/membrane/envelope biogenesis [M] 1.00
COG3371Uncharacterized membrane proteinFunction unknown [S] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms94.00 %
UnclassifiedrootN/A6.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002908|JGI25382J43887_10027912Not Available3042Open in IMG/M
3300005166|Ga0066674_10114047All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon1261Open in IMG/M
3300005167|Ga0066672_10002376All Organisms → cellular organisms → Archaea7480Open in IMG/M
3300005171|Ga0066677_10004289All Organisms → cellular organisms → Bacteria5656Open in IMG/M
3300005174|Ga0066680_10008539All Organisms → cellular organisms → Archaea5131Open in IMG/M
3300005177|Ga0066690_10175264All Organisms → cellular organisms → Archaea1419Open in IMG/M
3300005180|Ga0066685_10632260All Organisms → cellular organisms → Archaea736Open in IMG/M
3300005445|Ga0070708_100162081All Organisms → cellular organisms → Archaea → TACK group → Crenarchaeota → Thermoprotei2084Open in IMG/M
3300005468|Ga0070707_100004780All Organisms → cellular organisms → Archaea12677Open in IMG/M
3300005468|Ga0070707_100035181All Organisms → cellular organisms → Archaea4778Open in IMG/M
3300005553|Ga0066695_10105198All Organisms → cellular organisms → Archaea1733Open in IMG/M
3300005554|Ga0066661_10002902All Organisms → cellular organisms → Bacteria7459Open in IMG/M
3300005555|Ga0066692_10395329All Organisms → cellular organisms → Archaea876Open in IMG/M
3300005559|Ga0066700_10083956All Organisms → cellular organisms → Archaea2051Open in IMG/M
3300005586|Ga0066691_10423113Not Available793Open in IMG/M
3300005598|Ga0066706_11244205All Organisms → cellular organisms → Archaea564Open in IMG/M
3300005598|Ga0066706_11388139Not Available530Open in IMG/M
3300006034|Ga0066656_10085481All Organisms → cellular organisms → Archaea1891Open in IMG/M
3300006755|Ga0079222_10275481All Organisms → cellular organisms → Archaea1078Open in IMG/M
3300006796|Ga0066665_10400905All Organisms → cellular organisms → Archaea1131Open in IMG/M
3300006800|Ga0066660_10449850All Organisms → cellular organisms → Bacteria → Proteobacteria1076Open in IMG/M
3300007255|Ga0099791_10061298All Organisms → cellular organisms → Archaea1694Open in IMG/M
3300007255|Ga0099791_10625497All Organisms → cellular organisms → Archaea527Open in IMG/M
3300007258|Ga0099793_10029732All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Thalassobaculaceae → Nisaea2311Open in IMG/M
3300007258|Ga0099793_10307461All Organisms → cellular organisms → Archaea771Open in IMG/M
3300007258|Ga0099793_10371533All Organisms → cellular organisms → Archaea701Open in IMG/M
3300007265|Ga0099794_10000107All Organisms → cellular organisms → Archaea30372Open in IMG/M
3300007265|Ga0099794_10275891All Organisms → cellular organisms → Archaea869Open in IMG/M
3300009038|Ga0099829_10036390Not Available3552Open in IMG/M
3300009038|Ga0099829_11202756All Organisms → cellular organisms → Archaea628Open in IMG/M
3300009089|Ga0099828_10487409All Organisms → cellular organisms → Archaea1112Open in IMG/M
3300009089|Ga0099828_11020141All Organisms → cellular organisms → Archaea737Open in IMG/M
3300009137|Ga0066709_100251469All Organisms → cellular organisms → Archaea2366Open in IMG/M
3300011269|Ga0137392_10348890All Organisms → cellular organisms → Archaea1226Open in IMG/M
3300011270|Ga0137391_10545661All Organisms → cellular organisms → Archaea977Open in IMG/M
3300012096|Ga0137389_10053156All Organisms → cellular organisms → Archaea3069Open in IMG/M
3300012096|Ga0137389_10861397All Organisms → cellular organisms → Archaea → TACK group → Crenarchaeota → unclassified Thermoproteota → Crenarchaeota archaeon 13_1_20CM_2_51_8778Open in IMG/M
3300012199|Ga0137383_10021842All Organisms → cellular organisms → Archaea4437Open in IMG/M
3300012199|Ga0137383_10025725All Organisms → cellular organisms → Bacteria4109Open in IMG/M
3300012200|Ga0137382_10763352All Organisms → cellular organisms → Archaea695Open in IMG/M
3300012202|Ga0137363_11256681All Organisms → cellular organisms → Archaea628Open in IMG/M
3300012203|Ga0137399_10001594All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon10542Open in IMG/M
3300012203|Ga0137399_10045784All Organisms → cellular organisms → Archaea3142Open in IMG/M
3300012203|Ga0137399_10225795All Organisms → cellular organisms → Archaea1529Open in IMG/M
3300012203|Ga0137399_10941992All Organisms → cellular organisms → Archaea727Open in IMG/M
3300012206|Ga0137380_10092485All Organisms → cellular organisms → Archaea2764Open in IMG/M
3300012206|Ga0137380_10553291All Organisms → cellular organisms → Archaea1010Open in IMG/M
3300012206|Ga0137380_10912471All Organisms → cellular organisms → Archaea754Open in IMG/M
3300012206|Ga0137380_11278587All Organisms → cellular organisms → Archaea619Open in IMG/M
3300012207|Ga0137381_10522026All Organisms → cellular organisms → Bacteria1036Open in IMG/M
3300012209|Ga0137379_10126297All Organisms → cellular organisms → Archaea2458Open in IMG/M
3300012209|Ga0137379_10339481All Organisms → cellular organisms → Archaea1412Open in IMG/M
3300012209|Ga0137379_10983656All Organisms → cellular organisms → Archaea → TACK group → Candidatus Bathyarchaeota → unclassified Candidatus Bathyarchaeota → Candidatus Bathyarchaeota archaeon749Open in IMG/M
3300012210|Ga0137378_10036902All Organisms → cellular organisms → Archaea4358Open in IMG/M
3300012211|Ga0137377_10033484All Organisms → cellular organisms → Archaea4677Open in IMG/M
3300012211|Ga0137377_10374594All Organisms → cellular organisms → Archaea1361Open in IMG/M
3300012349|Ga0137387_10878871All Organisms → cellular organisms → Archaea649Open in IMG/M
3300012349|Ga0137387_11188810All Organisms → cellular organisms → Archaea539Open in IMG/M
3300012351|Ga0137386_10433943All Organisms → cellular organisms → Archaea946Open in IMG/M
3300012351|Ga0137386_11278563All Organisms → cellular organisms → Archaea510Open in IMG/M
3300012351|Ga0137386_11313037All Organisms → cellular organisms → Archaea501Open in IMG/M
3300012356|Ga0137371_10356642All Organisms → cellular organisms → Archaea1136Open in IMG/M
3300012357|Ga0137384_10153592All Organisms → cellular organisms → Archaea1927Open in IMG/M
3300012357|Ga0137384_10724057All Organisms → cellular organisms → Archaea807Open in IMG/M
3300012357|Ga0137384_11051302All Organisms → cellular organisms → Archaea654Open in IMG/M
3300012361|Ga0137360_10108386All Organisms → cellular organisms → Archaea2139Open in IMG/M
3300012362|Ga0137361_10627205All Organisms → cellular organisms → Archaea984Open in IMG/M
3300012362|Ga0137361_10947264All Organisms → cellular organisms → Archaea779Open in IMG/M
3300012918|Ga0137396_10055052All Organisms → cellular organisms → Archaea2739Open in IMG/M
3300012918|Ga0137396_10828402All Organisms → cellular organisms → Archaea680Open in IMG/M
3300012925|Ga0137419_10045726All Organisms → cellular organisms → Archaea2794Open in IMG/M
3300012927|Ga0137416_11373945All Organisms → cellular organisms → Archaea639Open in IMG/M
3300012944|Ga0137410_10379752All Organisms → cellular organisms → Archaea1135Open in IMG/M
3300015241|Ga0137418_10001968All Organisms → cellular organisms → Archaea18854Open in IMG/M
3300017657|Ga0134074_1200834Not Available707Open in IMG/M
3300018433|Ga0066667_10370843All Organisms → cellular organisms → Archaea1146Open in IMG/M
3300018468|Ga0066662_10047912All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → Thalassobaculaceae → Nisaea2700Open in IMG/M
3300021046|Ga0215015_10141569Not Available3251Open in IMG/M
3300021046|Ga0215015_10281623All Organisms → cellular organisms → Archaea525Open in IMG/M
3300021088|Ga0210404_10003619All Organisms → cellular organisms → Archaea6255Open in IMG/M
3300025922|Ga0207646_10020062All Organisms → cellular organisms → Archaea6198Open in IMG/M
3300026297|Ga0209237_1027027All Organisms → cellular organisms → Archaea3182Open in IMG/M
3300026309|Ga0209055_1005291All Organisms → cellular organisms → Archaea7468Open in IMG/M
3300026315|Ga0209686_1005435All Organisms → cellular organisms → Bacteria5631Open in IMG/M
3300026315|Ga0209686_1049985All Organisms → cellular organisms → Archaea1531Open in IMG/M
3300026317|Ga0209154_1003285All Organisms → cellular organisms → Archaea8806Open in IMG/M
3300026335|Ga0209804_1323868All Organisms → cellular organisms → Archaea519Open in IMG/M
3300026524|Ga0209690_1153688All Organisms → cellular organisms → Archaea838Open in IMG/M
3300026542|Ga0209805_1043215All Organisms → cellular organisms → Archaea2271Open in IMG/M
3300026551|Ga0209648_10429340All Organisms → cellular organisms → Archaea839Open in IMG/M
3300027643|Ga0209076_1002008All Organisms → cellular organisms → Archaea4320Open in IMG/M
3300027643|Ga0209076_1004940All Organisms → cellular organisms → Archaea3137Open in IMG/M
3300027671|Ga0209588_1000037All Organisms → cellular organisms → Archaea63830Open in IMG/M
3300027846|Ga0209180_10005863All Organisms → cellular organisms → Archaea6208Open in IMG/M
3300028536|Ga0137415_10210120All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1763Open in IMG/M
3300028536|Ga0137415_10967285All Organisms → cellular organisms → Archaea662Open in IMG/M
3300031962|Ga0307479_10000401All Organisms → cellular organisms → Archaea38771Open in IMG/M
3300031962|Ga0307479_10015483All Organisms → cellular organisms → Bacteria7157Open in IMG/M
3300031962|Ga0307479_10079148All Organisms → cellular organisms → Archaea3181Open in IMG/M
3300032180|Ga0307471_100074063All Organisms → cellular organisms → Archaea2956Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil58.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil20.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil1.00%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005559Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026309Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110 (SPAdes)EnvironmentalOpen in IMG/M
3300026315Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126 (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026524Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150 (SPAdes)EnvironmentalOpen in IMG/M
3300026542Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25382J43887_1002791223300002908Grasslands SoilMAIKPMFPDKTRMVRSANALFVIFAVAPGLYLMLAALVTFGKPGFARDPNIIPWLFLGLALFSLANIGVTVFVQTSTKLMSERARYDPIGRTYLIMATGAVLSEAHAIYGLVLMLLSGSTFYGIGFTILTWASLWWVWKRFKQNLASLPNA*
Ga0066674_1011404723300005166SoilLAKDRMFPNKGKMVRSANFLFVTLAVAPVLYLVTAVLATFGRPGLGVDPNIIPWLFIGLALVSLANIGVAVLVQSSTKLMSERARYDPINRAYLIMATGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA*
Ga0066672_1000237653300005167SoilMFPNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFAVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIGFTIVTWASLWWVRKRFKQNSASLPSAPEL*
Ga0066677_1000428933300005171SoilMFPNKAKMVRSANFLFVILAVAPGLYLVTALLVTDGKQGLARDSTIIPFLFLGLALFSLANIGVTILFQTSTKLMSERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIIA*
Ga0066680_1000853933300005174SoilMFPNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFVVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIGFTIVTWASLWWVRKRFKQNSASLPSAPEL*
Ga0066690_1017526423300005177SoilVVYPATAVLATVGRPGFARDPNVIPWLFLGLALVSLADIGVTVFVQTSRKLMLERARYDPINRAYLIMATGAVLSEAHAIYGQVLTLLSGSIFYGIGLTFVTWASLWWVRKRFKQSLESVPNG*
Ga0066685_1063226023300005180SoilMFPNKGKMVRSANFLFVTLAVAPVLYLVTAVLATFGRPGLGVDPNIIPWLFIGLALVSLANIGVAVLVQSSTKLMSERARYDPINRAYLIMATGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA*
Ga0070708_10016208123300005445Corn, Switchgrass And Miscanthus RhizosphereLFRDKAKMVRTANALLVRLAFAPGLYLVIAVLVTFGKPGLTGDPIFIRVLFIVLAVVSLANIGLTVFIETSKKLMSARTRYDPVGGTFQTMSLGAILSETHAIYGLVLTLLSGSIFYGIGFSIFTWTSLWWVRRRFKQNLASLPNA*
Ga0070707_10000478033300005468Corn, Switchgrass And Miscanthus RhizosphereMVRTANALLVRLAFAPGLYLVIAVLVTFGKPGLTGDPIFIRVLFIVLAVVSLANIGLTVFIETSNKLMSARTRYDPVGGTFQIMSLGAILSETHAIYGLVLTLLSGSILYGIGFSIFTWTSLWWVRRRFKQNLASLPNA*
Ga0070707_10003518133300005468Corn, Switchgrass And Miscanthus RhizosphereMVKGANALLVIFAVAPVLWPVIAFLVTLGNPGFARDPNIIPVLFIGLALVSLATIGFTVFIQTSRKLMSARARYDPIGRTFQLMSLGAILSEEHAIYGLVLSLLSGSIFYGIGFTIVTWASLWWIWKRFKQSLASLPNM*
Ga0066695_1010519823300005553SoilMFPDKAKMVRTANALLMTSAVAPGLYLVTAVLVTFGKPGLARDLNIIPWLFLGLALFSLANIGVTIFVQTSTKLMSERARYDPVGRTYLIMSTGAILSEAHAIYGLVLTLLSGSIFYEIGFTIVAWQVSGGSGNDSSRISETFQTRRPKR*
Ga0066661_1000290243300005554SoilLARKPMFPNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFAVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIGFTIVTWASLWWVRKRFKQNSASLPSAPEL*
Ga0066692_1039532923300005555SoilGVYLVTAVLVTFGKPGFARDLSIIPWLFLGLALVSLASIGVTIFVQTSTKLMLERGRYDPTGRTYLIMSTGAVLSEAHAIYGFVLTLLSGSIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA*
Ga0066700_1008395613300005559SoilDGQNCELPFRCICRCAVVYPATAVLATVGRPGFARDPNVIPWLFLGLALVSLADIGVTVFVQTSRKLMLERARYDPINRAYLIMATGAVLSEAHAIYRQVLTLLSGSIFYGIGLTFVTWASLWWVRKRFKQSLESVPNG*
Ga0066691_1042311323300005586SoilMVRSANFLFVVFAVAPVVYLATAVIVNFGKPGFTGDPNIIPWLFLGLALVSLANIGVTISVQTSTKLMSERVRYDPIGRAYLIVSIGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVTWAVSGGCGNDSGRVSKAFPTGRPIRYL*
Ga0066706_1124420513300005598SoilRWKSPLFPDKAKMVRNANLLFVILAVAPVVYLVSAVQVTFGNPGLARDPNIIPWLFLGLALFSLANIGLTIFVQTSTKMMIDRARYDPIGRAYLIASLGAVQSEAHAIYGLVLMLLSGSIVYGIGFTVVAWASLWWVWKRFRQNLASLPNA*
Ga0066706_1138813913300005598SoilMFPNKGKMVRSANFLFVTLAVAPVLYLVTAVLATFGRPGLGVDPNIIPWLFIGLALVSLANIGVAVLVQSSTKLMSERARYDPINRAYLIMATGAVLSEAHAIYGLVLTLLSGSIFYGIG
Ga0066656_1008548133300006034SoilLAKEPRFPNKAKMVRSANALFVMFAAAPGLYLVIAALATFGKPGFARDPNIIPWLFLGLALVSLANIDLTISVQTSTKLMSERARYDPIGRAYLIASLGAILSEAHAIYGLVLTLLSGSIFYGIGFTIIAWASLWWVRKRFKQSLSSLPDA*
Ga0079222_1027548113300006755Agricultural SoilLARKLMFPNKAAMVRRANVLFVMLAVAPVLYLVIAFLVTFGKPGFAGNPVFVSVFFIVLASVSVANIGFTVFIQTSKKLTSARARYDPVGRIFQVMSLGAVLSEVHAVYGLALTLLAGSIMYGIGFCVLTWASLWWVRIRFKQNLARLPNS*
Ga0066665_1040090523300006796SoilLFPDKAKMVRNANLLFVILAVAPVVYLVSAVQVTFGNPGLARDPNIIPWLFLGLALFSLANIGLTIFVQTSTKMMIDRARYDPIGRAFLIASLGAVQSEAHAIYGLVLMLLSGSIVYGIGFTVVAWASLWWVWKRFRQNLASLPNA*
Ga0066660_1044985023300006800SoilMFPNKAEMVRHANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFAVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIGFTIVTWASLWWVRKRFKQNSASLPSAPEHARR*
Ga0099791_1006129823300007255Vadose Zone SoilMFPNKAKMVRRANLLFVIFAFAPDLYLVTATLATLGKPGFARDPNVIPWLFLGLVLVSLANIGATVFVQTSRKLMLERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVAWASLWWVRRRFKQNLASLPNE*
Ga0099791_1062549713300007255Vadose Zone SoilLFPDKAKMVRSANALFVTLAVATGVYLVIAILLTFGKPGFARDPNIIPWLFLGLALVSLANIGVTVFVQTSKKLMSERLRYDPISRTYLIMSTGAILSEAHSIFGLVLTLLSGSIFYGIGFTIVTWASLWWVRRRFKQSLASLPNE*
Ga0099793_1002973233300007258Vadose Zone SoilLFPDKAKMVRSAITLFVIWFVAPVLYLVTAGLATFGKPGFARDPNIIPWLFLGIALFSLANVGVTIFVQTSKRLMSERARYDPIGRTYLIMSTGAILSEAHSIFGLVLTLLSGSIFYGIGFAIVTWASLWWVRRRFKQSLASLPNE*
Ga0099793_1030746123300007258Vadose Zone SoilLFPDKAKMVRSANLLFVIFAVAPVLWLVIAFLVTFGKPGFARDPNIIPWLFLGLALVSLANIGLTIFVQTSTRLMLERARYDPIGRAYLLSSLGAVLSEAHAIYGIVLTLLSGSILYELGFTIITWTSLWWVWKRFKQNLASLPNA*
Ga0099793_1037153313300007258Vadose Zone SoilLAKEPRFTNEAKMVRSANALFVMFAAAPGLYLVIAVLATFGKPGFARDPNVIPWLFLGLALVSLANIGLTIFVQTSRKLMSERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTFLSGSIFYGIGFTIIAWASLWWVRKRFKQSLSSLPDA*
Ga0099794_10000107283300007265Vadose Zone SoilLFPDKAKMVRSANALFVTLAVATGVYLVIAILLTFGKPGFARDPNIIPWLFLGLALVSLANIGVTVFVQTSKKLMSERLRYDPISRTYLIMSTGAILSEAHAIYGLVLTLLSGSVFYGVGFTIVALASLWWVWKRFKQSLASLPNE*
Ga0099794_1027589113300007265Vadose Zone SoilMFPNKAKMVRRSNLLFVIFVFAPDLYLVTATLATLGKPGFARDPNVIPWLFLGLCLFSLANIGVTVFIQTSKKIMSERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVAWASLWWVRRRFRQNLASLPDA*
Ga0099829_1003639043300009038Vadose Zone SoilLFPDKAQMVRSANLLFVIFAVAPVLYLVTAFLATFERSGLARDPSIIPVLFIGLALVSLTTIRLTVFVQTSKKVMPDRARYDPISRTYLIMSTGSILSEAHSIYGLVLTLLSGSILYGIGFSILAWASLWWVRKRFKRNLASLPDA*
Ga0099829_1120275613300009038Vadose Zone SoilMFPNKAKMVRSANLLFVMFAVAPGLYLVLAALATFGKPGFARDPNIIPLLFLGLALVSLANIGVTIFVQTSTKLMSERARYDPINRTYLIMAIGAVLSETHAIYGLVLTLLSGSILYGIGFSILTWASLWWVRKRFKQNLASLPNE*
Ga0099828_1048740923300009089Vadose Zone SoilMIRSANFLLVTFALAPVVYLVEAFLATFGKPGFARDPSIIPVLFIGLALVSLTTIRLTVFVQTSKKVMPDRARYDPISRTYLIMSTGSILSEAHSIYGLVLTLLSGSILYGIGFSILAWASLWWVRKRFKRNLASLPDA*
Ga0099828_1102014123300009089Vadose Zone SoilMFPNKAKMVRSANSLFVIFAVAPGLYLVLAALATFGKPGFARDPNVIPLLFLGLALVSLANIGVTIFVQTSTKLMSERASYDPINRTYLIMAIGTVLSEAHAIYGLVLTLLSGSIFHGIGFSILTWASLWWVRKRFKQSLARLPNE*
Ga0066709_10025146913300009137Grasslands SoilMFPNKGKMVRSANFLFVTLAVAPVLYLVTAVLATFGRPGLGVDPNIIPWLFIGLALVSLANIGVAVLVQSSTKLMSERARYDPINRAYLIMATAAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA*
Ga0137392_1034889023300011269Vadose Zone SoilLFPDKAKMVRSANLLFVIFAVAPVLYLVTAFLATVERSGLARDPSIIPVLFIGLALVSLTTIRLTVFVQTSKKVMPDRARYDPISRTYLIMSTGSILSEAHSIYGLVLTLLSGSILYGIGFSILAWASLWWVRKRFKRNLASLPDA*
Ga0137391_1054566123300011270Vadose Zone SoilMFPNKAKMVRSANFLLVTFALAPVVYLVEAFLATFGKPGFARDPNIIPGLFIGLALASLANISFTVFIQTSNRVMSGRARYDPINRTYLIMATSAILSEAHSIYGLVLTLLSGSIFYGIGFTIVTWAALLWVRKRFKQNLPKLPNA*
Ga0137389_1005315643300012096Vadose Zone SoilANLLVVIFGVAPGLYLVRAFLATFERSGLARDPSIIPVLFIGLALVSLTTIRLTVFVQTSKKVMPDRARYDPISRTYLIMSTGSILSEAHSIYGLVLTLLSGSILYGIGFSILAWASLWWVRKRFKRNLASLPDA*
Ga0137389_1086139713300012096Vadose Zone SoilAPGLYLMTAFLATFGKPGFARDPNSVPWLFLGIALFSLANIGVTIFVQTNMKLMSERARYDPIGRTYLIMSTGAVLSEAHAIYGMVLTLLSSSIFYGSGFTILTWASLWWVWKRFKQNLASLPDT*
Ga0137383_1002184263300012199Vadose Zone SoilLFPDKAKMVRSVNALFVMLAVATGVYLVIAVLVTFGKPGFARDENFILWLFFGLALISLANIGVTIFVQTSKKLMSERARYDPIGRTYLIVSTSAILSEAHSIYGLVLALLSGTVIYGIGFTVVAWASLWWVRKRFKQSLASLPDA*
Ga0137383_1002572533300012199Vadose Zone SoilMFPNKAKMVRTANALLVMLAVAPGVYLVTAVLVTFGGPGFARDPNIILWLFVGIALFSLANIGVTVFVQTSTKLMAERARYDPIGRTYLVMSLGAVLSEAHAIYGLVLTLLSGSILYEIGFTIVTWASLWWIWKRFKQNLGNIPDTST*
Ga0137382_1076335213300012200Vadose Zone SoilMFPNKGKMVRSANFLFVTLAVAPVLYLVTAVLATFGRPGLGVDPNIIPWLFIGLALVSLANIGVAVLVQSSTKLMSERARYDPINRAYLIMATGAVLSEAHAIYGLVLTLLSGLIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA*
Ga0137363_1125668113300012202Vadose Zone SoilMFPNKAKMVRRANLLFVIFAFAPDLYLVTATLATLGKPGFARDPNVIPWLFLGLVLVSLANIGATVFVQTSRKLMLERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIIAWASLWWVRKRFKQN
Ga0137399_10001594103300012203Vadose Zone SoilMFPNKAKMVRRANLLFVIFAFAPDLYLVTATLATLGKPGFARDPNVIPLLFLGLALVSLANISATVFVQTSRKLMLERARYDPIGRAYLIASLGAVLSEAHAIFGLVLSLLSGSIFYGIGFTIVTWASLWWVRKRFKQNLPKLPNA*
Ga0137399_1004578453300012203Vadose Zone SoilALFVILAVAPVLYLMTAVPVTFGGSGLARDPNLIAWLFLGLAFFSLANIGFTVFIQTSTKLMSERARYDPIGRTYLLMSLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTILTWASLWWVWKRFKQSLASLPDT*
Ga0137399_1022579523300012203Vadose Zone SoilLFPDKAKMVRSAITLFVIWFVAPVLYLVTAGLATFGKPGFARDPNIIPWLFLGIALFSLANVGVTIFVQTSKRLMSERARYDPIGRTYLIMSTGAILSEAHSIFGLVLTLLSGSIFYGIGFTIVTWASLWWVRRRFKQSLASLPNE*
Ga0137399_1094199213300012203Vadose Zone SoilLAKEPRFTNEAKMVRSANALFVMFAAAPGLYLVIAVLATFGKPGFARDPNVIPWLFLGLALVSLANIGLTIFVQTSRKLMSERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIIAWASLWWVRKRFKQSLSSLPDA*
Ga0137380_1009248523300012206Vadose Zone SoilMFPNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFVVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHTVYGLALTLVSGSILYGIGFSIVTWASLWWVRKRFKQNLGNLPNTSAQPVLE*
Ga0137380_1055329123300012206Vadose Zone SoilMFPDKAKMVRSANFLFVTLAVAPVLYLVTAVLATFGKPGLARDPNIIPVLFLGLALVSLANIGLTVFVQTSTKLMSERARYDPINRAYLIMAAGAVLSEAHAIYGLVLTLLSGSVFYEIGFTIITWASLWWVRRRFGQSLASLPNA*
Ga0137380_1091247113300012206Vadose Zone SoilMFPNKAKMVRSANFLFVVFAVAPGVYLATAVLVTFGKPGFARDPNIILWLFLALALVSLASIGVTIFVQTSTKLMLERGRYDPTGRTYLIMSTGAVLSEAHAIYGFVLTLLSGSIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA*
Ga0137380_1127858713300012206Vadose Zone SoilMFPDKTKMVRSTAALFVILAVAPGLYLVLAALVTVGKPGFARDPNIIPWLFLGLALVSLANIGVTIFVQTSTKLMPERARYDPTGRTYLIISTGAILSEAHSIYGLVITLLSGSLFFGIGFTSVTWASLWWVRKRFKQNLGNLPNTSAQPVLE*
Ga0137381_1052202623300012207Vadose Zone SoilMIRMANLLFVMFAVAPVVYLATAVLVTFGRPGFARDPNIIPWLFLGLALVSLSSIGVTIFVQTSTKLMSERARYDPINRAYLIMATGAVLSETHAIYGLVLTLLSGSIFYEIGFTIVAWASLLWVRRRFRQSLASLPDA*
Ga0137379_1012629723300012209Vadose Zone SoilMFPDKAKMIRTANLLFVIFAVAPGVYLVTAVLVTFGKPGFARDLSIIPWLFLGLALVSLASIGVTIFVQTSTKLMLERGRYDPTGRTYLIMSTGAVLSEAHAIYGFVLTLLSGSIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA*
Ga0137379_1033948113300012209Vadose Zone SoilLFPNKAKMIRMANLLFVMFAVAPVVYLATAVLVTFGRPGFARDPNIIPWLFLGLALVSLSSIGVTIFVQTSTKLMSERARYDPINRAYLIMATGAVLSETHAIYGLVLTLLSGSIFYEIGFTIVAWASLLWVRRRFRQSL
Ga0137379_1098365613300012209Vadose Zone SoilMFPNKAKMVRNANFLFVTLAVAPVLYLVTAVLATFGKPGLARDPNIIPWLFLGLALVSLANIGVTVFVQSSTKLMSERARYDPINRAYLIMATGAVLSEAHAIYGLVLTLLSGSIFYEIGFTIVAWASLLWVRRRFRQSL
Ga0137378_1003690233300012210Vadose Zone SoilMFPNKAKVVRTANALLVMLAVAPGVYLVTAVLVTFGGPGFARDPNIILWLFVGIALFSLANIGVTVFVQTSTKLMAERARYDPIGRTYLVMSLGAVLSEAHAIYGLVLTLLSGSILYEIGFTIVTWASLWWIWKRFKQNLGNIPDTST*
Ga0137377_1003348433300012211Vadose Zone SoilMVRTANALLVMLAVVPGVYLVTAVLVTFGGPGFARDPNIILWLFVGIALFSLANIGVTVFVQTSTKLMAERARYDPIGRTYLVMSLGAVLSEAHAIYGLVLTLLSGSILYEIGFTIVTWASLWWIWKRFKQNLGNIPDTST*
Ga0137377_1037459423300012211Vadose Zone SoilMFPNKAKMVGSANFLFVVFAVAAGLYLVIAVLVTFGRPGFARDPNFIPWLFLGLAIVSLASIGLTVFVQTSTKLMSERGRYDPIGRTYMIVSIGAILSEAHAIYGLVLTLVSGSILYEIGFTIVTWTSLWWVWKRFKQSLASLPDV*
Ga0137387_1087887123300012349Vadose Zone SoilMFPNKAKMVRNANFLFVTLAVAPVLYLVTAVLATFGKPGLARDPNIIPWLFLGLALVSLANIGVTVFVQSSTKLMSERARYDPINRAYLIMATGAVLSEAHAIYGLVLTLLSSSIFYEIGFTIVTWASLWWVRKRFKQNL
Ga0137387_1118881013300012349Vadose Zone SoilMFPDKTKMVRSTAALFVILAVAPGLYLVLAALVTVGKPGFARDPNIIPWLFLGLALVSLANIGVTIFVQTSTKLMPERARYDPTGRTYLIMSTGAILSEAHSIYGLVITLLSGSLFFGIGFTSVTWASLWWVRKRFK
Ga0137386_1043394313300012351Vadose Zone SoilMQLRYVFSADSIRRRLGLVANKPMFPNKAKMVRTANALLVMLAVAPGVYLVTAVLVTFGGPGFARDPNIILWLFVGIALFSLANIGVTVFVQTSTKLMAERARYDPIGRTYLVMSLGAVLSEAHAIYGLVLTLLSGSILYEIGFTIVTWASLWWIWKRFKQNLGNIPDTST*
Ga0137386_1127856313300012351Vadose Zone SoilGLARKRMFPNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFVVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHTVYGLALTLVSGSILYGIGFSIVTWASLWWVRKRFKQNLGNLPNTSAQPVLE*
Ga0137386_1131303713300012351Vadose Zone SoilNKPMFPDKTKMVRSTAALFVILAVAPGLYLVLAALVTVGKPGFARDPNIIPWLFLGLALVSLANIGVTIFVQTSTKLMPERARYDPTGRTYLIISTGAILSEAHSIYGLVITLLSGSLFFGIGFTSVTWASLWWVRKRFKQNLGNLPNTSAQPVLE*
Ga0137371_1035664213300012356Vadose Zone SoilANLLFVMFAVAPVVYLATAVLVTFGRPGFARDPNIIPWLFLGLALVSLSSIGVTIFVQTSTKLMSERARYDPIGRTYMIMSTGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA*
Ga0137384_1015359233300012357Vadose Zone SoilMFPNKAKMVRNANFLFVTLAVAPVLYLVTAVLATFGKPGLARDPNIIPWLFLGLALVSLANIGVTVFVQSSTKLMSERARYDPINRAYLIMATGAVLSETHAIYGLVLTLLSGSIFYEIGFTIVAWASLLWVRRRFRQSLASLPDA*
Ga0137384_1072405723300012357Vadose Zone SoilANFLFVTLAVAPVLYLVTAVLATFGKPGLARDPNIIPVLFLGLALVSLANIGLTVFVQTSTKLMSERARYDPINRAYLIMAAGAVLSEAHAIYGLVLTLLSGSVFYEIGFTIITWASLWWVRRRFGQSLASLPNA*
Ga0137384_1105130223300012357Vadose Zone SoilMFPDKAKMIRTANLLFVMFAVAPVVYLATAVLVTFGRPGFARDPNIIPWLFLGLALVSLSSIGVTIFVQTSTKLMLERGRYDPTGRTYMIMSTGAVLSEAHAIYGLVLTLLSGSIFYEIGFTILAWASLLWVRRRFRQSLASLPDA*
Ga0137360_1010838613300012361Vadose Zone SoilMVRSANSLFVIFAVAPGLYLMTAFLATFGKPGLARDPNSIPWLFLGIAFFSLANIGVTIFVQTSTRLMSERARYDPIGRTYLITSTGAILSEAHAIFGLVLTLLSGSIFYGIGFTIVTWASLWWVWKRFKQSLASLPKE*
Ga0137361_1062720523300012362Vadose Zone SoilMFPNKAKMVRRANLLFVIFAFAPDLYLVTATLATLGKPGFARDPNVIPLLFLGLALVSLANISATVFVQTSRKLMLERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTILSGSIFYGIGFTIVAWASLWWVRKLFKQNLASLPNE*
Ga0137361_1094726413300012362Vadose Zone SoilMFPNKAEMVRRANALLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFVVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIEFSIVTWASLWWVWKRFKQSLASLPNA*
Ga0137396_1005505223300012918Vadose Zone SoilMFPDKAKMVRSANALFVIFAVAPGLYLMLAALATFGKPGFARDPNIIPWLFLELALVSLANIGVTIFVQTSTKLMSERARYDPIGRTYLIMATGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVAWASLWWVWKRFKQSLASLPNE*
Ga0137396_1082840213300012918Vadose Zone SoilMFPNKAKMVRSANLLFVIFAVAPVLYLVTAALATFGKPGFARDPNIIPLLFLGLALVSLANIGLTIFVQTSTKIMMERARYDPIGRAYLIASLGSVQSEAHAIYGLVLTLLSGSIFYGIGFTIVTWASLWWVRTRFKQSLASLPDA*
Ga0137419_1004572623300012925Vadose Zone SoilMFPNKAKMVRRANLLFVIFAFAPDLYLVTATLATLGKPGFARDPNVIPLLFLGLALVSLANISATVFVQTSRKLMLERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVAWASLWWVRRRFKQNLASLPNE*
Ga0137416_1137394513300012927Vadose Zone SoilLAKEPRFTNEAKMVRSANALFVMFAAAPGLYLVIAVLATFGKPGFARDPNVIPWLFLGLALVSLANIGLTIFVQTSRKLMSERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTI
Ga0137410_1037975213300012944Vadose Zone SoilMFPNKAKMVRSANLLFVIFAVAPGLYLLLAALATLGKPGFARDPNIIPLLFFGLALVSLANIGVTVFVQTSRKLMSERASYDPINRTYLIMAIGAVLSEAHGIYGLVLTLLSGSIFYGIGFTIITWTSLWWVRRRFTQSLVSLPNA*
Ga0137418_1000196833300015241Vadose Zone SoilMFPNKAKMVRRANLLFVIFAFAPDLYLVTATLATLGKPGFARDPNVIPLLFLGLALVSLANISATVFVQTSRKLMLERARYDPIGRAYLIASLGAVLSEAHAIFGLVLSLLSGSIFYGIGFTIVAWASLWWVRRRFKQNLASLPNE*
Ga0134074_120083413300017657Grasslands SoilTANALLMTSAVAPGLYLVTAVLVTFGKPGLARDLNIIPWLFLGLALFSLANIGVTIFVQTSTKLMSERARYDPVGRTYLIMSTGAILSEAHAIYGLVLTLLSGSIFYEIGFTIVAWQVSGGSGNDSSRISETFQTRRPKR
Ga0066667_1037084323300018433Grasslands SoilLAKDRMFPNKGKMVRSANFLFVTLAVAPVLYLVTAVLATFGRPGLGVDPNIIPWLFIGLALVSLANIGVAVLVQSSTKLMSERARYDPINRAYLIMATGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVTWTSLWWVRKRFKQNLASLPDA
Ga0066662_1004791233300018468Grasslands SoilMFPNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFVVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIGFTIVTWASLWWVRKRFKQNSASLPSAPEL
Ga0215015_1014156953300021046SoilLFPDKAKMVRVANVLFLILFFAPVLYLATAVQVTFGNMGLARDPNIIPVLFLGLALSSLANIGVIILFQTSKKVMSERARYDPIGRTYTIMTLGAVLSEAHAIYGMLLTLLSGSILYGIGFTIVAWASLWWVWKRFKQNLASLPTA
Ga0215015_1028162313300021046SoilDKMVRSANALLAIFSVAPALWLVIAFLVTFGKRGFARDPSIIPILFIGLALVSLASIRLTIYVQTSRKLMSSSARYDPIGRTFLIMSRGAIVSEAHAIYGLVLTLLSGSLFYGIGFTIVTWGSLWWVRKRFKKNLASLPNP
Ga0210404_1000361923300021088SoilMVGVVFQLIRRRLGDKWISPLFPDKAKMVRSANALPVIFAIAPVLWLVTAFLVTYGKPGFLRDPSFIPNLFIGLALVSLANIGLTVFLQTSKKVMSARSSYDPVDRVFLKMVTGSILSEAHAIYGLVLTLLSGPILYELGFTVVTWASLWWVWKRFKQNITSLPNR
Ga0207646_1002006243300025922Corn, Switchgrass And Miscanthus RhizosphereMVKGANALLVIFAVAPVLWPVIAFLVTLGNPGFARDPNIIPVLFIGLALVSLATIGFTVFIQTSRKLMSARARYDPIGRTFQLMSLGAILSEEHAIYGLVLSLLSGSIFYGIGFTIVTWASLWWIWKRFKQSLASLPNM
Ga0209237_102702753300026297Grasslands SoilMAIKPMFPDKTRMVRSANALFVIFAVAPGLYLMLAALVTFGKPGFARDPNIIPWLFLGLALFSLANIGVTVFVQTSTKLMSERARYDPIGRTYLIMATGAVLSEAHAIYGLVLMLLSGSTFYGIGFTILTWASLWWVWKRFKQNLASLPNA
Ga0209055_100529143300026309SoilLARKPMFPNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFAVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIGFTIVTWASLWWVRKRFKQNSASLPSAPEL
Ga0209686_100543533300026315SoilMFPNKAKMVRSANFLFVILAVAPGLYLVTALLVTDGKQGLARDSTIIPFLFLGLALFSLANIGVTILFQTSTKLMSERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIIA
Ga0209686_104998523300026315SoilVVYPATAVLATVGRPGFARDPNVIPWLFLGLALVSLADIGVTVFVQTSRKLMLERARYDPINRAYLIMATGAVLSEAHAIYGQVLTLLSGSIFYGIGLTFVTWASLWWVRKRFKQSLESVPNG
Ga0209154_100328563300026317SoilMFPNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFAVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIGFTIVTWASLWWVRKRFKQNSASLPSAPEL
Ga0209804_132386813300026335SoilELPFRCICRCAVVYPATAVLATVGRPGFARDPNVIPWLFLGLALVSLADIGVTVFVQTSRKLMLERARYDPINRAYLIMATGAVLSEAHAIYGQVLTLLSGSIFYGIGLTFVTWASLWWVRKRFKQSLESVPNG
Ga0209690_115368813300026524SoilNKAEMVRRANTLLVMLAVAPVLYLVVAFLVTFGKPGFEGNPVFVSVFFAVLALVSLANIGFTVFIQTSKKLMSSRASIDPVGRTFHIMSLGAVLSEVHAVYGLALTLVSGSILYGIGFTIVTWASLWWVRKRFKQNSASLPSAPEL
Ga0209805_104321523300026542SoilVYPATAVLATVGRPGFARDPNVIPWLFLGLALVSLADIGVTVFVQTSRKLMLERARYDPINRAYLIMATGAVLSEAHAIYGQVLTLLSGSIFYGIGLTFVTWASLWWVRKRFKQSLESVPNG
Ga0209648_1042934013300026551Grasslands SoilLFPDKAKMVRSANLLFVTFAVAPVLYLVLDFLATFGKPGFARDPSIIPLLFVGLALVSLASIGVTIFIETSTRLMSEMARYDPIGRTFQTMFLGAVLSEAHAIFGQVLTLLSGSILYGIGFTIVTWASLWWVQRRFKQNLGSLDASLY
Ga0209076_100200843300027643Vadose Zone SoilMFPNKAKMVRRANLLFVIFAFAPDLYLVTATLATLGKPGFARDPNVIPWLFLGLVLVSLANIGATVFVQTSRKLMLERARYDPIGRAYLIASLGAVLSEAHAIYGLVLTLLSGSIFYGIGFTIVAWASLWWVRRRFKQNLASLPNE
Ga0209076_100494023300027643Vadose Zone SoilLFPDKAKMVRSAITLFVIWFVAPVLYLVTAGLATFGKPGFARDPNIIPWLFLGIALFSLANVGVTIFVQTSKRLMSERARYDPIGRTYLIMSTGAILSEAHSIFGLVLTLLSGSIFYGIGFAIVTWASLWWVRRRFKQSLASLPNE
Ga0209588_1000037383300027671Vadose Zone SoilLFPDKAKMVRSANALFVTLAVATGVYLVIAILLTFGKPGFARDPNIIPWLFLGLALVSLANIGVTVFVQTSKKLMSERLRYDPISRTYLIMSTGAILSEAHAIYGLVLTLLSGSVFYGVGFTIVALASLWWVWKRFKQSLASLPNE
Ga0209180_1000586393300027846Vadose Zone SoilLFPDKAQMVRSANLLFVIFAVAPVLYLVTAFLATFERSGLARDPSIIPVLFIGLALVSLTTIRLTVFVQTSKKVMPDRARYDPISRTYLIMSTGSILSEAHSIYGLVLTLLSGSILYGIGFSILAWASLWWVRKRFKRNLASLPDA
Ga0137415_1021012023300028536Vadose Zone SoilLFPDKAKMVRSAITLFVIWFVAPVLYLVTAGLATFGKPGFARDPNIIPWLFLGIALFSLANVGVTIFVQTSKRLMSERARYDPIGRTYLIMSTGAILSEAHSIFGLVLTLLSGSIFYGIGFTIVTWASLWWVRRRFKQSLASLPNE
Ga0137415_1096728513300028536Vadose Zone SoilMFPNKAKMVRRANLLFVIFAFAPDLYLVTATLATLGKPGFARDPNVIPLLFLGLALVSLANISATVFVQTSRKLMLERARYDPIGRAYLIASLGAVLSEAHAIFGLVLSLLSGSIFYGIGFTIVTWASLWWVRKRFKQNLPKLPNA
Ga0307479_10000401363300031962Hardwood Forest SoilLARKPMFPNKAAMVRRANVLFVMLAVAPVFYLMIAFLVTFGKSGFAGNPVSISVFFIVLASVSAANIGFTVFTQTSKKLMSARARHDPVGRIFQVMSLGAVLSEVHAVYGLALTLLAGSIMYGIGFCIVTWASLWWVRIRFKQNLANIPNT
Ga0307479_1001548323300031962Hardwood Forest SoilMFPNKAAMVRRANVLLVMLAVAPVLYLVIAFLVTFGKPGFAGNPVFISVFFTVLASVSVANIGFTVFTQTSKKLMSARAHYDPVGRIFQVMSLGAVLSEVHAVYGLALTLLAGSIMYGIGFCILTWASLWWVRKRFKQNLARLPERVDPTRISEL
Ga0307479_1007914833300031962Hardwood Forest SoilMFPNRAEMVRRANVLLVMLALAPVLYLVIAFLVTFGKPGFGGNPLFISVFFIVLASVSVANVGFTVFTQTSKKLMSARARYDPVGRIFQVMSLGAVLSEVYAVYGLALTLLAGSIMYGIGFCIITWASLWWVRRRFKQNLANLPNA
Ga0307471_10007406323300032180Hardwood Forest SoilMFPNRAAMVRRANALLVMLAIAPVLYLVIAFLVTFGKPGFSGNPIFISVFFTVLASVSVANIGFTVFTQTSKKLMSARARYDPVGRIFQVMSLGAVLSEVHAIYGLALTLLAGSIMYGIGFCILTWASLWWVRIRFKQNLANLPNA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.