NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F077824

Metagenome / Metatranscriptome Family F077824

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F077824
Family Type Metagenome / Metatranscriptome
Number of Sequences 117
Average Sequence Length 98 residues
Representative Sequence MKTRILLPVLLLSAVFVSPASANWFSNPNWNINLHIGSAPNPTPDDVRAERQPMLVRDADGNVIAMIDPATGKMIAIAEPPAPAQPSRGIANNAPAAAPAR
Number of Associated Samples 98
Number of Associated Scaffolds 117

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 52.99 %
% of genes near scaffold ends (potentially truncated) 42.74 %
% of genes from short scaffolds (< 2000 bps) 87.18 %
Associated GOLD sequencing projects 91
AlphaFold2 3D model prediction Yes
3D model pTM-score0.29

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (51.282 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(23.077 % of family members)
Environment Ontology (ENVO) Unclassified
(40.171 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(47.863 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Fibrous Signal Peptide: Yes Secondary Structure distribution: α-helix: 17.83%    β-sheet: 11.63%    Coil/Unstructured: 70.54%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.29
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 117 Family Scaffolds
PF08238Sel1 23.08
PF01979Amidohydro_1 8.55
PF13360PQQ_2 2.56
PF08241Methyltransf_11 1.71
PF04255DUF433 1.71
PF00497SBP_bac_3 0.85
PF00478IMPDH 0.85
PF07690MFS_1 0.85
PF00456Transketolase_N 0.85
PF11730DUF3297 0.85
PF13975gag-asp_proteas 0.85
PF07040DUF1326 0.85
PF13581HATPase_c_2 0.85
PF01546Peptidase_M20 0.85

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 117 Family Scaffolds
COG2442Predicted antitoxin component of a toxin-antitoxin system, DUF433 familyDefense mechanisms [V] 1.71
COG0021TransketolaseCarbohydrate transport and metabolism [G] 0.85
COG3959Transketolase, N-terminal subunitCarbohydrate transport and metabolism [G] 0.85
COG5588Uncharacterized conserved protein, DUF1326 domainFunction unknown [S] 0.85


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A51.28 %
All OrganismsrootAll Organisms48.72 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002907|JGI25613J43889_10200207Not Available535Open in IMG/M
3300003324|soilH2_10090986All Organisms → cellular organisms → Bacteria → Proteobacteria1805Open in IMG/M
3300004120|Ga0058901_1447447All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → pseudomallei group → Burkholderia pseudomallei564Open in IMG/M
3300004153|Ga0063455_101617078All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → pseudomallei group → Burkholderia pseudomallei511Open in IMG/M
3300005165|Ga0066869_10114709Not Available552Open in IMG/M
3300005171|Ga0066677_10766614Not Available536Open in IMG/M
3300005330|Ga0070690_100553463All Organisms → cellular organisms → Bacteria → Proteobacteria868Open in IMG/M
3300005331|Ga0070670_102214754Not Available506Open in IMG/M
3300005332|Ga0066388_100526327All Organisms → cellular organisms → Bacteria → Proteobacteria1815Open in IMG/M
3300005335|Ga0070666_11324468All Organisms → cellular organisms → Bacteria → Proteobacteria537Open in IMG/M
3300005338|Ga0068868_100366373All Organisms → cellular organisms → Bacteria1237Open in IMG/M
3300005355|Ga0070671_100801440All Organisms → cellular organisms → Bacteria → Proteobacteria820Open in IMG/M
3300005364|Ga0070673_100958562All Organisms → cellular organisms → Bacteria → Proteobacteria795Open in IMG/M
3300005365|Ga0070688_100496950Not Available919Open in IMG/M
3300005456|Ga0070678_100688080Not Available920Open in IMG/M
3300005458|Ga0070681_10968669All Organisms → cellular organisms → Bacteria → Proteobacteria770Open in IMG/M
3300005533|Ga0070734_10104101All Organisms → cellular organisms → Bacteria → Proteobacteria1664Open in IMG/M
3300005561|Ga0066699_10052900All Organisms → cellular organisms → Bacteria2504Open in IMG/M
3300005576|Ga0066708_10097940All Organisms → cellular organisms → Bacteria → Proteobacteria1737Open in IMG/M
3300006050|Ga0075028_100404584All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → unclassified Betaproteobacteria → Betaproteobacteria bacterium781Open in IMG/M
3300006163|Ga0070715_10826548Not Available564Open in IMG/M
3300006176|Ga0070765_100238393All Organisms → cellular organisms → Bacteria1664Open in IMG/M
3300006358|Ga0068871_101095909Not Available744Open in IMG/M
3300006794|Ga0066658_10314886All Organisms → cellular organisms → Bacteria → Proteobacteria836Open in IMG/M
3300006800|Ga0066660_10425032All Organisms → cellular organisms → Bacteria → Proteobacteria1104Open in IMG/M
3300007788|Ga0099795_10051005Not Available1507Open in IMG/M
3300009143|Ga0099792_10034844All Organisms → cellular organisms → Bacteria2360Open in IMG/M
3300009148|Ga0105243_10462622Not Available1193Open in IMG/M
3300009174|Ga0105241_10224108All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1582Open in IMG/M
3300009176|Ga0105242_10994022Not Available846Open in IMG/M
3300010047|Ga0126382_10711835All Organisms → cellular organisms → Bacteria → Acidobacteria844Open in IMG/M
3300010359|Ga0126376_10237897All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1539Open in IMG/M
3300010359|Ga0126376_10380554All Organisms → cellular organisms → Bacteria1263Open in IMG/M
3300010362|Ga0126377_12435431Not Available599Open in IMG/M
3300010373|Ga0134128_12073086All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria626Open in IMG/M
3300010376|Ga0126381_102922687Not Available680Open in IMG/M
3300010397|Ga0134124_12214932Not Available589Open in IMG/M
3300010400|Ga0134122_10000674All Organisms → cellular organisms → Bacteria → Proteobacteria23522Open in IMG/M
3300010400|Ga0134122_11276652Not Available740Open in IMG/M
3300010401|Ga0134121_12544637Not Available555Open in IMG/M
3300010857|Ga0126354_1071184Not Available631Open in IMG/M
3300010858|Ga0126345_1211514Not Available563Open in IMG/M
3300010859|Ga0126352_1223059All Organisms → cellular organisms → Bacteria961Open in IMG/M
3300010861|Ga0126349_1003169Not Available502Open in IMG/M
3300011120|Ga0150983_14226469Not Available574Open in IMG/M
3300011120|Ga0150983_14257674Not Available550Open in IMG/M
3300011120|Ga0150983_15369620All Organisms → cellular organisms → Bacteria641Open in IMG/M
3300012189|Ga0137388_11956802All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300012200|Ga0137382_10516082Not Available850Open in IMG/M
3300012203|Ga0137399_10023121All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria4122Open in IMG/M
3300012205|Ga0137362_10099369Not Available2449Open in IMG/M
3300012207|Ga0137381_11279428Not Available626Open in IMG/M
3300012208|Ga0137376_11504931Not Available564Open in IMG/M
3300012208|Ga0137376_11631708Not Available536Open in IMG/M
3300012209|Ga0137379_10214540Not Available1839Open in IMG/M
3300012210|Ga0137378_11047201Not Available730Open in IMG/M
3300012211|Ga0137377_10150467All Organisms → cellular organisms → Bacteria2229Open in IMG/M
3300012211|Ga0137377_10403308Not Available1305Open in IMG/M
3300012212|Ga0150985_107967875Not Available719Open in IMG/M
3300012212|Ga0150985_109992934Not Available686Open in IMG/M
3300012212|Ga0150985_111257300Not Available504Open in IMG/M
3300012212|Ga0150985_116945145All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium584Open in IMG/M
3300012357|Ga0137384_10196903All Organisms → cellular organisms → Bacteria → Proteobacteria1686Open in IMG/M
3300012357|Ga0137384_11209413Not Available600Open in IMG/M
3300012469|Ga0150984_123373664All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium586Open in IMG/M
3300012683|Ga0137398_10042069Not Available2662Open in IMG/M
3300012917|Ga0137395_10206004All Organisms → cellular organisms → Bacteria1371Open in IMG/M
3300012922|Ga0137394_10530785All Organisms → cellular organisms → Bacteria → Proteobacteria998Open in IMG/M
3300012922|Ga0137394_10537970Not Available991Open in IMG/M
3300012925|Ga0137419_11404501Not Available589Open in IMG/M
3300012925|Ga0137419_11575458All Organisms → cellular organisms → Bacteria558Open in IMG/M
3300012929|Ga0137404_11117581Not Available723Open in IMG/M
3300012930|Ga0137407_11636776Not Available613Open in IMG/M
3300012944|Ga0137410_10202205All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1539Open in IMG/M
3300012944|Ga0137410_10646000All Organisms → cellular organisms → Bacteria878Open in IMG/M
3300012944|Ga0137410_11334829All Organisms → cellular organisms → Bacteria621Open in IMG/M
3300012960|Ga0164301_10061772All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → unclassified Hyphomicrobiaceae → Hyphomicrobiaceae bacterium1987Open in IMG/M
3300012984|Ga0164309_10523331All Organisms → cellular organisms → Bacteria912Open in IMG/M
3300013297|Ga0157378_12868054Not Available534Open in IMG/M
3300014969|Ga0157376_11220875All Organisms → cellular organisms → Bacteria780Open in IMG/M
3300015371|Ga0132258_11002225All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2111Open in IMG/M
3300015372|Ga0132256_102792259Not Available587Open in IMG/M
3300016319|Ga0182033_11375177Not Available635Open in IMG/M
3300017974|Ga0187777_10939609All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodospirillales → unclassified Rhodospirillales → Rhodospirillales bacterium RIFCSPLOWO2_12_FULL_58_28623Open in IMG/M
3300019890|Ga0193728_1182313Not Available897Open in IMG/M
3300020068|Ga0184649_1360693All Organisms → cellular organisms → Bacteria657Open in IMG/M
3300021363|Ga0193699_10001599All Organisms → cellular organisms → Bacteria8893Open in IMG/M
3300021560|Ga0126371_13773085Not Available511Open in IMG/M
3300022532|Ga0242655_10324485Not Available507Open in IMG/M
3300022724|Ga0242665_10369886Not Available517Open in IMG/M
3300024331|Ga0247668_1014813All Organisms → cellular organisms → Bacteria → Acidobacteria1615Open in IMG/M
3300025925|Ga0207650_10824830Not Available786Open in IMG/M
3300025925|Ga0207650_11430997Not Available588Open in IMG/M
3300025931|Ga0207644_10590432Not Available922Open in IMG/M
3300025960|Ga0207651_11224821Not Available674Open in IMG/M
3300026023|Ga0207677_10574233All Organisms → cellular organisms → Bacteria986Open in IMG/M
3300026320|Ga0209131_1090702Not Available1629Open in IMG/M
3300026320|Ga0209131_1318970All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium578Open in IMG/M
3300027815|Ga0209726_10282089All Organisms → cellular organisms → Bacteria821Open in IMG/M
3300027826|Ga0209060_10061949All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1784Open in IMG/M
3300028536|Ga0137415_10476536All Organisms → cellular organisms → Bacteria1055Open in IMG/M
3300028556|Ga0265337_1005392All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria5076Open in IMG/M
3300029636|Ga0222749_10017267Not Available2957Open in IMG/M
3300031231|Ga0170824_101544895Not Available529Open in IMG/M
3300031231|Ga0170824_102298152Not Available716Open in IMG/M
3300031231|Ga0170824_102724545Not Available524Open in IMG/M
3300031231|Ga0170824_116367223Not Available524Open in IMG/M
3300031344|Ga0265316_10606943Not Available776Open in IMG/M
3300031421|Ga0308194_10303374Not Available554Open in IMG/M
3300031446|Ga0170820_11663551Not Available506Open in IMG/M
3300031740|Ga0307468_100099241All Organisms → cellular organisms → Bacteria1725Open in IMG/M
3300031753|Ga0307477_10000479All Organisms → cellular organisms → Bacteria → Proteobacteria44067Open in IMG/M
3300031879|Ga0306919_11384751Not Available531Open in IMG/M
3300031912|Ga0306921_10018896All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales7526Open in IMG/M
3300032261|Ga0306920_103220392Not Available611Open in IMG/M
3300032783|Ga0335079_10163062All Organisms → cellular organisms → Bacteria2494Open in IMG/M
3300033433|Ga0326726_10043218All Organisms → cellular organisms → Bacteria3936Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil23.08%
Switchgrass RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Switchgrass Rhizosphere6.84%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil5.13%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil4.27%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil4.27%
Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Forest Soil4.27%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil3.42%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.42%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere3.42%
Boreal Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Boreal Forest Soil3.42%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.56%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.71%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil1.71%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil1.71%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere1.71%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Miscanthus Rhizosphere1.71%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.71%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere1.71%
RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Rhizosphere1.71%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.85%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.85%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.85%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.85%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.85%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil0.85%
Tropical PeatlandEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Tropical Peatland0.85%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.85%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil0.85%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.85%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere0.85%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.85%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.85%
Miscanthus RhizosphereHost-Associated → Plants → Rhizoplane → Soil → Unclassified → Miscanthus Rhizosphere0.85%
Miscanthus RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Miscanthus Rhizosphere0.85%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.85%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Arabidopsis Rhizosphere0.85%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.85%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300004120Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF238 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300004153Grasslands soil microbial communities from Hopland, California, USA (version 2)EnvironmentalOpen in IMG/M
3300005165Soil and rhizosphere microbial communities from Laval, Canada - mgHMCEnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005331Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaGHost-AssociatedOpen in IMG/M
3300005332Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 6 (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300005335Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S2-3 metaGHost-AssociatedOpen in IMG/M
3300005338Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2Host-AssociatedOpen in IMG/M
3300005355Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaGHost-AssociatedOpen in IMG/M
3300005364Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaGHost-AssociatedOpen in IMG/M
3300005365Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3H metaGEnvironmentalOpen in IMG/M
3300005456Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M7-3 metaGHost-AssociatedOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005533Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005576Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_157EnvironmentalOpen in IMG/M
3300006050Freshwater sediment microbial communities from Pennsylvania, USA - Little Laurel Run_MetaG_LLR_2014EnvironmentalOpen in IMG/M
3300006163Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L11-1 metaGEnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006358Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M7-2Host-AssociatedOpen in IMG/M
3300006794Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_107EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300009148Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M3-4 metaGHost-AssociatedOpen in IMG/M
3300009174Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-4 metaGHost-AssociatedOpen in IMG/M
3300009176Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-4 metaGHost-AssociatedOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010362Tropical forest soil microbial communities from Panama - MetaG Plot_22EnvironmentalOpen in IMG/M
3300010373Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-4EnvironmentalOpen in IMG/M
3300010376Tropical forest soil microbial communities from Panama - MetaG Plot_28EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300010857Boreal forest soil eukaryotic communities from Alaska, USA - W1-3 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010858Boreal forest soil eukaryotic communities from Alaska, USA - C3-2 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010859Boreal forest soil eukaryotic communities from Alaska, USA - C5-5 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300010861Boreal forest soil eukaryotic communities from Alaska, USA - C4-5 Metatranscriptome (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300011120Combined assembly of Microbial Forest Soil metaTEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012207Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_115_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014969Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M4-5 metaGHost-AssociatedOpen in IMG/M
3300015371Combined assembly of cpr5 and col0 rhizosphere and soilHost-AssociatedOpen in IMG/M
3300015372Soil combined assemblyHost-AssociatedOpen in IMG/M
3300016319Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - timezero.00C.oxic.00.000.00HEnvironmentalOpen in IMG/M
3300017974Tropical peat soil microbial communities from peatlands in Department of Meta, Colombia - 1015_Q2_SP5_10_MGEnvironmentalOpen in IMG/M
3300019890Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1c1EnvironmentalOpen in IMG/M
3300020068Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_65 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300021363Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L3c2EnvironmentalOpen in IMG/M
3300021560Tropical forest soil microbial communities from Panama - MetaG Plot_4EnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024331Soil microbial communities from Purdue University Martell Research Forest, Indiana, United States - CNK09EnvironmentalOpen in IMG/M
3300025925Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S6-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025931Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S7-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025960Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M2-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026023Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Miscanthus M4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027826Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen06_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028556Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-21-22 metaGHost-AssociatedOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031231Coassembly Site 11 (all samples) - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031344Rhizosphere microbial communities from Carex aquatilis grown in University of Washington, Seatle, WA, United States - 4-5-22 metaGHost-AssociatedOpen in IMG/M
3300031421Metatranscriptome of soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_RNA_195 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031446Fir Summer Coassembly Site 11 - Champenoux / Amance forestEnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031879Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.172 (v2)EnvironmentalOpen in IMG/M
3300031912Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - statoxic.12C.oxic.44.000.080 (v2)EnvironmentalOpen in IMG/M
3300032261Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux4day.12C.oxic.44.000.170 (v2)EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25613J43889_1020020713300002907Grasslands SoilMKTPIVLSALLLPAIVSPASANWFSNPDWNINLHIGSAPSPTPEDIRLEKRPMLVRDSDGNIIAMIDPVTGKVIATVEPSARLNAAAAKIAPPP
soilH2_1009098623300003324Sugarcane Root And Bulk SoilMKTRLILSVLLFSVAFVSSASANWFSNPDWNINLHVGSAPNPTPEDIRTGQQPMLVRDADGNVIAMIDPASGKVLAIAEPPATAQPSSRTANVVKAATPSR*
Ga0058901_144744713300004120Forest SoilMKTRILLPVMLLSAVFVSPASANWFSNPSVGIDLNIGSTRSPTPDEVRADRQPMLVRDADGNVIAMIDPTTGKMIAIAEPSAAAQPKNGGAKVTPAAARAH*
Ga0063455_10161707813300004153SoilMKTRLVLPVLLFSLPFVSPASANWFSNPDWNINLHVGSAPSPRPDDIRAGRQPMLVRDADGNIIAMIDPSSGTVIAVAEPPATAQPISRAATVKAAAPAR*
Ga0066869_1011470913300005165SoilMKTRILLPMLLLSAVFVSPASANWFSNPDVNINLNLGSAPNPTPGDIRSDRQPMLVRDAEGNVIAMIDPTTGKMIAIAEPSAAAQPKNGAAKVAPAAAHAR*
Ga0066677_1076661413300005171SoilMRAHLILTGLIVASVAASPASANWFSNPNIDINLNIGSAPNPTPDDVLVERLPMLVRDADGNVIAMIDSTSGKVIATAEPSPTANAKPAAAQPAQARSSGR*
Ga0070690_10055346323300005330Switchgrass RhizosphereMSTRLILLALLLSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPSAANGVVTAKPAAAPAR*
Ga0070670_10221475423300005331Switchgrass RhizosphereMTTRILLPMLLLSAVFVSPASANWFANPSWNINLHIGSAPNPTPDDVRSERQPMLVRDADGNIIAMIDPASGKIIATAEPPATAQATKPAAAGR*
Ga0066388_10052632723300005332Tropical Forest SoilMKTRIALSALLLTTVFVSPASANWFSNLDWNINLNIGSAPSPTPDDIRNMRQPMLVRDADGNVIAMIDPATGKVLATAEVPAKANGGAPKTVPAAAPAR*
Ga0070666_1132446813300005335Switchgrass RhizosphereSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPSAANGVVTAKPAAAPAR*
Ga0068868_10036637343300005338Miscanthus RhizosphereNWFANPSWNINLHIGSAPNPTPDDVRSERQPMLVRDADGNIIAMIDPASGKIIATAEPPATAQATKPAAAGR*
Ga0070671_10080144013300005355Switchgrass RhizosphereLLALLLSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGSIIAMIDPATGKIIATAEPPPSAANGVVTAKPAAAPAR*
Ga0070673_10095856223300005364Switchgrass RhizosphereMSIRLILLALLLSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPSAANGVVTAKPAAAPAR*
Ga0070688_10049695013300005365Switchgrass RhizosphereMKMRLIWSVLLLSVALASSASANWFSNPAWNINLHIGSAPNPTPDDIRVGRQPMLVRDADGNVIAMIDPASGKIIAVAEPPPA
Ga0070678_10068808023300005456Miscanthus RhizosphereMKMRLIWSVLLLSVALASSASANWFSNPAWNINLHIGSAPNPTPDDIRVGRQPMLVRDADGNVIAMIDPASGKIIAVAEPPPAQASSQTVNVTKAAAPVR*
Ga0070681_1096866913300005458Corn RhizosphereMKTRLILPALLVSVAFVSPAFGNWFSNPDWNINLHIGSAPGPTPEDIRAAHQPMLVRDSDGNVIAMIDPSSGKVLATAEPPPSASQPRNAVVPAKAAAAPAR*
Ga0070734_1010410123300005533Surface SoilMKTRLILPTLLVSVVFVSPAFGNWFSNPDWNINLHVGSAPNPTPDDVRAGHQPMLVRDADGNVIAMIDPTTGKVIATAEPPPNAAQPRNALVPAKATASPAR*
Ga0066699_1005290033300005561SoilMRAHLILTGLIVASVAASPASANWFSNPNIDINLNIGSAPNPTPDDVLVERLPMLVRDADGNVIAMIDSASGKVIATAEPSASAKKPAATPAPQARSSGR*
Ga0066708_1009794023300005576SoilMRAYLILTGLIVASVAASPASANWFSNPNIDINLNIGSAPNPTPDDVLVERQPMLVRDADGNVIAMIDSTSGKVIATVEPSPSAKKPAATQTPQARSSGR*
Ga0075028_10040458423300006050WatershedsMKTRMLLPVLMLSAAFVSPASANWFSNPDVNINLNIGSAPNPTPGDIRSDRQPMLVRDAEGNVIAMIDPTTGKMIAIAEPSAAAQPKNGAAKVAPAAAHAR*
Ga0070715_1082654823300006163Corn, Switchgrass And Miscanthus RhizosphereMSTRLILLALFLSGAAASPAFANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPVTGKVIATAEPPPSAARSRGVATPAKAVAAPLR*
Ga0070765_10023839323300006176SoilMKTRILLPVMLLSAVFVSPASANWFSNPSVGIDLNIGSTRSPTPDEVRADRQPMLVRDADGNVIAMIDPTTGKMIAIAEPSATAQPKNGAAKVAPAAARAR*
Ga0068871_10109590923300006358Miscanthus RhizosphereMKTRLILPTILVSLAFVSPALGNWFSNPDWNINLHVGSAPNPTPDDIRAARQPMLVRDADGNIIAMIDPVSGKVIATAEPPPAAAQSIKAAVPAKAAAPGR*
Ga0066658_1031488623300006794SoilMRAHLILTGLIVASVAAPPASANWFSNPNIDINLNIGSAPNPTPDDVLVERLPMLVRDADGNVIAMIDSASGKVIATAEPSPSAKKPAATQTPQARSSGR*
Ga0066660_1042503223300006800SoilMKRHILLSALLLSAVFVSPASANWFSNPQININLNIGSAPSPTPDDVRGERQPMLVRDAEGNVIAMIDPATGKVIATAEPQAKNNGAAKIAPAAAPTR*
Ga0099795_1005100523300007788Vadose Zone SoilMESMMKTRILLPVLLLSAVFVSPASANWFSNPSLGIDLNIGSAPSPTPDQVRADRQPMLVRDAEGNVIAMIDPATGKMIAIAEPTAAAQKSNNAAKATPAAAKAH*
Ga0099792_1003484423300009143Vadose Zone SoilMESMMKTRILLPVLLLSAVFVSPASANWFSNPSLGIDLNIGSAPSPTPDQVRADRQPMLVRDAEGNVIAMIDPATGKMIAIAEPTAAAQKSNNAAKVTPAAARAH*
Ga0105243_1046262223300009148Miscanthus RhizosphereMEYKMSTRLILVALLLSGAAASPAFANWFSNPSWNINLHVGSAPNPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPTAANGVVAAKPAAAPVR*
Ga0105241_1022410823300009174Corn RhizosphereMEYKMSTRLILLALLLSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPSAANGVVTAKPAAAPAR*
Ga0105242_1099402213300009176Miscanthus RhizosphereMEYKMSTRLILLALLLSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPSAANGVVTAKPAA
Ga0126382_1071183523300010047Tropical Forest SoilFVSPASANWFSNIYWNIQLNLGSAPSPTPEQVRENRLPMLVRDADGNVIAMIDPATGKVIATAEPPAPPKANGVTNTAAPGTPARR*
Ga0126376_1023789723300010359Tropical Forest SoilMRTRIILSALLLSAIVSPASANWFSNPDWNINLHIGSAPSPTPEDIRLEKRPMLVRDADGNVIAMIDPATGKVIATVEPSARMNAAAARLYLGAPNPGAPKGK*
Ga0126376_1038055423300010359Tropical Forest SoilMELAMKTRILLSALLLSAAFVSPASANWFSNIYWNIQLNLGSAPSPTPEQVRENRLPMLVRDADGNVIAMIDPATGKVIATAEPPAPPKANGVTNTAAPGTPARR*
Ga0126377_1243543113300010362Tropical Forest SoilHNMRTRIILSALLLSAIVSPASANWFSNPDWNINLHIGSAPSPTPEDIRLEKRPMLVRDADGNVIAMIDPATGKVIATVEPSARMNAAAARLYLGAPNPGAPKGK*
Ga0134128_1207308623300010373Terrestrial SoilMKSRNLLTVLVFSVVAVSPAAANWFSNPRLGVNLHIGTAPGPTPEDILSGRQPMLVKDADGNVIAMIDTATGKVIATAEPPAPPARASVKPAQPQTRAR*
Ga0126381_10292268723300010376Tropical Forest SoilMEHKMKTRLILPMLLVSLAVASPAFGNWFSNPDWNINLHIGSAPGPTPDDIRAARQPMLVRDADGNIIAMIDPATGKVIATAEPPPSASQPRNGVVPAKAAAPAR*
Ga0134124_1221493223300010397Terrestrial SoilMEHEMKTRLILPTILVSLAFVSPALGNWFSNPDWNINLHVGSAPNPTPDDIRAARQPMLVRDADGNIIAMIDPVSGKVIATAEPPPAAAQSIKAAVPAKAAAPGR*
Ga0134122_10000674193300010400Terrestrial SoilMECIMKTRLLLSVLLLSAAFVSPASANWFSNTIWNLQLNIGSAPNPTPSDVRENKVPMLVRDADGNVIAMIDPATGKVIAIAEPPAPPSAVANTAPAARPAR*
Ga0134122_1127665223300010400Terrestrial SoilMEYKMSTRLILLALLLSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPTAANGVVAAKPAAAPAR*
Ga0134121_1254463713300010401Terrestrial SoilGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPSAANGVVTAKPVAAPAR*
Ga0126354_107118423300010857Boreal Forest SoilMECMMKRHILLSALLLSAVFVSPASANWFSNPDININLNIGSAPSPTPDDVRGERQPMLVRDAEGNVIAMIDPATGKVIATAEPRATAQPRGSAAKIVPAAARSR*
Ga0126345_121151413300010858Boreal Forest SoilDLPPSMESTMKTRILLPVLLLSAVSVSPASANWFANPSWGINLHIGSAPSPTPDQVRSDRQPMLVRDADGNVIAMVDPITGKMIAIAEPVAPAPASNANKTAAAPVR*
Ga0126352_122305913300010859Boreal Forest SoilITGPLPLMECTMKTRILLPVMLLSAVFVSPASANWFSNPSVGIDLNIGSTRSPTPDEVRADRQPMLVRDADGNVIAMIDPTTGKMIAIAEPSATAQPKSGAAKVAPAAARAR*
Ga0126349_100316913300010861Boreal Forest SoilMFLSALLLSAVFVSPASANWFSNPEININLNIGSAPNPTPDDVRGERQPMLVRDAEGNVIAMIDPATGKVIATAEPKATNSGTAKIAPAPARTR*
Ga0150983_1422646913300011120Forest SoilMERTMKTQSILSMLLLTAMFVSPASANWFANPKVGINLQLGSAPSPTPDDVLSGRQPMLVRDADGNVLAMIDTVTGKVIATAEPPAPVTARSNSVPTKTPARAH*
Ga0150983_1425767413300011120Forest SoilMECTMKTRILLPVLMLSAVFVSPASANWFSNPGLSIDLNIGSAPSPTPDQVRSDRQPMLVRDAAGNVIAMIDPTTGKMIAIAEPSAVAQPKNGAPKVTPAAARAR*
Ga0150983_1536962023300011120Forest SoilMECTMKTRILLPVLMLSAVFVSPASANWFSNPGVGIDLNIGSASSPTPDQVRSDRQPMLVRDADGNVIAMIDPTTGKMIAIAEPSAAAQPKNGGAKVTPAAARAH*
Ga0137388_1195680223300012189Vadose Zone SoilSPASANWFSNPSLGIDLNVGSAPSPTPDQVRADRQPMLVRDAEGNVIAMIDPATGKMIAIAEPTAAAQKSNNAAKVTPAAARAH*
Ga0137382_1051608223300012200Vadose Zone SoilMESMMKTRILLPVLMLSAVFVSPASANWFSNPGVNINLNIGSAPNPTPGDIRSDRQPMLVRDAEGNVIAMIDPATGKMIAIAEPTAAAQKS
Ga0137399_1002312113300012203Vadose Zone SoilMLLPVLLLTAAFVSPVSANWFHNPGWNLNLNLGSAPSPTPDDIRADKQPMLVRDADGNVIAMIDPATGKVIATAEPPVTARPSNGTARNAPAAAPAR*
Ga0137362_1009936923300012205Vadose Zone SoilMEYSMKTRILLPVLLLSAVFVSPASANWFHNPAWNLNLNLGSAPSPTPEDIRADKKPMLVRDADGNIIAMIDPATGKVIATAEPPATARPRSGVAPAAAAPAR*
Ga0137381_1127942823300012207Vadose Zone SoilMLLLSAVFVSPASANWFSNPSLGIDLNVGSAPSPTPDQVRADRQPMLVRDSEGNVIAMFDPATGKMIAIAEPTAAAQKSNNAAKVTPAAARAH*
Ga0137376_1150493113300012208Vadose Zone SoilMEHTMKTRMLLPVLLLTAAFVSPASANWFHNPSWNLNLNLGSAPSPTPDDIRADKQPMLVRDADGNIIAMIDPATGKVIATAEPPVTARP
Ga0137376_1163170813300012208Vadose Zone SoilMESMMKTRILLPVLLLSAVFVSPASANWFSNPSLGIDLNIGSAPSPTPDQVRADRQPMLVRDAEGNVIAMIDPATGKMIAI
Ga0137379_1021454033300012209Vadose Zone SoilMEPMMKTRILLPMLLLSAVFVSPASANWFSNPSLGIDLNIGSAPTPTPDQVRADRQPMLVRDAEGNVIAMIDPATGKMIAIAEPTAAAQKSNNAAKVTPAAARAH*
Ga0137378_1104720113300012210Vadose Zone SoilMECTMKRHVLLSALLLSAVSVSPASANWFSNPEININLNIGSAPSPTPDDIRGERQPMLVRDAEGNVIAMIDPATGKVIATAEPQAKNSGAAKIAPAAVPTR*
Ga0137377_1015046743300012211Vadose Zone SoilNWFSNPNIDINLNIGSAPNPTPDDVLVERLPMLVRDADGNVIAMIDSTSGKVIATAEPSPTANAKPAAAQPAQARSSGR*
Ga0137377_1040330823300012211Vadose Zone SoilMECTMKRHILLSALLLSAVFVSPASANWFSNPEININLNIGSAPNPTPDDVRGERQPMLVRDAEGNVIAMIDPATGRVIATAEPQAKNSGAAKIAPAAAPTR*
Ga0150985_10796787513300012212Avena Fatua RhizosphereHLGQSMEHEMKTRLILPTILVSLVFVSPALGNWFSNPDWNINLHVGSAPSPTPDDIRAGRQPMLVRDSDGNVIAMIDPVSGKVIAIAEPPPTAAQPRSVAVPAKTAVAPGR*
Ga0150985_10999293423300012212Avena Fatua RhizosphereLDLFTEHKMKTRLVLPVLLFSLPFVSPASANWFSNPDWNINLHVGSAPSPRPDDIRAGRQPMLVRDADGNIIAMIDPSSGTVIAVAEPPATAQPISRAATAVKAAAPAR*
Ga0150985_11125730013300012212Avena Fatua RhizosphereMECTMKKAMLLSVLLLSAVSVSPASANWFSNPGVNINLNLGSAPSPTPDDVRADRQPMLVRDAEGNVIAMIDPATGKMIAIAEPSAAPQGKGGPAKVTAATTKAR*
Ga0150985_11694514513300012212Avena Fatua RhizosphereHHWDPPMEYKMSSRLILLTLLLSGAAISPAFANWFSNPSWNINLHVGSAPNPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPNAAAAGKPAAVPAR*
Ga0137384_1019690323300012357Vadose Zone SoilMLLLSAVFVSPASANWFSNPSLGIDLNVGSAPSPTPDQVRADRQPMLVRDAEGNVIAMIDPATGKMIAIAEPTAAAQKSNNAAKVTPAAARAH*
Ga0137384_1120941313300012357Vadose Zone SoilMFLGGPQARPHLMECTMKRHVLLSALLLSAVSVSPASANWFSNPEININLNIGSAPSPTPDDIRGERQPMLVRDAEGNVIAMIDPATGKVIATAEPQAKNSGAAKIAPAAVPTR*
Ga0150984_12337366413300012469Avena Fatua RhizosphereHHWDPPMEYKMSSRLILLTLLLSGAAISPAFANWFSNPSWNINLHVGSAPNPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPSAAAAGKPAAVPAR*
Ga0137398_1004206923300012683Vadose Zone SoilMKTPIVLSALLLPAIVSPASANWFSNPDWNINLHIGSAPSPTPEDIRLEKRPMLVRDSDGNIIAMIDPVTGKVIATVEPSARLNAAAAKIAPPPARAR*
Ga0137395_1020600423300012917Vadose Zone SoilMECMMKTRILLPVLLLSAVFVSPASANWFSNPSLGIDLNIGSAPSPTPDQVRADRQPMLVRDAEGNVIAMIDPATGKMIAIAEPTAAAQKSNNAAKVTPAAARAH*
Ga0137394_1053078523300012922Vadose Zone SoilMESAMKTRIILPVLLLAAVFVSPASANWFHNPAWNLNLNLGSAPSPTPNDIRDNKQPMLVRDADGNVIAMIDPATGKVLATAEPPPTAQSRSGVAPAAAPAR*
Ga0137394_1053797023300012922Vadose Zone SoilMEYTMKTRLLLSVLLLSAAFVSPASANWFSNVYWNIHLNIGSAANPTPSDIRENKRPMLVRDADGNVIAMIDPDTGKVIATAEPPAPPSSAANVAPAARPAR*
Ga0137419_1140450113300012925Vadose Zone SoilMFLPVLLLTAVFASPASANWFHNPGWNLNLNLGSAPSPTPDDIRADKQPMLVRDADGNIIAMIDPATGKVIATAEPPVTVRANNATARSTPA
Ga0137419_1157545823300012925Vadose Zone SoilSTMKTRILLPVLMLSAVFVSPASANWFSNPGLNIDLNIGSAPSPTPDQVRADRQPMLVRDADGNVIAMIDPATGKMIAIAEPSAAAQQKNGAAKVTPASAKAR*
Ga0137404_1111758123300012929Vadose Zone SoilLLSAVFVSPASANWFHNPAWNLNLNLGSAPSPTPEDIRADKKPMLVRDADGNIIAMIDPATGKVIATAEPPATARPRSGVAPAAAAPAR*
Ga0137407_1163677623300012930Vadose Zone SoilMEYIMKTRLLLSVLLLSAAFVSPASANWFSNVYWNIQLNIGSAANPTPSDIRENKRPMLVRDADGNVIAMIDPDTGKVIATAEPPAPPS
Ga0137410_1020220513300012944Vadose Zone SoilTAFVAPASANWFHNPLWNLNLNLGSAASPTPEDIRADKKPMLVRDTDGNIIAMIDPATGKVIATAEPPVTAQANNSVARIAPAPARAR*
Ga0137410_1064600023300012944Vadose Zone SoilSANWFHNPGWNLNLNLGSAPSPTPDDIRADKQPMLVRDADGNIIAMIDPATGKVIATAEPPVTARPSNGTARNAPAAAPAR*
Ga0137410_1133482913300012944Vadose Zone SoilNWFSNPGLSIDLNIGSAPSPTPDQVRSDRQPMLVRDADGNVIAMIDPATGKMIAIAEPSAAAQQKNGAAKVAPPAAHAR*
Ga0164301_1006177233300012960SoilMLLLSAVVVSSTYTNWYSNPDVNINLSLGSAPKPTPGDSRSDRQPMLVRDAEGNVIAMIDPTTGKMIAIAEPSAAAQPKNGAAKVAPAAAHAR*
Ga0164309_1052333123300012984SoilMLLLSAVFVSPASANWFSNPDVNINLNLGSAPNPTPGDIRSDRQPMLVRDAEGNVIAMIDPTTGKMIAIAEPSAAAQPKNGAAKVAPAAAHAR*
Ga0157378_1286805413300013297Miscanthus RhizosphereMESTMKTRILLPMLLLSAVFVSPASANWFANPSWNINLHIGSAPNPTPDDVRAERQPMLVRYADGHIIAMIDPASGKIIATAEPPATAQAAKPAS
Ga0157376_1122087513300014969Miscanthus RhizospherePMLLLSAVFVSPASANWFANPSWNINLHIGSAPNPTPDDVRSERQPMLVRDADGNIIAMIDPASGKIIATAEPPATAQATKPAAAGR*
Ga0132258_1100222523300015371Arabidopsis RhizosphereMEHEMKTRLILPTILVSLVFVSPALGNWFSNPDWNINLHVGSAPNPTPDDIRAARQPMLVRDADGNVIAMIDPSTGKVIATAEPPPAAAQPIKAAVPAKAAAPVR*
Ga0132256_10279225923300015372Arabidopsis RhizosphereMEFLMKTRVLLSALLVSAAFVSPASANWFSNTVWNLTLNIGSAANPTPNDVRENKTPMLVRDADGNVIAMIDPATGKVIAIAEPPAPPSAVANTAPAARPAR*
Ga0182033_1137517723300016319SoilMKTRIALPALLLTTVFVSPASANWFSNPDWNINLNIGSAPTPTPDDIRNERRPMLVRDADGNIIAMIDPLTGKVIATIEPSSQAKDATGK
Ga0187777_1093960923300017974Tropical PeatlandMTLRHLLPVLLLSAVAASPVSANWFSNAPWGLNLAIGSAPSPTPEDVRAGRQPMLVRDADGNIIAMIDPSSGKVIATAEPVAPPPPNVRAASTPK
Ga0193728_118231313300019890SoilMKSILLPVLLLSAVFVSPASANWFSNPTWGINLHIGSAPSPTPDQVRADRQPMLVRDADGNVIAMVDPNTGKMIAIAEPPAPAQASANKAPTAAPV
Ga0184649_136069313300020068Groundwater SedimentMKTRILLPVLLLSAVFVSPASANWFSNPGWNINLAIGSAPNPTPEDIRQMRQPMLVRDAEGNVIAMIDPATGKMIATAEPPAAAQPSSGVAQPAPAAAPGR
Ga0193699_1000159923300021363SoilMKTRILLPVLLLPAVFVSPASANWFSNPEVNINLNLGSAPNPTPDDVRTDRQPMLVRDADGNVIAMIDPTTGKMIAIAEPSATAKPSSSAAKVTPAAAPAR
Ga0126371_1377308523300021560Tropical Forest SoilDWNINLHVGSAPNPTPEDVRIGHQPMLVRDADGNIIAMIDPATGKVLATAEPPAKANGGAPKTVPAAAPAR
Ga0242655_1032448513300022532SoilMKTRILLPILLLSAAFVSPASANWFASPQLGINLNIGSAPSPTPDHVSTGRQPMLVRDADGNIIAMIDSATGKIIATAEPPVAKQSSSAATKTAPALARAR
Ga0242665_1036988623300022724SoilVPYMERTMKTQSILSMLLLTAMFVSPASANWFANPRVGINLQLGSAPSPTPDDVLSGRQPMLVRDADGNVLAMIDTVTGKVIATAEPPAPVTARSNSVPTKTAASPARAH
Ga0247668_101481323300024331SoilMKTKLLLSVLLFSAMFVSPGSANWFSNPDWGINLHIGSAPNPTPADVRADRQPMLVRDSDGNIIAMVDPATGKMIAIAEPAAQAQATNAAPKKTAATAPVR
Ga0207650_1082483013300025925Switchgrass RhizosphereMKMRLIWSVLLLSVALASSASANWFSNPAWNINLHIGSAPNPTPDDIRVGRQPMLVRDADGNVIAMIDPASGKIIAVAEPPPAQASSQTVNVTKAAAPVR
Ga0207650_1143099713300025925Switchgrass RhizosphereMTTRILLPMLLLSAVFVSPASANWFANPSWNINLHIGSAPNPTPDDVRSERQPMLVRDADGNIIAMIDPASGKIIATAEPPATAQATKPAAAGR
Ga0207644_1059043213300025931Switchgrass RhizosphereLLALLLSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGSIIAMIDPATGKIIATAEPPPSAANGVVTAKPAAAPAR
Ga0207651_1122482113300025960Switchgrass RhizosphereMSIRLILLALLLSGVAASPASANWFSNPAWNINLHIGSAPSPTPDDIRVGRQPMLVRDADGNIIAMIDPATGKIIATAEPPPSAANGVVTAKPAAAPAR
Ga0207677_1057423323300026023Miscanthus RhizosphereMLLLSAVFVSPASANWFANPSWNINLHIGSAPNPTPDDVRSERQPMLVRDADGNIIAMIDPASGKIIATAEPPATAQATKPAAAGR
Ga0209131_109070213300026320Grasslands SoilMKTPIVLSALLLPAIVSPASANWFSNPDWNINLHIGSAPSPTPEDIRLEKRPMLVRDSDGNIIAMIDPVTGKVIATVEPSARLNAAAAKIAPPPARAR
Ga0209131_131897023300026320Grasslands SoilLLSAVFVSPASANWFSNPDVNINLNLGSAPSPTPDDVRTERQPMLVRDADGNVIAMIDPTTGKMIAIAEPSATAQQKNTPAKVTPAATRAH
Ga0209726_1028208923300027815GroundwaterMKTRILLPVLLLSAVCVSPASANWFANSGWNINLFVGSAPNPTPEDIRANKQPTLVQDADGNVIAMIDPRTGQTMAVAESLVTAQPASGAGNAAPASAPKR
Ga0209060_1006194923300027826Surface SoilMKTRLILPTLLVSVVFVSPAFGNWFSNPDWNINLHVGSAPNPTPDDVRAGHQPMLVRDADGNVIAMIDPTTGKVIATAEPPPNAAQPRNALVPAKATASPAR
Ga0137415_1047653613300028536Vadose Zone SoilLTAAFVSPASANWFHNPGWNLNLNLGSAPSPTPDDIRADKQPMLVRDADGNIIAMIDPATGKVIATAEPPVTARPSNGTARNAPAAAPAR
Ga0265337_100539233300028556RhizosphereMRTLLLFPALLFSAVLASPASANWFSNPALDINLHIGSARSPTPDQVRNEAQPMLVRDADGNVIAMIDPRSGKIIATAEPAPKAQAASRPANQPATPAR
Ga0222749_1001726733300029636SoilMKTQSILSMLLLTAMFVSPASANWFANPKVGINLQLGSAPSPTPDDVLSGRQPMLVRDADGNVLAMIDTVTGKVIATAEPPAPVTARSNSVPTKTPARAH
Ga0170824_10154489513300031231Forest SoilVVLPWYRSSCSGGRTPPPFMESALKTRILLPVLLLSAVFVSPASANWFSNPSIGIDLNIGSAPSPTPDQVRADRQPMLVRDADGNVIAMIDPTTGKMIAIAEPSAAAQSKNGAAKVAPPPARAR
Ga0170824_10229815213300031231Forest SoilMKIHSVPPILLLTAMLVSPASANWFSNPTWGINREIATAPSPTPEDVRSGRQPMLVKDADGNIIAMIDTVTGKVIATAEPPAGVTALPTATPVKPAVAPVRAASH
Ga0170824_10272454513300031231Forest SoilPPLMESAMKTRIILPVLLLSAVFVSPASANWFANPGWGINLHIGSAPSPTPDQVRSDRQPMLVRDADGNVIAMVDPVTGKMLAIAEPTAPAPASNANKSAAAPVR
Ga0170824_11636722313300031231Forest SoilRGTRDHPPLMESTMKTRILLPVLLLSAVFVSPASANWFSNPDVNINLNLGSAPNPTPGDVRSDRQPMLVRDADGNVIAMIDPTTGKMIAIAEPSAAAQPKSGAAKVTPAAAHAR
Ga0265316_1060694313300031344RhizosphereMRTLLLFPALLFSAVLASPASANWFSNPALDINLHIGSARSPTPDQVRNEAQPMLVRDADGNVIAMIDPRSGKIIATAEPAPK
Ga0308194_1030337413300031421SoilSPPLMECSMKRHILLSALLLSVVFVSPASANWFSNPNININLNIGSAPSPTPDDVRGERQPMLVRDAEGNVIAMIDPATGKVIATAEPQATNSGAAKIAPAAARGR
Ga0170820_1166355123300031446Forest SoilMKTRILLPVLLLSAVFVSPASANWFSNPDVNINLNLGSAPNPTPGDVRSDRQPMLVRDADGNVIAMIDPTTGKMIAIAEPSAAAQPKSGAAKVTPAAAHAR
Ga0307468_10009924123300031740Hardwood Forest SoilMKTRVLLSALLLSAAFVSPAAANWFSNTVWNIQLNIGSAANPTPSDVRENKVPMLVRDADGNVIAMIDPATGKVIAIAEPPAPPSAVANTAPAARPAR
Ga0307477_1000047963300031753Hardwood Forest SoilMTRILLAGLFLAPAFATPAAANWFSNPNLDINLNLGSAPSPTPSDVLGERQPMLVRDADGNVIAMIDPSTGKIIATAEPPAQAKSAGSVARPIAAKASAR
Ga0306919_1138475113300031879SoilTVFVSPASANWFSNPDWNINLNIGSAPTPTPDDIRNERRPMLVRDADGNIIAMIDPLTGKVIATIEPSSQAKDATGKPPPPRAR
Ga0306921_1001889663300031912SoilMKTRIALPALLLTTVFVSPASANWFSNPDWNINLNIGSAPTPTPDDIRNERRPMLVRDADGNIIAMIDPLTGKVIATIEPSSQAKDATGKPPPPRAR
Ga0306920_10322039213300032261SoilMKTRIALPALLLTTVFVSPASANWFSNPDWNINLNIGSAPTPTPDDIRNERRPMLVRDADGNIIAMIDPLTGKVIATIEPSSQAKDAT
Ga0335079_1016306223300032783SoilMTLRHLLPVLLLSAVAASPVSANWFSNAPWGLNLAIGSAPSPTPEDVRAGRQPMLVRDADGNIIAMIDPSSGKVIATAEPVAPPPPNVRAASTPKTSAR
Ga0326726_1004321863300033433Peat SoilMKTRILLPVLLLSAVFVSPASANWFSNPNWNINLHIGSAPNPTPDDVRAERQPMLVRDADGNVIAMIDPATGKMIAIAEPPAPAQPSRGIANNAPAAAPAR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.