NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F097652

Metagenome / Metatranscriptome Family F097652

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F097652
Family Type Metagenome / Metatranscriptome
Number of Sequences 104
Average Sequence Length 114 residues
Representative Sequence VATWIFTTPTVAEAPFAWNPLMERYRMNRGLSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVLAPNQGFGQGGFGEGGFGS
Number of Associated Samples 90
Number of Associated Scaffolds 104

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Viruses
% of genes with valid RBS motifs 77.88 %
% of genes near scaffold ends (potentially truncated) 25.96 %
% of genes from short scaffolds (< 2000 bps) 53.85 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.71

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Duplodnaviria (61.538 % of family members)
NCBI Taxonomy ID 2731341
Taxonomy All Organisms → Viruses → Duplodnaviria

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(16.346 % of family members)
Environment Ontology (ENVO) Unclassified
(25.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.154 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 12.90%    β-sheet: 21.94%    Coil/Unstructured: 65.16%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.71
Powered by PDBe Molstar

Potential Novel Structural Fold:

This family has a high confidence model (pTM >=0.7) with no significant hits to either SCOPe or PDB biological assemblies. It is, therefore, classified as a potential novel structural fold.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 104 Family Scaffolds
PF12708Pectate_lyase_3 5.77
PF13252DUF4043 5.77
PF00877NLPC_P60 5.77
PF12850Metallophos_2 2.88
PF13472Lipase_GDSL_2 1.92
PF00383dCMP_cyt_deam_1 1.92
PF01464SLT 1.92
PF00961LAGLIDADG_1 0.96
PF04545Sigma70_r4 0.96
PF00657Lipase_GDSL 0.96
PF03699UPF0182 0.96
PF01510Amidase_2 0.96
PF03237Terminase_6N 0.96
PF04686SsgA 0.96
PF13155Toprim_2 0.96
PF00578AhpC-TSA 0.96
PF13392HNH_3 0.96
PF02511Thy1 0.96
PF13481AAA_25 0.96

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 104 Family Scaffolds
COG0791Cell wall-associated hydrolase, NlpC_P60 familyCell wall/membrane/envelope biogenesis [M] 5.77
COG1351Thymidylate synthase ThyX, FAD-dependent familyNucleotide transport and metabolism [F] 0.96
COG1615Uncharacterized membrane protein, UPF0182 familyFunction unknown [S] 0.96


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms95.19 %
UnclassifiedrootN/A4.81 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2038011000|ACOD_FV90NF401C7I92All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae532Open in IMG/M
3300001990|JGI24737J22298_10142193All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae709Open in IMG/M
3300002886|JGI25612J43240_1000156All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes5577Open in IMG/M
3300002910|JGI25615J43890_1093142All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae534Open in IMG/M
3300003316|rootH1_10002237All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae44869Open in IMG/M
3300003316|rootH1_10015574All Organisms → Viruses → Predicted Viral3015Open in IMG/M
3300003321|soilH1_10092804All Organisms → Viruses → Predicted Viral2416Open in IMG/M
3300003321|soilH1_10350483All Organisms → cellular organisms → Bacteria2385Open in IMG/M
3300003322|rootL2_10000402All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae36913Open in IMG/M
3300003322|rootL2_10000430All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae28917Open in IMG/M
3300003324|soilH2_10137982All Organisms → Viruses → Predicted Viral1196Open in IMG/M
3300003838|Ga0058691_1000369All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae33277Open in IMG/M
3300004479|Ga0062595_100610604All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae852Open in IMG/M
3300005158|Ga0066816_1000004All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae14542Open in IMG/M
3300005330|Ga0070690_100104891All Organisms → Viruses → Predicted Viral1878Open in IMG/M
3300005343|Ga0070687_100314103All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae999Open in IMG/M
3300005345|Ga0070692_11396732All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae505Open in IMG/M
3300005439|Ga0070711_100020474All Organisms → Viruses → Predicted Viral4264Open in IMG/M
3300005446|Ga0066686_10118889All Organisms → Viruses → Predicted Viral1721Open in IMG/M
3300005555|Ga0066692_10036912Not Available2634Open in IMG/M
3300005563|Ga0068855_100000581All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae44917Open in IMG/M
3300005586|Ga0066691_10390975All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae826Open in IMG/M
3300006195|Ga0075366_10000581Not Available17175Open in IMG/M
3300006755|Ga0079222_10115102All Organisms → Viruses → Predicted Viral1450Open in IMG/M
3300006871|Ga0075434_101917906All Organisms → cellular organisms → Bacteria598Open in IMG/M
3300006904|Ga0075424_102208538All Organisms → cellular organisms → Bacteria579Open in IMG/M
3300010335|Ga0134063_10268120All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces816Open in IMG/M
3300010397|Ga0134124_10033779All Organisms → Viruses → Predicted Viral4248Open in IMG/M
3300012189|Ga0137388_11473913All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae618Open in IMG/M
3300012202|Ga0137363_11112830All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae672Open in IMG/M
3300012203|Ga0137399_10570808All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae950Open in IMG/M
3300012361|Ga0137360_10557992All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae978Open in IMG/M
3300012361|Ga0137360_11861428All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae507Open in IMG/M
3300012397|Ga0134056_1278968All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae853Open in IMG/M
3300012469|Ga0150984_105798169All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae13616Open in IMG/M
3300012582|Ga0137358_10000022All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae45420Open in IMG/M
3300012924|Ga0137413_10004202All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae6399Open in IMG/M
3300012927|Ga0137416_11784412All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae562Open in IMG/M
3300012960|Ga0164301_10471268All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae898Open in IMG/M
3300012984|Ga0164309_10003806All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae7034Open in IMG/M
3300012984|Ga0164309_10018363All Organisms → Viruses → Predicted Viral3616Open in IMG/M
3300012984|Ga0164309_10846083All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae741Open in IMG/M
3300012985|Ga0164308_10489774All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae1026Open in IMG/M
3300012987|Ga0164307_10101111All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae1804Open in IMG/M
3300012988|Ga0164306_11245048All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae626Open in IMG/M
3300014203|Ga0172378_10726742All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae722Open in IMG/M
3300016294|Ga0182041_11232139All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae683Open in IMG/M
3300017656|Ga0134112_10070053All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces1292Open in IMG/M
3300017659|Ga0134083_10012660All Organisms → Viruses → Predicted Viral2858Open in IMG/M
3300017936|Ga0187821_10011560All Organisms → Viruses → Predicted Viral3024Open in IMG/M
3300017994|Ga0187822_10014527All Organisms → Viruses → Predicted Viral1943Open in IMG/M
3300018052|Ga0184638_1134835All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae898Open in IMG/M
3300018075|Ga0184632_10050464All Organisms → Viruses → Predicted Viral1795Open in IMG/M
3300019867|Ga0193704_1000139All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae11740Open in IMG/M
3300019875|Ga0193701_1012911All Organisms → Viruses → Predicted Viral1668Open in IMG/M
3300019880|Ga0193712_1044610All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae964Open in IMG/M
3300019882|Ga0193713_1000017All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae45272Open in IMG/M
3300025916|Ga0207663_10000190All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae27388Open in IMG/M
3300025916|Ga0207663_10293261All Organisms → Viruses → Predicted Viral1212Open in IMG/M
3300025918|Ga0207662_10007279Not Available6017Open in IMG/M
3300025920|Ga0207649_11678290All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae503Open in IMG/M
3300025922|Ga0207646_10322398All Organisms → Viruses → Predicted Viral1396Open in IMG/M
3300025949|Ga0207667_10000657All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae44835Open in IMG/M
3300025981|Ga0207640_10904058All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae771Open in IMG/M
3300026285|Ga0209438_1000045All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae30963Open in IMG/M
3300026334|Ga0209377_1038311All Organisms → Viruses → Predicted Viral2227Open in IMG/M
3300026475|Ga0257147_1011793All Organisms → Viruses → Predicted Viral1164Open in IMG/M
3300026537|Ga0209157_1202326All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae837Open in IMG/M
3300026843|Ga0208126_1000325Not Available2232Open in IMG/M
3300027606|Ga0209370_1000572All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae46020Open in IMG/M
3300028381|Ga0268264_10001278All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae23852Open in IMG/M
3300028536|Ga0137415_10135173All Organisms → cellular organisms → Bacteria2308Open in IMG/M
3300028587|Ga0247828_10000616All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae11804Open in IMG/M
3300028587|Ga0247828_10006990All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes4015Open in IMG/M
3300028587|Ga0247828_10332583All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae851Open in IMG/M
3300028589|Ga0247818_10014040All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria5166Open in IMG/M
3300028589|Ga0247818_10494115All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae834Open in IMG/M
3300028590|Ga0247823_10135188All Organisms → Viruses → Predicted Viral1884Open in IMG/M
3300028592|Ga0247822_10000045All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae45212Open in IMG/M
3300028592|Ga0247822_10002066All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae12697Open in IMG/M
3300028596|Ga0247821_10635781All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae691Open in IMG/M
3300028597|Ga0247820_10009557All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae5421Open in IMG/M
3300028787|Ga0307323_10000287Not Available15531Open in IMG/M
3300028787|Ga0307323_10000715All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae10297Open in IMG/M
3300028794|Ga0307515_10080444All Organisms → Viruses → Predicted Viral4250Open in IMG/M
3300028809|Ga0247824_10203892All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae1083Open in IMG/M
3300028889|Ga0247827_10029782All Organisms → Viruses → Predicted Viral2352Open in IMG/M
3300029288|Ga0265297_10635061All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae676Open in IMG/M
3300030336|Ga0247826_10484853All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae931Open in IMG/M
(restricted) 3300031150|Ga0255311_1140597All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae534Open in IMG/M
3300031199|Ga0307495_10000001All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae47518Open in IMG/M
3300031226|Ga0307497_10001012All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptosporangiales → Streptosporangiaceae → Herbidospora → Herbidospora cretacea6840Open in IMG/M
3300031507|Ga0307509_10022077All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria7185Open in IMG/M
3300031507|Ga0307509_10030073All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria6013Open in IMG/M
3300031616|Ga0307508_10005766All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae11715Open in IMG/M
3300031724|Ga0318500_10386308All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae695Open in IMG/M
3300031954|Ga0306926_10577334All Organisms → Viruses → Predicted Viral1375Open in IMG/M
3300032075|Ga0310890_11405945All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae573Open in IMG/M
3300032144|Ga0315910_11489214All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae528Open in IMG/M
3300032180|Ga0307471_100423070All Organisms → Viruses → Predicted Viral1465Open in IMG/M
3300033004|Ga0335084_10664854All Organisms → Viruses → Predicted Viral1063Open in IMG/M
3300033004|Ga0335084_10785393All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Streptomycetales → Streptomycetaceae → Streptomyces → unclassified Streptomyces → Streptomyces sp. BK022968Open in IMG/M
3300033412|Ga0310810_10094938All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae3563Open in IMG/M
3300033550|Ga0247829_10414091All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Beephvirinae1109Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil16.35%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil14.42%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil8.65%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.81%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere4.81%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil3.85%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere3.85%
EctomycorrhizaHost-Associated → Plants → Roots → Unclassified → Unclassified → Ectomycorrhiza3.85%
Sugarcane Root And Bulk SoilHost-Associated → Plants → Rhizome → Unclassified → Unclassified → Sugarcane Root And Bulk Soil3.85%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil2.88%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil2.88%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere2.88%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment1.92%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.92%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil1.92%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere1.92%
AgaveHost-Associated → Plants → Phyllosphere → Phylloplane/Leaf Surface → Unclassified → Agave1.92%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater0.96%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.96%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.96%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.96%
Sandy SoilEnvironmental → Terrestrial → Soil → Sand → Unclassified → Sandy Soil0.96%
Fungus GardenHost-Associated → Arthropoda → Symbiotic Fungal Gardens And Galleries → Fungus Garden → Unclassified → Fungus Garden0.96%
Switchgrass RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Switchgrass Rhizosphere0.96%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere0.96%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Corn Rhizosphere0.96%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.96%
Landfill LeachateEngineered → Solid Waste → Landfill → Unclassified → Unclassified → Landfill Leachate0.96%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2038011000Fungus garden microbial communities from Atta colombica in Panama - from dump topHost-AssociatedOpen in IMG/M
3300001990Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C3Host-AssociatedOpen in IMG/M
3300002886Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cmEnvironmentalOpen in IMG/M
3300002910Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_80cmEnvironmentalOpen in IMG/M
3300003316Sugarcane root Sample L1Host-AssociatedOpen in IMG/M
3300003321Sugarcane bulk soil Sample H1EnvironmentalOpen in IMG/M
3300003322Sugarcane root Sample L2Host-AssociatedOpen in IMG/M
3300003324Sugarcane bulk soil Sample H2EnvironmentalOpen in IMG/M
3300003838Agave microbial communities from Guanajuato, Mexico - At.P.rzHost-AssociatedOpen in IMG/M
3300004479Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of All WPAsEnvironmentalOpen in IMG/M
3300005158Soil and rhizosphere microbial communities from Laval, Canada - mgHAAEnvironmentalOpen in IMG/M
3300005330Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3H metaGEnvironmentalOpen in IMG/M
3300005343Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaGEnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005439Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300006195Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. TD hybrid TD303-1Host-AssociatedOpen in IMG/M
3300006755Agricultural soil microbial communities from Georgia to study Nitrogen management - GA PlitterEnvironmentalOpen in IMG/M
3300006871Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD3Host-AssociatedOpen in IMG/M
3300006904Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD3Host-AssociatedOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300010397Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-4EnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012397Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_24_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012960Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_231_MGEnvironmentalOpen in IMG/M
3300012984Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_247_MGEnvironmentalOpen in IMG/M
3300012985Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_246_MGEnvironmentalOpen in IMG/M
3300012987Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_243_MGEnvironmentalOpen in IMG/M
3300012988Soil microbial communities amended with fresh organic matter from upstate New York, USA - Whitman soil sample_242_MGEnvironmentalOpen in IMG/M
3300014203Groundwater microbial communities from an aquifer near a municipal landfill in Southern Ontario, Canada - Pumphouse #3_1 metaGEnvironmentalOpen in IMG/M
3300016294Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178EnvironmentalOpen in IMG/M
3300017656Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_11112015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017936Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_1EnvironmentalOpen in IMG/M
3300017994Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SourceSoil_2EnvironmentalOpen in IMG/M
3300018052Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b2EnvironmentalOpen in IMG/M
3300018075Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_50_b1EnvironmentalOpen in IMG/M
3300019867Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3m1EnvironmentalOpen in IMG/M
3300019875Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U3s2EnvironmentalOpen in IMG/M
3300019880Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a1EnvironmentalOpen in IMG/M
3300019882Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H3a2EnvironmentalOpen in IMG/M
3300025916Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025918Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S3-3L metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025920Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C4-3 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025949Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300025981Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C4-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300026285Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026475Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-12-AEnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026843Soil and rhizosphere microbial communities from Laval, Canada - mgHAA (SPAdes)EnvironmentalOpen in IMG/M
3300027606Agave microbial communities from Guanajuato, Mexico - At.P.rz (SPAdes)Host-AssociatedOpen in IMG/M
3300028381Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Switchgrass S3-2 (SPAdes)Host-AssociatedOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028587Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day3EnvironmentalOpen in IMG/M
3300028589Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glucose_Day1EnvironmentalOpen in IMG/M
3300028590Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day30EnvironmentalOpen in IMG/M
3300028592Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Cellulose_Day30EnvironmentalOpen in IMG/M
3300028596Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glycerol_Day14EnvironmentalOpen in IMG/M
3300028597Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Glucose_Day14EnvironmentalOpen in IMG/M
3300028787Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_381EnvironmentalOpen in IMG/M
3300028794Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 17_EMHost-AssociatedOpen in IMG/M
3300028809Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_PalmiticAcid_Day48EnvironmentalOpen in IMG/M
3300028889Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day2EnvironmentalOpen in IMG/M
3300029288Leachate microbial communities from a municipal landfill in Southern Ontario, Canada - Leachate well 137-91EngineeredOpen in IMG/M
3300030336Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day1EnvironmentalOpen in IMG/M
3300031150 (restricted)Sandy soil microbial communities from University of British Columbia, Vancouver, Canada - EtOH4_T0_E4EnvironmentalOpen in IMG/M
3300031199Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 7_SEnvironmentalOpen in IMG/M
3300031226Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 10_SEnvironmentalOpen in IMG/M
3300031507Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 10_EMHost-AssociatedOpen in IMG/M
3300031616Populus trichocarpa ectomycorrhiza microbial communities from riparian zone in the Pacific Northwest, United States - 9_EMHost-AssociatedOpen in IMG/M
3300031724Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.174b1f20EnvironmentalOpen in IMG/M
3300031954Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - flux8day.12C.oxic.44.000.178 (v2)EnvironmentalOpen in IMG/M
3300032075Lab incubated soil microbial communities from West Virginia University Organic Research Farm, Morgantown, WV, United States - T20D3EnvironmentalOpen in IMG/M
3300032144Garden soil microbial communities collected in Santa Monica, California, United States - Edamame soilEnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300033004Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_4.4EnvironmentalOpen in IMG/M
3300033412Soil microbial communities from experimental microcosm in Duke University, North Carolina, United States - NCEnvironmentalOpen in IMG/M
3300033550Soil microbial communities from agricultural site in Penn Yan, New York, United States - 12C_Control_Day4EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
ACODT_182493202038011000Fungus GardenMWKYTTRTVEEGPFAWNDLMVRYRMPRGISVQEMAPCEFEEIRFYAYTDELGATNLPPNPNQDTDFWPAPSAGLRFFRGGYEHIVDDETKACLISSGVADESNFEATSGFG
JGI24737J22298_1014219313300001990Corn RhizosphereMTCWTLTTPTVDEAPFAWNPLMERFRIPRAVSIVEVSPGVYKQTRYDAYTNEIGATNLPVNPNEQDTDFWPAPSAGLHYFRGGYEWQVDDQIRADIIAS
JGI25612J43240_100015683300002886Grasslands SoilVTSWIFTTPTIAETPFAWNPLMQRFRMDRGISIQEVSPCVYQQVRYDAYTQEIGATNLPTNPNADDTDFWPAPSGGLHYFRGGYEWIVDDATKACLISSDVGVDESNFTITPNQGFGQGGFGEGGFGE*
JGI25615J43890_109314213300002910Grasslands SoilKGVSVTSWIFTTPTIAETPFAWNPLMQRFRMDRGISIQEVSPCVYQQVRYDAYTQEIGATNLPTNPNADDTDFWPAPSGGLHYFRGGYEWIVDDATKACLISSDVGVDESNFTITPNQGFGQGGFGEGGFGE*
rootH1_10002237483300003316Sugarcane Root And Bulk SoilMANWYFTTPTVAEAPFAWNPLMERFRMDRALSVVETAPGVFATNRYYAYTDEIGASNLPPNPNANDTNFYPAPAAGLRYYRGGYQHLVSDQDRTDLINSGVVDASNFTPAP*
rootH1_1001557423300003316Sugarcane Root And Bulk SoilLACWILTCPTVDEAPFAWSPLMERFRISRAISIKEVSPGQYVQVRYDAYTNELGAVNLGGDPNQDSDFWPAPSAGLHYFRGGYEWKVDDTIKAQIIASGAATEANFTPCPGTFGEGGFGEGLFGE*
soilH1_1009280433300003321Sugarcane Root And Bulk SoilVASWYFTTPTVAEAPFAWNPLMERFRMDRALSVIETAPGVFATSRYYAYTDEIGASNLPVNPNANDTDFYPAPATGLRYYRGGYQHIVSTQDKNDLIASGVVTESNFTLLPGFGFGQGGFGQGGFGS*
soilH1_1035048333300003321Sugarcane Root And Bulk SoilVADWKYTTRTVAEAPFAWNPLMERYRMDRAISVVEVSPGVYEEVRYDAYTNELGAVNLPPTNAAPEFDQPRAGLHYFRGGYEWIVDDQVRTDLINSGVADASNFVPA*
rootL2_10000402403300003322Sugarcane Root And Bulk SoilVAHWLFTTPTVSEAPFAWNPLMERYRMDRAISVVEVSPHVYEQTRYDAYTNEIGAVNYPVNPNENDTDFYPAPATGLHYFRGGYEWIVDDTVKNDIIASGAADASNFSPAPGYGFGEGGFGEGGFGL*
rootL2_10000430193300003322Sugarcane Root And Bulk SoilMDRAISVVEVSPHVYEQTRYDAYTNEIGAVNYPVNPNENDTDFYPAPATGLHYFRGGYEWIVDDTVKNDIIASGAADASNFTPAPGYGFGEGGFGEGGFGL*
soilH2_1013798233300003324Sugarcane Root And Bulk SoilVADWKYTTRTVAEAPFAWNPLMERFRMDRAISVVEVSPGVYEEVRYDAYTNELGAVNLPNVDYPPEYEVPRAGLHYFRGGYEWIVSDQVRTDLINSGVADASNFTPA*
Ga0058691_1000369253300003838AgaveMANWVFITPTVDEAPFAWSPLMERFRLTRGVSVVEVSPGQYETTRYDAYTNELGAENLPQNPNQDTEFWPAERAGLHYFRGGYEWIVDDQIRSDIIASGAATAANFTPV*
Ga0062595_10061060423300004479SoilRYRMNRGLSVVEVSPCVYETTRYDAYTNEIGALNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVPAPDQGFGEGGFGEGGFGS*
Ga0066816_100000473300005158SoilVATWKYTTRTVAEAPFAWNDLMVRYSMNRGVSVQEVSPCNYEVVRYYAYTDELGAKNLPQNPNQDTTFWPAPSAGLNFFRGGYEHIVDDATKACLIASGVADESNFQSTTPDTGFGEGGFGEGGFGE*
Ga0070690_10010489133300005330Switchgrass RhizosphereMNRGLSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVLAPNQGFGQGGFGEGGFGS*
Ga0070687_10031410333300005343Switchgrass RhizosphereVATWIFTTPTVAEAPFAWNPLMERYRMNRGLSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVLAPNQGFGQGGFGEGGFGS*
Ga0070692_1139673213300005345Corn, Switchgrass And Miscanthus RhizosphereSVANWYFNTPTVAEAPFAWNPLMERFRMDRAVSVVETAPGVYEQVRYDAYTNEIGAVNLPANPNANDTDFYPAPATGLHYFRGGYVHTVDDAVKANIIASGAADASNFTPAP*
Ga0070711_10002047463300005439Corn, Switchgrass And Miscanthus RhizosphereVATWIFTTPTVAEAPFAWNPLMERYRMNRALSVVEVSPCVYETTRYDAYTNEIGALNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVLAPNQGFGQGGFGEGGFGDE*
Ga0066686_1011888933300005446SoilVANWYFTTPTVAEAPFAWSPLMERFRLDRAISIMETAPGVYEQVRYDAYTNEIGAVNLPPNPNERDTAFYPAPSAGLHYFRGGYQHIVNDAVKADIIASGAADSSNFTPAP*
Ga0066692_1003691233300005555SoilMANWFFTTPTVAEAPFAWSPLMERFRMDRALSVQETSPGVFATTRYDAYTNEIGATNLPVNPNAGDTDFWPAASAGLRYYRGGYAWQVSDQDKADLIASGVVNSSNFVIAPDQGFGQGGFGQGGFGS*
Ga0068855_100000581343300005563Corn RhizosphereVANWYFNTPTVAEAPFAWNPLMERFRMDRAVSVVETAPGVYEQVRYDAYTNEIGAVNLPANPNANDTDFYPAPATGLHYFRGGYVHTVDDTVKANIIASGAADASNFTPAP*
Ga0066691_1039097523300005586SoilMALWQFTTPTVAEAPFAWNPLMERFRMDRALSVQETSPGVFVTTRYDAYTNEIGATNLPVNPNAGDTDFWPAASAGLRYYRGGYAWQVSDQDKADLIASGVVNSSNFVIAPDQGFGQGGFGQGGFGS*
Ga0075366_10000581323300006195Populus EndosphereMNRAISIVEVSPGVYEQVRYDAYTNEIGAVNYPPNPNELDTDFYPAVRTGLHYFRGGYEWLVDDTVKNNIIASGVADESNFTLTPNQGFGQGGFGEGGFGS*
Ga0079222_1011510263300006755Agricultural SoilVASWYFTTPTVAEAPFAWSPLMERYRIDRAVSVQETSPGVFQTTRYSAYTDEIGAINLPPNPNADDTTFYPAPSAGLRFYRGGYQHIVSDQDPADLIASGVVDASNFTPAP*
Ga0075434_10191790613300006871Populus RhizosphereVEVSPGVYEQVRYDAYTNEIGAVNYPPNPNELDTDFYPAVRTGLHYFRGGYEWLVDDTVKNNIIASGVADESNFTLTPNQGFGQGGFGEGGFGS*
Ga0075424_10220853823300006904Populus RhizosphereAEAPFAWNPLMERFRMNRAISIVEVSPGVYEQVRYDAYTNEIGAVNYPPNPNELDTDFYPAVRTGLHYFRGGYEWLVDDTVKNNIIASGVADESNFTLTPNQGFGQGGFGEGGFGS*
Ga0134063_1026812023300010335Grasslands SoilERFRMNRGISIQEVSPGVFQETRYGAYTDEIGAINLPPNPNAGDTSFYPAPSAGLRYYRGGYQHTVSDQDRTDLIASGLVDSTNFTLLPNQGFGQGGFGEGGFGDE*
Ga0134124_1003377953300010397Terrestrial SoilMERYRMNRGLSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNEQDTTFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVQYQPGFGEGGFGEGGFGS*
Ga0137388_1147391323300012189Vadose Zone SoilMERFRLDRALSVVEVSPHVYETTRYDAYTNEIGAVNYPANPNEQDTTFWPAAAQGLHYFRGGYEWIVDDTVKADLIASGVASESNFTPAP*
Ga0137363_1111283013300012202Vadose Zone SoilMANWLYTTNTVAEAPFAWNPLMERYRIDRAVSTVEISPGLYEEVRYDAYTSTIGAVNLPTNPNEQDTDFYPAERAGLHYFQGGHEHIVDDTVKANLIASGVADTTNFVLLP*
Ga0137399_1057080823300012203Vadose Zone SoilMERYRIDRGVSTVEMSPGVYEEVRYDAYTSTIGAVNLPPNPNAQDTDFYPAERAGLHYFQGGYEHIVNDAVKADLIASAVADASNFVLLP*
Ga0137360_1055799223300012361Vadose Zone SoilMQRYRITRGISIVETSPCVYQQVRYDAYTNELGATNLPVNPNATDLPFWPAPSVGLHYFRGGYEWLVDNATKACLLGSLDAGVTESNFVPAPGTYGAGLYGDGIYGG*
Ga0137360_1186142823300012361Vadose Zone SoilVKGASAMASWIYTTNTVAEAPFAWNTLMFRYRMNRGISVQEVSPCVYEEVRYDSYSNEIGAINLPANPNAQDTDFWPAPSAGLHYFRGGYEHTVNDAVKVCLISSGVATESNFTLIDG
Ga0134056_127896823300012397Grasslands SoilMERFRMNRGISIQEVSPGVFQETRYGAYTDEIGAINLPPNPNAGDTSFYPAPSAGLRYYRGGYQHTVSDQDRTDLIASGLVDSTNFTLLPNQGFGQGGFGEGGFGDE*
Ga0150984_105798169123300012469Avena Fatua RhizosphereMERFRIDRGVSVVEVSPHVYEQVRFDAYTNELGAVNLGPNPNADDPDFYPAPRTGLHYFRGGYEWRVDDQVRSDIITSGAADASNFVLCPDQSFGYGEGGYGDGEYGG*
Ga0137358_10000022673300012582Vadose Zone SoilMERFRMNRALSVQEVSPGVYVTTRYGAYTDEIGATNLPVNPNAQDTTFWPASSAGLHYFRGGYEWPVSDQVKADLIASGVADSSNFVLAPGQGFGQGGFGEGGFGS*
Ga0137413_1000420253300012924Vadose Zone SoilVKGASAMASWIYTTNTVAEAPFAWNTLMFRYRMNRGISVQEVSPCVYEEVRYDSYSNEIGAINLPPNPNAQDTDFWPAPSAGLHYFRGGYEHTVNDAVKACLISSGVATESNFTLIDGPGFGEGGFGQGGFGS*
Ga0137416_1178441213300012927Vadose Zone SoilGASAMANWLYTTRTVSEAPFGWTPLMERYRIDRGVSTVEMSPGVYEEVRYDAYTSTIGAVNLPPNPNAQDTDFYPAERAGLHYFQGGYEHIVNDAVKADLIASAVADASNFVLLP*
Ga0164301_1047126823300012960SoilVDEAPFAWNPLMERFRIPRAVSIVEVSPGVYKQTRYDAYTNEIGATNLPVNPNEQDTDFWPAPQAGLNYFRGGYEWRVDDQTKADIIASGVATSANFTPCDLSNTYGYGGYGDGVYGG*
Ga0164309_1000380663300012984SoilVDEAPFAWNPLMERYRIARAVSVVEVSPGVYKQVRYDAYTNEIGAVNLPTNPNEQDTDFWAAPQAGLHYFRGGYEWRVDDQTRADIIASGAATSANFMPCDGTGLYGYGGYGEGG*
Ga0164309_1001836333300012984SoilVANWLYTTNTVEEAPFAWNSLHERFRMPRGVSVQEVSPGVYEEIRYYAYTDELGAVNLPPNPNQQDPDFWPAPSAGLHFFRGGYEHIVNDTVKADLIASGVATAANFVPAP*
Ga0164309_1084608313300012984SoilMDRGISVQEVAPCQYEEVRYYTYADELGAANLPTNPNQDTTFWPAPSAGLNFFRGGYEHIVDDATKACLISSGVADESNFESTTPLTGFGEGGFGEGGVLSEWVGVSEDR*
Ga0164308_1048977423300012985SoilMDRGISVQEVAPCQYEEVRYYTYTDELGAANLPTNPNQDTTFWPAPSAGLNFFRGGYEHIVDDATKACLISSGVADESNFESTTPLTGFGEGGFGEGGFGE*
Ga0164307_1010111123300012987SoilMAWKFTTPTVEEGPFAWNDLMVRYRMDRGVSVQEVSPCQYELIRYYAYTDELGAENLPQNPNQDNEFWPAPSAGLNFFRGGYEHIVDDATRACLIASDIGIDDSNFEPISGFGVGPFGLGPFGGEA*
Ga0164306_1124504823300012988SoilVANWIDTTNTVEEAPFAWNPLMERFRMPRGISTQEVSPGVYEEIRYYSYSDEIGAVNLPPNPNELDTGFYPAPSAGLHFFRGGYEHIVNDQVRTDLINSGVATASNFVPAP*
Ga0172378_1072674223300014203GroundwaterMERYRMDREISVVEVAPCQYEEVRYDAYTNELGAVNLPQNPNQDTDFWPAPQAGLHYFRGGYEHTVDDDVKACLISSGVADESNFVLVPGQLLGYGEGGYGENGYGGTT*
Ga0182041_1123213923300016294SoilWNYLHNRYRIPRAESVVEVTPGVYELTRYDAYTNEIGAVNYPSNPNADHTDFWPAPQAGLHYFRGGYEWLVDDTTKANLIASNIGIDNTNFSIAPGTFGYGGFGQGGFGG
Ga0134112_1007005333300017656Grasslands SoilAFVASWYFTTPTVAEAPFAWNPLMERFRMNRGISIQEVSPGVFQETRYGAYTDEIGAINLPPNPNAGDTSFYPAPSAGLRYYRGGYQHTVSDQDRTDLIASGLVDSTNFTLLPNQGFGQGGFGEGGFGDE
Ga0134083_1001266023300017659Grasslands SoilVASWYFTTPTVAEAPFAWNPLMERFRMNRGISIQEVSPGVFQETRYGAYTDEIGAINLPPNPNAGDTSFYPAPSAGLRYYRGGYQHTVSDQDRTDLIASGLVDSTNFTLLPNQGFGQGGFGEGGFGDE
Ga0187821_1001156063300017936Freshwater SedimentVTTWIFTTPTVAEAPFAWNPLMERFRMDRGVSVVEVSPCVYKQVRYDAYTNEIGAVNLPPNPNGEDTDFWPAPSAGLHYFRGGYEHLVDDSVKACIIASGVADETNFTVVANQGFGQGGFGEGGFGS
Ga0187822_1001452743300017994Freshwater SedimentVTTWIFTTPTVAEAPFAWNPLMERFRMDRGVSVVEVSPCIYKQVRYDAYTNEIGAVNLPPNPNGEDTDFWPAPSAGLHYFRGGYEHLVDDSVKACIIASGVADETNFTVVANQGFGQGGFGEGGFGS
Ga0184638_113483523300018052Groundwater SedimentVATYIFTTPTTEEAPFAWNDLMIRYRIPRGISIQEVSPCVYEPVRFYAYTEELGAENLPQNPNQNTDFWPAPSAGLNFFRGGYEHEVSEEVKACLISSGVATESNFTIASGFGIGGFGEGPFGGP
Ga0184632_1005046413300018075Groundwater SedimentAFRTCTSSPRRGPLVATYIFTTPTTEEAPFAWNDLMIRYRIPRGISIQEVSPCVYEPVRFYAYTEELGAENLPQNPNQNTDFWPAPSAGLNFFRGGYEHEVSEEVKACLISSGVATESNFTIASGFGIGGFGEGPFGGP
Ga0193704_1000139103300019867SoilMERYRMNRALSVQEVSPGVFATTRYSAYTDEIGALNFPPNPNAGDTTFWPAPSAGLRFYRGGYEWLVSSQDRADLIASGVVDASNFALSPAGQGFGEGGFGEGGFGD
Ga0193701_101291123300019875SoilMERYRMNRALSVQEVSPGVFATTRYSAYTDEIGALNFPQNPNAGDTTFWPAPSAGLRFYRGGYEWRVSSQDRADLIASGVVDASNFVLDPAGQGFGEGGFGEGGFGD
Ga0193712_104461023300019880SoilVATWIFTTPTVAEAPFAWSPLMERFRMDRALSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNAGDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFILAPDQGFGEGGFGEGGFGS
Ga0193713_100001793300019882SoilMERFRMDRALSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNAGDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFILAPDQGFGEGGFGEGGFGS
Ga0207663_10000190343300025916Corn, Switchgrass And Miscanthus RhizosphereVATWIFTTPTVAEAPFAWNPLMERYRMNRALSVVEVSPCVYETTRYDAYTNEIGALNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVLAPNQGFGQGGFGEGGFGDE
Ga0207663_1029326113300025916Corn, Switchgrass And Miscanthus RhizosphereWIFTTPTVAEAPFAWNPLMERYRMNRALSVVEVSPCVYETTRYDAYTNEIGALNLPVNPNAGDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVPAPDQGFGEGGFGEGGFGS
Ga0207662_1000727943300025918Switchgrass RhizosphereVATWIFTTPTVAEAPFAWNPLMERYRMNRGLSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVLAPNQGFGQGGFGEGGFGS
Ga0207649_1167829013300025920Corn RhizosphereMTCWTLTTPTVDEAPFAWNPLMERFRIPRAVSIVEVSPGVYKQTRYDAYTNEIGATNLPVNPNEQDTDFWPAPSAGLHYFRGGYEWQVDDQIRADIIASGAATAANFTPCSGVGFGDGGY
Ga0207646_1032239813300025922Corn, Switchgrass And Miscanthus RhizosphereVATWIFTTPTVAEAPFAWNPLMERYRMNRGLSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVAD
Ga0207667_10000657383300025949Corn RhizosphereVANWYFNTPTVAEAPFAWNPLMERFRMDRAVSVVETAPGVYEQVRYDAYTNEIGAVNLPANPNANDTDFYPAPATGLHYFRGGYVHTVDDTVKANIIASGAADASNFTPAP
Ga0207640_1090405823300025981Corn RhizosphereMTCWTLTTPTVDEAPFAWNPLMERFRIPRAVSIVEVSPGVYKQTRYDAYTNEIGATNLPVNPNEQDTDFWPAPSAGLHYFRGGYEWQVDDQIRADIIASGAATAANFTPCSGVGFGDGGYGEGG
Ga0209438_1000045513300026285Grasslands SoilVTSWIFTTPTIAETPFAWNPLMQRFRMDRGISIQEVSPCVYQQVRYDAYTQEIGATNLPTNPNADDTDFWPAPSGGLHYFRGGYEWIVDDATKACLISSDVGVDESNFTITPNQGFGQGGFGEGGFGE
Ga0209377_103831163300026334SoilMANWFFTTPTVAEAPFAWSPLMERFRMDRALSVQETSPGVFATTRYDAYTNEIGATNLPVNPNAGDTDFWPAASAGLRYYRGGYQHTVSDQDRTDLINSGVVDSSNFTPAP
Ga0257147_101179333300026475SoilVATWIFTTPTVAEAPFAWNPLMERFRMDRALSVVETAPGVFATTRYDAYTNEIGATNLPSNPNAGDTDFWPAPSAGLRYFRGGYAHLVSDQDRIDLIASGVVTAANFTPA
Ga0209157_120232613300026537SoilSHPSSPKGASVANWYFTTPTVAEAPFAWSPLMERFRLDRAISIMETAPGVYEQVRYDAYTNEIGAVNLPPNPNERDTAFYPAPSAGLHYFRGGYQHIVNDAVKADIIASGAADSSNFTPA
Ga0208126_100032543300026843SoilVATWKYTTRTVAEAPFAWNDLMVRYSMNRGVSVQEVSPCNYEVVRYYAYTDELGAKNLPQNPNQDTTFWPAPSAGLNFFRGGYEHIVDDATKACLIASGVADESNFQSTTPDTGFGEGGFGEGGFGE
Ga0209370_1000572453300027606AgaveMANWVFITPTVDEAPFAWSPLMERFRLTRGVSVVEVSPGQYETTRYDAYTNELGAENLPQNPNQDTEFWPAERAGLHYFRGGYEWIVDDQIRSDIIASGAATAANFTPV
Ga0268264_10001278153300028381Switchgrass RhizosphereMNRGLSVVEVSPCVYETTRYDAYTNEIGAVNLPVNPNAQDTDFWPAPSAGLHYFRGGYEHLVDDATKACLISSGVADNSNFVLAPNQGFGQGGFGEGGFGS
Ga0137415_1013517313300028536Vadose Zone SoilMANWLYTTRTVSEAPFGWTPLMERYRIDRGVSTVEMSPGVYEEVRYDAYTSTIGAVNLPPNPNAQDTDFYPAERAGLHYFQGGYEHIVNDAVKADLIASAVADASNFVLLP
Ga0247828_10000616253300028587SoilVAVWIFTTPTVAEAPFAWNPLMERFRMDRAISVVEVSPGVYEQVRFDAYTNELGAINYPANPNQDTDFWPAVREGLHYFRGGYEWQVSSQVRADIIASGAADASNFVLAPGMGFGDGGFGEGGFGE
Ga0247828_1000699043300028587SoilVANWIYTTNSVEEAPFAWNSLHERFRMPRGVSVQEVSPGVYEEIRYYAYTDELGAENLPQNPNQNTDFWPAPSAGLHFFRGGYEHTVDDAVKADLIASGVATLANFVPAP
Ga0247828_1033258323300028587SoilVANWIYSTNVVEEAPFAWNSLHERFRMPRGISVQEISPGVYEEIRYYSYTDELGAENLPQNPNQNTEFWPAPSAGLNFFRGGYEHTVDDAVKADLIASGVATAANFTPAP
Ga0247818_1001404013300028589SoilTTNSVEEAPFAWNSLHERFRMPRGVSVQEVSPGVYEEIRYYAYTDELGAENLPQNPNQNTDFWPAPSAGLHFFRGGYEHTVDDAVKADLIASGVATLANFVPAP
Ga0247818_1049411513300028589SoilVAVWIFTTPTVAEAPFAWNPLMERFRMDRAISVVEVSPGVYEQVRFDAYTNELGAINYPANPNQDTDFWPAVWEGLHYFRGGYEWQVSSQVRADIIASGAADASNFVLAPGMGF
Ga0247823_1013518843300028590SoilMAWIYTTNSVEEAPFAWNSLHERFRMPRGVSVQEVSPGVYEEIRYYAYTDELGAENLPQNPNQNTEFWPAPSAGLNFFRGGYEHTVDDAVKADLIASGVATAANFTPAP
Ga0247822_10000045553300028592SoilMDRAISVVEVSPGVYEQVRFDAYTNELGAINYPANPNQDTDFWPAVREGLHYFRGGYEWQVSSQVRADIIASGAADASNFVLAPGMGFGDGGFGEGGFGE
Ga0247822_1000206653300028592SoilMAWIYTTNSVEEAPFAWNSLHERFRMPRGVSVQEVSPGVYEEIRYYAYTDELGAENLPQNPNQNTDFWPAPSAGLHFFRGGYEHTVDDAVKADLIASGVATLANFVPAP
Ga0247821_1063578123300028596SoilVANWIYSTNVVEEAPFAWNSLHERFRMPRGVSVQEVAPGVYEEIRYYAYTDELGAENLPQNPNQNTEFWPAPSAGLNFFRGGYEHTVDDTVKADLIASGVATLANFTPAP
Ga0247820_1000955753300028597SoilMALWKYTTRAVEEAPFAYNDLMFRYRIPRGISVQEVAPCQYEEIRYYAFTDELGAENLPQNPNQDTAFWPAPSAGLNFFRGGYEHEVDDDTKACLISSGVADESNFTEVL
Ga0307323_10000287153300028787SoilVATYIFTTPTTEEAPFAWNDLMIRYRIPRGISIQEVSPCVYEPVRFYAYTEELGAENLPRNPNQNTDFWPAPSAGLNFFRGGYEHEVSEEVKACLISSGVATESNFTIASGFGIGGFGEGPFGGP
Ga0307323_10000715113300028787SoilMNRALSVQEVSPGVFATTRYSAYTDEIGALNFPPNPNAGDTTFWPAPSAGLRFYRGGYEWLVSSQDRADLIASGVVDASNFVLSPAGQGFGEGGFGEGGFGD
Ga0307515_1008044463300028794EctomycorrhizaMDRAISVVEVAPCQYEEVRYDAYTNAIGAVNLPTNPNANDTDFYPAPRTGLHYFQGGYEHIVSTEVRACLISSGVADASNFTLFPGQGFGEGGFGEGGFGL
Ga0247824_1020389233300028809SoilMAWIYTTNSVEEAPFAWNSLHERFRMPRGVSVQEVSPGVYEEIRYYAYTDELGAENLPQNPNQNTEFWPAPSAGLHFFRGGYEHTVDDAVKADLIASGVATLANFVPA
Ga0247827_1002978243300028889SoilMAWIYTTNSVEEAPFAWNSLHERFRMPRGVSVQEVSPGVYEEIRYYAYTDELGAENLPQNPNQNTEFWPAPSAGLHFFRGGYEHTVDDAVKADLIASGVATLANFVPAP
Ga0265297_1063506113300029288Landfill LeachateVATWKYTTRTVAEAPFAWNDLMVRYSMNRGVSVQEVSPCNYEVVRYYAYTDELGAKNLPQNPNQDTTFWPAPSAGLNFFRGGYEHIVDDATKACLIASGVADESNFQSTTPDTGFGEGGFGE
Ga0247826_1048485313300030336SoilMAWIYTTNSVEEAPFAWNSLHERFRMPRGISVQEISPGVYEEIRYYSYTDELGAENLPQNPNQNTDFWPAPSAGLHFFRGGYEHTVDDAVKADLIASGVATLANFVPAP
(restricted) Ga0255311_114059713300031150Sandy SoilMATWTFTTPTVAEAPFAWNPLMERFRMDRALSVQEVSPGQYVTTRYGAYTDELGATNLPVNPNQDTDFWPAASAGLHYFRGGYEWLVDSQVRSDLIASGVATASNFVLTPGQGFGQGGFGQGGFGV
Ga0307495_10000001343300031199SoilVASYTYTTNTVAEAPFAWNSLMQRFRIDRGISTVEVTPGVYEEIRYDAYTEEIGAVNLPPNPNELDTPFYPAPRAGLHYFRGGYEHTVNDTVRADLIASGVATSSNFVLIP
Ga0307497_1000101283300031226SoilMANYRYTTRTIAETPFAWNPLMVRFRMERGISVEEVSPGVYEEVRYTDYTDEIGAVNLPPNPNEQDTAFWPATRAGLHFFRGGYAHTVDDTVRSNLIASGVADASNF
Ga0307509_1002207723300031507EctomycorrhizaVASWIFTTPTVAEAPFAWNPLMERFRMDRGVSVVEVSPCEYQQVRYDAYTNEIGAVNLPPNPNEQDTAFWDAPRVGLHYFRGGYEHIVDSSVRACIISSGAADASNFTLVPDQGFGEGGFGEGGFGL
Ga0307509_1003007323300031507EctomycorrhizaVASYVFTTPTVAEAPFAWSPLMERFRIDRAVSVVEVSPCVYQQVRYDAYTNEIGAVNLPPNPNEQDTAFWDAPRVGLHYFRGGYEHIVDSSVRACIISSGAADASNFTLVPDQGFGEGGFGEGGFGL
Ga0307508_1000576693300031616EctomycorrhizaMNRAVSVVEVSPGIYEQTRYDAYTNEIGAVNRPVNPNEQDTTFWPAPSAGLHYFRGGYEWIVSDQVRADIIASGAADASNFIPA
Ga0318500_1038630823300031724SoilMSLWLFTTPTVDEAPMAWNYLHNRYRIPRAESVVEVTPGVYELTRYDAYTNEIGAVNYPSNPNADHTDFWPAPQAGLHYFRGGYEWLVDDTTKANLIASNIGIDNTNFSIAPGTFGYGGFGQGGFGG
Ga0306926_1057733423300031954SoilAPMAWNYLHNRYRIPRAESVVEVTPGVYELTRYDAYTNEIGAVNYPSNPNADHTDFWPAPQAGLHYFRGGYEWLVDDTTKANLIASNIGIDNTNFSIAPGTFGYGGFGQGGFGG
Ga0310890_1140594513300032075SoilMATWKYTTRTVEEAPFAWNDLMVRYRMPRGISVQEVAPCQFEEIRYYAYTDELGATNLPQNPNQNTGFWPAPSAGLKFFRGGYEHEVDDATKACLIASGVADESNFTLTTGFGAGGIGEGPFG
Ga0315910_1148921423300032144SoilQYEQVRYDAYTNELGAINYPVNPNQDTDFWPAVREGLHYFRGGYEWQVSSQVRADIIASGAADASNFVLAPGMGFGDGGFGEGGFGE
Ga0307471_10042307033300032180Hardwood Forest SoilVAHWIFTTPTVAEAPFAWNPLMERFRMDRAISIVETSPCVYAQVRYDAYTNEIGAVNLPPNPNEQDTDFWPAPRAGLHYFRGGYEHIVDDDVRDCIIASGVADASNFTPAPSFGFGAGGFGEGGFGE
Ga0335084_1066485423300033004SoilVTTWIFTTPTVAEAPFAWNPLMERFRMDRGVSVVEVSPCVYKQVRYDAYTNEIGAVNLPPNPNGQDTDFWPAPSAGLHYFRGGYEHLVDDSVKACIISSGVADETNFTAVPDQGFGEGGFGEGGFGT
Ga0335084_1078539313300033004SoilSPLMERFRMDRGVSVVEVSPCVYKTVRYDAYTNEIGAVNLPPNPNGQDTDFWPAASAGLHYFRGGYEHLVDDSVKACIISSGVADETNFTAVPDQGFGEGGFGEGGFGT
Ga0310810_1009493853300033412SoilVACWTFTTPTVAEAPFAWNPLMERFRMDRAVSIVEVSPGVYQQVRYDAYTNEIGATNLPVNPNEQDTAFWPAARAGLHYFRGGYEHMVDDTVKADIIASGVATEANFTYCAGGFGIGPFGVGPFGGLH
Ga0247829_1041409123300033550SoilMATWKYTTRTVEEAPFAWNDLMVRYRMPRGISVQEVAPCQFEEIRYYAYTDELGATNLPQNPNQNTGFWPAPSAGLKFFRGGYEHEVDDATKACLIASGVADESNFTLTTGFGAGGFGEGPFGG


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.