NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F072045

Metagenome / Metatranscriptome Family F072045

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F072045
Family Type Metagenome / Metatranscriptome
Number of Sequences 121
Average Sequence Length 95 residues
Representative Sequence MPLLHVRKPSEAPLPSRSSRAVREQQQKYDEFVRPIEVSDVGDLELEPGENVRSVKVRLRRASSRVGLDLDIWDANGHVYFRRVTRRGRPRKQA
Number of Associated Samples 90
Number of Associated Scaffolds 121

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 85.12 %
% of genes near scaffold ends (potentially truncated) 23.14 %
% of genes from short scaffolds (< 2000 bps) 84.30 %
Associated GOLD sequencing projects 81
AlphaFold2 3D model prediction Yes
3D model pTM-score0.65

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (64.463 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Soil
(19.008 % of family members)
Environment Ontology (ENVO) Unclassified
(29.752 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Subsurface (non-saline)
(26.446 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 26.23%    β-sheet: 13.93%    Coil/Unstructured: 59.84%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.65
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 121 Family Scaffolds
PF04909Amidohydro_2 8.26
PF03054tRNA_Me_trans 7.44
PF12850Metallophos_2 0.83
PF03320FBPase_glpX 0.83
PF02410RsfS 0.83
PF00596Aldolase_II 0.83
PF03721UDPG_MGDP_dh_N 0.83
PF04116FA_hydroxylase 0.83
PF03816LytR_cpsA_psr 0.83
PF00581Rhodanese 0.83
PF01979Amidohydro_1 0.83
PF02595Gly_kinase 0.83
PF13425O-antigen_lig 0.83
PF04203Sortase 0.83

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 121 Family Scaffolds
COG0482tRNA U34 2-thiouridine synthase MnmA/TrmU, contains the PP-loop ATPase domainTranslation, ribosomal structure and biogenesis [J] 7.44
COG0240Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 0.83
COG0677UDP-N-acetyl-D-mannosaminuronate dehydrogenaseCell wall/membrane/envelope biogenesis [M] 0.83
COG0799Ribosomal silencing factor RsfS, regulates association of 30S and 50S subunitsTranslation, ribosomal structure and biogenesis [J] 0.83
COG1004UDP-glucose 6-dehydrogenaseCell wall/membrane/envelope biogenesis [M] 0.83
COG12503-hydroxyacyl-CoA dehydrogenaseLipid transport and metabolism [I] 0.83
COG1316Anionic cell wall polymer biosynthesis enzyme TagV/TagU, LytR-Cps2A-Psr (LCP) family (peptidoglycan teichoic acid transferase)Cell wall/membrane/envelope biogenesis [M] 0.83
COG1494Fructose-1,6-bisphosphatase/sedoheptulose 1,7-bisphosphatase or related proteinCarbohydrate transport and metabolism [G] 0.83
COG1893Ketopantoate reductaseCoenzyme transport and metabolism [H] 0.83
COG1929Glycerate kinaseCarbohydrate transport and metabolism [G] 0.83
COG3000Sterol desaturase/sphingolipid hydroxylase, fatty acid hydroxylase superfamilyLipid transport and metabolism [I] 0.83
COG3764Sortase (surface protein transpeptidase)Cell wall/membrane/envelope biogenesis [M] 0.83


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A64.46 %
All OrganismsrootAll Organisms35.54 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2124908009|FWIRA_GRAM18401EGVQDNot Available505Open in IMG/M
3300002121|C687J26615_10127053Not Available642Open in IMG/M
3300002124|C687J26631_10014135All Organisms → cellular organisms → Bacteria2835Open in IMG/M
3300002124|C687J26631_10016515All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium2619Open in IMG/M
3300002124|C687J26631_10118274Not Available894Open in IMG/M
3300002568|C688J35102_118034837Not Available524Open in IMG/M
3300003892|Ga0063012_10005348All Organisms → cellular organisms → Bacteria33859Open in IMG/M
3300003892|Ga0063012_10075392All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2512Open in IMG/M
3300003995|Ga0055438_10198706All Organisms → cellular organisms → Bacteria609Open in IMG/M
3300003998|Ga0055472_10216841Not Available594Open in IMG/M
3300004005|Ga0055448_10387458All Organisms → cellular organisms → Bacteria556Open in IMG/M
3300004064|Ga0055479_10106296All Organisms → cellular organisms → Bacteria1004Open in IMG/M
3300004145|Ga0055489_10011368All Organisms → cellular organisms → Bacteria1968Open in IMG/M
3300004463|Ga0063356_104269600All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Hyphomicrobium → unclassified Hyphomicrobium → Hyphomicrobium sp. MC1615Open in IMG/M
3300004463|Ga0063356_106232436Not Available511Open in IMG/M
3300004480|Ga0062592_101124290Not Available728Open in IMG/M
3300005436|Ga0070713_101166710Not Available745Open in IMG/M
3300005458|Ga0070681_10491009All Organisms → cellular organisms → Bacteria1141Open in IMG/M
3300005466|Ga0070685_10752896Not Available714Open in IMG/M
3300005468|Ga0070707_102167387Not Available523Open in IMG/M
3300005529|Ga0070741_10111113All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi2866Open in IMG/M
3300005529|Ga0070741_10424492Not Available1217Open in IMG/M
3300005549|Ga0070704_101164163Not Available702Open in IMG/M
3300005616|Ga0068852_101311492Not Available746Open in IMG/M
3300005836|Ga0074470_10619310All Organisms → cellular organisms → Bacteria796Open in IMG/M
3300006051|Ga0075364_10147141All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1586Open in IMG/M
3300006051|Ga0075364_11156856All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi525Open in IMG/M
3300006865|Ga0073934_10198323All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium1368Open in IMG/M
3300006876|Ga0079217_11440056Not Available541Open in IMG/M
3300006903|Ga0075426_10987163Not Available636Open in IMG/M
3300007004|Ga0079218_10522665All Organisms → cellular organisms → Bacteria1057Open in IMG/M
3300007004|Ga0079218_11403688Not Available746Open in IMG/M
3300009095|Ga0079224_100669474All Organisms → cellular organisms → Bacteria1488Open in IMG/M
3300009137|Ga0066709_101054772All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium1193Open in IMG/M
3300009873|Ga0131077_10076451All Organisms → cellular organisms → Bacteria4280Open in IMG/M
3300010391|Ga0136847_10472267Not Available586Open in IMG/M
3300010391|Ga0136847_11978385All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1377Open in IMG/M
3300010401|Ga0134121_12235887Not Available584Open in IMG/M
3300012210|Ga0137378_10996069Not Available752Open in IMG/M
3300012350|Ga0137372_10750534Not Available703Open in IMG/M
3300012353|Ga0137367_10609468Not Available764Open in IMG/M
3300012360|Ga0137375_10093642All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium3085Open in IMG/M
3300012469|Ga0150984_118548981Not Available668Open in IMG/M
3300013056|Ga0164270_136982Not Available675Open in IMG/M
3300013297|Ga0157378_11673170Not Available683Open in IMG/M
3300014269|Ga0075302_1168190All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Bacteroidia → Bacteroidales543Open in IMG/M
3300014302|Ga0075310_1148214Not Available532Open in IMG/M
3300014314|Ga0075316_1050023All Organisms → cellular organisms → Bacteria906Open in IMG/M
3300014325|Ga0163163_11068880Not Available870Open in IMG/M
3300015373|Ga0132257_102303900Not Available698Open in IMG/M
3300017540|Ga0180217_1035242All Organisms → cellular organisms → Bacteria2255Open in IMG/M
3300017560|Ga0182747_1159079Not Available918Open in IMG/M
3300018056|Ga0184623_10397524Not Available607Open in IMG/M
3300018074|Ga0184640_10440971Not Available580Open in IMG/M
3300018077|Ga0184633_10445075Not Available641Open in IMG/M
3300018077|Ga0184633_10475672Not Available611Open in IMG/M
3300018079|Ga0184627_10545232Not Available592Open in IMG/M
3300018084|Ga0184629_10507319Not Available627Open in IMG/M
3300018422|Ga0190265_11928294All Organisms → cellular organisms → Bacteria697Open in IMG/M
3300018432|Ga0190275_11933018Not Available668Open in IMG/M
3300018465|Ga0190269_10243534All Organisms → cellular organisms → Bacteria1008Open in IMG/M
3300018466|Ga0190268_11231077Not Available624Open in IMG/M
3300018481|Ga0190271_10902142All Organisms → cellular organisms → Bacteria1008Open in IMG/M
3300018481|Ga0190271_13494634Not Available526Open in IMG/M
3300019249|Ga0184648_1356648Not Available518Open in IMG/M
3300020200|Ga0194121_10505894Not Available588Open in IMG/M
3300022226|Ga0224512_10524765Not Available577Open in IMG/M
3300025001|Ga0209618_1053942Not Available630Open in IMG/M
3300025155|Ga0209320_10386375Not Available565Open in IMG/M
3300025159|Ga0209619_10112746All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium1611Open in IMG/M
3300025159|Ga0209619_10359108Not Available759Open in IMG/M
3300025160|Ga0209109_10085568All Organisms → cellular organisms → Bacteria1637Open in IMG/M
3300025160|Ga0209109_10137852Not Available1236Open in IMG/M
3300025165|Ga0209108_10352436Not Available729Open in IMG/M
3300025310|Ga0209172_10024093All Organisms → cellular organisms → Bacteria4253Open in IMG/M
3300025313|Ga0209431_10376254Not Available1100Open in IMG/M
3300025318|Ga0209519_10061539All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium2155Open in IMG/M
3300025318|Ga0209519_10765839Not Available513Open in IMG/M
3300025322|Ga0209641_10530404Not Available830Open in IMG/M
3300025324|Ga0209640_10003901All Organisms → cellular organisms → Bacteria13103Open in IMG/M
3300025324|Ga0209640_10090842All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → Dehalococcoidia → unclassified Dehalococcoidia → Dehalococcoidia bacterium2634Open in IMG/M
3300025324|Ga0209640_11329190Not Available530Open in IMG/M
3300025324|Ga0209640_11389852Not Available513Open in IMG/M
3300025325|Ga0209341_10389890All Organisms → cellular organisms → Bacteria1130Open in IMG/M
3300025325|Ga0209341_10786809Not Available722Open in IMG/M
3300025326|Ga0209342_10728269Not Available790Open in IMG/M
3300025327|Ga0209751_10577054Not Available908Open in IMG/M
3300025327|Ga0209751_10794280Not Available738Open in IMG/M
3300025792|Ga0210143_1110478Not Available509Open in IMG/M
3300025912|Ga0207707_11414182Not Available554Open in IMG/M
3300025959|Ga0210116_1016441Not Available1359Open in IMG/M
3300026029|Ga0208002_1019725Not Available653Open in IMG/M
3300027657|Ga0256865_1080271Not Available822Open in IMG/M
3300027717|Ga0209998_10071981Not Available827Open in IMG/M
3300027815|Ga0209726_10187764Not Available1103Open in IMG/M
(restricted) 3300027872|Ga0255058_10575345Not Available551Open in IMG/M
3300027964|Ga0256864_1250115Not Available508Open in IMG/M
3300028792|Ga0307504_10267492Not Available632Open in IMG/M
3300028812|Ga0247825_10151381Not Available1591Open in IMG/M
3300028812|Ga0247825_10178709Not Available1463Open in IMG/M
3300028812|Ga0247825_11272722Not Available537Open in IMG/M
3300030006|Ga0299907_10067582Not Available2883Open in IMG/M
3300030006|Ga0299907_10213516Not Available1591Open in IMG/M
3300030006|Ga0299907_11058180Not Available592Open in IMG/M
3300030606|Ga0299906_10067664All Organisms → cellular organisms → Bacteria2846Open in IMG/M
3300030606|Ga0299906_10086239All Organisms → cellular organisms → Bacteria2501Open in IMG/M
3300030620|Ga0302046_11181791Not Available602Open in IMG/M
3300030620|Ga0302046_11191308Not Available599Open in IMG/M
3300031229|Ga0299913_10048577All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi4049Open in IMG/M
3300031229|Ga0299913_10498802All Organisms → cellular organisms → Bacteria1204Open in IMG/M
3300031229|Ga0299913_11317496Not Available679Open in IMG/M
3300031801|Ga0310121_10049340Not Available2842Open in IMG/M
3300031965|Ga0326597_10081264All Organisms → cellular organisms → Bacteria3983Open in IMG/M
3300031965|Ga0326597_10311291All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1786Open in IMG/M
3300031965|Ga0326597_10342635All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1683Open in IMG/M
3300031965|Ga0326597_10580837All Organisms → cellular organisms → Bacteria1203Open in IMG/M
3300031965|Ga0326597_11536977Not Available637Open in IMG/M
3300033291|Ga0307417_10186028Not Available810Open in IMG/M
3300033417|Ga0214471_10000785All Organisms → cellular organisms → Bacteria25819Open in IMG/M
3300033417|Ga0214471_11408256Not Available525Open in IMG/M
3300033433|Ga0326726_11900162Not Available580Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil19.01%
SoilEnvironmental → Terrestrial → Soil → Loam → Unclassified → Soil12.40%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil8.26%
Natural And Restored WetlandsEnvironmental → Aquatic → Marine → Wetlands → Unclassified → Natural And Restored Wetlands5.79%
Groundwater SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Groundwater Sediment4.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil4.13%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil3.31%
Natural And Restored WetlandsEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Natural And Restored Wetlands3.31%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural → Soil2.48%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere2.48%
Arabidopsis Thaliana RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Arabidopsis Thaliana Rhizosphere2.48%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Freshwater Sediment1.65%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Unclassified → Hot Spring Sediment1.65%
Hot Spring SedimentEnvironmental → Aquatic → Thermal Springs → Sediment → Unclassified → Hot Spring Sediment1.65%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.65%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere1.65%
Populus EndosphereHost-Associated → Plants → Roots → Bulk Soil → Unclassified → Populus Endosphere1.65%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.83%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake0.83%
GroundwaterEnvironmental → Aquatic → Freshwater → Groundwater → Contaminated → Groundwater0.83%
MarineEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine0.83%
Salt MarshEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh0.83%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment0.83%
SeawaterEnvironmental → Aquatic → Marine → Gulf → Unclassified → Seawater0.83%
Sediment (Intertidal)Environmental → Aquatic → Sediment → Unclassified → Unclassified → Sediment (Intertidal)0.83%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.83%
CompostEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Compost0.83%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Agricultural Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil0.83%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil0.83%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.83%
Switchgrass RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Switchgrass Rhizosphere0.83%
Peat SoilEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Peat Soil0.83%
Arabidopsis RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Arabidopsis Rhizosphere0.83%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.83%
Switchgrass RhizosphereHost-Associated → Plants → Roots → Rhizosphere → Soil → Switchgrass Rhizosphere0.83%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.83%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.83%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizosphere → Soil → Unclassified → Avena Fatua Rhizosphere0.83%
WastewaterEngineered → Wastewater → Activated Sludge → Unclassified → Unclassified → Wastewater0.83%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
2124908009Soil microbial communities from sample at FACE Site Metagenome WIR_Amb2EnvironmentalOpen in IMG/M
3300002121Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1EnvironmentalOpen in IMG/M
3300002124Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_3EnvironmentalOpen in IMG/M
3300002568Grasslands soil microbial communities from Hopland, California, USA - 2EnvironmentalOpen in IMG/M
3300003892Hot spring sediment microbial communities from Chocolate Pots, Yellowstone National Park, Wyoming that are Fe(III) reducing - CP Core 2, 1cmEnvironmentalOpen in IMG/M
3300003995Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleA_D2EnvironmentalOpen in IMG/M
3300003998Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleC_D2EnvironmentalOpen in IMG/M
3300004005Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - China_Galinas_PWA_D2EnvironmentalOpen in IMG/M
3300004064Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - Muzzi_PWC_D1EnvironmentalOpen in IMG/M
3300004145Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2EnvironmentalOpen in IMG/M
3300004463Combined assembly of Arabidopsis thaliana microbial communitiesHost-AssociatedOpen in IMG/M
3300004480Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of AARS Block 4EnvironmentalOpen in IMG/M
3300005436Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-2 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005466Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS S1-3L metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005549Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-2 metaGEnvironmentalOpen in IMG/M
3300005616Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C2-2Host-AssociatedOpen in IMG/M
3300005836Microbial communities from Youngs Bay mouth sediment, Columbia River estuary, Oregon - S.42_YBBEnvironmentalOpen in IMG/M
3300006051Populus root and rhizosphere microbial communities from Tennessee, USA - Endosphere MetaG P. deltoides DD176-4Host-AssociatedOpen in IMG/M
3300006865Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaGEnvironmentalOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009095Agricultural soil microbial communities from Utah to study Nitrogen management - Steer compost 2015EnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300009873Activated sludge microbial diversity in wastewater treatment plant from Taiwan - Wenshan plantEngineeredOpen in IMG/M
3300010391Freshwater sediment microbial communities from Lake Superior, USA - Station SU-17. Combined Assembly of Gp0155404, Gp0155335, Gp0155336, Gp0155336, Gp0155403, Gp0155406EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012350Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012353Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012360Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_113_16 metaGEnvironmentalOpen in IMG/M
3300012469Combined assembly of Soil carbon rhizosphereHost-AssociatedOpen in IMG/M
3300013056Enriched Organic Plus compost microbial communities from Emeryville, California, USA - RNA 3rd pass 37_C BE-Lig OP (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300013297Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - M6-5 metaGHost-AssociatedOpen in IMG/M
3300014269Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - Browns_TuleB_D1EnvironmentalOpen in IMG/M
3300014302Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D2EnvironmentalOpen in IMG/M
3300014314Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_TuleA_D2EnvironmentalOpen in IMG/M
3300014325Switchgrass rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - S6-5 metaGHost-AssociatedOpen in IMG/M
3300015373Combined assembly of cpr5 rhizosphereHost-AssociatedOpen in IMG/M
3300017540Enriched Organic Plus compost microbial communities from Emeryville, California, USA - eDNA 3rd pass 37_C BE-Lig OP (version 2)EnvironmentalOpen in IMG/M
3300017560Enriched backyard soil microbial communities from Emeryville, California, USA - eDNA 3rd pass 30_C BE-Lig BY (version 2)EnvironmentalOpen in IMG/M
3300018056Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM1_100_b1EnvironmentalOpen in IMG/M
3300018074Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_200_b2EnvironmentalOpen in IMG/M
3300018077Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3-1_170_b1EnvironmentalOpen in IMG/M
3300018079Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM3_127_b1EnvironmentalOpen in IMG/M
3300018084Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32_b1EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018432Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 550 TEnvironmentalOpen in IMG/M
3300018465Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 ISEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300018481Populus adjacent soil microbial communities from riparian zone of Weber River, Utah, USA - 356 TEnvironmentalOpen in IMG/M
3300019249Metatranscriptome of groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM4_32 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300020200Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015020 Mahale Deep Cast 50mEnvironmentalOpen in IMG/M
3300022226Sediment microbial communities from San Francisco Bay, California, United States - SF_May12_sed_USGS_13EnvironmentalOpen in IMG/M
3300025001Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 3 (SPAdes)EnvironmentalOpen in IMG/M
3300025155Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 4EnvironmentalOpen in IMG/M
3300025159Soil microbial communities from Rifle, Colorado, USA - sediment 16ft 3EnvironmentalOpen in IMG/M
3300025160Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 2EnvironmentalOpen in IMG/M
3300025165Soil microbial communities from Rifle, Colorado, USA - sediment 10ft 1EnvironmentalOpen in IMG/M
3300025310Hot spring sediment bacterial and archeal communities from British Columbia, Canada, to study Microbial Dark Matter (Phase II) - Larsen N4 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025313Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_3 (SPAdes)EnvironmentalOpen in IMG/M
3300025318Soil microbial communities from Rifle, Colorado, USA - sediment 13ft 1EnvironmentalOpen in IMG/M
3300025322Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025324Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 10_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025325Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025326Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 16_2 (SPAdes)EnvironmentalOpen in IMG/M
3300025327Soil microbial communities from Rifle, Colorado - Rifle CSP2_sed 13_1 (SPAdes)EnvironmentalOpen in IMG/M
3300025792Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNW_TuleC_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025959Wetland microbial communities from San Francisco Bay, California, USA, that impact long-term carbon sequestration - RushOxbow_ThreeSqB_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300026029Natural and restored wetland microbial communities from the San Francisco Bay, California, USA, that impact long-term carbon sequestration - MayberryNE_CattailA_D2 (SPAdes)EnvironmentalOpen in IMG/M
3300027657Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT145D125 HiSeqEnvironmentalOpen in IMG/M
3300027717Arabidopsis thaliana rhizosphere microbial communities from the Joint Genome Institute, USA, that affect carbon cycling - Endophyte Co-N S PM (SPAdes)Host-AssociatedOpen in IMG/M
3300027815Subsurface groundwater microbial communities from S. Glens Falls, New York, USA - GMW46 contaminated, 5.4 m (SPAdes)EnvironmentalOpen in IMG/M
3300027872 (restricted)Seawater microbial communities from Amundsen Gulf, Northwest Territories, Canada - Cases_109_9EnvironmentalOpen in IMG/M
3300027964Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111 HiSeqEnvironmentalOpen in IMG/M
3300028792Soil microbial communities from Populus trichocarpa stands in riparian zone in the Pacific Northwest, United States - 19_SEnvironmentalOpen in IMG/M
3300028812Soil microbial communities from agricultural site in Penn Yan, New York, United States - 13C_Vanillin_Day48EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300030606Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT145D125EnvironmentalOpen in IMG/M
3300030620Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT147D111EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031801Marine microbial communities from Western Arctic Ocean, Canada - CB27_Tmax_986EnvironmentalOpen in IMG/M
3300031965Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT100D185EnvironmentalOpen in IMG/M
3300033291Salt marsh sediment microbial communities from the Plum Island Ecosystem LTER, Massachusetts, United States - WE1602-10EnvironmentalOpen in IMG/M
3300033417Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT142D155EnvironmentalOpen in IMG/M
3300033433Lab enriched peat soil microbial communities from Michigan Hollow, Ithaca, NY, United States - MHF15MNEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
FWIRA_071420002124908009SoilSLPVLHIRKPGEAPSPSRSSRAVRELQQKYDDFVKGIDPSEVGDLELDPNDNVRSVKVRLRRASSRLGANLNIWDVSGHVYFQHVARRGRPRKTA
C687J26615_1012705313300002121SoilMPLLHVRKPNEAPLPSRSSRAVREQQQKYDDFVRRIDTGEVGDLELEPSENVRSVKVRLRRASSRVGIDLDIWDANGHVYFRRLTRRGRPRKQG*
C687J26631_1001413533300002124SoilMPLLHVRKPNEAPPPSRSSKAVREQQQKYDDFIRKIDSTDVGDLELEPSENVRSVKVRLRRASSRLGIDIDIWDANSHVYFRRVTRRGRPRRASQP*
C687J26631_1001651523300002124SoilMPQLHVRKPSEAPPPSRSSRAVREQQQKYDEFVRQIDAADVGDLELEPNENVRSVKVRLRRASSRMGVDLDIWDMNGHVYFKRVTRRGRPRKVS*
C687J26631_1011827423300002124SoilMPQLHIRKPSEAPPPSRSSRAVREQQQKYDDFVRGIETSDVGDLELEPKENVRSVKVRLRRASSRLGIDLEIWDTNGHVYFRRVTRRGRPRKKS*
C688J35102_11803483713300002568SoilMLLTKRLIIPSSRRSYTLPTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVRSIDPSEVGDLELDSSDNVRSVKVRLRRASSRIGANLNIWDVNGHVYFQHVTRRGRPRKVA*
Ga0063012_10005348243300003892Hot Spring SedimentMPTLHIRKPHEAPPPSRSSKAVREQQQKYDEFVRRIDTNDVGDLELEPNENVRSVKVRLRRASSRLGMDIDIWDANGHVYFRRITRRGRPRKQA*
Ga0063012_1007539253300003892Hot Spring SedimentMPKLHVRKPGEVPPPSRASRAVREQQQKFDDFVRGIDVSDVGDLELDPNENLRSVKVRLRRASSRLGVDLDIWDANGHVYFKRVTRRGRRRQQQQA*
Ga0055438_1019870623300003995Natural And Restored WetlandsMPVLHVRKPNEAPPPSRSSKAVREQQQKYDDFVRKIDTTDVGDLELEPNENVRSVKVRLRRASSRLGIDLDIWDANSHVYFRRVTRRGRPRRTA*
Ga0055472_1021684113300003998Natural And Restored WetlandsMPKLHVRKPNEAPPPSRSSKAVREQQQKYDDFIRGIDTNDVGDLELDPGENVRSVKVRLRRASSRLNSDIDIWDVNGHVYFRKVTRRGRPRKQV*
Ga0055448_1038745813300004005Natural And Restored WetlandsMPQLHVRKPNEAPPPSRSSKAVREQQEKYDNFIRGIDTNDVGDLELDPGENVRSVKVRLRRASSRLNSDIDIWDVNGHVYFRKVTRRGRPRKQA*
Ga0055479_1010629613300004064Natural And Restored WetlandsMPQLHVRKPNEAPLPSRSSKAVRELQEKYDDFIRGVDAADVGDLELDPGENVRSVKVRLRRASSRLNTDIDIWDVNGHVYFRKVTRRGRPRKLA*
Ga0055489_1001136823300004145Natural And Restored WetlandsMPLLHVRKPSEAPLPSRSSKAVREQQQKYDDFIRSIDTSEVGDLELEPNENVRSVKVRLRRASSRIGVDVDIWDANGHVYFRRVTRRGRPRRSA*
Ga0063356_10426960013300004463Arabidopsis Thaliana RhizosphereMPVLHLRKPNEAPLPSRSSRAVREQQQKYDEFVRRVELNDVGDLELEPAENVRSVKVRLRRASSRLSIDLDIWDVNGHVYFRRVTR
Ga0063356_10623243613300004463Arabidopsis Thaliana RhizosphereGAEFANTITRNGNIDMPMLHVRKPNEAPLPSRSSRAVREQQQKYDDFVRKIEVSDVGDLELDEGENVRSVKVRLRRASSRLSIDLDIWDANGHVYFRRVTRRGRPRRQP*
Ga0062592_10112429023300004480SoilAVALLQGLIHCEGLFKMLLTKRLLLQTSMRRQTLPTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVRRIDPSEVGDLELDPSDNVRSVKVRLRRASSRLGANLNIWDVNGHVYFQHVARRGRPRKTA*
Ga0070713_10116671013300005436Corn, Switchgrass And Miscanthus RhizosphereMPTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVKGIDASQIGDLELEASDNVRSVKVRLRRASSRLGINLNIWDVNGHVYFQHVARRGRPRK
Ga0070681_1049100923300005458Corn RhizosphereMPTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVKGIDASQIGDLELEPSDNVRSVKVRLRRASSRLGINLNIWDVNGHVYFQHVARRGRPRKQA*
Ga0070685_1075289613300005466Switchgrass RhizosphereLPTLHVRKPGEAPSPSRSSRAVRDLQQKYDDFVKGIDTSEVGDLELDPNDNVRSVKVRLRRASSRLGANLNIWDVSGHVYFQHVARRGRPRKTA*
Ga0070707_10216738723300005468Corn, Switchgrass And Miscanthus RhizospherePHLIVKKATDVPLPSRASRAVRELQQKYDEFVRGVDVNEAGELELEPGDNIRSVKVRLRRASSRLGLDMDIWDANGRVYFRRVARRGRQRRPAG*
Ga0070741_1011111353300005529Surface SoilMPRLHVRKPGEVPPPSRASRAVREQQQKFDDFVRGIDVSDVGDLELDPSENVRSVKVRLRRASSRLGVDLDIWDANGHVYFKRVTRRGRRRQQQQA*
Ga0070741_1042449223300005529Surface SoilMPTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVRGIDASQIGDLELEPSDNVRSVKVRLRRASSRLGINLNIWDVNGHVYFQHVARRGRPRKQA*
Ga0070704_10116416313300005549Corn, Switchgrass And Miscanthus RhizosphereMRRQTLPTLHVRKPGEVPTPSRSSRAVRELQQKYDDFVRGIDPSEVGDLELDPSDNVRSVKVRLRRASSRLGANLNIWDVNGH
Ga0068852_10131149213300005616Corn RhizosphereLPQLHVRKPNEAPPPSRSSRAVREQQQKYDEFVRQIDTSDVGDLELEPSENVRSVKVRLRRASSRIGVDLDIWDVSGHVYFKRVIRRGRPRKVT*
Ga0074470_1061931023300005836Sediment (Intertidal)MPVLHVRKPNEAPLPSRSSKAVREQQQKYDDFVRKIDSNDVGDLELEPSENVRSVKVRLRRASSRLGIDIDIWDVNSHVYFRRVTRRGRPRRTA*
Ga0075364_1014714113300006051Populus EndosphereLPVLHIRKPGEAPSPSRSSRAVRELQQKYDDFVKGIDPSEVGDLELDPNDNVRSVKVRLRRASSRLGANLNIWDVSGHVYFQHVARRGRPRKTA*
Ga0075364_1115685613300006051Populus EndosphereLPILHIRKPGEAPSPSRSSRTVRELQQKYDDFVKGIDPSEVGDLELDPNDNVRSVKVRLRRASSRLGANLNIWDVSGHVYFQHVARRGRPRK
Ga0073934_1019832313300006865Hot Spring SedimentMPTLHIRRPNEAPPPSRSSKAVREQQQKYDEFVRRIDTNDVGDLELEPNENVRSVKVRLRRASSRLGMDIDIWDANGHVYFRRITRRGRPRKQA*
Ga0079217_1144005623300006876Agricultural SoilMPVLHLRKPNEAPQPSRSSRAVRELQQKYDEFVRRVETSDVGDLELEPNENVRSVKVRLRRASSRVGVDLGIWDVNGHVYFRRVTRRGRPRRPA*
Ga0075426_1098716323300006903Populus RhizosphereMPHLIVKKATEVPLPSRASRAVRELQQKYDEFVRRVEVNEAGELELEPGDNIRSVKVRLRRASSRLGLDIDIWDSNGRVYFRRVTRRGRQRRPA*
Ga0079218_1052266513300007004Agricultural SoilREISMPQLHVRKPSEAPLPSRSSKAVREQQQKYDEFVRRVDTTDVGDLELDTNENVRSVKVRLRRASSRLNVDLDIWDADGHVYFRRVTRRGRPRKQP*
Ga0079218_1140368813300007004Agricultural SoilMPQLHVRKPSEAPLPSRSSKAVREQQQKYDEFVRRVDTADVGDLELDTNENVRSVKVRLRRASSRLNVDLDIWDADGHVYFRRVTRRGRPRKQP*
Ga0079224_10066947423300009095Agricultural SoilMPKLHVRKPNEAPPPSRSSKAVREQQQRYDDFIRGIDATEVGDLELEPGENVRSVKVRLRRASSRLNTDIDIWDVNGHVYFRKVTRRGRPRKQA*
Ga0066709_10105477243300009137Grasslands SoilMPLLHVRKPSEAPLPSRSSRAVREQQQKYDEFVRPIEVSDVGDLELEPGENVRSVKVRLRRASSRVGLDLDIWDANGHVYFRRVTRRGRPRKQA*
Ga0131077_1007645173300009873WastewaterMRLVSGRKREVTVPVLHVRKASEAPPPSRSSRAVREQQQKYDDFVRGIEANEVGDLELEPSENVRSVKVRLRRASSRLGLDIEIWDANGHVYFQRVARRARRKPA*
Ga0136847_1047226713300010391Freshwater SedimentMPRLHVRKPGDVPPPSRASRAVREQQQKFDDFVRGIDVSDVGDLELEAGENLRSVKVRLRRASSRLGVDLDIWDANGHVYFKRVTRRGRPRRQQA*
Ga0136847_1197838523300010391Freshwater SedimentLPSRASRAVRELQQKYDNFVRGVDVNEAGELELEAGDNIRSVKVRLRRASSRLSVDLDIWDANGKVYFRRVTRRGRPRRQA*
Ga0134121_1223588723300010401Terrestrial SoilHVRKPGEAPSPSRSARAVRDLQQKYDDFVKGIDTSEVGDLELDPNDNVRSVKVRLRRASSRLGANLNIWDVSGHVYFQHVARRGRPRKTA*
Ga0137378_1099606913300012210Vadose Zone SoilMPHLIVKKATDVPLPSRASRAVRELQQKYDEFVRGIDINEAGELELEAGDNIRSVKVRLRRASSRLGLDIDIWDANGRVYFRRVTRRGRQRRPA*
Ga0137372_1075053423300012350Vadose Zone SoilMRLEQKGAFRMPHLIVKKATDVPLPSRASRAVRELQQKYDEFVRGVDVNEAGELELEPGDNIRSVKVRLRRASSRLGLDIDIWDANGRVYFRRVTRKGRQRRPA*
Ga0137367_1060946823300012353Vadose Zone SoilMPHLIVKKATDVPLPSRASRAVRELQQKYDEFVRGVDVNEAGELELEPGDNIRSVKVRLRRASSRLGLDIDIWDANGRVYFRRVTRKGRQRRPA*
Ga0137375_1009364233300012360Vadose Zone SoilLPQLHIRKPNEAPLPSRSSRAVREQQEKYDDFVRTIDASEVGDLELEPAENVRSVKVRLRRASSRVGVDLDIWDVNSHVYFKRVTRRGRPRKTS*
Ga0150984_11854898113300012469Avena Fatua RhizosphereRSYTLPTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVRSIDPSEVGDLELDSSDNVRSVKVRLRRASSRIGANLNIWDVNGHVYFQHVTRRGRPRKVA*
Ga0164270_13698213300013056SoilLWSDLIDQSISLKEEISVPVLHVRKPNEAPPPSRSSRAVREQQQKYDEFVRKIDANDVGDLELEPNENLRSVKVRLRRASSRLGVDIDIWDANGHVYFQRVTRRGRRRQA*
Ga0157378_1167317023300013297Miscanthus RhizosphereLPVLHVRKPGEAPTPSRSSRAVRELQQKYDDFVKGIDPSEVGDLELDPSDNVRSVKVRLRRASSRLGANLNIWDVSGHVYFQHVTRRGRPRKTA*
Ga0075302_116819013300014269Natural And Restored WetlandsMPTLHIRKPNEAPLPSRSSRAVREQQQRYDEFVRRVEVTDVGDLELEPSENVRSIKVRLRRASSRLGIDLDIWDANGHVYFRRVTRRGRTR
Ga0075310_114821423300014302Natural And Restored WetlandsMPVLHLRKPNEAPLPSRSSRAVREQQQRYDEFVRRVEVSDVGDLELEAGENIRSVKVRLRRASSRLGIDLDIWDANGRVYFRRVTRR
Ga0075316_105002313300014314Natural And Restored WetlandsITTLYQESYMPQLHVRKPNEAPPPSRSSKAVREQQQKYDDFIRGIDTNDVGDLELDPGENVRSVKVRLRRASSRLSSDIDIWDVNGHVYFRKVTRRGRPRKQA*
Ga0163163_1106888013300014325Switchgrass RhizosphereTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVRRIDPSEVGDLELDPADNVRSVKVRLRRASSRIGAYLNIWDVNGHVYFQHVARRGRPRKTA*
Ga0132257_10230390023300015373Arabidopsis RhizosphereLPTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVRRIDPSEVGDLELDPSDNVRSVKVRLRRASSRIGANLNIWDVNGHIYFQHVTRRGRPRKIA*
Ga0180217_103524253300017540CompostVPVLHVRKPNEAPPPSRSSRAVREQQQKYDEFVRKIDANDVGDLELEPNENLRSVKVRLRRASSRLGVDIDIWDANGHVYFQRVTRRGRRRQA
Ga0182747_115907923300017560SoilLPVLHVRKPNEAPPPSRSSRAVREQQQKYDDFVRKIDTNDVGDLELEPNENLRSVKVRLRRASSRLGLDIDIWDANGHVYFQRVTRRSRRRQS
Ga0184623_1039752413300018056Groundwater SedimentMPVLHLRKPNEAPLPSRSSRAVREQQQRYDEFVRRVEVSDVGDLELDQGENIRSVKVRLRRASSRLGIDLDIWDANGRVYFRRITRRGRPRSRQA
Ga0184640_1044097123300018074Groundwater SedimentMPRLHVRKPGEVPPPSRASRAVREQQQKFDDFVRGIDVSDVGDLELEAGENLRSVKVRLRRASSRLGVDLDIWDANGHVYFKRVTRRGRPRRQQS
Ga0184633_1044507513300018077Groundwater SedimentMPILHLRKPSEAPLPSRSSRAVREQQQKYDEFVRRVELNDVGDLELEPSENVRSVKVRLRRASSRLSIDLDIWDVNGHVYFRRVTRRGRPRKS
Ga0184633_1047567223300018077Groundwater SedimentMPVLHVRKPNEAPLPSRSSRTVREQQQKYDEFVRRIEVSDVGDLELDQGENLRSVKVRLRRASSRLGLDIDIWDANGRVYFRRVTRRGRPRKQG
Ga0184627_1054523213300018079Groundwater SedimentLPQLHVRKPSEAPVPSKSSRAVREQQEKYDDFVRRIDASEVGDLELESTENVRSVKVRLRRASSRVAVELDIWDVNGHVYFKRVTRRGRPRKAS
Ga0184629_1050731913300018084Groundwater SedimentLPQLHVRKPSEAPVPSRSSRAVRELQEKYDDFVHKIDASEVGDLELDASENVRSVKVRLRRASSRVGVDLDIWDVNGHVYFKRVTRRGRPRKAS
Ga0190265_1192829413300018422SoilRIVGVLRGTTMPQLHVRKPGEAPPPSRSSRAVRELQQKYDEFIRGIEGNNVGELVLEPHKENVRSVKVRLRRASSRLGIDLSIWDVNGHVYFQRQSRRGRPRKRSA
Ga0190275_1193301813300018432SoilMPVLQIRKPSDASRPSRSSRAAREQQQKYDGFIKSVEGTDIGELELDPGENLRSIKVRLRRASSRLSVDIETWDANGRIYFRRITRRPRTRRQA
Ga0190269_1024353413300018465SoilMPVLHVRKPNEAPLPSRSSRAVREQQQKYDEFVRGIEVNDVGDLELEPSENIRSVKVRLRRASSRLGIDLDIWDANGHVYFRRVTRRGRARRPQQ
Ga0190268_1123107713300018466SoilMPQLHVRKPSEAPLPSRSSKAVREQQQKYDEFVRRVDTADVGDLELDTNENVRSVKVRLRRASSRLNVDLDIWDADGHVYFRRVTRRGRPRKQP
Ga0190271_1090214223300018481SoilMPQLHVRKPSEAPLPSRSSKAVREQQQKYDEFVRRVDTMDVGDLELDPNENVRSVKVRLRRASSRLNVDLDIWDAEGHVYFRRVTRRGRPRKQA
Ga0190271_1349463413300018481SoilMPQLHVRKPSEAPLPSRSSKAVREQQQKYDEFVRHVDTADVGDLELDTNENVRSVKVRLRRASSRLNVDLDIWDADGHVYFRRVTRRGRPRKQP
Ga0184648_135664813300019249Groundwater SedimentLPQLHVRKPSEAPVPSRSSRAVRELQEKYDDFVHKIDASEVGDLELDASENVRSVKVRLRRASSRVGVDLDIWDANGHVYFKRVTRRGRPRKAS
Ga0194121_1050589413300020200Freshwater LakeLPTLHIRKPGEAPSPSRSSRAVRELQQKYDGFVRGIETSEVGHLELDPADNVRSVKVRLRRASTHVGANLTIWDVNGHVYFKHVARRGRPRKSVVA
Ga0224512_1052476523300022226SedimentMPQLHIRKPSEAPLPSRSTKAVRELQEKYDEFIRGVDATEVGDLELEPGENVRSVKVRLRRASSRLNTDIDIWDVNGHVYFRKVTRRGRPRKQV
Ga0209618_105394213300025001SoilMPLLHVRKPNEAPLPSRSSRAVREQQQKYDDFVRRIDTGEVGDLELEPSENVRSVKVRLRRASSRVGIDLDIWDANGHVYFRRLTRRGRPRKQG
Ga0209320_1038637513300025155SoilMPVLRVKKPNEVPAASRASRAVREQQQRYDSFVLGVETGDAGELELEPGETVRSVKVRLRRASSRLGLDLEIWDASGKVYFRRVTKRGRGRRSA
Ga0209619_1011274613300025159SoilMPLLHVRKPSEAPLPSRSSRAVREQQQKYDEFVRRVETGEVGDLELDPKENVRSVKVRLRRASSRVGIDLDIWDANGHVYFRRITRRGRPRKQS
Ga0209619_1035910823300025159SoilMPVLHVKKPGDVPPPSRASRAVREQQEKYDDFVRRIDVSDVGDLELEPAENLRSVKVRLRRASSRLSVDLDIWDANGRVYFRRVTRRGRPR
Ga0209109_1008556823300025160SoilMPLLHVRKPNEAPPPSRSSKAVREQQQKYDDFIRKIDSTDVGDLELEPSENVRSVKVRLRRASSRLGIDIDIWDANSHVYFRRVTRRGRPRRASQP
Ga0209109_1013785233300025160SoilMPKLHVRKPGDVPPPSRASRAVREQQQKFDEFVRGIDVSDVGDLELEAGENLRSVKVRLRRASSRLGVDLDIWDANGHVYFKRVTRRGRPRRQQA
Ga0209108_1035243623300025165SoilLPQLHIRKPSEAPLPSRSSRAVREQQEKYDDFVRKIDASEVGDLELEPAENVRSVKVRLRRASTRVGVDVDIWDVNGHVYFKRVTRRGRPRKAS
Ga0209172_1002409343300025310Hot Spring SedimentMPTLHIRRPNEAPPPSRSSKAVREQQQKYDEFVRRIDTNDVGDLELEPNENVRSVKVRLRRASSRLGMDIDIWDANGHVYFRRITRRGRPRKQA
Ga0209431_1037625413300025313SoilLPSRSSRAVREQQQKYDEFVRRVELNDVGDLELEPSENVRSVKVRLRRASSRLSIDLDIWDVNGHVYFRRVTRRGRPRKT
Ga0209519_1006153913300025318SoilMPLLHVRKPNEAPLPSRSSRAVREQQQKYDDFVRRIDTGEVGDLELEPSENVRSVKVRLRRASSRVGIDLDIWDANGHVYFRRLTRRGRPRKQGGG
Ga0209519_1076583913300025318SoilMPLLHVRKPSEAPLPSRSSRAVREQQQKYDEFVRRVETGEVGDLELDPKENVRSVKVRLRRASSRVGIDLDIWDANGHVYFRRITR
Ga0209641_1053040413300025322SoilMPLLHVRKPNEAPLPSRSSRAVREQQQKYDEFVRRVETGEVGDLELDPKENVRSVKVRLRRASWRVGIDLDIWDANGHVYFRRITRRGRPRKQS
Ga0209640_1000390183300025324SoilMPLLHVRKPSEAPPPSRSSKAVREQQQKYDEFIRKIDTNDVGDLELEPGENVRSVKVRLRRASSRLGVDIDIWDVNSHVYFRRVTRRGRPRRAQTA
Ga0209640_1009084253300025324SoilMPKLHVRKPGDVPPPSRASRAVREQQQKFDEFVRGIDVSDVGDLELEAGENLRSVKVRLRRASSRLGVDVDIWDANGHVYFKRVTRRGRPRRQQA
Ga0209640_1132919013300025324SoilMPVLHVRKPSEAPPPSRSSKAVREQQQKYDDFVRKIDSNDVGDLELEPSENVRSVKVRLRRASSRLSIDIDIWDVNSHVYFRRVTRRGRPRRAP
Ga0209640_1138985213300025324SoilMPTLHVRKPSEAPLPSRSSRAVREQQQKYDDFVRTIDTNDVGDLELEDGENVRSVKVRLRRASSRVGIDLDIWDTNGHVYFRRVTRRGRPRKQA
Ga0209341_1038989013300025325SoilMPVLRVKKPSEVPAASRASRAVREQQQRYDGFVLGVETGDAGELELEPGETVRSVKVRLRRASSRLGVELDIWDASGKVYFRRVTKRGRGRRSA
Ga0209341_1078680913300025325SoilMPVLHLRKPNEAPLPSRSSRAVREQQQKYDEFVRRVELNDVGDLELEPSENVRSVKVRLRRASSRLSIDLDIWDVNGHVYFRRVTRRGRPRKT
Ga0209342_1072826933300025326SoilKKPSEVPAASRASRAVREQQQRYDGFVLGVETGDAGELELEPGETVRSVKVRLRRASSRLGLDLEIWDASGKVYFRRVTKRGRGRRSA
Ga0209751_1057705413300025327SoilMPVLHLRKPSEAPQPSRSSRAVRELQQKYDEFVRRVETSDVGDLELEPNENVRSVKVRLRRASSRVGVDLDIWDVNGHVYFRRVTRRGRPRRQA
Ga0209751_1079428023300025327SoilMPVLRVKKPNEVPAASRASRAVREQQQRYDGFVLGVETGDAGELELEPGETVRSVKVRLRRASSRLGLDLEIWDASGKVYFRRVTKRGRGRRSA
Ga0210143_111047813300025792Natural And Restored WetlandsMPKLHVRKPNEAPPPSRSSKAVREQQQKYDDFIRGIDTNDVGDLELDPGENVRSVKVRLRRASSRLNSDIDIWDVNGHVYFRKVTRRGRPRKQV
Ga0207707_1141418213300025912Corn RhizosphereMPTLHVRKPGEAPTPSRSSRAVRELQQKYDDFVKGIDASQIGDLELEPSDNVRSVKVRLRRASSRLGINLNIWDVNGHVYFQHVARRGRPRKQA
Ga0210116_101644123300025959Natural And Restored WetlandsMPLLHVRKPSEAPLPSRSSKAVREQQQKYDDFIRSIDTSEVGDLELEPNENVRSVKVRLRRASSRIGVDVDIWDANGHVYFRRVTRRGRPRRSA
Ga0208002_101972513300026029Natural And Restored WetlandsMPVLHLRKPNEAPLPSRSSRAVREQQQKYDEFVRRVEVTDVGDLELEPSENVRSVKVRLRRASSRLGIDLDIWDVNGHVYFRRVTRRGRPRKT
Ga0256865_108027133300027657SoilMPLLHVRKPSEAPPPSRSSKAVREQQQKYDDFIRRIDTSEVGDLELEPNENVRSVKVRLRRASSRISVDVDIWDANGHVYFRRVTRRGRPRRNA
Ga0209998_1007198113300027717Arabidopsis Thaliana RhizosphereLLPVTVLNLSLAYDLNHSYTFKAIPAHLWIDSREISMPQLHVRKPSEAPLPSRSSKAVREQQQKYDEFVRRVDTTDVGDLELDTNENVRSVKVRLRRASSRLNVDLDIWDADGHVYFRRVTRRGRPRKQP
Ga0209726_1018776413300027815GroundwaterMPVLHLRKPTETPLPSRSSRAVREQQQKYDEFVRRVEVSDVGDLELEPSENVRSVKVRLRRASSRVGIDLDIWDANGHVYFRRVTRRGRPRKQG
(restricted) Ga0255058_1057534523300027872SeawaterMPVLHLRKPHEAPLPSRSPRVVRERQQKFDDFIRRIEGNDVGELELEPGESLRSVRVRLRRASSRLGVEIDIWDASGHIYFRRVSRRGRPRKQA
Ga0256864_125011513300027964SoilMPLLHVRKPSEAPPPSRSSKAVREQQQKYDDFIRRIDASEVGDLELEPSENVRSVKVRLRRASSRIGVDVDIWDANGHVYFRRVTRRGRPRRTA
Ga0307504_1026749223300028792SoilMPLLHIRKPNEAPPPSRSSRAVREQQQKYDDFVRGIDANEVGDLELEPRENVRSVKVRLRRASSRLALDLDIWDANGHVYFRRVSRRGRPRKQAS
Ga0247825_1015138143300028812SoilMPVLQLRKPSDAPSPSRSSRAAREQQQKYDGFIRSVEGSDVGELELEPGENLRSIKVRLRRASSRLGVDIETWDANGHIYFRRITRRPRTRRQA
Ga0247825_1017870933300028812SoilMPVLQLRKPSDAPSPSRSSRAAREQQQKYDGFIRSVEGTDIGELELDPGENLRSVKVRLRRASSRLGVDIETWDANGHIYFRRITRRPRTRRQA
Ga0247825_1127272213300028812SoilLNRSYTFKAIPACLSIDSRENSMPQLHVRKPSEAPLPSRSSKAVREQQQKYDEFVRRVDTADVGDLELDTNENVRSVKVRLRRASSRLNVDLDIWDADGHVYFRRVTRRGRPRKQP
Ga0299907_1006758263300030006SoilMPLLHVRKPSEAPPPSRSSKAVREQQQKYDDFIRRIDTSDVGDLELEPSENVRSVKVRLRRASSRIGIDIDIWDANGHVYFRRVTRRGRPRRSA
Ga0299907_1021351643300030006SoilMPLLHVRKPSEAPPPSRSSKAVREQQQKYDDFIRRIDTSEVGDLELEPNENVRSVKVRLRRASSRIGVDVDIWDANGHVYFRRVTRRGRPRRSA
Ga0299907_1105818013300030006SoilMPQLHVRKPSEAPPPSRSSKAVREQQQRYDDFIRGVDANDVGDLELDPGENVRSVKVRLRRASSRLNTDIDIWDVNGHVYFRKVTRRGRPRKQA
Ga0299906_1006766463300030606SoilMPQLHVRKPTEAPPPSRSSKTVREQQQKYDDFIRGIDTTDVGDLELDPGENVRSVKVRLRRASSRLNTDIDIWDVNGHVYFRKVTRRGRPRKLA
Ga0299906_1008623933300030606SoilMPQLHVRKPSEAPPPSRSSKAVREQQQRYDDFIRGIDANDVGDLELDPGENVRSVKVRLRRASSRLNTDIDIWDVNGHVYFRRVTRRGRPRKQA
Ga0302046_1118179113300030620SoilPLLHVRKPSEAPPPSRSSKAVREQQQKYDDFIRRIDTSEVGDLELEPNENVRSVKVRLRRASSRISVDVDIWDANGHVYFRRVTRRGRPRRNA
Ga0302046_1119130813300030620SoilMPQLHVRKPNEAPPPSRSSKAVREQQQKYDDFIRGIDTNDVGDLELDAGENVRSVKVRLRRASSRLNADIDIWDVNGHVYFRKVTRRG
Ga0299913_1004857743300031229SoilMPLLHVRKPSEAPPPSRSSKAVREQQQKYDDFIRSIDTSEVGDLELEPNENVRSVKVRLRRASSRIGIDVDIWDANGHVYFRRVTRRGRPRRSA
Ga0299913_1049880223300031229SoilMPQLHVRKPNEAPPPSRSSKAVREQQQKYDDFIRGVDTNDVGDLELDPGENVRSVKVRLRRASSRLNTDIDIWDVNGHVYFRKVTRRGRPRKLA
Ga0299913_1131749613300031229SoilMPLLHVRKPSEAPPPSRSSKAVREQQQKYDDFIRSIDASEVGDLELEPNENVRSVKVRLRRASSRIGIDIDIWDTNGHVYFRRVTRRGRPRRTA
Ga0310121_1004934013300031801MarineMPLLHVRKPSEAPPLSRSSKAVREQQQKYDDFIRRIDTSEVGDLELEPNENVRSVKVRLRRASSRIAIDVDIWDVNGHVYFRRVTRRGRPRRSA
Ga0326597_1008126453300031965SoilMPQLHVRKPNEAPPPSRSSKAVREQQQKYDDFIRGIDTNDVGDLELDQGENVRSVKVRLRRASSRLNTDIDIWDVNGHVYFRKVTRRGRPRKQP
Ga0326597_1031129123300031965SoilMPVLHLRKPAEAPLPSRSSRAVREQQQRFDEFVRRVEVTDVGDLELEAGENLRSIKVRLRRASSRLGIDIDIWDADGHIYFRRVTRRGRPRKQA
Ga0326597_1034263513300031965SoilMPVLRLLKPGEAPPPSRVPRAMREQQQRYEEFVRGIELNEVGDLQLDAGENLRSVKVRLRRAALRLGEDLDIWDANGRVYFRRVTRRRRPRRQTAGA
Ga0326597_1058083713300031965SoilMPTLHVRKPSEAPLPSRSSRAVREQQQRYDEFVRRVEVTDVGDLELDPSENVRSIKVRLRRASSRLGIDLDIWDANGHVYFRRVTRRGRTRR
Ga0326597_1153697723300031965SoilLPQLHIRKPSEAPLPSRSSRAVREQQEKYDDFVRKIDASEVGDLELEPAENVRSVKVRLRRASTRVGVDVDIWDVNGHVYFKRVTRRGRPR
Ga0307417_1018602813300033291Salt MarshMPILHIRKPSEAPPPSRSSRAVREQQQKYDDFVRGIDANDVGDLELDPNENVRSVKVRLRRASSRLGLDLDIWDANGHVYFRRVMRRGRPRKS
Ga0214471_1000078533300033417SoilMPLLHVRKPSEAPLPSRSSKAVREQQQKYDDFIRSIDTSEVGDLELEPNENVRSVKVRLRRASSRIGVDVDIWDANGHVYFRRVTRRGRPRRTA
Ga0214471_1140825613300033417SoilMPVLHLRKPSEAPLPSRSSRAVREQQQKYDDFVRRVEVSDVGELELEPSDNVRSVKVRLRRASSRLGIDLDIWDVNSRVYFRRVTRRGRPRKT
Ga0326726_1190016223300033433Peat SoilMPKLHVRKPGDVPPPSRASRAVREQQQKFDDFVRGIDIADVGDLELDAGENLRSVKVRLRRASSRLGVDLDIWDANGHVYFKRVTRRGRPRRQPQA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.