NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F104096

Metagenome / Metatranscriptome Family F104096

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104096
Family Type Metagenome / Metatranscriptome
Number of Sequences 101
Average Sequence Length 52 residues
Representative Sequence GQGFRALLEKDPEVMKRIGGRLDEIFDPWAGLEHTDLAYERLQLGVKAR
Number of Associated Samples 87
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 1.98 %
% of genes near scaffold ends (potentially truncated) 97.03 %
% of genes from short scaffolds (< 2000 bps) 90.10 %
Associated GOLD sequencing projects 82
AlphaFold2 3D model prediction Yes
3D model pTM-score0.45

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (55.446 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(19.802 % of family members)
Environment Ontology (ENVO) Unclassified
(25.743 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(59.406 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 38.96%    β-sheet: 0.00%    Coil/Unstructured: 61.04%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.45
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF01259SAICAR_synt 79.21
PF13507GATase_5 7.92
PF02700PurS 4.95
PF10397ADSL_C 2.97
PF00586AIRS 1.98
PF02769AIRS_C 0.99
PF13205Big_5 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0152Phosphoribosylaminoimidazole-succinocarboxamide synthaseNucleotide transport and metabolism [F] 79.21
COG1828Phosphoribosylformylglycinamidine (FGAM) synthase, PurS subunitNucleotide transport and metabolism [F] 4.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A55.45 %
All OrganismsrootAll Organisms44.55 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
2088090004|P1_DRAFT_NODE_278484_len_8286_cov_16_702631All Organisms → cellular organisms → Bacteria8336Open in IMG/M
3300000887|AL16A1W_10082347Not Available645Open in IMG/M
3300001361|A30PFW6_1238299All Organisms → cellular organisms → Bacteria645Open in IMG/M
3300001414|JGI20174J14864_1004401All Organisms → cellular organisms → Bacteria776Open in IMG/M
3300001537|A2065W1_11430998Not Available505Open in IMG/M
3300001593|JGI12635J15846_10782393Not Available547Open in IMG/M
3300001661|JGI12053J15887_10212278Not Available978Open in IMG/M
3300001661|JGI12053J15887_10391816Not Available668Open in IMG/M
3300002245|JGIcombinedJ26739_100648687All Organisms → cellular organisms → Bacteria933Open in IMG/M
3300002911|JGI25390J43892_10017923All Organisms → cellular organisms → Bacteria1693Open in IMG/M
3300002912|JGI25386J43895_10025573All Organisms → cellular organisms → Bacteria1738Open in IMG/M
3300002914|JGI25617J43924_10306936All Organisms → cellular organisms → Bacteria547Open in IMG/M
3300005176|Ga0066679_10954786Not Available536Open in IMG/M
3300005177|Ga0066690_11015955Not Available520Open in IMG/M
3300005180|Ga0066685_10911916All Organisms → cellular organisms → Bacteria586Open in IMG/M
3300005406|Ga0070703_10109157Not Available987Open in IMG/M
3300005435|Ga0070714_101300396All Organisms → cellular organisms → Bacteria710Open in IMG/M
3300005444|Ga0070694_100946105Not Available713Open in IMG/M
3300005444|Ga0070694_101863428Not Available513Open in IMG/M
3300005445|Ga0070708_100938347Not Available812Open in IMG/M
3300005445|Ga0070708_100962910All Organisms → cellular organisms → Bacteria800Open in IMG/M
3300005445|Ga0070708_101542163Not Available619Open in IMG/M
3300005446|Ga0066686_10873931All Organisms → cellular organisms → Bacteria592Open in IMG/M
3300005447|Ga0066689_10442725All Organisms → cellular organisms → Bacteria815Open in IMG/M
3300005467|Ga0070706_100078349All Organisms → cellular organisms → Bacteria3060Open in IMG/M
3300005518|Ga0070699_100246774All Organisms → cellular organisms → Bacteria1595Open in IMG/M
3300005536|Ga0070697_100872486All Organisms → cellular organisms → Bacteria798Open in IMG/M
3300005568|Ga0066703_10239090All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1103Open in IMG/M
3300005587|Ga0066654_10024863All Organisms → cellular organisms → Bacteria2438Open in IMG/M
3300005764|Ga0066903_108842334Not Available511Open in IMG/M
3300006057|Ga0075026_100327262Not Available844Open in IMG/M
3300006057|Ga0075026_100637073Not Available630Open in IMG/M
3300006638|Ga0075522_10464486All Organisms → cellular organisms → Bacteria590Open in IMG/M
3300006800|Ga0066660_10300745All Organisms → cellular organisms → Bacteria1280Open in IMG/M
3300006800|Ga0066660_10506255Not Available1015Open in IMG/M
3300006806|Ga0079220_11186896All Organisms → cellular organisms → Bacteria628Open in IMG/M
3300006914|Ga0075436_100126402All Organisms → cellular organisms → Bacteria1791Open in IMG/M
3300006954|Ga0079219_10410366All Organisms → cellular organisms → Bacteria903Open in IMG/M
3300006954|Ga0079219_11069069All Organisms → cellular organisms → Bacteria680Open in IMG/M
3300009088|Ga0099830_10071403All Organisms → cellular organisms → Bacteria2519Open in IMG/M
3300009090|Ga0099827_10078547All Organisms → cellular organisms → Bacteria2569Open in IMG/M
3300009137|Ga0066709_100368574All Organisms → cellular organisms → Bacteria1980Open in IMG/M
3300009137|Ga0066709_102378245Not Available721Open in IMG/M
3300010047|Ga0126382_12536958Not Available501Open in IMG/M
3300010114|Ga0127460_1078946Not Available603Open in IMG/M
3300010358|Ga0126370_11247765Not Available694Open in IMG/M
3300010358|Ga0126370_11521746Not Available637Open in IMG/M
3300010361|Ga0126378_10137348All Organisms → cellular organisms → Bacteria2472Open in IMG/M
3300010361|Ga0126378_12259529Not Available621Open in IMG/M
3300010396|Ga0134126_11796832Not Available672Open in IMG/M
3300011269|Ga0137392_10186520All Organisms → cellular organisms → Bacteria1691Open in IMG/M
3300011270|Ga0137391_10610659Not Available913Open in IMG/M
3300011270|Ga0137391_11506164Not Available518Open in IMG/M
3300012001|Ga0120167_1098006Not Available601Open in IMG/M
3300012096|Ga0137389_10718778Not Available858Open in IMG/M
3300012189|Ga0137388_10854926Not Available842Open in IMG/M
3300012208|Ga0137376_10430818Not Available1145Open in IMG/M
3300012209|Ga0137379_10775440Not Available864Open in IMG/M
3300012285|Ga0137370_10071917All Organisms → cellular organisms → Bacteria1900Open in IMG/M
3300012358|Ga0137368_10632445Not Available678Open in IMG/M
3300012363|Ga0137390_10160276All Organisms → cellular organisms → Bacteria2237Open in IMG/M
3300012393|Ga0134052_1276906All Organisms → cellular organisms → Bacteria1259Open in IMG/M
3300012918|Ga0137396_11259777Not Available517Open in IMG/M
3300012923|Ga0137359_11617102Not Available536Open in IMG/M
3300012923|Ga0137359_11617104Not Available536Open in IMG/M
3300012925|Ga0137419_10358524All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1130Open in IMG/M
3300012927|Ga0137416_10287890All Organisms → cellular organisms → Bacteria1357Open in IMG/M
3300013294|Ga0120150_1015236All Organisms → cellular organisms → Bacteria1604Open in IMG/M
3300013768|Ga0120155_1173153Not Available579Open in IMG/M
3300013768|Ga0120155_1177335Not Available571Open in IMG/M
3300013770|Ga0120123_1049605Not Available900Open in IMG/M
3300014052|Ga0120109_1163734Not Available520Open in IMG/M
3300014056|Ga0120125_1190626Not Available508Open in IMG/M
3300015086|Ga0167655_1045801Not Available666Open in IMG/M
3300015241|Ga0137418_10111669All Organisms → cellular organisms → Bacteria2447Open in IMG/M
3300015359|Ga0134085_10081970All Organisms → cellular organisms → Bacteria1323Open in IMG/M
3300018482|Ga0066669_12363154Not Available508Open in IMG/M
3300019866|Ga0193756_1047908Not Available587Open in IMG/M
3300020002|Ga0193730_1023397All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1786Open in IMG/M
3300021046|Ga0215015_11006293Not Available792Open in IMG/M
3300021432|Ga0210384_10088269All Organisms → cellular organisms → Bacteria2786Open in IMG/M
3300025509|Ga0208848_1055791Not Available838Open in IMG/M
3300025906|Ga0207699_11135691Not Available579Open in IMG/M
3300025922|Ga0207646_10983207Not Available746Open in IMG/M
3300025927|Ga0207687_10454653Not Available1063Open in IMG/M
3300026301|Ga0209238_1136492Not Available784Open in IMG/M
3300026330|Ga0209473_1079281All Organisms → cellular organisms → Bacteria1388Open in IMG/M
3300026333|Ga0209158_1066325All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1432Open in IMG/M
3300026529|Ga0209806_1146034Not Available910Open in IMG/M
3300026532|Ga0209160_1058825All Organisms → cellular organisms → Bacteria2164Open in IMG/M
3300026540|Ga0209376_1156415All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi → unclassified Chloroflexi → Chloroflexi bacterium1086Open in IMG/M
3300027671|Ga0209588_1047161All Organisms → cellular organisms → Bacteria1393Open in IMG/M
3300027678|Ga0209011_1166157Not Available615Open in IMG/M
3300027748|Ga0209689_1187465Not Available926Open in IMG/M
3300027748|Ga0209689_1254008Not Available716Open in IMG/M
3300027857|Ga0209166_10097177All Organisms → cellular organisms → Bacteria1647Open in IMG/M
3300027862|Ga0209701_10159035All Organisms → cellular organisms → Bacteria → Terrabacteria group → Chloroflexi1374Open in IMG/M
3300027986|Ga0209168_10643780Not Available505Open in IMG/M
3300028784|Ga0307282_10400776Not Available664Open in IMG/M
3300031740|Ga0307468_101690117Not Available595Open in IMG/M
3300031771|Ga0318546_11265965Not Available518Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil19.80%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil15.84%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere10.89%
PermafrostEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Permafrost9.90%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil6.93%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil4.95%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil4.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.96%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.97%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil2.97%
Arctic Peat SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Arctic Peat Soil2.97%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.98%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil1.98%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.98%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil0.99%
Glacier Forefield SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Glacier Forefield Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Permafrost → Soil0.99%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil0.99%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Tropical Forest Soil0.99%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil0.99%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.99%
Miscanthus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Miscanthus Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
2088090004Permafrost microbial communities from permafrost in Bonanza Creek, Alaska - Permafrost Layer P1EnvironmentalOpen in IMG/M
3300000887Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A3-65cm-16A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001361Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A30-PF)- 6 month illuminaEnvironmentalOpen in IMG/M
3300001414Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-1 shallow-072012EnvironmentalOpen in IMG/M
3300001537Permafrost active layer microbial communities from McGill Arctic Research Station, Canada - (A20-65 cm-11A)- 1 week illuminaEnvironmentalOpen in IMG/M
3300001593Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2EnvironmentalOpen in IMG/M
3300001661Mediterranean Blodgett CA OM1_O3 (Mediterranean Blodgett coassembly)EnvironmentalOpen in IMG/M
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002912Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005176Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005406Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-1 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005444Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-25-1 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005447Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_138EnvironmentalOpen in IMG/M
3300005467Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005536Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-1 metaGEnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005587Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_103EnvironmentalOpen in IMG/M
3300005764Tropical forest soil microbial communities from Panama analyzed to predict greenhouse gas emissions - Panama Soil - Plot 1 (version 2)EnvironmentalOpen in IMG/M
3300006057Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Straight Creek_MetaG_SC_2012EnvironmentalOpen in IMG/M
3300006638Arctic peat soil microbial communities from the Barrow Environmental Observatory site, Barrow, Alaska, USA - NGEE PermafrostL2-AEnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300006914Populus root and rhizosphere microbial communities from Tennessee, USA - Rhizosphere MetaG P. TD hybrid SRZTD5Host-AssociatedOpen in IMG/M
3300006954Agricultural soil microbial communities from Georgia to study Nitrogen management - GA ControlEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009090Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010047Tropical forest soil microbial communities from Panama - MetaG Plot_30EnvironmentalOpen in IMG/M
3300010114Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Wat_40_5_16_2 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300010358Tropical forest soil microbial communities from Panama - MetaG Plot_3EnvironmentalOpen in IMG/M
3300010361Tropical forest soil microbial communities from Panama - MetaG Plot_23EnvironmentalOpen in IMG/M
3300010396Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-2EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300012001Permafrost microbial communities from Nunavut, Canada - A24_80cm_12MEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012358Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012393Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_R_Glu_40cm_5_0_1 metaT (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300013294Permafrost microbial communities from Nunavut, Canada - A3_65cm_0MEnvironmentalOpen in IMG/M
3300013768Permafrost microbial communities from Nunavut, Canada - A35_65cm_0MEnvironmentalOpen in IMG/M
3300013770Permafrost microbial communities from Nunavut, Canada - A15_5cm_18MEnvironmentalOpen in IMG/M
3300014052Permafrost microbial communities from Nunavut, Canada - A23_35cm_12MEnvironmentalOpen in IMG/M
3300014056Permafrost microbial communities from Nunavut, Canada - A20_5cm_0MEnvironmentalOpen in IMG/M
3300015086Arctic soil microbial communities from a glacier forefield, Storglaci?ren, Tarfala, Sweden (Sample st-5c, rocky medial moraine)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015359Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09182015EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019866Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H1m1EnvironmentalOpen in IMG/M
3300020002Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? U1a1EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300025509Arctic peat soil from Barrow, Alaska - NGEE Surface sample 210-1 shallow-072012 (SPAdes)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025927Miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS M5-4 metaG (SPAdes)Host-AssociatedOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026330Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_133 (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026540Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027748Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_149 (SPAdes)EnvironmentalOpen in IMG/M
3300027857Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen01_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028784Soil microbial communities from the East River watershed near Crested Butte, Colorado, United States - ER_DNA_121EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031771Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.169b2f19EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
P1_DRAFT_005788002088090004SoilLIAGPAXXXXFRGLLEQNAEVMTRIGSKLDEVFDPWAGLEHTDLAYERLGLGAKAQ
AL16A1W_1008234723300000887PermafrostVLEKDSEVMSRIGTRMLEIFDPWTGLEHTDLAYDRLGLGSGALKK*
A30PFW6_123829923300001361PermafrostLEADPEVMSRIGTKLDEVFNPWAGLEHTDLAFEKLGLGVNTR*
JGI20174J14864_100440113300001414Arctic Peat SoilYLVVQKAAMQAMDDAGGAQGVVDPGFRSLLEQNHEVMSRIGAKLDQVFDPWAGLEHTDLAYERLAIGRALRT*
A2065W1_1143099823300001537PermafrostEDGPGFRSALEKDNEVMSRIGPQMEGLFDPWAGLEHTDLAYDRLGLGSEARKK*
JGI12635J15846_1078239323300001593Forest SoilAMAEDGAGFRLRLENDKEVMSRIGTRMEEIFDPWTGLEHTDLAYDRLGLGSEVRHK*
JGI12053J15887_1021227823300001661Forest SoilRGFRAILEQDKEVMSRIGTRLXEVFDPWVGLEHTDLAYEKLGLGAKAH*
JGI12053J15887_1039181613300001661Forest SoilEMLEKDREVMTRIGTRMEEIFNPWTGLEHTDLAYDRLGLEAKAK*
JGIcombinedJ26739_10064868723300002245Forest SoilAMKALDEDGPGFRALLEKDPTVMERIGGRLDEVFDPWAGLEHTDLAYDRLQLGVKAR*
JGI25390J43892_1001792343300002911Grasslands SoilGDGFRALLEKNEDVTNRIGPRLDEVFDPWAGLEHTDLAYQRLGLGVPAN*
JGI25386J43895_1002557343300002912Grasslands SoilENSGDGFRALLEKNEDVTKRIGPRLDEVFDPWAGLEHTDLAYQRLGLGVPAN*
JGI25617J43924_1030693613300002914Grasslands SoilPGFRALLEKDAEVMKRIGGRLDDVFDPWAGLEHTDLAYDRLGLGVPR*
Ga0066679_1095478613300005176SoilLLEKNDEVMRRIGSKLEAVFDPWAGLEHTDLAYERLGLGVPTK*
Ga0066690_1101595523300005177SoilMEAMQESGAGFRTLLEKNDEVMRRIGSKLEAVFDPWAGLEHTDLAYERLGLGVPTK*
Ga0066685_1091191613300005180SoilREAGFRDLLEKNDGVQRRIGARMDKVFDPWAGLEHTDLVYERLGLGVAAP*
Ga0070703_1010915723300005406Corn, Switchgrass And Miscanthus RhizospherePGFRELLEKDADVMKRIGGHLEEVFDPWAGLEHTDLAYDRIGLGRALRA*
Ga0070714_10130039613300005435Agricultural SoilRGFRELLEKDEEVARLIGRKMDQVFDPWAGLEHTDLAYERLGLGVTAK*
Ga0070694_10094610513300005444Corn, Switchgrass And Miscanthus RhizosphereLVVQKAALDATAGQGPGFRQTLEANPEVMRRIGGRLDEVFDPWAGLEHTELAYERLGLGVKAI*
Ga0070694_10186342813300005444Corn, Switchgrass And Miscanthus RhizosphereEKDADVMKRIGAHLEEVFDPWAGLEHTDLAYDRIGLGRALRA*
Ga0070708_10093834723300005445Corn, Switchgrass And Miscanthus RhizosphereMEALQEDGPGFRAQVEKNKDVAGRLGPRLDAAFDPWAGLEHTDLAFDRLGLGVETR*
Ga0070708_10096291013300005445Corn, Switchgrass And Miscanthus RhizosphereGQGFRALLEKDPEVMKRIGGRLDEIFDPWAGLEHTDLAYERLQLGVKAR*
Ga0070708_10154216323300005445Corn, Switchgrass And Miscanthus RhizosphereAYLVVQKAALDATAGQGPGFRQTLEANPEVMRRIGGRLDEVFDPWAGLEHTELAYERLGLGVKAI*
Ga0066686_1087393123300005446SoilAGFRDLLEKNDGVQRRIGARMDKVFDPWAGLEHTDLVYERLGLGVAAP*
Ga0066689_1044272523300005447SoilILEKDAEVMRRIGARLDEVFDPWAGLEHTDLAYERLGLGVRR*
Ga0070706_10007834913300005467Corn, Switchgrass And Miscanthus RhizosphereKEFRALLQENDEVMRRIGDRLDQLFDAWAGLEHTDLAYERLGLGVSAK*
Ga0070699_10024677413300005518Corn, Switchgrass And Miscanthus RhizosphereQEDGPGFRAQMEKNKDVAGRLGPRLDAAFDPWAGLEHTDLAFDRLGLGVETR*
Ga0070697_10087248613300005536Corn, Switchgrass And Miscanthus RhizospherePGGKEFRALLQENDEVTRRIGDRLDQLFDPWAGLEHTDLAYERLGLGVSAK*
Ga0066703_1023909013300005568SoilLENDAEVMKRIGAHLDAVFDPWAGLEHTDLAYERLQLGVKSR*
Ga0066654_1002486313300005587SoilGFRAQVESNPEVSSRIGERLDAVFDPWAGLEHADLAYDRLGLGVRAR*
Ga0066903_10884233423300005764Tropical Forest SoilQKAAMQAMEGGGPGFRTLLEKNDEVMQRIGGKLESVFDPWAGLEHTDLAYERLRLGVQA*
Ga0075026_10032726223300006057WatershedsEEDGPGFRKLLEGDTEVMSRIGSMINEMFNPWAGLEHTDLAYERLGLGVKTN*
Ga0075026_10063707323300006057WatershedsFRTILEKNTEVMGRIGGRLDEVFDPWAGLEHTEIAYERLGLGSALRS*
Ga0075522_1046448613300006638Arctic Peat SoilGDGAGFRKLLEQDAEVMSRIGSKLDEVFDPWAGLEHTDLAYERLGLAVRAK*
Ga0066660_1030074533300006800SoilEGPGFRSLVEKDAAVRSRIGDRLEQVFDPWAGLERTDLAYERLGLGVKSR*
Ga0066660_1050625523300006800SoilALLERNDEVSGRIGPKMDHLFDPWRGLEHTDLAYERLGLGVRTTT*
Ga0079220_1118689623300006806Agricultural SoilVQKAAMQAMEPGGKEFRALLQENDEVMRRIGDRLDHLFDPWAGLEHTDLAYERLGLGVSAK*
Ga0075436_10012640213300006914Populus RhizosphereTILEKDADVARRIGGRMDEVFDPWAGLEHTDLAYERLGLGVNAK*
Ga0079219_1041036613300006954Agricultural SoilRSGGQGGQGFRTLLQENDEVMKRIGGKLDQVFDPWAGLEHTDLAYERLGLGVSAK*
Ga0079219_1106906923300006954Agricultural SoilGGIGFRELLEKNADVVQRIGPMLATVFDPWTGLEHTDLAYERLGLGALK*
Ga0099830_1007140313300009088Vadose Zone SoilALLEDDAEVMKRIGARLDEIFDPWAGLEHTNLAFDRLGLGAKAR*
Ga0099827_1007854753300009090Vadose Zone SoilIEKDAEVMKRIGGRLDDVFDPWAGLEHTELAYERVGLAVPAE*
Ga0066709_10036857443300009137Grasslands SoilDGPGFRTMLEKDEEVMKRIGSRMEEVFDPWTGLEHTDLAYERLGLGAKAK*
Ga0066709_10237824523300009137Grasslands SoilGGKCREILEKDAEVMTRIGVRLDEVFDPWAGLEHTDLAYERLGLGVRR*
Ga0126382_1253695823300010047Tropical Forest SoilRRLLERDEEVSRRIGPQMDRVFDPWAGLEHTDLAYEKLGLGVRAK*
Ga0127460_107894623300010114Grasslands SoilEKNDGVQRRIGARMDKVFDPWAGLEHTDLVYERLGLGVAAP*
Ga0126370_1124776523300010358Tropical Forest SoilVQKAALEAMEGSGPGFRARLEKNDGVMRRIGSKMDAVFDPWQGLEHTDLAYERIGLGVASK*
Ga0126370_1152174613300010358Tropical Forest SoilTGFRALLESDEEVVRRIGTRMDSVFDPWAGLEHTDLAYERLGLGVTAK*
Ga0126378_1013734853300010361Tropical Forest SoilFRALLEKDTEVTRRIGPKLDTVFDPWQGLEHTDLAYERIGLGVTTK*
Ga0126378_1225952913300010361Tropical Forest SoilAMQAMDSQAVGFRTLLENDDEVRQRIGSRIDQIFDPWSGLEHTDLAYERLGLGVQA*
Ga0134126_1179683223300010396Terrestrial SoilYLVVQKAALDATAGEGPGFRQTLEANPEVMRRIGGRIDEVFDPWAGLEHTELAYERLGLGAKAI*
Ga0137392_1018652043300011269Vadose Zone SoilRALIEKDAEVMKRIGGRLDDVFDPWAGLEHTELAYERVGLAVPAE*
Ga0137391_1061065923300011270Vadose Zone SoilFRELLEKDADVMKRIGPHLEEVFDPWAGLEHTDLAYERLRLGVKTS*
Ga0137391_1150616423300011270Vadose Zone SoilGGPGFRSQIEKNRAVMDRIGARLDQVFDPWSGLEHTDLAYDRLGLAVKTR*
Ga0120167_109800623300012001PermafrostMGPAFRSALEKDNEVMSRIGPQMEGLFDPWAGLEHTDLAYDRLGLGTEARKK*
Ga0137389_1071877823300012096Vadose Zone SoilDEEGPGFRALLEKDGEVMKRIGGRLDDVFDPWAGLEHTDLAYDRLGLGVPR*
Ga0137388_1085492623300012189Vadose Zone SoilLLEKDDGVMKRIGPHLEEVFDPWAGLEHTDLAYDRLGLGVKTT*
Ga0137376_1043081823300012208Vadose Zone SoilEGSGPGFRVLLEKDEELMKRIGPRMDKIFDPWTGLEHTDLAYERLGLAVRTT*
Ga0137379_1077544023300012209Vadose Zone SoilLEKDTEVVSRIGSKLDEVFDPWKGLEHTELAFDRLNLGSALPGVKAH*
Ga0137370_1007191713300012285Vadose Zone SoilMQAMEDPGAGFRALLEKNEDVMKRIGSKMDQVFDPWAGLEHTDLAYERLGLAVTTK*
Ga0137368_1063244513300012358Vadose Zone SoilILEKDTEVVSRIGSKLDEVFDPWKGLEHTELAFDRLNLGSALPGVKAH*
Ga0137390_1016027613300012363Vadose Zone SoilALDEEGAGFRALLEKDPEVMGRIGGRLDEIFDPWAGLEHTELAYDRIGLGVKTR*
Ga0134052_127690633300012393Grasslands SoilDNPGTGFRTLLENNDEVANRIGTRMDQAFDPWSGLEHTDLAYERLGLGVTAK*
Ga0137396_1125977723300012918Vadose Zone SoilVQKAAMQALDEDGAGFRALIEKEAEVMKRIGGRLDEVFDPWAGLEHTDLAYERLGLAVPAE*
Ga0137359_1161710223300012923Vadose Zone SoilAMQALEGDGPGFRSLLEKDADVMKRIGSRLDEVFDPWAGLEHADLAYDRLGLGVKTT*
Ga0137359_1161710423300012923Vadose Zone SoilAMQALEGDGPGFRALLEKDAEVMERIGERLDEVFDPWTGLEHTDLAYERLGLGVKAR*
Ga0137419_1035852423300012925Vadose Zone SoilAAMQSMENAGDGFRTLLEKNEDVMNRIGARMDEVFDPWAGLEHTDLAYQRLGLGVPAN*
Ga0137416_1028789013300012927Vadose Zone SoilGFRKLLEKDTDVMSRIGSKLDDVFDPWAGLEHTDLAYERLGLGVKAK*
Ga0120150_101523613300013294PermafrostGSGFRTLLEKDVAVMKRIGGRLDDVFDPWAGLEHTELAYERLGLGVKAR*
Ga0120155_117315313300013768PermafrostPIVQKAAMEAMADDGAGFRTILEANPEVMSRIGTKLDEVFNPWAGLEHTDLAYEKLGLGAKTT*
Ga0120155_117733513300013768PermafrostGPGFRATLEKDKDVMSRIGRRIDEVFDPWKGLEHTELAYERLGLGRTLRT*
Ga0120123_104960523300013770PermafrostLVVQRSAMQAMAEDGTGFRSILEKDKEVMSRIGDRMEEVFDPWMGLEHTDLAYERLALGVKAK*
Ga0120109_116373413300014052PermafrostLSVLEKDNEVMSRIGTRMLEIFDPWTGLEHTDLAYDRLGLGSEALKK*
Ga0120125_119062623300014056PermafrostVQRHAMQAMGEDGPGFRSALEKDNEVMSRIGPQMEGLFDPWAGLEHTDLAYDRLGLGSEARKK*
Ga0167655_104580113300015086Glacier Forefield SoilAMAEDGSGFRSALEKDNDVMSRIGARMEELFDPWAGLEHTDLAYDRLGLGTEARKS*
Ga0137418_1011166953300015241Vadose Zone SoilAMENSSDGFRALLEKNEDVTKRIGPRLDEVFDPWAGLEHTDLAYQRLGLGVPAN*
Ga0134085_1008197033300015359Grasslands SoilDPGAGFRALLEKNEDVMKRIGSKMDQVFDPWAGLEHTDLAYERLGLAVTTK*
Ga0066669_1236315413300018482Grasslands SoilGGRGFRALLQDNQDVMQRLGNRMEEVFDPWSGLEHTDLAYEKLGLGVSAS
Ga0193756_104790823300019866SoilVQQAAMRAMDGDGKGFRALLEENQEVMSRIGSKVKEVFDPWAGLEHTDLAYESLGLGVKA
Ga0193730_102339723300020002SoilMAEDGAGFRAMLEKDKDVMTRIGTRMDVIFDPWAGLEHTDLAYERLGLGRTLRT
Ga0215015_1100629323300021046SoilAVDGDGPGFRDLLAMDADVMRRIGAHLDEVFDPWAGLEHTDLAYERLGLGVKTS
Ga0210384_1008826913300021432SoilGQENSEETGFRKLLEKDTEVMSRIGPKLEEAFNPWAGLEHTDLAYERLGLGVKTK
Ga0208848_105579123300025509Arctic Peat SoilALEKDNEVMSRIGTRMEELFDPWAGLEHTDLAYDRLGLGTEARTS
Ga0207699_1113569113300025906Corn, Switchgrass And Miscanthus RhizosphereGFRELLEKDEEVARLIGRTMDQVFDPWAGLEHTDLAYERLGLGVTAN
Ga0207646_1098320723300025922Corn, Switchgrass And Miscanthus RhizosphereAGFRALLEKNEDVMKRIGSKLDHVFDPWAGLEHTDLAYERLGLGVTSK
Ga0207687_1045465313300025927Miscanthus RhizosphereDATAGQGPGFRQTLEANPEVMRRIGGRLDEVFDPWAGLEHTELAYERQGLGVKAI
Ga0209238_113649213300026301Grasslands SoilKAAMRSMENAGEGFRTLLEKDEDVMKRIGGRMDEIFDPWAGLEHTDLAYQRLGLGVTTK
Ga0209473_107928143300026330SoilLLEKNDEVMRRIGSKLEAVFDPWAGLEHTDLAYERLGLGVPTK
Ga0209158_106632543300026333SoilLLEKNEDVTKRIGARMDQVFDPWAGLEHTDLAYERLGLGVTTK
Ga0209806_114603423300026529SoilEGGPGFRPQIEKNRAVMERIGARLDEVFDPWAGLEHTDLAYQRLGLGVPAN
Ga0209160_105882553300026532SoilAAMQAMEGGTGFRALLQQNDEVMKRIGPKMDAVFDPWAGLEHTDLAYERLGLGVTTK
Ga0209376_115641513300026540SoilAAMRTMENAGDGFRELLEKNEDVTKRIGARMDQVFDPWAGLEHTDLAYERLGLGVTTK
Ga0209588_104716113300027671Vadose Zone SoilPGFRELLEMDAGVMKRIGPHLEEVFDPWAGLEHTDLAYERLRLGVKTS
Ga0209011_116615723300027678Forest SoilATGQTQVGFRELLEQDQQVMSRIGQKLDAVFDPWVGLEHTEIAYDRLGLGAKAH
Ga0209689_118746513300027748SoilKAAMQAMDDDGLGFRAQLENDAEVMKRIGGQLDAVFDPWAGLEHTDLAYERLQLGVKAR
Ga0209689_125400823300027748SoilAAMEAMQESGAGFRTLLEKNDEVMRRIGSKLEAVFDPWAGLEHTDLAYERLGLGVPTK
Ga0209166_1009717713300027857Surface SoilFRAQLEKNKDVMSRIGGALDAVFDPWVGLEHTDLAYERLGLGVKAV
Ga0209701_1015903513300027862Vadose Zone SoilEGPGFRALLEDDAEVMKRIGARLDEIFDPWAGLEHTNLAFDRLGLGAKAR
Ga0209168_1064378023300027986Surface SoilTDGTGYGFRELLEKNDEVRKRIGERLAGLFDPWNGLEHTDLAYERLGLGVSANN
Ga0307282_1040077613300028784SoilLIVQKAAMQGMDEQGAGFRAILEQDKEVMSRIGSRLDEVFDPWVGLEHTDLAFQRLGLGVKAQ
Ga0307468_10169011713300031740Hardwood Forest SoilRPGFRALLEKDKEVMNRIGTRLEEIFDPWTGLEHTELAYERLGLGAKAK
Ga0318546_1126596513300031771SoilQKAAMNAMETGGPGFRSLLEKDAEVVRRIGPKMDSVFDPWQGLEHTELAYERLGLGVTAK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.