NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F045249

Metagenome / Metatranscriptome Family F045249

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F045249
Family Type Metagenome / Metatranscriptome
Number of Sequences 153
Average Sequence Length 105 residues
Representative Sequence MKKMNAISGRTTKMLLGFAPLVTLLAAVGASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMAVRGDV
Number of Associated Samples 112
Number of Associated Scaffolds 153

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 3.27 %
% of genes from short scaffolds (< 2000 bps) 1.96 %
Associated GOLD sequencing projects 101
AlphaFold2 3D model prediction Yes
3D model pTM-score0.26

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (96.078 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(44.444 % of family members)
Environment Ontology (ENVO) Unclassified
(37.255 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(46.405 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 30.08%    β-sheet: 9.02%    Coil/Unstructured: 60.90%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.26
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 153 Family Scaffolds
PF00756Esterase 44.44
PF01850PIN 1.96
PF13470PIN_3 1.31
PF02922CBM_48 1.31
PF01112Asparaginase_2 1.31
PF01243Putative_PNPOx 0.65
PF00903Glyoxalase 0.65
PF14493HTH_40 0.65
PF00282Pyridoxal_deC 0.65
PF04185Phosphoesterase 0.65
PF08450SGL 0.65
PF14310Fn3-like 0.65
PF02475Met_10 0.65
PF03449GreA_GreB_N 0.65
PF02308MgtC 0.65
PF02604PhdYeFM_antitox 0.65
PF09957VapB_antitoxin 0.65

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 153 Family Scaffolds
COG1446Isoaspartyl peptidase or L-asparaginase, Ntn-hydrolase superfamilyAmino acid transport and metabolism [E] 1.31
COG0076Glutamate or tyrosine decarboxylase or a related PLP-dependent proteinAmino acid transport and metabolism [E] 0.65
COG0782Transcription elongation factor, GreA/GreB familyTranscription [K] 0.65
COG109223S rRNA G2069 N7-methylase RlmK or C1962 C5-methylase RlmITranslation, ribosomal structure and biogenesis [J] 0.65
COG1285Magnesium uptake protein YhiD/SapB, involved in acid resistanceInorganic ion transport and metabolism [P] 0.65
COG2161Antitoxin component YafN of the YafNO toxin-antitoxin module, PHD/YefM familyDefense mechanisms [V] 0.65
COG2264Ribosomal protein L11 methylase PrmATranslation, ribosomal structure and biogenesis [J] 0.65
COG2265tRNA/tmRNA/rRNA uracil-C5-methylase, TrmA/RlmC/RlmD familyTranslation, ribosomal structure and biogenesis [J] 0.65
COG2520tRNA G37 N-methylase Trm5Translation, ribosomal structure and biogenesis [J] 0.65
COG3174Membrane component of predicted Mg2+ transport system, contains DUF4010 domainInorganic ion transport and metabolism [P] 0.65
COG3386Sugar lactone lactonase YvrECarbohydrate transport and metabolism [G] 0.65
COG3391DNA-binding beta-propeller fold protein YncEGeneral function prediction only [R] 0.65
COG3511Phospholipase CCell wall/membrane/envelope biogenesis [M] 0.65
COG4118Antitoxin component of toxin-antitoxin stability system, DNA-binding transcriptional repressorDefense mechanisms [V] 0.65


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A96.08 %
All OrganismsrootAll Organisms3.92 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005180|Ga0066685_10002073All Organisms → cellular organisms → Bacteria → Acidobacteria → Acidobacteriia → Acidobacteriales → Acidobacteriaceae9165Open in IMG/M
3300012685|Ga0137397_10043518All Organisms → cellular organisms → Bacteria3211Open in IMG/M
3300012925|Ga0137419_10409363All Organisms → cellular organisms → Bacteria → Acidobacteria1061Open in IMG/M
3300021168|Ga0210406_10444773All Organisms → cellular organisms → Bacteria → Acidobacteria1032Open in IMG/M
3300027862|Ga0209701_10005256All Organisms → cellular organisms → Bacteria8496Open in IMG/M
3300031820|Ga0307473_10188750All Organisms → cellular organisms → Bacteria → Acidobacteria1213Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil44.44%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil16.34%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil15.03%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil5.23%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil4.58%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil4.58%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.27%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil2.61%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil1.96%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds0.65%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil0.65%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.65%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300001084Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_O1EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300002908Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005178Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005187Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_124EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005554Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_110EnvironmentalOpen in IMG/M
3300005556Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156EnvironmentalOpen in IMG/M
3300005558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_147EnvironmentalOpen in IMG/M
3300005586Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140EnvironmentalOpen in IMG/M
3300005591Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF1EnvironmentalOpen in IMG/M
3300005602Reference soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire, USA - Hubbard Brook CCASE Soil Metagenome REF2EnvironmentalOpen in IMG/M
3300005610Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 3EnvironmentalOpen in IMG/M
3300005712Warmed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WRM 4EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006046Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_101EnvironmentalOpen in IMG/M
3300006052Freshwater sediment microbial communities from North America - Little Laurel Run_MetaG_LLR_2013EnvironmentalOpen in IMG/M
3300006175Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-1 metaGEnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300007788Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_2EnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009143Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_2EnvironmentalOpen in IMG/M
3300010335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09082015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011270Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4B metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012349Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Sage2_R_115_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012582Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_20_16 metaGEnvironmentalOpen in IMG/M
3300012683Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz2.16 metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015051Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020021Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? H2c1EnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021086Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300021168Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-30-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021181Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-OEnvironmentalOpen in IMG/M
3300021407Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022532Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-4-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026331Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_137 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026371Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NR-01-BEnvironmentalOpen in IMG/M
3300026497Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - CO-08-BEnvironmentalOpen in IMG/M
3300026514Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DL-13-BEnvironmentalOpen in IMG/M
3300026515Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-03-AEnvironmentalOpen in IMG/M
3300026528Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_156 (SPAdes)EnvironmentalOpen in IMG/M
3300026530Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026555Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027643Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027674Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300027678Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM1_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027869Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen03_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300028906Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5 (v2)EnvironmentalOpen in IMG/M
3300030743Forest Soil Metatranscriptomes Boreal Montmorency Forest, Quebec, Canada VCO Co-assemblyEnvironmentalOpen in IMG/M
3300030844Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - FA11 EcM (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030935Forest soil microbial communities from France for metatranscriptomics studies - Site 11 - Champenoux / Amance forest - OA7 SO (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300030991Metatranscriptome of forest soil microbial communities from Montana, USA - Site 5 -Soil GP-1A (Eukaryote Community Metatranscriptome)EnvironmentalOpen in IMG/M
3300031708FICUS49499 Metagenome Czech Republic combined assemblyEnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300031820Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI1027J11758_1166598013300000789SoilMLLGLAAVAMLLGAVSATLYGLGHGRRQGQSAAHGMVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGD
JGI12648J13191_103127813300001084Forest SoilMKKMNGMSGRTAKSLFGVLALMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVLTGMDGMAEATLGPGGFAVMPS
JGI25613J43889_1004121023300002907Grasslands SoilMVLGLAPLVILLVAAGAASRGQGQTSPHGIVTPLAGAXLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVR
JGI25382J43887_1009832413300002908Grasslands SoilMKKMNAISRRTTKILLGFAPLVMLLVAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDV
JGI25617J43924_1003680113300002914Grasslands SoilMNEMNAISGRTAKMLLGFAPLVILLAAVGPSLRGQGQVSPRGIVTPLASATLVFDGEPACLKVSRENGDPDKGPSTFLLEAPSGCVVPAHYHTAEE
Ga0066674_1035915513300005166SoilMEKMNGISGRTATSLLGVVALMALLAAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVV
Ga0066672_1089381923300005167SoilMKKTNAISGRTTKMLLGFAPLVILLVAVGASSRGQGQISPRGIVTPLSSAKLVFDGEPVCLKVARENGDPDKGASTFLLEAPSGCVVPAHYHSAEEQLMVVRGDVLTGMDGMAEATLGPGGF
Ga0066688_1078906523300005178SoilMKQRNVISGRTTKMLLGFAALMLLLAAFGASSRGQGQTSPHGVVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPPHYHTAEEQLIVVRGDVLTGMEGMAEATLGPGGFAM
Ga0066685_10002073103300005180SoilMKKMNATSGTTTKMLLSFAPLVILFVAVSASSQGQGQISPHGMVTPLASANLVFDGEPACLKVARENGDPDKGASAFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATL
Ga0066678_1108866913300005181SoilMLLSFVPLMILLVAVSASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPGGF
Ga0066675_1116160523300005187SoilMKKMNATSGTTTKMLLSFVPLMILLVALSASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKRASTFLLEAPSGCVVPAHYHTAEEQLM
Ga0066686_1053816923300005446SoilMKKMNAISGRTKKMLLGFAPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKRASTFLLEAPSGCVVPAHYHTAEEQ
Ga0066661_1079765713300005554SoilMKKMNATSGTTTKILLGFAPLVILLVGVRASSQGQGQGQTHGVVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATL
Ga0066661_1079765913300005554SoilMKKMNVISGRTTKMLLSFVPLMILLVAVSASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATL
Ga0066707_1041279523300005556SoilMKKMNATSGTTTKMLLSFAPLVILFVAVSASSQGQGQISPHGMVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPGGFA
Ga0066698_1020444523300005558SoilMKKMNAISGRTKKMLLGFAPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKRASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATL
Ga0066691_1085153613300005586SoilMKKMSTISGRTTKMVLGFAPLVMLLVAVGASSQGQGQISPHGIVTPLASAHLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRG
Ga0070761_1007110653300005591SoilMKKMNGMSGRTAKSLFGVLALMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTA
Ga0070761_1008269813300005591SoilMKKMNDISRRTTKSLLGVVALMVLLVAVAASSRGQEQISKHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTA
Ga0070762_1010051723300005602SoilMEKMNSISGRTAKPVLGVVALMALFVAVSSPGQEEIPKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVLAGMDSMAEATLGPGGFARMPSKA
Ga0070762_1038415813300005602SoilMICAQSEFSSLFFVEADMKKMNGISGRTAKSLLGVVALMALLVAVAASSRGQEQISKRGIVTPLANANLVFDGEPACLKVARENGDPDKGTSTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVL
Ga0070763_1068076413300005610SoilMKKVMKKLNAISGRATKMLLGFAPLVILLAAVGASSRGQAQTSPHGVVTPLASANLVFDGEPACLKVARENGDPDKGPSAFLLQAPSGCVVPAHYH
Ga0070764_1049555223300005712SoilMKKMSGISGRTTKSLLGVVALMVLLVAVTASSRGQEQSSKRGMVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVLAGMDSMAEATLGPGGFARMPSKA
Ga0066696_1058428413300006032SoilMKKMNLISGRTTKMLLSFVPLMILLVAVSASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEAT
Ga0066656_1012211023300006034SoilMKMRNGISARAAKMLLGVAALAMLLGAVGAGLHGRGHGQGQTAAHGVVTPLASANLVFDGEPTCLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEQL
Ga0066652_10180286913300006046SoilMKKMNGISGRTAKSLLGVVALMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVP
Ga0075029_10050460813300006052WatershedsMEKTNAISRRAAKLLLGFVPLVILLVAVGASSRAQGQNSPHGTVTPLASANLVFDGQPACLKVARENGDPDLGPSTFLLEAPSGCVVP
Ga0070712_10109993913300006175Corn, Switchgrass And Miscanthus RhizosphereMKKMNGISGRTAKSLLGVVALMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPTCLKVARENGDPDKGASTFLLEAPSGCVV
Ga0066665_1143848823300006796SoilMKKMNVISGRTTKMLLSFVPLMILLVAVSASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMD
Ga0066659_1011759523300006797SoilMKRMNATSGTTTKMLLSFVPLMILLVALSASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPGGFA
Ga0066660_1048775023300006800SoilMKQRNVISGRTTKMLLGFAALMLLLAAFGASSRGQGQTSPHGVVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPPHYHTTEEQLIVVRGDVLTGMEGMAEATLGPG
Ga0099793_1008738913300007258Vadose Zone SoilMKQIKGVSGRTTRIVLGLAPLVILLVAVGAASRGQGQTSPHGIVTPLAGANLVFDGEPACLKVARENGNPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMD
Ga0099794_1011381713300007265Vadose Zone SoilMKMMNAISARTTKMLLGLMALAILAASVGATLHGPGHGQGQGQTAAHGIVTPLASASLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHT
Ga0099794_1027383913300007265Vadose Zone SoilMKMMNAISSRTTKMLLGIAALAILIAAVGATLHGRGQAHGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHHHTAEEQLMVVRGDVLTGMDGMPETTMGPG
Ga0099794_1051332013300007265Vadose Zone SoilMNKMNAISGRTAKMLLGFAPLVILLAAVGPSLRGQTSPHGIVTPLSSATLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMV
Ga0099794_1072711023300007265Vadose Zone SoilMKKMNAISGRTTKMLLGFAPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPTCLKVARENGDPDKGASTFLLEAPSGCVVPAH
Ga0099795_1015974913300007788Vadose Zone SoilMKKMQAISGQTTKMLVHFTPLAILLVAVCSSLRGQDQISARGIVTPLASAGLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPTHYHTAEEQLMV
Ga0066710_10157360423300009012Grasslands SoilMKKMNATSGTTTKMLLSFAPLVILFVAVSASSQGQGQISPHGMVTPLASANLVFDGEPACLKVARENGDPDKGASAFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEAT
Ga0066710_10358523523300009012Grasslands SoilMMNAISRRTTKMLLGLLALAVLVASVGATFRGPGHGQGQGQTAAHGIVTPLAGANLVFDGEPACLKVARENGDPDKGASTFLL
Ga0066710_10397670423300009012Grasslands SoilMKKMNVISGRTTKMLLSFVPLMILLVAVSASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEAT
Ga0099829_1099242713300009038Vadose Zone SoilMKKMNAISGKTTKMLLGFTALVILLAAVGPSLRGLGQTSPHGIVTPLSSATLIFDGEPACLKVARENGDPDKGPSTFLLEAPTGCVVPAHYHTEEEQVMVVQCDVLTGMDGMAEATLGPGGF
Ga0099830_1147297823300009088Vadose Zone SoilMKKINVISGRTTKMLLGLAPLAILLAAVVAASSQGQGQTPPHGVVTPLAGANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAH*
Ga0099828_1078635323300009089Vadose Zone SoilMKKMNAISGRTTKMLLGFAPLVILLVAVGATSRGQAQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVV
Ga0099792_1081133813300009143Vadose Zone SoilMNKMNAISGRTAKMLLGFAPLVILLAAVGPSLRGQGQTSPHGIVTPLASATLVFDGEPACLKVARENGDPDKGPSTFLLEAPTGCVVPAHYHTAEEQLMVARGDVLTGMDGMAEATL
Ga0134063_1051141423300010335Grasslands SoilMKKMNVISGRTTKMLLSFVPLMILLVAVSASSRGQGQTSPHGIVTPLASVNLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHT
Ga0137392_1000684893300011269Vadose Zone SoilMKKMNTISRRTTKMLLGFALLVILLVAASSRGRGQVSPHGTVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLG
Ga0137392_1130157913300011269Vadose Zone SoilMKMMNAISARTTKMLLGLMALAILAASVGATLHGPGHGQGQGQTAAHGIVTPLASASLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRG
Ga0137391_1079255613300011270Vadose Zone SoilMKKMNTISRRTTKMLLGFALLVILLVAASSRGRGQVSPHGTVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVV
Ga0137393_10000998183300011271Vadose Zone SoilMKMMNAISARTTKMLLGLAALAMLLGAIGVSLQGQTTLHGVLTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAH
Ga0137393_1081439923300011271Vadose Zone SoilMKRINGVSGRTTRIVLGLAPLVILVVAVGAASRGQGRTSPHGIVTPLASANLVFDGEPACLRVARENGDPDNGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPG
Ga0137389_1055511813300012096Vadose Zone SoilMKMMNAISARTTKMLLGLMALAILAASVGATLHGPGHGQGQGQTAAHGIVTPLASASLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGM
Ga0137389_1067544423300012096Vadose Zone SoilMKMRNAISARTTKMLLGVAALVMLPGAIGVSLRGQTAAHGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTA
Ga0137389_1106865913300012096Vadose Zone SoilMKRINGVSGRTTRIVLGLAPLVILVVAVGAASRGQGRTSPHGIVTPLASANLVFDGEPACLRVARENGDPDNGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDV
Ga0137363_1071574313300012202Vadose Zone SoilMKKMNAISGRTTKMLLGFAPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPTCLKVARENGDPDKGPSTFLLQAPSGCVVPAHYHTAEEQLM
Ga0137363_1089639513300012202Vadose Zone SoilMKRINGVSGRTTRIVLGLAPLVILLVAVGAASRGQGQGKTSPHGIVTPLASANLVFDGEPACLRVARENGDPDNGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGM
Ga0137363_1115337013300012202Vadose Zone SoilMKMRNAISAKTAKMLLGVAALAMLLGALGATLQGRGQGQTAAHGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSG
Ga0137399_1050675613300012203Vadose Zone SoilMLLGLRALAMLLGAMGATLYGRGHGRGQAQSAARGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEQLMVVR
Ga0137399_1104011923300012203Vadose Zone SoilMKKMNAISGRATKMLLGLAPLVILLAAVGVSSQGPGQVSPRGIVTPLASATLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPG
Ga0137399_1110251823300012203Vadose Zone SoilMKKMNTISRRTTKMLLGFALLVILLVAASSRGQGQVSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLT
Ga0137362_1013802713300012205Vadose Zone SoilMKRINGVSGRTTRIVLGLAPLVILLVAVGAASRGQGQGKTSPHGIVTPLASANLVFDGEPACLRVARENGDPDNGASTFLLEAPSGCVVPAHYHTAEEQLMV
Ga0137376_1027950413300012208Vadose Zone SoilMKKMNTISGRTTKMVLGFAPLVMLLVAVGASSQGQGQISPHGIVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVLTGMDGMAEAT
Ga0137387_1020265313300012349Vadose Zone SoilMKKMNAISRRTTKMLLGFAPLVMLLVAVGASSRGQGQISPRGIVTPLSSAKLVFDGEPACLKVVRENGDPDKGASTFLLEAPSGCVVPAHYHSAEEQ
Ga0137360_1071321313300012361Vadose Zone SoilMKKMNAISGRTTKMLLAFAPLVILLVAVSTSSREQGQISPHGTVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMPEATLGPGGF
Ga0137360_1074462123300012361Vadose Zone SoilMKKMNAISGRTTKMLLGFVPLVILLAAVGASSQGQVSPRGIVTPLASATLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLT
Ga0137360_1125609123300012361Vadose Zone SoilMKMRNAISAKTAKMLLGVAALAMLLGALGATLQGRGQGQTAAHGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEE
Ga0137361_1083179413300012362Vadose Zone SoilMKKMNGISGRTAKPLLGVVALMALLVAVAASSRGQEQISKRGKVTPLAGANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVLTGMDGMAEATLGPGGF
Ga0137358_1044359223300012582Vadose Zone SoilMLLGLTAVAMLLGAMGATLYGRGHGRGQAQSAAHGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMPE
Ga0137358_1063833813300012582Vadose Zone SoilMKMKMRNGISARTAKMLLGVAALAMLLGAVGAGLHGRGHGQGQTAAHGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVP
Ga0137398_1019795113300012683Vadose Zone SoilMKKMNAISGRTTKMLLGFAPLVILLAAVGPSLRGQGQPSPHGIVTPLASATLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVQGDV
Ga0137397_1004351853300012685Vadose Zone SoilMKMMNAISARMTKTLLGLAALAILLAAVGATLQGRGHGQGQGQTAAHGVVTPFASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEQL
Ga0137397_1131656823300012685Vadose Zone SoilMKKMNGVSRRTTRIVLGLAPLVILLVAAGAASRGQGQTSPHGIVTPLAGANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMPEQT
Ga0137395_1082652523300012917Vadose Zone SoilMKKMNGISGRTTKSLLGVVALMALFVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEE
Ga0137396_1015781413300012918Vadose Zone SoilMEKMNAISGRTTKMLLGFVPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLQAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDSMAEATL
Ga0137396_1015943923300012918Vadose Zone SoilMLLGLGALAMMVGAVGATLHGHGHGQGQGQTAAHGTVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHDHTAEEQLMVVRGDVLTGMDG
Ga0137396_1119163213300012918Vadose Zone SoilMKKRNAISGRTTKMLLGSAPLVILLAAVGARLHGHTHGLGYGQGQTAPHGVVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGM
Ga0137394_1002695653300012922Vadose Zone SoilMLLGLGALAMMVGAVGATLHGHGHGQGQGQTAAHGTVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTA
Ga0137394_1118385913300012922Vadose Zone SoilMLLGLRALAMLLGAMGATLYGRGHGRGQAQSAARGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTA
Ga0137359_1138918513300012923Vadose Zone SoilMKKMNAISGRTTKMLLAFAPLVILLVAVSTSSREQGQISPHGTVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTG
Ga0137419_1040936313300012925Vadose Zone SoilMKKMNGISGRTAKPLLGVVALMALLVGVAASSRGQEQISKRGIVTPLAGANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGC
Ga0137419_1099352923300012925Vadose Zone SoilMKKINVISGRTTKILLGLAPLAILLAAVVAASSQRQGQIPPHGVVTPLAGANLVFDGEPACLKVARENGNPDKDASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMA
Ga0137419_1101059623300012925Vadose Zone SoilMKKMNAISGRTRRIVLGFAPLVILLVVGAASLQGQGQNSPHGIVTPLASANLVFDGEPACLKVARENGDPDTGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLT
Ga0137416_1182871523300012927Vadose Zone SoilMKKINVISGRTTKILLGLAPLAILLAAVVAASSQRQGQIPPHGVVTPLAGANLVFDGEPACLKVARENGNPDKSASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVL
Ga0137404_1078297723300012929Vadose Zone SoilMKKMNAISGRTTKMFLGFVPLVILLVAVGASSRGQGQIFPHGIVTPLTNADLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVL
Ga0137404_1146523023300012929Vadose Zone SoilMLLGFAALAMLFGAIGVSLQGQTASHGIVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEQ
Ga0137407_1065104023300012930Vadose Zone SoilMLLGFAALAMLFGAIGVSLQGQTASHGIVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHT
Ga0137410_1010248213300012944Vadose Zone SoilMMNAISARMTKTLLGLAALAILLAAVGATLQGRGHGQGQGQTAAHGVVTPFASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHY
Ga0134077_1006543123300012972Grasslands SoilMKQRNVISGRTTKMLLGFAALMLLLAAFGASSRGQGQTSPHGVVTPLASANLVFDGEPACLKVARENGDPDKRASTFLLEAPSGCVVPAHYHTAEEQLMVVRG
Ga0134077_1042881913300012972Grasslands SoilMKKMNAISGRTKKMLLGFAPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKRASTFLLEAPSGCVVPAHYHTAEEQLMVVRG
Ga0134081_1001672913300014150Grasslands SoilMKQRNVISGRTTKMLLGFAALMLLLAAFGASSRGQGQTSPHGVVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPPHYHTAEEQLIVVRGDVLT
Ga0137414_117276823300015051Vadose Zone SoilMKKMNAISGRTRRIVLGFAPLVILLVVGAASLQGQGQNSPHGIVTPLASANLVFDGEPACLKVARENGDPDTGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPAALP*
Ga0137418_1125987613300015241Vadose Zone SoilMLLGVAAPAILLAAVGVSLQGQTAAHGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEQLMVVR
Ga0066662_1068837413300018468Grasslands SoilMKKMSTISGRTTKMVLGFAPLVMLLVAVGASSQGQEQISPHGIVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRG
Ga0066662_1225246523300018468Grasslands SoilMKKMNAISGRTTKMLLGFVPLVILLVAVGPSLRGQGQPSPRGIVTPLASATLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLAGMDGMPEATPGPGGLGMNPQKALHGVHWYI
Ga0193726_107312123300020021SoilMKKKNGISGRTTKSLFGVVALTALLVAESSRGQEQISKRGIVTPLASATLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAE
Ga0210399_1013371733300020581SoilMKKLNAISGRTLKMLLGFVPMLILFAAVGASSRGQGQNSPHGIVTPLASANLVFDGEPACLKVARENGDPDNGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMVEATLGP
Ga0210399_1135792323300020581SoilMLVAGVASLRGQGQNSSRGRVIPLASANLVFDGEPACLKVARENGDPDTGASTFLLEAPSGCVVPAHYHTAEEQLMVV
Ga0210401_1027131313300020583SoilMTEFSSLLFVEADMKKMNGISGRTAKSLFGVAASMALLVAVTASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVLTGMDGMA
Ga0179596_1039720923300021086Vadose Zone SoilMKKMNAISGRTTKMLLGFAPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPTCLKVARENGDPDKGPSTFLLQAPSGCVVPAHY
Ga0210406_1044477323300021168SoilMKKMNGISGGTVKSVLGVVALIVLLVAVAASSRGQEQISKHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATL
Ga0210406_1056319713300021168SoilMLVAGVASLRGQGQNSSRGRVAPLASANLVFDGEPACLKVARENGDPDTGASTFLLEAPSGCVVPAHYHTAEEQLMV
Ga0210406_1057617813300021168SoilMALFVVVAALSRGQEQISKRGIATPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVIPA
Ga0210400_1122983413300021170SoilMKKMNAISGRTTKMLLGFAPLVTLLVAVGASSRGQGQISPRGIVTPLAGANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHKHTAEEQLMVVRGDVLT
Ga0210388_1048293513300021181SoilMKKMNDISGRTAKLLHGVVVLMVLLVAVVVSSRGQEQISKHGMVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLM
Ga0210383_1073008113300021407SoilVKKINGSSGRTTKPLLGVALMALFVVVAASSRGQEQISKRGVVTPLAGVNLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVIPAHYHTAEEQLMVIQGDVLTGMDGM
Ga0210394_1133631013300021420SoilMEKMNGISGRTAKSLLGLVGLMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPA
Ga0210402_1004322113300021478SoilMKKMSGISGRTAKSLLGVVALMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEAT
Ga0210410_1020911313300021479SoilMKKVNGISGRTTRIVLGLAPLVILLVAVGAASRGQGQGQTSPHGTVTPLANASLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVR
Ga0210409_1038823613300021559SoilMEKMNGISGRTAKSLLGLVGLMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPGHYHTAEEQ
Ga0242655_1000865313300022532SoilMEKMNGISGRTAKSLLGLVGLMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPGHYHTAEEQLMVVQGDVLTGMDGMAEAKL
Ga0242655_1010987023300022532SoilMAKSLLGVVALMVLLVAVAASSRGQEQSSKRGMVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPGHY
Ga0137417_116507823300024330Vadose Zone SoilMKKMNGVSPRTTRIVLGLAPLVILLVAAGAASRGQGQTSPHGIVTPLAGANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAE
Ga0209267_129754223300026331SoilMKQRNVISGRTTKMLLGFAALMLLLAAFGASSRGQGQTSPHGVVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPPHYHTAEEQLIVVRGD
Ga0209377_118208223300026334SoilMKKMNAISRRTTKILLGFAPLVILLVAVDASSRGQGQISPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMV
Ga0257179_105263223300026371SoilMKQIKGVSGRTTRIVLGLAPLVILLVAVGAASRGQGQTSPHGIVTPLAGANLVFDGEPACLKVARENGNPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAE
Ga0257164_103299323300026497SoilMKKMNAISGRTMKMFLGFAPLMILLVAAGTSSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHSAEEQLMV
Ga0257168_110581713300026514SoilMKKMHAISGRTKKMTLGFAPLLILLVAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLG
Ga0257158_109129213300026515SoilMNKMNVTSGRTMRIVLGLAPLVILLVVGAASLQGQGQNSTHGIVTPLASAHLVFDGEPACLKVARENGDPDTGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPGGFALM
Ga0209378_106058413300026528SoilMKKMNVISGRTTKMLLSFVPLMILLVAVSASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPGGFAM
Ga0209807_112092713300026530SoilMKKMNATSGTTTKILLGFAPLVILLVGVRASSQGQGQGQTHGVVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLG
Ga0209157_132188923300026537SoilMKKMNATSGTTTKMLLSFAPLVILFVAVSASSQGQGQISPHGMVTPLASANLVFDGEPACLKVARENGDPDKGASAFLLEAPSGCVVP
Ga0209157_134470323300026537SoilMKKMNAISGRTKKMLLGFAPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKRASTFLLEAPSGCVVPAHYHTA
Ga0179593_118902353300026555Vadose Zone SoilMVLGLMPLVLLLVAAGAASRGQGQTSPHGIVTTPLAGANLVFDGEPACLKVARENGDPDKGASTFLLEAPSRCVVPAHYHTAQEQLMVVRGDVLTGMDGMPEQTLGPGGFAM
Ga0179587_1009846913300026557Vadose Zone SoilMKKMNTISRRTTKMLLSFALLVVLLVAASSRGRGQVSPHGIVTPLASAHLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQ
Ga0179587_1045820813300026557Vadose Zone SoilMVLGLMPLVLLLVAAGAASRGQGQTSPHGIVTPLAGANLVFDGEPACLKVARENGDPDKGASTFLLEAPSRCVVPAHYHTAQEQLMV
Ga0209220_102589323300027587Forest SoilMFLCFAPFLIVLAAVASPLEQTPDHGVVTPLASAKLASDGEPACLKSALENGDPDTGPSTFLLEAVPGCVVPAHYHTAEEQLIVVRGDVLTGMDGMSEKTLGPG
Ga0209076_102331523300027643Vadose Zone SoilMKRINGVSGRTTRIVLGLAPLVILLVAVGAASRGQGQGKTSPHGIVTPLASANLVFDGEPACLRVARENGDPDNGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMP
Ga0209076_107989123300027643Vadose Zone SoilMKMMNAISSRTTKMLLGIAALAILIAAVGATLHGRGQAHGVVTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAH
Ga0209076_115223813300027643Vadose Zone SoilMKQIKGVSGRTTRIVLGLAPLVILLVAVGAASRGQGQTSPHRIVTPLAGANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEA
Ga0209588_104608613300027671Vadose Zone SoilMNKMNAISGRTAKMLLGFAPLVILLAAVGPSLRGQTSPHGIVTPLSSATLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVARGDVLT
Ga0209588_110532213300027671Vadose Zone SoilMKKMNAISGRTTKMLLGFVPLVILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKGSSTFLLQAPSGCVVPAHY
Ga0209588_112660813300027671Vadose Zone SoilMKKMNAISARTTKMLLGFAPLVILLVAVSASSRGQGQISPRGTVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPG
Ga0209588_116890413300027671Vadose Zone SoilMKKMNAISARTTKMLLGFAPLVILLVAVGASSRGQGQISPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCV
Ga0209118_104641813300027674Forest SoilMKMMNAISARTTKMLLGLAALMILLGAIGVSLQGQTAAHGVVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPPHYHTAEEQLMVVRGDVLTGMDGMA
Ga0209011_117181213300027678Forest SoilMRNAISARTTKMLVGLAALALLLGGVGATLYGRGHGGGQGQSAAHGVVTPLASANLVFDGEPECLKVARENGDPDTGASTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMPETTMGP
Ga0209180_1014837423300027846Vadose Zone SoilMKKMNAISGKTTKMLLGFAPLVILLAAVGPSLRGQGQTSPHGIVTPLASATLIFDGEPACLKVARENGDPDKGPSTFLLEAPTGCVV
Ga0209701_10005256113300027862Vadose Zone SoilMKMMNAISARTTKMLLGLAALAMLLGAIGVSLQGQTTLHGVLTPLASANLVFDGEPACLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEELMVVRGDVLTG
Ga0209579_1017136423300027869Surface SoilMKLVETNKNRKQATISFGIGLKRILCFTSLTIVLVAVAISSAPQNTHSGVVTPLASAKLVFDGEPECLKVARENGDPDTGPSAFLLEAPSGCVVPAHSHTAEEQMMVVRGDVLTGM
Ga0209526_1071435613300028047Forest SoilMKTMSVIRREITGVFLCTAPLIIVLAAVRSPEGQGPHHGVLTPLASAKLASDGQPACLKSALENGDPQTGPSTFLLEAAPGCVVPAHSHTAEEQLTVIRGDVLTGMD
Ga0137415_1017189813300028536Vadose Zone SoilMKKMNAISGRTTKMLLGFVPLAILLAAVGASSRGQGQISPRGIVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSACVVPAHYHTAEEQLMVVRGNLLTGMDGMAEETLGP
Ga0308309_1117276213300028906SoilMKKINGISGRAAKSLLGVVALMVLLVAVAASSRGQEQISKRGTVTPLANANFVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQFMVVRGDVLTGMDGMAEATLGPGGFAVMPSKAMHW
Ga0265461_1223090323300030743SoilVRLLLDTSLLFVEADMKKMNGISGRTAKSLLGVVALMVLLVAASSRGQEQVSKPGIVTSLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVLTGWNRAVAQKI
Ga0075377_1160912713300030844SoilMKKMNGISGRTAKSLLGVGALMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVPGDVLTGMDGMAEATLGPGGFAM
Ga0075401_1083661113300030935SoilMKKMNGISERTAKSLLGVVALMALLVAVAASSRGQEQISKRGIVTPLASANPVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVQGDVLTGMDG
Ga0073994_1001547013300030991SoilMKKMNGISGRTAKSLLGIVALMVLLVAVAESSRGQEQISKRGMVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMVVRG
Ga0310686_10127816833300031708SoilMKKMNGISGRTAKSLLGVVALMAPLVAASSRGQEQISKRGMVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPA
Ga0310686_11433258923300031708SoilMEKMNGVSGRTAKSLLGVVALMALLVAVAASSRGQEQSSKRGMVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPARY
Ga0307469_1074079123300031720Hardwood Forest SoilMRNAISARTTKMLLGLAALAMLLGAIGVSLQGQTASHGIVTPLASAKLVFDGEPECLKVARENGDPDIGASTFLLEAPSGCVVPAHYHTAEEQLMVVRDVLTGMDGMPETTMGPGGFAMMPSKAMHWFTCKSKETCLMFVTFDRKYDIVWAKPAK
Ga0307469_1166248523300031720Hardwood Forest SoilMKKMNAISGRTKMFLCLTPLMILLVSVGTGSQGQAPSHGVVTPLASANLVFDGEPACLKVARENGDPDKGPSTFLLEAPSGCVVPAHYHTAEEQLMVV
Ga0307477_1032852323300031753Hardwood Forest SoilMKKLNSFSGRTIKVLLGFAPLVILLVAVGASSGGQGQISPRGIVTPLASANLVFDGEPACLNVARENGDPDKGASTFLLEAPPGCVVPAHYHTAEEQFMVVWGDVLTGMDGVAETRLGPGGF
Ga0307475_1057606923300031754Hardwood Forest SoilMKKAMKKRNTISGRTTKMLLGFAPLLILLVAASASSRRQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDTGPSTFLLEAPSGCVVPAHYHTAEEQLMVVRGDVLTGMDGMAEATLGPGGFAM
Ga0307473_1018875013300031820Hardwood Forest SoilMKKMNGISGRTAKSLFGVVALMALLVAVAASSRGQEQISKRGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGVRGSRPLSHC
Ga0307471_10086114323300032180Hardwood Forest SoilMKKMNAISGRTTKMLLGFAPLVTLLAAVGASSRGQGQTSPHGIVTPLASANLVFDGEPACLKVARENGDPDKGASTFLLEAPSGCVVPAHYHTAEEQLMAVRGDV
Ga0307472_10072378313300032205Hardwood Forest SoilMKMRNAISARTTKMLLGFAALALLLGTVCATLYGRGHGREQGQSAAHGVVTPLASANLVFDGEPACLKVARENGDPDTGASTFLLEA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.