NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F102695

Metagenome Family F102695

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F102695
Family Type Metagenome
Number of Sequences 101
Average Sequence Length 92 residues
Representative Sequence VATIYVCRKCKHSKELQSALARKTDATVKLVGCQDVCTQPLAGTRVEGCLQWFGGLDQSKRQRALIDLVNDGGRGPVPAVLAKARSKKRAGKSPR
Number of Associated Samples 75
Number of Associated Scaffolds 101

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 0.00 %
% of genes from short scaffolds (< 2000 bps) 0.00 %
Associated GOLD sequencing projects 67
AlphaFold2 3D model prediction Yes
3D model pTM-score0.65

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (100.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(13.861 % of family members)
Environment Ontology (ENVO) Unclassified
(31.683 % of family members)
Earth Microbiome Project Ontology (EMPO) Host-associated → Plant → Plant rhizosphere
(45.545 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 21.14%    β-sheet: 17.07%    Coil/Unstructured: 61.79%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.65
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 101 Family Scaffolds
PF00210Ferritin 29.70
PF00440TetR_N 9.90
PF00108Thiolase_N 4.95
PF00296Bac_luciferase 2.97
PF13631Cytochrom_B_N_2 1.98
PF00082Peptidase_S8 1.98
PF00248Aldo_ket_red 0.99
PF02332Phenol_Hydrox 0.99
PF02803Thiolase_C 0.99
PF00106adh_short 0.99
PF02308MgtC 0.99
PF01047MarR 0.99
PF02786CPSase_L_D2 0.99
PF01613Flavin_Reduct 0.99
PF00528BPD_transp_1 0.99

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 101 Family Scaffolds
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 5.94
COG2141Flavin-dependent oxidoreductase, luciferase family (includes alkanesulfonate monooxygenase SsuD and methylene tetrahydromethanopterin reductase)Coenzyme transport and metabolism [H] 2.97
COG1285Magnesium uptake protein YhiD/SapB, involved in acid resistanceInorganic ion transport and metabolism [P] 0.99
COG1853FMN reductase RutF, DIM6/NTAB familyEnergy production and conversion [C] 0.99
COG3174Membrane component of predicted Mg2+ transport system, contains DUF4010 domainInorganic ion transport and metabolism [P] 0.99


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

NameRankTaxonomyDistribution
UnclassifiedrootN/A100.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil13.86%
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil13.86%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil8.91%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere8.91%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil5.94%
Corn RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn Rhizosphere5.94%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil4.95%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil4.95%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil4.95%
SoilEnvironmental → Terrestrial → Soil → Sand → Desert → Soil4.95%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere3.96%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil2.97%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.98%
Agricultural SoilEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Agricultural Soil1.98%
Groundwater SedimentEnvironmental → Aquatic → Freshwater → Sediment → Unclassified → Groundwater Sediment0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil0.99%
Sugarcane Root And Bulk SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Sugarcane Root And Bulk Soil0.99%
Sub-Biocrust SoilEnvironmental → Terrestrial → Soil → Unclassified → Desert → Sub-Biocrust Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Uranium Contaminated → Soil0.99%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.99%
Avena Fatua RhizosphereHost-Associated → Plants → Rhizoplane → Unclassified → Unclassified → Avena Fatua Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizoplane → Epiphytes → Unclassified → Corn Rhizosphere0.99%
Sugarcane Root And Bulk SoilHost-Associated → Plants → Rhizome → Unclassified → Unclassified → Sugarcane Root And Bulk Soil0.99%
Populus RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Populus Rhizosphere0.99%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere0.99%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000956Soil microbial communities from Great Prairies - Kansas, Native Prairie soilEnvironmentalOpen in IMG/M
3300003319Sugarcane bulk soil Sample L2EnvironmentalOpen in IMG/M
3300003322Sugarcane root Sample L2Host-AssociatedOpen in IMG/M
3300005093Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling - Combined assembly of KBS All BlocksEnvironmentalOpen in IMG/M
3300005175Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_122EnvironmentalOpen in IMG/M
3300005345Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-10-2 metaGEnvironmentalOpen in IMG/M
3300005435Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaGEnvironmentalOpen in IMG/M
3300005440Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-25-3 metaGEnvironmentalOpen in IMG/M
3300005445Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-3 metaGEnvironmentalOpen in IMG/M
3300005458Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaGEnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005518Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K1-50-3 metaGEnvironmentalOpen in IMG/M
3300005529Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen16_06102014_R1EnvironmentalOpen in IMG/M
3300005530Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C6-3B metaGEnvironmentalOpen in IMG/M
3300005535Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C7.2-3L metaGEnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_119EnvironmentalOpen in IMG/M
3300005563Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS Corn C5-2Host-AssociatedOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006876Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200EnvironmentalOpen in IMG/M
3300006894Agricultural soil microbial communities from Utah to study Nitrogen management - NC ControlEnvironmentalOpen in IMG/M
3300006903Populus root and rhizosphere microbial communities from Tennessee, USA - Soil MetaG P. TD hybrid SBSTD5Host-AssociatedOpen in IMG/M
3300007004Agricultural soil microbial communities from Utah to study Nitrogen management - NC CompostEnvironmentalOpen in IMG/M
3300009012Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_159EnvironmentalOpen in IMG/M
3300009093Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C5-4 metaGHost-AssociatedOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010399Terrestrial soil microbial communities with excess Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-175-3EnvironmentalOpen in IMG/M
3300010400Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-2EnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012212Combined assembly of Hopland grassland soilHost-AssociatedOpen in IMG/M
3300012355Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_113_16 metaGEnvironmentalOpen in IMG/M
3300012362Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_80_16 metaGEnvironmentalOpen in IMG/M
3300012532Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_80_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012930Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012944Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300014488Bulk soil microbial communities from Mexico - San Felipe (SF) metaGEnvironmentalOpen in IMG/M
3300015242Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018422Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 124 TEnvironmentalOpen in IMG/M
3300018429Populus adjacent soil microbial communities from riparian zone of Shoshone River, Wyoming, USA - 504 TEnvironmentalOpen in IMG/M
3300018432Populus adjacent soil microbial communities from riparian zone of Yellowstone River, Montana, USA - 550 TEnvironmentalOpen in IMG/M
3300018466Populus adjacent soil microbial communities from riparian zone of Blue River, Arizona, USA - 249 TEnvironmentalOpen in IMG/M
3300019377Populus adjacent soil microbial communities from riparian zone of Indian Creek, Utah, USA - 112 TEnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020181Soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_10EnvironmentalOpen in IMG/M
3300021080Groundwater sediment microbial communities from an aquifer in East River, Colorado, USA - PLM0_60_coex redoEnvironmentalOpen in IMG/M
3300021184Soil microbial communities from Anza Borrego desert, Southern California, United States - S1+v_20EnvironmentalOpen in IMG/M
3300021976Soil microbial communities from a riparian zone of the East river system, Colorado, United States ? L2c1EnvironmentalOpen in IMG/M
3300024430Soil microbial communities from Anza Borrego desert, Southern California, United States - S3+v_20EnvironmentalOpen in IMG/M
3300025910Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025912Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS C8-3B metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025922Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300025929Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L8-3 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026075Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-10-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026295Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300027637Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS200 (SPAdes)EnvironmentalOpen in IMG/M
3300027691Agricultural soil microbial communities from Utah to study Nitrogen management - NC AS100 (SPAdes)EnvironmentalOpen in IMG/M
3300027886Agricultural soil microbial communities from Utah to study Nitrogen management - NC Compost (SPAdes)EnvironmentalOpen in IMG/M
3300027986Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300030006Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT152D67EnvironmentalOpen in IMG/M
3300031229Soil microbial communities from uranium-contaminated site in the Upper Colorado River Basin, Wyoming, United States - RVT155D38EnvironmentalOpen in IMG/M
3300031548Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-C-3Host-AssociatedOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031903Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - 322HYB-O-1Host-AssociatedOpen in IMG/M
3300032002Maize rhizosphere microbial communities from greenhouse at UC Davis, California, United States - DK15-C-3Host-AssociatedOpen in IMG/M
3300032174Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_05EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M
3300032205Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM5C_05EnvironmentalOpen in IMG/M
3300034172Sub-biocrust soil microbial communities from Mojave Desert, California, United States - 9HMSEnvironmentalOpen in IMG/M
3300034268Forest soil microbial communities from Eldorado National Forest, California, USA - SNFC_MG_FRD_1.2EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI10216J12902_11066191023300000956SoilIRLVGCQDVCKQPVAGTRVDGCLQWFGGLDQSKRQRVLIDLVNDGGHGPVSPVLAKAGSRKRAGKAPR*
soilL2_1018162723300003319Sugarcane Root And Bulk SoilVATIYVCHKCKRSKQLQDALERKTDATIKLVGCQDVCEQPVAGTRVDGCLQWFGGLDKPKRQKAVIELVNNGSRGPLPDALAKSRSKKRAGKSPR*
rootL2_1028142123300003322Sugarcane Root And Bulk SoilVATIYVCHKCKRSKHLQDALERRTDATIKLVGCQDVCEQPVAGTRVDGCLQWFGGLDKPKRQKAVIELVNNGSRGPLPDVLAKSRSKKRAGKSPR*
Ga0062594_10034605923300005093SoilVATIYVCQKCKHSKELQSALERKTDATVKLVGCQDVCHQPVAGTRVDGCLQWFEGLDRAKRQRALIDLVNAEGRGPVPAVLVKARSKKRAGKAPR*
Ga0062594_10236798113300005093SoilMATIYVCRKCKHSKGLQKALEDGTDATVKLVGCQDVCTQPVAGTRVDGCLQWFGGLDTGKRTRALIDLVTDGGRGPVSPVLAKARSKKRAGKAPR*
Ga0062594_10267018113300005093SoilVATIYVCRKCKHSKELQHALTRKTDATLKLVGCQDVCKQPVAGIRVEGCLEWFGCLDQSKRQRALIDLVNDGGRGPVPAVLAKVRSKKRAGKAR*
Ga0066673_1087018123300005175SoilMATLYVCRKCKHSKSLQKAVERETDATVRLVGCQDVCQQPLAGTRVDGRLEWFGGLDKAKRQQALIELLNDEPRRIPAALEKARVAKRSGKGPR*
Ga0070692_1135211113300005345Corn, Switchgrass And Miscanthus RhizosphereVATIYVCQKCKHSKELQSALERKTDATVKLVGCQDVCHQPVAGTRVDGCLQWFEGLDRAKRQRALIDLVNAEGRGPVPAVLAKARSKKRAGKAPR*
Ga0070714_10056958413300005435Agricultural SoilGGGSQVGTVYVCRKCKHSKALQQAIERGSAATVKLVGCQDVCHQPVAGTRVDDRLEWFGGLDKPKRRAALVELLNDKPPRRIPKALDKARVAKRAGKDPR*
Ga0070705_10148692813300005440Corn, Switchgrass And Miscanthus RhizosphereSRGPRGERVATIYVCSECKHSKTLQCALKRKTDATVKLVGCQDVCEQPLAGTRVDGALQWFGGLDRPKRQRALIDLVNDGGRGPLPEVLTKARSKKRAGKAPR*
Ga0070708_10049292323300005445Corn, Switchgrass And Miscanthus RhizosphereMATIYVCRKCKHSKCLQKAVERGSDATVRLVGCQDVCEQPVAGTRVDGRLEWFGGLDKAKRQQAIIDLLNDQPKRVPDALDKARVAKRAGKGPR*
Ga0070708_10110360023300005445Corn, Switchgrass And Miscanthus RhizosphereLERKTDATVKLVGCQDVCHQPVAGTRVDGCLQWFEGLDRAKRQRALIELVNAEGRGPVPAVLVKARSKKRAGKAPR*
Ga0070681_1004634043300005458Corn RhizosphereVGTVYVCRKCKHSKALQQAIERGSAATVKLVGCQDVCHQPVAGTRVDDRLEWFGGLDKPKRRAALVELLNDKPPRRIPKALDKARVAKRAGKDPR*
Ga0070681_1042115933300005458Corn RhizosphereQMATIYVCRKCKHSKGLQKALEKGTDATIRLVGCQDVCEQPVAGTRVEGCLQWFGGLDRPKRQQALIDLVNHHGDGPVPVVLAKTRSRKRAGKAPR*
Ga0070707_10206362823300005468Corn, Switchgrass And Miscanthus RhizosphereKTDATVKLVGCQDVCQQPVAGTRVDGRLEWFGGLDRAKRQRALIDLVKDGGRGPVPVVLAKARSKKRAGKAPR*
Ga0070699_10084405023300005518Corn, Switchgrass And Miscanthus RhizosphereMATIYVCRKCKHSKALQTALERGTDATVKLVGCQDVCKQPVAGTRVDGCLQWFEGLDRPKRQKALIDLVNDGGDGPLPEVLVKARSRKRAGRGPR*
Ga0070741_100004521403300005529Surface SoilMATLYVCRECKHSKALQKAVARETDATVKLVGCQDVCKEPVAGARVGGRLEWFGGLDAAKRQRALIRLLTDQPRKVPEALEKARVAKRSGKEPR*
Ga0070741_10004828203300005529Surface SoilVYVCRKCKHSKVLQKAIEGGSDATVKLVGCQDICRQPVAGTRVDDRLEWFGGLDKPKRRAALLELLNDPRPRRVPKALAKVRAPKRSGKDPR*
Ga0070741_1001295763300005529Surface SoilVGTVYVCRKCKHSKALQQAIERGSDATVKLVGCQDVCRQPVAGTRVDDRLEWFGGLDKPKRRAALVDLLNDKAPRRIPKALEGARVAKRAGKDPR*
Ga0070741_1020544133300005529Surface SoilVATIYVCRKCKHSKGLQKALEKGTDATVRLVGCQDVCSQPLAGTRVEGSLQWFGGLDRPKRQRAFIDLVNHGGNDPVSPVLAKARSRKRAGKAPR*
Ga0070679_10044968923300005530Corn RhizosphereVATIYVYQKCKHSKELQSALERKTDATVKLVGCQDVCHQPVAGTRVDGCLQWFEGLDRAKRQRALIELVNAEGRGPVPAVLVKARSKKRAGKAPR*
Ga0070684_10121740423300005535Corn RhizosphereMATIYVCRKCKHSKGLQKALEKGTDATIRLVGCQDVCEQPVAGTRVEGCLQWFGGLDRPKRQQALIDLVNHHGDGPVPVVLAKTRSRKRAGKAPR*
Ga0066692_1083627023300005555SoilCKHSKGLQKAVERGTDATIRLVGCQDVCQQPVAGTRVGGRLEWFGGLDKAKRQRALVDLLNDGSRRIPDELAKARVGKRSGKGPR*
Ga0066670_1025603723300005560SoilMATTPGVGGCPPTMYVCRKCKHSKGLQKAVERGTDATIRLVGCQDVCQQPVAGTRVGGRLEWFGGLDKAKRQRALVDLLNDGSRRIPDELAKARVGKRSGKGPR*
Ga0068855_10100371213300005563Corn RhizosphereHSKGLLQAIERGSDATVKLVGCQDICHQPVAGTRVDDRLEWFGGLDKPKRRAALVELLNDKPPRRIPKALDKARVAKRAGKDPR*
Ga0066696_1005625023300006032SoilMATLYVCRKCKHSKSLQKAVERETDATIRLVGCQDVCQQPLAGTRVDGRLEWFGGLDKAKRQQALIELLNDEPRRIPAALEKARVAKRSGKGPR*
Ga0066656_1040897223300006034SoilMATIYVCRKCKHSKGLQKAIAQGSDATVKLVGCQDVCEQPVAGTRVDGCLQWFGGLDKAKRQRALIDLLNDQPKKIPDGLEKARAAKRAGKAPR*
Ga0079217_1013505523300006876Agricultural SoilMATIYVCRKCKHSKGLQKALERGTDATVKLIGCQDVCQQPVAGTRAEGCLQWFGGLDKPKRQRALIDLVNGETDGEVPGPLAKARTRKQKAGKGLR*
Ga0079217_1016322423300006876Agricultural SoilVATIYVCRKCKHSKELQDALSRKTDSTLKLVGCQDVCKQPVAGTRVDGCLQWFGGLDQSKRRRALIDLVNDGARGPVPAVLAKASAKKRAGKSPR*
Ga0079215_1007804823300006894Agricultural SoilMATIYVCRKCKHSKGLQKALERGTDATVKLIGCQDVCQQPVAGTRAEGCLQWFGGLDKPKRQRALINLLNGETAGEVPVTLAKARTRKQKAGKGPR*
Ga0079215_1170212513300006894Agricultural SoilVSTTYVCLKCKHSKELQEALARKTDATLKLVGCQDVCKQPVAGTRVDGCLEWFGGLDQSKRRRALIDLVNDGNRGPVPAVLAKASSKKRAGKSPR*
Ga0075426_1010629633300006903Populus RhizosphereVGTVYVCRKCKHSKALQRAIERGSDATVKLVGCQDVCQQPVAGTRVDDRLEWFGGLDKAKRRAALVELLNDKPPRRIPKAL
Ga0079218_1006153023300007004Agricultural SoilVATIYVCRKCKHSKELQDALSRKTDSTLKLVGCQDVCKQPVAGTRVDGCLEWFGGLDQSKRRRALIDLVNDGNRGPVPAVLAKASSKKRAGKSPR*
Ga0079218_1307395513300007004Agricultural SoilMPTIYVCRECKHSKSLQKALEKGTDATIKLVGCQDVCKQPVAGTRVDGSLQWFGALDQAKRQRALIDLVNDGGRGPVSPLLAKTRSRKRAGKTPR*
Ga0066710_10235006823300009012Grasslands SoilMATIYVCRKCKHSKDLQKAVGRGTDATVKLVGCQDVCEQPVAGTRVDGCLQWFGGLDQGKRQRALIDLLNEPRPRELPKVLAKARVPKRAGKSPR
Ga0066710_10484428913300009012Grasslands SoilMATLYVCRKCKHSKSLQKAVERETDATVRLVGCQDVCQQPVAGTRVDSRLEWFGGLDKAKRQRALIDLLNHAPRRVPDALEK
Ga0105240_1122653113300009093Corn RhizosphereVGTVYVCRKCKHSKALQQAIERGSAATVKLVGCQDVCHQPVAGTRVDDRLEWFGGLDKPKRRAALVELLNDKPPRRIPKALHKARVAKRAGKDPR*
Ga0066709_10066819213300009137Grasslands SoilMYVCRKCKHSKGLQKAVERGTDATIRLVGCQDVCQQPVAGTRVGGRLEWFGGLDKAKRQRALVDLLNDGSRRIPDELAKARVGKRSGKGPR*
Ga0134127_1159228623300010399Terrestrial SoilVATIYVCSECKHSKTLQCALKRKTDATVKLVGCQDVCEQPLAGTRVDGALQWFGGLDRPKRQRAMIDLVNDGGRGPLPEVLAKARSKKRAGKAPR*
Ga0134122_1161537213300010400Terrestrial SoilVATIYVCQKCKHSKELQSALERKTDATVKLVGCQDVCHQPVAGTRVDGCLQWFEGLDRAKRQRALIELVNAEGRGPVPAVLVKARSKKRAGKAPR*
Ga0137364_1026440023300012198Vadose Zone SoilMATIYVCRKCKHSKDLQKAVGRGTDATVKLVGCQDVCEQPVAGTRVDGCLQWFGGLDQGKRQRALIDLLNEPRPRELPKVLAKARVPKRAGKSPR*
Ga0137363_1042467723300012202Vadose Zone SoilMIYVCRKCKHSKDLHKAVERGTDATVKLVGCQDICDQPVAGTRVDGCLQWFGGLDRPKRQKALINLVNDGGDGPLPEVLVKARSRKRAGKAPR*
Ga0150985_10287673323300012212Avena Fatua RhizosphereVATIYVCRKCKHSKDLQKALERGTDASIRLVGCQEVCEQPVAGTRADGCLQWFGGLDKPKRQRALIDLVNHGGDGKIPDVLAKSRSRMR
Ga0137369_1063998413300012355Vadose Zone SoilSASLQKALEQRTDATVKLVGCQDVCQQPLAGTRVDGCLQWFGGLDRAKRQRALIDLVNDGGHGPLAPVLVKARSKKRAGKAPR*
Ga0137361_1104332523300012362Vadose Zone SoilGCQDVCQQPVAGTRVDGSHQWFGGLDRPKRQKALINLVNDGGDGPLPEVLVKARSRKRAGKAPR*
Ga0137373_1016170023300012532Vadose Zone SoilMATIYVCRKCKHSKGLRCALERGTGATVKLVGCQDVCKEPVAGTKVDGCLQWFGGLDRPERQRALIDLLKDPRPRAVPDALVKARAPKRAGKGPR*
Ga0137413_1079270223300012924Vadose Zone SoilLERGTDATIRLVGCQDVCEQPLAGTRVDGCLQWFGGLDKSKRQHALIELVNDGGHGKVPDVLAKARSRKRAGKSPR*
Ga0137416_1213171823300012927Vadose Zone SoilMGTLYVCRKCKHSKGLQKAVERGTDATIRLVGCQDVCQQPVAGTRVTGRLEWFGGLDKAKRQQALIDLLNDQPKRVPDALDKARVAKRAGKGPR*
Ga0137404_1009587733300012929Vadose Zone SoilMATIYVCRKCKHSKDLHKAVERGTDATVKLVGCQDICAQPVAGTRVDGCLQWFGGLDRPKRQKALINLVNDVGDGPLPEVLVKARSRKRAGKAPR*
Ga0137407_1025481123300012930Vadose Zone SoilMATIYVCRKCKHSKDLHKAVERGTDATVKLVGCQDICDQPVAGTRIDGCLQWFGGLDRPKRQKALINLVNDGGDGPLPEVLVKARSRKRAGKAPR*
Ga0137410_1033547823300012944Vadose Zone SoilMATIYVCRKCKHSKDLHKAVERGTDATVKLVGCQDICDQPVAGTRVDGCLQWFGGLDRPKRQKALINLVNHGGDGPLPEVLVKARSRKRAGKAPR*
Ga0182001_1071041413300014488SoilVATKPTIYVCRKCKHSKELQGTLSRKTDATLKLVGCQEVCKQPVAGTRVDGGLQWFGGLDQSKRQRALIDLVNDGSRGPVPAVLAKASAKKRAGKSPR*
Ga0137412_1094409713300015242Vadose Zone SoilKCKHSKDLQKALERGTDATISLVGCQDVCEQPLAGTRVDGCLQWFGGLDKSKRQHALIELVNDGGHGKVPDVLAKARSRKRAGKSPR*
Ga0137409_1036455823300015245Vadose Zone SoilMATIYVCRKCKHSKDLHKAVERGTDATVKLVGCQDICDQPVAGTRVDGCLQWFGGLDRPKRQKALINLVNDGGDGPLPEVLVKARSRKRAGKAPR*
Ga0190265_1031563413300018422SoilTIYVCRKCKHSKELQNALARKTDGTIKLVGCQDICKQPVAGTRVDGELQWFGGLDQCKRQRALIDLVNDGGRGPVPVVLAKARSKKRAGKSPR
Ga0190265_1041366423300018422SoilMATIYVCRKCKHSKALQNALERGTDATVKLIGCQDVCEQPVAGTRVDGCLQWFGGLDKSKRQRALIDLVNDGSTGRLPEVLVKARDRKRAGKAPR
Ga0190265_1143904523300018422SoilLATIYVCRKCKHSKALQSALGKGTDASIKLIGCQDVCKQPVAGTRAEGCLQWFGGLDSGKRQRALIDLVNGQTDGKVPAVLVKARSGKKKAGKTPR
Ga0190272_1002384513300018429SoilMATIYVCRKCKHSKDLQKALEKGTDATIKLVGCQDVCSQPLAGTRVDGSLQWFGGLEQGKRQRALIDLVNDGGKGPVSPVLAKARSRKRAGKAPR
Ga0190272_1044958033300018429SoilMATLYVCRKCKHSKALQNAIERGTDATVKLIGCQDVCKQPVAGTKAEGCMQWFGGLDQPKRQRALIDLVNGRTDGRVPEVLVKARSSKRKGKAPR
Ga0190275_1267491813300018432SoilMKTIYVCRKCKHSKALQNALGRETDATVKLIGCQDVCEQPVAGTRVDGCLQWFGGLDKSKRQRALIDLVNGGSTGRLPEVLVKARDRKRAGKTPR
Ga0190268_1003803423300018466SoilERPRGERVATIYVCRKCKHSKELQSALARKTDATVKLVGCQDVCTQPLAGTRVEGCLQWFGGLDQSKRQRALIDLVNDGGRGPVPAVLAKARSKKRAGKSPR
Ga0190264_1006650113300019377SoilVSTIYVCRKCKHSKELQDALARKTDATVKLVGCQDVCKQPVAGTRVDGCLEWFGGLDQSKRRRALIDLVNDGNRGPVPAVLAKASSKKRAGKSPR
Ga0190264_1028394923300019377SoilATIYVCRKCKHSKALQNAIAQGTDASIKLVGCQDVCKQPVAGTRIEGCLQWFGGLDQPKRQKALIDVVNGRGDGRLPDVLVKARNPKRTGKAPR
Ga0190264_1060449313300019377SoilVATIYVCRKCKHSKELQSALARKTDATVKLVGCQDVCTQPLAGTRVEGCLQWFGGLDQSKRQRALIDLVNDGGRGPVPAVLAKARSKKRAGKSPR
Ga0190264_1160009713300019377SoilVATIYVCRECKHSKALQCALKRKTDATVKLVGCQDVCEQPLAGTRVDGALQWFGGLDRPKRQRAVIDLVNDGGRGPLPNVLIKARSKKRAGKAPR
Ga0137408_123201023300019789Vadose Zone SoilMATIYVCRKCKHSKDLHKAVERGTDATVKLVGCQDICDQPVAGTRIDGCLQWFGGLDRPKRQKALINLVNDGGDGPLPEVLVKARSRKRAGKAPR
Ga0137408_123201123300019789Vadose Zone SoilIYVCRKCKHSKDLHKAVERGTDATVKLVGCQDICDQPVAGTRIDGCLQWFGGLDRPKRQKALINLVNDGGDGPLPEVLVKARSRKRAGKAPR
Ga0196958_1015902413300020181SoilMATIYVCRKCKHSKCLRKELERKTGATIKLIGCQDVCKQPVAGTRVDGRLQWFGGLDRPKRQRALIDLVNDGGEGPVPVVLAKARSTKRAGKAPRS
Ga0210382_1041899213300021080Groundwater SedimentCQDVCKQPVAGTRVDGCVQWFGGLDDAKRQRALIDLLNGRWDGRLPDVLARARSRKRAGKKPR
Ga0196959_1006351913300021184SoilVATIYVCRKCKHSKELREALARKTDATVKLVGCQDVCKQPVAGTRVDGCLQWFGGLDRPKRQRALIDLINDGGRGPVPLVLTKARSGKRAGKAPR
Ga0196959_1012660113300021184SoilVATIYVCRKCKHSEDLQAALARKTDATVKLVGCQDVCKQPVAGTRVDGCLQWFGGLDRPKRRRAIIDLVNDGGRGPVPVVLAKARSGKRAGKAPR
Ga0193742_108351823300021976SoilVQDLQKALEKGTDATIKLVGCQDVCSQPLAGTRVDGSLQWFGGLEQGKRQRALIDLVNDGGKGPVPPVLAKARSRKRAGKAPR
Ga0196962_1000108543300024430SoilVSTIYVCSKCKHSKQLQCALGRGTDATLKLVGCQDVCQQPVAGTRVEGRLQWFGGLDKPKRQRALIDLVNDGGRGPVPEVLAKARSKKRAGKSPR
Ga0196962_1010312123300024430SoilVRVTTIYVCRKCKHSKELQGALTRKTDATLKLVGCQEVCKQPVAGTRVEGGLQWFGGLDQAKRRRAFIDLVNDGGRGPVPAELVKARSKKRAGKSPR
Ga0207684_1171130813300025910Corn, Switchgrass And Miscanthus RhizosphereVATIYVCQKCKHSKELQSALERKTDATVKLVGCQDVCHQPVAGTRVDGCLQWFEGLDRAKRQRALIDLVNAEGRGPVPAVLVKARSKKRAGKAPR
Ga0207707_1015317813300025912Corn RhizosphereATVKLVGCQDVCHQPVAGTRVDGCLQWFEGLDRAKRQRALIDLVNAEGRGPVPAVLVKARSKKRAGKAPR
Ga0207707_1072873823300025912Corn RhizosphereQMATIYVCRKCKHSKGLQKALEKGTDATIRLVGCQDVCEQPVAGTRVEGCLQWFGGLDRPKRQQALIDLVNHHGDGPVPVVLAKTRSRKRAGKAPR
Ga0207646_1173669523300025922Corn, Switchgrass And Miscanthus RhizosphereTDATVKLVGCQDVCQQPVAGTRVDGRLEWFGGLDRAKRQRALIDLVKDGGRGPVPVVLAKARSKKRAGKAPR
Ga0207664_1105033123300025929Agricultural SoilKCKHSKALQQAIERGSDATVKLVGCQDVCRQPVAGTRVDDRLEWFGGLDKPKRRAALVELLNDKPPRRIPKALDKARVAKRAGKDPR
Ga0207708_1070604523300026075Corn, Switchgrass And Miscanthus RhizosphereVATIYVCQKCKHSKELQSALERKTDATVKLVGCQDVCHQPVAGTRVDGCLQWFEGLDRAKRQRALIDLVNAEGRGPVPTVLAKARSKKRAGKAPR
Ga0209234_105269633300026295Grasslands SoilMATTPGVGGCPPTMYVCRKCKHSKGLQKAVERGTDATIRLVGCQDVCQQPVAGTRVGGRLEWFGGLDKAKRQRALVDLLNDGSRRIPDELAKARVGKRSGKGPR
Ga0209234_118059723300026295Grasslands SoilVATIYVCRKCKHSKELQSALERKTDATVKLVGCQDVCHQPVAGTRVDGTLQWFEGLDRAKRQRALIELVNDGGRGPLPDVLAKARSKKQAGKAPR
Ga0209818_104957713300027637Agricultural SoilVATIYVCRKCKHSKELQDALSRKTDSTLKLVGCQDVCKQPVAGTRVDGCLQWFGGLDQSKRRRALIDLVNDGARGPVPAVLAKASAKKRAGKSPR
Ga0209485_105667023300027691Agricultural SoilVATIYVCRKCKHSKELQEALSRKTDSTLKLVGCQDVCKQPVAGTRVDGCLQWFGGLDQSKRRRALIDLVNDGARGPVPAVLAKASAKKRAGKSPR
Ga0209486_1115111713300027886Agricultural SoilMPTIYVCRECKHSKSLQKALEKGTDATIKLVGCQDVCKQPVAGTRVDGSLQWFGALDQAKRQRALIDLVNDGGRGPVSPLLAKTRSRKRAGKTPR
Ga0209168_1010642733300027986Surface SoilVGTVYVCRKCKHSKALQQAIERGSDATVKLVGCQDVCRQPVAGTRVDDRLEWFGGLDKPKRRAALVDLLNDKAPRRIPKALEGARVAKRAGKDPR
Ga0299907_1003853353300030006SoilVATIYVCSKCKHSKALQCAVERRTDATIKLVGCQDVCEQPLAGTRVDGVLQWFGGLDRPKRQRAVIDLLNDGGRRPLPDVLNKARAKKRAGKAPR
Ga0299907_1033060323300030006SoilMATIYVCRKCKHSKGLQKALERGTDATVKLIGCQDVCQQPVAGTRAEGCLQWFGGLDKPKRQRALIELVNGETDGEVPGPLARARTRKQKAGKGPR
Ga0299913_1050017333300031229SoilVGKKPGVVGSLSVVYVCRKCKHSKGLQAALARETDASVRLVGCQDVCQQPVAGCRVDGCLEWFGRLDKKKRQRALVELVNDGGRGPLS
Ga0307408_10013974523300031548RhizosphereVSTIYVCRKCKHSKELQDALARRTDATLKLVGCQDVCKQPLAGTRVDGCLEWFGGLDQSKRQRALIDLVNDGSRGPVPAVLAKARSKKRAGQSPR
Ga0307469_1045427123300031720Hardwood Forest SoilMATLYVCRKCKHSASLQKALEQRTDATVKLVGCQDVCQQPVAGTRVDGSFCWFGGLDGAKRQRALIDLVNTGGHGPLSPVLVKARSKKRAGKAPR
Ga0307407_1136599113300031903RhizosphereVATIYVCRKCKHSKELQSAIARKTDATVKLVGCQDVCKQPVAGTRVDGELQWFGGLDQCKRQRALIDLVNDGGRGPVPVVLTKARSKKRAGKSPR
Ga0307416_10010583923300032002RhizosphereVSTIYVCRKCKHSKELQDALARRTDATLKLVGCQDVCKQPLAGTRVDGCLEWFGGLDQSKRQRALIDLVNDGSRGPVPAVLAKARSKKRAGKSPR
Ga0307416_10179379823300032002RhizosphereYVCRECKHSKGLQKALDKGTDATIKLVGCQDVCKQPVAGARVDGCLQWFGGLDQGKRQRALIDLVNDGGHGPVSPVLAKAGSRKRAGKAPR
Ga0307470_1095465613300032174Hardwood Forest SoilMPTNPGVGGCPPTIYVCRKCKHSKALREAIERGSDAAVKLVGCQDVCREPVAGTRVDGRMEWFGGLDKAKRQRALLSLLNGSAPREIPEALAKARVPKRAGKAPR
Ga0307471_10006778133300032180Hardwood Forest SoilMASIYVCRKCKHSKALREAIERGSDAAVKLVGCQDVCREPVAGTRVDGRMEWFAGLDKPKRQRALLSLLNASAPREIPEALVKARVPKRAGKGPR
Ga0307471_10042155913300032180Hardwood Forest SoilMGTIYVCRKCKHSKGLQKALERGTDATVKLVGCQDVCEQPLAGTRVDGCLEWFGGLDRPERQRALVNLLNDSRPKGIPKVLDKARVPKRAGKGPR
Ga0307471_10243549013300032180Hardwood Forest SoilMATPTIYVCRKCKHSKCLQKAVEHGTDATVRLVGCQDVCEQPVAGTRIDGRLEWFGGLDKAKRQRALVDLLNDGSRRIPDELAKARVGKRSGKGPR
Ga0307472_10084663523300032205Hardwood Forest SoilMPTNPGVGGCPPTIYVCRKCKHSKALREAIERGSDAAVKLVGCQDVCREPVAGIRVDGRMEWFAGLDKAKRQRALLSLLNASAPREIPEALVKARVPKRAGKGPR
Ga0334913_071888_484_7293300034172Sub-Biocrust SoilVATIYVCSECKHSKTLQDALKRKTDATVKLVGCQDVCEQPVAGTRVDGCLQWFGGLDRPKRRRAVIDLLNDGGRGPLPEVLV
Ga0372943_0026262_2051_23383300034268SoilVATIYVCRKCKHSKDLQKGLERGTDATIRLVGCQDVCDQPVAGTRVDGCLQWFGGLDKPKRQRALINLVKDGGDGPVPDVLAKTRSRKRAGKSPR


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.