NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F095218

Metagenome / Metatranscriptome Family F095218

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F095218
Family Type Metagenome / Metatranscriptome
Number of Sequences 105
Average Sequence Length 67 residues
Representative Sequence MRFRTSLCLATLFLVCFTIAAWSTPLPAQPAADSLQPTPDNQSLSGQIASVGDAEFSVQVTKDKDVNTVQFL
Number of Associated Samples 80
Number of Associated Scaffolds 105

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 100.00 %
% of genes near scaffold ends (potentially truncated) 3.81 %
% of genes from short scaffolds (< 2000 bps) 3.81 %
Associated GOLD sequencing projects 69
AlphaFold2 3D model prediction Yes
3D model pTM-score0.31

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (95.238 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(44.762 % of family members)
Environment Ontology (ENVO) Unclassified
(38.095 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(47.619 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: Yes Secondary Structure distribution: α-helix: 21.00%    β-sheet: 21.00%    Coil/Unstructured: 58.00%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.31
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 105 Family Scaffolds
PF02620YceD 11.43
PF06271RDD 10.48
PF13432TPR_16 2.86
PF03747ADP_ribosyl_GH 2.86
PF13620CarboxypepD_reg 1.90
PF13248zf-ribbon_3 1.90
PF01522Polysacc_deac_1 0.95
PF00106adh_short 0.95
PF14294DUF4372 0.95
PF03544TonB_C 0.95
PF00982Glyco_transf_20 0.95
PF08281Sigma70_r4_2 0.95
PF01566Nramp 0.95
PF13424TPR_12 0.95
PF02469Fasciclin 0.95

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 105 Family Scaffolds
COG139923S rRNA accumulation protein YceD (essential in plants, uncharacterized in bacteria)Translation, ribosomal structure and biogenesis [J] 11.43
COG1714Uncharacterized membrane protein YckC, RDD familyFunction unknown [S] 10.48
COG1397ADP-ribosylglycohydrolasePosttranslational modification, protein turnover, chaperones [O] 2.86
COG0380Trehalose-6-phosphate synthase, GT20 familyCarbohydrate transport and metabolism [G] 0.95
COG0726Peptidoglycan/xylan/chitin deacetylase, PgdA/NodB/CDA1 familyCell wall/membrane/envelope biogenesis [M] 0.95
COG0810Periplasmic protein TonB, links inner and outer membranesCell wall/membrane/envelope biogenesis [M] 0.95
COG1914Mn2+ or Fe2+ transporter, NRAMP familyInorganic ion transport and metabolism [P] 0.95
COG2335Uncaracterized surface protein containing fasciclin (FAS1) repeatsGeneral function prediction only [R] 0.95


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A95.24 %
All OrganismsrootAll Organisms4.76 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300007265|Ga0099794_10399202All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium718Open in IMG/M
3300009089|Ga0099828_10199713All Organisms → cellular organisms → Bacteria1784Open in IMG/M
3300012923|Ga0137359_10206279Not Available1755Open in IMG/M
3300020583|Ga0210401_10079053All Organisms → cellular organisms → Bacteria → Acidobacteria3095Open in IMG/M
3300026538|Ga0209056_10278579All Organisms → cellular organisms → Bacteria1170Open in IMG/M
3300026551|Ga0209648_10081540All Organisms → cellular organisms → Bacteria → Acidobacteria2699Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil44.76%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil19.05%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil10.48%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil9.52%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil7.62%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil4.76%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil3.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000364Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300000789Soil microbial communities from Great Prairies - Iowa, Native Prairie soilEnvironmentalOpen in IMG/M
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002914Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005177Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139EnvironmentalOpen in IMG/M
3300005561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_148EnvironmentalOpen in IMG/M
3300005568Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152EnvironmentalOpen in IMG/M
3300005569Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_154EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009089Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H1.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010303Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09182015EnvironmentalOpen in IMG/M
3300010320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_11112015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010364Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_2_09212015EnvironmentalOpen in IMG/M
3300012096Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012202Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_115_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012285Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012685Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz1.16 metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012977Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Glu_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018482Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_118EnvironmentalOpen in IMG/M
3300019789Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300020579Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-MEnvironmentalOpen in IMG/M
3300020581Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-14-MEnvironmentalOpen in IMG/M
3300020583Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-32-MEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300021170Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-26-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022724Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-H-17-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300026300Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026301Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cm (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026335Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_139 (SPAdes)EnvironmentalOpen in IMG/M
3300026377Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - DW-10-BEnvironmentalOpen in IMG/M
3300026529Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_152 (SPAdes)EnvironmentalOpen in IMG/M
3300026538Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027587Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027645Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM2_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027667Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_OM3_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027671Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027684Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM1H0_M2 (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300029636Metatranscriptome of lab incubated forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-M (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031720Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM2C_515EnvironmentalOpen in IMG/M
3300031740Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gases AM2C_05EnvironmentalOpen in IMG/M
3300031753Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_515EnvironmentalOpen in IMG/M
3300031754Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM1C_515EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
INPhiseqgaiiFebDRAFT_10154605613300000364SoilMRFRTSMVLATLFVACFTIAAWSTPPTVRPSPRPLLPTPDTQSLSGKIASVGDAEFSVQVTKGQDVNTVQFLVDDKTKVEGKLAVGAQA
INPhiseqgaiiFebDRAFT_10154635313300000364SoilMRFRTSMFLITMFVACCTIAAWSTPLTADTQSLSGRIATIGDGEFSVQVTKDQDVKTVQFLVDDKTKVEGKLAVGA
JGI1027J11758_1289563313300000789SoilMRFRTSMVLATLFVACFTIAAWSTPPTVRPSPRPLLPTPDTQSLSGKIASVGDAEFSVQVTKGQDVNTVQFLVDDKTKVEGKLAVG
JGI25385J37094_1016716713300002558Grasslands SoilMRFRTSMCLATLFLFCFTIAAWSAPHLILAAGHTEPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVKFMVDDKT
JGI25390J43892_1003311423300002911Grasslands SoilMRFRTSLCLATLFLVCFTIAAWSTALPAQPAADNLQPTPDNQSLSGQIASVGDAEFSVQVTKDKNVNTVQFLVDGKTKVEGKLAVGAE
JGI25617J43924_1031297013300002914Grasslands SoilMCLTTLFLVCFTIAAWSTPPPAQPAAGHPQSTPDNQSLSGQITSVGDAEFSLQVAKNKEASTVHFQVDDKTKVXGKLAVGAQASV
JGI25389J43894_108026113300002916Grasslands SoilMRFRTSLXLATLVLVSFTMGAWCTPLPAHSAAINLQPTPDKQSLSRQMAPGRSRILRTVRKDQGVNTVQFLVGGKTKIQGK
Ga0066683_1088338013300005172SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIASVGDAEFSVQVTKDKDVNTVQFLVDGKT
Ga0066690_1044680213300005177SoilMRFRTSLCLATLFLVCFTMAAWSTLLPAPSAAINEQPTPDQQSLWGQIASVGD
Ga0066699_1011982713300005561SoilMRFRTSLCLATLFLVCFTMAAWSTLLPAPSAAINEQPTPDQQSLWGQIASVGDREFSVQVKKDQGVNTVPFLVD
Ga0066703_1022873123300005568SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIASVGDAEFSVQVTKDKDVNTVQF
Ga0066703_1056427013300005568SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAETAADSLQPAPDNQSLSGPIASVGDAEFSVQVTKDKDVNTVQF
Ga0066705_1015218423300005569SoilMRFRASLCLATLFLVCFTIAAWSTPPAAHPAAGPLEPTPDNQSLSGQITSVGDAEFSVQVAK
Ga0066705_1051343713300005569SoilMRFRTSLRLATLFVVCFTMAAWYTPLPVHSAAVNLQPIPHKQSLSRQ
Ga0066696_1022360323300006032SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAETAADSLQPAPDNQSLSGPIASV
Ga0066696_1042969523300006032SoilMRFRTSLCLATLFLVCFTMAAWSTLLPAPSAAINEQPTPDQQSLWGQIASVGDREFSVQVKK
Ga0066665_1045674313300006796SoilMRFRTWLCRATLFLVCFTIAAWSAPLPAQPAADSRQPALDNQALPGQIASVGGAEFSVPVKRTKM*
Ga0066659_1012798613300006797SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIA
Ga0066659_1013598713300006797SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAETAADSLQPAPDNQSLSGPIASVGDAEFSVQVTKDKDVNTVQFL
Ga0099791_1011946713300007255Vadose Zone SoilMCLATLFLFCLTIAAWSTPHPILAAGHAQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVRFMVDDKTKVEGKLAVGAQATVEYR
Ga0099791_1057715523300007255Vadose Zone SoilVFVVCFTIAAWSAPLPSVPEADHLSPTPDNQSVSGTVAAIGDAEFSLQVTKNQEVNTVQFLVDDKTKVEGKLAVG
Ga0099791_1058874613300007255Vadose Zone SoilMCLATLFLFCLTIAAWSTPLPILAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTV
Ga0099793_1015223323300007258Vadose Zone SoilMRFRNSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIASVGDAEFSV
Ga0099793_1041141423300007258Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPLPALPVADHGSPGPDNQSLSG
Ga0099794_1010918513300007265Vadose Zone SoilMLFLFFFTITAWSTPHPILAAGCSEPTPDNQSLSGQITSVGDAEFSVQVAKNKDSS
Ga0099794_1030980423300007265Vadose Zone SoilMCLATLFLFCLTIAAWSTPHPILAAGHAQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTV
Ga0099794_1039920213300007265Vadose Zone SoilMCLATLFLCCLTIAAWSTPLPILAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVQFM
Ga0099828_1019971313300009089Vadose Zone SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTEADSLQPAPDNQSLSGPIAS
Ga0066709_10452642013300009137Grasslands SoilMRFRAPLCLATLFLVCFTIAAWSTPPAAHPAAGPLEPTPDNQSLSGQITSVGDAEFSVQVAKNKDASTVQFL
Ga0134082_1019917323300010303Grasslands SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAETAADSLQPAPDNQSLSGPIASVGDAEFS
Ga0134109_1022440623300010320Grasslands SoilMRFRGSLCLATLFLVCFTIAAWSTPPAAHSAAGPLKPTPDNQSLSGQITSIG
Ga0134064_1009368233300010325Grasslands SoilMRFRTSLCLATLFLVCFTMAAWSTPLPDQQSLWGQIASVGDREFS
Ga0134066_1004944713300010364Grasslands SoilMRFRTSLCLATLFLVCFTMAAWCTPLPAHSAAVNLQPTPDKQSLSRQMASVGDREFSVQVRKDQGVNTVQ
Ga0137389_1005317013300012096Vadose Zone SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTVADSLQPTPDNQSLSGP
Ga0137388_1049406823300012189Vadose Zone SoilMCLATLFLFCLTIAAWSTPLPILAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVQFMVDDKTKVEGKLAVG
Ga0137383_1092394413300012199Vadose Zone SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIASVGDAEFSVQVTKDKDVNTVQFL
Ga0137382_1026895123300012200Vadose Zone SoilMRFRASLCLATLFFVCFTIAAWSTPPAAQPAAGRLDPTPDNQSLSGQITSIGDAEFSVQVAKNKDASTVQFLV
Ga0137363_1101792923300012202Vadose Zone SoilMCLVTLFVVCFTIAAWSTPLRALPPAGYARPTPDSQSLSGTIASVGDAEFSVQAAKDKDA
Ga0137363_1104438413300012202Vadose Zone SoilMCLATLFLFCLTIAAWSTPHPILAAGHAQPIPDNQSLSGQITSVGDAEFSVQVAKNKDISTVRFMVD
Ga0137363_1125439913300012202Vadose Zone SoilMCLATLFLFCLTIAAWSTPHPILAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVQ
Ga0137363_1174271813300012202Vadose Zone SoilMCLATLFLFCLTIAAWSTPLPILAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVQFMVDDK
Ga0137399_1034031413300012203Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPPPALPGADHLSPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVQFM
Ga0137399_1101240313300012203Vadose Zone SoilMRFRTSLCLATLFLFCFTIAAWSAPHLILAAGHTEPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTV
Ga0137362_1060146123300012205Vadose Zone SoilMCLATLFLFCLTIAAWSTPHPILAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTV
Ga0137378_1113081313300012210Vadose Zone SoilMRFRASLCLATLFLVCFTIASWSTPPAAQPAAGPLEPTPDNHSLSGQIASVGDAEFSVQVAKNKDASTVQFLVDDKT
Ga0137377_1105561323300012211Vadose Zone SoilMRFRTSLCLATLFLVCFTMAAWSTPLPAETAADSLQPAPDNQSLSGPIASVG
Ga0137377_1189601223300012211Vadose Zone SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAETAADSLQPAPDNQSLSG
Ga0137370_1013378713300012285Vadose Zone SoilMRFRASLCLATLFLVCFTIAAWSTPPAAQPAAGPLEPRPDNQSLSGQITSIGDAEFSVQVAKNKDA
Ga0137371_1113039523300012356Vadose Zone SoilMRFRTSLCLATLFLVCFTIAGRSTPLLAETAADSLQPAPDNQSLSGPIASVGDAEFSVQV
Ga0137360_1102717313300012361Vadose Zone SoilMFLATLFLVCFTIAAWSTPPTVQPAAGPLLPTPDTQSLSGKIASVGDAEFSVQMTKDQDVNTVQFLVDDKTKVEGKLAVGAQATV
Ga0137390_1052183813300012363Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPPPALPGADHLSPTPDNQSLSGQITSVGDAEFS
Ga0137397_1048173913300012685Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPHPILAAGHSEPTPDNQSLSGQITAVGDAEFSVQVAKNKDISTVQFMVDDQTKVEGKL
Ga0137396_1050043523300012918Vadose Zone SoilMRFRASLVLATLFVVCFTIAAWSAPYPILAAGHSEPTPDNQSLSGQITSVGDA
Ga0137394_1012830833300012922Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPHPILAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKD
Ga0137394_1063514923300012922Vadose Zone SoilVFVVCFTIAAWSAPLPALPVVDHVSPGPDNQSLSGTVASVGDAEFSLQVAKNQDVNTVQFMVDDKTKVEGKLAVGAQATVEYR
Ga0137394_1076826323300012922Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPHPILAAGHSQPTPDNQSLSGQITSV
Ga0137359_1020627913300012923Vadose Zone SoilMRFRTSMCLATLFLFCLMIAAWSTPLPTLAAGHSQPTPDNQSLSGQITSVG
Ga0137359_1129073013300012923Vadose Zone SoilMRFRTSLCLATLFLVCFTIAAWSAPLPAQTAADSLQPAPDNQSLSGPIASV
Ga0137416_1056163223300012927Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPLPALPVADHGSPGPDNQSLSGTVASVGDAEFSLQVAKNQDVNTVQFMVD
Ga0137416_1168120213300012927Vadose Zone SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQPAADSLQPTPDNQSLSGQIASVGDAEFSVQVTKDKDVNTVQFL
Ga0137416_1217509113300012927Vadose Zone SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIASVGDAEFSVQVTKDKDVNTVQFLVD
Ga0137404_1192085713300012929Vadose Zone SoilMCLATLFLFCFTIAAWSAPHLILAAGHTEPTPDNQSLSGQITSVGDAEFSVQVTKNKDISTVQFMVD
Ga0134087_1018038523300012977Grasslands SoilMRFRTSLCLATLFLVCFTMAAWSTLLPAPSAAINEQPTPDQQSLWGQ
Ga0137409_1017088533300015245Vadose Zone SoilMRFRTSMCLATLFLFCFTIAAWSTPHPILAAGHAQPTPDNQSLSGQITSVGDAEFSVQV
Ga0137409_1036241023300015245Vadose Zone SoilMRFRTSLCLATLFLFCLTIAAWSKPYPTLAAGHSQPTPDNQSLSGQITSVGDAEFSVQV
Ga0137403_1043965623300015264Vadose Zone SoilMRFRTSMCLATLFLFCLTIAAWSTPLPTLAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKD
Ga0066655_1139822813300018431Grasslands SoilMRFRASLCLATLFLVCFTIAAWSTPPAAHPAAGPLEPTPDNQSLSGQITSVGDAEFSVQVAKNKDASTVQFLA
Ga0066667_1215735913300018433Grasslands SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIASVGDAEFSVQV
Ga0066669_1155721423300018482Grasslands SoilMRFRTSLCLATLFVVCFTIAAWCTPLPAQSAAVNLQPTPDKQSLSRQMASVGDREFSVQVRKDQGVNTVQFLVDGQTK
Ga0137408_143909573300019789Vadose Zone SoilMCLAPLFLFIFTIGAWSTPHPTLAAGHSQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVQFMVMTRRKSKGKLAVGAQATGRVPFERKAKTSPFTSS
Ga0210407_1055225723300020579SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTVADSLQPTPDNQSLSGPIASVGDAEFSVQVKKDKDVNTV
Ga0210399_1040978923300020581SoilMRFRTSLCLATLFLAFFTIAAWSTPLPARPAADRLQPAPDNQSLSGQILSVGDAEFSLQVKKNQDVNTVQFLVDDKTTVEGKLSVGARATV
Ga0210401_1007905313300020583SoilMRFRASMFLATLFVACFTIAAWSTPLPAQPAAGDPQPIPDNQSLSGTIAAIGDAEFSVQVAKDK
Ga0210404_1029053713300021088SoilMRFRTSLCLATLFLAFFTIAAWSTPLPARPAADRLQPAPDNQSLSGQILSVGDA
Ga0210400_1116219313300021170SoilMRFRTSLCLATLFLAFFTIAAWSTPLPARPAADRLQPAPDNQSLSGQILSVGDAEFSLQVKKNQDVNTVQFLVDDKTTVEGKLSVGARATVE
Ga0210410_1143069623300021479SoilMRFRTSLCLATLFLAFFTIAAWSTPLTARPAADRLQPAPDNQSLSGQILSVGDAEFSLQVKKNQDVNTV
Ga0210409_1130533523300021559SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTVADSLQPTPDNQSLSGPIASVGDAEFSVQVKKDKDVNTVQ
Ga0242665_1028599113300022724SoilIAAWSTPLRALPAAGDARATPDSQSSSGTIASVGDAEFSVQVAKNKDESAVQFVVDDKTKVEGKLTVGARATVE
Ga0137417_110767123300024330Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPPPALPGADHLSPTPDNQSLSGQI
Ga0209027_121161523300026300Grasslands SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIASVGDAEFSVQVTKDKDVNTVQFLVDGK
Ga0209238_116437313300026301Grasslands SoilMRFRGSLCLATLFLVCFTIAAWSTPPAAHSAAGPLKPTPDNPSLSGQITSIGDAEFSVQVAKNKDASTVQF
Ga0209471_109146313300026318SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAETAADSLQPAPDNQSLSGPIASVGDAEFSVQVTKDKDVNTVQFLVDGKTKVEGKLAVGAQ
Ga0209804_121607113300026335SoilMRFRTSLCLATLFLVCFTMAAWSTLLPAPSAAINEQPTPDQQSLWGQIASVGDRE
Ga0257171_101709513300026377SoilMRFRTSMCLATLFLFCLTIAAWSTPHPILAAGHAQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVKFMVDDKTKVEGKL
Ga0209806_118006613300026529SoilMRFRTSMCLATLFLFCFTIAAWSAPHLILAAGHTEPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVKFMVDDKTKVE
Ga0209056_1002815313300026538SoilMRFRTSLCLATLFLVCFTMAAWSTLLPAPSAAINEQPTPDQQSLWGQIASVGDREFSVQVKKDQGVNTVPFLVDGQTKVE
Ga0209056_1027857923300026538SoilMRFRTWLCRATLFLVCFTIAAWSAPLPAQPAADSRQPALDNQALPGQIASVGGAEFSVPVKRTKM
Ga0209648_1008154043300026551Grasslands SoilVRFRTPMCLTTLFLVCFTIAAWSTPPPAQPAAGHPQSTPDNQSLSGQITSVGDAEFSLQVAKNKEASTVHFQVDDKTKV
Ga0209220_118879413300027587Forest SoilMRFRASLLVILFLFCITTAAWSAPLPIQAVNHSQPTPDNQSLSGQITAVSAAQFSVQVAKNKDASIVQFV
Ga0209117_118171913300027645Forest SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTAADSLQPAPDNQSLSGPIASVGDAEFSVRV
Ga0209388_104325413300027655Vadose Zone SoilMRFRTSMCLATLFLFCLTIAAWSTPHPILAAGHAQPTPDNQSLSGQITSVGDAEFSVQVAKNKDISTVR
Ga0209009_103104013300027667Forest SoilMRFRISLCLATLFLVCFTVAAWSTPLPARTVADSLQPTPDNQSLSGPIASVADAEFSIQVTKDKDVNTVQFLVDGILKSIHAHGESI
Ga0209588_122834313300027671Vadose Zone SoilMRFRTSLCLATLFLFCFTIAAWSTPYPILAAGHSLPTAENQSLSGQITSVGDAEFSVQV
Ga0209626_118361113300027684Forest SoilMRFRTSLCLATLFLVCFTIAAWSTPLPAQTVADSLQPTPDNQSLSGPIASVG
Ga0209701_1006420713300027862Vadose Zone SoilMRFRTSLCLATLFLFFFTITAWSTPHPILAAGYSEPTPDNQSLSGQITSVGDAEFSVQVA
Ga0137415_1083312913300028536Vadose Zone SoilMRFRTSLVLATLFVVCFTIAAWSAPPPALPGADHLSPTPDNQSLSGQITSVGDAEFSV
Ga0222749_1050048613300029636SoilMRFRASMFLATLFVACFTIAAWSTPLPAQPVAGDPQPIPDNQSLSGTIAAIGDAEFSVQVAKDKDVSTVQFLVDDKTKVEGKLTVGAQAT
Ga0307469_1059033523300031720Hardwood Forest SoilSLCLATLFLVCFTMAAWSTPLPAQSAAVNLQPTPDKQSLWGPIASAGDREFSVPIALGVFVTLE
Ga0307469_1064346723300031720Hardwood Forest SoilMRFWASLLVTLFLFCITTAAWSAPLPILAMSHSQPTPDNQSLSGQITAVSAAQFSVQVAKNKDASIVQF
Ga0307468_10146121223300031740Hardwood Forest SoilMRFRTSLFLATAFLFCFTIAAWSTPLPILAAGPSQPTPDNQSLSGQIISVGDAEFSVQVAKNKDVSTVQFMVD
Ga0307477_1100062123300031753Hardwood Forest SoilMRFLASMLLVTLFVACFTIAAWSTSLTAQLATGALQPTADNQSLSGQITSIGDAEFSVQVAKNKDVN
Ga0307475_1048953623300031754Hardwood Forest SoilMRFRTALCLATLFLVCFTVAAWSTPRSAQLLDYPRPTPDNQSLSGKIASVGDATFSVEVTKNQEVSTIEFLIDGDT
Ga0307471_10005845113300032180Hardwood Forest SoilMRFRTSLRLATLFLVCFTMAAWSTPLPAQSAAVNLQPTPDKQSLWGPIASAGDREFSVPIALGVFVTLE
Ga0307471_10271238623300032180Hardwood Forest SoilMRFRASLCLATLFLVCFTIAAWSTPPAAHPAAGPLEPTPDNQSLSGQITSVGDAEFSVQ
Ga0307471_10272804813300032180Hardwood Forest SoilMRFRTSLCLATLFLFCLTIAAWSTPHPILAAGHSQPTPDNQSLSGQITSV


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.