NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome / Metatranscriptome Family F105578

Metagenome / Metatranscriptome Family F105578

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F105578
Family Type Metagenome / Metatranscriptome
Number of Sequences 100
Average Sequence Length 60 residues
Representative Sequence DHDTILYTLTVVDPKAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKPSK
Number of Associated Samples 82
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence Yes
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 0.00 %
% of genes near scaffold ends (potentially truncated) 1.00 %
% of genes from short scaffolds (< 2000 bps) 1.00 %
Associated GOLD sequencing projects 76
AlphaFold2 3D model prediction Yes
3D model pTM-score0.27

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (99.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(35.000 % of family members)
Environment Ontology (ENVO) Unclassified
(36.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(42.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 16.48%    β-sheet: 18.68%    Coil/Unstructured: 64.84%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.27
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF07676PD40 6.00
PF00248Aldo_ket_red 4.00
PF01145Band_7 2.00
PF00903Glyoxalase 2.00
PF00578AhpC-TSA 2.00
PF00080Sod_Cu 2.00
PF13620CarboxypepD_reg 2.00
PF13442Cytochrome_CBB3 2.00
PF01425Amidase 1.00
PF07690MFS_1 1.00
PF12838Fer4_7 1.00
PF13709DUF4159 1.00
PF12681Glyoxalase_2 1.00
PF03352Adenine_glyco 1.00
PF01717Meth_synt_2 1.00
PF00266Aminotran_5 1.00
PF01494FAD_binding_3 1.00
PF11737DUF3300 1.00
PF13545HTH_Crp_2 1.00
PF02922CBM_48 1.00
PF11008DUF2846 1.00
PF01925TauE 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG06542-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductasesEnergy production and conversion [C] 2.00
COG2032Cu/Zn superoxide dismutaseInorganic ion transport and metabolism [P] 2.00
COG0154Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit or related amidaseTranslation, ribosomal structure and biogenesis [J] 1.00
COG0578Glycerol-3-phosphate dehydrogenaseEnergy production and conversion [C] 1.00
COG0620Methionine synthase II (cobalamin-independent)Amino acid transport and metabolism [E] 1.00
COG0644Dehydrogenase (flavoprotein)Energy production and conversion [C] 1.00
COG0665Glycine/D-amino acid oxidase (deaminating)Amino acid transport and metabolism [E] 1.00
COG0730Sulfite exporter TauE/SafE/YfcA and related permeases, UPF0721 familyInorganic ion transport and metabolism [P] 1.00
COG28183-methyladenine DNA glycosylase TagReplication, recombination and repair [L] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A99.00 %
All OrganismsrootAll Organisms1.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300028047|Ga0209526_10279977All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1134Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil35.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil21.00%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment6.00%
Forest SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil6.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil3.00%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil3.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Unclassified → Soil3.00%
Surface SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Surface Soil2.00%
Peatlands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Peatlands Soil2.00%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil2.00%
SoilEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Soil2.00%
BogEnvironmental → Aquatic → Freshwater → Wetlands → Bog → Bog1.00%
Iron-Sulfur Acid SpringEnvironmental → Aquatic → Thermal Springs → Hot (42-90C) → Acidic → Iron-Sulfur Acid Spring1.00%
WatershedsEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Watersheds1.00%
Terrestrial SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Terrestrial Soil1.00%
Tropical Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Tropical Forest Soil1.00%
SoilEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Soil1.00%
PermafrostEnvironmental → Terrestrial → Soil → Wetlands → Permafrost → Permafrost1.00%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere1.00%
PalsaEnvironmental → Terrestrial → Peat → Unclassified → Unclassified → Palsa1.00%
BiofilmEnvironmental → Terrestrial → Cave → Unclassified → Unclassified → Biofilm1.00%
RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Rhizosphere1.00%
Corn RhizosphereHost-Associated → Plants → Rhizosphere → Unclassified → Unclassified → Corn Rhizosphere1.00%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002245Jack Pine, Ontario site 1_JW_OM2H0_M3 (Jack Pine, Ontario combined, ASSEMBLY_DATE=20131027)EnvironmentalOpen in IMG/M
3300002907Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cmEnvironmentalOpen in IMG/M
3300005167Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121EnvironmentalOpen in IMG/M
3300005534Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen07_05102014_R1EnvironmentalOpen in IMG/M
3300005921Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 6EnvironmentalOpen in IMG/M
3300006176Warmed and freeze-thawed soil microbial communities from the Hubbard Brook experimental Forest, New Hampshire - Hubbard Brook CCASE Soil Metagenome WFT 5EnvironmentalOpen in IMG/M
3300006893Iron sulfur acid spring bacterial and archeal communities from Banff, Canada, to study Microbial Dark Matter (Phase II) - Paint Pots PPA 5.5 metaGEnvironmentalOpen in IMG/M
3300007255Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009088Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaGEnvironmentalOpen in IMG/M
3300010359Tropical forest soil microbial communities from Panama - MetaG Plot_15EnvironmentalOpen in IMG/M
3300010401Terrestrial soil microbial communities without Nitrogen fertilizer from Kellogg Biological Station, Michigan, USA - KB3-0-1EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012351Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_100_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012917Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk2.16 metaGEnvironmentalOpen in IMG/M
3300012922Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czobulk1.16 metaGEnvironmentalOpen in IMG/M
3300012923Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_40_16 metaGEnvironmentalOpen in IMG/M
3300012924Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012925Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012929Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012955Unamended control soil microbial communities from upstate New York, USA - Whitman soil sample_216_MGEnvironmentalOpen in IMG/M
3300012986Soil microbial communities amended with pyrogenic organic matter from upstate New York, USA - Whitman soil sample_217_MGEnvironmentalOpen in IMG/M
3300013105Corn rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - C2-5 metaGHost-AssociatedOpen in IMG/M
3300014158Peatland microbial communities from Houghton, MN, USA - PEATcosm2014_Bin02_60_metaGEnvironmentalOpen in IMG/M
3300014501Permafrost microbial communities from Stordalen Mire, Sweden - P3-2 metaG (Illumina Assembly)EnvironmentalOpen in IMG/M
3300015241Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015245Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300015264Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad2_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300017823Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_3EnvironmentalOpen in IMG/M
3300017943Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_4EnvironmentalOpen in IMG/M
3300017955Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - SO4_2EnvironmentalOpen in IMG/M
3300018012Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - ASW_5EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300020170Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZOMad1_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020199Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300020580Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-19-MEnvironmentalOpen in IMG/M
3300021171Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-11-MEnvironmentalOpen in IMG/M
3300021178Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-4-MEnvironmentalOpen in IMG/M
3300021180Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-OEnvironmentalOpen in IMG/M
3300021401Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-27-OEnvironmentalOpen in IMG/M
3300021420Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-12-MEnvironmentalOpen in IMG/M
3300021432Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-2-MEnvironmentalOpen in IMG/M
3300021476Biofilm microbial communities from the roof of an iron ore cave, State of Minas Gerais, Brazil - TC_06 Biofilm (v2)EnvironmentalOpen in IMG/M
3300021478Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-7-MEnvironmentalOpen in IMG/M
3300021479Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-C-4-MEnvironmentalOpen in IMG/M
3300021559Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-17-MEnvironmentalOpen in IMG/M
3300022722Metatranscriptome of forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Native-BW-C-12-M (Metagenome Metatranscriptome) (v2)EnvironmentalOpen in IMG/M
3300024227Spruce rhizosphere microbial communities from Bohemian Forest, Czech Republic - CZU4Host-AssociatedOpen in IMG/M
3300024330Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300025906Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - LAR L5-1 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300026291Permafrost soil microbial communities from the Arctic, to analyse light accelerated degradation of dissolved organic matter (DOM) - Organic soil replicate 3 DNA2013-049 (SPAdes)EnvironmentalOpen in IMG/M
3300026320Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_140 (SPAdes)EnvironmentalOpen in IMG/M
3300026552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109 (SPAdes)EnvironmentalOpen in IMG/M
3300026557Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug2_1_08_16fungalEnvironmentalOpen in IMG/M
3300027570Peat soil microbial communities from Weissenstadt, Germany - Sb_50d_9_AC metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027660Forest soil microbial communities from Thunder Bay, Ontario, Canada - Black Spruce, Ontario site 2_A8_Ref_M3 (SPAdes)EnvironmentalOpen in IMG/M
3300027842Surface soil microbial communities from Centralia Pennsylvania, which are recovering from an underground coalmine fire - Coalmine Soil_Cen04_05102014_R1 (SPAdes)EnvironmentalOpen in IMG/M
3300027894Freshwater sediment microbial communities in response to fracking from Pennsylvania, USA - Cold Stream Run_MetaG_CSR_2012 (SPAdes)EnvironmentalOpen in IMG/M
3300027905Peat soil microbial communities from Weissenstadt, Germany - SII-SIP-2007 (SPAdes)EnvironmentalOpen in IMG/M
3300027908Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_Ref_O2 (SPAdes)EnvironmentalOpen in IMG/M
3300028047Forest soil microbial communities from Algoma, Ontario, Canada - Jack Pine, Ontario site 1_JW_OM2H0_M1 (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300030862Metatranscriptome of soil microbial communities from Maridalen valley, Oslo, Norway - NSE5 (Metagenome Metatranscriptome)EnvironmentalOpen in IMG/M
3300031234Peat permafrost microbial communities from Stordalen Mire near Abisko, Sweden - Palsa_T0_2EnvironmentalOpen in IMG/M
3300031679Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.065b5f23EnvironmentalOpen in IMG/M
3300031715Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM2C_05EnvironmentalOpen in IMG/M
3300031833Lab enrichment of tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.bulkMG.LF178EnvironmentalOpen in IMG/M
3300031962Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesECM4C_515EnvironmentalOpen in IMG/M
3300032052Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f19EnvironmentalOpen in IMG/M
3300032063Tropical soil microbial communities from Luquillo Experimental Forest, Puerto Rico - GRE.SIPMG.084b2f17EnvironmentalOpen in IMG/M
3300032783Soil microbial communities from Loxahatchee National Wildlife Refuge, Florida, United States - Lox_Sample_3.3EnvironmentalOpen in IMG/M
3300033480Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_M1_C1_D5_BEnvironmentalOpen in IMG/M
3300033486Wetland soil microbial communities from Old Woman Creek delta, Ohio, United States - OWC_Aug_N3_C1_D5_AEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGIcombinedJ26739_10089986013300002245Forest SoilDTIVYDITITDPMAYTKPIVAPQKTLKLKPHGEIDELPCVWSEENEFTQRIRNPATPKPAK*
JGIcombinedJ26739_10091245713300002245Forest SoilHDTILYTLTVVDPKAYTKPIEGRHVTLKLRPHQEIEELPCVWSEENAFTRRIREPAAANPTK*
JGI25613J43889_1008153523300002907Grasslands SoilKPIEGRHVTFKLKPRVEIEELPCVWSEENAFTKRIREPAAAKPAK*
Ga0066672_1077318313300005167SoilMRVTERYQRVDHDTIRYNIRVEDPKAYTKPIIGPERAMKLRPGVQINELPCVWSEENSFTKRIREPAVRKPN*
Ga0070735_1070301513300005534Surface SoilRTDRDTVVYDMTIFDPIAYTKPIVGPQRILKLKPGEEIGEYPCVYTEEHEFTQRIREPATPKQTK*
Ga0070766_1125689313300005921SoilVTDPMAYTKPIVAPQRTMKLRPHEEIEEQPCVWSQENEFAKRIREPATRNPTK*
Ga0070765_10040493513300006176SoilHSEDMRVTERYQRVDRNTILYNIRVDDPKAYTKPIIGPGRMMKLRSGVEIGELPCVWSEENSFTKRIREPAVGKPNR*
Ga0073928_1070714313300006893Iron-Sulfur Acid SpringRNTIVYNIRVDDPKAYTKPVVGPERAMKLRPGVQIGELPCVWSEENSFTKRIREPAVGKPNR*
Ga0099791_1002685353300007255Vadose Zone SoilMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0099793_1047122913300007258Vadose Zone SoilDHDTILYTLTVVDPKAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKPSK*
Ga0099794_1070069413300007265Vadose Zone SoilKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKATK*
Ga0099830_1155725313300009088Vadose Zone SoilVTERYHRVDHDTILYDMTVTDPMAYTKPIVAPQKTMKLKPHEQIEEQPCVWSQENEFAKRIREPATSNPAK*
Ga0126376_1071302223300010359Tropical Forest SoilKPIVGPQRTMKLRPKEEIEELVCVWSDETAFAKRIREPAAAKRK*
Ga0134121_1281086423300010401Terrestrial SoilVDSNTIRYNITVEDTKAYSKPIVAPERIMKLRPGAQIEELPCVWSEENAFTRRIRKPAVGPPNR*
Ga0137392_1087830333300011269Vadose Zone SoilTRPIIGPERIMKLRPGVEIGELPCVWSEENSFTKGIREPAVGKPNR*
Ga0137399_1091703723300012203Vadose Zone SoilHDTILYTLTVVDPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137399_1130887423300012203Vadose Zone SoilVDPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137362_1007468923300012205Vadose Zone SoilPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137376_1142571813300012208Vadose Zone SoilITERYQRVDHDTILYTLTVVDPKAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKATK*
Ga0137377_1002483113300012211Vadose Zone SoilMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKATK*
Ga0137386_1079592413300012351Vadose Zone SoilYQRVDHDTILYTLTVVDPKAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTR*
Ga0137360_1004826113300012361Vadose Zone SoilKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAKTTK*
Ga0137390_1050823413300012363Vadose Zone SoilDTILYDMTVTDPMAYTKPIVAPQKTMKLKPHEQIEEQPCVWSQENEFAKRIREPATSNPAK*
Ga0137395_1002634523300012917Vadose Zone SoilPHTEDMRITERYQRVDHDTILYSLTVVDPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTAK*
Ga0137394_1006439223300012922Vadose Zone SoilILYTLTVVDPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137359_1090440213300012923Vadose Zone SoilVVDPKAYTKPIDGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137413_1017567513300012924Vadose Zone SoilAKPIVGPQRIMKLKPGGEIDELPCVWSEENAFTHRIREPAASNPAK*
Ga0137419_1097113123300012925Vadose Zone SoilRYQRVDHDTILYTLTVVDPKAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137419_1150082313300012925Vadose Zone SoilILYYVTVVDPKAYAKPIEGRHVTLKLRPKLEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137419_1154216613300012925Vadose Zone SoilVIDPKAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137416_1007588613300012927Vadose Zone SoilSPAGFPHTEDMRITERYQRVDHDTILYTLTVLDPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137416_1075270123300012927Vadose Zone SoilERYQRIDHDTILYNISVVDLKAYTKPIVGLQKTFKLRPHAEIEEFPCVWSEENSFTKRIREPAAAKPAK*
Ga0137416_1099343013300012927Vadose Zone SoilPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTSK*
Ga0137416_1224057813300012927Vadose Zone SoilILYDMTVTDPKAYSKPIVAPQRTMKLKPHEEIEEQPCVWSQENEFAKRIREPATRNPTR*
Ga0137404_1054541213300012929Vadose Zone SoilHDTFLYNITVADPKAYSKPIEGRHVTFKLKTHVEIEELPCVWSEENAFTKRIREPAAAKPAK*
Ga0164298_1131032313300012955SoilHDTIIYDLTISDPKAYAKTIVTPHRTLKLKPTGEIAEYPCVWTEENSFTKRIREPATPKNNQ*
Ga0164304_1045056823300012986SoilYNRVDHDTIIYDLTISDPKAYAKTIVTPHRTLKLKPTGEIAEYPCVWTEENSFTKRIREPATPKNNQ*
Ga0157369_1143469723300013105Corn RhizosphereDTILYDITVSDPKAYTRPVVAPQRIMKLRPHQEIEEFVCVASDEQAFSKRINEPATKTPGK*
Ga0181521_1036187213300014158BogPKAYVKPIVAPERIMKLKPGAEMEELPCVWSEENSFTKRIRQPAVGKPNR*
Ga0182024_1259444223300014501PermafrostMRVTERYQRVDHDTLLYNITVDDPKAYTKSIVAPQKTMKLKPKEEIEELVCVWSEENSFAKRIREPASAKSTQ*
Ga0137418_1011840723300015241Vadose Zone SoilKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK*
Ga0137409_1086471913300015245Vadose Zone SoilIERYHRLDHDTIQYNMTLTDPTAYSQPIVVPQKIMKLKPHGEIEEQPCVWSQENAFAKRIREPAAAKPAK*
Ga0137403_1032186813300015264Vadose Zone SoilITVVDPKAYAKPIEGRHVTFKLKPRVEIEELPCVWSEENAFTKRIREPAAAKPAK*
Ga0187818_1039121413300017823Freshwater SedimentIAYDMTINDPKSYTKPIVTPHRFLKLKPGGEINEYPCVWSEENEFTQRIRNPATPKPAK
Ga0187819_1066799423300017943Freshwater SedimentAYDITITDPVAYTKPIVTPHRIMKLRPGVEIPESPCVWSQENEFTQRIRNPATPKPAK
Ga0187817_1045886223300017955Freshwater SedimentDPKAYTKPIVAPERIMKLKPGAEMEELPCVWSEENSFTKRIREPAVGKPNR
Ga0187817_1054532713300017955Freshwater SedimentTIDDPKAYVKPIVAPERIMKLKPGAEMEELPCVWSEENSFTKRIRQPAVGKPNH
Ga0187817_1074812813300017955Freshwater SedimentTIADPVAYTKPIVTPHRIMKLRPGVEIPESPCVWSQENEFTQRIRNPATPKPPK
Ga0187810_1043364613300018012Freshwater SedimentITIDDPKAYVKPIVAPERIMKLKPGAEMEELPCVWSEENSFTRRIRQPAVGKPNH
Ga0066662_1031466613300018468Grasslands SoilQRVDGNTIVYNIRVDDPKAYTRPIIGPERIMKLRPGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0179594_1035586313300020170Vadose Zone SoilYTLSVVDPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK
Ga0179592_1012671123300020199Vadose Zone SoilDLKAYTKPIEGRHVTLKLRPKLEIEELPCVWSEENAFTKRIREPAAKTAK
Ga0210403_1092172013300020580SoilVDHDTILYDLTVTDPMAYTKPIVAPQRTMKLRPHEEIEEQPCVWSQENEFAKRIREPATPKLSK
Ga0210405_1005102743300021171SoilNIRVDDPKAYTKPVVGPERIMKLRPGQEIGELPCVWSEENSFTRRIREPAVGKPNR
Ga0210408_1130790113300021178SoilVTERYHRVDHDTILYDITVTDPMAYTKPIVALQKIMKLRPHAEIEEQPCVWSQENEFTQRIRNPATPKAAK
Ga0210408_1141924813300021178SoilQRVDRNTILYNIRVDDPKAYTKPIIGPERMMKLRPGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0210396_1042439713300021180SoilRVDHDTIAYDMTITDPKAYAKPVVTPHRVMKLKPNEEISEQPCVWSQENEFAQRIRNPATPQPAK
Ga0210396_1087493123300021180SoilYTRVDRDTISYDLTITDPTAYTKPVITPHRTLKLKTGVEVPESICVWSEENEFTQRIRNPATPKPAK
Ga0210393_1072046223300021401SoilTILYNITVDDPKAYAKAIVGPQRTMKLRPNAEIEELVCAWSEENAFTKRIREPAANKPTK
Ga0210394_1087202413300021420SoilDTILYNISVTDPMSYARPIPGPQRIMKLKPGAEIDELPCVWSDENEFTKRIREPAASKSA
Ga0210394_1096855423300021420SoilSEEMRTTERYHRVDHDTILYDLTVTDPMAYTKPIVAPQRTMKLRPHEEIEEQPCVWSQENEFAKRIREPATRNPTK
Ga0210384_1156968213300021432SoilYKRVDSNTIRYNITVEDTKAYSKPIVAPERTMKLRPGAQIEELPCVWSEENAFTRRIRKPAVGPPNR
Ga0187846_1014048223300021476BiofilmTERYQRVDRDTMRYNIRVDDPKAYTKPIIAPERIMKLRPGVELSELPCVWSEENSFTKRIREPAVRSPN
Ga0210402_1138756113300021478SoilNIRVDDPKAYTKPIIGPERMMKLRPGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0210410_1069390313300021479SoilYTRVDHDTISYDLTITDPTAYTKPIVTPHRTLKMKTGVEIPESICVWSEENEFTQRIRNPATPKPAK
Ga0210409_1028492133300021559SoilLYDMTVTDPMAYTKPIVAPQRTMKLKPREEIEEQPCVWSLENEFAKRIREPAAHNPAK
Ga0210409_1059489223300021559SoilVDHDTILYTLTVVDPKAYTKPIEGRHVTLKLRPHQEIEELPCVWSEENSFTKRIREPAASKPAK
Ga0210409_1079235623300021559SoilMRVTERYQRVDRDTIRYNITVVDPKAYTKPIVGPDRTMKLRPGAHVEELPCVWSEENSFTNRIRKPAVGQPAR
Ga0210409_1085222033300021559SoilSEDMRVTERYQRIDRNTIVYNIRVDDPKAYTKPIIGPERIMKLRTGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0242657_104291513300022722SoilKPIIGPERMMKLRPGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0228598_101823313300024227RhizosphereTERYHRVDHDTILYDLTISDPTAYTKPIVTPRRTLKLKTGEEIAEYPCVWSEENSFTKRIREPATPKPAQ
Ga0137417_116037813300024330Vadose Zone SoilDMHITERYQRVDHDTILYTLTVIDPKAYTKPIEPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK
Ga0207699_1122781313300025906Corn, Switchgrass And Miscanthus RhizosphereRNTIVYNIRVDDPKAYTKPIVGPERMMKLRPGGEISELPCVWSDENSFTKRIREPAVGKPNR
Ga0209890_1002311533300026291SoilMTITDPKAYAKPVVTPHRTMKLKPNEEISEQPCVWSQENEFAQRIRNPATPKTTKQ
Ga0209131_136246513300026320Grasslands SoilQRVDRDTFLYNITVVDPKAYTKPIEGRHVTFKLKPRVEIEELPCVWSEENAFTKRIREPAAAKPAK
Ga0209158_127073813300026333SoilYQRVDGNTIVYNTRVEDPKAYTRPIIGSERIMKLRPGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0209577_1037433513300026552SoilYTKPVIGPERIMKLRPGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0179587_1083438123300026557Vadose Zone SoilLYNITIDDPKAYGKPIVGPPRTMKLRPNDELIESVCVMSEEKTFAKRIGEPAASKPVK
Ga0179587_1084286723300026557Vadose Zone SoilVDPMAYTKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK
Ga0208043_115415923300027570Peatlands SoilDTIVYDMTVTDPMAYTKPIVAPQRIMKLKPHEEIEEQPCVWSLENEFAKRIREPAAHNPA
Ga0209736_118454113300027660Forest SoilYQRVDRNTILYNIRVDDAKAYTKPIIGPERMMKLRPGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0209580_1044705623300027842Surface SoilTDPMAYTKPIVAPQRIMKLKPHEEIEEQPCVWSLENEFAKRIREPAAHKPAK
Ga0209068_1033990513300027894WatershedsITVTDPKAYTKPIAGPQRTMKLKPGAEIEEQPCVWSQENAFAKRIREPAASKPAK
Ga0209415_1094355213300027905Peatlands SoilDHDTIQYDITVTDPMAYTKPIVGPRRYLKLRPKAEIEELPCVWSEENSFAKRIREPAAAKPAK
Ga0209006_1050972913300027908Forest SoilTERYHRVDHDTIAYDLTISDPMAYTKPIVTPRRILKLKTGEEIGEYPCVWSEENSFTKRIREPATPKPAK
Ga0209006_1055744033300027908Forest SoilAYTKPIVAPQKTLKLKPHGEIDELPCVWSEENEFTQRIRNPATPKPAK
Ga0209526_1027997733300028047Forest SoilTILYYVTVVDPKAYTKPIEGRHVTLKLRPKLEIEELPCVWSEENAFTKRIREPAAAKPAK
Ga0137415_1008902923300028536Vadose Zone SoilKPIEGRHVTLKLRPHEEIEELPCVWSEENAFTKRIREPAAAKTTK
Ga0137415_1036525613300028536Vadose Zone SoilMRITERYQRIDHDTILYNISVVDLKAYTKPIVGLQKTFKLRPHAEIEEFPCVWSEENSFTKRIREPAAAKPAK
Ga0265753_105404913300030862SoilMTVTDPMAYTKPIVAPQRIMKLKPHEEIEEQPCVWSLENEFAKRIREPAAHNPAK
Ga0302325_1254413923300031234PalsaERYHRVDHDTIVYDLTITDPMAYTKPVVATQKILKLKPHQEIEELTCVWSEENEFTQRIRIPATPKPAK
Ga0318561_1081656723300031679SoilYSRPIVGPQRTFKLRPKAEIEELVCVWSEENAFAKRIREPAAAPPPK
Ga0307476_1105706123300031715Hardwood Forest SoilDMTVTDPKAYTKPIVAQQRIMKLKPHEEIEEQPCVWSLENEFAKRIREPAARNPAK
Ga0310917_1098695113300031833SoilQRVAHDTLLYNSTIDDPKAYSRPIVGPQRTFKLRPKAEIEELVCVWSEENAFAKRIREPAAAPPPK
Ga0307479_1026755423300031962Hardwood Forest SoilMRVTERYQRVDRNTIAYNIRVEDPKAYTRPVVGPERMMKLRPGVEIGELPCVWSEENSFTKRIREPAVGKPNR
Ga0318506_1022431423300032052SoilLYNSTIDDPKAYSRPIVGPQRTFKLRPKAEIEELVCVWSEENAFAKRIREPAAAPPPK
Ga0318504_1054237713300032063SoilPKAYSKPIVGPQRTFKLRPKAEIEELVCVWSEENAFAKRIREPAAAPPPK
Ga0335079_1033625323300032783SoilDQNTLLYSITVTDPKAYTKPVTSPQRTFKLRPGAEIEELPCVWSEENEFTKRIREPAAQKPAK
Ga0316620_1018586433300033480SoilKHIVGPQRTYKRRPGAEIAELPCVWSEENSFAKRIREPAASKPAK
Ga0316624_1022221713300033486SoilRIVERYRRVDLDTIHYNITVTDPKAYTKHIVGPQRTYKRRPGAEIAELPCVWSEENSFAKRIREPAASKPAK


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.