NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F099223

Metagenome Family F099223

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F099223
Family Type Metagenome
Number of Sequences 103
Average Sequence Length 73 residues
Representative Sequence MAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRNMYT
Number of Associated Samples 45
Number of Associated Scaffolds 103

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 73.27 %
% of genes near scaffold ends (potentially truncated) 31.07 %
% of genes from short scaffolds (< 2000 bps) 74.76 %
Associated GOLD sequencing projects 35
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (48.544 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous
(66.990 % of family members)
Environment Ontology (ENVO) Unclassified
(77.670 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(79.612 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 53.33%    β-sheet: 6.67%    Coil/Unstructured: 40.00%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 103 Family Scaffolds
PF04488Gly_transf_sug 16.50
PF04545Sigma70_r4 8.74
PF01467CTP_transf_like 4.85
PF00271Helicase_C 3.88
PF00551Formyl_trans_N 2.91
PF09588YqaJ 1.94
PF12802MarR_2 1.94
PF03592Terminase_2 1.94
PF04448DUF551 1.94
PF13481AAA_25 0.97
PF13662Toprim_4 0.97
PF01381HTH_3 0.97
PF00392GntR 0.97
PF12728HTH_17 0.97
PF09374PG_binding_3 0.97
PF05551zf-His_Me_endon 0.97
PF12957DUF3846 0.97

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 103 Family Scaffolds
COG3774Mannosyltransferase OCH1 or related enzymeCell wall/membrane/envelope biogenesis [M] 16.50
COG3728Phage terminase, small subunitMobilome: prophages, transposons [X] 1.94


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms51.46 %
UnclassifiedrootN/A48.54 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300005527|Ga0068876_10465493Not Available698Open in IMG/M
3300005805|Ga0079957_1035719All Organisms → Viruses → Predicted Viral3225Open in IMG/M
3300006637|Ga0075461_10007506All Organisms → Viruses → Predicted Viral3609Open in IMG/M
3300006637|Ga0075461_10139672All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage746Open in IMG/M
3300006637|Ga0075461_10232213Not Available545Open in IMG/M
3300006802|Ga0070749_10003204Not Available10940Open in IMG/M
3300006802|Ga0070749_10187758All Organisms → Viruses → Predicted Viral1188Open in IMG/M
3300006802|Ga0070749_10335058Not Available843Open in IMG/M
3300006802|Ga0070749_10362451Not Available804Open in IMG/M
3300006802|Ga0070749_10490348Not Available670Open in IMG/M
3300006802|Ga0070749_10537219Not Available634Open in IMG/M
3300006802|Ga0070749_10691926Not Available545Open in IMG/M
3300007363|Ga0075458_10098138Not Available913Open in IMG/M
3300007538|Ga0099851_1056286All Organisms → Viruses → Predicted Viral1539Open in IMG/M
3300007538|Ga0099851_1116185All Organisms → Viruses → Predicted Viral1013Open in IMG/M
3300007538|Ga0099851_1255944Not Available625Open in IMG/M
3300007538|Ga0099851_1280595All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage591Open in IMG/M
3300007538|Ga0099851_1304308Not Available562Open in IMG/M
3300007538|Ga0099851_1334047Not Available531Open in IMG/M
3300007538|Ga0099851_1347487Not Available518Open in IMG/M
3300007538|Ga0099851_1357462Not Available509Open in IMG/M
3300007539|Ga0099849_1016545All Organisms → cellular organisms → Bacteria → Proteobacteria3234Open in IMG/M
3300007539|Ga0099849_1038574All Organisms → Viruses → Predicted Viral2022Open in IMG/M
3300007539|Ga0099849_1088965All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1241Open in IMG/M
3300007540|Ga0099847_1042237All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1447Open in IMG/M
3300007541|Ga0099848_1030139All Organisms → Viruses → Predicted Viral2264Open in IMG/M
3300007541|Ga0099848_1058106All Organisms → Viruses → Predicted Viral1543Open in IMG/M
3300007541|Ga0099848_1228219All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage658Open in IMG/M
3300007541|Ga0099848_1243144Not Available632Open in IMG/M
3300007541|Ga0099848_1259095Not Available606Open in IMG/M
3300007541|Ga0099848_1342490All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage505Open in IMG/M
3300007542|Ga0099846_1013102Not Available3270Open in IMG/M
3300007542|Ga0099846_1049598All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1593Open in IMG/M
3300007542|Ga0099846_1071062All Organisms → Viruses → Predicted Viral1301Open in IMG/M
3300007542|Ga0099846_1206897Not Available690Open in IMG/M
3300007960|Ga0099850_1220632All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage739Open in IMG/M
3300007960|Ga0099850_1281548Not Available635Open in IMG/M
3300007960|Ga0099850_1316146Not Available590Open in IMG/M
3300008266|Ga0114363_1026032Not Available2516Open in IMG/M
3300010299|Ga0129342_1009010Not Available4249Open in IMG/M
3300010299|Ga0129342_1203776All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Limoniibacter → Limoniibacter endophyticus702Open in IMG/M
3300010299|Ga0129342_1326142Not Available525Open in IMG/M
3300010318|Ga0136656_1031614All Organisms → cellular organisms → Bacteria → Proteobacteria1919Open in IMG/M
3300010318|Ga0136656_1125291All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage889Open in IMG/M
3300010354|Ga0129333_10227968All Organisms → Viruses → Predicted Viral1687Open in IMG/M
3300010354|Ga0129333_10486998All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage1082Open in IMG/M
3300010368|Ga0129324_10252084All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage704Open in IMG/M
3300010370|Ga0129336_10062236All Organisms → Viruses → Predicted Viral2219Open in IMG/M
3300010370|Ga0129336_10385186Not Available767Open in IMG/M
(restricted) 3300013126|Ga0172367_10147000All Organisms → cellular organisms → Bacteria → Proteobacteria1563Open in IMG/M
(restricted) 3300013126|Ga0172367_10187212All Organisms → Viruses → Predicted Viral1319Open in IMG/M
(restricted) 3300013127|Ga0172365_10854035Not Available511Open in IMG/M
(restricted) 3300013131|Ga0172373_10240183Not Available1202Open in IMG/M
3300017697|Ga0180120_10216703Not Available788Open in IMG/M
3300017788|Ga0169931_10263875All Organisms → Viruses → Predicted Viral1392Open in IMG/M
3300017963|Ga0180437_10233299All Organisms → Viruses → Predicted Viral1429Open in IMG/M
3300017963|Ga0180437_11052242Not Available581Open in IMG/M
3300020183|Ga0194115_10014171Not Available6864Open in IMG/M
3300020220|Ga0194119_10748827Not Available584Open in IMG/M
3300021961|Ga0222714_10000944All Organisms → cellular organisms → Bacteria32856Open in IMG/M
3300021961|Ga0222714_10063293All Organisms → Viruses → Predicted Viral2488Open in IMG/M
3300022063|Ga0212029_1022374Not Available856Open in IMG/M
3300022176|Ga0212031_1009409All Organisms → Viruses → Predicted Viral1353Open in IMG/M
3300022198|Ga0196905_1014526All Organisms → Viruses → Predicted Viral2543Open in IMG/M
3300022198|Ga0196905_1016372Not Available2372Open in IMG/M
3300022198|Ga0196905_1026824All Organisms → Viruses → Predicted Viral1759Open in IMG/M
3300022198|Ga0196905_1052448Not Available1159Open in IMG/M
3300022198|Ga0196905_1088274All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage838Open in IMG/M
3300022198|Ga0196905_1152612Not Available594Open in IMG/M
3300022198|Ga0196905_1165973All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage563Open in IMG/M
3300022198|Ga0196905_1201173Not Available500Open in IMG/M
3300022200|Ga0196901_1033854All Organisms → Viruses → Predicted Viral1981Open in IMG/M
3300022200|Ga0196901_1160908Not Available742Open in IMG/M
3300022200|Ga0196901_1186139All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage673Open in IMG/M
3300022200|Ga0196901_1248479Not Available552Open in IMG/M
3300025630|Ga0208004_1134922Not Available549Open in IMG/M
3300025646|Ga0208161_1135247All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage634Open in IMG/M
3300025646|Ga0208161_1147580Not Available591Open in IMG/M
3300025646|Ga0208161_1171244Not Available522Open in IMG/M
3300025647|Ga0208160_1003542All Organisms → cellular organisms → Bacteria6068Open in IMG/M
3300025647|Ga0208160_1094714Not Available782Open in IMG/M
3300025655|Ga0208795_1094363All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage811Open in IMG/M
3300025655|Ga0208795_1111471All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → environmental samples → uncultured Caudovirales phage721Open in IMG/M
3300025655|Ga0208795_1182089Not Available504Open in IMG/M
3300025671|Ga0208898_1001069Not Available18875Open in IMG/M
3300025687|Ga0208019_1024068All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Caulobacterales → Caulobacteraceae → Phenylobacterium → unclassified Phenylobacterium → Phenylobacterium sp.2343Open in IMG/M
3300025687|Ga0208019_1083058All Organisms → Viruses → Predicted Viral1019Open in IMG/M
3300025687|Ga0208019_1211251Not Available501Open in IMG/M
3300025818|Ga0208542_1202464Not Available512Open in IMG/M
3300025889|Ga0208644_1000737Not Available29115Open in IMG/M
3300025889|Ga0208644_1024507All Organisms → Viruses → Predicted Viral3732Open in IMG/M
3300025889|Ga0208644_1028715All Organisms → Viruses → Predicted Viral3365Open in IMG/M
3300025889|Ga0208644_1059729All Organisms → Viruses → Predicted Viral2058Open in IMG/M
3300027793|Ga0209972_10026705All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Myxococcaceae → Myxococcus → unclassified Myxococcus → Myxococcus sp. AM0113414Open in IMG/M
3300027901|Ga0209427_10282093All Organisms → Viruses → Predicted Viral1337Open in IMG/M
3300027917|Ga0209536_100041341All Organisms → cellular organisms → Bacteria6135Open in IMG/M
3300031857|Ga0315909_10285230All Organisms → Viruses → Predicted Viral1244Open in IMG/M
3300034072|Ga0310127_096118Not Available1290Open in IMG/M
3300034073|Ga0310130_0028698All Organisms → Viruses → Predicted Viral1724Open in IMG/M
3300034073|Ga0310130_0045145All Organisms → Viruses → Predicted Viral1326Open in IMG/M
3300034073|Ga0310130_0102686Not Available860Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous66.99%
Freshwater To Marine Saline GradientEnvironmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient10.68%
FreshwaterEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater3.88%
Fracking WaterEnvironmental → Terrestrial → Deep Subsurface → Unclassified → Unclassified → Fracking Water3.88%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lentic → Unclassified → Freshwater Lake1.94%
Freshwater LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater Lake1.94%
Marine SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Marine Sediment1.94%
Estuarine WaterEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Estuarine Water1.94%
Hypersaline Lake SedimentEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Hypersaline → Sediment → Hypersaline Lake Sediment1.94%
Freshwater, PlanktonEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Freshwater, Plankton0.97%
LakeEnvironmental → Aquatic → Freshwater → Lake → Unclassified → Lake0.97%
SedimentEnvironmental → Aquatic → Freshwater → Lake → Sediment → Sediment0.97%
FreshwaterEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → Freshwater0.97%
Marine Methane Seep SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Marine Methane Seep Sediment0.97%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300005527Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel5S_2200h metaGEnvironmentalOpen in IMG/M
3300005805Microbial and algae communities from Cheney Reservoir in Wichita, Kansas, USAEnvironmentalOpen in IMG/M
3300006637Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_15_>0.8_DNAEnvironmentalOpen in IMG/M
3300006802Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18EnvironmentalOpen in IMG/M
3300007363Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_0.3_<0.8_DNAEnvironmentalOpen in IMG/M
3300007538Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007539Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1M Viral MetaGEnvironmentalOpen in IMG/M
3300007540Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007541Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaGEnvironmentalOpen in IMG/M
3300007542Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaGEnvironmentalOpen in IMG/M
3300007960Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1D Viral MetaGEnvironmentalOpen in IMG/M
3300008266Freshwater microbial communities from Harmful Algal Blooms in Lake Erie, Western Basin, USA - Station WLE12, Sample HABS-E2014-0108-C-NAEnvironmentalOpen in IMG/M
3300010299Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_15_0.2_DNAEnvironmentalOpen in IMG/M
3300010318Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_15_0.8_DNAEnvironmentalOpen in IMG/M
3300010354Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.8_DNAEnvironmentalOpen in IMG/M
3300010368Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Spr_15_0.2_DNAEnvironmentalOpen in IMG/M
3300010370Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Sum_0.6_0.2_DNAEnvironmentalOpen in IMG/M
3300010389Marine sediment microbial communities from methane seeps within Baltimore Canyon, US Atlantic Margin - Baltimore Canyon MUC-11 12-14 cmbsfEnvironmentalOpen in IMG/M
3300013126 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_022012_10mEnvironmentalOpen in IMG/M
3300013127 (restricted)Sediment microbial communities from Lake Kivu, Rwanda - Sediment site 48cmEnvironmentalOpen in IMG/M
3300013131 (restricted)Freshwater microbial communities from Kabuno Bay, South-Kivu, Congo ? kab_092012_10mEnvironmentalOpen in IMG/M
3300017697Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Spr_31_0.2_DNA (version 2)EnvironmentalOpen in IMG/M
3300017788Freshwater microbial communities from Lake Kivu, Western Province, Rwanda to study Microbial Dark Matter (Phase II) - Kivu_15m_20LEnvironmentalOpen in IMG/M
3300017963Hypersaline lake sediment archaeal communities from the Salton Sea, California, USA - SS_3_D_1 metaGEnvironmentalOpen in IMG/M
3300020183Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015002 Mahale S4 surfaceEnvironmentalOpen in IMG/M
3300020220Freshwater microbial communities from Lake Tanganyika, Tanzania - TA2015018 Mahale Deep Cast 100mEnvironmentalOpen in IMG/M
3300021961Estuarine water microbial communities from San Francisco Bay, California, United States - C33_3DEnvironmentalOpen in IMG/M
3300022063Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v2)EnvironmentalOpen in IMG/M
3300022176Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (v2)EnvironmentalOpen in IMG/M
3300022198Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300022200Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300025630Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_15_>0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025646Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1S Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025647Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025655Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025671Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_4 (SPAdes)EnvironmentalOpen in IMG/M
3300025687Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_1D Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025818Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Fall_15_<0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025889Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18 (SPAdes)EnvironmentalOpen in IMG/M
3300027793Freshwater lake microbial communities from Lake Erie, under a cyanobacterial bloom - NOAA_Erie_Diel1S_2200h metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027901Marine sediment microbial communities from White Oak River estuary, North Carolina - WOR-1-36_30 (SPAdes)EnvironmentalOpen in IMG/M
3300027917Marine sediment microbial communities from White Oak River estuary, North Carolina - WOR-2-8_12 (SPAdes)EnvironmentalOpen in IMG/M
3300031857Freshwater fungal communities from buoy surface, Lake Erie, Ohio, United States - Buoy 2 MA125EnvironmentalOpen in IMG/M
3300034072Fracking water microbial communities from deep shales in Oklahoma, United States - MC-3-AEnvironmentalOpen in IMG/M
3300034073Fracking water microbial communities from deep shales in Oklahoma, United States - MC-6-XLEnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
Ga0068876_1046549333300005527Freshwater LakeMAEGHYLTDDSDPRICIMMDQHVFDAINAHSAKCGEPFTTVARELLRCAVEDGKLDEYYPKKRRRKGWKLLSLHTT*
Ga0079957_103571993300005805LakeMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLEEYYPTKRRRRNQWYRNMYT*
Ga0075461_10007506123300006637AqueousMAQGHYLYIDSDPRICITMDQHVFDAINQHAAKCREPFQNVARELLRCAVEDGKLQEYYPNQRFYK*
Ga0075461_1013967213300006637AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA*
Ga0075461_1023221323300006637AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRN
Ga0070749_1000320473300006802AqueousMAEGHYLTEDSDPRICIMMDQHIFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPKTRRRRNQWYRNMYT*
Ga0070749_1002948633300006802AqueousMAEGYYTSPDSDPRITVTMYQWVFDAINDHALRCGEPFTVVACELLRCAVEDGKLEEYFPVTKEG*
Ga0070749_1018775823300006802AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRNQWYRNMYI*
Ga0070749_1033505833300006802AqueousMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNIARELLRCAVEDGKLNEYYPTKRRRRNQWYRDMYT*
Ga0070749_1036245123300006802AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAEKCREPFGTVARELLRCAVEDGKLDEYYPRRRKKWKSA*
Ga0070749_1049034823300006802AqueousMAEGHYLTEDGDPRICIMMDQHVFDAVNEHAARVNRPFSDVARELLRCAVEDGKLDEYYPRSNR*
Ga0070749_1053721923300006802AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAGRCGEPFSNVARELLRCAVEDGKLNEYYPTKRRRRNQWYRDMYT*
Ga0070749_1069192613300006802AqueousSAKAAAGVCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA*
Ga0075458_1009813843300007363AqueousLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPKRRKRWKSA*
Ga0099851_105628623300007538AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAAKCGEPFGTVARELLRCAVEDGKLDEYYPKKRRRRNQWYRSMYT*
Ga0099851_111618523300007538AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRNQWYRNMYT*
Ga0099851_125594433300007538AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINDHAAVCGEPFSTVARELLRCAVEDGKLSEYYPKKRRRRGWKLLSLHTT*
Ga0099851_128059523300007538AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRKRRNQWYRNMYT*
Ga0099851_130430823300007538AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAEKCGEPFGTVARELLRCAVEDGKLSEYYPRKRRRRGWKLLSLHTM*
Ga0099851_133404723300007538AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRSMYT*
Ga0099851_134748733300007538AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINAHAAKCREPFGTVARELLRCAVEDGKLDEYYPRRRERWKSA*
Ga0099851_135746223300007538AqueousMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPKKRSKMYRDLYT*
Ga0099849_101654563300007539AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINDHAAACGESFTTVARELLRCAVEDGKLDEYYPRRRKRWKTA*
Ga0099849_103857453300007539AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINDHAEKCREPFGTVARELLRCAVEDGKLDEYYPRRRKKWKSA*
Ga0099849_108896523300007539AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPKTRRRRNQWYRNMYT*
Ga0099847_104223723300007540AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRSMYT*
Ga0099848_1030139103300007541AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFTTVARELLRCAVEDGKLDEYYPRRRKRWKTA*
Ga0099848_105810663300007541AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINDHAAVCGEPFSTVARELLRCAVEDGKLSEYYPKKRRRRGWKLLSL
Ga0099848_122821923300007541AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRNMYT*
Ga0099848_124314423300007541AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINNHAAACGESFTTVARELLRCAVEDGKLDEYYPKTRRRRNQWYRNMYT*
Ga0099848_125909513300007541AqueousGDPQDRAGFEEPRSVEGCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA*
Ga0099848_134249013300007541AqueousGDPQDRAGFEEPRSVEGCVMAEGHYLTDDSDPRICIMMDQHIFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPKKRRRRNQ*
Ga0099846_101310253300007542AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINDHAAVCGEPFSTVARELLRCAVEDGKLNEYYPRKRRRRGWKLLSLHTT*
Ga0099846_104959813300007542AqueousHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPKTRRRRNQWYRNMYT*
Ga0099846_107106213300007542AqueousPRSVEGCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRNQWYRNMYI*
Ga0099846_120689713300007542AqueousKPRSMEGCVMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPKKRSKMYRDLYT*
Ga0099850_122063223300007960AqueousRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPKTRRRRNQWYRNMYT*
Ga0099850_128154823300007960AqueousMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNVARELLRCAVEDGKLNEYYPTKRRRRNQWYRDMYT*
Ga0099850_131614613300007960AqueousSVEGCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRNQWYRNMYI*
Ga0114363_102603263300008266Freshwater, PlanktonMAEGHYLTDDSDPRICIMMDQHVFDAINAHSAKCGEPFTTVARELLRCAVEDGKLDEYYPRKRRRRGWKSLSLHTT*
Ga0129342_100901013300010299Freshwater To Marine Saline GradientEDAEKPRSVEGCVMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNVARELLRCAVEDGKLNEYYPTKRRRRNQWYRDMYT*
Ga0129342_120377613300010299Freshwater To Marine Saline GradientMAEGHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPKKRRRRNQ*
Ga0129342_132614233300010299Freshwater To Marine Saline GradientEKAGEDAEKPRSVEGCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA*
Ga0136656_103161453300010318Freshwater To Marine Saline GradientRADDNQGGGDPDNSGSVEGCVMAEGHYLTEDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA*
Ga0136656_112529123300010318Freshwater To Marine Saline GradientTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRNMYT*
Ga0129333_1022796823300010354Freshwater To Marine Saline GradientMAQGHYLYVDSDPRICITMDQHVFDAINQHAAKCKEPFQNVARELLRCAVEDGKLQEYYPTQRFYK*
Ga0129333_1048699813300010354Freshwater To Marine Saline GradientEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRSMYT*
Ga0129324_1025208423300010368Freshwater To Marine Saline GradientMAEGHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRTSGTETCTHDVADEGNS*
Ga0129336_1006223693300010370Freshwater To Marine Saline GradientMAQGHYLYVDSDPRICITMDQHVFDAINQHAAKCKEPFQNVARELLRCAVEDGKLQEYYPNQRFYK*
Ga0129336_1038518623300010370Freshwater To Marine Saline GradientMAEGHYLTEDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRNMYT*
Ga0136549_1032892923300010389Marine Methane Seep SedimentMARGYRRAPWADPRISVEMYEYVFDAINDHAFRCGESFTTVACELLRCAVEDGKLEEYFPVERR*
(restricted) Ga0172367_1014700023300013126FreshwaterMAEGHYITDDSDPRICIMMDQHVFDAINAHAERCRDPFGAVARELLRCAVEDGKLDEYYPRRRR*
(restricted) Ga0172367_1018721243300013126FreshwaterMAEGHYLTEDSDPRICIMMDQHVFDAVNKHAARVNRPFSDVARELLRCAVEDGKLDEYYPRSNR*
(restricted) Ga0172365_1085403523300013127SedimentMAEGHYLTENSGPRICIMMDQHVFDAVNAHAARVNRPFSDVARELLRCAVEDGKLDEYYPRRRR*
(restricted) Ga0172373_1024018323300013131FreshwaterMAEGHYLTEDSDPRICIMMDQHVFDEVNKHAARVNRPFSDVARELLRCAVEDGKLDEYYPRSNR*
Ga0180120_1021670333300017697Freshwater To Marine Saline GradientSRALAERGTNLMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA
Ga0169931_1026387523300017788FreshwaterMAEGHYLTEDSDPRICIMMDQHVFDEVNKHAARVNRPFSDVARELLRCAVEDGKLDEYYPRSNR
Ga0180437_1023329953300017963Hypersaline Lake SedimentMAEGHYLTDDSDPRICIMMDQHVFDAINAHAEKCREPFGTVARELLRCAVEDGKLDEYYPRRR
Ga0180437_1105224223300017963Hypersaline Lake SedimentMAEGHYLTDDSDPRICIMMDQHIFDAINAHAGRCGEPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRDMYT
Ga0194115_10014171103300020183Freshwater LakeMADGHYITDDSDPRICIMMDQHVFDAINAHAERCRDPFGAVARELLRCAVEDGKLDEYYPRRRR
Ga0194119_1074882713300020220Freshwater LakeMAEGHYITDDSDPRICIMMDQHVFDAINAHAERCRDPFGAVARELLRCAVEDGKLDEYYPRRRR
Ga0222714_10000944113300021961Estuarine WaterMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRMKRWKSA
Ga0222714_1006329333300021961Estuarine WaterMAQGHYLYIDSDPRICITMDQHVFDAINQHAAKCKEPFQNVARELLRCAVEDGKLQEYYPTKRFYK
Ga0212029_102237433300022063AqueousGMAQGHYLYIDSDPRICITMDQHVFDAINQHAAKCREPFQNVARELLRCAVEDGKLQEYYPNQRFYK
Ga0212031_100940953300022176AqueousMAQGHYLYIDSDPRICITMDQHVFDAINQHAAKCREPFQNVARELLRCAVEDGKLQEYYPNQRFYK
Ga0196905_101452633300022198AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAAKCGEPFGTVARELLRCAVEDGKLDEYYPKKRRRRNQWYRSMYT
Ga0196905_101637223300022198AqueousMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPKKRSKMYRDLYT
Ga0196905_102682443300022198AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINDHAAVCGEPFSTVARELLRCAVEDGKLNEYYPRKRRRRGWKLLSLHTT
Ga0196905_105244823300022198AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFTTVARELLRCAVEDGKLDEYYPRRRKRWKTA
Ga0196905_108827423300022198AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRNQWYRNMYI
Ga0196905_115261213300022198AqueousGFEEPRSVEGCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA
Ga0196905_116597323300022198AqueousHYLTEDSDPRICIMMDQHIFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPKKRRRRNQ
Ga0196905_120117323300022198AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAEKCREPFGTVARELLRCAVEDGKLDEYYPRRRKKWKSA
Ga0196901_1033854103300022200AqueousMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNIARELLRCAVEDGKLNEYYPTKRRRRNQWYRDMYT
Ga0196901_116090813300022200AqueousLMAEGHYLTEDSDPRICIMMDQHVFDAINDHAAVCGEPFSTVARELLRCAVEDGKLDEYYPRKRRRRGWKLLSLHTT
Ga0196901_118613923300022200AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRKRRNQWYRNMYT
Ga0196901_124847933300022200AqueousRSVEGCVMAEGHYLTDDSDPRICIMMDQHIFDAINAHAGRCGEPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA
Ga0208004_113492223300025630AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPTKRKRRNQWYRNMYI
Ga0208161_113524723300025646AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRNMYT
Ga0208161_114758033300025646AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA
Ga0208161_117124423300025646AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINDHAAVCGEPFSTVARELLRCAVEDGKLSEYYPKKRRRRGWKLLSLHTT
Ga0208160_1003542103300025647AqueousMAEGHYLTDDSDPRICIMMDQHIFDAINAHAGRCGEPFSNVARELLRCAVEDGKLNEYYPTKRRRRNQWYRDMYT
Ga0208160_109471423300025647AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPKKRRRRNQ
Ga0208795_109436313300025655AqueousLAARGANLMAEGHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPTKRKRRNQWYRNMYT
Ga0208795_111147113300025655AqueousQDRAGVEEPRSVEGCVMAEGHYLTEDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPKKRRRRNQ
Ga0208795_118208913300025655AqueousGVVRADDDQGGGDPQDRAGFEEPRSVEGCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA
Ga0208898_1001069223300025671AqueousMAQGHYLYIDSDPRICITMDQHVFDAINQHAAKCREPFQNVARELLRCAVEDGKLQEYYPTQRFYK
Ga0208019_102406853300025687AqueousLAARGANLMAEGHYLTDDSDPRICIMMDQHVFDAINAHAEKCGEPFGTVARELLRCAVEDGKLSEYYPRKRRRRGWKLLSLHTT
Ga0208019_108305823300025687AqueousHGRRNQEDPAGFEEPRSVEGCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRNQWYRNMYI
Ga0208019_121125133300025687AqueousEGHYLTEDSDPRICIMMDQHVFDAINAHAAKCREPFGTVARELLRCAVEDGKLDEYYPRRRERWKSA
Ga0208542_120246413300025818AqueousCVMAEGHYLTDDSDPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA
Ga0208644_100073773300025889AqueousMAEGHYLTEDSDPRICIMMDQHIFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPKTRRRRNQWYRNMYT
Ga0208644_102450773300025889AqueousMAEGHYLTEDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDRKLNEYYPTKRRRRNQWYRNMYI
Ga0208644_1028715143300025889AqueousMAEGHYLTDDSDPRICIMMDQHVFDAINDHAERCGESFTTVARELLRCAVEDGKLEEYYPRRRKRWK
Ga0208644_1059729103300025889AqueousMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNIARELLRCAVEDGKLNEYYP
Ga0209972_1002670543300027793Freshwater LakeMAEGHYLTDDSDPRICIMMDQHVFDAINAHSAKCGEPFTTVARELLRCAVEDGKLDEYYPKKRRRKGWKLLSLHTT
Ga0209427_1028209323300027901Marine SedimentMAEGHYLTDDSAPRICIMMDQHVFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPTKRRRRNQWYRDMYT
Ga0209536_100041341103300027917Marine SedimentMAEGHYLTDDSDPRICIMMDQHIFDAINAHAGRCGEPFSNVARELLRCAVEDGKLDEYYPKKRSKMYRDLYT
Ga0315909_1028523013300031857FreshwaterMAEGHYLTDDSDPRICIMMDQHVFDAINAHSAKCGEPFTTVARELLRCAVEDGKLDEYYPKKRRRKGWKLL
Ga0310127_096118_939_11483300034072Fracking WaterMAEGHYLTDDSDPRICIMMDQHVFDAINAHAERCGEPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA
Ga0310130_0028698_950_11773300034073Fracking WaterMAEGHYLTEDSDPRICIMMDQHVFDAINAHAAKCGEPFGTVARELLRCAVEDGKLDEYYPKKRRRRNQWYRSMYT
Ga0310130_0045145_664_8913300034073Fracking WaterMAEGHYLTDDSDPRICIMMDQHIFDAINAHAERCGEPFSNVARELLRCAVEDGKLNEYYPTKRRRRNQWYRNTYT
Ga0310130_0102686_37_2463300034073Fracking WaterMAEGHYLTEDSDPRICIMMDQHIFDAINAHANRCREPFSNVARELLRCAVEDGKLDEYYPRRRKRWKSA


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.