NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F104782

Metagenome Family F104782

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F104782
Family Type Metagenome
Number of Sequences 100
Average Sequence Length 67 residues
Representative Sequence MPEYVMAMSPKEMTEYLTSEFNDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Number of Associated Samples 68
Number of Associated Scaffolds 100

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Unclassified
% of genes with valid RBS motifs 80.00 %
% of genes near scaffold ends (potentially truncated) 25.00 %
% of genes from short scaffolds (< 2000 bps) 79.00 %
Associated GOLD sequencing projects 56
AlphaFold2 3D model prediction No

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Unclassified (52.000 % of family members)
NCBI Taxonomy ID N/A
Taxonomy N/A

Most Common Ecosystem
GOLD Ecosystem Environmental → Aquatic → Marine → Coastal → Unclassified → Aqueous
(34.000 % of family members)
Environment Ontology (ENVO) Unclassified
(57.000 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Saline → Water (saline)
(57.000 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 49.25%    β-sheet: 0.00%    Coil/Unstructured: 50.75%
Feature Viewer
Powered by Feature Viewer


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 100 Family Scaffolds
PF11753DUF3310 38.00
PF00959Phage_lysozyme 20.00
PF05766NinG 14.00
PF11351GTA_holin_3TM 3.00
PF01807zf-CHC2 1.00
PF01381HTH_3 1.00
PF00149Metallophos 1.00
PF04851ResIII 1.00
PF09588YqaJ 1.00
PF02839CBM_5_12 1.00

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 100 Family Scaffolds
COG0358DNA primase (bacterial type)Replication, recombination and repair [L] 1.00


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
UnclassifiedrootN/A52.00 %
All OrganismsrootAll Organisms48.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300000124|BS_KBA_SWE12_21mDRAFT_c10008494Not Available3409Open in IMG/M
3300000418|P_2C_Liq_1_UnCtyDRAFT_1086167Not Available509Open in IMG/M
3300000792|BS_KBA_SWE02_21mDRAFT_10002313All Organisms → cellular organisms → Bacteria7164Open in IMG/M
3300000947|BBAY92_10031319Not Available1451Open in IMG/M
3300000949|BBAY94_10008952All Organisms → Viruses → Predicted Viral2768Open in IMG/M
3300004460|Ga0066222_1084339All Organisms → Viruses → Predicted Viral1732Open in IMG/M
3300004460|Ga0066222_1084340Not Available518Open in IMG/M
3300005613|Ga0074649_1009133All Organisms → cellular organisms → Bacteria7299Open in IMG/M
3300005747|Ga0076924_1128697All Organisms → Viruses → Predicted Viral4125Open in IMG/M
3300006025|Ga0075474_10015919All Organisms → Viruses → Predicted Viral2789Open in IMG/M
3300006026|Ga0075478_10050612All Organisms → cellular organisms → Bacteria1361Open in IMG/M
3300006802|Ga0070749_10039342Not Available2912Open in IMG/M
3300006802|Ga0070749_10114353All Organisms → cellular organisms → Bacteria1588Open in IMG/M
3300006802|Ga0070749_10285924Not Available927Open in IMG/M
3300006810|Ga0070754_10050054All Organisms → cellular organisms → Bacteria2215Open in IMG/M
3300006810|Ga0070754_10089486All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria1538Open in IMG/M
3300006920|Ga0070748_1149954Not Available868Open in IMG/M
3300007276|Ga0070747_1030836Not Available2129Open in IMG/M
3300007276|Ga0070747_1094034All Organisms → Viruses → Predicted Viral1110Open in IMG/M
3300007346|Ga0070753_1066140All Organisms → cellular organisms → Bacteria1453Open in IMG/M
3300007346|Ga0070753_1194490All Organisms → cellular organisms → Bacteria → Proteobacteria753Open in IMG/M
3300007538|Ga0099851_1062923All Organisms → Viruses → Predicted Viral1446Open in IMG/M
3300007538|Ga0099851_1283933Not Available586Open in IMG/M
3300007540|Ga0099847_1027358All Organisms → Viruses → Predicted Viral1839Open in IMG/M
3300007540|Ga0099847_1096564All Organisms → cellular organisms → Bacteria902Open in IMG/M
3300007540|Ga0099847_1176302All Organisms → cellular organisms → Bacteria → Proteobacteria629Open in IMG/M
3300007542|Ga0099846_1231804All Organisms → cellular organisms → Bacteria644Open in IMG/M
3300007609|Ga0102945_1015124All Organisms → cellular organisms → Bacteria1768Open in IMG/M
3300007609|Ga0102945_1015429Not Available1746Open in IMG/M
3300007609|Ga0102945_1064106All Organisms → cellular organisms → Bacteria → Proteobacteria695Open in IMG/M
3300009074|Ga0115549_1037909Not Available1780Open in IMG/M
3300009076|Ga0115550_1010219Not Available5073Open in IMG/M
3300009434|Ga0115562_1136939Not Available926Open in IMG/M
3300009438|Ga0115559_1019205All Organisms → Viruses → Predicted Viral3423Open in IMG/M
3300009509|Ga0123573_11562058Not Available613Open in IMG/M
3300010316|Ga0136655_1042588All Organisms → cellular organisms → Bacteria1437Open in IMG/M
3300011254|Ga0151675_1060751Not Available1925Open in IMG/M
3300013010|Ga0129327_10743517Not Available553Open in IMG/M
3300014042|Ga0117790_1050057All Organisms → cellular organisms → Bacteria → Terrabacteria group727Open in IMG/M
3300017727|Ga0181401_1063766Not Available984Open in IMG/M
3300017824|Ga0181552_10492092All Organisms → cellular organisms → Bacteria → Proteobacteria578Open in IMG/M
3300018416|Ga0181553_10043992All Organisms → Viruses → Predicted Viral3019Open in IMG/M
3300021085|Ga0206677_10005186All Organisms → cellular organisms → Bacteria10354Open in IMG/M
3300021085|Ga0206677_10031847All Organisms → Viruses → Predicted Viral2952Open in IMG/M
3300021085|Ga0206677_10035166All Organisms → cellular organisms → Bacteria2760Open in IMG/M
3300022053|Ga0212030_1008934Not Available1212Open in IMG/M
3300022053|Ga0212030_1050918Not Available588Open in IMG/M
3300022169|Ga0196903_1006787All Organisms → cellular organisms → Bacteria1474Open in IMG/M
3300022169|Ga0196903_1022034Not Available767Open in IMG/M
3300022178|Ga0196887_1088607Not Available711Open in IMG/M
3300022187|Ga0196899_1148986Not Available652Open in IMG/M
3300022220|Ga0224513_10000053Not Available35101Open in IMG/M
(restricted) 3300023109|Ga0233432_10119024All Organisms → cellular organisms → Bacteria1446Open in IMG/M
(restricted) 3300024059|Ga0255040_10139562Not Available969Open in IMG/M
(restricted) 3300024062|Ga0255039_10366779All Organisms → cellular organisms → Bacteria → Proteobacteria620Open in IMG/M
3300024281|Ga0228610_1046548Not Available599Open in IMG/M
(restricted) 3300024519|Ga0255046_10361498Not Available685Open in IMG/M
3300025543|Ga0208303_1035568All Organisms → cellular organisms → Bacteria1295Open in IMG/M
3300025543|Ga0208303_1042641Not Available1142Open in IMG/M
3300025617|Ga0209138_1009343All Organisms → cellular organisms → Bacteria5430Open in IMG/M
3300025621|Ga0209504_1018381All Organisms → Viruses → Predicted Viral2769Open in IMG/M
3300025626|Ga0209716_1108079Not Available779Open in IMG/M
3300025641|Ga0209833_1047117All Organisms → Viruses → Predicted Viral1482Open in IMG/M
3300025652|Ga0208134_1069695Not Available1048Open in IMG/M
3300025653|Ga0208428_1000946All Organisms → cellular organisms → Bacteria12590Open in IMG/M
3300025655|Ga0208795_1103029Not Available762Open in IMG/M
3300025759|Ga0208899_1023344All Organisms → Viruses → Predicted Viral3041Open in IMG/M
3300025759|Ga0208899_1084992Not Available1223Open in IMG/M
3300025821|Ga0209600_1174878Not Available580Open in IMG/M
3300025889|Ga0208644_1119643All Organisms → cellular organisms → Bacteria1259Open in IMG/M
3300026097|Ga0209953_1018574All Organisms → cellular organisms → Bacteria → Proteobacteria1252Open in IMG/M
3300026511|Ga0233395_1080182Not Available881Open in IMG/M
(restricted) 3300027861|Ga0233415_10138950Not Available1096Open in IMG/M
3300027917|Ga0209536_100027685Not Available7770Open in IMG/M
(restricted) 3300027996|Ga0233413_10524534Not Available526Open in IMG/M
3300031539|Ga0307380_10238990All Organisms → Viruses → Predicted Viral1725Open in IMG/M
3300031539|Ga0307380_10382126All Organisms → Viruses → Predicted Viral1276Open in IMG/M
3300031539|Ga0307380_10602391Not Available945Open in IMG/M
3300031539|Ga0307380_10706691All Organisms → cellular organisms → Bacteria848Open in IMG/M
3300031539|Ga0307380_11020612Not Available658Open in IMG/M
3300031565|Ga0307379_10386114All Organisms → cellular organisms → Bacteria1347Open in IMG/M
3300031565|Ga0307379_10504184All Organisms → Viruses → Predicted Viral1131Open in IMG/M
3300031566|Ga0307378_11146251Not Available620Open in IMG/M
3300031566|Ga0307378_11329838Not Available559Open in IMG/M
3300031566|Ga0307378_11494081All Organisms → cellular organisms → Bacteria515Open in IMG/M
3300031578|Ga0307376_10205134All Organisms → Viruses → Predicted Viral1344Open in IMG/M
3300031578|Ga0307376_10364078Not Available956Open in IMG/M
3300031578|Ga0307376_10384858Not Available924Open in IMG/M
3300031578|Ga0307376_10468033Not Available819Open in IMG/M
3300031578|Ga0307376_10743357All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria611Open in IMG/M
3300031578|Ga0307376_10872164Not Available551Open in IMG/M
3300031669|Ga0307375_10158580Not Available1557Open in IMG/M
3300031669|Ga0307375_10497661Not Available735Open in IMG/M
3300031669|Ga0307375_10700542Not Available583Open in IMG/M
3300031669|Ga0307375_10825614Not Available521Open in IMG/M
3300031673|Ga0307377_10572846Not Available812Open in IMG/M
3300032272|Ga0316189_11087931All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria604Open in IMG/M
3300033742|Ga0314858_025968Not Available1323Open in IMG/M
3300034374|Ga0348335_131988Not Available718Open in IMG/M
3300034418|Ga0348337_189728Not Available526Open in IMG/M

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
AqueousEnvironmental → Aquatic → Marine → Coastal → Unclassified → Aqueous34.00%
SoilEnvironmental → Terrestrial → Soil → Clay → Unclassified → Soil21.00%
Pelagic MarineEnvironmental → Aquatic → Marine → Pelagic → Unclassified → Pelagic Marine8.00%
SeawaterEnvironmental → Aquatic → Marine → Strait → Unclassified → Seawater4.00%
Pond WaterEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Unclassified → Pond Water4.00%
SeawaterEnvironmental → Aquatic → Marine → Inlet → Unclassified → Seawater3.00%
SeawaterEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Seawater3.00%
SeawaterEnvironmental → Aquatic → Marine → Coastal → Unclassified → Seawater2.00%
Freshwater To Marine Saline GradientEnvironmental → Aquatic → Marine → Coastal → Unclassified → Freshwater To Marine Saline Gradient2.00%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine2.00%
Salt MarshEnvironmental → Aquatic → Marine → Intertidal Zone → Salt Marsh → Salt Marsh2.00%
Macroalgal SurfaceHost-Associated → Algae → Green Algae → Ectosymbionts → Unclassified → Macroalgal Surface2.00%
Marine SedimentEnvironmental → Aquatic → Marine → Oceanic → Sediment → Marine Sediment1.00%
Worm BurrowEnvironmental → Aquatic → Marine → Coastal → Sediment → Worm Burrow1.00%
MarineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Marine1.00%
Sea-Ice BrineEnvironmental → Aquatic → Marine → Coastal → Unclassified → Sea-Ice Brine1.00%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Unclassified → Marine1.00%
MarineEnvironmental → Aquatic → Marine → Intertidal Zone → Sediment → Marine1.00%
MarineEnvironmental → Aquatic → Marine → Wetlands → Sediment → Marine1.00%
MarineEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Marine1.00%
EnviromentalEnvironmental → Aquatic → Marine → Unclassified → Unclassified → Enviromental1.00%
SedimentEnvironmental → Aquatic → Marine → Sediment → Unclassified → Sediment1.00%
Saline Water And SedimentEnvironmental → Aquatic → Non-Marine Saline And Alkaline → Saline → Sediment → Saline Water And Sediment1.00%
Mangrove SedimentEnvironmental → Aquatic → Sediment → Unclassified → Unclassified → Mangrove Sediment1.00%
Epidermal MucusHost-Associated → Fish → Skin → Epidermal Mucus → Unclassified → Epidermal Mucus1.00%

Visualization
Powered by ApexCharts



Associated Samples

Note: Some of these datasets are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Taxon OIDSample NameHabitat TypeIMG/M Link
3300000124Marine microbial communities from chronically polluted sediments in the Baltic Sea - site KBA sample SWE 12_21mEnvironmentalOpen in IMG/M
3300000418Marine microbial community from Union City, CA, USA - Pond 2C Liquid 1EnvironmentalOpen in IMG/M
3300000792Marine microbial communities from chronically polluted sediments in the Baltic Sea - site KBA sample SWE 02_21mEnvironmentalOpen in IMG/M
3300000947Macroalgal surface ecosystem from Botany Bay, Sydney, Australia - BBAY92Host-AssociatedOpen in IMG/M
3300000949Macroalgal surface ecosystem from Botany Bay, Sydney, Australia - BBAY94Host-AssociatedOpen in IMG/M
3300004460Marine viral communities from Newfoundland, Canada BC-1EnvironmentalOpen in IMG/M
3300005613Saline sediment microbial communities from Etoliko Lagoon, Greece - sedimentEnvironmentalOpen in IMG/M
3300005747Seawater microbial communities from Vineyard Sound, MA, USA - control T14EnvironmentalOpen in IMG/M
3300006025Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_22_D_<0.8_DNAEnvironmentalOpen in IMG/M
3300006026Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_29_D_<0.8_DNAEnvironmentalOpen in IMG/M
3300006802Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18EnvironmentalOpen in IMG/M
3300006810Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Sep_01EnvironmentalOpen in IMG/M
3300006920Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_12EnvironmentalOpen in IMG/M
3300007276Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_31EnvironmentalOpen in IMG/M
3300007346Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_31EnvironmentalOpen in IMG/M
3300007538Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007540Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaGEnvironmentalOpen in IMG/M
3300007542Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_1 Viral MetaGEnvironmentalOpen in IMG/M
3300007609Salt pond water microbial communities from South San Francisco under conditions of wetland restoration - Salt Pond MetaG R2_restored_H2O_MGEnvironmentalOpen in IMG/M
3300009074Pelagic marine microbial communities from North Sea - COGITO_mtgs_100430EnvironmentalOpen in IMG/M
3300009076Pelagic marine microbial communities from North Sea - COGITO_mtgs_100511EnvironmentalOpen in IMG/M
3300009434Pelagic marine microbial communities from North Sea - COGITO_mtgs_110516EnvironmentalOpen in IMG/M
3300009438Pelagic marine microbial communities from North Sea - COGITO_mtgs_110506EnvironmentalOpen in IMG/M
3300009509Mangrove sediment microbial communities from Mai Po Nature Reserve Marshes in Hong Kong, China - Maipo_11EnvironmentalOpen in IMG/M
3300010316Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Spr_15_0.8_DNAEnvironmentalOpen in IMG/M
3300011254Seawater microbial communities from Japan Sea near Toyama Prefecture, Japan - 2015_1, 0.02EnvironmentalOpen in IMG/M
3300013010Freshwater to marine salinity gradient microbial communities from Chesapeake Bay, USA - CPBay_Spr_31_0.8_DNAEnvironmentalOpen in IMG/M
3300014042Epidermal mucus viral and microbial communities from European eel in Spain - Ebro delta (0.22 um filter)Host-AssociatedOpen in IMG/M
3300017727Marine viral communities from the oligotrophic San Pedro Time Series (SPOT) site, San Pedro Channel, CA, USA ? 24 SPOT_SRF_2011-07-20EnvironmentalOpen in IMG/M
3300017824Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 011501BT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300018416Coastal salt marsh microbial communities from the Groves Creek Marsh, Skidaway Island, Georgia - 011502XT metaG (megahit assembly)EnvironmentalOpen in IMG/M
3300021085Ammonia-oxidizing marine archaeal communities from Monterey Bay, California, United States - M1 30m 12015EnvironmentalOpen in IMG/M
3300022053Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaG (v2)EnvironmentalOpen in IMG/M
3300022169Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaG (v3)EnvironmentalOpen in IMG/M
3300022178Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_31 (v3)EnvironmentalOpen in IMG/M
3300022187Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Sep_01 (v3)EnvironmentalOpen in IMG/M
3300022220Sediment microbial communities from San Francisco Bay, California, United States - SF_May12_sed_USGS_21EnvironmentalOpen in IMG/M
3300023109 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - SI_122_August2016_10_MGEnvironmentalOpen in IMG/M
3300024059 (restricted)Seawater microbial communities from Strait of Georgia, British Columbia, Canada - BC1_12_2EnvironmentalOpen in IMG/M
3300024062 (restricted)Seawater microbial communities from Strait of Georgia, British Columbia, Canada - BC1_12_1EnvironmentalOpen in IMG/M
3300024281Seawater microbial communities from Monterey Bay, California, United States - 11DEnvironmentalOpen in IMG/M
3300024519 (restricted)Seawater microbial communities from Strait of Georgia, British Columbia, Canada - BC1_12_27EnvironmentalOpen in IMG/M
3300025543Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1504_2 Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025617Marine microbial communities from expanding oxygen minimum zones in the Saanich Inlet - ESP_153SG_22_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025621Pelagic marine microbial communities from North Sea - COGITO_mtgs_100511 (SPAdes)EnvironmentalOpen in IMG/M
3300025626Pelagic marine microbial communities from North Sea - COGITO_mtgs_120531 (SPAdes)EnvironmentalOpen in IMG/M
3300025641Pelagic marine microbial communities from North Sea - COGITO_mtgs_110506 (SPAdes)EnvironmentalOpen in IMG/M
3300025652Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Mar_31 (SPAdes)EnvironmentalOpen in IMG/M
3300025653Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - DEBay_Sum_29_N_>0.8_DNA (SPAdes)EnvironmentalOpen in IMG/M
3300025655Freshwater to marine saline gradient viral communities from Chesapeake Bay - CB_1508_2 Viral MetaG (SPAdes)EnvironmentalOpen in IMG/M
3300025759Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_24 (SPAdes)EnvironmentalOpen in IMG/M
3300025821Pelagic marine microbial communities from North Sea - COGITO_mtgs_110421 (SPAdes)EnvironmentalOpen in IMG/M
3300025889Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Nov_18 (SPAdes)EnvironmentalOpen in IMG/M
3300026097Salt pond water microbial communities from South San Francisco under conditions of wetland restoration - Salt Pond MetaG R2_restored_H2O_MG (SPAdes)EnvironmentalOpen in IMG/M
3300026511Seawater microbial communities from Monterey Bay, California, United States - 27DEnvironmentalOpen in IMG/M
3300027861 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_12_MGEnvironmentalOpen in IMG/M
3300027917Marine sediment microbial communities from White Oak River estuary, North Carolina - WOR-2-8_12 (SPAdes)EnvironmentalOpen in IMG/M
3300027996 (restricted)Seawater microbial communities from Saanich Inlet, British Columbia, Canada - Na_anoxic_6_MGEnvironmentalOpen in IMG/M
3300031539Soil microbial communities from Risofladan, Vaasa, Finland - UN-3EnvironmentalOpen in IMG/M
3300031565Soil microbial communities from Risofladan, Vaasa, Finland - UN-2EnvironmentalOpen in IMG/M
3300031566Soil microbial communities from Risofladan, Vaasa, Finland - UN-1EnvironmentalOpen in IMG/M
3300031578Soil microbial communities from Risofladan, Vaasa, Finland - TR-2EnvironmentalOpen in IMG/M
3300031669Soil microbial communities from Risofladan, Vaasa, Finland - TR-1EnvironmentalOpen in IMG/M
3300031673Soil microbial communities from Risofladan, Vaasa, Finland - TR-3EnvironmentalOpen in IMG/M
3300032272Coastal sediment microbial communities from Maine, United States - Lowes Cove worm burrowEnvironmentalOpen in IMG/M
3300033742Sea-ice brine viral communities from Beaufort Sea near Barrow, Alaska, United States - 2018 seawaterEnvironmentalOpen in IMG/M
3300034374Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_31 (v4)EnvironmentalOpen in IMG/M
3300034418Aqueous microbial communities from the Delaware River and Bay under freshwater to marine salinity gradient to study organic matter cycling in a time-series - Viral MetaG DEL_Aug_28 (v4)EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Note: Some of these sequences are restricted, as per the data usage policy of the Joint Genome Institute (JGI). Utilizing any of their features below requires obtaining a license from the datasets' corresponding author(s).

Protein ID Sample Taxon ID Habitat Sequence
BS_KBA_SWE12_21mDRAFT_1000849453300000124MarineMTEYVIAMSAKEMTEYLSNEFNDLPDDARRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
P_2C_Liq_1_UnCtyDRAFT_108616713300000418EnviromentalVFVRR*TPSPDTKPLAKRSSERVKMTEFVFAMSAQEMSEYLQNDFHDLPDDARRCIATMMAMIMDHXEFLEXQGLIEKFEFEYDSNEGDLH*NQQTIK*
BS_KBA_SWE02_21mDRAFT_1000231383300000792MarineMTEYVIAMSAKEMTEYLSNEXNXLPDDARRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
BBAY92_1003131923300000947Macroalgal SurfaceMAKRSSEGLAMTEFIFAMSSQEMTDYLSNEFHDLPDDARRCIATMMAMIMDHTDFLEDQGLIEKFEFEYDSNEGELH*
BBAY94_1000895223300000949Macroalgal SurfaceMAKRSSEGLAMTEFIFAMSSQEMTDYLSNEFHDLPDDARRCIATMMAMIMDHTDFLEDQGLTEKFEFEYDSNEGELH*
Ga0066222_108433963300004460MarineMPEYVMAMSPAEMTEFLESEFYELEEGPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDKDGGDLH*
Ga0066222_108434023300004460MarineMAMSPKELTEYLTSEFYDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH*
Ga0074649_100913383300005613Saline Water And SedimentMTEYVIAMSAKEMTEYLSNEFNDLPDDARRCIATMMAMIMDHHDFLEDQGLTEKFEFEYDSNEGELH*
Ga0076924_112869793300005747MarineMPEYVIAMSAEEMSEFLESEFYDLEDNPKRCIATMMAMLIDHTEFLEEQGLEEKFDFLYDKDGGDLH*
Ga0075474_1001591913300006025AqueousSQEMTDYLSNEFHDLPDEAKRCIATMMAMIMDHTDFLEEQGLKEKFEFEYDSNEGELH*
Ga0075478_1005061213300006026AqueousMTEFIFAMSSQEMTDYLTNEFHDLPDDARRCIATMMAMIMDHTDFLEEQGLKEKFEFEYDSNEGELH*
Ga0070749_1003934243300006802AqueousMPEYVLAMSAEEMNEYLMNDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
Ga0070749_1011435333300006802AqueousMTEFIFAMSSQEMTDYLSNEFHDLPDEAKRCIATMMAMIMDHTDFLEEQGLKEKFEFEYDSNEGELH*
Ga0070749_1028592423300006802AqueousMPEYVLAMSETELTEYLMSEFHDLPDDAKRCIATMMSMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
Ga0070754_1005005463300006810AqueousMPEYVMAMSPKELTEYLTSDFYDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH*
Ga0070754_1008948613300006810AqueousGLKMTEFIFAMSSQEMTDYLSNEFHDLPDEAKRCIATMMAMIMDHTDFLEEQGLKEKFEFEYDSNEGELH*
Ga0070748_114995413300006920AqueousMPEYVLAMSAEEMNEYLMSDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
Ga0070747_103083633300007276AqueousMTEYVMAMSPKELTEYLTSEFYDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSDEGELH*
Ga0070747_109403423300007276AqueousMPEYVLAMSATELTEYLMSDFHDLPDEAKRCIATMMSMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
Ga0070753_106614013300007346AqueousMTEFIFAMSSQEMTDYLTNEFHDLPDDARRCIATMMAMIMDHTDFLEEQGLKEKFEFEYDSNEG
Ga0070753_119449013300007346AqueousMPEYVLAMSAKEMHEYLMSDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
Ga0099851_106292333300007538AqueousMPEYVLAMSETELTEYLMSEFHDLPDEAKRCIATMMSMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
Ga0099851_128393313300007538AqueousMTEFIFAMSSQEMTDYLSNEFHDLPDDARRCIATMMAMIMDYFDFLEEQGLKEKFEFEYDSNEGELH*
Ga0099847_102735843300007540AqueousMTEFIFAMSSQEMTDYLSNEFHDLPDDARRCIATMMAMIMDYTDFLEEQGLKEKFEFEYDSNEGELH*
Ga0099847_109656413300007540AqueousMPEYVLAMSETELTEYLMSDFHDLPDEAKRCIATMMSMIMDHSDFLEDQGLTEKFEFEYDSNEGELH*
Ga0099847_117630233300007540AqueousVMAMSPKELTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH*
Ga0099846_123180423300007542AqueousMPEYVMAMSPKELTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSDEGE
Ga0102945_101512473300007609Pond WaterVKMTEFIFAMSAQEMSEYLHNDFHDLPDDARRCIATMMAMIMDHNEFLEDQGLIEKFEFEYDSNEGDLH*
Ga0102945_101542933300007609Pond WaterMTEFIFAMSAQEMSEYLHNDFHDLPDDARRCIATMMAMIMDHTDFLEDQGLTEKFEFEYDSNEGELH*
Ga0102945_106410633300007609Pond WaterVKMTEFIFAMSAQEMSEYLHNDFHDLPDDARRCIATMMAMIMDHNEFLEDQGLTKKFEFEYDSNEGDLH*
Ga0115549_103790923300009074Pelagic MarineMPEYLMAMSPEEMTEFLESEFYDLEDNPKRCIATMMAMLMDHNEFLEEQGLEEKFDFLYDNDEGDLH*
Ga0115550_1010219123300009076Pelagic MarineMPEYVMAMSPKEMTEYLTNEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH*
Ga0115562_113693923300009434Pelagic MarineMPEYVMAMSPKEMTEYLTNEFYELPDEAKRCIATMMAMIIDHSDFLEDQGLTEKFEFEYDRDEGELH*
Ga0115559_101920543300009438Pelagic MarineMPEYVMAMSPKELTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH*
Ga0123573_1156205823300009509Mangrove SedimentMPEYVLAMSAEEMTEFLENEFHDLEESPKRCIATMMSMLMDHNEFLEEQGLEEKFEFLYDNNEGELH*
Ga0136655_104258833300010316Freshwater To Marine Saline GradientMTEFIFAMSSQEMTDYLSNEFHDLPDDAKRCIATMMAVIMDHTDFLEEQGLKEKFEFEYDSNEGELH*
Ga0151675_106075133300011254MarineMTQYIIAMTSEEMAEFLDGEFHDLEEGPKRCIATMMAMLMDHNEFLEEQGLEEKFDFLYDNESGELH*
Ga0129327_1074351733300013010Freshwater To Marine Saline GradientDNLKEIKQMPEYVMAMSPKELTEYLTSDFYDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH*
Ga0117790_105005713300014042Epidermal MucusMPEYVIAMSAKEMTDYLSNEFNDLPDDARRCIATMMAMIMDHNDFLEDQGLTEKFEFEYDSNEGELH*
Ga0181401_106376633300017727SeawaterMTEFIFAMSAQEMSEYLHNDFHDLPDDARRCIATMMAMIMDHNEFLEDQGLIEKFEFEYDSNEGDLH
Ga0181552_1049209233300017824Salt MarshTEYLMSDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0181553_10043992103300018416Salt MarshMPEYVLAMSAEEITEYLMSDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0206677_10005186103300021085SeawaterMPEYVMAMSPAEMTEFLESEFYDLEDGPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDNDGGDLH
Ga0206677_1003184783300021085SeawaterMPEYVIAMSAEEMSEFLESEFYDLEDKPKRCIATMMTMLIDHTEFLEEQGLEEKFDFLYDNDGGELH
Ga0206677_1003516663300021085SeawaterMPEYVMAMSAEEMTEFLDGEFHDLEEGPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDNDGGDLH
Ga0212030_100893433300022053AqueousMTEYVMAMSPKELTEYLTSEFYDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSDEGELH
Ga0212030_105091833300022053AqueousMPEYVMAMSPKELTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELHXNPQITK
Ga0196903_100678743300022169AqueousMPEYVLAMSETELTEYLMSEFHDLPDDAKRCIATMMSMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0196903_102203433300022169AqueousMTEYVMAMSPKELTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSDEGELH
Ga0196887_108860733300022178AqueousMPEYVMAMSPKELTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH
Ga0196899_114898623300022187AqueousMPEYVMAMSPKELTEYLTSDFYDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH
Ga0224513_10000053353300022220SedimentMPEYVIAMSAEEMSEFLESEFYDLEDNPKRCIATMMAMLIDHTEFLEEQGLEEKFDFLYDKDGGDLH
(restricted) Ga0233432_1011902423300023109SeawaterMPEYVMAMSAEEMTEFLESEFYDLEDAPKRCIATMMAMLMDHTEFLEEQGLEEKFDFLYDNDGGDLH
(restricted) Ga0255040_1013956223300024059SeawaterMPEYVMAMSAEEMTEFLESEFYDLEDGPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDNDGGDLH
(restricted) Ga0255039_1036677913300024062SeawaterMPEYVMAMSAEAMTEFLESEFYDLEDGPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDNDGGDLH
Ga0228610_104654823300024281SeawaterMTEYVMAMSPKELTEYLTSDFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH
(restricted) Ga0255046_1036149813300024519SeawaterMPEYVMAMSPAEMTEFLESEFYDLEDGPKRCIATMMAMLMDHTEFLEEQGLEEKFDFLYDNDGGDLH
Ga0208303_103556843300025543AqueousMPEYVLAMSETELTEYLMSDFHDLPDEAKRCIATMMSMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0208303_104264143300025543AqueousMTEFIFAMSSQEMTDYLSNEFHDLPDDARRCIATMMAMIMDYTDFLEEQGLKEKFEFEYDSNEGELH
Ga0209138_100934393300025617MarineMPEYVMAMSPAEMTEFLESEFYDLEDSPKRCIATMMAMLMDHSEFLEEQGLQEKFDFMYDKDGGDLH
Ga0209504_101838153300025621Pelagic MarineMPEYLMAMSPEEMTEFLESEFYDLEDNPKRCIATMMAMLMDHNEFLEEQGLEEKFDFLYDNDEGDLH
Ga0209716_110807923300025626Pelagic MarineMPEYVMAMSPKEMTEYLTNEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH
Ga0209833_104711723300025641Pelagic MarineMPEYVMAMSPKEMTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH
Ga0208134_106969523300025652AqueousMPEYVLAMSATELTEYLMSDFHDLPDEAKRCIATMMSMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0208428_1000946133300025653AqueousMTEFIFAMSSQEMTDYLSNEFHDLPDEAKRCIATMMAMIMDHTDFLEEQGLKEKFEFEYDSNEGELH
Ga0208795_110302913300025655AqueousMPEYVLAMSETELTEYLMSEFHDLPDEAKRCIATMMSMIMDHSDFLEDQGLTEKFEFEYDSNEG
Ga0208899_102334453300025759AqueousMPEYVLAMSAEEMNEYLMNDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0208899_108499233300025759AqueousLAKYSHKGLKMTEFIFAMSSQEMTDYLSNEFHDLPDEAKRCIATMMAMIMDHTDFLEEQGLKEKFEFEYDSNEGELH
Ga0209600_117487833300025821Pelagic MarineQLKNCGKDSQSRGSKMPEYVMAMSPKELTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH
Ga0208644_111964313300025889AqueousSAEEMNEYLMSDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0209953_101857453300026097Pond WaterPDTKPLAKRSSERVKMTEFIFAMSAQEMSEYLHNDFHDLPDDARRCIATMMAMIMDHNEFLEDQGLIEKFEFEYDSNEGDLH
Ga0233395_108018233300026511SeawaterTEYVMAMSPKELTEYLTSDFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH
(restricted) Ga0233415_1013895023300027861SeawaterMPEYVMAMSAEEMTEFLESEFYDLEDGPKRCIATMMAMLMDHTEFLEEQGLEEKFDFLYDNDGGDLH
Ga0209536_100027685123300027917Marine SedimentMPEYVLAMSAEEMHEYLMSDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
(restricted) Ga0233413_1052453423300027996SeawaterMTQYIIAMTSEEMTEFLDGEFHDLEDGPKRCIATMMAMLMDHTEFLEEQGLEEKFDFLYDNDGGDLH
Ga0307380_1023899053300031539SoilMPEYLMAMSPKELTEYLTSEFDDLPDDARRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDRDEGELH
Ga0307380_1038212633300031539SoilMTEYVIAMSAKEMTEYLSNEFNDLPDDARRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0307380_1060239123300031539SoilMPEYVMAMSPKEMTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0307380_1070669133300031539SoilMPEYLMAMSPAEMTEFLESEFYELEEAPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDRDEGELH
Ga0307380_1102061223300031539SoilMNEFIFAMSAKEMTEYLSNEFNDLPDDARRCIATMMAMIMDHNDFLEDQGLTEKFEFEYDRDEGELH
Ga0307379_1038611443300031565SoilMPEYLMAMSPAEMTEFLESEFYELEEGPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDRDEGELH
Ga0307379_1050418413300031565SoilMPEYVIAMSAKEMTDYLSNEFDDLPDDARRCIATMMAMIMDHNDFLEDQGLTEKFEFEYDRDEGELH
Ga0307378_1114625123300031566SoilMPEYVIAMSAKEMTDYLSNEFDDLPDDARRCIATMMAMIMDHNDFLEDQGLTKKFEFEY
Ga0307378_1132983823300031566SoilMPEYLMAMSPKELTEYLTSEFYDLPDEAKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDRDEGELH
Ga0307378_1149408113300031566SoilMPEYVMAMSPKEMTEYLTSEFYELPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELHXNPQTTK
Ga0307376_1020513433300031578SoilMPEYLMAMSPKELTEYLTSEFYDLPDEAKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDKDGGDLH
Ga0307376_1036407823300031578SoilMPEYVIAMSAKEMTDYLSNEFDDLPDDARRCIATMMAMIMDHNDFLEDQGLTKKFEFEYDSNEGELH
Ga0307376_1038485833300031578SoilMPEYLMAMSLEEMTEFLESEFYDLEDNPKRCIATMMAMLMDHNEFLEEQGLEEKFDFLYDNDEGDLH
Ga0307376_1046803313300031578SoilMTEFIFAMSAKEMTEYLSNEFNDLPDDARRCIATMMAIIMDHNDFLEDQGLTEKFEFEYDSNEGELH
Ga0307376_1074335713300031578SoilMPEYLMAMSPAEMTEFLESEFYELEEAPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDKDGGDLH
Ga0307376_1087216413300031578SoilMPEYVMAMSPKEMTEYLTSEFNDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH
Ga0307375_1015858063300031669SoilPEYVIAMSPKEMTDYLTSEFNDLPDEAKRCIATMMAMIMAHNDFLEDQGLTEKFEFEYDSNEGELH
Ga0307375_1049766133300031669SoilMPEYVIAMSAKEMTEYLTSEFDDLPDDARRCIATMMAMIMDHSDFLEDQGLTEKF
Ga0307375_1070054223300031669SoilMPEYLMAMSPAEMTEFLESEFYELEEGPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLY
Ga0307375_1082561423300031669SoilNLKEIKPMPEYLMAMSPAEMTEFLESEFYELEEAPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDKDGGDLH
Ga0307377_1057284613300031673SoilMTEFIFAMSAKEMTEYLTSEFDELPDDARRCIATMMAMIMDHNDFLEDQGLTEKFEFEYDRDEGELH
Ga0316189_1108793133300032272Worm BurrowEYVIAMSAEEMSEFLESEFYDLEDNPKRCIATMMAMLIDHTEFLEEQGLEEKFDFLYDKDGGDLH
Ga0314858_025968_675_8783300033742Sea-Ice BrineMHEYVMAMSAEEMTEFLESEFYDLEDGPKRCIATMMAMLMDHSEFLEEQGLEEKFDFLYDNDGGDLH
Ga0348335_131988_536_7183300034374AqueousMTEFIFAMSSQEMTDYLTNEFHDLPDDARRCIATMMAMIMDHTDFLEEQGLKEKFEFEYD
Ga0348337_189728_283_4863300034418AqueousMPEYVLAMSAEEMNEYLMSDFHDLPDEAKRCIATMMAMIMDHSDFLEDQGLTEKFEFEYDSNEGELH


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.