NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300001157

3300001157: Forest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_O2



Overview

Basic Information
IMG/M Taxon OID3300001157 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0071004 | Gp0055411 | Ga0002908
Sample NameForest soil microbial communities from Davy Crockett National Forest, Groveton, Texas, USA - Texas A ecozone_OM2H0_O2
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size25993306
Sequencing Scaffolds25
Novel Protein Genes27
Associated Families27

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Chloracidobacterium → Chloracidobacterium thermophilum1
All Organisms → cellular organisms → Bacteria7
All Organisms → cellular organisms → Bacteria → Acidobacteria3
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium3
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Armatimonadetes → unclassified Armatimonadetes → Armatimonadetes bacterium 13_1_40CM_3_65_71
Not Available9
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_40CM_2_61_41

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameForest Soil Microbial Communities From Multiple Locations In Canada And Usa
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil → Forest Soil Microbial Communities From Multiple Locations In Canada And Usa

Alternative Ecosystem Assignments
Environment Ontology (ENVO)forest biomelandforest soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationDavy Crockett National Forest, Groveton, Texas, USA
CoordinatesLat. (o)31.11Long. (o)-95.15Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000621Metagenome981Y
F001123Metagenome / Metatranscriptome770Y
F002113Metagenome592Y
F002379Metagenome / Metatranscriptome566Y
F002675Metagenome538Y
F006953Metagenome / Metatranscriptome361Y
F007823Metagenome / Metatranscriptome344Y
F007889Metagenome / Metatranscriptome343Y
F009919Metagenome311Y
F020968Metagenome / Metatranscriptome221Y
F022661Metagenome213Y
F025373Metagenome202Y
F029282Metagenome / Metatranscriptome189Y
F029794Metagenome187N
F036256Metagenome170Y
F037169Metagenome / Metatranscriptome168Y
F040704Metagenome161Y
F042000Metagenome / Metatranscriptome159Y
F048489Metagenome / Metatranscriptome148Y
F048508Metagenome148Y
F050596Metagenome / Metatranscriptome145Y
F051223Metagenome / Metatranscriptome144Y
F054068Metagenome / Metatranscriptome140Y
F070499Metagenome123Y
F082332Metagenome113Y
F089392Metagenome109Y
F097047Metagenome104Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
JGI12686J13345_100213All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Chloracidobacterium → Chloracidobacterium thermophilum8771Open in IMG/M
JGI12686J13345_100502All Organisms → cellular organisms → Bacteria5470Open in IMG/M
JGI12686J13345_100563All Organisms → cellular organisms → Bacteria5040Open in IMG/M
JGI12686J13345_100605All Organisms → cellular organisms → Bacteria4748Open in IMG/M
JGI12686J13345_100843All Organisms → cellular organisms → Bacteria3580Open in IMG/M
JGI12686J13345_101048All Organisms → cellular organisms → Bacteria → Acidobacteria2741Open in IMG/M
JGI12686J13345_101197All Organisms → cellular organisms → Bacteria2279Open in IMG/M
JGI12686J13345_101612All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1390Open in IMG/M
JGI12686J13345_101690All Organisms → cellular organisms → Bacteria → Terrabacteria group → Armatimonadetes → unclassified Armatimonadetes → Armatimonadetes bacterium 13_1_40CM_3_65_71296Open in IMG/M
JGI12686J13345_101714Not Available1277Open in IMG/M
JGI12686J13345_101786Not Available1225Open in IMG/M
JGI12686J13345_101925All Organisms → cellular organisms → Bacteria → Acidobacteria1133Open in IMG/M
JGI12686J13345_102107All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1046Open in IMG/M
JGI12686J13345_102731All Organisms → cellular organisms → Bacteria → Acidobacteria877Open in IMG/M
JGI12686J13345_103053All Organisms → cellular organisms → Bacteria815Open in IMG/M
JGI12686J13345_103859Not Available715Open in IMG/M
JGI12686J13345_104092Not Available694Open in IMG/M
JGI12686J13345_104535All Organisms → cellular organisms → Bacteria657Open in IMG/M
JGI12686J13345_104602Not Available652Open in IMG/M
JGI12686J13345_106088All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium575Open in IMG/M
JGI12686J13345_106471Not Available558Open in IMG/M
JGI12686J13345_106656Not Available550Open in IMG/M
JGI12686J13345_106748Not Available547Open in IMG/M
JGI12686J13345_107378Not Available525Open in IMG/M
JGI12686J13345_107579All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → unclassified Cyanobacteria → Cyanobacteria bacterium 13_1_40CM_2_61_4519Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
JGI12686J13345_100213JGI12686J13345_1002131F002675VAFSVAIILLLLINPFPSVHSQNQDSAARQTGVLRLRVRVKADDNAPLKGLARKRFFLIRGSLEQNKSVTQIAEQRTLVSRDCYYSKLGASQLFIAWLKANDCESVYCRGIDPQFVTGTNAVPEFANAYAAGQKEFGSDTTALNWLTTNLPENLRDGFYRDRQALIQTLIKQAEVSSGTSVLSVMTDRNGTAYFTDLTPGTYILTNLVPTELPQTTASWSCEVEIKVGDIATEKPYLVSNRKDRNVKCVAVEKPLPVCDK*
JGI12686J13345_100502JGI12686J13345_1005021F042000ILLIDPACFSPGVITGDHILTSYGVITGDGSAFLATNLLFQDLLAEGGYLRDGVITGDGVITGDGVITGDLVATAQSAMILGDTTSVMTTDPDDGIDCLDY*
JGI12686J13345_100563JGI12686J13345_1005632F036256MNDEQFAIAKTIMTRIVMEKIPKAKRWSVSKEFKSWIDGLSKEDFQVYLDRLRLLTQTLSDEGNADQEASRVTGTFIFFGAYPPAEIEQRDQLARIGGEATAYYALALWRSLQVEELQKELRATN*
JGI12686J13345_100605JGI12686J13345_1006057F007823MDQIWGTFVTLIPVFFLLTIVFGAGIYIGRVSKRE*
JGI12686J13345_100843JGI12686J13345_1008434F020968MTGAVTIISRTYRKIKGKYYARVRYKIDDGKPLDILRQVDNKSAIKPKQAEIEVELLKNGPAQLVAGKVTFRQLAQHAKDNIYVSALYDNQETKIKGVRSVVPAHSALNNLVAFFGDVDIRKINEKKLTKYQVARLTGTIKNLRKVSLSTVDRELSKAR
JGI12686J13345_101048JGI12686J13345_1010485F022661TAKGEVVELKYAYAGRAVRFGNESIVIFLTDKPIPSDKVAAEVKEPAMLESEQIRGLEYVIDGDGMWVRFHPSQYQESTSNKLKDYKVEGDVVSGVDEGNDLSSGKYSRSVKFSASIVK*
JGI12686J13345_101197JGI12686J13345_1011974F040704RIFPTKWLASSIDHDRKIAEDLFHFTGCDVNTPMNQLGTDLTSVWRRAFRYRADREATANALRVREGKSIETTSRCSDGGWMFDGKMLRFSREIATPSPDTAMRLVLRVKP*
JGI12686J13345_101612JGI12686J13345_1016122F048489MVTRKAATKVATKVSVLKFNPEWFSDPPPPFFKNLDRVAQRELAAAKKEFTARVKEILAKGQQR*
JGI12686J13345_101690JGI12686J13345_1016902F025373MTQPEPIIAASQAIRFPRLYAAIKWTIITSLILLAVLAAAFWTMVLGGFAASGFRASGPATGAGESLVITPECAWPLSVHEQDAGTVCRMFYNLTPEQRARVLARRK*
JGI12686J13345_101714JGI12686J13345_1017142F097047MPIQESSATALVRFTGLGIVCFNQEHQRGEIAGIRDNKHVLTIKIQRPVYQDGAEKDVIAYKDVVTYENLPNDDVRIEIKASGNPPIAGYEIYQNGEFDRLGAHDVNDFRWLVNMSTLHGDSALQTSPQQRYPLTKIYISNGLFYAHKLDRNLFFEKVEKNANGLSTLSEVFGNVA
JGI12686J13345_101786JGI12686J13345_1017861F002113MEDLLAIEAEAKYLLAASDRSHEQKLMQELLRIFKEAELLFGPRDVSYQLSVPRITECAAARSYIFQPLRKTRIYLSREAGTKPWIASLELAHEAIHVLSPVPFGSGPTILEEGLAEWFAQRYADRVHGLTFERACDPKADDVMRAVSTLLAKNQFVIKTLRTRQPVISKIDEKLLVEVAGIGPSEAKFLCTDFKSYWRTSPLSKFAAQGAERVAACLRSIWDQK*
JGI12686J13345_101925JGI12686J13345_1019251F002379MYFRRLSLALLTLLFAASPLAAQAQKPAPKPADAWAEWETITPEGEEFTVTMPKNPTTESTKFPYHKMELNTRLYLAKSTSGPVLAIASFSGIKANPALYTEFQRFNSYIDAFKDFFPPKVRTKDTAITKLTLVSSRPFNGHTGRSYKVSIGELNGVLHAF
JGI12686J13345_102107JGI12686J13345_1021072F007889THANWGAELVDADEDGYQELLFSGKDSSESRNLRRLILFVPNEKRTYSMQMTGETTSSGTPRIQWLSNATGTEAAAYRTALRQKARLLVSKKR*
JGI12686J13345_102731JGI12686J13345_1027311F037169LILLKIGEELGIPVDIQNERLDPVDANISKLPIEDVARQLSPHIKLYYRADLTRAEKRALRMVLAEPPKATQGP*
JGI12686J13345_102731JGI12686J13345_1027312F001123MQNEIINEENSVKNANGADELKKAFVEPAISVPVDVLEATTFFQVASSGATN*
JGI12686J13345_103053JGI12686J13345_1030532F006953MRDTKRHKRYKTILQIHLCLLCFLCLFVATVEAKGRPEILGVSLGMTREATQQRLKTIGRLEKEERKRQEVWAVNDPRISHILVGYDTDYRVRYITAIARAGGPRIRYQEVVDVKHAQRVNYQGNYKFTLEARRGQVAYVTIAHGRDP
JGI12686J13345_103859JGI12686J13345_1038591F089392MPTTFRAALALIILNLSFSTVASAQSHSLSSTPIRAAEEENLRTLTAEYGRALGAGDLDAMRKFWNPQSPNLTSQLRSYKNVFLQARLELTSPEVTRLEIDGDKAVSQLTLDERRLDRKTGAVLLTFDPFRGACRSFEWTRTSSGWKIEREVLVQDELAARLEATRSDRQRDEILEQEKRFVNNTLIGA
JGI12686J13345_104092JGI12686J13345_1040921F070499PIKRSCRSCSGERKQRRPQACCLRSDQMTKLIKTLFLTCLICFVTIGALGQRSARQQGGSVDGSKLLELGDYYYRSNDISDAADRYYQQAIDSSPGSQTAGYAQYNRGNYWFRKFYVLNEQRSKPDHSALTQAERHYYNFVDKFARQTNTVGLLADAEFYLALVYLQQGNREYAIGWLNHLQAEAAKQDQSIYVYKVVWSSRTGDVVDRNLDTAQLAAYARDAIKKNTDTN
JGI12686J13345_104535JGI12686J13345_1045351F054068LLAIFKMVYMRRAAATMLRVVQFFVGFLPGQTVPRFTMPADPNLTLPYAVAICLGSVVSFFLFRV*
JGI12686J13345_104602JGI12686J13345_1046021F000621MPTTKHELLDWLMDVPEDAEIGTDGAGLALLAILGTNVHLLEVGHIPNADELYAEAINQAMMERLRRIHASGGETDTGVIIVTFQGYISGIPSLFSSDFDTAFVFKNKEQAEAFITEFADELHNPQILDCP*
JGI12686J13345_106088JGI12686J13345_1060882F048508MKFLALIFLSVVSVFQTADVKNFNANSVSFDYPNGWVLNDDSNSDAQQLTLARPNNDVQIRVFVHKGKITTEKFADAKKSFIDPYIASTNKQFVAMGAKPEQSPDSTEIGGAKAEGVKIAANLGEPGAAKIYWALVGQRVVILTIFGPDREIKQFTPAW
JGI12686J13345_106471JGI12686J13345_1064711F029794MPTTKHELLDWLLDVPEDAEIGTDGQGLALLAILGTNVHLLEVGRIPNADELYAEAINQAMMERLRRIDAEGGET
JGI12686J13345_106641JGI12686J13345_1066411F029282SSGCKQCGARAVGEPLPRPAHELPSYGRALALSISGSLIVLVFLVQTIASMVQRGTGWFEFWSWAAAAETAAWRLKWISLPVMFVTLWFGAKLYRSIRQQPAKFCGLTYARSGLIASATVAFLIALLIGVTVPARLRQRELAKEAQIRADWHTFEAAALEYQHRFHTYPADLKDLRDRLPDPY
JGI12686J13345_106656JGI12686J13345_1066561F051223MKMSKISRALGAAMLCLGLLVSASVVIAKQDPAMPAKEKKQKTTMINGTVSAVTDSALTVIDSQKAEHTLAVTAETKVTKGGKDAVLADVKANDVVTIEAQK
JGI12686J13345_106748JGI12686J13345_1067481F082332PATSAEIFKYNIGVRKSLAARARYFAYMLRPTDSDFGSRALPPGFGFAYYLIRPFRLLFKNKQTGLQN*
JGI12686J13345_107378JGI12686J13345_1073782F009919NIQYKLNNWVTFALEEGYYRTRAANNSVFDFGGLPLYRGIPSRTAHNIRSEFATIFNF*
JGI12686J13345_107579JGI12686J13345_1075791F050596RRGLTLTQALIAAGGVTPKAKEARLGRDDGRGFLTVTRIKLKDIESGKVPDPQVRPGDRITIDK*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.