NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300002660

3300002660: Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF103 (Metagenome Metatranscriptome, Counting Only)



Overview

Basic Information
IMG/M Taxon OID3300002660 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0085736 | Gp0056633 | Ga0005454
Sample NameForest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF103 (Metagenome Metatranscriptome, Counting Only)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size6397062
Sequencing Scaffolds37
Novel Protein Genes39
Associated Families35

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available30
All Organisms → cellular organisms → Bacteria3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → Burkholderia cepacia complex → Burkholderia cepacia1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Micromonospora → unclassified Micromonospora → Micromonospora sp. L51
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameForest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil → Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies

Alternative Ecosystem Assignments
Environment Ontology (ENVO)forest biomelandforest soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationHarvard Forest LTER, Petersham, MA, USA
CoordinatesLat. (o)42.532967Long. (o)-72.180244Alt. (m)N/ADepth (m)0 to .1
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000159Metagenome / Metatranscriptome1863Y
F000396Metagenome / Metatranscriptome1185Y
F001233Metagenome / Metatranscriptome741Y
F001418Metagenome / Metatranscriptome698Y
F001533Metagenome / Metatranscriptome676Y
F002272Metagenome / Metatranscriptome576Y
F002596Metagenome / Metatranscriptome544Y
F003383Metagenome / Metatranscriptome490Y
F006591Metagenome / Metatranscriptome369Y
F014145Metagenome / Metatranscriptome265Y
F016398Metagenome / Metatranscriptome247Y
F017080Metagenome / Metatranscriptome242N
F024561Metagenome / Metatranscriptome205Y
F029356Metagenome / Metatranscriptome188Y
F030305Metagenome / Metatranscriptome185N
F031504Metagenome / Metatranscriptome182Y
F033191Metagenome / Metatranscriptome178Y
F033437Metagenome / Metatranscriptome177Y
F034304Metagenome / Metatranscriptome175Y
F036494Metagenome / Metatranscriptome170Y
F037840Metagenome / Metatranscriptome167N
F047772Metagenome / Metatranscriptome149Y
F047783Metagenome / Metatranscriptome149Y
F058585Metagenome / Metatranscriptome134N
F059538Metagenome / Metatranscriptome133N
F061605Metagenome / Metatranscriptome131N
F068321Metagenome / Metatranscriptome124N
F068857Metagenome / Metatranscriptome124Y
F074243Metagenome / Metatranscriptome119Y
F082272Metagenome / Metatranscriptome113Y
F088037Metagenome / Metatranscriptome109N
F093040Metagenome / Metatranscriptome106N
F098746Metagenome / Metatranscriptome103Y
F100490Metagenome / Metatranscriptome102N
F103604Metagenome / Metatranscriptome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0005454J37223_100023Not Available946Open in IMG/M
Ga0005454J37223_100031Not Available650Open in IMG/M
Ga0005454J37223_100037Not Available895Open in IMG/M
Ga0005454J37223_100186Not Available609Open in IMG/M
Ga0005454J37223_100193Not Available778Open in IMG/M
Ga0005454J37223_100227Not Available702Open in IMG/M
Ga0005454J37223_100230Not Available570Open in IMG/M
Ga0005454J37223_100298Not Available789Open in IMG/M
Ga0005454J37223_100304Not Available619Open in IMG/M
Ga0005454J37223_100354Not Available591Open in IMG/M
Ga0005454J37223_100393Not Available762Open in IMG/M
Ga0005454J37223_100457Not Available621Open in IMG/M
Ga0005454J37223_100548Not Available595Open in IMG/M
Ga0005454J37223_100698Not Available789Open in IMG/M
Ga0005454J37223_100851Not Available816Open in IMG/M
Ga0005454J37223_101232Not Available593Open in IMG/M
Ga0005454J37223_101267Not Available541Open in IMG/M
Ga0005454J37223_101297Not Available601Open in IMG/M
Ga0005454J37223_101423Not Available586Open in IMG/M
Ga0005454J37223_101565Not Available602Open in IMG/M
Ga0005454J37223_101775Not Available672Open in IMG/M
Ga0005454J37223_102587Not Available693Open in IMG/M
Ga0005454J37223_102626Not Available746Open in IMG/M
Ga0005454J37223_102796Not Available566Open in IMG/M
Ga0005454J37223_102901Not Available756Open in IMG/M
Ga0005454J37223_103066Not Available840Open in IMG/M
Ga0005454J37223_103290Not Available843Open in IMG/M
Ga0005454J37223_103383Not Available736Open in IMG/M
Ga0005454J37223_104667All Organisms → cellular organisms → Bacteria941Open in IMG/M
Ga0005454J37223_104883All Organisms → cellular organisms → Bacteria1213Open in IMG/M
Ga0005454J37223_104899All Organisms → cellular organisms → Bacteria550Open in IMG/M
Ga0005454J37223_105178Not Available537Open in IMG/M
Ga0005454J37223_105546All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium572Open in IMG/M
Ga0005454J37223_105807All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Burkholderiales → Burkholderiaceae → Burkholderia → Burkholderia cepacia complex → Burkholderia cepacia559Open in IMG/M
Ga0005454J37223_105992All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Micromonosporales → Micromonosporaceae → Micromonospora → unclassified Micromonospora → Micromonospora sp. L5640Open in IMG/M
Ga0005454J37223_106757Not Available556Open in IMG/M
Ga0005454J37223_107681All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium514Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0005454J37223_100023Ga0005454J37223_1000231F003383MRRRVAPLPRSSRSARVASLGCPVPAPFLLSRRPNPQVAPWFRAFGCAGDGRSSCPERRMPLALLVSARLRVAPENLAFSCSACDVGLGSPLVLHLRLYRRWIIESPRCSHHSAVPTGRSSGFPKSQPFGIADDSPSELPQTLNPPAPIDGYPSYLGSHTIRFALVESPGCPGHSSLATAIDQFPGCPKSWVSHRSPILRASSLPESWFLG*
Ga0005454J37223_100031Ga0005454J37223_1000311F058585QYSRCCVLRLPQWPPSSFRWRCYHWLRWLSELRLASGRCTPARPVANLPARIGFVSFGSTGCKPPTCVDCSALPLVLRRSIWLAPDDPGLIGLAPDNPGSTRLAPHGSFLRLGRQPTSDSHRCRPLARLAASSGLRLMLPLPLGWLRFRFAPAASPSALTVREPLGLRLVVPSPAEPVMHSLFQLNLASPAKPSMSILYPPALASSGIFQLNNFRL
Ga0005454J37223_100037Ga0005454J37223_1000373F016398MEASERKQPLQARERPLRKIGSVEETDCGGPEGVDPEGEKSNLGERNGEHSVLL
Ga0005454J37223_100186Ga0005454J37223_1001861F068857LRVLSLEDGNGLSAANTPGGVKNAIEDGSWRPNGLEEEFRISAKR*
Ga0005454J37223_100193Ga0005454J37223_1001932F029356MTTLILNPRGKDGGRPPARVAGRGEAKTRKLGRLPGESPGTEVRNVSAEAKAVRLLERSLGTL
Ga0005454J37223_100227Ga0005454J37223_1002271F001418KKPQGILAEAEFTSSSGSVRALSVNAKKGLSDKVRKETAPSRVK*
Ga0005454J37223_100230Ga0005454J37223_1002301F014145LPACTALSCANKSAGRFASPLPDGRFLQPPDQCFLARRLLPLARNRLLVTAFRSPATVAASRRPPFRGQSSQPATSLPSKSVFVPVRPFGSTTASRIAPVAAVSLPVARCTATTRFGLPRLRSPLPSGTFASLGIKAFNRVCCLPVRLTNPPDFLSLPAALPD*
Ga0005454J37223_100298Ga0005454J37223_1002981F030305GVFRMPLQFIGSCIVPFGFLVPVPSFLFPALSDLARRCSSGRPVPRVSDRTGDEAPSCPGSSVFSAVPADGSSSRPDSRILQLGSPQIARLPRLSTSCLAVDERPGCPVRSIIWLYRRRRFRVAPNLTSFGGTVSNSPSRPGSSLLQPLPLMVLRVAPGAPSSGFAGGDSSGCPEALIPRLCRLVAFRVSPDLPPSDICRFRFSGLPQIGFLGGSMMNPRFARTLHPRSIQLTSLQVAPKFPLPAAPRMNLQTHSGLAFLPT
Ga0005454J37223_100304Ga0005454J37223_1003041F100490TNIGQFTRLLRSLELGAVSHRTVLLLWDQKRLLRMRDRYGGPKKHLPTPG*
Ga0005454J37223_100354Ga0005454J37223_1003541F000396VPSDPISQPNSPPACNGTELCSQNRRAPSLWLPAALFRKQRINASRLACQLLWPEPVSRSGLSLSRNDRPSPGYHFEVKAPDLLLRRPAVRSSCPFGFRFPHALRFAPMRAGSLPKSRCLTPVRHFQPFLGSPLPFRAFRTPKDQSVQPNSWPKSSPSEHSRFPVTPRHRFYFISADARSPLRSR*
Ga0005454J37223_100393Ga0005454J37223_1003932F033437RHALFPIGTMLGALAFAGLVLPDATLRGMRSSRSHGGIAPTVAGWSLLSEAFSPGSDTPCRGRHAGRGADTPASIGFHVGTVTTGTAPAFGRPFAPRYGSSLLTLRLRSLLQ*
Ga0005454J37223_100457Ga0005454J37223_1004571F003383MRRRVAPLPRSSRSARVASPGCPFPAPFLLSRRPNPRVAPWFRAFGCAGDGSLELPRTCMPLALLVSARLRVAPVALAFSCPACDVGLGSPLVLHLRLYRRWIIESPRCSHHSAVPTYQSSSCPKSQPFGIADDSLFELPRTLNPPVPIDGYPSYLGSRTIRFALVRSPSCPGHLPSATAIDQFPGCPKSWVSHRSPIPLASSFPE
Ga0005454J37223_100548Ga0005454J37223_1005482F059538PLLRPSATPAVRLQLSLPLLPLAAPVSNLRLASAALPPARPRANPPARIGVVSPGSVSGKCSAFAAYYALLIDWLLTFQLALVSGLQLGLRLLPTHIWRCPSARLVFQLPVLTGCRCNN*
Ga0005454J37223_100698Ga0005454J37223_1006982F029356MTTLILNPRGKDGGRPPARVAGRGEATTRKLDRLPGESPGTDVRKISAEAKAVRLLERSLGT
Ga0005454J37223_100851Ga0005454J37223_1008512F017080LDESVVQPVLRPLATPEAGLQLSLATSSSGCAGFEPPTCVGCSTSGSTGGQPSGSDRRSVLRLDRWQAPGFRRLHCASARPVANLPTCVGVLPPARPATNCRLTSGADPSARLVPNHRLSPAVVAAFSLRLLPLRLSSLRWLSPVCHTGGELPTRIGCYAIQLHRF*
Ga0005454J37223_101232Ga0005454J37223_1012321F088037VLQSSTCIAVLPGADISKGLSLASRDFLFPGCLEKVNAPGCFLQRPAEISLKPVPPTASPLNPVCPGLGGILAMSPLPDFVSALLAATVSPLPSRDFYIPLRIAAFDAACHLKAYLLELPDFPSLPAGLPLLTSGRRIIVPGPLLLTRFVCSVNL
Ga0005454J37223_101267Ga0005454J37223_1012671F068321HRMSLYRMLTSVSERLHSIHWLRTDSLGNLLPTWARLGPLSVNEQGAALSHRRFDRGSFRCDRAPNGVAWRFPDWTAVGFAFPSPFLGSRSLWMSVTRILVTCWARFDHDRGSLSPRLGEPLAAILTRIASGVFVADDFRVALSLAADSRRARYPLLRTDFAQKVGLRIRSLFLFRRSL
Ga0005454J37223_101297Ga0005454J37223_1012971F093040MRLAEAEGVKHALGVHHPKTGTDRKGPGLPTNPDLSPVTRDYPRRSRVDGQSPREASATLNLP
Ga0005454J37223_101423Ga0005454J37223_1014231F061605SK*PSITAKLPAGMYGTELCGQNRGTIFPSAPRLLLSTLAVQRFRARLPAFLAGADVSKRPFALPKRLPVSRPPFRGQRSRSATSTPCLSFAAPVPLSTPPRAPVCPGTREISAKNP*CNSRPALPTVPRISTPFQGLSNPSGSKRSIRFPAGKLTFRTLPIALHSPVPALFD*
Ga0005454J37223_101565Ga0005454J37223_1015651F047783GQQTLNQNKPRCPPHQTEMKRMNMSVNKKFWILGATSAVAVGLIGGVFGAFLSVAHPAQSSSNTPVASAPVTIEPGTSILLLNKGEAHASVKVDRNGLILLNLTTKTGQNQIALGVLGDSKLEVGVFDSAGKAKAGMEVPMKDSGRVHMLLLDKKNALGSYTHIES*
Ga0005454J37223_101775Ga0005454J37223_1017751F082272VKPEAKAAEGSLVAPEIAKADASRRLLKLADPEDARTGGYGILIDGSAGDARFG*
Ga0005454J37223_102587Ga0005454J37223_1025873F024561VALGQRTAKSGGRPSANGEEAETQSQTCLHPVRESASANAGSSEPRGLALRFSRRKALDG
Ga0005454J37223_102626Ga0005454J37223_1026261F001233RLSPKISRIIPGDWGKDESGWLVHPLINRAARSGSGGRIHQFLWRRSHAVGYAKRNCAVRGD*
Ga0005454J37223_102796Ga0005454J37223_1027961F098746VAPGKRTAKSGGRPGARWLRRGSAIHKRVLTWFASRRPQTPAEASLDASLKIQRPQGTGRPCYEADNSACRGEWRREAGSQCAPNASMGKREWRTPIP
Ga0005454J37223_102901Ga0005454J37223_1029011F033191KLIDGLAGESPASADCPVGTVVISGDGAGDQTVGSPDVNVLDGWT*
Ga0005454J37223_103066Ga0005454J37223_1030663F006591RHTLFPIHRTLALAPSQVGGRLLDASPRGNAEFSVSWQDRRRNCFRRLLGEAFQHGSHGSCRRNAPHKVMRPLQRDLVTTGRQDCPRELPPGRWPDGTGLAS*
Ga0005454J37223_103290Ga0005454J37223_1032901F031504MPRAFTTRKWELAARGRDYQLIPTFPRLPGIIPEDPAGRPPLNEATSAQTFR
Ga0005454J37223_103290Ga0005454J37223_1032903F006591VGARHTLFPIRRSLGAPPLQVGGRLLDASPRGNAEFSVSWQDRRPDCSKRLLGEAFQRGSHGPCRGSVPHKVMRPLRHGLVTAVRQVGPRELPPERWPDGTGLASYPSSDI*
Ga0005454J37223_103383Ga0005454J37223_1033831F029356MTTLILNPRGKDGGRPPARVAGRGEAKTRRLGRLPGENPGTDVRNVGAEAEATRLPERSL
Ga0005454J37223_104667Ga0005454J37223_1046672F002272MNANYWEQRTALIRDGAVRVLSLSTPEEVDYWRDQLKAHRRNSHELEVMTWGNHASTLRGRADYGHLDEIAEYVFQFIRTSEGKLLKFGTV
Ga0005454J37223_104883Ga0005454J37223_1048831F002596FREKEPTMKFKIKGGRLAMLLVPALALAAPAFAQVRATTLSDSQEPGSVIVFPKFIQGAISTPEGTSLPITELEIGVVCPKGVICTEHQSVKIRFHWVCGATEADEATSFVCKETDFDITATVFEKIVLTPNAETPGFYAVAGGGLPTKFAPAPECPQGGGYLIGWVINTSDQPIKFDGLVGDAHLRPGSPVPSVAGALNPFAGSPTALADYDAIPIQADPKLGNLALITTNGNGALIFDGAAGHYLAVTGQVMGDVRYTNLTTGPTFTSGALTLLTLDVKSNRPNLATFVDLDFFGGNPSAIGNENQLSTSTDFICWEEIPITSISSDLTTTQMGRKGVFVSAPAVDVTGAPVTLLGLSEVFEGGTFPPTAAWPRASFTGLFNSSNPVPTRFVPTPSPNFLP*
Ga0005454J37223_104899Ga0005454J37223_1048991F036494MMLAADRIAANVLTASPELRDALYRKYLGISEEWLRSHGASEAEEAQETLDSVEQIARRLVAMRLTNVRETTAHKEQRA*
Ga0005454J37223_105178Ga0005454J37223_1051781F047772SDGHESRSILPRGDFMNGSKGIGWCRVLASTAVLFGAIFTILLLPASAQQDVSPDWYDPAPNAAVVHPAQPAAAVHSSQPPVAAHRYQQTVKSASTASDAGKLRVKDAQVHSGHNPAQKSGGAPSGELVAIASRDPR*
Ga0005454J37223_105546Ga0005454J37223_1055461F074243TVTGVQIGDVRFDKTTAGAPLPNVLSKTNLTFLTLDVLSDEPNAPTFIDIDFWNESQATVAGSSSPSFEHLTSTFTEFVCWNQVPLSSLAGGNLTQAFQGTRKGVVIAGPAMKIPDGNAPADVSSPTAVTLIGLVETVEGTAANGFLERKYNFNLQTNGVPVATTFVPSPIFP*
Ga0005454J37223_105807Ga0005454J37223_1058071F037840EMKTLKLKIESKKETKMKMTAVVKTSLYVAALGLACMLPVTARAQSDVMPDSFAFSADEAPAARPALVAAVDDVNQTQDDFQGKVTLPYNAKCAGKNLKAGQYLVSVKLEATGRVVTIHGDGADMNIRVREVPTNRRASQSALLVRNSGHGRTLAAVSVEGLNAMFYLGTSASQALTERLPIS*
Ga0005454J37223_105992Ga0005454J37223_1059921F103604GEEMKLTSVALCAFLMVLAGAGSRAAYGVTSKIIIKAPDPTCPPPQGTQSISFDGLVPNADGSSVNGGSVPIPVDGSTTFGDNEFANCTGETLDVLTVTIDDIPLNQQYIVLLSGDAFDGFSTGPISNSSETLELYCDPGFFGTTCDGLSGVAGQDNGVSFTIAPEPTEAPMLLLGIGGVFLLGLRARKGRKQLRVAGVV*
Ga0005454J37223_106757Ga0005454J37223_1067572F034304MSGINGDKARFHRERKQRIAKRNRNRELLKSLIEQPKPAAPGSGAKAKAVAE*
Ga0005454J37223_107681Ga0005454J37223_1076811F001533MGYPEHMDHLGVLRETIGRLRAEIANIQELNQQFRRDSGKGTGAQVAHGQRSERLQAIQRELMQLADLGRRVVSTEQRKEQHRSRLHLFTQERAS*
Ga0005454J37223_112499Ga0005454J37223_1124991F000159MKFNSILRSLALAAALVSTVPLLAKPFAKTINIAQSAKIGKADLQAGEYRLMIDGNKATVQKGRQTIVESEGRWEDRSSKAANDSVLIGEDGQVKEVRFSGQTRVFVFSE*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.