NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300002662

3300002662: Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF105 (Metagenome Metatranscriptome, Counting Only)



Overview

Basic Information
IMG/M Taxon OID3300002662 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0085736 | Gp0056635 | Ga0005456
Sample NameForest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF105 (Metagenome Metatranscriptome, Counting Only)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size6940556
Sequencing Scaffolds23
Novel Protein Genes26
Associated Families26

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available19
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → asterids → lamiids → Solanales → Solanaceae → Solanoideae → Solaneae → Solanum → Solanum tuberosum1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetes incertae sedis → Cantharellales → Botryobasidiaceae → Botryobasidium → Botryobasidium botryosum1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetes incertae sedis → Polyporales → Fibroporiaceae → Fibroporia → Fibroporia radiculosa1
All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameForest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil → Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies

Alternative Ecosystem Assignments
Environment Ontology (ENVO)forest biomelandforest soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationHarvard Forest LTER, Petersham, MA, USA
CoordinatesLat. (o)42.532967Long. (o)-72.180244Alt. (m)N/ADepth (m)0 to .1
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000396Metagenome / Metatranscriptome1185Y
F001343Metagenome / Metatranscriptome719Y
F003497Metagenome / Metatranscriptome483Y
F007311Metagenome / Metatranscriptome353Y
F009043Metagenome / Metatranscriptome324Y
F010404Metagenome / Metatranscriptome304Y
F016972Metagenome / Metatranscriptome243Y
F017080Metagenome / Metatranscriptome242N
F018933Metagenome / Metatranscriptome232N
F024561Metagenome / Metatranscriptome205Y
F029356Metagenome / Metatranscriptome188Y
F030305Metagenome / Metatranscriptome185N
F038685Metagenome / Metatranscriptome165Y
F047066Metagenome / Metatranscriptome150Y
F052338Metagenome / Metatranscriptome142Y
F055474Metagenome / Metatranscriptome138Y
F058585Metagenome / Metatranscriptome134N
F072941Metagenome / Metatranscriptome120N
F078064Metagenome / Metatranscriptome116N
F079670Metagenome / Metatranscriptome115N
F080064Metagenome / Metatranscriptome115Y
F088984Metagenome / Metatranscriptome109Y
F096734Metagenome / Metatranscriptome104N
F098814Metagenome / Metatranscriptome103N
F098815Metagenome / Metatranscriptome103N
F102643Metagenome / Metatranscriptome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0005456J37224_100065Not Available863Open in IMG/M
Ga0005456J37224_100184Not Available817Open in IMG/M
Ga0005456J37224_100490Not Available601Open in IMG/M
Ga0005456J37224_100502Not Available547Open in IMG/M
Ga0005456J37224_100582Not Available589Open in IMG/M
Ga0005456J37224_100613Not Available887Open in IMG/M
Ga0005456J37224_100798Not Available807Open in IMG/M
Ga0005456J37224_100863Not Available515Open in IMG/M
Ga0005456J37224_101052All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → asterids → lamiids → Solanales → Solanaceae → Solanoideae → Solaneae → Solanum → Solanum tuberosum586Open in IMG/M
Ga0005456J37224_101140Not Available898Open in IMG/M
Ga0005456J37224_101842Not Available594Open in IMG/M
Ga0005456J37224_101899Not Available699Open in IMG/M
Ga0005456J37224_102234Not Available815Open in IMG/M
Ga0005456J37224_102434Not Available628Open in IMG/M
Ga0005456J37224_102701All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetes incertae sedis → Cantharellales → Botryobasidiaceae → Botryobasidium → Botryobasidium botryosum600Open in IMG/M
Ga0005456J37224_102970All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetes incertae sedis → Polyporales → Fibroporiaceae → Fibroporia → Fibroporia radiculosa1675Open in IMG/M
Ga0005456J37224_103298Not Available607Open in IMG/M
Ga0005456J37224_104039Not Available739Open in IMG/M
Ga0005456J37224_104375Not Available527Open in IMG/M
Ga0005456J37224_105604Not Available781Open in IMG/M
Ga0005456J37224_105873Not Available775Open in IMG/M
Ga0005456J37224_106847All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium648Open in IMG/M
Ga0005456J37224_107348Not Available634Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0005456J37224_100065Ga0005456J37224_1000651F058585LWRFNLDHSQYSRCCVLRLPQWPPSSFRWRCYHWLRWLSELRLASGRCTPARPVANLPARIGFVSFGSTGCKPPTCVDCSALPLVLRRSIWLAPDDPGLIGLAPDNPGSTRLAPHGSFLRLGRQPTSDSHRCRPLARLAASSGLRLMLPLPLGWLRFQFAPAASPSALTGREPLGLRLVAPSPAEPVMHSLFPPNLASPAKPSMSIPFPLALASSGIFQLNNFRLASAFALLVRPAIPLRLAPQVSPSVRAGGHPPVLTGCPLRQPCWRSTSGLRRLSLLPAFRQPT
Ga0005456J37224_100184Ga0005456J37224_1001842F018933HVPSCWSQPSLLRRSRITPRSTLDSLYRPRIAPRSVLIDRCCPPIARRSTLNARRRSQIAPRSTLCARAGHGLLHVQHLESSPRVQIAPYSTLCVPRRSQIAPRSTLPALYRSRIAPLSAIDAPCRSTDISVRLHFAPCAVHGLLRAQHFKLRTAHRLLCVRRLHFVPIADCSAFLTPCLVSLADCSVLNT*
Ga0005456J37224_100469Ga0005456J37224_1004691F102643MGGRSHPKLNILGRPIAKKYSDGKVKRTLKRRSKVLETVKRETDGASNYPMGIQTIHFCSISQWFDRSTECPVRLLLGCYKRYLVCMFVNIKYERGINHGGRWTGHVW*
Ga0005456J37224_100490Ga0005456J37224_1004901F010404MQPFTLLQRRFALQQIPAAGSTLLAYIFKAALEFQLARSASRSRPRLAFFRLTELDHCESPVANFLS*
Ga0005456J37224_100502Ga0005456J37224_1005021F024561VALGQRIAKSGGRPSANGQDADPKSQTCLDLVRKPASANASPSELKASLNFSDRKALDGPAMRPETPLAVENGVGKLAASARMRLMSAWEREHGELPA
Ga0005456J37224_100582Ga0005456J37224_1005821F000396VPSDLISQPNSPPACNGTELCPQDRRVSSLQLPAAFFRTLRINAYGPTCQLSWLEPVSRSGLSLSRNDCPSPGHHFEVEAPDLLLQHPAVRSSCPFRFRFPYVLQFALVRARSRPKPRCLTPVRHSQPFFGSPLPFGAFRALKDQSVQPDSRLGSSPSERSRLPITPRHRIHFLVGLDTGSPLQAR*
Ga0005456J37224_100613Ga0005456J37224_1006131F079670VKRTLKKGLKVLEMVKRERNKIITIDNSLKLRTILYFFF*
Ga0005456J37224_100798Ga0005456J37224_1007981F080064METSSVDHGVICERCEQSLPEMEQSMTGRGSQAMSAERSAGKAGREVKGAEQSELEAEGQVSGTK*
Ga0005456J37224_100863Ga0005456J37224_1008631F055474MPFGIKAFGQFSLPEVHLRKTPDFLSLPVARLHINDDDRGSTFQVRYVSRGSLFSVNL
Ga0005456J37224_100881Ga0005456J37224_1008811F098814LPGTLPVDFPITGFAGIRINAFRRAAFLFRTGLSARNGLSLARYDLRLRGFRYEVNVPGLLLRLRILRFCHPFRLLLHCQSRFAPFSAASTLQTRCGVAVRFRRPRLQPLLPLRNITSLRIDASAVLAACQPACQGCPISVRSPQPFLLLGWLRIIVPGPLLPRKLAVPQT
Ga0005456J37224_101052Ga0005456J37224_1010521F009043VPSDLISQPNPPPACTALSFATRVARCCPFRSPPPSFESNGSTRPGSPAGFPAWSRYLEAAFHSPKTTVRLRTAISRSKFPTYSFDTLPNVHPARSVSDSPTRSGSPRHAQDRYQKPVARLPSGSPNRSSDLHSPLGPFGPFRIKAFNPIPGHEAHLPNSFDCLSLPGPGSILLAPMPDHRSRLASRSAACCSTD
Ga0005456J37224_101140Ga0005456J37224_1011401F030305RMPLQFIGSCIVPFGFLVPVPSFLFPALSDLARRCSSGRPVPRVSDRTGDEAPSCPGSSVFSAVPADGSSSRPDSRILQLGSPQIARLPRLSTSCLAVDERPGCPVRSIIWLYRRRRFRVAPNLTSFGGTVSNSPSRPGSSLLQPLPLMVLRVAPGAPSSGFAGGDSSGCPEALIPRLCRLVAFRVSPDLPPSDICRFRFSGLPQIGFLGGSMMNPRFARTLHPRSIQLTSLQVAPKFPLPAAPRMNLQTHSGLAFLPTLRCSLNLYPLFACRRTGLLRTTINQFRIACRAGLQSVSLQ
Ga0005456J37224_101842Ga0005456J37224_1018421F096734LQVARSSFAPRSAATILFITQRFGSSFQIRYFLPGSLSFE
Ga0005456J37224_101899Ga0005456J37224_1018991F078064IAGCTRDCKSRRKSKVVKLADSEDARTGGYGILIDGLPAMQGSVEVRGASKAQPEVEPQGASRVNRNR*
Ga0005456J37224_102098Ga0005456J37224_1020981F003497VHLLLPAAAFIRLRIGAPDSRSLFSYLLEASASRRPFTRPQRLPSFESPRSEVNAPGLSLRRNSELFFQPVRPPTPILNCVLHAAGGVHC*KPVAVFRSQNSQTSF*LSLPFRTFIPSDRSARSEAESEKLTFAYSPISLRSPKAPITF**
Ga0005456J37224_102234Ga0005456J37224_1022342F017080LDESVVQPVLRPLATPETSLQLALATPSSGCAGFKPPTCVGCSTSGSTGGQPSGSDRCSVLRLDRWQAPGFRRLHCASVRPVANLPTCVGVLPPARPATNCRLTSGADPSARLVPNFRLSPAVVAAFSLRLLPLRLSSLRWLSPVCHTGGELPTRIGCYAIQLHRF*
Ga0005456J37224_102434Ga0005456J37224_1024341F072941LEIVKSEAIESPIMINITVGEIQFISIRLLARVHFGCDEWMAFKGD*
Ga0005456J37224_102701Ga0005456J37224_1027011F098815SK*PIEQLALPAGMRSQSLACSVVDTSPFAPRCGFYPASDRCSRARLPFLPARSHCLGTAFPLPFGSPATTVLFRKPPRRGQSSWPIPSASIPNLSSSPFGPELPPSAPFFTSPGAFLAQNPLPAGKPETLKRPSNFRSPSGLSSLRISALGQRLNLRSLPLCSTRFSFAPRWRQSLLITTRHRIIVPDSLPFTRLSAL
Ga0005456J37224_102970Ga0005456J37224_1029701F016972MVRLTPLTDEIPVVNFYLTINEIYRVSFGAKTSRYGPHELGYRRAT*
Ga0005456J37224_103298Ga0005456J37224_1032981F038685PEKPAENTTGIIPGDRGKDEESWFPPPLPAALKTAGQARSPAPLAGVMTGQLHH*
Ga0005456J37224_104039Ga0005456J37224_1040392F047066VAPANAPQKTVADRSLTVKTRENHRRVFTWFASRWRNHQPKRAEKPHSIIRNRKNPDGPATRPITPLAVENGVGKPAASERSAPNVGAGKRVWRTPTP
Ga0005456J37224_104375Ga0005456J37224_1043751F007311VPSCWPPHFAVAPIMDCSMLDAWHSLLHADCSALHFVCPVQTTDLTRCSTLSALRLLRIAPRSTPCARAGHGSLRATDCSVVQHLELCPACGLLRHRDLAFHAALGLLLVRRFPLSVAHGLLRARRLTLHAGLRIAPYADILRPRAVYGLLHFRH*VPHRHRLLCARRLTLCAVS
Ga0005456J37224_105604Ga0005456J37224_1056041F052338HTFIRHTYNCVPRRLNYMANKKTGRKRPQNSVQPRRVGGFDKNDATLVHIVIGEPGISGVPSTIKGRCWIEWVPRLDATGADKRCFYSKVGVDEYDKILGESGNKLPRNGASLNPEGRKFLVLFPGTLKVWMETKKKDVGPTRAVVESIPASVNSQAVRDNVMLTMVGGGYAETIASRAPSRLDKPSGSGNSRRVS*
Ga0005456J37224_105873Ga0005456J37224_1058732F029356MTTLILNPRGKDGGRPPARVAGRGEATTRKLDRLPGESPGTDVRKISAEAKAVRLLE
Ga0005456J37224_106847Ga0005456J37224_1068472F001343MKFANTRFATVSKSLFMGLALLMATSAFAGTKASLELHYPTNVNGTKLKPGEYKLQWEGSGPNVELSIMSGKNVVAKVPAHLVDLSSPAGNTAAVVKHNDDGSADLQGVRFEGKKYSLELGDGGNGMAASK*
Ga0005456J37224_107348Ga0005456J37224_1073481F088984VFDRFISISLWCAQVTVTPDARRTAVLSSGTLNGLRGLIPVGGQQHPSSGVGASLLWKNAQKNAKKNRTSDVINRIIPHRKPFVT*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.