NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026751

3300026751: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A2w-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026751 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072082 | Ga0207594
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G05A2w-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size24285283
Sequencing Scaffolds33
Novel Protein Genes36
Associated Families36

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1
All Organisms → cellular organisms → Bacteria → Proteobacteria3
Not Available12
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Methylocystaceae → unclassified Methylocystaceae → Methylocystaceae bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria3
All Organisms → cellular organisms → Bacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Anaeromyxobacteraceae → Anaeromyxobacter → Anaeromyxobacter dehalogenans1
All Organisms → cellular organisms → Archaea3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria1
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium2
All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes1
All Organisms → cellular organisms → Bacteria → Acidobacteria1
All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium1
All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → Nitrososphaeraceae1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000569Metagenome / Metatranscriptome1018Y
F001436Metagenome / Metatranscriptome695Y
F002315Metagenome / Metatranscriptome572Y
F002568Metagenome / Metatranscriptome547Y
F004511Metagenome / Metatranscriptome435Y
F005305Metagenome / Metatranscriptome405N
F014011Metagenome / Metatranscriptome266Y
F016026Metagenome / Metatranscriptome250Y
F016471Metagenome / Metatranscriptome247Y
F017145Metagenome / Metatranscriptome242Y
F022675Metagenome / Metatranscriptome213Y
F022677Metagenome / Metatranscriptome213N
F023316Metagenome / Metatranscriptome210Y
F026346Metagenome / Metatranscriptome198Y
F033457Metagenome / Metatranscriptome177Y
F038467Metagenome / Metatranscriptome166N
F048397Metagenome / Metatranscriptome148N
F048402Metagenome148N
F049092Metagenome147N
F049708Metagenome / Metatranscriptome146Y
F051112Metagenome / Metatranscriptome144Y
F051273Metagenome / Metatranscriptome144N
F060311Metagenome / Metatranscriptome133Y
F061030Metagenome / Metatranscriptome132Y
F064703Metagenome / Metatranscriptome128Y
F066823Metagenome / Metatranscriptome126Y
F066861Metagenome / Metatranscriptome126Y
F068052Metagenome125Y
F071393Metagenome122N
F073765Metagenome / Metatranscriptome120N
F083534Metagenome / Metatranscriptome112Y
F084328Metagenome112N
F088429Metagenome109N
F089166Metagenome / Metatranscriptome109Y
F099400Metagenome / Metatranscriptome103Y
F103537Metagenome / Metatranscriptome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207594_100297All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae1717Open in IMG/M
Ga0207594_100324All Organisms → cellular organisms → Bacteria → Proteobacteria1665Open in IMG/M
Ga0207594_100754Not Available1261Open in IMG/M
Ga0207594_101165All Organisms → cellular organisms → Bacteria → Proteobacteria1070Open in IMG/M
Ga0207594_101557Not Available936Open in IMG/M
Ga0207594_101601Not Available924Open in IMG/M
Ga0207594_101893Not Available862Open in IMG/M
Ga0207594_101997Not Available843Open in IMG/M
Ga0207594_102134Not Available818Open in IMG/M
Ga0207594_102191Not Available806Open in IMG/M
Ga0207594_102292All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Methylocystaceae → unclassified Methylocystaceae → Methylocystaceae bacterium789Open in IMG/M
Ga0207594_102312All Organisms → cellular organisms → Bacteria → Proteobacteria787Open in IMG/M
Ga0207594_102448All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria763Open in IMG/M
Ga0207594_102552All Organisms → cellular organisms → Bacteria748Open in IMG/M
Ga0207594_102565All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria746Open in IMG/M
Ga0207594_103007All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → Myxococcales → Cystobacterineae → Anaeromyxobacteraceae → Anaeromyxobacter → Anaeromyxobacter dehalogenans691Open in IMG/M
Ga0207594_103107Not Available684Open in IMG/M
Ga0207594_103132All Organisms → cellular organisms → Archaea681Open in IMG/M
Ga0207594_103147All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium679Open in IMG/M
Ga0207594_103153Not Available678Open in IMG/M
Ga0207594_103356All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria660Open in IMG/M
Ga0207594_103385All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium658Open in IMG/M
Ga0207594_103416Not Available655Open in IMG/M
Ga0207594_103486All Organisms → cellular organisms → Archaea648Open in IMG/M
Ga0207594_103914All Organisms → cellular organisms → Bacteria → FCB group → Gemmatimonadetes619Open in IMG/M
Ga0207594_105124All Organisms → cellular organisms → Bacteria → Acidobacteria549Open in IMG/M
Ga0207594_105454All Organisms → cellular organisms → Archaea535Open in IMG/M
Ga0207594_105538All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → unclassified Actinomycetia → Actinomycetia bacterium531Open in IMG/M
Ga0207594_105648Not Available528Open in IMG/M
Ga0207594_105896All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria518Open in IMG/M
Ga0207594_105922Not Available517Open in IMG/M
Ga0207594_106039All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium512Open in IMG/M
Ga0207594_106086All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrososphaeria → Nitrososphaerales → Nitrososphaeraceae511Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207594_100086Ga0207594_1000861F099400SLTIVRHPCPMRILIWIVGLLALGTSLDSSLYGGAYTRAFVTTIQDMHRAFGLHLFG
Ga0207594_100297Ga0207594_1002971F026346SLLSTDPRPMSSLSKVTQLPPTEGGEAASRVASIVVETDKKGRCEERRFDNRTGKMVSANYVNCDARLEPERDTTPSENINRERIRAILGAFKK
Ga0207594_100324Ga0207594_1003244F084328KNAAPSVSDACPDWEAPAFTKLPIASRTGSGANVANDAPRAIEPEAPGQPLSKLGFSFEMSFPMSSRSEK
Ga0207594_100754Ga0207594_1007541F038467MRWIPALRICVAASLLGGSAAHAANDSIRIGDWILRPHFSEQKDKKQKEKRLDRCTAQLTNADKITMIYSLDTHYMWTLELSNPSWNFPSGSKFDVSFGSREGGYFRQRVAALEPQLVRVLLPDSVSSFEAIRRLFKLQMVAGGLTTQFDLAYANQVLLALTQCVTKFGMTTKSKAAITAYLKSPIGPAAEANSDPAIVQEAFSLAGAIVAEGEIAKAAALKPDEIPDGVSGDTVWKVADNLFTISVLPKDAVPAEIGDLNDLIVGGNAQKCRGDFFAGAMLDVVESTTIARAYTTCQTQQAATSTYYFAMPRKQGGGLYLTKIIATGVEVPPTIERAIKELDAKVRGVITAALARL
Ga0207594_101165Ga0207594_1011652F051112MKRTMKNRTMSRAFTIAAVAVAAVFVVSAAQAFTVGDANGPVGGQGYIDFDKPGAAPDRMAPVNRFGNENGQTTMKQGNSTLQFGGQQSFGQRYNTDNIFNPY
Ga0207594_101557Ga0207594_1015572F049708LVFALSFIPTDKILPRPIEESWELVRDILWAGMTIISIGLLEVMSPLATPFASASGLRRGDVAEILLAVILVAAAVYSALRLRRKDLTPRWRGTHTFFLLLALTLVAIVRFTLYSWSHFA
Ga0207594_101601Ga0207594_1016011F064703MRKGLTVLIVAVLGVFAVKSVIGPGKMSLSEAVKYPVPTYGLHVAQPVDMKSFPSDVVPL
Ga0207594_101893Ga0207594_1018932F049092MKFIEPAFDTGGDLWRPPLNVPDSDAHALNEKFARQSAELRLRLTEVLELHNVQQRQANELQDRYDDIDRLSQTVSALQEAVNQYKMGTAAAEDKIILLESEKAALQAQLDGALEESKMLADRMLAAEAAAKRQEENVASS
Ga0207594_101997Ga0207594_1019971F002315MKRVAVLSMAMLTVAFAADKKTYRYNCKGGAFTVTAAVEASGRWSKAEPVVLQIDSEPPQTLIADPDVPDADSFTNKDYEFYALKTFITLTRKSHGVVVKTYNACRAE
Ga0207594_102134Ga0207594_1021342F048402ARRPQREGVLLTRRRLRIYAMVALVVALGLFLTMVAALQWVAIFLVVASTLVLMWTLWTGGREVERE
Ga0207594_102191Ga0207594_1021911F033457MRNIVSYIPAGALVVLLLDVIAPPAGLGFRAAAWPSVERQGLAPQIVDRTRKSDQLPVPKATGRRLTPPAAPVLVGCDPVFSA
Ga0207594_102292Ga0207594_1022921F071393ALEESKTLADRVHAAEAASDRREATVALSIKQIEFLNTELTAAAAERFRLVAAMQGEQRRQRSVFSQQKSILEDKLQEKEALAATQGTKIKQLEGVRDELDKRVRVIEALLASEREVAERKTRRPTEILGAAG
Ga0207594_102308Ga0207594_1023082F103537LRFGSKLEIMPFVLRGRITKSIFAFAAIEYEFPLTVSDQIKHELLKMNDARFIDIVHCGYTEVAKQAAKVFTLNKPFINCFKTIHKLSKQIFNIVLATKFSR
Ga0207594_102312Ga0207594_1023122F001436MKGVVMNTVNDIAPLLELLKMAAERRPYSETGNVSQDDLFCEDHSLLEMWPEACRRTGVGTREFPPGVIKLWKESLGRSN
Ga0207594_102448Ga0207594_1024482F073765KGQKLMEEVYLSEQIRCLECQKTVPIGVEVVRVQKEGQAKKVLKRVFYCRSHAGDYESRARGEG
Ga0207594_102552Ga0207594_1025521F060311FFFARPFVVGGSMIGYITVMVSKAQRIVLVDHARLPWRFFGRLTAVQAGL
Ga0207594_102565Ga0207594_1025652F061030KDIFDKVACTCAVRNGGHVIPPPVGVKREGLKLRPKEEAGGTQTLDGGRVAFPKYYRREGLKFHRSRALEGYLACMRAAGRK
Ga0207594_102739Ga0207594_1027392F083534MGRIRCVRCEKLRWDFMARTFVLIAPVPYVLHQVSCSYETIPNAPKYYETYRNITLGSNWVDCEXPLRKIPTSLRGMNFCIKCTSSPYFAPSFMQLRNNPKCTKILQRSNGVDSVRSLQKSPTSLRGTNYSINCTNSPYFAPSFMQLRNDPKCTQTLXNTPKHEFRIQ
Ga0207594_103007Ga0207594_1030072F004511LADVLGTAATATLLRRAIKQTAARTAWSEPVIVARNGLDYTYRLPEAWKQPANDEAVDVLRIVAGELRVLLIELTGPVVVARLGQLAALRKWGIDFSDEARHG
Ga0207594_103107Ga0207594_1031071F089166AMVDALEAERNAAKKNNAGDHPAFSCEPEAEPSPAPRVQRTADEPAP
Ga0207594_103132Ga0207594_1031321F016471MTEADKFGAVKDATVAIGLANKDTKDVISSFGTGFFIGGEYIVSSAHIFSQCIKYNAQNKDMNNGMEGIYSAFNITTNGDQLELNTYHIIKAIRLPPVKEVTGFTGSVDLDIGIGKLDR
Ga0207594_103147Ga0207594_1031472F017145IPRTYVLLTWLAFGAALLIYSNDWHPSGWTALRKEATAPKPPVSVTEQYTGSIIIVPTRGEDCRQMMLDNRTGRMWDKGVVNCYEAVSRPEKGQRGGMSSLRMNAIGKAFNRRDE
Ga0207594_103153Ga0207594_1031531F023316KVVIRKIAPAPAATVCVGTSCSLPVATSEKLAALLLS
Ga0207594_103356Ga0207594_1033561F022675MRSNRRRNWFIKRIALGLAIAAFAAPVAQAKVDEGSSVQANGYQAFVTDFPSYANGVNASDYGMPRPTATDYAISRGDLIEVVRSTPNGTSSDKIEFVRTQPRSIGEPQVVAAGFDWKDAGVGAGLALALVLLGGG
Ga0207594_103385Ga0207594_1033852F016026GRREGQEVWARWIDGRVQGDREFLKIAGLDPGTRFEDPHAFVELIEKLLDKGSASVVLDLPQSQA
Ga0207594_103416Ga0207594_1034161F000569MDLHIKKVWLPGAASCLLFFGFYWVLIWLPFDKNRFQFMVIPYLVLPFVGALAAYGSRRMKGSVLERILSALFPVFAFVALFAVRIVYGLFFEGKPYTLPHFLAGFSVTLVFIVVGGLLLVLGAWPFCRNAKFKPSDSSH
Ga0207594_103486Ga0207594_1034861F005305EDEDFKKEFNQLSKDVIEPVMRKFESYLESKDVNSSVHIQSQIVAGKNPSIEFSLHLKLTHESRYPNIKFSSSGEKISVQEDRLMTKDEVSQDMIPEYYDKELITEEFVKVRLIRLIKSCFDKDWQSFYS
Ga0207594_103914Ga0207594_1039141F066861DAKITLDGGAYIGSVKSLNDSTPIAPGTLRITREDMDAIMPNLKAGMKVYFY
Ga0207594_105124Ga0207594_1051242F066823TYLANHLSSLRQEIAHLQNMNTRYAKKTEHSPLDDSALEMRTVRLLQIKEELESMLERPSDPKVWWDKLRKPRTAWSAK
Ga0207594_105454Ga0207594_1054541F088429SLNVPEPQPETNASLNVPEPQPETNASEIVTPRSIDLNITVGKDPIARSENQMVTVVALDPTTGKVLDRVFIKLEIKDPVGILVKNYTGTVGNLTRTFKIGENAIGTFIISATASQAGVQSTKSLPFQVQ
Ga0207594_105538Ga0207594_1055382F014011HRHGHFRGRPGFRGHRRFPSREEWLRRLEEYQRDLEQEIADLADVIKHLKSGETPAGTTA
Ga0207594_105648Ga0207594_1056481F068052GATPDQAKAKIDSLEGQVVSLKNREWPKLTPAAVTDFERVLASQESHVVSILPTDRDSIFFARDLVDAFKRIGWKAKRDVSMNDIPDGLSVWPQDDVARAICNALMMATGALVTVREDQHLKDQGTYAIGIGHKLN
Ga0207594_105896Ga0207594_1058961F022677RQKGAIMYGLANIDFSNRFGVVISELPDEIDDQDQMQDAKQGPALNDLEMAVFGAA
Ga0207594_105922Ga0207594_1059221F051273MRRFIPLLILLGLVFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLDEVKRFYTEQLTAA
Ga0207594_106039Ga0207594_1060391F002568MKNPPILVSVLGFFAALAGFGFLFFGLRVIGFDWFGAFGDLPQYNSVGLWGWLAVGTGIVWLLAALGLWALQPWARLFALIIAGFALFEAVLAFIQFPGTGIGFGMAIAGIVERRHVPCDRVADR
Ga0207594_106086Ga0207594_1060861F048397ILVDQCRYQYIFIVHNSTIFRAILEKNKCDKCNLTIHPNFMGNHKCDHQNCPICKNPITPSQYWPHIRSHPGHENDSPPLPKRNMYENRKNENHSGYGNYQRN

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.