NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300027438

3300027438: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G08A3a-11 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300027438 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0072060 | Ga0207564
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G08A3a-11 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size31256952
Sequencing Scaffolds38
Novel Protein Genes42
Associated Families42

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root14621
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales3
Not Available18
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601
All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes1
All Organisms → cellular organisms → Archaea4
All Organisms → cellular organisms → Bacteria → Proteobacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium1
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium 13_2_20CM_2_64_71

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.3Long. (o)-89.38Alt. (m)N/ADepth (m)0
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000268Metagenome / Metatranscriptome1411Y
F002616Metagenome / Metatranscriptome543Y
F002896Metagenome / Metatranscriptome522N
F004478Metagenome / Metatranscriptome436Y
F005363Metagenome / Metatranscriptome403Y
F013352Metagenome / Metatranscriptome272Y
F013811Metagenome268Y
F014051Metagenome266Y
F015863Metagenome / Metatranscriptome251Y
F017166Metagenome / Metatranscriptome242Y
F020927Metagenome / Metatranscriptome221N
F023901Metagenome / Metatranscriptome208N
F024580Metagenome / Metatranscriptome205Y
F025317Metagenome / Metatranscriptome202Y
F028161Metagenome / Metatranscriptome192N
F034564Metagenome / Metatranscriptome174Y
F038294Metagenome / Metatranscriptome166Y
F038480Metagenome166Y
F039759Metagenome163Y
F040398Metagenome / Metatranscriptome162Y
F041750Metagenome / Metatranscriptome159Y
F042837Metagenome157N
F044132Metagenome / Metatranscriptome155Y
F045986Metagenome / Metatranscriptome152Y
F046248Metagenome151Y
F046445Metagenome / Metatranscriptome151Y
F049092Metagenome147N
F049708Metagenome / Metatranscriptome146Y
F056406Metagenome137N
F057709Metagenome136Y
F065246Metagenome / Metatranscriptome128Y
F068052Metagenome125Y
F068880Metagenome / Metatranscriptome124Y
F071393Metagenome122N
F078898Metagenome116N
F082749Metagenome / Metatranscriptome113Y
F084687Metagenome / Metatranscriptome112Y
F087349Metagenome / Metatranscriptome110Y
F087420Metagenome / Metatranscriptome110Y
F087926Metagenome / Metatranscriptome110N
F099772Metagenome / Metatranscriptome103N
F103567Metagenome101N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207564_100114All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Xanthobacteraceae → Pseudolabrys → unclassified Pseudolabrys → Pseudolabrys sp. Root14622131Open in IMG/M
Ga0207564_100929All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1167Open in IMG/M
Ga0207564_101010All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1129Open in IMG/M
Ga0207564_101110Not Available1093Open in IMG/M
Ga0207564_101154Not Available1079Open in IMG/M
Ga0207564_101394All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Rhodoplanes → unclassified Rhodoplanes → Rhodoplanes sp. Z2-YC68601005Open in IMG/M
Ga0207564_101998All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes880Open in IMG/M
Ga0207564_102094All Organisms → cellular organisms → Archaea861Open in IMG/M
Ga0207564_102144All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales853Open in IMG/M
Ga0207564_102641All Organisms → cellular organisms → Bacteria → Proteobacteria790Open in IMG/M
Ga0207564_102786All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium775Open in IMG/M
Ga0207564_102912All Organisms → cellular organisms → Bacteria762Open in IMG/M
Ga0207564_103063All Organisms → cellular organisms → Archaea747Open in IMG/M
Ga0207564_103328Not Available722Open in IMG/M
Ga0207564_103713All Organisms → cellular organisms → Bacteria → Bacteria incertae sedis → Bacteria candidate phyla → Candidatus Rokubacteria → unclassified Candidatus Rokubacteria → Candidatus Rokubacteria bacterium691Open in IMG/M
Ga0207564_103876All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria680Open in IMG/M
Ga0207564_103969All Organisms → cellular organisms → Archaea673Open in IMG/M
Ga0207564_104118Not Available663Open in IMG/M
Ga0207564_104253Not Available655Open in IMG/M
Ga0207564_104307All Organisms → cellular organisms → Archaea652Open in IMG/M
Ga0207564_104581Not Available636Open in IMG/M
Ga0207564_104617Not Available634Open in IMG/M
Ga0207564_104698Not Available629Open in IMG/M
Ga0207564_104898Not Available620Open in IMG/M
Ga0207564_105552All Organisms → cellular organisms → Bacteria591Open in IMG/M
Ga0207564_105754Not Available583Open in IMG/M
Ga0207564_106043All Organisms → cellular organisms → Bacteria → Proteobacteria572Open in IMG/M
Ga0207564_106240Not Available565Open in IMG/M
Ga0207564_106742Not Available549Open in IMG/M
Ga0207564_106885All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium545Open in IMG/M
Ga0207564_107324Not Available534Open in IMG/M
Ga0207564_107507Not Available528Open in IMG/M
Ga0207564_107662All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → unclassified Alphaproteobacteria → Alphaproteobacteria bacterium 13_2_20CM_2_64_7525Open in IMG/M
Ga0207564_107841All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium521Open in IMG/M
Ga0207564_107901Not Available519Open in IMG/M
Ga0207564_108086Not Available515Open in IMG/M
Ga0207564_108252Not Available511Open in IMG/M
Ga0207564_108454Not Available506Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207564_100102Ga0207564_1001021F082749PRTQPYHCIAHSAFAANATGEGRLAPAFARRGGMDGESVDAAGKLGRKRLINHAMTLDAGLSLKGVRHDIDPVVSLPARPVPGMALMLVRFINHFEVLRRESLGQLFCDEIGGSHIARLGERSLPVNGYKQVLKASPVKAHNVRS
Ga0207564_100114Ga0207564_1001142F017166MFIVALGAMLFIGWLYLGEMSDVRSIKAAVTAAADSTAVALSQSANPERNTDADADGIFIKHIQTSSTLEDVSVKQSVEPISAGRLRQSVKVTARARTTLSEFFNMQGAEIEITATHDFDRKK
Ga0207564_100929Ga0207564_1009293F040398SSAVPNFLRLCVATSGKVLSERRDDLVKFVAAEMDAYKFALANRAETIKVSQEMTHAKPDDKRAEFITDEAIKDKQIDPTLSIPLDRLDWMQNLFLKAGVIKQTVPIESIVDKSVNADAAKIAGK
Ga0207564_101010Ga0207564_1010103F044132QTTMKQGNSTLQFGGQQSFGQRYNTDNIFNPYARDGR
Ga0207564_101110Ga0207564_1011101F015863MRRFIPLLILLGLVFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVQHPAGFHGLDLGTRASLDEVKRFYTEQLTSAGFEVSDLGLMGLNPMTAAYLGVDGMLSAKRHATDDAIDVQIRTPDGIIPSRLLQIHWRKISATPG
Ga0207564_101154Ga0207564_1011541F039759LPPMRPLAPLATDIAVMVICFCYIMIVLCLYSAAWVASGGRVPQGAYGRRPTPLEPLRPSRLLDTFLPGHRSQDVTLWEGAVFALSSLLFVAASMAPFYGIRRVQNAFQSTFATQVQQVCPAQPPEAMIACWAQFYPWSRVAIDLGAPIVIAVICLVLANRLRHFGRQHFINRLAELKVLPAASTLFLRAFRDDQVRIRRASRNLFSSVFDLGRVPATLDELMLERLDGRGDLIAIGNPQDRKGAARQSPWGAQRLYVDDAHWQETVTMLARDADRIVLCIDASDGVRWEIAHVLQSGHAGKTLFFLNPSTDVQTRKRQLQEDFGVSAADLASIDVDRVLALRTTSTDQLILMVCDKPE
Ga0207564_101394Ga0207564_1013941F087926HKFALYWKGTHAFAKRQRELGDRFVARPVFTRKGSTYVGLVPLDRQKKKA
Ga0207564_101998Ga0207564_1019983F000268MRVVAVMLLLSAGIAAEAMSYSFVSKASGRLGGPIRFEFYRDSTTLPKTDIKSFTVSMRTADDRWKAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLISGRVYAAFATDEHGGSSGVTFGFDKDGRM
Ga0207564_102094Ga0207564_1020941F013352ASQFSSLLCLSMAIAGFLFSLMASSTNAQIEVTVDENMSTQISNSTQSEDAEPRPDILYSALNKDTIVGEVLNNFSYPIELVRITATVYDKNGIIVATGDKYVNDYLIKPGSRSGFDIFLDETLPTKSKYTLTTSFEKSEDDKPEALQLSVGKNSKSSNTFRVLGEVMNQGKNDANAVKVSAIFYDEKHKVMDTDYVFTNPDIISPNKKAPFEFSFYVDNPEKIKSMAFNVQSDEFSLITDNGQNNTISQQ
Ga0207564_102144Ga0207564_1021442F034564SGYWYGAVCIAGLGFGLLGERRPFGQGILAHPFIVYAFVVAAGLLVIRVVRQQPVPELIPERALGLGCAAGVALFLAGNFIAAHLVGR
Ga0207564_102641Ga0207564_1026411F020927MWAAFVRQFAAYAITRRGKKLFALIGVLALCFGAALLIDMQFYVSASFAALLAGFAAVTYVVQHVKLKRAEHQRLLRKAEAAHQRALAAQARLERIDTAKSALRGAVTGAGRLVT
Ga0207564_102786Ga0207564_1027862F056406QMMLDNRTGRMWDKGVVNCYEAVSRSEKEQRGGMSSLRINAIGKAFNRRDE
Ga0207564_102912Ga0207564_1029122F028161TLSLAAVLTSLAIQAAQAQDNKNVREDDYVRKVPLEDFKVPIVPIIPPGSSLDLRPGRTPDSSDRIYNSTPFSRDPTTPSIGLSIKSPFDDRK
Ga0207564_103063Ga0207564_1030631F002896MGKMNEDTILHFYRILENSLLESDISKINEEDIDAWSQSFKKVVRESREKSGKGVFVPFLMWKLGELSPVEASKYLVNRKQDECRVSYDHNNVEYILRVMALMFMSWSVTNLKRKTQNGHCQNIDHPHGDTNPRLCKEGTIFHQDLYN
Ga0207564_103328Ga0207564_1033281F002616PSYDLLYSYAHLTPRMQDTPMPNSFPNWTSRIVIAGRIAEYRRPENWNESTMPGDYHLPFGRVEAQAKAARWLWQSYII
Ga0207564_103713Ga0207564_1037131F045986MREAISHSGLRMHQWLILFGALLYGIIYMGVLEPRGIAAWFANQPEVARAFSDPHFGRADAFILLFSTLFLGPLALLLGLMALVFLLAVFGGFLLPIVRWFRLPDWVATASVLVGVVTALWLQSEMWLPR
Ga0207564_103827Ga0207564_1038272F087349MTLEYGMSWSETMKGNATVASLEQVANDESVFRPTQATSWDPFEVWLTRIKQPRDRAAKSALADTSSGKGRV
Ga0207564_103876Ga0207564_1038761F068052GATPDQAKAKIDSLEGQVVSLKNREWPKLTPAAVTDFERVLASQESHVVSILPTDRDSIFFARDLVDAFKRIGWKAKRDTSVNDVPDGLTVWPEDDVARAICNALTMATGALVAVREDQHLKDQGTYAIGVGYKLI
Ga0207564_103969Ga0207564_1039691F068880AGPFMTGYLKYKGVLESADRLTFGHSNYGYRYFLNLSSPSKLLDSSSGLIPKNEFLSKIPEGYDVHFKGMLIITQKHKELYAIIFLNPKEKFDSMLNQIQPTLDSIQLSG
Ga0207564_104118Ga0207564_1041181F099772LLTVCAVQLMGTACADDCLRQWSTSPVHAHDQLPGSPDGQTETDPGEGEFEELALAQGEAFRIIHLARACVLDNASCPRTDFAFDWFRPPALP
Ga0207564_104253Ga0207564_1042532F046445MWRAAQVLIGVTVLALVTFAHPIEGNTFGFCPFSKSLLHAANDVVVAAIDVGS
Ga0207564_104307Ga0207564_1043072F023901FAINTKGWDAKREPTHNCHDQHKSGLIPTNLENDKAVRLRQTVKDESGKVHQIGEIDYMDGNGFHKVMDIFDSSPNPWMVDRNLYETKSYFWIRNNGSGYVTVRDVSLEILS
Ga0207564_104581Ga0207564_1045811F087420AKRSAAGGAIVLVLCALPHAPASAAKPPYAGCVVVTKQEYDSAKKQHMLRTRYTEYVRTGLPGRRQYWYCR
Ga0207564_104617Ga0207564_1046172F038294MKFVTLCLAFVVAAFGTDRAAVQERPNFSGTWIGVGPQPEIRELTIKQDGSTLSFEGQPDVTKHTFKLDGSETEMSAPDGKPLRAKAVWVGKSLVITIHFPELKQDIRRLTWVIDADGQLVMETAFLGGKPEA
Ga0207564_104698Ga0207564_1046981F013811VQNRPAILVADQKVSPFFSHGSGPTGFEASFKHDVRNGLGIKVDVSGYSDTFPPGPAAYCQPDGSTAGIACGTGLTFQATGRALYVTAGPEWKIRRGKRFAPFAQTLVGIVYTRSTFMMNGSDVQYTNPFTGGVLLFTSARFPPDRSIDYADAHADAGLALAIGGGFDIRLSRRIGLRAAMDYDPTFLVRPVFPDLTPDAEGRVVLRPA
Ga0207564_104898Ga0207564_1048981F049092MKFIEPVFETGGDLWRTLSNVPDADARALNEKLARQSAELNRRLTEILELHNVHQRQANELQDAHDEIDRLSQTVSALQEAVTQYQAGAAAAEDEIVLLESEKAALQAQLDGAFEESKTLADRVLAAEAAAKRREENIASSLKQIDFLNAELMAASSERFKV
Ga0207564_105014Ga0207564_1050142F084687MKLVRYRSASGEKPGLILDGEIFDLSGSFAALNPRAPTLDDIEAITAVPAKALVKVEK
Ga0207564_105552Ga0207564_1055522F004478QHRAVLADGRRAVAGAARERGIPFVVFDVKPYPGLDPKYVRDELRDGQRELLERESQRSGFHFLNTYSYFMDYLKQHPDVDLRTVFAVSAADGHPNTLAHSINAQALFEYLVAHRLLPLDKRSNQDH
Ga0207564_105754Ga0207564_1057541F065246ERRFAIRPTPVVQQSADAGGLIDRQGCKSDKSVPRNANVRILVQSGAWSSVDIDEDGNADGQVKTADLTSNLATMPPWG
Ga0207564_106043Ga0207564_1060431F038480MKTVLVSIGLCLAATAVHSMPLSLLNANVAQPVIAVSDQCGDRCGSSRSYVRDRRTVMAGYSGGYVLVRDPLIQRRPYCPFG
Ga0207564_106240Ga0207564_1062401F078898MDLSKRQLLQEALAKAEAHHAYIAVSTPSGTRILLRPDFTCYETYVEGTGADGQLLTLTY
Ga0207564_106742Ga0207564_1067421F014051KEVNKKALKDFLDIAFWAVKDDGLPRNKLEATASLMKKIGAIKPDKEPVAFENLVDPSVWKDANAMVK
Ga0207564_106885Ga0207564_1068851F042837MAMLKKKTRKAISKTLKKAINKHGPTVAEHLATGLAAGLATYLGAEGKKGRKQIKKIAKSIPGGKKIARAVTKTVPALKGAADKLPTWNDNGDNKKGRGSKKSHSAS
Ga0207564_107324Ga0207564_1073242F049708MRVKWVLIGFGILYLVFALSFIPTDKILPRPIEESWELVRDILWAGMTIISIGLLEVMSPLATPFASASGLRRGDVAEILLAVILVAAAVYSALRLRRKDLTPRWRGTHTF
Ga0207564_107507Ga0207564_1075071F071393QEAVNQYKMGTAAAEDKIILLESEKAVLQAQLDVALEESKTLADRLHAAEAASDRREATVASSIRQIEFLNTELTAAAAERFRLVATMQGEQRRQRSVFSQQKSILEDKLQEKEALAATQGTKIKQLEGVRDELDKRVRVIEALLASEREVAERKTRRPTEILGAAG
Ga0207564_107662Ga0207564_1076622F041750VLFVSGKPLFAPRKPVTIAAMLKAPTDADLIRLRQFLHRLESDAKDAVEDAEFCRREIDRLKNEIAYLEAARSKAFLAQIAIDFGATGSGRWNYSAQRGKPS
Ga0207564_107841Ga0207564_1078411F005363LATKDVDLTFRGEIDFENTKHVVVRIAGATPIFDLMSRPLDCVNKIEIAPAALPLAPVATELEFRGPLLQSGWSVSLKEEIASQFSIVSTPDAAERIFPLCFGTGPEEKTLLLGTVPRAQAALQDQPKKREKGQ
Ga0207564_107901Ga0207564_1079011F024580ESKESAMKTLFNRNTAIATFAAVVIVGLAGFTLDRGHDGALPKGIIEVGNPTSLAVGDTLVASLPAVEVIGSREMTLADASKHADPQG
Ga0207564_108086Ga0207564_1080861F046248LPILVAAVLLTAIFVDVVAAGVKAVSRRSRASYPVPDTGITVSVPNAMKAFPAELLPQ
Ga0207564_108252Ga0207564_1082521F025317AGGAACGMARLTERLAVLAFAALIVAAIIGIAFAVGYGVGKLLL
Ga0207564_108454Ga0207564_1084541F103567MLTLVSASKADNPDSFAVFYGEDCVGHIMRTQKSPPGKPWFWTIFVSDKHSSIVDRGYAATHEQAMADFEAQWLRLPRSVEKERAPGGCPPGLFTQT
Ga0207564_108454Ga0207564_1084542F057709MPEFDLKVALIIFVTKVIDPFAALPALVAGYFCRTWWQVVISAAVVGIFV

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.