NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300026939

3300026939: Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A5-12 (SPAdes)



Overview

Basic Information
IMG/M Taxon OID3300026939 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0095510 | Gp0055684 | Ga0207542
Sample NameSoil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A5-12 (SPAdes)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size26580529
Sequencing Scaffolds23
Novel Protein Genes24
Associated Families23

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium3
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria1
Not Available11
All Organisms → cellular organisms → Bacteria2
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Rhodovulum → unclassified Rhodovulum → Rhodovulum sp. PH101
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales1
All Organisms → cellular organisms → Bacteria → Terrabacteria group1
All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Methyloceanibacter → Methyloceanibacter methanicus1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameSoil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts)

Alternative Ecosystem Assignments
Environment Ontology (ENVO)terrestrial biomeagricultural fieldagricultural soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationWisconsin, United States
CoordinatesLat. (o)43.2958Long. (o)-89.3799Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000268Metagenome / Metatranscriptome1411Y
F001033Metagenome / Metatranscriptome799Y
F002616Metagenome / Metatranscriptome543Y
F003758Metagenome / Metatranscriptome470Y
F005366Metagenome / Metatranscriptome403Y
F006477Metagenome / Metatranscriptome372Y
F012929Metagenome / Metatranscriptome276Y
F015863Metagenome / Metatranscriptome251Y
F017759Metagenome239N
F020986Metagenome / Metatranscriptome221Y
F024822Metagenome / Metatranscriptome204N
F031563Metagenome182Y
F046968Metagenome / Metatranscriptome150Y
F054151Metagenome / Metatranscriptome140N
F056406Metagenome137N
F068085Metagenome125Y
F068641Metagenome / Metatranscriptome124Y
F075090Metagenome / Metatranscriptome119N
F080568Metagenome / Metatranscriptome115Y
F081321Metagenome / Metatranscriptome114N
F085779Metagenome / Metatranscriptome111N
F087420Metagenome / Metatranscriptome110Y
F102094Metagenome / Metatranscriptome102Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0207542_100164All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium1373Open in IMG/M
Ga0207542_100314All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria1142Open in IMG/M
Ga0207542_100489All Organisms → cellular organisms → Bacteria → Proteobacteria1022Open in IMG/M
Ga0207542_100541Not Available989Open in IMG/M
Ga0207542_100737All Organisms → cellular organisms → Bacteria913Open in IMG/M
Ga0207542_100756All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Rhodovulum → unclassified Rhodovulum → Rhodovulum sp. PH10907Open in IMG/M
Ga0207542_100998All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria835Open in IMG/M
Ga0207542_101227Not Available786Open in IMG/M
Ga0207542_101275All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium777Open in IMG/M
Ga0207542_101757Not Available699Open in IMG/M
Ga0207542_101942All Organisms → cellular organisms → Bacteria678Open in IMG/M
Ga0207542_102236Not Available642Open in IMG/M
Ga0207542_102272Not Available639Open in IMG/M
Ga0207542_102487Not Available619Open in IMG/M
Ga0207542_102493Not Available619Open in IMG/M
Ga0207542_102634Not Available608Open in IMG/M
Ga0207542_102803Not Available596Open in IMG/M
Ga0207542_103060Not Available581Open in IMG/M
Ga0207542_103347All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium564Open in IMG/M
Ga0207542_103558Not Available553Open in IMG/M
Ga0207542_104003All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales532Open in IMG/M
Ga0207542_104314All Organisms → cellular organisms → Bacteria → Terrabacteria group521Open in IMG/M
Ga0207542_104481All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Methyloceanibacter → Methyloceanibacter methanicus515Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0207542_100164Ga0207542_1001641F001033MHLTKMVPRSARIWTRQPKIERRFMKFVSLTKSTAVPHQILGTSTNEKRELLMCGHSLIARLTIAVFVFQMLGVTSVVHAETPDSTAGTSSAGTRKLFIDPSSTSVALRGKASLIVSPLTHRDGNYVGDYQLKVRPYFFKSEKGSLLLAASDDAVRKLQAGTAINFTGQAVTHKDGRTHIVLGRATPSSRDRGSVTFSIVTDDARIVFNTSYHFPAPRP
Ga0207542_100314Ga0207542_1003143F081321VMCTVLRAAALISLALVLVSPTHAACRGNCEPNVEVARAAMQQIFKQTFLSPYTLVSFERLDGRSGERYGGVFYEMRIRAVLHYDGVRLRCRRPSCPELHHYLLENDAASKKATVAGWLFLANDGDGWKTVPLTLQSPQ
Ga0207542_100384Ga0207542_1003841F102094MYMLIVVIGVLSQGASVLPVGVTSQIVGKFKNLDECKAAAKQPHA
Ga0207542_100489Ga0207542_1004891F003758DPRMSPKVLRAFENAEDEVLTNTLKGAPRLNVPSLFRPQFMEAVQKEHPEYFADLPPLK
Ga0207542_100541Ga0207542_1005412F024822VREPFIRIAGAILCALALSGCVDSSGPLLSDAQPVLGEQLRLQFYSLRKGTADEPEQATYKWDRGAYQRTGGGMTDIGSFSVHPLARDIFVVQSAAAKRPGVFEYAIARRLVDGVYQVIAIDEADAGRVTRARFCKRASDSSCRIETRNQLYAFARATAERKKGQGGLVLRLADGVAESS
Ga0207542_100737Ga0207542_1007372F000268MRVVAVMLLLSAGIAAEAVSYSFVSKASGRLGGPIRFEFYHDSTTRPKTDIKSFTVSMRTADDRWKAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAGFATDGHGGTSGVTFGFDKNGRMTFPDSFDQ
Ga0207542_100756Ga0207542_1007561F075090PVIVQEAFSLAGAIVAEGEIAKAAALKPDEIPNGVSGDTVWKVADNLFTISVLPKDAVPAEIGDLNDLIVGGDAQKCRGDFFAGAMLDVVESLTVARAYTNCRTQQAETSTYYFAMPRKSGGLYLLKTIATGVEVTPVADRTIKELEARVRSVITAALAKL
Ga0207542_100998Ga0207542_1009981F085779MKNSHCLLGTSGVAAAVLLYSLGAAVFAQDEPRRPLNPGLVPNTGQINTGAAPQPWSKSAEVDNIPAPAEAWAALVQPISRQPSAGGGATTTGTGGQQPPASGTINASSEPPPSGPIGSFGQTIPAKFSKRNDILDHLPTMAIPLPLTQEQRKQIYDAVMAEKSQPVVGADALEL
Ga0207542_101227Ga0207542_1012272F002616RPSYDLLYSYAHLTPRMQDTPMPNSFPNWTSRIVIAGRIAEYRRPENWNESTMPGDYHLPFGRVEAQAKAARWLWQSYII
Ga0207542_101275Ga0207542_1012752F006477SGLAFVAYKHPNAYRVMFIFAVPVLVMGGLIVLAIKIGDLNGSIKSIYHELPNIRKYALSDQLPYQIRRLYEVGQFLKVFVIYYISGFAYLVFLLVLGGFLDLARDRHLSLRDMERK
Ga0207542_101757Ga0207542_1017571F087420QMGPTLGIRAKRSAAGGAIVLVLCALPHAPASAAKPPYAGCVVVTKQEYDSAKKQHMLRTRYTEYVRTGLPGRRQYWYCR
Ga0207542_101942Ga0207542_1019422F046968MRIISFGTQDSKFDTPRMGIILDTNGRDSRYRLDCEKLFESADRPSNPLAWFDMDGRW
Ga0207542_102236Ga0207542_1022362F054151MRKVDQYTLADHFRALAEGLSLRAVSERAPAQRAELQRLAECYAELAKQQSPADHF
Ga0207542_102272Ga0207542_1022721F075090AKAAALKPDEIPDGVSGDTVWKVADNLFTISVLPKDAVPAEIGDLNDLIVGGNAQKCRGDFFAGAMLDVVESTTIARAYTTCQTQQAATSTYYFAMPRKQGGGLYLTKIIATGVEVPPTIERAIKELDAKVRGVITAALARL
Ga0207542_102487Ga0207542_1024872F020986MKTRDIATELDRAFAAARVKGRMGGVVLCDTIAPLYAIHKALKATHASGELTDQQYTEKGRELLEILGNAVVSLLIQQAMSKHNH
Ga0207542_102493Ga0207542_1024931F015863LVFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLDEVKRFYTEQLTAAGFEVSDLGLMGLNPMTAAYLGVDGMLSAKRHATDDAIDVQIRTPDGIIPSRLLQIHWRKISATLAPPVAAHPRASEGPAAN
Ga0207542_102634Ga0207542_1026342F068085MIVLKYILLFTCLGIGLALCVGVISILRSPPDNGPPAWFAAAFGAMFFWG
Ga0207542_102803Ga0207542_1028032F017759IEAPESSLSGAVTVPGSSPVLLFGVVVGLLNSECPQQDRAYESKYGADSQHIELQGKVHGSASLVDALRLARNDPAPKAPVTRPAFPAGGIAYRTCAIDNRLIERLKKSEGPKILIVQNSCDTEFFMANLRQIPSILRGNDSTGFVAKEI
Ga0207542_103060Ga0207542_1030601F080568IKVVKSFTYRGQTRLFSNRYHFNGGLPPDSAHWTTLSDAIVTAEKAIYFAPQIVHTYGYAAGSEVPIFSKAYTTAGTLALGTQERCPGDCAGLIRYATTARTSKNHPVYLFNYYHGVVAVGASFDDVGAVQASAYSTYAGLWIAGFSDGATTYNRAGPNGASAVGSLVEPYITHRDLPR
Ga0207542_103347Ga0207542_1033471F068641MKSAVIELIQIALISATTLEVSAADTNYELKPISLPGATGTIALDYFAYDHATGKVWVPASNTGRVDVIDDATDAVSQVTGFATGEIERR
Ga0207542_103558Ga0207542_1035582F056406APKPPVSVTEQYTGSIIIVPTRGEDCRQMMLDNRTGRMWDKGIVNCYEAVSRPEKGQRGGMSSLRMNAIGKAFNRRDE
Ga0207542_104003Ga0207542_1040031F012929FAAVLVGAAVLFGLEQQFGVKLYLAIPAAIAVYFATLIVLTLAFGSGNQTK
Ga0207542_104314Ga0207542_1043142F031563MCGHRHHHRGLGRRGRRFPNREEWIRRLEERQRDLEQEIADLADVIKHLKSGETPEG
Ga0207542_104481Ga0207542_1044811F005366MYPLTLQAQIMRHMLNLAKDMASGDLLESRKKLDEYPLCGDPSVFDQIMRARAVQDRMWGREVDDTKNDPWRWSTYISQCAVRWLRDPHKWTREDTDDFYDAMIETAAICAAAAESVVRQRNTNGRTFYEPGSGKGK

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.