Cytochrome P450 diversity in the tree of life

Research output: Contribution to journalArticle

31 Citations (Scopus)

Abstract

Sequencing in all areas of the tree of life has produced > 300,000 cytochrome P450 (CYP) sequences that have been mined and collected. Nomenclature has been assigned to > 41,000 CYP sequences and the majority of the remainder has been sorted by BLAST searches into clans, families and subfamilies in preparation for naming. The P450 sequence space is being systematically explored and filled in. Well-studied groups like vertebrates are covered in greater depth while new insights are being added into uncharted territories like horseshoe crab (Limulus polyphemus), tardigrades (Hypsibius dujardini), velvet worm (Euperipatoides_rowelli), and basal land plants like hornworts, liverworts and mosses. CYPs from the fungi, one of the most diverse groups, are being explored and organized as nearly 800 fungal species are now sequenced. The CYP clan structure in fungi is emerging with 805 CYP families sorting into 32 CYP clans. > 3000 bacterial sequences are named, mostly from terrestrial or freshwater sources. Of 18,379 bacterial sequences downloaded from the CYPED database, all are > 43% identical to named CYPs. Therefore, they fit in the 602 named P450 prokaryotic families. Diversity in this group is becoming saturated, however 25% of 3305 seawater bacterial P450s did not match known P450 families, indicating marine bacterial CYPs are not as well sampled as land/freshwater based bacterial CYPs. Future sequencing plans of the Genome 10 K project, i5k and GIGA (Global Invertebrate Genomics Alliance) are expected to produce more than one million cytochrome P450 sequences by 2020. This article is part of a Special Issue entitled: Cytochrome P450 biodiversity and biotechnology, edited by Erika Plettner, Gianfranco Gilardi, Luet Wong, Vlada Urlacher, Jared Goldstone.

Original languageEnglish (US)
Pages (from-to)141-154
Number of pages14
JournalBiochimica et Biophysica Acta - Proteins and Proteomics
Volume1866
Issue number1
DOIs
StatePublished - Jan 1 2018

Fingerprint

Cytochrome P-450 Enzyme System
Horseshoe Crabs
Fresh Water
Fungi
Anthocerotophyta
Hepatophyta
Embryophyta
Bryophyta
Biodiversity
Seawater
Invertebrates
Biotechnology
Terminology
Genomics
Sorting
Vertebrates
Genes
Genome
Databases

All Science Journal Classification (ASJC) codes

  • Analytical Chemistry
  • Biophysics
  • Biochemistry
  • Molecular Biology

Cite this

Cytochrome P450 diversity in the tree of life. / Nelson, David.

In: Biochimica et Biophysica Acta - Proteins and Proteomics, Vol. 1866, No. 1, 01.01.2018, p. 141-154.

Research output: Contribution to journalArticle

@article{d514c33e2aeb4e2f8411b3f69082d0e9,
title = "Cytochrome P450 diversity in the tree of life",
abstract = "Sequencing in all areas of the tree of life has produced > 300,000 cytochrome P450 (CYP) sequences that have been mined and collected. Nomenclature has been assigned to > 41,000 CYP sequences and the majority of the remainder has been sorted by BLAST searches into clans, families and subfamilies in preparation for naming. The P450 sequence space is being systematically explored and filled in. Well-studied groups like vertebrates are covered in greater depth while new insights are being added into uncharted territories like horseshoe crab (Limulus polyphemus), tardigrades (Hypsibius dujardini), velvet worm (Euperipatoides_rowelli), and basal land plants like hornworts, liverworts and mosses. CYPs from the fungi, one of the most diverse groups, are being explored and organized as nearly 800 fungal species are now sequenced. The CYP clan structure in fungi is emerging with 805 CYP families sorting into 32 CYP clans. > 3000 bacterial sequences are named, mostly from terrestrial or freshwater sources. Of 18,379 bacterial sequences downloaded from the CYPED database, all are > 43{\%} identical to named CYPs. Therefore, they fit in the 602 named P450 prokaryotic families. Diversity in this group is becoming saturated, however 25{\%} of 3305 seawater bacterial P450s did not match known P450 families, indicating marine bacterial CYPs are not as well sampled as land/freshwater based bacterial CYPs. Future sequencing plans of the Genome 10 K project, i5k and GIGA (Global Invertebrate Genomics Alliance) are expected to produce more than one million cytochrome P450 sequences by 2020. This article is part of a Special Issue entitled: Cytochrome P450 biodiversity and biotechnology, edited by Erika Plettner, Gianfranco Gilardi, Luet Wong, Vlada Urlacher, Jared Goldstone.",
author = "David Nelson",
year = "2018",
month = "1",
day = "1",
doi = "10.1016/j.bbapap.2017.05.003",
language = "English (US)",
volume = "1866",
pages = "141--154",
journal = "Biochimica et Biophysica Acta - Proteins and Proteomics",
issn = "1570-9639",
publisher = "Elsevier",
number = "1",

}

TY - JOUR

T1 - Cytochrome P450 diversity in the tree of life

AU - Nelson, David

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Sequencing in all areas of the tree of life has produced > 300,000 cytochrome P450 (CYP) sequences that have been mined and collected. Nomenclature has been assigned to > 41,000 CYP sequences and the majority of the remainder has been sorted by BLAST searches into clans, families and subfamilies in preparation for naming. The P450 sequence space is being systematically explored and filled in. Well-studied groups like vertebrates are covered in greater depth while new insights are being added into uncharted territories like horseshoe crab (Limulus polyphemus), tardigrades (Hypsibius dujardini), velvet worm (Euperipatoides_rowelli), and basal land plants like hornworts, liverworts and mosses. CYPs from the fungi, one of the most diverse groups, are being explored and organized as nearly 800 fungal species are now sequenced. The CYP clan structure in fungi is emerging with 805 CYP families sorting into 32 CYP clans. > 3000 bacterial sequences are named, mostly from terrestrial or freshwater sources. Of 18,379 bacterial sequences downloaded from the CYPED database, all are > 43% identical to named CYPs. Therefore, they fit in the 602 named P450 prokaryotic families. Diversity in this group is becoming saturated, however 25% of 3305 seawater bacterial P450s did not match known P450 families, indicating marine bacterial CYPs are not as well sampled as land/freshwater based bacterial CYPs. Future sequencing plans of the Genome 10 K project, i5k and GIGA (Global Invertebrate Genomics Alliance) are expected to produce more than one million cytochrome P450 sequences by 2020. This article is part of a Special Issue entitled: Cytochrome P450 biodiversity and biotechnology, edited by Erika Plettner, Gianfranco Gilardi, Luet Wong, Vlada Urlacher, Jared Goldstone.

AB - Sequencing in all areas of the tree of life has produced > 300,000 cytochrome P450 (CYP) sequences that have been mined and collected. Nomenclature has been assigned to > 41,000 CYP sequences and the majority of the remainder has been sorted by BLAST searches into clans, families and subfamilies in preparation for naming. The P450 sequence space is being systematically explored and filled in. Well-studied groups like vertebrates are covered in greater depth while new insights are being added into uncharted territories like horseshoe crab (Limulus polyphemus), tardigrades (Hypsibius dujardini), velvet worm (Euperipatoides_rowelli), and basal land plants like hornworts, liverworts and mosses. CYPs from the fungi, one of the most diverse groups, are being explored and organized as nearly 800 fungal species are now sequenced. The CYP clan structure in fungi is emerging with 805 CYP families sorting into 32 CYP clans. > 3000 bacterial sequences are named, mostly from terrestrial or freshwater sources. Of 18,379 bacterial sequences downloaded from the CYPED database, all are > 43% identical to named CYPs. Therefore, they fit in the 602 named P450 prokaryotic families. Diversity in this group is becoming saturated, however 25% of 3305 seawater bacterial P450s did not match known P450 families, indicating marine bacterial CYPs are not as well sampled as land/freshwater based bacterial CYPs. Future sequencing plans of the Genome 10 K project, i5k and GIGA (Global Invertebrate Genomics Alliance) are expected to produce more than one million cytochrome P450 sequences by 2020. This article is part of a Special Issue entitled: Cytochrome P450 biodiversity and biotechnology, edited by Erika Plettner, Gianfranco Gilardi, Luet Wong, Vlada Urlacher, Jared Goldstone.

UR - http://www.scopus.com/inward/record.url?scp=85019862016&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85019862016&partnerID=8YFLogxK

U2 - 10.1016/j.bbapap.2017.05.003

DO - 10.1016/j.bbapap.2017.05.003

M3 - Article

C2 - 28502748

AN - SCOPUS:85019862016

VL - 1866

SP - 141

EP - 154

JO - Biochimica et Biophysica Acta - Proteins and Proteomics

JF - Biochimica et Biophysica Acta - Proteins and Proteomics

SN - 1570-9639

IS - 1

ER -