logo logo
PTMD: A Database of Human Disease-associated Post-translational Modifications. Xu Haodong,Wang Yongbo,Lin Shaofeng,Deng Wankun,Peng Di,Cui Qinghua,Xue Yu Genomics, proteomics & bioinformatics Various posttranslational modifications (PTMs) participate in nearly all aspects of biological processes by regulating protein functions, and aberrant states of PTMs are frequently implicated in human diseases. Therefore, an integral resource of PTM-disease associations (PDAs) would be a great help for both academic research and clinical use. In this work, we reported PTMD, a well-curated database containing PTMs that are associated with human diseases. We manually collected 1950 known PDAs in 749 proteins for 23 types of PTMs and 275 types of diseases from the literature. Database analyses show that phosphorylation has the largest number of disease associations, whereas neurologic diseases have the largest number of PTM associations. We classified all known PDAs into six classes according to the PTM status in diseases and demonstrated that the upregulation and presence of PTM events account for a predominant proportion of disease-associated PTM events. By reconstructing a disease-gene network, we observed that breast cancers have the largest number of associated PTMs and AKT1 has the largest number of PTMs connected to diseases. Finally, the PTMD database was developed with detailed annotations and can be a useful resource for further analyzing the relations between PTMs and human diseases. PTMD is freely accessible at http://ptmd.biocuckoo.org. 10.1016/j.gpb.2018.06.004
dbPPT: a comprehensive database of protein phosphorylation in plants. Cheng Han,Deng Wankun,Wang Yongbo,Ren Jian,Liu Zexian,Xue Yu Database : the journal of biological databases and curation As one of the most important protein post-translational modifications, the reversible phosphorylation is critical for plants in regulating a variety of biological processes such as cellular metabolism, signal transduction and responses to environmental stress. Numerous efforts especially large-scale phosphoproteome profiling studies have been contributed to dissect the phosphorylation signaling in various plants, while a large number of phosphorylation events were identified. To provide an integrated data resource for further investigations, here we present a comprehensive database of dbPPT (database of Phosphorylation site in PlanTs, at http://dbppt.biocuckoo.org), which contains experimentally identified phosphorylation sites in proteins from plants. The phosphorylation sites in dbPPT were manually curated from the literatures, whereas datasets in other public databases were also integrated. In total, there were 82,175 phosphorylation sites in 31,012 proteins from 20 plant organisms in dbPPT, presenting a larger quantity of phosphorylation sites and a higher coverage of plant species in comparison with other databases. The proportions of residue types including serine, threonine and tyrosine were 77.99, 17.81 and 4.20%, respectively. All the phosphoproteins and phosphorylation sites in the database were critically annotated. Since the phosphorylation signaling in plants attracted great attention recently, such a comprehensive resource of plant protein phosphorylation can be useful for the research community. Database URL: http://dbppt.biocuckoo.or 10.1093/database/bau121
RiceSRTFDB: a database of rice transcription factors containing comprehensive expression, cis-regulatory element and mutant information to facilitate gene function analysis. Priya Pushp,Jain Mukesh Database : the journal of biological databases and curation Rice is one of the most important crop plants, representing the staple food for more than half the world's population. However, its productivity is challenged by various stresses, including drought and salinity. Transcription factors (TFs) represent a regulatory component of the genome and are the most important targets for engineering stress tolerance. Here, we constructed a database, RiceSRTFDB, which provides comprehensive expression information for rice TFs during drought and salinity stress conditions and various stages of development. This information will be useful to identify the target TF(s) involved in stress response at a particular stage of development. The curated information for cis-regulatory elements present in their promoters has also been provided, which will be important to study the binding proteins. In addition, we have provided the available mutants and their phenotype information for rice TFs. All these information have been integrated in the database to facilitate the selection of target TFs of interest for functional analysis. This database aims to accelerate functional genomics research of rice TFs and understand the regulatory mechanisms underlying abiotic stress responses. Database URL: http://www.nipgr.res.in/RiceSRTFDB.html 10.1093/database/bat027
Stress2TF: a manually curated database of TF regulation in plant response to stress. Zhang Xiaodan,Yao Chensong,Fu Sicong,Xuan Hongdong,Wen Shuxian,Liu Chao,Li Fangdong,Liu Aiguo,Bi Shoudong,Zhang Shihua,Li Shaowen Gene Considerable studies demonstrate that plant transcription factors (TFs) play key regulatory roles in abiotic/biotic stress conditions, such as drought and pathogen attack. However, there is no effort dedicated to curate experimentally validated stress-TF regulatory relationships from these individual reports into a central database, which put an obstacle in the exploration of stress-TF regulations in plants. To address this issue, we presented a literature-curated database 'Stress2TF' that currently documented 1533 regulatory relationships between 71 abiotic/biotic stresses and 558 TFs in 47 plant species. Each entry in Stress2TF contains detailed information about a stress-TF relationship such as plant name, stress name, TF and brief description of stress-TF relationship. Stress2TF provided a user-friendly interface for entry browse, search and download. In addition, a submission page and several useful tools (e.g., BLAST, network visualization) were integrated. Stress2TF may be a valuable resource for the research of stress-TF regulatory mechanisms in plants. Stress2TF is available at http://csgenomics.ahau.edu.cn/Stress2TF. 10.1016/j.gene.2017.09.067
STIFDB-Arabidopsis Stress Responsive Transcription Factor DataBase. Shameer K,Ambika S,Varghese Susan Mary,Karaba N,Udayakumar M,Sowdhamini R International journal of plant genomics Elucidating the key players of molecular mechanism that mediate the complex stress-responses in plants system is an important step to develop improved variety of stress tolerant crops. Understanding the effects of different types of biotic and abiotic stress is a rapidly emerging domain in the area of plant research to develop better, stress tolerant plants. Information about the transcription factors, transcription factor binding sites, function annotation of proteins coded by genes expressed during abiotic stress (for example: drought, cold, salinity, excess light, abscisic acid, and oxidative stress) response will provide better understanding of this phenomenon. STIFDB is a database of abiotic stress responsive genes and their predicted abiotic transcription factor binding sites in Arabidopsis thaliana. We integrated 2269 genes upregulated in different stress related microarray experiments and surveyed their 1000 bp and 100 bp upstream regions and 5'UTR regions using the STIF algorithm and identified putative abiotic stress responsive transcription factor binding sites, which are compiled in the STIFDB database. STIFDB provides extensive information about various stress responsive genes and stress inducible transcription factors of Arabidopsis thaliana. STIFDB will be a useful resource for researchers to understand the abiotic stress regulome and transcriptome of this important model plant system. 10.1155/2009/583429
PlantPReS: A database for plant proteome response to stress. Mousavi Seyed Ahmad,Pouya Farhad Movahedi,Ghaffari Mohammad Reza,Mirzaei Mehdi,Ghaffari Akram,Alikhani Mehdi,Ghareyazie Mohammad,Komatsu Setsuko,Haynes Paul A,Salekdeh Ghasem Hosseini Journal of proteomics UNLABELLED:About 75% of plant yield potential has been estimated to be lost to environmental stresses, even in developed agricultures. To facilitate the biotechnological improvement of crop productivity, genes and proteins that control crop adaptation to a wide range of environments will need to be identified. Due to the challenges faced in text/data mining, there is a large gap between the data available to researchers and the hundreds of published plant stress proteomics articles. Plant stress proteome database (PlantPReS; www.proteome.ir) is an open online proteomic database, which currently (as of October 2015) comprises >20,413 entries from 456 manually curated articles, and contains >10,600 unique stress responsive proteins. Since every aspect of the experiments, including protein name, accession number, plant type, tissue, stress types, organelles, and developmental stage has been digitized, experimental data can be rapidly accessed and integrated. Furthermore, PlantPReS enables researchers to perform multiple analyses on the database using the filtration mode, and the results of each query indicate a series of proteins for which a set of selected criteria is met. The query results can be displayed in either text or graphical format. SIGNIFICANCE:The promise of text and data mining to facilitate and enhance research fundamentally has not yet been achieved, mainly because great numbers of stress-associated proteins are not deposited in databases. PlantPReS is a valuable database for the vast majority of researchers working in proteomics and plant stress areas. It has a user-friendly interface with a number of useful features, including a search engine, analysis tools, gene ontology, a function for cross-referencing useful external databases, and the expression pattern of stress associated proteins. 10.1016/j.jprot.2016.03.009
Databases for plant phosphoproteomics. Schulze Waltraud X,Yao Qiuming,Xu Dong Methods in molecular biology (Clifton, N.J.) Phosphorylation is the most studied posttranslational modification involved in signal transduction in stress responses, development, and growth. In the recent years large-scale phosphoproteomic studies were carried out using various model plants and several growth and stress conditions. Here we present an overview of online resources for plant phosphoproteomic databases: PhosPhAt as a resource for Arabidopsis phosphoproteins, P3DB as a resource expanding to crop plants, and Medicago PhosphoProtein Database as a resource for the model plant Medicago trunculata. 10.1007/978-1-4939-2648-0_16
P³DB 3.0: From plant phosphorylation sites to protein networks. Yao Qiuming,Ge Huangyi,Wu Shangquan,Zhang Ning,Chen Wei,Xu Chunhui,Gao Jianjiong,Thelen Jay J,Xu Dong Nucleic acids research In the past few years, the Plant Protein Phosphorylation Database (P(3)DB, http://p3db.org) has become one of the most significant in vivo data resources for studying plant phosphoproteomics. We have substantially updated P(3)DB with respect to format, new datasets and analytic tools. In the P(3)DB 3.0, there are altogether 47 923 phosphosites in 16 477 phosphoproteins curated across nine plant organisms from 32 studies, which have met our multiple quality standards for acquisition of in vivo phosphorylation site data. Centralized by these phosphorylation data, multiple related data and annotations are provided, including protein-protein interaction (PPI), gene ontology, protein tertiary structures, orthologous sequences, kinase/phosphatase classification and Kinase Client Assay (KiC Assay) data--all of which provides context for the phosphorylation event. In addition, P(3)DB 3.0 incorporates multiple network viewers for the above features, such as PPI network, kinase-substrate network, phosphatase-substrate network, and domain co-occurrence network to help study phosphorylation from a systems point of view. Furthermore, the new P(3)DB reflects a community-based design through which users can share datasets and automate data depository processes for publication purposes. Each of these new features supports the goal of making P(3)DB a comprehensive, systematic and interactive platform for phosphoproteomics research. 10.1093/nar/gkt1135
P(3)DB: An Integrated Database for Plant Protein Phosphorylation. Yao Qiuming,Bollinger Curtis,Gao Jianjiong,Xu Dong,Thelen Jay J Frontiers in plant science Protein phosphorylation is widely recognized as the most widespread, enzyme-catalyzed post-translational modification in eukaryotes. In particular, plants have appropriated this signaling mechanism as evidenced by the twofold higher frequency of protein kinases within the genome compared to other eukaryotes. While all aspects of plant protein phosphorylation research have grown in the past 10 years; phosphorylation site mapping using high-resolution mass spectrometry has grown exponentially. In Arabidopsis alone there are thousands of experimentally determined phosphorylation sites. To archive these events in a user-intuitive format we have developed P(3)DB, the Plant Protein Phosphorylation Database (p3db.org). This database is a repository for plant protein phosphorylation site data, currently hosting information on 32,963 non-redundant sites collated from 23 experimental studies from six plant species. These data can be queried for a protein-of-interest using an integrated BLAST module to query similar sequences with known phosphorylation sites among the multiple plants currently investigated. The paper demonstrates how this resource can help identify functionally conserved phosphorylation sites in plants using a multi-system approach. 10.3389/fpls.2012.00206
P3DB: a plant protein phosphorylation database. Gao Jianjiong,Agrawal Ganesh Kumar,Thelen Jay J,Xu Dong Nucleic acids research P(3)DB (http://www.p3db.org/) provides a resource of protein phosphorylation data from multiple plants. The database was initially constructed with a dataset from oilseed rape, including 14,670 nonredundant phosphorylation sites from 6382 substrate proteins, representing the largest collection of plant phosphorylation data to date. Additional protein phosphorylation data are being deposited into this database from large-scale studies of Arabidopsis thaliana and soybean. Phosphorylation data from current literature are also being integrated into the P(3)DB. With a web-based user interface, the database is browsable, downloadable and searchable by protein accession number, description and sequence. A BLAST utility was integrated and a phosphopeptide BLAST browser was implemented to allow users to query the database for phosphopeptides similar to protein sequences of their interest. With the large-scale phosphorylation data and associated web-based tools, P(3)DB will be a valuable resource for both plant and nonplant biologists in the field of protein phosphorylation. 10.1093/nar/gkn733
SysPTM 2.0: an updated systematic resource for post-translational modification. Li Jing,Jia Jia,Li Hong,Yu Jian,Sun Han,He Ying,Lv Daqing,Yang Xiaojuan,Glocker Michael O,Ma Liangxiao,Yang Jiabei,Li Ling,Li Wei,Zhang Guoqing,Liu Qian,Li Yixue,Xie Lu Database : the journal of biological databases and curation Post-translational modifications (PTMs) of proteins play essential roles in almost all cellular processes, and are closely related to physiological activity and disease development of living organisms. The development of tandem mass spectrometry (MS/MS) has resulted in a rapid increase of PTMs identified on proteins from different species. The collection and systematic ordering of PTM data should provide invaluable information for understanding cellular processes and signaling pathways regulated by PTMs. For this original purpose we developed SysPTM, a systematic resource installed with comprehensive PTM data and a suite of web tools for annotation of PTMs in 2009. Four years later, there has been a significant advance with the generation of PTM data and, consequently, more sophisticated analysis requirements have to be met. Here we submit an updated version of SysPTM 2.0 (http://lifecenter.sgst.cn/SysPTM/), with almost doubled data content, enhanced web-based analysis tools of PTMBlast, PTMPathway, PTMPhylog, PTMCluster. Moreover, a new session SysPTM-H is constructed to graphically represent the combinatorial histone PTMs and dynamic regulation of histone modifying enzymes, and a new tool PTMGO is added for functional annotation and enrichment analysis. SysPTM 2.0 not only facilitates resourceful annotation of PTM sites but allows systematic investigation of PTM functions by the user. Database URL: http://lifecenter.sgst.cn/SysPTM/. 10.1093/database/bau025
SysPTM: a systematic resource for proteomic research on post-translational modifications. Li Hong,Xing Xiaobin,Ding Guohui,Li Qingrun,Wang Chuan,Xie Lu,Zeng Rong,Li Yixue Molecular & cellular proteomics : MCP With the rapid expansion of protein post-translational modification (PTM) research based on large-scale proteomic work, there is an increasing demand for a suitable repository to analyze PTM data. Here we present a curated, web-accessible PTM data base, SysPTM. SysPTM provides a systematic and sophisticated platform for proteomic PTM research equipped not only with a knowledge base of manually curated multi-type modification data but also with four fully developed, in-depth data mining tools. Currently, SysPTM contains data detailing 117,349 experimentally determined PTM sites on 33,421 proteins involving nearly 50 PTM types, curated from public resources including five data bases and four web servers and more than one hundred peer-reviewed mass spectrometry papers. Protein annotations including Pfam domains, KEGG pathways, GO functional classification, and ortholog groups are integrated into the data base. Four online tools have been developed and incorporated, including PTMBlast, to compare a user's PTM dataset with PTM data in SysPTM; PTMPathway, to map PTM proteins to KEGG pathways; PTMPhylog, to discover potentially conserved PTM sites; and PTMCluster, to find clusters of multi-site modifications. The workflow of SysPTM was demonstrated by analyzing an in-house phosphorylation dataset identified by MS/MS. It is shown that in SysPTM, the role of single-type and multi-type modifications can be systematically investigated in a full biological context. SysPTM could be an important contribution to modificomics research. SysPTM is freely available online at www.sysbio.ac.cn/SysPTM. 10.1074/mcp.M900030-MCP200
PLMD: An updated data resource of protein lysine modifications. Xu Haodong,Zhou Jiaqi,Lin Shaofeng,Deng Wankun,Zhang Ying,Xue Yu Journal of genetics and genomics = Yi chuan xue bao Post-translational modifications (PTMs) occurring at protein lysine residues, or protein lysine modifications (PLMs), play critical roles in regulating biological processes. Due to the explosive expansion of the amount of PLM substrates and the discovery of novel PLM types, here we greatly updated our previous studies, and presented a much more integrative resource of protein lysine modification database (PLMD). In PLMD, we totally collected and integrated 284,780 modification events in 53,501 proteins across 176 eukaryotes and prokaryotes for up to 20 types of PLMs, including ubiquitination, acetylation, sumoylation, methylation, succinylation, malonylation, glutarylation, glycation, formylation, hydroxylation, butyrylation, propionylation, crotonylation, pupylation, neddylation, 2-hydroxyisobutyrylation, phosphoglycerylation, carboxylation, lipoylation and biotinylation. Using the data set, a motif-based analysis was performed for each PLM type, and the results demonstrated that different PLM types preferentially recognize distinct sequence motifs for the modifications. Moreover, various PLMs synergistically orchestrate specific cellular biological processes by mutual crosstalks with each other, and we totally found 65,297 PLM events involved in 90 types of PLM co-occurrences on the same lysine residues. Finally, various options were provided for accessing the data, while original references and other annotations were also present for each PLM substrate. Taken together, we anticipated the PLMD database can serve as a useful resource for further researches of PLMs. PLMD 3.0 was implemented in PHP + MySQL and freely available at http://plmd.biocuckoo.org. 10.1016/j.jgg.2017.03.007
EPSD: a well-annotated data resource of protein phosphorylation sites in eukaryotes. Lin Shaofeng,Wang Chenwei,Zhou Jiaqi,Shi Ying,Ruan Chen,Tu Yiran,Yao Lan,Peng Di,Xue Yu Briefings in bioinformatics As an important post-translational modification (PTM), protein phosphorylation is involved in the regulation of almost all of biological processes in eukaryotes. Due to the rapid progress in mass spectrometry-based phosphoproteomics, a large number of phosphorylation sites (p-sites) have been characterized but remain to be curated. Here, we briefly summarized the current progresses in the development of data resources for the collection, curation, integration and annotation of p-sites in eukaryotic proteins. Also, we designed the eukaryotic phosphorylation site database (EPSD), which contained 1 616 804 experimentally identified p-sites in 209 326 phosphoproteins from 68 eukaryotic species. In EPSD, we not only collected 1 451 629 newly identified p-sites from high-throughput (HTP) phosphoproteomic studies, but also integrated known p-sites from 13 additional databases. Moreover, we carefully annotated the phosphoproteins and p-sites of eight model organisms by integrating the knowledge from 100 additional resources that covered 15 aspects, including phosphorylation regulator, genetic variation and mutation, functional annotation, structural annotation, physicochemical property, functional domain, disease-associated information, protein-protein interaction, drug-target relation, orthologous information, biological pathway, transcriptional regulator, mRNA expression, protein expression/proteomics and subcellular localization. We anticipate that the EPSD can serve as a useful resource for further analysis of eukaryotic phosphorylation. With a data volume of 14.1 GB, EPSD is free for all users at http://epsd.biocuckoo.cn/. 10.1093/bib/bbz169
PhosPhAt 4.0: An Updated Arabidopsis Database for Searching Phosphorylation Sites and Kinase-Target Interactions. Xi Lin,Zhang Zhaoxia,Schulze Waltraud X Methods in molecular biology (Clifton, N.J.) The PhosPhAt 4.0 database contains information on Arabidopsis phosphorylation sites identified by mass spectrometry in large-scale experiments from different research groups. So far PhosPhAt 4.0 has been one of the most significant large-scale data resources for plant phosphorylation studies. Functionalities of the web application, besides display of phosphorylation sites, include phosphorylation site prediction and kinase-target relationships retrieval. Here, we present an overview and user instructions for the PhosPhAt 4.0 database, with strong emphasis on recent renewals regarding protein annotation by SUBA4.0 and Mapman4, and additional phosphorylation site information imported from other databases, such as UniProt. Here, we provide a user guide for the retrieval of phosphorylation motifs from the kinase-target database and how to visualize these results. The improvements incorporated into the PhosPhAt 4.0 database have produced much more functionality and user flexibility for phosphoproteomic analysis. 10.1007/978-1-0716-1625-3_14
The Plant PTM Viewer 2.0: in-depth exploration of plant protein modification landscapes. Journal of experimental botany Post-translational modifications (PTMs) greatly increase protein diversity and functionality. To help the plant research community interpret the ever-increasing number of reported PTMs, the Plant PTM Viewer (https://www.psb.ugent.be/PlantPTMViewer) provides an intuitive overview of plant protein PTMs and the tools to assess it. This update includes 62 novel PTM profiling studies, adding a total of 112 000 modified peptides reporting plant PTMs, including 14 additional PTM types and three species (moss, tomato, and soybean). Furthermore, an open modification re-analysis of a large-scale Arabidopsis thaliana mass spectrometry tissue atlas identified previously uncharted landscapes of lysine acylations predominant in seed and flower tissues and 3-phosphoglycerylation on glycolytic enzymes in plants. An extra 'Protein list analysis' tool was developed for retrieval and assessing the enrichment of PTMs in a protein list of interest. We conducted a protein list analysis on nuclear proteins, revealing a substantial number of redox modifications in the nucleus, confirming previous assumptions regarding the redox regulation of transcription. We encourage the plant research community to use PTM Viewer 2.0 for hypothesis testing and new target discovery, and also to submit new data to expand the coverage of conditions, plant species, and PTM types, thereby enriching our understanding of plant biology. 10.1093/jxb/erae270
The Plant PTM Viewer, a central resource for exploring plant protein modifications. Willems Patrick,Horne Alison,Van Parys Thomas,Goormachtig Sofie,De Smet Ive,Botzki Alexander,Van Breusegem Frank,Gevaert Kris The Plant journal : for cell and molecular biology Post-translational modifications (PTMs) of proteins are central in any kind of cellular signaling. Modern mass spectrometry technologies enable comprehensive identification and quantification of various PTMs. Given the increased numbers and types of mapped protein modifications, a database is necessary that simultaneously integrates and compares site-specific information for different PTMs, especially in plants for which the available PTM data are poorly catalogued. Here, we present the Plant PTM Viewer (http://www.psb.ugent.be/PlantPTMViewer), an integrative PTM resource that comprises approximately 370 000 PTM sites for 19 types of protein modifications in plant proteins from five different species. The Plant PTM Viewer provides the user with a protein sequence overview in which the experimentally evidenced PTMs are highlighted together with an estimate of the confidence by which the modified peptides and, if possible, the actual modification sites were identified and with functional protein domains or active site residues. The PTM sequence search tool can query PTM combinations in specific protein sequences, whereas the PTM BLAST tool searches for modified protein sequences to detect conserved PTMs in homologous sequences. Taken together, these tools help to assume the role and potential interplay of PTMs in specific proteins or within a broader systems biology context. The Plant PTM Viewer is an open repository that allows the submission of mass spectrometry-based PTM data to remain at pace with future PTM plant studies. 10.1111/tpj.14345