The Protein Data Bank Archive

Methods Mol Biol. 2021:2305:3-21. doi: 10.1007/978-1-0716-1406-8_1.

Abstract

Protein Data Bank is the single worldwide archive of experimentally determined macromolecular structure data. Established in 1971 as the first open access data resource in biology, the PDB archive is managed by the worldwide Protein Data Bank (wwPDB) consortium which has four partners-the RCSB Protein Data Bank (RCSB PDB; rcsb.org), the Protein Data Bank Japan (PDBj; pdbj.org), the Protein Data Bank in Europe (PDBe; pdbe.org), and BioMagResBank (BMRB; www.bmrb.wisc.edu ). The PDB archive currently includes ~175,000 entries. The wwPDB has established a number of task forces and working groups that bring together experts form the community who provide recommendations on improving data standards and data validation for improving data quality and integrity. The wwPDB members continue to develop the joint deposition, biocuration, and validation system (OneDep) to improve data quality and accommodate new data from emerging techniques such as 3DEM. Each PDB entry contains coordinate model and associated metadata for all experimentally determined atomic structures, experimental data for the traditional structure determination techniques (X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy), validation reports, and additional information on quaternary structures. The wwPDB partners are committed to following the FAIR (Findability, Accessibility, Interoperability, and Reproducibility) principles and have implemented a DOI resolution mechanism that provides access to all the relevant files for a given PDB entry. On average, >250 new entries are added to the archive every week and made available by each wwPDB partner via FTP area. The wwPDB partner sites also develop data access and analysis tools and make these available via their websites. wwPDB continues to work with experts in the community to establish a federation of archives for archiving structures determined using integrative/hybrid method where multiple experimental techniques are used.

Keywords: 3DEM; Biocuration, and Validation system OneDep; Deposition; Integrative hybrid methods; Macromolecular structure archive; NMR spectroscopy; PDBx/mmCIF; Protein Data Bank; Validation task forces; Worldwide Protein Data Bank (wwPDB); X-ray crystallography.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Crystallography, X-Ray
  • Data Accuracy
  • Data Curation*
  • Databases, Protein*
  • Europe
  • Japan
  • Macromolecular Substances / chemistry*
  • Models, Molecular*
  • Nuclear Magnetic Resonance, Biomolecular
  • Protein Conformation
  • Proteins / chemistry
  • Reproducibility of Results
  • User-Computer Interface

Substances

  • Macromolecular Substances
  • Proteins