Documentation

The Bio3D package for structural bioinformatics consists of sets of functions for:

Major functions are listed below with links to further documentation that includes example code and results.

Note that you can also get help on any particular function by using the command ?function or help(function) (e.g. help(read.pdb)) and directly execute the example code for a given function with the command example(function) from within R itself.

We also distribute a number of extended Bio3D vignettes that provide worked examples of using Bio3D to perform a particular type of analysis. Currently available vignettes include:

  • Installing Bio3D ( PDF | HTML)
  • Getting started with Bio3D ( PDF | HTML )
  • PDB structure manipulation and analysis with Bio3D ( PDF | HTML)
  • Beginning trajectory analysis with Bio3D ( PDF | HTML)
  • Enhanced methods for Normal Mode Analysis with Bio3D ( PDF | HTML)
  • Comparative sequence and structure analysis with Bio3D ( PDF | HTML)
  • Correlation network analysis with Bio3D ( PDF | HTML )
  • Protein structure network analysis with Bio3D ( PDF | HTML )

There is also a package manual (in PDF format) that is a concatenation of each functions documentation.

Note that for information on Bio3D development status or to report a bug, please refer to: https://bitbucket.org/Grantlab/bio3d

Help topics

Input/Output:

Read and Write Common Biomolecular Data Types

  • read.pdb(print.pdb, summary.pdb): Read PDB File
  • read.fasta: Read FASTA formated Sequences
  • read.fasta.pdb: Read Aligned Structure Data
  • read.ncdf: Read AMBER Binary netCDF files
  • read.dcd: Read CHARMM/X-PLOR/NAMD Binary DCD files
  • read.crd: Read Coordinate Data from Amber or Charmm
  • read.pqr: Read PQR File
  • read.mol2(print.mol2): Read MOL2 File
  • read.all: Read Aligned Structure Data
  • read.pdcBD: Read PQR output from pdcBD File
  • aln2html: Create a HTML Page For a Given Alignment
  • get.pdb: Download PDB Coordinate Files
  • get.seq: Download FASTA Sequence Files
  • load.enmff(ff.calpha, ff.calphax, ff.anm, ff.pfanm, ff.sdenm, ff.reach): ENM Force Field Loader
  • write.pdb: Write PDB Format Coordinate File
  • write.crd: Write CRD File
  • write.fasta: Write FASTA Formated Sequences
  • write.ncdf: Write AMBER Binary netCDF files
  • write.pqr: Write PQR Format Coordinate File
  • write.pir: Write PIR Formated Sequences
  • view.dccm: Visualization of Dynamic Cross-Correlation
  • view.cna: View CNA Protein Structure Network Community Output in VMD

Sequence Analysis:

Do Interesting Things with Protein Sequence

  • consensus: Sequence Consensus for an Alignment
  • conserv: Score Residue Conservation At Each Position in an Alignment
  • blast.pdb(get.blast, plot.blast): NCBI BLAST Sequence Search and Summary Plot of Hit Statistics
  • hmmer: HMMER Sequence Search
  • pfam: Download Pfam FASTA Sequence Alignment
  • uniprot: Fetch UniProt Entry Data.
  • entropy: Shannon Entropy Score
  • seqidentity: Percent Identity
  • motif.find: Find Sequence Motifs.
  • pdbaln: Sequence Alignment of PDB Files
  • seq2aln: Add a Sequence to an Existing Alignmnet
  • seqaln: Sequence Alignment with MUSCLE
  • seqaln.pair: Sequence Alignment of Identical Protein Sequences
  • seqbind: Combine Sequences by Rows Without Recycling

Structure Analysis:

Do Interesting Things with Protein Structure

  • angle.xyz: Calculate the Angle Between Three Atoms
  • biounit:Biological Units Construction
  • blast.pdb(get.blast, plot.blast): NCBI BLAST Sequence Search and Summary Plot of Hit Statistics
  • atom.select(atom.select.pdb, atom.select.prmtop, print.select): Atom Selection from PDB and PRMTOP Structure Objects
  • combine.select: Combine Atom Selections From PDB Structure
  • cmap(cmap.default, cmap.xyz, cmap.pdb): Contact Map
  • core.find(core.find.default, core.find.pdbs, core.find.pdb): Identification of Invariant Core Positions
  • com(com.pdb, com.xyz): Center of Mass
  • dccm: DCCM: Dynamical Cross-Correlation Matrix
  • dist.xyz: Calculate the Distances Between the Rows of Two Matrices
  • dm(dm.pdb, dm.xyz): Distance Matrix Analysis
  • dssp(dssp.pdb, dssp.pdbs, dssp.xyz, stride, print.sse): Secondary Structure Analysis with DSSP or STRIDE
  • geostas(geostas.default, geostas.xyz, geostas.nma, geostas.enma, geostas.pdb, geostas.pdbs, amsm.xyz, print.geostas): GeoStaS Domain Finder
  • mustang: Structure-based Sequence Alignment with MUSTANG
  • fit.xyz(rot.lsq): Coordinate Superposition
  • binding.site: Binding Site Residues
  • mktrj(mktrj.pca, mktrj.nma, mktrj.enma): PCA / NMA Atomic Displacement Trajectory
  • overlap: Overlap analysis
  • pca: Principal Component Analysis
  • pca.xyz(print.pca): Principal Component Analysis
  • pca.pdbs: Principal Component Analysis
  • pca.array: Principal Component Analysis of an arrary of matrices
  • pca.tor: Principal Component Analysis
  • dccm.pca: Dynamic Cross-Correlation from Principal Component Analysis
  • project.pca(z2xyz.pca, xyz2z.pca): Project Data onto Principal Components
  • pdbaln: Sequence Alignment of PDB Files
  • pdb.annotate: Get Customizable Annotations From PDB
  • pdb2aln: Align a PDB structure to an existing alignment
  • pdb2aln.ind: Mapping from alignment positions to PDB atomic indices
  • pdbfit(pdbfit.pdb, pdbfit.pdbs): PDB File Coordinate Superposition
  • chain.pdb: Find Possible PDB Chain Breaks
  • convert.pdb:Renumber and Convert Between Various PDB formats
  • rgyr: Radius of Gyration
  • rmsd: Root Mean Square Deviation
  • rmsf: Atomic RMS Fluctuations
  • rmsip(rmsip.default, rmsip.enma): Root Mean Square Inner Product
  • struct.aln: Structure Alignment Of Two PDB Files
  • torsion.pdb: Calculate Mainchain and Sidechain Torsion/Dihedral Angles
  • torsion.xyz: Calculate Torsion/Dihedral Angles
  • wrap.tor: Wrap Torsion Angle Data
  • aa2mass: Amino Acid Residues to Mass Converter
  • aa.table: Table of Relevant Amino Acids
  • atom.index: Atom Names/Types
  • atom2mass(atom2mass.default, atom2mass.pdb): Atom Names/Types to Mass Converter
  • atom2ele(atom2ele.default, atom2ele.pdb): Atom Names/Types to Atomic Symbols Converter
  • cov.nma(cov.enma): Calculate Covariance Matrix from Normal Modes
  • dccm.enma: Cross-Correlation for Ensemble NMA (eNMA)
  • dccm.nma: Dynamic Cross-Correlation from Normal Modes Analysis
  • dccm.xyz(cov2dccm): DCCM: Dynamical Cross-Correlation Matrix
  • deformation.nma: Deformation Analysis
  • fluct.nma: NMA Fluctuations
  • inner.prod: Mass-weighted Inner Product
  • load.enmff(ff.calpha, ff.calphax, ff.anm, ff.pfanm, ff.sdenm, ff.reach): ENM Force Field Loader
  • nma: Normal Mode Analysis
  • nma.pdb(build.hessian, print.nma): Normal Mode Analysis
  • nma.pdbs(print.enma): Ensemble Normal Mode Analysis
  • normalize.vector: Mass-Weighted Normalized Vector
  • pdbs2pdb: PDBs to PDB Converter
  • plot.enma: Plot eNMA Results
  • plot.nma: Plot NMA Results
  • plot.rmsip: Plot RMSIP Results
  • sdENM: Index for the sdENM ff
  • sse.bridges: SSE Backbone Hydrogen Bonding
  • view.dccm: Visualization of Dynamic Cross-Correlation
  • view.modes: Vector Field Visualization of Modes
  • var.xyz(var.pdbs): Pairwise Distance Variance in Cartesian Coordinates
  • inspect.connectivity: Check the Connectivity of Protein Structures

Trajectory Analysis:

Do Interesting Things with Simulation Data

  • angle.xyz: Calculate the Angle Between Three Atoms
  • cmap(cmap.default, cmap.xyz, cmap.pdb): Contact Map
  • core.find(core.find.default, core.find.pdbs, core.find.pdb): Identification of Invariant Core Positions
  • dccm: DCCM: Dynamical Cross-Correlation Matrix
  • dccm.pca: Dynamic Cross-Correlation from Principal Component Analysis
  • lmi: LMI: Linear Mutual Information Matrix
  • dist.xyz: Calculate the Distances Between the Rows of Two Matrices
  • dm(dm.pdb, dm.xyz): Distance Matrix Analysis
  • geostas(geostas.default, geostas.xyz, geostas.nma, geostas.enma, geostas.pdb, geostas.pdbs, amsm.xyz, print.geostas): GeoStaS Domain Finder
  • fit.xyz(rot.lsq): Coordinate Superposition
  • mktrj(mktrj.pca, mktrj.nma, mktrj.enma): PCA / NMA Atomic Displacement Trajectory
  • overlap: Overlap analysis
  • project.pca(z2xyz.pca, xyz2z.pca): Project Data onto Principal Components
  • pca.tor: Principal Component Analysis
  • pca.xyz(print.pca): Principal Component Analysis
  • pdbaln: Sequence Alignment of PDB Files
  • rgyr: Radius of Gyration
  • rmsd: Root Mean Square Deviation
  • rmsf: Atomic RMS Fluctuations
  • rmsip(rmsip.default, rmsip.enma): Root Mean Square Inner Product
  • torsion.pdb: Calculate Mainchain and Sidechain Torsion/Dihedral Angles
  • torsion.xyz: Calculate Torsion/Dihedral Angles
  • wrap.tor: Wrap Torsion Angle Data

Normal Mode Analysis:

Probe Large-Scale Protein Motions

  • aa2mass: Amino Acid Residues to Mass Converter
  • aa.table: Table of Relevant Amino Acids
  • atom.index: Atom Names/Types
  • atom2mass(atom2mass.default, atom2mass.pdb): Atom Names/Types to Mass Converter
  • atom2ele(atom2ele.default, atom2ele.pdb): Atom Names/Types to Atomic Symbols Converter
  • bhattacharyya(bhattacharyya.nma, bhattacharyya.pca, bhattacharyya.enma, bhattacharyya.array, bhattacharyya.matrix): Bhattacharyya Coefficient
  • cov.nma(cov.enma): Calculate Covariance Matrix from Normal Modes
  • covsoverlap(covsoverlap.enma, covsoverlap.nma): Covariance Overlap
  • dccm.enma: Cross-Correlation for Ensemble NMA (eNMA)
  • dccm.nma: Dynamic Cross-Correlation from Normal Modes Analysis
  • dccm.xyz(cov2dccm): DCCM: Dynamical Cross-Correlation Matrix
  • deformation.nma: Deformation Analysis
  • geostas(geostas.default, geostas.xyz, geostas.nma, geostas.enma, geostas.pdb, geostas.pdbs, amsm.xyz, print.geostas): GeoStaS Domain Finder
  • fluct.nma: NMA Fluctuations
  • inner.prod: Mass-weighted Inner Product
  • load.enmff(ff.calpha, ff.calphax, ff.anm, ff.pfanm, ff.sdenm, ff.reach): ENM Force Field Loader
  • mktrj(mktrj.pca, mktrj.nma, mktrj.enma): PCA / NMA Atomic Displacement Trajectory
  • nma: Normal Mode Analysis
  • nma.pdb(build.hessian, print.nma): Normal Mode Analysis
  • nma.pdbs(print.enma): Ensemble Normal Mode Analysis
  • normalize.vector: Mass-Weighted Normalized Vector
  • pdbs2pdb: PDBs to PDB Converter
  • plot.enma: Plot eNMA Results
  • plot.nma: Plot NMA Results
  • plot.rmsip: Plot RMSIP Results
  • sdENM: Index for the sdENM ff
  • sse.bridges: SSE Backbone Hydrogen Bonding
  • sip(sip.default, sip.nma, sip.enma): Square Inner Product
  • var.xyz(var.pdbs): Pairwise Distance Variance in Cartesian Coordinates
  • view.dccm: Visualization of Dynamic Cross-Correlation
  • view.modes: Vector Field Visualization of Modes

Correlation Network Analysis:

Network analysis of dynamic coupling

  • cna(cna.dccm, cna.ensmb): Protein Dynamic Correlation Network Construction and Community Analysis.
  • cnapath(summary.cnapath, print.cnapath, view.cnapath): Suboptimal Path Analysis for Correlation Networks
  • dccm: DCCM: Dynamical Cross-Correlation Matrix
  • lmi: LMI: Linear Mutual Information Matrix
  • cmap(cmap.default, cmap.xyz, cmap.pdb): Contact Map
  • community.tree: Reconstruction of the Girvan-Newman Community Tree for a CNA Class Object.
  • network.amendment: Amendment of a CNA Network According To A Input Community Membership Vector.
  • view.cna: View CNA Protein Structure Network Community Output in VMD
  • view.dccm: Visualization of Dynamic Cross-Correlation
  • plot.cna: Protein Structure Network Plots in 2D and 3D.
  • print.cna(summary.cna): Summarize and Print Features of a cna Network Graph
  • identify.cna: Identify Points in a CNA Protein Structure Network Plot
  • layout.cna: Protein Structure Network Layout
  • prune.cna: Prune A cna Network Object

Graphics:

Plotting and Graphic Display

Utilities:

Convert and Manipulate Data

  • aa.index: AAindex: Amino Acid Index Database
  • aa123(aa321): Convert Between 1-letter and 3-letter Aminoacid Codes
  • aa2index: Convert an Aminoacid Sequence to AAIndex Values
  • aln2html: Create a HTML Page For a Given Alignment
  • as.fasta: Alignment to FASTA object
  • as.pdb(as.pdb.mol2, as.pdb.prmtop, as.pdb.default): Convert to PDB format
  • as.select: Convert Atomic Indices to a Select Object
  • atom.select(atom.select.pdb, atom.select.prmtop, print.select): Atom Selection from PDB and PRMTOP Structure Objects
  • combine.select: Combine Atom Selections From PDB Structure
  • atom2xyz(xyz2atom): Convert Between Atom and xyz Indices
  • basename.pdb:
  • bio3d-package(bio3d): Biological Structure Analysis
  • biounit:Biological Units Construction
  • bounds: Bounds of a Numeric Vector
  • bounds.sse:Obtain A SSE Object From An SSE Sequence Vector
  • cat.pdb: Concatenate Multiple PDB Objects
  • check.utility: Check on Missing Utility Programs
  • clean.pdb:Inspect And Clean Up A PDB Object
  • chain.pdb: Find Possible PDB Chain Breaks
  • convert.pdb:Renumber and Convert Between Various PDB formats
  • diag.ind: Diagonal Indices of a Matrix
  • difference.vector: Difference Vector
  • gap.inspect: Alignment Gap Summary
  • inspect.connectivity: Check the Connectivity of Protein Structures
  • is.gap: Gap Characters
  • is.pdb(is.pdbs): Is an Object of Class
  • is.select:Is an Object of Class
  • is.xyz(as.xyz): Is an Object of Class
  • lbio3d: List all Functions in the bio3d Package
  • orient.pdb: Orient a PDB Structure
  • pairwise: Pair Indices
  • plot.bio3d(plotb3): Plots with marginal SSE annotation
  • print.core: Printing Core Positions and Returning Indices
  • print.cna(summary.cna): Summarize and Print Features of a cna Network Graph
  • print.fasta(.print.fasta.ali): Printing Sequence Alignments
  • print.xyz: Printing XYZ coordinates
  • rle2(print.rle2):Run Length Encoding with Indices
  • pdbseq: Extract The Aminoacid Sequence From A PDB Object
  • seqbind: Combine Sequences by Rows Without Recycling
  • pdbsplit: Split a PDB File Into Separate Files, One For Each Chain.
  • store.atom: Store all-atom data from a PDB object
  • unbound: Sequence Generation from a Bounds Vector
  • vec2resno: Replicate Per-residue Vector Values
  • setup.ncore: Setup for Running Bio3D Functions using Multiple CPU Cores
  • elements:Periodic Table of the Elements
  • formula2mass:Chemical Formula to Mass Converter

Example Data:

Bio3d Example Data

  • example.data(kinesin, transducin, pdbs, core, annotation, hivp):Bio3d Example Data

Other