PepX was constructed from the Protein Data Bank. We filtered for protein-peptide complexes requiring
In PepX, we define a peptide as follows:
All the protein-peptide complexes in PepX were clustered on their binding sites using Hierarchical Agglomerative Clustering, the same algorithm used to construct BriX. The distance matrix used in the clustering contains the RMSD values between any two protein-peptide binding sites. computed with Mustang.
The Alignment value is used to express the % of the Binding Site of the protein-peptide complex that is used in clustering. The higher the alignment, the more of the binding site is used in clustering, and thus the more clusters there will be.
The Threshold value is the maximum allowed Root Mean Square Distance or RMSD between two PDBs. The threshold value is expressed in Ångström or Å. For tighter clustering (generating more clusters), you need to choose a small value (eg 1 Å). If you need less clusters, choose a higher value (eg 2 Å).