Skip to content

add pdb matching function #84

@LilySnow

Description

@LilySnow

pdb matching is a very useful step for pdb analysis. It would be nice if we could add this to pdb2sql.

Expected performance:

INPUT:

  • a reference pdb file with multiple chains
  • a set of pdb files for the same protein complex but with different numbering and chain IDs

OUTPUT:

  • chain ID mapping
  • pdb files renumbered based on the reference pdb. Chain IDs are also changed based on the reference pdb

Ideally, we hope to separate pdb_matching into two functions (steps):

Step 1. pdb_match_chn_batch.py: match chain IDs of pdb files to ref.pdb. Output _newChnID.pdb files.
Note: This step can be skipped if model.pdb files have already matched chain IDs. This step is also error-prone when multiple chains are highly similar to each other. Therefore, a human visual check is necessary.

Step 2. pdb_renum_batch.py: align and renumber pdb files to ref.pdb. Output _renum.pdb files.

There are two existing solutions:

  1. https://github.com/LilySnow/PDB-matching (python + cpp)
  2. DeepRank/haddock-tools@ed9beee (python, by the haddock group)

Maybe we could use these solutions as the basis?

Metadata

Metadata

Assignees

Labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions