Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
247 commits
Select commit Hold shift + click to select a range
c4398d7
Increase size of and font in figures
amorehead Nov 15, 2024
51a0be1
Upgrade PoseBusters, and update FASTA preparation script
amorehead Nov 16, 2024
46f06b8
Parse all cofactor inputs for all datasets
amorehead Nov 16, 2024
37ceb24
Add Dockgen support
amorehead Nov 17, 2024
81397f2
Finalize one copy extraction of primary ligands
amorehead Nov 17, 2024
353c8cf
Model ions by default
amorehead Nov 18, 2024
0833abe
Support parsing CASP15 sequences
amorehead Nov 18, 2024
5eef6cb
Switch to PyMOL-based binding site-superimposed complex alignment rou…
amorehead Nov 20, 2024
e07a312
Score primary (e.g., single) ligand(s) during inference analysis
amorehead Nov 20, 2024
53b431c
Make minor fixes
amorehead Nov 21, 2024
f58353b
Catch ValueErrors raised potentially during relaxation
amorehead Nov 22, 2024
a695036
Update mmCIF to PDB conversion script to be compatible with AF3 local…
amorehead Nov 22, 2024
455a34a
Update apo-to-holo alignment based on PyMOL algorithm
amorehead Nov 22, 2024
97d6225
Support biomolecule FASTA creation
amorehead Nov 22, 2024
0695201
Set default number of poses to sample per method to 5
amorehead Nov 23, 2024
97701a1
Standardize multi-ligand delimiter for model inputs
amorehead Nov 23, 2024
b78b1e7
Make small fixes
amorehead Nov 23, 2024
3e12cbf
Update RFAA database paths
amorehead Nov 23, 2024
5ba8ab8
Update protein sequence extraction function
amorehead Nov 23, 2024
3c2800a
Update import
amorehead Nov 23, 2024
4b295a7
Fix some inference edge cases for DiffDock
amorehead Nov 23, 2024
5ec0705
Handle for edge case in DynamicBind having missing protein outputs
amorehead Nov 23, 2024
9f28b34
Handle an edge case in relaxation
amorehead Nov 23, 2024
0692595
Fix DiffDock inference bug
amorehead Nov 23, 2024
48b28a3
Update chai-lab copy
amorehead Nov 24, 2024
33cbdf8
Finish updating Chai-1's inference code
amorehead Nov 24, 2024
f8a8b85
Update conversion and alignment scripts
amorehead Nov 24, 2024
0ff2ae1
Reenable function for ensembling only
amorehead Nov 24, 2024
a37d46e
Simplify
amorehead Nov 24, 2024
2d44890
Revert
amorehead Nov 24, 2024
57c537a
Fix order of check
amorehead Nov 25, 2024
f0d4c27
Fix CASP15 holo-aligned protein filepaths
amorehead Nov 25, 2024
57c57a9
Fix bug where relaxation could delete holo-aligned structures
amorehead Nov 25, 2024
5bb1803
Try fixing bust results output directory naming issue
amorehead Nov 25, 2024
f0a0e10
Manually reactivate PoseBench env
amorehead Nov 26, 2024
2275e51
Revert "Manually reactivate PoseBench env"
amorehead Nov 26, 2024
f8bad59
Add missing dependencies to casp15_ligand_scoring env
amorehead Nov 26, 2024
1adc70f
Catch bad mmCIFs written by Chai-1
amorehead Nov 26, 2024
1851a84
Unify binding site definitions, and update CASP15 holo-aligned struct…
amorehead Nov 27, 2024
ff4c5c2
Try to score all CASP15 targets for all methods
amorehead Nov 27, 2024
d8083c8
Standardize max_method_predictions
amorehead Nov 27, 2024
8f36ef6
Relax PoseBusters version to install it in ligand scoring env
amorehead Nov 28, 2024
8ddc54d
Subset list of CASP15 targets with which to score Chai-1, due to its …
amorehead Nov 29, 2024
f1099c7
Make CASP15 scoring more robust
amorehead Nov 30, 2024
099691c
Clean up
amorehead Nov 30, 2024
e47488d
Make resuming of inference relaxation script more robust
amorehead Dec 1, 2024
d69f175
Make molecule fragment splitting fault-tolerant
amorehead Dec 1, 2024
7867b99
Make CASP15 pre-scoring pipeline robust to Ca atom mismatches between…
amorehead Dec 3, 2024
445fac0
Timeout Vina
amorehead Dec 6, 2024
32b9348
Disable signals in Vina timeout
amorehead Dec 6, 2024
bc984b9
Timeout subprocess Vina commands
amorehead Dec 6, 2024
167a812
Check Vina return codes
amorehead Dec 6, 2024
0b8e7aa
Fix NeuralPLexer chunk size
amorehead Dec 6, 2024
dcd181f
Adjust Vina timeout
amorehead Dec 6, 2024
8b77d4e
Add FD fork
amorehead Dec 7, 2024
97eaae9
Finish integrating FD
amorehead Dec 7, 2024
98496ba
Move around code to fix FD inference error
amorehead Dec 7, 2024
6a20acf
Add chemical files
amorehead Dec 7, 2024
c574d4b
Fix variable interp for FD
amorehead Dec 7, 2024
5bc8684
Update NP CSVs
amorehead Dec 7, 2024
723e97c
Update DiffDock CSV
amorehead Dec 7, 2024
a003c2c
Update Vina documentation, and fix Vina relaxation bug
amorehead Dec 8, 2024
777b5ee
Prepare to rerun compute resource benchmarking
amorehead Dec 8, 2024
87ad39c
Make time benchmarking output directories unique
amorehead Dec 8, 2024
2c9b2ac
Update subsetting CSVs
amorehead Dec 8, 2024
115dda4
Fix dir name
amorehead Dec 8, 2024
952120e
Fix output dirs
amorehead Dec 8, 2024
2e327fd
Fix FD time benchmarking
amorehead Dec 8, 2024
e947882
Begin adding aF3
amorehead Dec 8, 2024
2b47c5d
Begin adding AF3
amorehead Dec 9, 2024
d9e8bcb
Change split path
amorehead Dec 9, 2024
f75b6b2
Update apo-to-holo and complex alignment functions for DockGen-E
amorehead Dec 9, 2024
c804ebe
Handle for PoseBusters Benchmark extraction
amorehead Dec 9, 2024
9343142
Fix AF3's output extraction using PDBFixer
amorehead Dec 10, 2024
b0f9121
Finish integrating AlphaFold 3
amorehead Dec 10, 2024
fbcbe10
Update figure
amorehead Dec 10, 2024
cab5451
Make DockGen subsetting more robust
amorehead Dec 11, 2024
ffd61ac
Fix small oversight
amorehead Dec 11, 2024
5b9812d
Update figure
amorehead Dec 11, 2024
32c99d3
Fix up DockGen parsing
amorehead Dec 11, 2024
8701c49
Fix up DockGen FASTA sequence extraction for all biomolecules
amorehead Dec 11, 2024
a8bdbd3
Fix up DockGen scoring
amorehead Dec 12, 2024
c731a8d
Add script for analyzing baseline methods' protein conformational cha…
amorehead Dec 12, 2024
2252abf
Update changelog
amorehead Dec 12, 2024
4351e99
Update CSVs and conformation analysis script
amorehead Dec 12, 2024
3e4d7a5
Update .gitignore
amorehead Dec 12, 2024
43b37f4
Update .gitignore
amorehead Dec 12, 2024
e90754a
Update RFAA output extraction for DockGen-E
amorehead Dec 13, 2024
207b6fd
Fix RFAA DockGen output extraction
amorehead Dec 13, 2024
ab207d5
Fix misplaced variable
amorehead Dec 13, 2024
bdbe817
Update ESMFold structure prediction instructions
amorehead Dec 14, 2024
031b245
Fix script name
amorehead Dec 14, 2024
2d3b6d4
Fix AF3 output extraction
amorehead Dec 14, 2024
5495c21
Update input CSVs
amorehead Dec 14, 2024
c637764
Fix file output naming
amorehead Dec 14, 2024
7e1daa8
Fix file output naming
amorehead Dec 14, 2024
7f9db86
Fork PoseCheck for PoseBench code
amorehead Dec 15, 2024
5c27431
Document function, and deprecate interaction dataframes
amorehead Dec 15, 2024
46d68f1
Refactor interaction analysis notebooks to track PDB IDs
amorehead Dec 15, 2024
dbf7e95
Add method interaction analysis notebook for each dataset separately
amorehead Dec 15, 2024
0e45306
Update mapping
amorehead Dec 15, 2024
3f05c0d
Add scripts
amorehead Dec 16, 2024
af72c00
Fix type
amorehead Dec 16, 2024
2c50a9a
Fix syntax
amorehead Dec 16, 2024
6ddd51c
Add comment
amorehead Dec 16, 2024
f786156
Catch errors in collecting baseline interactions
amorehead Dec 16, 2024
a832d8c
Fix path to PB CCD IDs file
amorehead Dec 16, 2024
fc3166d
Start adding metric
amorehead Dec 17, 2024
d9c5088
Keep developing metric
amorehead Dec 17, 2024
c53f795
Catch bad interaction entries
amorehead Dec 17, 2024
df5755a
Add script for preparing Chai-1 MSAs
amorehead Dec 17, 2024
6702aae
Make MSA prep script dataset flexible
amorehead Dec 17, 2024
1bd4732
Annotate query sequences
amorehead Dec 17, 2024
124227b
Wire in custom MSA support for Chai-1
amorehead Dec 17, 2024
1fd8ef4
Update PoseBusters plot
amorehead Dec 17, 2024
3acdd4f
Clean up figures
amorehead Dec 17, 2024
b07d072
Safeguard Chai MSA construction
amorehead Dec 17, 2024
3bfb581
Update PB inference result plotting
amorehead Dec 17, 2024
8ab777b
Remove hybrid notebook
amorehead Dec 17, 2024
840a7fe
Update interaction plots
amorehead Dec 17, 2024
5938be9
Update DockGen plots
amorehead Dec 17, 2024
aaef02b
Update CASP15 plots
amorehead Dec 17, 2024
341da74
Update Astex Diverse plots
amorehead Dec 17, 2024
f8bf3b7
Fix method name parsing
amorehead Dec 18, 2024
8cab814
Fix Chai-1 arg
amorehead Dec 18, 2024
7b84ad2
Debug with logging
amorehead Dec 18, 2024
4a078d4
Feed Chai-1 MSAs another way
amorehead Dec 18, 2024
1c4fce8
Add in Vina
amorehead Dec 18, 2024
1deebe9
Add standalone script
amorehead Dec 18, 2024
41a87a1
Run ProLIF in sequential mode
amorehead Dec 20, 2024
66f55d5
Fix bug
amorehead Dec 20, 2024
f5324a4
Update ProLIF
amorehead Dec 20, 2024
462d160
Add AF3 for CASP15 interaction analysis
amorehead Dec 20, 2024
330f02b
Make ESMFold predictor more verbose
amorehead Dec 21, 2024
a1dd5e5
Upgrade MDAnalysis and NumPy to fix segmentation fault when caching d…
amorehead Dec 21, 2024
16e26db
Update README
amorehead Dec 22, 2024
a09f1b2
Update env
amorehead Dec 22, 2024
4390d4e
Update environments
amorehead Dec 22, 2024
42b4a22
Update env
amorehead Dec 22, 2024
693ffad
Update dataset interaction plots
amorehead Dec 22, 2024
4390d54
Fix up AF3 output extraction filenames
amorehead Dec 24, 2024
65beb90
Fix alphafold3 name
amorehead Dec 24, 2024
d997c9a
Update interaction methods to analyze
amorehead Dec 28, 2024
b1ff7bc
Patch OpenFold to support more than 26 canonical chain IDs
amorehead Dec 28, 2024
d70b5ba
Streamline A2H plot names
amorehead Dec 28, 2024
a33143e
Add Chai-1 single-seq mode interaction analysis
amorehead Dec 30, 2024
9c04175
Revert NumPy to 1.24.4
amorehead Dec 30, 2024
ffbc50c
Update PLIF metric plotting
amorehead Dec 30, 2024
7bc316e
Handle invalid interactions
amorehead Dec 30, 2024
d50aa44
Rearrange notebooks
amorehead Dec 30, 2024
1dee6cc
Update plots
amorehead Dec 31, 2024
c3ed21f
Fix AF3 CASP15 output extraction
amorehead Jan 3, 2025
d1312b6
Fix AF3 CASP15 output extraction again
amorehead Jan 3, 2025
1a4fb77
Update all plots
amorehead Jan 4, 2025
9ef40a3
Clip EMD values to a max of 10 when constructing WM values
amorehead Jan 5, 2025
b8268db
Update .gitignore
amorehead Jan 9, 2025
6b51017
Update README
amorehead Jan 10, 2025
ab60553
Standardize PoseBusters package version
amorehead Jan 10, 2025
67b3c0d
Add cRMSD to plots, and update NumPy in PoseBench env
amorehead Jan 11, 2025
6aaa63c
Add initial failure mode analysis notebook
amorehead Jan 11, 2025
3a7c06c
Support DockGen for failure mode analysis
amorehead Jan 12, 2025
8343970
Analyze transferases in the PDB
amorehead Jan 12, 2025
41196dc
Update PoseBench env
amorehead Jan 12, 2025
827f51f
Further analyze AF3 performance
amorehead Jan 13, 2025
f853de9
Add sequence overlap AF3 analysis
amorehead Jan 14, 2025
88b429b
Clean up failure mode analysis
amorehead Jan 14, 2025
2c285be
Fix PLIF-WM normalization for CASP15
amorehead Jan 14, 2025
198c5cb
Adjust clipping value for PLIF-WM again
amorehead Jan 14, 2025
47a9afe
Investigate difficulty of interaction types
amorehead Jan 15, 2025
41c3e0e
Save figure
amorehead Jan 15, 2025
ac21a4c
Fix plots
amorehead Jan 15, 2025
6cbc9a6
Widen and standardize failure mode analysis
amorehead Jan 15, 2025
b5f23b3
Fix typo
amorehead Jan 16, 2025
c255b3b
Find novel protein-ligand complexes AF3 could not predict
amorehead Jan 16, 2025
8929910
Track molecule IDs during primary-ligand scoring
amorehead Jan 16, 2025
79d4ba1
Simplify failure mode analysis notebook after securing PDB IDs for al…
amorehead Jan 17, 2025
8bb2154
Fix some bugs in failure mode plotting notebook
amorehead Jan 17, 2025
9d0a502
Fix up failure mode analysis notebook
amorehead Jan 17, 2025
d5e2aca
Fix PoseBench environment again
amorehead Jan 17, 2025
ecfa0f2
Fix version of PoseBusters
amorehead Jan 18, 2025
6447c79
Document interaction and failure mode notebooks
amorehead Jan 18, 2025
f09c431
Add rescoring flag
amorehead Jan 18, 2025
ff32d9b
Differentiate bust result columns for DockGen
amorehead Jan 18, 2025
de15277
Create mol_id column more carefully
amorehead Jan 18, 2025
109d9d9
Insert mol_ids alternatively
amorehead Jan 18, 2025
057304c
Guard reference to bust_results DF
amorehead Jan 18, 2025
b049aff
Add support for directly scoring single-sequence baselines
amorehead Jan 18, 2025
6715388
Try inserting mol_ids alternatively
amorehead Jan 18, 2025
f9c216e
Add finishing touch on single-seq support
amorehead Jan 18, 2025
624dd5c
Fix parsing of path
amorehead Jan 18, 2025
feabccc
Narrow range for normalizing PLIF-WM
amorehead Jan 18, 2025
8cc8924
Track PoseBusters deposition dates
amorehead Jan 19, 2025
c5b8316
Track PoseBusters deposition dates
amorehead Jan 19, 2025
788f6bf
Filter analyses of PoseBusters results to n=130 complexes released af…
amorehead Jan 19, 2025
1a6b003
Standardize docking success criterion for failure mode analysis
amorehead Jan 20, 2025
82d7c7b
Update verbiage
amorehead Jan 21, 2025
6874bf7
Install newer version of RDKit
amorehead Jan 25, 2025
7a4b9fc
Revert "Install newer version of RDKit"
amorehead Jan 25, 2025
5145796
Support single-seq methods in complex alignment
amorehead Jan 26, 2025
cbb0874
Support single-seq methods in ensembling
amorehead Jan 26, 2025
5484677
Fix string replacement
amorehead Jan 27, 2025
db43a83
Allow rescoring of CASP15 preds
amorehead Jan 27, 2025
d64b4f7
Fix up CASP15 single-seq method support
amorehead Jan 27, 2025
98d9614
Rerun chai-1 in single-seq mode if MSA inputs cause error
amorehead Jan 28, 2025
c196691
Begin integrating NP3
amorehead Jan 28, 2025
717c73b
Add method
amorehead Jan 28, 2025
e1b0f68
Keep integrating
amorehead Jan 28, 2025
4d46aa7
Update DockGen notebook
amorehead Jan 28, 2025
117130a
Update Astex notebook
amorehead Jan 28, 2025
4d9a85f
Update CASP15 notebook
amorehead Jan 28, 2025
03b07c5
Simplify MSA naming scheme
amorehead Jan 30, 2025
633dc9f
Disable mixed MSA-singleseq mode
amorehead Jan 30, 2025
b46c1d3
Support new method
amorehead Feb 2, 2025
245962e
Timeout CASP15 method interaction analyses
amorehead Feb 2, 2025
68822ca
Report targets being analyzed
amorehead Feb 2, 2025
8ac891d
Skip problem child target for NP3 interaction analysis
amorehead Feb 2, 2025
12d51db
Timeout all analyses
amorehead Feb 2, 2025
5c8c280
Track IDs
amorehead Feb 2, 2025
0bf1a34
Fix code
amorehead Feb 2, 2025
82cc1fa
Skip another target
amorehead Feb 2, 2025
c6e2e07
Timeout without signals
amorehead Feb 2, 2025
d9e2f0f
Fix notebook
amorehead Feb 2, 2025
bbb50e8
Finish prepping notebooks
amorehead Feb 2, 2025
a1c462d
Try fixing freezing issue
amorehead Feb 2, 2025
452279e
Specify PoseCheck version
amorehead Feb 2, 2025
738108f
Fix local PoseCheck install
amorehead Feb 2, 2025
653246e
Update README
amorehead Feb 2, 2025
e7a949b
Skip specific examples
amorehead Feb 2, 2025
c629676
Skip another
amorehead Feb 2, 2025
bb21bf5
Do not skip
amorehead Feb 2, 2025
f01dc32
Update mdanalysis
amorehead Feb 2, 2025
ffaa36b
Add signal handling
amorehead Feb 2, 2025
47bbb92
Skip a specific prediction
amorehead Feb 3, 2025
80aa109
Update notebook
amorehead Feb 3, 2025
6bfb33a
Update README
amorehead Feb 3, 2025
aebb10e
Update Zenodo links
amorehead Feb 3, 2025
a70d6bd
Update docs
amorehead Feb 3, 2025
cba896f
Update .gitignore
amorehead Feb 3, 2025
be68709
Update methods
amorehead Feb 8, 2025
20b5d69
Update README
amorehead Feb 8, 2025
d2c3b70
Simplify name of RFAA
amorehead Feb 9, 2025
e0ac3b1
Fix method name
amorehead Feb 9, 2025
a3e1953
Update notebooks
amorehead Feb 9, 2025
6c858de
Update CHANGELOG
amorehead Feb 9, 2025
9dff221
Update docs
amorehead Feb 9, 2025
12c7217
Update CHANGELOG.md
amorehead Feb 9, 2025
f569076
Specify MMseqs2 version
amorehead Feb 9, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
7 changes: 6 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -147,6 +147,7 @@ dmypy.json
# PoseBench
configs/local/default.yaml
/*cache_dir/
/*alignment_viz/
/casp15_ligand_scoring/
/data/
/ensemble_generation_scripts/
Expand All @@ -163,6 +164,7 @@ configs/local/default.yaml

# Forks
/workdir/
/forks/alphafold3/*prediction_outputs/
/forks/chai-lab/chai-lab/
/forks/chai-lab/prediction_inputs/
/forks/chai-lab/prediction_outputs/
Expand All @@ -174,10 +176,13 @@ configs/local/default.yaml
/forks/DynamicBind/workdir/
/forks/FABind/ckpt/best_model.bin
/forks/FABind/FABind/
/forks/FlowDock/FlowDock/
/forks/FlowDock/checkpoints/
/forks/NeuralPLexer/NeuralPLexer/
/forks/NeuralPLexer/**/neuralplexermodels*
/forks/NeuralPLexer*/prediction_inputs/
/forks/P2Rank/
/forks/*/inference*/
/forks/*/*inference*/
/forks/RoseTTAFold-All-Atom/blast-2.2.26
/forks/RoseTTAFold-All-Atom/rf2aa/config/inference/*_rfaa_inference.yaml
/forks/RoseTTAFold-All-Atom/csblast-2.2.3
Expand Down
32 changes: 16 additions & 16 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ exclude: "^forks/"

repos:
- repo: https://github.com/pre-commit/pre-commit-hooks
rev: v4.4.0
rev: v5.0.0
hooks:
# list of supported hooks: https://pre-commit.com/hooks.html
- id: trailing-whitespace
Expand All @@ -18,32 +18,32 @@ repos:
- id: check-toml
- id: check-case-conflict
- id: check-added-large-files
args: ["--maxkb=15000"]
args: ["--maxkb=40000"]

# python code formatting
- repo: https://github.com/psf/black
rev: 23.1.0
rev: 24.10.0
hooks:
- id: black
args: [--line-length, "99"]

# python import sorting
- repo: https://github.com/PyCQA/isort
rev: 5.12.0
rev: 5.13.2
hooks:
- id: isort
args: ["--profile", "black", "--filter-files"]

# python upgrading syntax to newer version
- repo: https://github.com/asottile/pyupgrade
rev: v3.3.1
rev: v3.19.1
hooks:
- id: pyupgrade
args: [--py38-plus]

# python docstring formatting
- repo: https://github.com/myint/docformatter
rev: v1.7.4
rev: v1.7.5
hooks:
- id: docformatter
args:
Expand All @@ -57,7 +57,7 @@ repos:

# python docstring coverage checking
- repo: https://github.com/econchick/interrogate
rev: 1.5.0 # or master if you're bold
rev: 1.7.0 # or master if you're bold
hooks:
- id: interrogate
args:
Expand All @@ -74,7 +74,7 @@ repos:

# python check (PEP8), programming errors and code complexity
- repo: https://github.com/PyCQA/flake8
rev: 6.0.0
rev: 7.1.1
hooks:
- id: flake8
args:
Expand All @@ -88,28 +88,28 @@ repos:

# python security linter
- repo: https://github.com/PyCQA/bandit
rev: "1.7.5"
rev: "1.8.0"
hooks:
- id: bandit
args: ["-s", "B101"]

# yaml formatting
- repo: https://github.com/pre-commit/mirrors-prettier
rev: v3.0.0-alpha.6
rev: v4.0.0-alpha.8
hooks:
- id: prettier
types: [yaml]
exclude: "environment.yaml"

# shell scripts linter
- repo: https://github.com/shellcheck-py/shellcheck-py
rev: v0.9.0.2
rev: v0.10.0.1
hooks:
- id: shellcheck

# md formatting
- repo: https://github.com/executablebooks/mdformat
rev: 0.7.16
rev: 0.7.21
hooks:
- id: mdformat
args: ["--number"]
Expand All @@ -122,22 +122,22 @@ repos:

# word spelling linter
- repo: https://github.com/codespell-project/codespell
rev: v2.2.4
rev: v2.3.0
hooks:
- id: codespell
args:
- --skip=logs/**,data/**,*.ipynb,posebench/utils/data_utils.py,posebench/utils/residue_utils.py,posebench/data/components/protein_fasta_preparation.py,posebench/models/minimize_energy.py,posebench/data/components/create_casp15_ensemble_input_csv.py,posebench/analysis/casp15_ligand_scoring/casp_parser.py
- --skip=logs/**,data/**,*.ipynb,posebench/utils/data_utils.py,posebench/utils/residue_utils.py,posebench/data/components/fasta_preparation.py,posebench/models/minimize_energy.py,posebench/data/components/create_casp15_ensemble_input_csv.py,posebench/analysis/casp15_ligand_scoring/casp_parser.py,*Components-smiles-stereo-oe.smi,notebooks/pdb_reports/transferase/*
# - --ignore-words-list=abc,def

# jupyter notebook cell output clearing
- repo: https://github.com/kynan/nbstripout
rev: 0.6.1
rev: 0.8.1
hooks:
- id: nbstripout

# jupyter notebook linting
- repo: https://github.com/nbQA-dev/nbQA
rev: 1.6.3
rev: 1.9.1
hooks:
- id: nbqa-black
args: ["--line-length=99"]
Expand Down
26 changes: 26 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,29 @@
### 0.6.0 - 02/09/2025

**Additions**:

- Added new baseline methods (AlphaFold 3, Chai-1 with multiple sequence alignments (MSAs))
- Added new binding site-focused implementation of `complex_alignment.py` based on PyMOL's `align` command, which in many cases yields 3x better docking evaluation scores for baseline methods
- Added new script for analyzing baseline methods' protein conformational changes w.r.t. input (e.g., AlphaFold) protein structures and the corresponding reference (crystal) protein structures
- Added the new centroid RMSD and **PLIF-EMD/WM** metrics (n.b., see new arXiv preprint for more details)
- Added a failure mode analysis notebook (n.b., see new arXiv preprint for more details)

**Changes**:

- Introducing **DockGen-E**, a new version of the DockGen benchmark dataset featuring enhanced biomolecular context for docking and co-folding predictions - namely, now all DockGen complexes represent the first (biologically relevant) bioassembly of the corresponding PDB structure
- For the single-ligand datasets (i.e., Astex Diverse, PoseBusters Benchmark, and DockGen), now providing each baseline method with primary *and cofactor* ligand SMILES strings for prediction, to enhance the biomolecular context of these methods' predicted structures - as a result, for these single-ligand datasets, now the predicted ligand *most similar* to the primary ligand (in terms of both Tanimoto and structural similarity) is selected for scoring (which adds an additional layer of challenges for baseline methods)
- Updated Chai-1's inference code to commit `44375d5d4ea44c0b5b7204519e63f40b063e4a7c`, and ran it also with standardized (paired) MSAs
- Replaced all AlphaFold 3 server predictions of each dataset's protein structures with predictions from AlphaFold 3's local inference code

**Deprecations**:

- Pocket-only benchmarking has been deprecated

**Results**:

- With all the above changed in place, simplified, re-ran, and re-analyzed all baseline methods for each benchmark dataset, and updated the baseline predictions and datasets (now containing standardized MSAs) hosted on Zenodo
- **NOTE**: The updated arXiv preprint should be publicly available by 02/12/2025

### 0.5.0 - 09/30/2024

- Added results with AlphaFold 3 predicted structures (now the default)
Expand Down
Loading
Loading