Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with creating GreenBlobList #7

Open
maikemuecke opened this issue Nov 19, 2024 · 3 comments
Open

Problem with creating GreenBlobList #7

maikemuecke opened this issue Nov 19, 2024 · 3 comments

Comments

@maikemuecke
Copy link

Hi, I have been using your package since roughtly December 2022 to check for unassigned Density between Protein Residues.
Since the release of v2.7.1 however I have been having issues with this. Density Peaks that are present in the ccp4 files (i check through coot) don't appear anymore in when I create the GreenBlobList and parse it's centroids.
One example where this problem occurs is the protein 3K9D. Between the sulfur (SG) of the cystein reside 359 & the nitrogen (NZ) of the Lysine residue 392 (both in chain A) is an apparent green density blob. With version 2.6.1 I was able to locate the blob through the following code:

analyzer=pdb_eda.densityAnalysis.fromPDBid('3k9d')
S_chain='A'
S_num=359
chain='A'
resnum=392
S_atom='SG'
N_atom='NZ'
S_coords=analyzer.biopdbObj[0][S_chain][int(S_num)][S_atom].get_coord()
print('S coords='+str(S_coords))
N_coords=analyzer.biopdbObj[0][chain][int(resnum)][N_atom].get_coord()
print('N coords='+str(N_coords))
greenBlobList = analyzer.greenBlobList
greenBlob_xyz=[i.coordCenter for i in greenBlobList]
greenBlob_xyz=np.vstack(greenBlob_xyz)
print(greenBlob_xyz.shape)
mid_point=(S_coords+N_coords)/2

if np.sum([np.linalg.norm(greenBlob_xyz-mid_point,axis=1) <=2])>0:
    idx_clossest_gree_blob=np.argmin(np.linalg.norm(greenBlob_xyz-mid_point,axis=1))
    print(greenBlobList[idx_clossest_gree_blob].volume)

In the code i simply check if within a 2 Angstrom radius of the midpoint between the SG & NZ atoms a centroid from the greenBlobList is located.
When I use version2.7.1 the greenBlobList does not contain the unassigned density peak at all. Generally the list does not contain any unassigned density peaks with negative xyz coordinates, even though part of the protein has negative structure coordinates.
I saw that from v2.6.1 to 2.7.1 the only change was the server from which the pdb files are downloaded, so I am very surprised at the issues I am facing. I was wondering if you could advise me what is going wrong.
Thank you!

@hunter-moseley
Copy link
Member

Give me a few days. Working on a grant proposal submission. But I will look at this.

@hunter-moseley
Copy link
Member

hunter-moseley commented Nov 23, 2024

Are the comparisons between version results separated by time and/or being performed in separate directories?

The only rationale I have is that the files downloaded from wwPDB have changed.
The pdb_eda.densityAnalysis.fromPDBid function will pull down the files if they are not present in the ./ccp4_data and ./pdb_data subdirectories.

If there are separate subdirectories for producing the 2.6.1 and 2.7.1 results, you should be able to diff the ccp4 and .ent.gz files to see if they are different.

@maikemuecke
Copy link
Author

maikemuecke commented Nov 29, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants