-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assistance Requested for Replicating CLIP Score Calculations from Your Paper #35
Comments
Dear Jiahui: |
Dear Author, Could you please provide additional insights or possibly a more detailed version of the code used in your study? Any additional parameters, configurations, or preprocessing steps that might be crucial for achieving the reported scores would be immensely helpful. Could you please provide an explanation for this scaling method and confirm whether the values in the paper should be interpreted as scaled or unscaled? Your clarification on this matter would be invaluable for ensuring the accuracy and integrity of the research based on your work. |
Hello Jiahui, |
Dear Author,
I hope this message finds you well.
Firstly, I would like to extend my sincere compliments on your remarkable work. It has greatly assisted us in our research endeavors. However, I have encountered some challenges regarding the computational method used for the values in Table 1, specifically titled "Quantitative comparisons on CLIP [55] similarity with other methods."
In my attempt to replicate the results for the GaussianDreamer using CLIP, I was unable to achieve the reported score of 27.23 ± 0.06, 41.88 ± 0.04 as presented in your paper. My approach involved generating 10 random images based on the camera angles described in your paper, post which I utilized ViT-L/14 and ViT-bigG-14 models to compute the CLIP scores. I successfully generated results for 411 out of 415 prompts provided in the Dreamfusion project for this computation.
The outcomes of my calculations are illustrated in the attached image.
Could you kindly offer any guidance or share the specific code used for computing the CLIP scores as per your study? It would be incredibly helpful in understanding how to replicate the results you have achieved in your paper.
Thank you very much for your time and consideration. I am looking forward to your valuable response.
Best regards.
The text was updated successfully, but these errors were encountered: