Is GumTree suitable for software plagiarism detection? #291
namasikanam
started this conversation in
Ideas
Replies: 1 comment 2 replies
-
Hi! I think it could be possible! Maybe it would require customizing the weight of some actions (for instance low weight for moving code around, low weight for renaming, high weight for adding or deleting code). It would be a fun experiment to assess how it compares to some baseline. Cheers! |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am curious about the effectivenss of GumTree for the task of software plagiarism detection. It seems a natural idea to use the length of edit scripts or Jaccard similarity of mappings between two ASTs, so as to compute a similarity score between two source code files.
Update (2022-09-17): I will try to spare time to find a way to thoroughly and fairly evaluate it, if anyone is also interested to this, please contact me!
Beta Was this translation helpful? Give feedback.
All reactions