What should be the strategy of using automatic system to detect code plagiarism?

Revision en4, by hieu_2004, 2020-12-31 07:37:01

Recently, I have seen a lot of blogs talking about the issues of cheaters. Therefore, I am currently thinking about using automatic system to catch them.

Currently, the most well-known automatic system for assisting of detecting plagiarism is MOSS(from Stanford). At first, I asked myself, why did not Codeforces use them? However, I look at the number of participants of each contests; it turns out that the count is approximately under 30000. So, we have to compare $4.5*10^8$ pairs of source code!

Assuming that the system can check $10^4$ pairs per second, we will need $45000$ seconds, which is just more than half a day, the same length as hacking procedure of Educational Rounds. But I believe that limit is much lower (I have not used it).

Is there any assistance like that could run that fast, if not MOSS? Is there any solutions that can drop the complexity of $O(n^2 * t)$? (assuming $t$ is the time for comparing a pair of code)

#### History

Revisions

Rev. Lang. By When Δ Comment
en4 hieu_2004 2020-12-31 07:37:01 0 (published)
en3 hieu_2004 2020-12-31 07:36:11 186
en2 hieu_2004 2020-12-31 07:33:22 408
en1 hieu_2004 2020-12-31 07:24:47 474 Initial revision (saved to drafts)