What should be the strategy of using automatic system to detect code plagiarism?

→ Обратите внимание

До соревнования
Codeforces Round 940 (Div. 2) and CodeCraft-23
3 дня
Зарегистрироваться »

*есть доп. регистрация

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	ecnerwala	3648
2	Benq	3580
3	orzdevinwang	3570
4	cnnfls_csy	3569
5	Geothermal	3568
6	tourist	3565
7	maroonrk	3530
8	Radewoosh	3520
9	Um_nik	3481
10	jiangly	3467

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	maomao90	174
2	adamant	164
2	awoo	164
4	TheScrasse	160
5	nor	159
6	maroonrk	156
7	-is-this-fft-	150
8	SecondThread	147
9	orz	146
10	pajenegod	145

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя hieu_2004

What should be the strategy of using automatic system to detect code plagiarism?

Автор hieu_2004, история, 3 года назад, По-английски

Recently, I have seen a lot of blogs talking about the issues of cheaters. Therefore, I am currently thinking about using automatic system to catch them.

Currently, the most well-known automatic system for assisting of detecting plagiarism is MOSS(from Stanford). At first, I asked myself, why did not Codeforces use them? However, I look at the number of participants of each contests; it turns out that the count is approximately under 30000. So, we have to compare $$$4.5*10^8$$$ pairs of source code!

Assuming that the system can check $$$10^4$$$ pairs per second, we will need $$$45000$$$ seconds, which is just more than half a day, the same length as hacking procedure of Educational Rounds. But I believe that limit is much lower (I have not used it).

Is there any assistance like that could run that fast, if not MOSS? Is there any solutions that can drop the complexity of $$$O(n^2 * t)$$$? (assuming $$$t$$$ is the time for comparing a pair of code)

hieu_2004
3 года назад
2

Комментарии (1)

Показать архивные | Написать комментарий?

navneet.h

3 года назад, # |

← Rev. 2 →

we can do the same thing on any random contest from any two month period, where we will decrease cutoff of similarity, so that more persons could get caught.

technically we can make relation tree of variables, like now you can do some automaton or suffix sorting kind of thing to make smaller groups, by neglecting those pairs which will definitely differ in code perspective.

also, i guess moss is system only for text based comparison, does it also compares machine level code??

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 18.04.2024 22:20:51 (j3).

Десктопная версия, переключиться на мобильную.

При поддержке