We need more lower rated testers!

→ Обратите внимание

До соревнования
Educational Codeforces Round 166 (Rated for Div. 2)
35:39:06
Зарегистрироваться »

→ Трансляции

CodeChef Starters 136 Solution Discussion

aryanc403

До начала 14:04:06

CF Edu Round 166 Solution Discussion

aryanc403

До начала 37:44:06

Всё →

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	3757
2	jiangly	3647
3	Benq	3581
4	orzdevinwang	3570
5	Geothermal	3569
5	cnnfls_csy	3569
7	Radewoosh	3509
8	ecnerwala	3486
9	jqdai0815	3474
10	gyh20	3447

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	maomao90	171
2	adamant	164
3	awoo	163
4	TheScrasse	159
5	nor	155
6	maroonrk	154
7	-is-this-fft-	152
8	Petr	147
9	orz	146
10	pajenegod	145

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя giorgosgiapis

We need more lower rated testers!

Автор giorgosgiapis, история, 2 года назад, По-английски

Perhaps this is an unpopular opinion but I think we need more testers that are on the lower side of the rating spectrum. Having only red/orange testers for Div 2 contests might (and does) result in underestimation of the difficulty of the proposed problems. Blue/cyan or even green/gray form a more representative sample for the actual contestants. I understand that there should be a few highly rated and experienced testers but having only them can result in speedforces rounds for Div 2 participants.

div.2, opinion

+162

giorgosgiapis
2 года назад
19

Комментарии (16)

Показать архивные | Написать комментарий?

Helal_Salloum

2 года назад, # |

I agree

→ Ответить

oursaco

2 года назад, # |

yes div 2s are way too hard

→ Ответить

Wind_Eagle

2 года назад, # |

← Rev. 2 →

OK, what's about Round 745? There were a plenty of expert / specialist testers, but the round was very unbalanced (sorry problemsetters).

→ Ответить

giorgosgiapis

2 года назад, # ^ |

I'm not saying blue/cyan testers will eliminate all unbalanced rounds but I think it can help.

→ Ответить

Wind_Eagle

2 года назад, # ^ |

+27

In my experience, blue/cyan testers can evaluate problem's difficulty not better than orange/red.

→ Ответить

pigmike

2 года назад, # ^ |

+15

That's nonsense. Problems that are easy for an orange/red may be way more difficult for a blue/cyan, so I don't understand why you think they will give the same evaluation. Why would I say a problem is easy if I'm unable to solve it during testing?

→ Ответить

Wind_Eagle

2 года назад, # ^ |

For example, I am able to solve D2A D2B D2C D2D. Do you say that for me they have the same difficulty?

→ Ответить

giorgosgiapis

2 года назад, # ^ |

You missed the point. When someone is highly rated then he might not be able to estimate the difficulty gap between problems and that's quite reasonable tbh.

→ Ответить

Wind_Eagle

2 года назад, # ^ |

← Rev. 2 →

Hmm... There is something. I remember preparing my 741 round. Before testing I was absolutely sure that task C was 800-900 rated (it scored 1500 at Codeforces).

But this is only the half of the truth. I've seen 1800-1900 rated problems that are hard for me, and 2900 problems that are pretty easy for me. To balance round one need many testers, no matter how they are rated.

This is because each person at each rating can understand difficulty gap (if they solved the problem), But everyone (except tourist) has weak and strong sides. You just need many testers to have combined view.

→ Ответить

giorgosgiapis

2 года назад, # ^ |

Agreed. Many testers from different rating ranges is probably the way to go. The problem is that, unlike coordinators, the number of testers, as well as who those testers are going to be, depends on the problem setter.

→ Ответить

Wind_Eagle

2 года назад, # ^ |

Sad truth. Maybe this is so: we should have a testers of each colour :)

→ Ответить

pigmike

2 года назад, # ^ |

Maybe idk how testing works, but I believe if there were more cyan/blue testers for today's round most of them won't be able to solve Div2D, and any sensible author would consider that a red flag, since most of the target audience might not be able to solve that problem. I can't say the same for orange/red testers maybe it was an easy or okay-ish problem to them, and they solved it during testing so the setter felt the problem was okay.

→ Ответить

Olympia

2 года назад, # |

I agree...I think ideally there should be 1 tester of each rating color...something like Codeforces Round #736 (authored by Agnimandur): which had 32 testers of each color range. Having a bunch of testers in different rating ranges does not guarantee a balanced round, but it certainly makes a round more balanced imo...

→ Ответить

lemongrab

2 года назад, # |

Please allow me to be a tester

→ Ответить

mukund007

2 года назад, # |

-8

Huh? Then some random testers would start selling solutions. Eventually cheaters per round would increase. End of the world incoming with this kind of decision.

→ Ответить

naman1601

2 года назад, # ^ |

+15

That's not how testing works. Testers are (almost always) people whom the authors know (fairly well) and trust, and are experienced participants who would (hopefully) not only never intentionally leak problems/solutions, but would also be careful about unintentionally leaking anything.

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 29.05.2024 05:55:54 (k3).

Десктопная версия, переключиться на мобильную.

При поддержке