Some IOI 2022 Statistics

#	User	Rating
1	tourist	3880
2	jiangly	3669
3	ecnerwala	3654
4	Benq	3627
5	orzdevinwang	3612
6	Geothermal	3569
6	cnnfls_csy	3569
8	jqdai0815	3532
9	Radewoosh	3522
10	gyh20	3447

#	User	Contrib.
1	awoo	161
1	maomao90	161
3	adamant	156
4	maroonrk	153
5	-is-this-fft-	148
5	atcoder_official	148
5	SecondThread	148
8	Petr	147
9	nor	144
10	TheScrasse	142

Hello CF! Following IOI 2022, I did some analysis from the scoreboard. Here are my findings.

Country Standings

This has already been posted on codeforces here, but mine is slightly different as it uses the total score instead of the average (hence countries with less than 4 participants are ranked higher than the ones with 4 in mine).

Top 5:

Rank	Country	Average Score
1	CHN	577.50
2	USA	477.14
3	JPN	467.77
4	KOR	406.11
5	CAN	376.01

Full Country Standings

Rank	Country	Average Score
1	CHN	577.50
2	USA	477.14
3	JPN	467.77
4	KOR	406.11
5	CAN	376.01
6	TWN	375.51
7	UKR	375.07
8	IRN	361.89
9	ROU	346.69
10	IOI	342.09
11	HRV	336.90
12	POL	333.36
13	SGP	322.23
14	ISR	321.50
15	AUS	321.30
16	VNM	317.43
17	BGR	274.68
18	TUR	273.28
19	BRA	263.23
20	IDN	259.53
21	IND	256.07
22	KAZ	233.96
23	MYS	226.68
24	ITA	223.44
25	HKG	218.65
26	MAC	214.24
27	BGD	210.65
28	MKD	209.07
29	SRB	206.14
30	DEU	202.77
31	FRA	194.51
32	GEO	194.37
33	SVK	190.62
34	HUN	190.34
35	GBR	189.86
36	KGZ	183.08
37	MNE	178.00
38	NLD	165.98
39	SVN	164.19
40	PHL	156.90
41	CZE	142.15
42	BEL	141.41
43	ARM	141.30
44	MEX	140.72
45	DNK	140.29
46	THA	140.13
47	CUB	139.67
48	LVA	138.81
49	CHE	137.06
50	MNG	135.92
51	CYP	134.69
52	LTU	133.44
53	EGY	129.99
54	SYR	121.70
55	ESP	118.87
56	NZL	114.83
57	BIH	112.07
58	FIN	108.18
59	SWE	107.40
60	EST	107.38
61	MDA	105.28
62	AUT	102.71
63	MAR	102.69
64	TUN	100.88
65	TJK	97.60
66	NOR	92.40
67	SAU	91.67
68	ISL	88.17
69	PSE	85.75
70	AZE	83.25
71	PER	79.34
72	PRT	77.97
73	ZAF	77.13
74	UZB	71.85
75	ARG	61.00
76	SLV	60.19
77	DOM	59.75
78	IRL	55.60
79	TKM	53.50
80	GRC	52.75
81	VEN	37.94
82	CHL	35.13
83	LKA	35.03
84	BOL	33.44
85	NGA	26.31
86	LUX	23.00
87	JOR	18.63
88	COL	15.88
89	ECU	0.00

Problem Score Distributions

Most people do better on some tasks than on others. These graphs can answer the following questions:

What medal would I get if the score was entirely based on one task?
Which was the easiest task to get x pts on?
How many people got >x pts on each task? etc (You can see a higher resolution version here)

The vertical coloured lines represent the "cutoff" score for each medal for each task (ie bronze line corresponds to median score, silver corresponds to 75th percentile, and gold corresponds to 91.67th percentile.

Country Score Dispersion

Following day 2, some people found that some online contestants from the same country had similar scores a bit "sus". In this graph, we can see the distribution of the coefficients of variation of the scores of each country's contestants. ie for each country:

Take the score of the 4 contestants.
Find the standard deviation and mean
Find the coefficient of variation (standard deviation / mean)

Plot the distribution of said coefficients over all countries, separately for online and onsite ones.

Note: the distribution has been estimated using a guassian KDE, which is why there are probabilities >0 for values of x < 0. The actual smallest coefficients are 0.0287 for onsite participants (JPN) and 0.0450 for online participants (CHN).

Note that despite the fact that the average variation is less for online participants, the sample size for them is quite small (n=17).

Sources

These were generated using data from the scoreboard, available at https://ranking.ioi2022.id.

Online/Onsite participants were found at https://status.ioi2022.id

Comments (7)

Write comment?

kozliklekarsky

23 months ago, # |

didn't even add r*ssia after the "outrage". nice play

→ Reply

chromate00

23 months ago, # ^ |

"r*ssia" that made me laugh

2147483648

-11

A play of a true Nazi.

wait, actually, ru**ia is in the list. the thing is that they're just labeled as IOI.

← Rev. 2 →

-14

but it's not the r*ssia we all know and-- mainly because there is also belarus with which no one really has a problem with but they do because they are accessories to whatever this is?? unfortunately I can't fully comprehend the entire political situation and its implications so let's just comment about russia

nor

+16

Great work on the plots, they're pretty informative. This reminds me of a similar (but very brief) analysis here.

However, I feel like there's an issue with the final plot. Firstly, it doesn't make sense to take a coefficient of variation for just 4 data points (I understand you wanted to use it as a proxy for how suspicious the scores are). Even if it did, estimating it using a Gaussian kernel density estimator is quite overkill, and I don't understand why anyone would prefer it over using a histogram (which is much simpler and more accurate to the original situation) when you have a decent number of datapoints (countries) to plot a histogram for.

cfalas

I just thought that a smooth line would show a "clearer" trend, but didn't give much thought to it tbh (this is what happens when you make random graphs in an airport when going home from IOI).

This is how it looks with a histogram.

Graph

Do you suggest using something else other than coefficient of variation as a metric? I get what you are saying, but this is true for any measure of dispersion if I understand correctly.

cfalas's blog

Country Standings

Problem Score Distributions

Country Score Dispersion

Sources