Blog entries - Codeforces

#	User	Rating
1	ecnerwala	3648
2	Benq	3580
3	orzdevinwang	3570
4	cnnfls_csy	3569
5	Geothermal	3568
6	tourist	3565
7	maroonrk	3530
8	Radewoosh	3520
9	Um_nik	3481
10	jiangly	3467

#	User	Contrib.
1	maomao90	174
2	adamant	164
2	awoo	164
4	TheScrasse	160
5	nor	159
6	maroonrk	156
7	-is-this-fft-	150
8	SecondThread	147
9	pajenegod	145
9	orz	145

z4120's blog

Auto-translated Chinese national IOI training team report papers

By z4120, history, 3 years ago, In English

This is the Chinese national training team report papers translated into English using several computer tools.

I find this to be a much way to read these PDF papers than Google Translate or Foxit Reader Translate (despite the limitations -- see below), so I think it may be useful to other people too.

Original papers download:

Auto-translated papers download:

https://nd.nl.tab.digital/s/GqoiQ5b8tpJFrXD

I've only translated some topics, but I will upload more in the future.

Update:

Better method is used to translate PDF (some PDF uploaded).

Details

DeepL is used. Google Translate doesn't work well for Chinese sentences longer than 70 characters, for example, on this sentence

此处我们规定球冠不能大于半球，并不是说大的球冠无法被处理，而是因为在提出布尔运算后，我们可以使用较小球冠的补集来表示超过半球的球冠，从而无需讨论这种情况。作此规定可以为几何上的处理提供方便。

I found some more tools to do similar tasks:

Not the same task, but also use DeepL to translate a particular file format (pptx):

https://towardsdatascience.com/using-selenium-and-deepl-to-automate-the-translation-of-power-point-files-3c01f81f113

I uploaded the method and programs used to translate those PDFs (previously, the general method is already explained in the remark section).
I uploaded all the translated LaTeX sources and compiled PDF files of 2020 I've done so far (currently only topic 1 at 2020/1-translated.pdf -- note that the PDF preview feature on the site might not work!; however the source code, programs and methods are all available, anyone can translate and upload them)
There are some commercial tools for converting PDF back to LaTeX source (Mathpix for example), however they must be paid for and I don't know what the quality would be for Chinese. It would be easier to get the source code.

I'll update all the files if I find some better way to translate those.

I could not find any existing post that does the same thing, despite a lot of blog posts that requests it: 1 2 3 and I find it really hard to copy and paste each line into a translator program (or select each line), and translating the whole thing with Google Translate (or similar) will remove the figures/formulas, so the side-by-side comparison was helpful.

Issues/possible improvements/contributions:

It's really hard to find a good site/program to translate PDF files. Does anyone know one better than this one?
The one I'm using fails badly sometimes, stretches or shrinks the text. Sample page (low resolution version). However, it's still better than the alternatives (Foxit Reader Translate, Google document translate), which requires highlighting/copying each sentence, scrolling two windows parallelly, and/or overflows the page width so horizontal scrolling is required.
I suppose that the original Chinese characters are still preserved inside the PDF; however direct copy and paste results in corrupted data.
If anyone can figure out how to extract the Chinese characters without OCR, that would improve the translation quality (because currently the OCR is not perfect, and there are some errors).
(some metadata in a PDF shows that it was made with Microsoft Word 2013 and/or Acrobat 11.0.0)
The images, math formulas and pseudo code listings are not preserved.
This is a limitation of ABBYY OCR tool. Although it can be fixed manually, I'm not going to do that.
You can also write (usually English; however Chinese HTML is still easier to translate than Chinese PDF) blog posts to explain the techniques.
Or find existing content (in English) that describes those techniques.

z4120's blog

Single element modification + find nearest previous element smaller than value

Lazy propagation + find nearest previous element smaller than value

2D segment tree

Segment tree with correct node order