Марафонский раунд Яндекс.Алгоритм 2017

7 лет назад, # |

+18

Java на Яндекс.Контесте похоже (опять) сломана. То, что локально работает 0.01-0.2 секунды, там занимает 0.1-1.1 секунды на Java 7 32 (и какой-то infinity на Java 8). В задачах подобных этой — такое критично.

→ Ответить

7 лет назад, # ^ |

+23

Ответа до конца контеста так и нет... А между тем, 2 часа "увлекательного" переписывания сабмита на C++ (с кучей memory leakов в результате) == ~+150 баллов... При ровно том же самом коде.

Скажите хотя бы, починится ли Java к обычным раундам? Или лучше сразу на плюсах писать?

→ Ответить

7 лет назад, # ^ |

+16

Илья, ну ты же должен понимать, что мониторить комменты на кф все 48 часов у жюри в моём лице (оповещалка здесь придёт только мне) возможности нет. Такие вопросы надо задавать через тестирующую систему, за ней следит гораздо большее количество людей.

Давай разбираться в личке. Пришли номер сабмита, который, по-твоему, работает неоправданно долго.

→ Ответить

HE_MATEMATIK

7 лет назад, # |

+89

Когда будет тестирование на полном наборе тестов?

→ Ответить

7 лет назад, # ^ |

+26

Оно будет завтра. Спокойной ночи :)

→ Ответить

HE_MATEMATIK

7 лет назад, # ^ |

+52

Когда будут известны результаты финального тестирования?

→ Ответить

ssor96

7 лет назад, # ^ |

-35

Они будут завтра. Спокойной ночи :D

Ну правда. За сутки точно справятся.

→ Ответить

7 лет назад, # ^ |

+27

Уверен?

→ Ответить

7 лет назад, # |

+26

Мои 5 копеек комментариев: было бы здорово, если бы авторы дали свой код генератора тестов. Если мне не изменяет память, на ТС так делалось. Все равно тесты генерировать так или иначе приходится, а маленькие различия в процессе генерации могут серьезно влиять на статистику тестов (например, количество тестов с разными К может серьезно зависеть от того, перегенерируются ли все параметры теста, или только источики, и т.д.). К примеру, у меня локально есть два решения дающие разницу в 50+ баллов, которые на сервере дают абсолютно одинаковый результат:(

→ Ответить

7 лет назад, # ^ |

+49

Спасибо за отзыв, для нас это была первая проба пера, и мы с большим вниманием относимся к подобным замечаниям.

→ Ответить

7 лет назад, # ^ |

← Rev. 2 →

+11

У меня еще больше разница была, аж в 200 очков. Впрочем, посмотрим, что будет на финальном.

UPD В финальном на 100 очков больше, чем в public.

→ Ответить

Um_nik

7 лет назад, # ^ |

+28

В генераторе должен быть код проверки существования решения. Это было бы огромным хинтом, имхо.

→ Ответить

MrDindows

7 лет назад, # ^ |

← Rev. 2 →

В данном случае, его можно бы было оставить функцией-пустышкой: if (!solutionExists()) ...

→ Ответить

7 лет назад, # ^ |

← Rev. 2 →

Это тоже хинт — что существование решения можно проверить быстро. Я когда условие прочитал изначально, не сразу это понял — теоретически, авторы могли какими-то эвристиками строить эти пути, и если не получилось, то перегенеривать. Когда подумал про поток, конечно, всё встало на свои места, но этот момент как-то скользко в условии описан.

→ Ответить

riadwaw

7 лет назад, # ^ |

Ну так в условии написано "генерируем вот так" и если искать эвристиками, то это противоречило бы условию

→ Ответить

7 лет назад, # ^ |

← Rev. 2 →

Ну там мутно написано, "дальнейшие действия произвести невозможно". Но вообще да, все же из условия следует, что авторы умеют за не очень долго проверять существование. Просто в марафоне обычно не ожидаешь, что существенная часть задачи имеет точное решение :)

→ Ответить

7 лет назад, # ^ |

+10

Мне кажется, это специфика конкретной задачи. Но даже в таком случае можно было просто сгенерировать 1000 тестов и положить их где-то в свободном доступе для всех участников.

→ Ответить

zeliboba

7 лет назад, # ^ |

+13

Думаю, что люди из топ-30 это и так все поняли очень быстро, задача-то известная.

→ Ответить

Um_nik

7 лет назад, # ^ |

+10

Думаю, что это неправда.

Для меня было неожиданно, что в марафоне требуются высокоуровневые ACM-алгоритмы, поэтому первый час я даже не думал в эту сторону.

→ Ответить

7 лет назад, # ^ |

+10

Поток всё же general-purpose алгоритм, который активно используется вне ACM. Вот если бы в марафонской задаче было нужно дерево палиндромов или какое-нибудь изощрённое дерево отрезков — было бы более забавно.

→ Ответить

7 лет назад, # ^ |

Уважаемые, а что вы понимаете под словом "нужно" / "требуется"? Это на то и марафонская задача, что никто из авторов не знает, каково "верное" решение. Мне кажется, что вместо этого потока можно было использовать жадное паросочетание и БФС без особых потерь...

→ Ответить

7 лет назад, # ^ |

Вполне возможно, что bfs хватило бы. Лично я под словом "нужно" понимал "напрямую применимо".

→ Ответить

EbraM96

7 лет назад, # |

I couldn't participate in the marathon round , will I be able to participate in later stages of the contest ? If yes , how would marathon round affect me?

→ Ответить

7 лет назад, # ^ |

It is not necessary to participate in all rounds, your final result is calculated only using the rounds you have participated in.

Refer to the Rules section of the competition.

→ Ответить

7 лет назад, # ^ |

+10

I just read the rules, in both languages, and didn't find anywhere a phrase about using the rounds you have participated in. Instead, I found that final score for any participant is the sum of the scores for all 4 rounds + the lowest place taken as a tiebreaker. Did I miss something?

→ Ответить

ania

7 лет назад, # ^ |

I suppose this just meant that the rounds are equally important and you don't drop out (or get any kind of penalty) for not taking part in some of them.

→ Ответить

7 лет назад, # ^ |

← Rev. 2 →

ania says everything correctly, that was the informal description. When you don't participate in a round, you get the zero points and rank infinity, so it is effectively equivalent to not considering this round for you at all. You are not penalized in any manner.

→ Ответить

Eryx

7 лет назад, # ^ |

← Rev. 6 →

I am not sure what the "lowest rank" means. If lower = worse, then your lower rank becomes infinity if you do not participate, i.e., you lose the tiebreaker. The wording "A Contestant will rank higher [...] if they have higher scores [or] their lowest rank for the four rounds [...] is higher" suggests that this is the case. Also, if the worst rank matters, it would be illogical to not consider this infinity, because then not participating in a round might be a better choice than participating ("I am second in the first round, definitely tiebreaker at least, let's not participate just to not destroy my good standing with respect to the tiebreaker" -- with two rounds and 11 advancers this would seem to be the case).

→ Ответить

eulerscheZahl

7 лет назад, # |

+13

As the round is over, here is my code, around 7500 points on the public tests. Ugly to read, not recommended.

First I generate random pairs of life and magic fountains. I try to connect these pairs along the shortest path (I also tried Hungarian method, didn't work that well). When creating pairs, close fountains have a higher probability to be grouped together. I fill the rest of the map in the following manner: when an empty field has two neighbors of the same region, that empty field gets added to that region. Next I assign fields with only one non-empty neighbor, then the remaining fields.

For the second half of the available time I try to further optimize my best solution so far: select a random border and move it. If the score gets higher, I keep that solution. I sometimes move multiple edges in the same round to exit the local optimum.

This code still times out on two of the public tests. Is anyone ranked higher willing to share his strategy?

→ Ответить

savinov

7 лет назад, # ^ |

← Rev. 3 →

+16

I did almost the same, but I generated matching by min-cost-flow with random edge weights. Also I didn't change color of multiple cells at once (was lazy about it :D), but did a lot of iterations. Got about ~7550 points on public testset.

→ Ответить

7 лет назад, # ^ |

← Rev. 4 →

+12

Here is the gist of my method (gets around 7700+, the remaining ≤ 100 points obtained by relatively small hacks)

Generate pairs by min-cost-max-flow with all edge costs of 1. Then fill remaining pixels greedily by adding a pixel each time such that the total score is as high as possible after that step.

Refine this solution by trying to "grow corners" — loop through all possibilities in random order (selecting random pixel, direction, and number of layers ≤ 10 = MAGIC to grow) and apply an update as long as it improves score or there is time left. Example of "corner growth" at the bottom. Growing corners is useful because it allows to get from a square of size N × N to the one bigger, and square is the shape with the optimal "convexity", I think.

Hack that brings biggest improvement: when the test case is from the third group, in MCMF instead of making the cost of every edge 1, make cost of using pixels (i, j) such that i or j is divisible by MAGIC=5, equal to 0. In this way, the obtained set of paths is likely to be more disperse, rather than crammed all together around the line joining the squares from which the sources are generated.

.........           .........
.........           .xxxxxxx.
..xxxxxxy           .xxxxxxxy
..x                 .xx
..x            ->   .xx
..x                 .xx
..z                 ..z

→ Ответить

7 лет назад, # ^ |

+11

Around 7700 on public tests if implemented in C++ (and 7550 — if in Java).

First, run min-cost-max-flow to match sources and targets. Add these paths to the solution, and grow them greedily, adding one cell or continuous horizontal or vertical sequence at a time (score := delta_target_function / (5 + number of cells)). Once the field is full, do a similar process of improving solution: move one cell or continuous horizontal or vertical sequence from one group to another if it increases the score (this is tricky implementation-wise, since we have to check we don't disconnect areas; I just ran full connectivity check, but maybe one can do smth smarter and faster).

Finally, there are many places where one can add randomness here: order of edges in min-cost flow, random fluctuations in greedy & final hill-climbing scores. Add all this randomness, and repeat the procedure for 2 seconds, choosing the best answer. Locally this was giving +200 points (evaluating on 1000 testcases), but on public tests Java version didn't improve much (likely because it didn't have time to do more than O(1) iterations). I didn't have time to evaluate C++ version though.

→ Ответить

MrDindows

7 лет назад, # ^ |

← Rev. 2 →

7668.32

Generate paths by min-cost-max-flow. Expand them by bfs. Then while possible to increase score do mutations of two types: 1) Take side (edge?) of some region and paint several lines near it in the same color. 2) Take side of some region and fill it with neighbour colors.
Some hacks like "we can take just part of side" or "let's check improvement after each painting" significantly increased the execution time but also helped to get up from ~7500 to 7670.
BTW, on my local tests I get ~8150.

→ Ответить