IDAO has been held since 2018. This page tells a story of the olympiad.
IDAO 2019: How it was?
The final count is 1287 teams from 78 countries
Stage 1. On-line competition
There were two separate tracks during the online stage. From the machine learning perspective, the tracks were similar, yet the restrictions put on the solutions are different for each track.
The first track was a traditional data science competition. Having a labeled training data set, participants were asked to make a prediction for the test data and submit their predictions to the leaderboard. In this track, participants can produce arbitrarily complex models. If you like to use 4-level stacking or deep neural networks, this is the right track for you – you will only need to submit test predictions. However, those who qualify for the finals will be asked to submit the full code of the solution for validation by the judges.
In real world problems, efficiency is as important as quality. Complex and resource-intensive solutions will not fit the strict time and space restrictions often imposed by an application. That is why in the second competition track, the task was to solve the same problem as was in track one, but with tight restrictions on the time and on the memory used during both learning and inference. The participants had to upload the end-to-end code for your solution: both learning and inference. The evaluation server ran training and testing for the model and report the result. Both learning and evaluation must fit into time and memory constraints.
We hope that the two tracks made the olympiad fascinating for both machine learning competition experts and competitive programming masters, Kaggle winners and ACM champions, as well as everyone eager to solve real world problems with Data. Moreover, we encouraged people with different backgrounds, ML and ACM, to team up and push Data Analysis to new frontiers.
The muon research group of the LHCb experiment (LHCb Muon Group) provided the task for participants of the online qualifying round. Nikita Kazeev, co-author of the task, about the task:
“The task we gave the participants, muon identification, is important for the LHCb experiment. Majority of the physics research done at LHCb uses the output of this algorithm. I am looking forward to data science practitioners trying a hand on the problem. At the LHCb collaboration, we hope that the ideas and techniques they develop will ultimately bring us a step closer to understanding the big mysteries of the Universe. The task is also tricky from the machine learning point of view, for it contains features of variable length and negative example weights.”
PhD student at HSE and the University of Rome, researcher at the Laboratory of Methods for Big Data Analysis
Stage 2. Finals
The following two-step procedure has been used to select finalists.
Firstly, 15 teams with the highest score in the second track go to final (no matter what is their score in the first track).
Secondly, we consider all remaining teams and select 15 teams with the highest score in the first track (no matter what is their score in the second track).
These teams also went to the final.
Only submissions to the private tasks were considered.
Thus, in order to qualify for the final a team could choose one of the two strategies:
- to obtain the highest score in the second track where the code is needed, or
- to obtain the highest score in the first track.
Each of 30 teams, which were selected as finalists, received a letter describing further steps.
First of all, we ask the source code of your solution (for both tracks) which will be reviewed and validated. The solution must reproduce your submission exactly. Our jury members check that your solution contains no cheating, and your team does not attempt to unfairly pass the rules.
The finalists table 2019 was published in February after the jury’s decision.
The second, onsite stage was held in Moscow in April 2019 at the central headquarters of Yandex. Over the 36 hours of competition, participants tried not only to get up to speed on the model, but to create a full-fledged prototype that will be tested both in terms of accuracy and performance.
As part of the onsite round of the olympiad, speeches and workshops by international experts in machine learning and data analysis were also held.
To take part in the Olympiad, each team participant must register. Each team consists of 1-3 members.
The Olympiad is held in two rounds: online qualification round hosted on the Yandex.Contest Platform, and the on-site finals, held in Moscow. The solution of the task of the online round must be submitted by the team to the contest system no later than 23.59 Moscow time on February 11, 2019.
Based on the results of the online round, a table with points scored by teams will be published on the IDAO site by February 18, 2019, highlighting the list of finalists.
Each team can submit only one solution.
Only participants who have reached the age of 18 before the start of the on-site finals can participate.
At the finals, participants will need to use their own computer. Use of any legal software is allowed.
Three prizes will be awarded in the final round: one for the winning team, and two runners up.
Employees of Yandex and members of the LHCb collaboration can only participate hors concours, since Yandex and LHCb provide tasks for IDAO 2019.
Winners of the first stage (finalists) were invited to Moscow to take part in the on-site competition.
All participants have the chance to showcase their skills to the data science community on an international scale – the results will be internships, networking with some of the most passionate and like-minded individuals, and job opportunities. Winning also will be a serious advantage for students applying to the master’s degree programs at the HSE Faculty of Computer Science.
For winners, valuable prizes will be awarded. All members of the winning team will receive laptops as prizes. The winners will be determined by the leaderboard ranking based on private test set.
|Team||Captain||First Member||Second Member|
|AR_U_KIDDIN_MI||Eugene Bobrov||Moscow State University||Vladimir Bugaevskii||Moscow State University||Denis Bibik||Moscow State University|
|BarelyBears||Hiroshi Yoshihara||The University of Tokyo||Kosaku Ono||The University of Tokyo||Naoki Maeda||The University of Tokyo|
|Columbarium||Konstantin Frolov||SKB Kontur||Grigoriy Pogorelov||MTS||Nikolay Prokoptsev||Tinkoff Bank|
|DataBroom||Pawan Kumar Singh||Myntra Design Pvt. Ltd.||Shruti Singh|
|DataScienceBois||Egor Kravchenko||Lomonosov Moscow State University||Vladislav Trifonov||Lomonosov Moscow State University||Artyom Mironov||Lomonosov Moscow State University|
|Eureka||Sandeep Singh Adhikari||Myntra||Yadunath Gupta||Myntra||Nilpa Jha||Myntra|
|FeelsBadMan||Iskander Safiulin||OKKO||Ksenia Balabaeva||ITMO||Dmitry Ivanov||Higher School of Economics|
|Gradient Boosting||Pavel Shevchuk||NRU HSE (applied mathematics)||Mikhail Diskin||NRU HSE||Dmitriy Nikulin||Samsung AI Center Moscow|
|HAL 9000 followers||Aleksandr Belov||National Research University of Electronic Technology, Applied mathematics||Andrey Gorodetsky||Bauman Moscow State Technical||Maxim Tsygankov||Bauman Moscow State Technical University|
|HardNet||Yaroslav Murzaev||MIPT||Andrey Kachetov||MIPT||Viktor Nochevkin||MIPT|
|holistic agency||Maxim Shaposhnikov||Ural Federal University||Elena Arslanova||Ural Federal University||Denis Razbitsky||Ural Federal University|
|Hunky-dory||Ihar Shulhan||Innopolis University||Almira Murtazina||Innopolis University||Ruslan Mustafin||Machine learning engineer|
|ifelse||Samir Mammadov||E-gov Development Center||Asgar Mammadli||E-gov Development Center||Umid Suleymanov||E-gov Development Center|
|ImprovY||Stanislav Sopov||SEMrush||Mikhail Alekseev||Okko||Rinat Shakbasarov||GrowFood|
|itchy mcfly||Petr Kuderov||N/A||Alex Maslov|
|John Keats||Kirill Trofimov||self-employed||Sabina Abdullaeva|
|kek ||Ranis Nigmatullin||yandex|
|Magic CIty||Sergei Arefev||Saint Petersburg State University||Artem Plotkin||Saint Petersburg State University||Roman Pyankov||Saint Petersburg State University|
|Mylen Farmer||Ilya Ivanitskiy||Avito|
|Polis||Yuriy Gavrilin||Innopolis University||Vladislav Kurenkov||Innopolis University||Andrey Kulagin||Innopolis University|
|shadd||Daniil Barysevich||BSUIR, Computer Science||Dzmitry Vabishchewich||BSUIR||Aliaksei Barysevich||BSUIR, Computer Science|
|Singularis Lab||Aleksei Alekseev||Singularis Lab||Oleg Shapovalov||Singularis Lab||Andrey Pedchenko||Mello|
|TEAM X||Andrey Kutsenko||Moscow State University||Nazar Beknazarov||Higher School of Economics||Sergey Kolomiyets||Tyumen State University|
|Team_Name||Daniil Cherniavskii||MIPT||Alexandr Valukov||MIPT|
|trtr||Denis Litvinov||Sberbank||Aleksey Buzovkin||Michail Voronov|
|Umka||Dmitrii Fedotov||PJSC Norilsk Nickel|
|Unnamed:0||Arthur Bogdanov||Innopolis University||Gcinizwe Dlamini||Innopolis University||Rufina Galieva||Innopolis University|
|wearenotgonnapasstothefinalanyways||Toghrul Rahimli||ADA University||Jalal Rasulzade||ADA University||Orkhan Bayramli||ADA University|
|Zvezdochka*||Ernest Glukhov||Innopolis University||Daria Zapekina||Innopolis University||Vyacheslav Karpov||Innopolis University|
 The team “kek” participated in the final hors concours.
1st place – Mylen Farmer
Ilya Ivanitskiy – Higher School of Economics/Avito
2nd place – Zvezdochka*
Ernest Glukhov – Innopolis University,
Daria Zapekina – Innopolis University,
Vyacheslav Karpov – Innopolis University
3rd place – TEAM X
Andrey Kutsenko – Moscow State University,
Nazar Beknazarov – Higher School of Economics,
Sergey Kolomiyets – Tyumen State University
|IDAO 2019 Finals||Score in Yandex.Contest||Ranks for Tasks||Place by Rank|
|Team Name||taskA||taskB||taskC||Total Score||Place by Score||rankA||rankB||rankC||Total Rank|
|kek (hors concours)||74.1||62.39||61.04||197.53||12||12.0||10.0||16.0||38.0||12|
|HAL 9000 followers||74.7||62.32||60.26||197.28||13||7.0||11.5||27.0||45.5||13|
The on-site finals, in which the top 30 performing teams from the online round will compete, has been held in Moscow, Yandex office.
- Dmitry Vetrov – Chairman of the Judiciary Commission, Research Professor in HSE, Head of the Deep Learning and Bayesian Methods Centre
- Alexander Guschin – Judge, Data Analyst at Yandex, highest overall rank in Kaggle is 5th
- Emil Kayumov – Judge, Data Analyst at Yandex.Taxi
- Matteo Palutan – Judge, Researcher at the Laboratori Nazionali di Frascati of INFN, Member of the LHCb experiment at CERN
- Barbara Sciascia – Judge, Researcher at the Laboratori Nazionali di Frascati of INFN, Team leader of Frascati LHCb group and Deputy Operation Coordinator of the experiment
- Evgeny Sokolov – Judge, Head of AI at Yandex.Zen, Deputy Head of the Big Data and Information Retrieval School
- Dmitry Ulyanov – Judge, PhD student in Skoltech University, Research Scientist at Bayesian Methods Centre
- Andrey Ustyuzhanin – Judge, Head of Methods for Big Data Analysis Lab at HSE
Dmitry Vetrov, Chairman of the judiciary commission, Research Professor in HSE, Head of the Deep Learning and Bayesian Methods Centre
Yandex is a technology company that builds intelligent products and services powered by machine learning. Our goal is to help consumers and businesses better navigate the online and offline world. Since 1997, we have delivered world-class, locally relevant search and information services. Additionally, we have developed market-leading on-demand transportation services, navigation products, and other mobile applications for millions of consumers across the globe. Yandex, which has 22 offices worldwide, has been listed on the NASDAQ since 2011.
About our education initiatives: Yandex is helping shape the future of education by enhancing the learning process with machine learning technologies and teaching the next generation of data scientists to thrive in a world driven by artificial intelligence. As the leading search provider in Russia and one of Europe’s largest internet companies, we have a responsibility to help educate future generations in data science, artificial intelligence and machine learning. We are proud to help provide the education that will make this goal a reality and help future generations prepare for the jobs of tomorrow through our math and coding competitions, learning platforms, school programs, online courses, and the Yandex School of Data Analysis.
The Higher School of Economics (HSE) is the one of the most renowned Russian universities. The education is focused on economics and social sciences as well as high technologies and natural science. We stand on deep studying approach in fundamental disciplines combined with real experience at the biggest Russian companies to bring our graduates the perfect skills for their future carriers.
The HSE Faculty of Computer Science was created in March 2014 with the goal of becoming one of the world’s top 30 faculties in training developers and researchers in the field of big data storage and processing, system and software engineering and system programming. The Faculty is active in many research areas: machine learning, computer vision, theoretical computer science, algorithms for big data, optimisation, software engineering, and bioinformatics. We publish in leading computer science journals and present our results at major conferences.
- Tamara Voznesenskaya – Organizing Committee Chair, First Deputy Dean at the Faculty of Computer Science, HSE University
- Irina Plisetskaya – Partnership Coordinator, Deputy Dean for Development, Finance and Administration at the Faculty of Computer Science, HSE University
- Sergey Karapetyan – IDAO Coordinator, Manager at the Faculty of Computer Science, HSE University
- Emil Kayumov – Problem Co-author, Data Analyst at Yandex.Taxi
- Nikita Kazeev – Problem Co-author, PhD student at HSE and the University of Rome, researcher at the Laboratory of Methods for Big Data Analysis
- Vladislav Lipyanin – Web-Site Editor, Student at HSE University
- Denis Mashkovtsev – System Administrator at HSE University
- Alexey Mitsyuk – Technical Team Lead, Research Fellow at the Faculty of Computer Science, HSE University
- Aleksey Tolstikov – Yandex.Contest Expert, Yandex School for Data Analysis
IDAO 2018: The first tournament
In 2018 the Online Round gathered 1500 participants from all over the world. It was conducted on January 15–February 11, 2018. 100 best participants took part in the On-Site Final that was held in Moscow on April 2–3.
Stage 1. On-line competition
The event was organized by the HSE Faculty of Computer Science, Yandex, and Harbour.Space University (Barcelona) with the support of Sberbank. The task for the Online Round was provided by Yandex.Market.
Stanislav Fedotov, curator at the Yandex School of Data Analysis, and Associate Professor at the HSE Faculty of Computer Science, on the task for the Online Round: “At the online stage, the contestants solved a task for Yandex.Market. When a user enters this service with a specific purpose, the system chooses a set of options which match their query. For example, when someone looks for a kettle, Yandex.Market offers them a lot of options of kettles with various prices and options. But teaching the system to predict queries would be much more interesting, as this would mean that it would offer not what the individual is looking for at that particular moment, but something they would be likely to want in future. ‘The participants were given a search history of notional users, and they had to predict the categories of items these individuals hadn’t looked at over the last three weeks, but would be likely to search for in a week’s time. They had to choose five users, suggest five categories of goods for each user and ‘guess’ at least one of them
Stage 2. Final
The task for the Final was provided by Sberbank. According to Andrey Chertok, Managing Director for Research and Development at Sberbank, the participants had to solve a real problem on which the Sberbank team worked recently, and which is faced by all banks. The task is very applicable: it is about optimizing the cash supply for Sberbank ATMs, numbering tens of thousands across the country. The problem is that cash delivery isn’t always performed effectively, and as a result, cash lies useless in some ATMs, while others run out of cash too quickly. ‘The bank’s losses due to excessive money just ‘lying around’ in ATMs amounts to billions of roubles annually’, Andrey Chertok emphasized. ‘Our team uses data analysis more and more frequently to solve such problems. For example, the problem with cash delivery optimization and forecasting the amount of money to be cashed from a specific ATM was successfully solved with machine learning methods. We proposed a mini version of what we’ve done at Sberbank to the Olympiad participants.’ The finalists worked with real data of Sberbank ATMs’ locations and loading. During the process, the teams faced the same problems that are faced by bank data analysis teams in real life. This includes whether or not the data should be cleaned, and that the data sometimes has so-called ‘outliers’ which relate to more intensive cash delivery on days when salaries or pensions are paid. ‘In a short period of time, all the participants were quite successful in building usable models and got some hands-on experience in solving a real banking task’, said Andrey Chertok. ‘I believe, at this Olympiad, we managed to bring together competitive spirit and applicability’.
In first place, and hailing from St. Petersburg, Magic City (Artem Plotkin, Roman Piankov and Sergey Arefev).
Coming in second place, all the way from Ukraine is team SantiagoSeaman (Alexander Makeev).
And finally in third place and making their way from Belarus, Apex (Evgeniy Demidovich, Sergei Petrov and Konstantin Mlynarchyk).
Informational and Tech Partners
Press about IDAO
- Олимпиада IDAO. Битва специалистов по большим данным – Академия Яндекса, 11 марта 2019
- Международная олимпиада по анализу данных IDAO пройдет во второй раз – Где учиться и работать (guir.ru)
- II Международная олимпиада по анализу данных (International Data Analysis Olympiad — IDAO) – РОССИЙСКИЙ КУЛЬТУРНЫЙ ЦЕНТР В ПЕКИНЕ
- Регистрация на международную олимпиаду по анализу данных закрывается 28 января – Студенческий портал Физтеха ПОТОК
- С 4 по 6 апреля в Москве состоится очный тур Международной олимпиады по анализу данных IDAO. 31 команду финалистов, среди которых есть участники из Японии и даже Свазиленда, ждет 36-часовой марафон программирования – Студенческое информационное агентство “КЛИК”, 1 апреля 2019
- データ解析オリンピック「IDAO」で東大チームが堂々の４位「ロシアにはモンスターがいる」 – Sputnik Japan, 2019-04-12
- Международная олимпиада по анализу данных – Издание Tproger.ru, 15 января 2019
- В Москве пройдет финал второй Международной олимпиады по анализу данных IDAO (International Data Analysis Olympiad) – Россотрудничество, 1 апреля 2019
- В Москве наградили победителей международной олимпиады по анализу данных IDAO-2019 – Россотрудничество, 6 апреля 2019
- Финалисты олимпиады IDAO-2019 о своих впечатлениях – Академия Яндекса, 12 апреля 2019
- IDAO: подведены итоги первого тура международной олимпиады по анализу данных – Спутник.Новости, 21 февраля 2018