Each press article in the press info testset file (press_test.csv) is a report of a scientific paper in the paper_test.csv. For each press article, the competition participants need to choose 3 best-matching papers. Please note that these predictions are ordered by their probability, from high to low.
Please refer to sample_submission.csv for submission format. The submission files are in csv format without any header:
press_id, paper_id (1), paper_id (2), paper_id (3)
with each line of the file corresponds to one press article (press_id), followed by 3 most relative papers (paper_id). These three predictions in one line shall be different from each other. Halfwidth comma shall be used between press_id and paper_id.
Submissions are evaluated according to the Mean Average Precision @3 (MAP@3)
Where |U| is the number of press_id in the test set, P(k) is the precision at cutoff k, n is the number of predicted papers.
Science of Science Data Hackthon