Center for Global Cyber Strategy (CGCS) researchers have used the data donated by the white hat groups to create anonymized profiles of the groups.One such profile has been identified by CGCS sociopsychologists as most likely to resemble the structure of the group who accidentally caused this internet outage. You have been asked to examine CGCS records and identify those groups who most closely resemble the identified profile.
Data for this mini-challenge consists of the following. This data is described in more detail in the download file.
Note: In the sample, data channels were each in their own file. In the full release these have been merged into a single file.
Note: Having difficulty with scale when working with the very large graph? Feel free to ask questions about approaching this challenge. Answers to the questions we receive will be posted on this page for all contestants to see.
Question
I’m having trouble downloading the large files when I click on the link in the form. How do I download the files?
Clarification
Links in the form must be opened in a new browser tab. Clicking on the links within the form may not work.
Question
There are travel records with negative weights. What does this signify?
example:
492850 6 625756 1641600 -1 5 3 22 156 -25 -111
Clarification
There is no special significance to these values. Datasets can be messy and contain errors and unknown values.
Question
There is a mismatch in the descriptions of eType 0 and eType 1 in the PDF documentation for mini-challenge 1 (CGCS-GraphData-Readme.pdf). Which is correct?
Clarification
The table on the first page of the PDF was incorrect. Emails are designated as eType 0 and calls are designated as eType 1.
Note: Links in the form must be opened in a new tab.
VAST 2020 Submission Instructions
Please provide a valid address in order to get access to the data. Your email address may be used if we need to contact you with important information about the Challenge.
Sample data for this mini-challenge is available in order to give teams a small set to prepare their tools and work through ingesting the data.
Note: Data from multiple channels is provided in separate files in this sample.