Skip to content
This repository has been archived by the owner on Dec 6, 2021. It is now read-only.

dioptra-io/IMC18_dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

Dataset

We provide here the dataset that lead to the IMC paper "Multilevel MDA-Lite Paris Traceroute". In this dataset, we provide four different files:

The MDA traceroute survey, the Multilevel MDA-Lite traceroute survey, the MDA/MDA-Lite comparison, and the midar results.

The dataset can be found at the following URL:

ftp://venus.planet-lab.eu/

Exploring the data

To explore the MDA traceroute, which are in the mda_survey_dump.tar.gz file, you need to type the following command.

Be careful, this needs approximately 150 GB.

rethinkdb restore mda_survey_dump.tar.gz

Look at the rethinkdb restore documentation if you have memory problems. Common problems can be resolved with the following:

rethinkdb restore --temp-dir /path/to/dir/with/memory mda_survey_dump.tar.gz

To explore the MDA-Lite traceroutes, and MDA/MDA-Lite evaluation dataset, you will need to install graph-tool, as the serialized output of the MDA-Lite traceroute is a graph that is saved with the graph-tool library.

Follow the instructions in the following link to install graph-tool. As the C++ Library makes an extensive use of metaprogramming and templates, it can take a while (up to 30 minutes on a recent laptop) to compile graph-tool:

Graph-tool

Once you have graph-tool, you can use the following code to explore the data:

from graph_tool import *
g = load_graph("/path/to/serialized/graphs.xml")

# Manipulate the graph via its properties.
v_properties = g.vertex_properties
e_properties = g.edge_properties
g_properties = g.graph_properties

# Example: ip_ids, print ip_ids of an ip_address
ip_address = g.vertex_properties["ip_address"]
ip_ids = g.vertex_properties["ip_ids"]
for v in g.vertices():
    if ip_address[v] == "an_ip_of_your_choice_in_the_traceroute":
        print ip_ids[v]

Lots of other properties are available in the graph. You can find a list of it in the head of the .xml file of the serialized traceroute.