Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
hehlinge42 authored Mar 29, 2021
1 parent ad079ce commit 6bcd6bf
Showing 1 changed file with 8 additions and 3 deletions.
11 changes: 8 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,10 +5,15 @@ This project aims at developing a scalable algorithm able to find connected comp
## Architecture

This repository contains the following files:
* ```reports```: contains the original paper used as reference for this project and the report presenting our methodology
* ```notebooks```
* hllui
* uaewpor
* luhaeiu
* ```notebooks/databricks```: contains both Scala and PySpark implementations of the algorithm to be run on Databricks
* ```notebooks/local```: contains a version of the algorithm that can be run on a local machine
* ```notebooks/analysis```: contains the data vizualization notebooks that was used in the pdf report mentioned above
* ```assets```: contains the ```png``` files of data vizualization used in the report
* ```src```: contains a python script used to parse the cluster logs produced by DataBricks. More details in the report, section 4.2
* ```generator```: contains the source code of a graph generator developed in C
* ```logs```: contains the source logs produced when executing the algorithm and used for performance analysis

## Contributors

Expand Down

0 comments on commit 6bcd6bf

Please sign in to comment.