Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remodel due to d2. #37

Open
wants to merge 92 commits into
base: main-the-big-three
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
92 commits
Select commit Hold shift + click to select a range
eff1392
Indicator filter eliminated & new scale
amarchve Jul 21, 2022
a111977
New corr added
amarchve Jul 21, 2022
9ce55a4
bug from 21/07 fixed
amarchve Jul 22, 2022
8541783
Final corr
amarchve Jul 22, 2022
dda1894
Update WDI-Complete code.ipynb
amarchve Jul 22, 2022
165b4dd
Update WDI-Complete code.ipynb
Mferna38 Jul 22, 2022
6e5995a
Merge branch 'Remodel-due-to-d2' of https://github.com/devonfw-forge/…
Mferna38 Jul 22, 2022
f3db7d9
Revert "Merge branch 'Remodel-due-to-d2' of https://github.com/devonf…
amarchve Jul 25, 2022
19c6d13
json fix
amarchve Jul 25, 2022
557cf8d
Categories ended
Mferna38 Jul 25, 2022
49b7d8d
Error correction
Mferna38 Jul 25, 2022
027325f
Inplace checked
Mferna38 Jul 25, 2022
4cc57e0
Normalization fixed
amarchve Jul 25, 2022
1120fd6
shifts
amarchve Jul 25, 2022
ce89295
Update WDI-Complete code.ipynb
masanchis Jul 26, 2022
b310703
Update WDI-Complete code.ipynb
masanchis Jul 26, 2022
1c22ae6
Update WDI-Complete code.ipynb
masanchis Jul 26, 2022
cba6f21
Update WDI-Complete code.ipynb
masanchis Jul 26, 2022
e7e0adb
U
masanchis Jul 26, 2022
b76627b
Update README.md
masanchis Jul 26, 2022
ecf5003
Update README.md
masanchis Jul 26, 2022
5606f57
Update README.md
masanchis Jul 26, 2022
75b788d
Update README.md
masanchis Jul 26, 2022
e08b9ae
Update README.md
masanchis Jul 26, 2022
764d746
Categ included
Mferna38 Jul 26, 2022
4f6a2ee
Merge branch 'Remodel-due-to-d2' of https://github.com/devonfw-forge/…
Mferna38 Jul 26, 2022
76e8b51
Review
Mferna38 Jul 26, 2022
cbf9059
Update README.md
masanchis Jul 26, 2022
366d2da
bug fixed
amarchve Jul 26, 2022
003e9cf
Merge branch 'Remodel-due-to-d2' of https://github.com/devonfw-forge/…
amarchve Jul 26, 2022
8080645
Update WDI-Complete code.ipynb
masanchis Jul 26, 2022
67dcb21
Update WDI-Complete code.ipynb
masanchis Jul 27, 2022
978bebc
Update WDI-Complete code.ipynb
masanchis Jul 27, 2022
e380048
Update WDI-Complete code.ipynb
masanchis Jul 27, 2022
614e648
Moved corr
amarchve Jul 28, 2022
c417ff9
final tables visualized
amarchve Jul 28, 2022
15a918f
Update WDI-Complete code.ipynb
amarchve Jul 28, 2022
9975cec
Marias' changes
Mferna38 Jul 28, 2022
c688856
Merge branch 'Remodel-due-to-d2' of https://github.com/devonfw-forge/…
amarchve Jul 29, 2022
c946c67
Final table
amarchve Jul 29, 2022
8082b0d
visualization
amarchve Jul 29, 2022
237ab9d
Update WDI-Complete code.ipynb
amarchve Jul 29, 2022
bfc5032
Update WDI-Complete code.ipynb
amarchve Jul 29, 2022
4e8b05c
Conclusions started
amarchve Aug 1, 2022
a0ef360
Code for conclusions done, without documentation
amarchve Aug 1, 2022
8772e66
Updated dfind
amarchve Aug 1, 2022
5e52074
Update WDI-Complete code.ipynb
amarchve Aug 1, 2022
545edb9
2 Graphs added
Mferna38 Aug 1, 2022
5f6c57a
Args[] to resolve
Mferna38 Aug 1, 2022
c7c9b25
updated final table
amarchve Aug 1, 2022
46814f2
Starting documentation
amarchve Aug 2, 2022
5fa6be1
Casuallity hypothesis finshed
amarchve Aug 2, 2022
1b6b2d0
Update WDI-Complete code.ipynb
amarchve Aug 2, 2022
521b9a2
Update WDI-Complete code.ipynb
amarchve Aug 2, 2022
ab3c3f6
photos
amarchve Aug 2, 2022
305d551
Cases updated
Mferna38 Aug 2, 2022
ff32b65
1st slider
amarchve Aug 2, 2022
b24bc53
Sliders 2
amarchve Aug 2, 2022
2a02702
Slider 3
amarchve Aug 2, 2022
ab522c4
photos updated
amarchve Aug 2, 2022
2b86044
final photos
amarchve Aug 2, 2022
232245e
Update WDI-Complete code.ipynb
amarchve Aug 2, 2022
e60f04a
documentation
amarchve Aug 2, 2022
ee66657
Update dfindicators - Copy.xlsx
amarchve Aug 3, 2022
1da536c
Documetation corr
amarchve Aug 3, 2022
d213c5e
H_0 sets
amarchve Aug 3, 2022
7d5c417
Final tables
amarchve Aug 3, 2022
73992ee
Update WDI-Complete code.ipynb
amarchve Aug 3, 2022
ac52596
Updated libraries
Mferna38 Aug 3, 2022
c3ff331
Update README.md
Mferna38 Aug 3, 2022
a39ddd1
Doc
amarchve Aug 3, 2022
4baa609
Merge branch 'Remodel-due-to-d2' of https://github.com/devonfw-forge/…
amarchve Aug 3, 2022
61987a2
Extract
amarchve Aug 3, 2022
55d63dc
Update WDI-Complete code.ipynb
amarchve Aug 3, 2022
ce172d7
itable added
amarchve Aug 3, 2022
62f32d1
Introduction title
amarchve Aug 3, 2022
5888b9d
Update README.md
masanchis Aug 3, 2022
399daec
Introduction
masanchis Aug 3, 2022
6b05a5e
describe through images
amarchve Aug 3, 2022
e32d308
TITULO & ÍNDICE
masanchis Aug 3, 2022
d729d90
Update WDI-Complete code.ipynb
masanchis Aug 3, 2022
f86d25e
Update WDI-Complete code.ipynb
amarchve Aug 3, 2022
ca48c67
Update WDI-Complete code.ipynb
amarchve Aug 3, 2022
1fda043
Revert "Update WDI-Complete code.ipynb"
amarchve Aug 3, 2022
6960cb5
Last graphs
amarchve Aug 3, 2022
265c6da
Update WDI-Complete code.ipynb
amarchve Aug 3, 2022
19ea942
Adrian sample file
Mferna38 Aug 3, 2022
55c23ec
Doc, spurious
amarchve Aug 4, 2022
bbab318
Graphs reviewed
Mferna38 Aug 4, 2022
5f89c1f
Cap logo
amarchve Aug 4, 2022
bd9e938
Divided notebooks
amarchve Aug 4, 2022
4071a70
New
amarchve Aug 4, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added Data/dfindicators - Copy.xlsx
Binary file not shown.
102,914 changes: 0 additions & 102,914 deletions Demo overall graphs.ipynb

This file was deleted.

Binary file added IQR.webp
Binary file not shown.
Binary file added Logos/chart (1).jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added Logos/chart (2).jpeg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
18 changes: 15 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -48,6 +48,13 @@ Therefore it is being utilized to get rid of all the outliers that may come from

2- **Substitution of the NaN values**. The developed Nan values´ treatment has been a mix, between the linear interpolation and backwards filling. The linear interpolation is a form of interpolation, which involves the generation of new values based on an existing set of values. Linear interpolation is achieved by geometrically rendering a straight line between two adjacent points on a graph or plane. On the other side, the backwards filling will help us to arrive to those values which have not been fullfilled with the linear interpolation.

3- **Scaling method**. The escalation process has been done dividing each value by the initial one of an indicator (value in 1990). Considering the start point as 1 (initial value divided by itself), each result will show the growth respect to the initial data.

4- **Removing indicators**.
- Those indicators which have 20% of missing values of its total have been removed because a lack of data shows unreliable results.
- There are some indicators which represent exactly the same through different units, so, we are going to select only one type. For example, in monetary cases, indicators which are expressed with current US $ has been selected. Then, which are showed with the percentage and the total value, we have programmed to selct which ones which show a greater value.


# Run the application
## Dependencies
Dependecies are automatically managed by Poetry and there is NO need to use external dockers for running spark.
Expand All @@ -71,10 +78,18 @@ Poetry will take care of:
- From `scipy` import `stats` and `shapiro`. Scipy provides algorithms for optimization, integration, interpolation, eigenvalue problems, algebraic equations, differential equations, statistics and many other classes of problems. Extends NumPy providing additional tools for array computing and provides specialized data structures, such as sparse matrices and k-dimensional trees. Mainly used for statistical calculations.

- The `plotly.express` module (usually imported as px) contains functions that can create entire figures at once, and is referred to as Plotly Express or PX. Plotly Express is a built-in part of the plotly library, and is the recommended starting point for creating most common figures. Every Plotly Express function uses graph objects internally and returns a plotly.graph_objects.Figure instance. Throughout the plotly documentation, you will find the Plotly Express way of building figures at the top of any applicable page, followed by a section on how to use graph objects to build similar figures. Therefore it will allow for interacting graphs.

- `seaborn` is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics. Used for the correlations matrix.

- The `requests` library is the de facto standard for making HTTP requests in Python. It abstracts the complexities of making requests behind a beautiful, simple API so that you can focus on interacting with services and consuming data in your application. It allows for downloading data from the websites.

- The `functools` is a module for higher-order functions: functions that act on or return other functions.

- The `ipywidgets` allows us having interactive widgets (sliders, buttons, dropdowns...) with which we can control and customize the display of our data.

- `warnings` to avoid warning messages when showing the notebook.

- `dash` and `itables`: both libraries can be used for making interactive tables designed for viewing, editing, and exploring large datasets in Python. At the begginning, we started with `dash` (which is a scratch in React.js) but as it is rendered with semantic HTML, we looked for an alternative that visualizes inside our notebook, so we found `itables`, which have a similar functionality.

## Running on local
To start the execution of our code, you can directly run the notebooks on Visual Studio Code opening the files .ipynb, or with the command `poetry run jupyter notebook`.
Expand All @@ -86,11 +101,8 @@ To start the execution of our code, you can directly run the notebooks on Visual
- The Net Migration indicator measures the difference between the number of immigrants and emigrants, so the number of people entering the country minus the number of people leaving it. As the difference is measured, it is necessary to see the original data to draw the conclusions correctly. Taking into account whether this net value is positive or negative and extract conclusions according the type of correlation.

- Something similar occurs with the indicator Direct Foreign Investment. This one shows us the difference between outflows and inflows, so, if countries invest outside more or less than which is invested in them. Again we should see which are the net values to extract conclusions correctly.

- After treating the data, we realized that the indicators: Mortality Pollution and Gender Equality do not present sufficient data for analysis. Therefore, they have been eliminated. No conclusions will be drawn between this indicator and the countries.



## Repository setup
The code is divided in several notebooks that need to be excuted following the corresponding order, which coincides with the one described above in the main steps section.

Expand Down
Loading