-
Notifications
You must be signed in to change notification settings - Fork 3
/
compositor.json
80 lines (80 loc) · 4.8 KB
/
compositor.json
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
{
"name": "josep2/Jathena",
"version": "0.1.4",
"libraries": {
"xv": "^1.1.25"
},
"title": "Jathena",
"branch": "master",
"style": {
"name": "Williamsburg",
"componentSet": {
"nav": "nav/BasicNav",
"header": "header/LightBannerHeader",
"article": "article/ReaderArticle",
"footer": "footer/BasicFooter"
},
"fontFamily": "Montserrat, sans-serif",
"heading": {
"fontWeight": 600,
"letterSpacing": "0.1em"
},
"colors": {
"text": "#346",
"background": "#fff",
"primary": "#0099e0",
"secondary": "#ab61ff",
"highlight": "#f7b",
"muted": "#fffab3",
"border": "#ccd"
}
},
"content": [
{
"component": "nav",
"links": [
{
"href": "https://github.com/josep2/Jathena",
"text": "GitHub"
}
]
},
{
"component": "header",
"heading": "Jathena",
"subhead": "Proof of concept open source Amazon Athena",
"children": [
{
"component": "ui/TweetButton",
"text": "Jathena: Proof of concept open source Amazon Athena",
"url": null
},
{
"component": "ui/GithubButton",
"user": "josep2",
"repo": "Jathena"
}
]
},
{
"component": "article",
"metadata": {
"source": "github.readme"
},
"html": "\n<h3>An Open Source Amazon Athena Proof of Concept</h3>\n<h4>Parts</h4>\n<ul>\n<li>Apache Drill</li>\n<li>Apache Zookeeper</li>\n<li>Minio</li>\n<li>Kubernetes</li>\n</ul>\n<h4>Installation</h4>\n<p><strong>Prereqs:</strong></p>\n<p><a href=\"https://kubernetes.io\">Kubernetes</a></p>\n<h4>Ordered Installation</h4>\n<p>Install Minio</p>\n<pre>//Run these three commands (Thanks: https://github.com/kubernetes/examples/blob/master/staging/storage/minio/README.md)\nkubectl <span class=\"hljs-keyword\">create</span> -f https://github.com/kubernetes/kubernetes/<span class=\"hljs-built_in\">blob</span>/<span class=\"hljs-keyword\">master</span>/examples/<span class=\"hljs-keyword\">storage</span>/minio/minio-<span class=\"hljs-keyword\">standalone</span>-pvc.yaml?<span class=\"hljs-keyword\">raw</span>=<span class=\"hljs-literal\">true</span>\nkubectl <span class=\"hljs-keyword\">create</span> -f https://github.com/kubernetes/kubernetes/<span class=\"hljs-built_in\">blob</span>/<span class=\"hljs-keyword\">master</span>/examples/<span class=\"hljs-keyword\">storage</span>/minio/minio-<span class=\"hljs-keyword\">standalone</span>-deployment.yaml?<span class=\"hljs-keyword\">raw</span>=<span class=\"hljs-literal\">true</span>\nkubectl <span class=\"hljs-keyword\">create</span> -f https://github.com/kubernetes/kubernetes/<span class=\"hljs-built_in\">blob</span>/<span class=\"hljs-keyword\">master</span>/examples/<span class=\"hljs-keyword\">storage</span>/minio/minio-<span class=\"hljs-keyword\">standalone</span>-service.yaml?<span class=\"hljs-keyword\">raw</span>=<span class=\"hljs-literal\">true</span>\n\n//You can also <span class=\"hljs-keyword\">use</span> the <span class=\"hljs-keyword\">distributed</span> <span class=\"hljs-keyword\">mode</span> <span class=\"hljs-keyword\">found</span> <span class=\"hljs-keyword\">in</span> the linked README above.</pre><p>Install Zookeeper </p>\n<pre>kubectl create <span class=\"hljs-_\">-f</span> zookeeper.yaml\n// thanks https://github.com/vitillo/minipipe</pre><p>Install Apache Drill </p>\n<pre>kubectl create <span class=\"hljs-_\">-f</span> drill.yaml</pre><h4>Final Steps</h4>\n<ol>\n<li>Add the bank-data.csv file to Minio</li>\n<li>Edit and enable s3 in Apache Drill Admin Console</li>\n<li>Query File </li>\n</ol>\n<ul>\n<li>To add the CSV to Minio, you need to port-forward your minio pod <code>9000:9000</code>, create a bucket named "data" and add the bank-data.csv file from the repo. </li>\n</ul>\n<p><img src=\"https://i.imgur.com/2Pa9hUT.gif\"></p>\n<ul>\n<li>To edit and enable s3. You must port forward your Drill pod <code>8047:8047</code> and add the s3.json config to the s3 storage service.</li>\n</ul>\n<p><img src=\"https://imgur.com/ZUuWj0z.gif\"></p>\n<ul>\n<li>Use the Drillbit interface to query the file</li>\n</ul>\n<pre><span class=\"hljs-keyword\">select</span> * <span class=\"hljs-keyword\">from</span> s3.<span class=\"hljs-symbol\">`bank-data.csv`</span>;</pre><p><img src=\"https://imgur.com/Y7sQ0pB.gif\"></p>\n<h4>Bonus</h4>\n<p>You can get the JDBC or ODBC connection for Drill by following the intrusctions <a href=\"https://drill.apache.org/docs/using-the-jdbc-driver/\">here</a>. </p>\n"
},
{
"component": "footer",
"links": [
{
"href": "https://github.com/josep2/Jathena",
"text": "GitHub"
},
{
"href": "https://github.com/josep2",
"text": "josep2"
}
]
}
]
}