Skip to content

Latest commit

 

History

History
21 lines (11 loc) · 328 Bytes

XX01-intro.asciidoc

File metadata and controls

21 lines (11 loc) · 328 Bytes

why Hadoop is a breakthrough tool and examples of how you can use it to transform, simplify, contextualize, and organize data.

  • distributes the data

  • context (group)

  • matching (cogroup / join) *

  • coordinates to grid cells

  • group on location

  • count articles

  • wordbag

  • join wordbags to coordinates

  • sum counts