Skip to content
This repository has been archived by the owner on May 6, 2018. It is now read-only.

Latest commit

 

History

History
145 lines (103 loc) · 2.89 KB

HISTORY.md

File metadata and controls

145 lines (103 loc) · 2.89 KB

Version 2.0.1

February 22, 2015

  • Upgraded to CDH 5.3.0
  • Added HBase word count demo
  • Added graph (BFS and PageRank) integration tests

Version 2.0.0

December 11, 2014

  • Upgraded to CDH 5.1.2
  • Converted project into Maven artifact
  • Removed data structures code: added dependency to lintools-datatypes and lintools-datatypes-fastutil 1.0.0

Version 1.5.0

November 24, 2013

  • Upgraded to JDK7
  • Upgraded to CDH 4.4.0
  • Fixed minor integration/regression issues as a result of upgrade to JDK7

Version 1.4.17

July 6, 2013

  • Upgraded to CDH 4.3.0

Version 1.4.16

June 17, 2013

  • Refactoring of BFS code and added integration tests
  • Created wikipedia.graph package for Wikipedia graph manipulation classes
  • Added class for extracting anchor text from Wikipedia graph

Version 1.4.15

June 9, 2013

  • Improved Wikipedia collection handling
  • Updated Wikipedia collection APIs to new Hadoop API (org.apache.hadoop.mapreduce)

Version 1.4.14

April 30, 2013

  • Upgraded to CDH 4.2.1
  • Changed dependency to Maven artifacts for dsiutils, sux4j, fastutil, and spymemcached (as opposed to local jars)

Version 1.4.13

March 10, 2013

  • Added demo for learning Univariate Gaussian Mixture Models
  • Upgraded to CDH 4.2.0

Version 1.4.12

February 24, 2013

  • Added ability to read index and collection from HDFS in solutions to inverted indexing and boolean retrieval exercises

Version 1.4.11

February 17, 2013

  • More efficient comparators in Writable pairs
  • Refactored PageRank implementations, added proper arg parsing
  • Revamped documentation on reference implementations
  • Fixed broken integration tests

Version 1.4.10

February 6, 2013

  • Refactored cooccurrence matrix example, stripes implementation
  • Added integration test for webgraph

Version 1.4.9

February 1, 2013

  • Fixed bugs in Wikipedia classes

Version 1.4.8

January 31, 2013

  • Refactored cooccurrence matrix examples: pairs and stripes

Version 1.4.7

January 27, 2013

  • Fixed previously corrupt pom.xml

Version 1.4.6

January 25, 2013

  • Updates to Wikipedia code

Version 1.4.5

January 22, 2013

  • Updated documentation to use Bootstrap
  • Updated MapReduce exercises
  • Created word count tutorial

Version 1.4.4

January 14, 2013

  • Code fixes to Hooka
  • YARN-related fix to CombineSequenceFiles

Version 1.4.3

December 31, 2012

  • Fixes to disambiguation page identification in Wikipedia
  • Added a few integration test for sample exercises

Version 1.4.2

December 25, 2012

  • Fixed bug with disambiguation page detection in Wikipedia

Version 1.4.1

December 20, 2012

  • Fixed broken code for working with MEDLINE collection

Version 1.4.0

December 16, 2012

  • Upgrade to YARN (CDH 4.1.2)