GitHub - amod0017/InvertedIndexWithClient-Server: Inverted Index Implementation in Hadoop to make document searching for e-commerce website easier.

#Inverted Index with Hadoop Advance Operating System: Implementation of Inverted Index using HADOOP

Functional Requirement:
As a user using this software one should be able to find all the documents which are present in HDFS (HADOOP Distributed Files System) containing a particular word. User will be provided with a GUI which should contain text field where he will enter the word to be searched and search button. When search button is clicked user should be able to get the entire documents name which contains the particular word entered by the user. For searching inverted index algorithm must be used. On the server side user will be the server admin. Server admin will be able to trigger inverted index algorithm whenever needed. Also this algorithm should be run every hour in the system for the new files added and updated.
Non Functional Requirement:
• User should be able to run this software from wherever possible, that means it is not necessary that client will be on the same system where HDFS is installed.
• MapReduce should be used.
• Client should be platform independent. Hence user should able to use the software in both windows and linux based platform.
• Code should be written following the clean code principals, however JUNITS are optional and can be written if time permits.
• Every module should be separately tested before performing the integration testing.
• Software should at least work on single node cluster of HADOOP.
• A proper dataset should be for testing.
Software Requirement:
• JAVA 7
• HADOOP
• MAPREDUCE
Hardware Requirement:
• Standard Ubuntu Machine with 4GB+ RAM and i3 or above processor.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.metadata/.plugins		.metadata/.plugins
.recommenders		.recommenders
Client		Client
Common		Common
HadoopData		HadoopData
InvertedIndexClientServer		InvertedIndexClientServer
RemoteSystemsTempFiles		RemoteSystemsTempFiles
Server		Server
.gitignore		.gitignore
Implementation of Inverted Index using HADOOP.docx		Implementation of Inverted Index using HADOOP.docx
README.md		README.md
client.jar		client.jar
common.jar		common.jar
executiontest.jar		executiontest.jar
hadoop-client-2.2.0.jar		hadoop-client-2.2.0.jar
hadoop-common-2.6.4.jar		hadoop-common-2.6.4.jar
hadoop-core-1.2.1.jar		hadoop-core-1.2.1.jar
hadoop-mapreduce-client-core-2.2.0.jar		hadoop-mapreduce-client-core-2.2.0.jar
inverted.jar		inverted.jar
server.jar		server.jar
test.jar		test.jar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Contributors 2

Languages

amod0017/InvertedIndexWithClient-Server

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages