READ CloudComputingProposal, CloudComputingReport and download Powerpoint Presentation for more detailed information.
(VIEW RAW)
Python files are the important parts of code written for the system:
main.py is for front end django application
getpage.py is polling script that listens for requests from front end
runboto.py makes requests to EMR for handling Job processes
indexmap.py is mapping part of the plagirism algorithm
indexreduce.py is reducing part of the algorithm