GitHub - nkher/YelpDataSetChallenge-2015: Measuring social impact of a friends network on itself

Yelp Data Set Challenge

Measuring the impact of a users ego network on itself and its overall impact on a business

This repository contains the code to our solution to the yelp data set challenge for 2015. We have two components in our solution which are

Personalized Page Rank Component
Feature Formation Component
Data Analysis and Building Statistical Model Component

The first component is used for calculating personalized page rank values for yelp users which we use as a feature for data analysis. We leveraged the power of the open source implementation of map reduce that is Hadoop for calculating user pageranks.

The second component are a set of standalone java files that we use for a variety of tasks which are cleaning our data, studying our data by performing preliminary analysis, and also for feature formation. We use MongoDB as the backend non relational store where our data sits.

The third component is building statistical models on our final dataset and performing some data analysis to get some cool findings. We make use of the available R platform to build our models.

More detailed information about each of the component could be found in the individual md files of each component.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
FeatureFormationComponent		FeatureFormationComponent
PersonalizedPageRankComponent		PersonalizedPageRankComponent
StatisticalModelling&AnalysisComponent		StatisticalModelling&AnalysisComponent
Website		Website
.DS_Store		.DS_Store
README.md		README.md
Technical Paper.pdf		Technical Paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Yelp Data Set Challenge

Measuring the impact of a users ego network on itself and its overall impact on a business

About

Uh oh!

Releases

Packages

Languages

nkher/YelpDataSetChallenge-2015

Folders and files

Latest commit

History

Repository files navigation

Yelp Data Set Challenge

Measuring the impact of a users ego network on itself and its overall impact on a business

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages