Skip to content

mari1647iv/sentiment-analyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

sentiment analyzer

Github comments sentiment anlysis via NLTK(www.text-processing.com) and GitHub API

The code also could be found on

Proccess

  1. We extract the github comments from repository via Github API.
  2. We prerocess the comments using NLTK toolkit since they have open API that is comfortable to use.
  3. NLTK returns the probablity of the comment being positive, negative or neutral. The label is assigned for the further model precision measurement.
  4. We train and tune our model on the 70% of the comments dataset and test it against the remaining 30%.
  5. We gather the sentiment metrics from the dataset with issue-structured comments and analyze the results.

Results

We analyzed ~15k comments. The summary is presented below.

Resolved vs Open issues

image image

Fraction of resoled issues within each sentiment

image image image

Time and length of discussion per sentiment

image image

About

Github comments sentiment analysis via NLTK (www.text-processing.com) and GitHub API

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors