☑️ Use raw tf-idf algorithm to apply on document-term matrix. Look at the subreddit, and check the highest words. Then apply LDA on them. Pick the number of topics interested. Included the week 3's results.
☑️ Associate subreddits with topics using threshold like 10%.
☑️ Pick the top 5% of the heavy-comment user.
☑️ Use raw tf-idf algorithm to apply on document-term matrix. Look at the subreddit, and check the highest words. Then apply LDA on them. Pick the number of topics interested. Included the week 3's results.
☑️ Associate subreddits with topics using threshold like 10%.
☑️ Pick the top 5% of the heavy-comment user.