Skip to content

IslamHisham/Domain_Sampler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Domain_Sampler

Get a representative sample of articles for a domain that can be used for further studies like credibility assestment of the domain or any other type of analysis.

The project is made up of two stages:

  1. Size Reduction Stage: where the number of the article is reduced baed on the statistical limited population theory
  2. Topic sampling: A representative sample from each topic is taken to ensure the diversity and representativeness of the sample. We use BERTopic.

About

Get a representative sample of articles for a domain

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages