This project contains a comprehensive analysis of Undergraduate Research Experience (URE) coding data exported from NVivo. The analysis includes statistical summaries, visualizations, and detailed breakdowns of coding patterns across research articles.
analysis.ipynb- Jupyter notebook containing all analysis code and visualizationscoding matrix.csv- Original coding matrix exported from NVivo
comprehensive_coding_analysis.xlsx- Main analysis file with 4 sheets:- Articles_with_Codes: Detailed article analysis with metadata
- Codes_with_Articles: Code frequency and article associations
- Code_Matrix: Full cross-tabulation matrix
- Summary_Stats: Overall statistics
articles_with_codes_and_metadata.csv- Articles analysis with extracted metadatacodes_with_article_details.csv- Codes analysis with frequency statisticscoding_matrix_crosstab.csv- Cross-tabulation matrixarticle_summary.csv- Basic article summarycode_summary.csv- Basic code summary
coding_analysis_visualizations.png- Comprehensive 8-panel visualization dashboarddetailed_coding_analysis.png- Code co-occurrence and usage pattern analysis
- 335 articles analyzed
- 81 different codes identified
- Average 8.4 codes per article
- 2,818 total code applications
- Program type (91.94%)
- Primary Institutional Context (91.64%)
- STEM (80.0%)
- 4-Year University (63.28%)
- High frequency (>50% of articles): 4 codes
- Medium frequency (10-50% of articles): 22 codes
- Low frequency (<10% of articles): 55 codes
-
Ensure you have the required Python packages installed:
pip install pandas numpy matplotlib seaborn openpyxl
-
Open
analysis.ipynbin Jupyter Notebook or VS Code -
Run all cells to:
- Load and analyze the coding matrix
- Generate comprehensive spreadsheets
- Create visualizations
- Export summary files
- Modify code groupings in the visualization functions
- Add external metadata by following the instructions in the notebook
- Adjust visualization parameters for different outputs
- Python 3.7+
- pandas
- numpy
- matplotlib
- seaborn
- openpyxl (for Excel file generation)
Analysis conducted as part of URE research project.
Generated: September 2025