Global Research Trends, Hotspots and Collaborative Networks in Brain-Derived Extracellular Vesicles: A Multi-Database Bibliometric Analysis
Authors:
This repository contains a comprehensive Python-based Jupyter Notebook (PGI-RV-02.ipynb) designed by Adhish Mazumder for the PregaMind EV-Omics Lab at PGIMER, Chandigarh to perform advanced, multi-database bibliometric analysis. By aggregating, cleaning, and curating publication data from PubMed, Scopus, and Web of Science, this pipeline extracts deep insights into the global research trends, institutional collaborative networks, and emerging thematic hotspots within the study of Brain-Derived Extracellular Vesicles (bdEVs).
The notebook is modularized into several robust data processing and visualization pipelines:
Automates the loading of raw data from PubMed, Scopus, and Web of Science.
- Cleans, parses, and normalizes records into a single curated dataset.
Evaluates and plots annual and cumulative publication growth.
Generates geographic density maps (chloropleth) of global publication distribution using GeoPandas and pycountry.
- Constructs country collaboration networks with curved edges and strength-based color mapping.
Ranks top authors by publication count, total citations, and H-index.
- Builds co-authorship network graphs utilizing Louvain community detection clustering to group frequent collaborators.
- Generates bubble scatter plots to track top authors' production over time.
Identifies top institutions and maps their collaborations.
- Uses T-SNE dimensionality reduction to create a Density Heatmap of Institutional Hotspots.
- Creates Three-field Connection plots mapping the flow from Institutions → Research Areas → Countries.
Evaluates core journals by citation averages, records, and H-index.
- Generates Top 100 Journal Co-occurrence Networks using concentric circle layouts based on weighted degree.
Standardizes and normalizes variable keyword spellings (e.g., evs, bdevs, exosomes, miRNAs).
- Generates Burst Detection Heatmaps to identify keywords trending over time.
- Constructs a Strategic Quadrant Diagram (Motor, Niche, Emerging/Declining, and Basic Themes) leveraging centrality and density metrics.
- Plots keyword co-occurrence networks bound by convex hulls to visualize distinct thematic clusters.
Ensure you have Python 3.8+ installed. The script relies on several core data science, NLP, and network analysis libraries. You can install the required dependencies using pip:
pip install pandas numpy matplotlib seaborn plotly networkx python-louvain scikit-learn nltk wordcloud geopandas pycountry fsspec adjustText scipy
(Note: NLTK requires the punkt and stopwords datasets, which the script will automatically attempt to download upon its first run).
git clone https://github.com/AdhishMazumder/PGI-RV-02.git
cd PGI-RV-02
Ensure your raw CSV/XLS data and curated Excel files are placed in the exact directory paths referenced above.
Launch Jupyter and open the pipeline:
jupyter notebook PGI-RV-02.ipynb
Run the cells sequentially. The script will automatically compute the metrics, cluster the networks, and save the resulting tables and figures.
The script automatically exports results into the 2. Analysis/2.3. Bibliometric Analysis/ subfolders:
- Data Tables: Exported as .csv files (e.g., top countries, authors, journals, and keywords summaries).
- Visualizations: Exported in multiple publication-ready formats including PNG, SVG, TIFF, and PDF at an ultra-high resolution of 1000 DPI.
If you use this code or pipeline in your research, please cite the associated publication:
APA Format:
Mazumder, A., Dey, S. & Mitra, P. (2026). Global research trends, hotspots and collaborative networks in brain-derived extracellular vesicles: A multi-database Bibliometric Analysis. Neuroinformatics. [https://doi.org/10.1007/s12021-026-09774-5]
BibTeX:
@article{mazumder2026bdevs,
title={Global research trends, hotspots and collaborative networks in brain-derived extracellular vesicles: A multi-database bibliometric analysis},
author={Mazumder, Adhish and Dey, Shubhankhi and Mitra, Prasenjit},
journal={Neuroinformatics},
year={2026},
doi={10.1007/s12021-026-09774-5}
}