πͺπΈ Leer en EspaΓ±ol Β |Β π¬π§ Read in English
High-precision bioinformatics platform for microRNA research and biomarker discovery
Genomic Convergence Β Β·Β PubMed Evidence Β Β·Β Functional Enrichment Β Β·Β Automated Scientific Reporting
Overview Β Β·Β Analysis Flow Β Β·Β Features Β Β·Β Tech Stack Β Β·Β Databases Β Β·Β Installation Β Β·Β Roadmap
KENRYU is for Research Use Only (RUO). It is not a substitute for professional medical judgment, individualized patient evaluation, local clinical guidelines, or review by a qualified healthcare professional. Evidence comes from public databases (NCBI, miRTarBase, TargetScan, STRING-DB) and must be rigorously verified before any therapeutic or research intervention. This platform does not store or process PII or PHI.
MicroRNAs (miRNAs) are post-transcriptional regulators that control the expression of complex gene networks via targeted silencing. Identifying which genes are simultaneously regulated by multiple miRNAs is fundamental to discovering critical nodes in pathologies such as cancer, cardiovascular diseases, neurodegeneration, and metabolic disorders.
KENRYU automates the entire discovery workflow in one platform:
| Step | Action | Sources |
|---|---|---|
| 1 | π Integrate thermodynamic predictions + experimental validation | TargetScan Β· miRTarBase |
| 2 | π― Identify dynamic Venn intersection of core regulated genes | Local DB + Harmonizome |
| 3 | 𧬠Enrich targets functionally with balancing algorithm | KEGG · Reactome · WikiPathways · GO |
| 4 | π‘ Investigate each core gene across 4 sources in parallel | PubMed Β· OMIM Β· ClinVar Β· ClinicalTrials |
| 5 | π Generate editable academic report with Vancouver bibliography | PDF Β· Markdown ZIP Β· TXT |
Unlike traditional pipelines, KENRYU uses a search cascade with fallback (strict β expanded β minimal) and real-time ESβEN translation to overcome PubMed's limitations with non-English queries. Fully optimized for Hugging Face Spaces with robust NCBI eUtils rate-limit handling.
Click on each phase to explore the underlying bioinformatics logic.
| π₯ 01. INPUT | π 02. COLLECTION | π― 03. VENN CORE |
|---|---|---|
| miRNA selection | Multi-source retrieval | Genomic convergence |
𧬠STEP 01 β Input & Parameters (Click to expand)
The Start: User enters N miRNAs in standard nomenclature.
- Dynamic Filtering: Define the year window for PubMed evidence.
- Validation: Automatic name cleaning for cross-database compatibility.
π STEP 02 β Multi-Source Collection (Click to expand)
The Engine: Parallel querying of high-fidelity databases.
- TargetScan 8.0: Thermodynamic binding predictions (Local Database).
- miRTarBase: Gold-standard experimental validation via Harmonizome (Remote API).
π STEP 03 β Genomic Convergence (Click to expand)
The Logic: Mathematical intersection of regulatory networks.
- Strict Mode: Only genes common to ALL miRNAs.
- N-1 / N-2: Robust consensus for broader discoveries.
| 𧬠04. ENRICHMENT | π‘ 05. RESEARCH | π 06. REPORT |
|---|---|---|
| Pathway analysis | Multi-source evidence | Professional output |
π STEP 04 β Functional Enrichment (Click to expand)
The Context: Identifying biological impact.
- Databases: KEGG, Reactome, WikiPathways, and GO.
- Evidence: Automated PubMed cross-referencing for every significant pathway found.
π STEP 05 β Multi-Source Research (Click to expand)
The Evidence: Consolidating real-world scientific data.
- Live queries to ClinVar (variants) and OMIM (disorders).
- ClinicalTrials.gov integration for human-centric research insights.
β¨ STEP 06 β Scientific Reporting (Click to expand)
The Result: Professional-grade academic reports.
- Vancouver Style: Auto-generated citations for all integrated sources.
- Multi-Format: Export to PDF, Markdown ZIP with assets, or raw TXT.
| Mode | Description | Use Case |
|---|---|---|
| Strict | Genes regulated by ALL miRNAs in the panel | High-confidence core targets |
| N-1 | Genes regulated by at least N-1 miRNAs | Robust regulatory networks |
| N-2 | Genes regulated by at least N-2 miRNAs | Broader pathway exploration |
- Dynamic Convergence β Venn intersection of N miRNAs, no hardcoded limit on core gene count
- Balanced Functional Enrichment β fairness algorithm across KEGG, Reactome, WikiPathways, and GO Biological Process
- Volcano plot Β· Venn diagram Β· STRING-DB interactome β generated on-the-fly per analysis
| Source | What it provides |
|---|---|
| PubMed | Scientific articles per gene + biological context, year-filtered with cascade fallback |
| OMIM | Mendelian Inheritance in Man β associated hereditary diseases |
| ClinVar | Reported pathogenic / likely pathogenic variant count |
| ClinicalTrials.gov | Active clinical trials where the gene symbol appears in the title |
- 429-aware Retries β automatic NCBI eUtils rate-limit handling with exponential backoff
- Persistent Cache β
local_db/analysis_cache.jsonstores translations, PubMed results, and enrichment across sessions - ESβEN Mapping β auto-translates biological terms from Spanish presets for effective PubMed queries
- Filter Cascade β strict β expanded β minimal year window fallback
- Professional A4 Editor β WYSIWYG preview with smart height-based pagination
- Vancouver Bibliography β
[N] Authors. Title. Journal. Year. PMID: X. URL - Multi-type Citations β PMID / OMIM / ClinVar / NCT with correct identifier per source
- PDF Export β via
window.print()with optimized@media printCSS, footer anchored to each A4 page - Markdown ZIP Export β
Report.md+/assets/PNGs +README.mdβ Pandoc/Obsidian/VSCode compatible
Main interface β Volcano plot, Venn diagram, and STRING-DB interactome generated live. (Note: The clinical interface displays analytical metrics and parameter presets in Spanish for localized environments).
Bioinformatics charts β Functional enrichment visualization
Gene sidebars β Quick access to OMIM, ClinVar, and trial data
Research results β PubMed Β· OMIM Β· ClinVar Β· ClinicalTrials consolidated
Narrative report β Academic synthesis with Vancouver bibliography. The automated export functions render structured clinical narratives, cross-referenced literature insights, and citation indices.
βββββββββββββββββββββββββββββββββββ
β Frontend (HTML / JS / CSS) β
β Β· Paged A4 WYSIWYG Editor β
β Β· PDF Β· Markdown ZIP Β· TXT β
ββββββββββββββ¬βββββββββββββββββββββ
β POST /api/v1/analyze
βΌ
βββββββββββββββββββββββββββββββββββ
β FastAPI Backend β kenryu_engineβ
β Β· Venn Intersection Engine β
β Β· Functional Enrichment β
β Β· Multi-source Research β
ββββββββββββββ¬βββββββββββββββββββββ
β
ββββββββββββ¬βββββββββββββββΌββββββββββββββββ¬βββββββββββββββββββ
βΌ βΌ βΌ βΌ βΌ
ββββββββββββ ββββββββββββ βββββββββββ βββββββββββββββ ββββββββββββββββββ
βTargetScanβ βmiRTarBaseβ β Enrichr β βNCBI PubMed β β OMIM Β· ClinVar β
β v8 local β β(Harmoniz)β βKEGG/GOβ¦ β β eUtils β β ClinicalTrialsβ
ββββββββββββ ββββββββββββ βββββββββββ βββββββββββββββ ββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββ
β Persistent Cache β
β local_db/analysis_cache.json β
βββββββββββββββββββββββββββββββββββ
|
Backend |
Frontend |
Deployment |
| Source | Function | Type |
|---|---|---|
| 𧬠TargetScanHuman v8.0 | Thermodynamic prediction of binding sites | Local (zip) |
| π¬ miRTarBase via Harmonizome | Experimental interaction validation (CLIP-seq, Luciferase, WB) | Remote API |
| π Enrichr | Functional enrichment β KEGG / Reactome / WikiPathways / GO | Remote API |
| π NCBI PubMed eUtils | Scientific articles per gene + context with cascade + retries | Remote API |
| π₯ NCBI OMIM | Associated hereditary diseases by gene symbol | Remote API |
| π§ͺ NCBI ClinVar | Reported pathogenic / likely pathogenic variants | Remote API |
| π ClinicalTrials.gov v2 | Active clinical trials filtered by gene symbol in title | Remote API |
| π STRING-DB | Protein-protein interaction network generation | Remote API |
| π MyGene.info | Clinical gene annotation | Remote API |
| π€ MyMemory API | Real-time ESβEN biological term translation | Remote API |
- Python 3.11+
- pip
- Docker (optional β for containerized deployment)
# 1. Clone the repository
git clone https://github.com/abrangel/Kenryu.git
cd Kenryu
# 2. Create virtual environment (recommended)
python3.11 -m venv venv
source venv/bin/activate # Linux / macOS
# venv\Scripts\activate # Windows
# 3. Install dependencies
pip install -r requirements.txt
# 4. Start the server
uvicorn kenryu_engine:app --host 0.0.0.0 --port 7860 --reload
# 5. Open in browser β http://localhost:7860docker build -t kenryu .
docker run -p 7860:7860 -v $(pwd)/local_db:/app/local_db kenryuThe
local_dbvolume persists the analysis cache across container restarts.
The repo is fully ready for HF Spaces β the Dockerfile exposes port 7860 and mounts local_db for persistent caching. Simply connect the Space to this GitHub repository.
- Open production Space or
http://localhost:7860 - Enter miRNAs separated by commas:
hsa-miR-33a-5p, hsa-miR-144-3p, hsa-miR-758-3p - Select year cutoff (PubMed evidence age filter)
- Select consensus mode (Strict / N-1 / N-2)
- Click Execute
curl -X POST "http://localhost:7860/api/v1/analyze" \
-H "Content-Type: application/json" \
-d '{
"mirnas": ["hsa-miR-33a-5p", "hsa-miR-144-3p", "hsa-miR-758-3p"],
"years": 10,
"mode": "strict"
}'π Example JSON Response
{
"common_genes": ["ABCA1", "KPNA3", "SCN1A"],
"gene_research": {
"ABCA1": {
"pubmed": [{"pmid": "...", "term": "...", "year_window": "10y"}],
"omim": [{"omim_id": "600046", "title": "..."}],
"clinvar": {"count": 201, "url": "..."},
"trials": [{"nct_id": "NCT01456650", "title": "..."}]
}
},
"enrichment": ["..."],
"scientific_synthesis": "...",
"venn_plot": "data:image/png;base64,...",
"volcano_plot": "data:image/png;base64,...",
"ppi_plot": "data:image/png;base64,..."
}Kenryu/
βββ kenryu_engine.py # FastAPI backend + bioinformatics logic (~1700 lines)
βββ Dockerfile # HF Spaces / Docker configuration
βββ requirements.txt
β
βββ static/
β βββ index.html # A4 editor main interface
β βββ script.js # Frontend logic (pagination, PDF/MD export)
β βββ style.css # Dark mode Β· gold/teal accents
β
βββ data/
β βββ targetscan_full.json.zip # TargetScan v8.0 indexed DB (Git LFS, ~8 MB)
β βββ hsa-miR-*.txt # Pre-processed per-miRNA example files
β
βββ local_db/ # Runtime cache (auto-created, not in repo)
βββ analysis_cache.json
- Dynamic Venn convergence β Strict / N-1 / N-2 modes
- Balanced functional enrichment β KEGG Β· Reactome Β· WikiPathways Β· GO
- Visualizations β Venn Β· Volcano Β· STRING-DB interactome
- Vancouver-style bibliography with multi-source citation types
- Multi-source research β PubMed Β· OMIM Β· ClinVar Β· ClinicalTrials
- Versioned persistent cache across sessions
- 429-aware retries with exponential backoff for NCBI eUtils
- ESβEN mapping for effective PubMed queries
- Professional Markdown ZIP export with separate image assets
- A4 PDF footer correctly anchored across all pages
- Integrated genomic evidence narrative in report body
- Support for miRNA isomers (isomiRs)
- DisGeNET integration for gene-disease association
- Structured JSON export for interoperability with other tools
- Side-by-side miRNA panel comparison
- Cross-expression heatmap (genes Γ miRNAs)
- Batch mode β process multiple panels from CSV
- TargetScan v9.0 integration when available
Contributions are welcome. To report bugs, suggest features, or submit pull requests:
- Fork the repository
- Create a branch:
git checkout -b feat/feature-name - Commit with descriptive messages following conventions below
- Push:
git push origin feat/feature-name - Open a Pull Request
Commit conventions: feat: Β· fix: Β· docs: Β· refactor: Β· chore:
When reporting bugs please include: steps to reproduce, expected vs actual behavior, browser console logs (frontend errors), and HF Space logs (backend errors).
This project is distributed under a Free Academic License for Research and Education. For commercial use, contact the author.
KENRYU integrates data from public sources (NCBI, Enrichr, OMIM, ClinVar, ClinicalTrials.gov, STRING-DB) subject to their respective terms of service. Users are responsible for complying with each database's policies.
Cesar Manzo β Clinical Bioinformatics Β· Genomic Analysis Β· Translational Medicine
- TargetScanHuman β Lewis Lab, Whitehead Institute
- miRTarBase β Chou et al., Nucleic Acids Research
- NCBI β National Center for Biotechnology Information
- Enrichr β Ma'ayan Lab, Mount Sinai
- ClinicalTrials.gov β U.S. National Library of Medicine
- STRING-DB β European Molecular Biology Laboratory