-
targeted-bioactivity-analysis: This Jupyter Notebook performs a bioactivity and drug-likeness analysis on compounds targeting the SARS coronavirus 3C-like proteinase (ChEMBL3927) using RDKit, ChEMBL API, and statistical analysis.
-
SpatialTranscriptomics-ML: This project implements machine learning models to classify spatial transcriptomics data. The code leverages popular libraries like Scanpy, Squidpy, PyTorch, and PyTorch Geometric for graph learning, aiming to explore and compare deep learning approaches on spatially resolved transcriptomics data.
-
cross-species-genomics-ml: This Jupyter Notebook implements a DNA sequence classification pipeline using k-mer tokenization and Naive Bayes models, and compares cross-species classification performance between human, chimpanzee, and dog DNA.
-
tableau-clinical-trials-dashboard: A data visualization that explores clinical trial metrics using data from ClinicalTrials.gov.
-
llm-ngs-qc-parser: This project implements a pipeline to parse Next-Generation Sequencing (NGS) quality control (QC) reports, specifically focusing on Bioanalyzer PDF outputs. It uses LLMs to extract sample information and quality metrics from PDF files and outputs structured data in JSON and CSV formats.
sumeetg23/data-sandbox
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|