I’m an MSc Bioinformatics student interested in human genetics, rare disease and machine learning for genomics. Long term, I’d like to work as a bioinformatician in the pharmaceutical / biotech industry, building tools that help with target discovery, biomarker development and patient stratification.
- Human genetics and rare disease
- Single-cell and bulk RNA-seq analysis
- Variant interpretation and clinical genomics
- Applying machine learning / NLP to biological data
- Languages / tools: Python, R, Git, Linux
- Python: pandas, scikit-learn, Scanpy, matplotlib
- R / Bioconductor: DESeq2, clusterProfiler
- Domains: scRNA-seq, RNA-seq, variant analysis, phenotype–disease matching
On GitHub I tend to build small, reproducible analysis pipelines around:
- single-cell RNA-seq (cell type classification / label transfer),
- bulk RNA-seq differential expression and pathway analysis,
- phenotype–disease matching and LLM-style reasoning over clinical text,
- variant-level modelling and pathogenicity prediction.
As my MSc project grows, I’ll keep using this space to collect the kinds of workflows and experiments I enjoy working on most, especially those that sit at the intersection of bioinformatics, statistics and translational genomics in pharma.