Projects
A collection of my programming, machine learning, and research projects.
These range from early experiments with language tools to published work in computational pathology and natural language processing.
Microscopy-Based Taxol Concentration Classification (2024–2025)
- Collaborated with the University of Southern Maine Biology Department to establish experimental parameters for imaging C6 glioma cells exposed to multiple Taxol concentrations.
- Advised on data collection protocols (image resolution, labeling, file organization) to support downstream computational analysis; dataset generation conducted by the Biology team.
- Designed a ResNet-50 baseline model with CBAM attention modules and k-NN classification for the resulting dataset.
- arXiv Paper
CLEF 2023 SimpleText — Task 1 (2023)
- Built multilingual retrieval systems with keyword-based methods, web scraping, and neural rerankers with Reciprocal Rank Fusion.
- Publication PDF
arXiv Metadata Database (Fall 2023)
- Scraped and cleaned metadata from arXiv using BeautifulSoup.
- Designed ER diagrams and implemented a normalized SQLite database.
- GitHub Repo
Diabetes Prediction Pipeline (Fall 2023)
- Built an end-to-end ML workflow using NHANES health survey data.
- Applied data cleaning, visualization, and feature engineering in pandas and seaborn.
- Trained and evaluated multiple models in scikit-learn.
- GitHub Repo
Syllable Counter (2020)
- Interactive tool to count syllables in words and phrases.
- Combined Python string methods with phonetic rules to estimate syllable counts.
- Backend deployed on AWS Lambda with GitHub-hosted frontend.
- Live Demo
Haiku Generator (2020)
- Automated poetry generator that composes haikus from user prompts.
- Integrated the Datamuse API, a custom syllable counter, and English syntax rules.
- Backend deployed on AWS Lambda with GitHub-hosted frontend.
- Live Demo