Projects

A collection of my programming, machine learning, and research projects. These range from early experiments with language tools to published work in computational pathology and natural language processing.


Microscopy-Based Taxol Concentration Classification (2024–2025)

  • Collaborated with the University of Southern Maine Biology Department to establish experimental parameters for imaging C6 glioma cells exposed to multiple Taxol concentrations.
  • Advised on data collection protocols (image resolution, labeling, file organization) to support downstream computational analysis; dataset generation conducted by the Biology team.
  • Designed a ResNet-50 baseline model with CBAM attention modules and k-NN classification for the resulting dataset.
  • arXiv Paper

CLEF 2023 SimpleText — Task 1 (2023)

  • Built multilingual retrieval systems with keyword-based methods, web scraping, and neural rerankers with Reciprocal Rank Fusion.
  • Publication PDF

arXiv Metadata Database (Fall 2023)

  • Scraped and cleaned metadata from arXiv using BeautifulSoup.
  • Designed ER diagrams and implemented a normalized SQLite database.
  • GitHub Repo

Diabetes Prediction Pipeline (Fall 2023)

  • Built an end-to-end ML workflow using NHANES health survey data.
  • Applied data cleaning, visualization, and feature engineering in pandas and seaborn.
  • Trained and evaluated multiple models in scikit-learn.
  • GitHub Repo

Syllable Counter (2020)

  • Interactive tool to count syllables in words and phrases.
  • Combined Python string methods with phonetic rules to estimate syllable counts.
  • Backend deployed on AWS Lambda with GitHub-hosted frontend.
  • Live Demo

Haiku Generator (2020)

  • Automated poetry generator that composes haikus from user prompts.
  • Integrated the Datamuse API, a custom syllable counter, and English syntax rules.
  • Backend deployed on AWS Lambda with GitHub-hosted frontend.
  • Live Demo