Projects
Side projects, tools, and rabbit holes. Some of these are useful. Some are just interesting. See all on GitHub ↗
Dorabella Calibrated Falsification (paper)
GitHub ↗Reproducibility repo for the Dorabella paper. Ships the matched-budget simulated-annealing scripts that produce the §4 calibration result (z = −1.23, below the shuffle mean), the substitution-consistency check that falsifies Packwood (2020), pre-built English/Italian quadgram models, and the cipher transcription (Hauer 2021).
dictcollision
GitHub ↗Python library and CLI for calibrating dictionary hit rates. Given a list of short strings and a reference dictionary, separates real matches from chance collisions. Applications include decipherment, OCR validation, spell-check evaluation, password audits, and fuzzer QA.
Dictionary Collision Effect (paper)
GitHub ↗Research paper quantifying the noise introduced when computational decipherment checks decoded tokens against a large dictionary. Develops a four-category classifier that separates real signal from dictionary-induced collisions, validated across 252 experimental cells against five competing correction methods.
Visa Arbitrage
GitHub ↗Data pipeline cross-referencing WARN layoffs, H-1B petitions, SEC filings, and quarterly financials for S&P 500 companies. 308 layoff events, 607K visa records, 29K WARN notices, one SQLite database.
Voynich Manuscript Analysis
GitHub ↗Syllabary analysis pipeline with signal isolation: 77-phase computational analysis identifying Italian syllabic tachygraphy as the Voynich Manuscript's encoding mechanism.
Voynich Consonant-Skeleton Matching
GitHub ↗Letter-level pipeline matching consonant skeletons against a Latin medical dictionary, the first of two independent approaches to the Voynich Manuscript.
Knowledge Graph Issue Matcher
GitHub ↗Builds knowledge graphs from GitHub repos and employee profiles to match the right people to open issues using LightRAG.
Barry's Song Suggestion PoC
GitHub ↗Proof-of-concept app for Barry's Bootcamp. Participants suggest songs via Spotify, instructors review and approve. Built it and sent it to the CEO.
Resume Customizer
GitHub ↗AI-powered tool that tailors your resume to job descriptions. Extracts ATS keywords, suggests improvements, and tracks applications.
Global Compensation Converter
GitHub ↗Streamlit tool comparing compensation packages between Seattle and Madrid, built to negotiate my own salary for a potential relocation.