Solo ML engineer building hardware-efficient AI. I specialize in MoE architectures and deploying fast, lightweight models on consumer hardware.
LLM Diff is an open-source evaluation engine that runs a parallel behavioral battery against two models and outputs a Behavioral Fingerprint Distance. It probes 6 dimensions: Sycophancy, Refusal, Hallucination, Calibration, Reasoning, and Verbosity.