Writing
Notes from measuring on-device inference across real phones and browsers.
-
WebGPU feature detection was not enough to run small LLMs on phones
Four cases where the device said yes and the run said no.
Notes from measuring on-device inference across real phones and browsers.
Four cases where the device said yes and the run said no.