Edge AI Face Authentication
A quantized convolutional neural network for secure facial authentication, optimized for real-time inference on low-power NPU hardware in healthcare edge devices.
WHAT I BUILT
I developed a lightweight convolutional neural network model for facial recognition-based patient authentication. The model was designed from the ground up to run directly on edge healthcare devices, enabling secure identity verification without requiring cloud round-trips or external connectivity.
The authentication system includes a complete enrollment and matching pipeline. During enrollment, facial embeddings are securely captured and stored on-device. At authentication time, the model generates a new embedding from the live camera feed and performs a similarity match against stored profiles, all within milliseconds on the local hardware.
This edge-first approach was driven by the healthcare context, where network reliability cannot be guaranteed and where keeping biometric data on-device enhances both security and privacy compliance.
TECHNICAL APPROACH
I trained a CNN architecture tailored for facial feature extraction, balancing model capacity against the strict computational constraints of the target edge hardware. The architecture was designed with depthwise separable convolutions and efficient attention mechanisms to maximize accuracy within a small parameter budget.
To achieve real-time performance on the proprietary low-power NPU hardware, I applied INT8 quantization techniques. This involved quantization-aware training to minimize accuracy degradation, careful calibration of quantization parameters, and extensive validation to ensure the quantized model maintained authentication accuracy suitable for healthcare security requirements.
The deployment pipeline included secure model packaging, on-device inference optimization, and a matching algorithm tuned to balance false acceptance and false rejection rates appropriate for a healthcare identity verification context.
IMPACT
The system enabled real-time patient identity verification running entirely on edge devices, eliminating the need for cloud-based authentication and the associated latency, connectivity dependencies, and data transfer concerns.
The quantized model runs at interactive speeds on the NPU hardware, delivering inference results fast enough for a seamless user experience during patient check-in and authentication workflows. This performance was achieved while maintaining authentication accuracy that meets the security standards required in healthcare environments.
By keeping all biometric processing on-device, the system provides a privacy-preserving authentication solution that aligns with healthcare data protection requirements and reduces the attack surface for sensitive patient identity data.