Photo vs Live Face: How Verification Tells Them Apart
A research view of photo vs live face verification: why blood-flow liveness catches printed photos, screen replays, and masks that fool basic selfie checks.

A printed photo, a high-resolution screen replay, and a silicone mask share one quiet advantage over the human face: they hold still and they hold their color. For a verification system trained only to confirm that a face is present and roughly oriented toward the camera, that is often enough to pass. The gap between what a basic selfie check sees and what a living person actually does sits at the center of the photo vs live face verification problem, and it is the gap fraud teams are paying for in chargebacks, mule accounts, and synthetic onboarding. Understanding how a verification pipeline separates a real face from a reproduction of one is now a procurement question, not an academic one.
In the LivDet-Face 2024 international competition, organizers reported that the most effective presentation attacks, including high-quality replays and masks, defeated multiple submitted detectors, with several systems showing attack presentation classification error rates well above single digits against the hardest spoof categories.
What photo vs live face verification actually measures
The phrase sounds binary, but the decision is built from layered signals. A simple selfie check answers a shallow question: is there a face-shaped object in the frame? More serious liveness detection asks a harder one: is this face attached to a living body right now? Those are different tests, and most spoofs exploit the distance between them.
Presentation attacks fall into three broad families. Two-dimensional artifacts include printed photos and digital images shown on a phone or monitor. Replay attacks present recorded video of a genuine user. Three-dimensional artifacts include paper cutouts, resin or silicone masks, and wearable prosthetics. Each defeats a different assumption. A printed photo defeats systems that only check facial geometry. A replay defeats systems that ask for a blink or a head turn, because the recording already contains those motions. A mask defeats systems that rely on depth or texture alone.
Liveness detection blood flow analysis sits in a different category because it does not test for shape, motion, or texture. It uses remote photoplethysmography (rPPG) to read the faint periodic color change in skin caused by blood pulsing through capillaries with each heartbeat. A genuine face produces a coherent pulse signal across facial regions. A photo, a screen, and a mask do not, because none of them circulate blood. This is why the question of whether you can spoof a face check with a photo has a precise answer that depends entirely on what the check is measuring.
The table below compares how each detection approach responds to the most common attack types.
| Detection method | Printed photo | Screen replay | Silicone/3D mask | Deepfake injection | User friction |
|---|---|---|---|---|---|
| Face presence / geometry only | Often passes spoof | Often passes spoof | Often passes spoof | Often passes spoof | Very low |
| Active challenge (blink, turn, smile) | Usually blocked | Frequently passes | Sometimes passes | Frequently passes | High |
| Texture / moire analysis | Often blocked | Often blocked | Variable | Variable | Low |
| Depth / 3D mapping | Blocked | Blocked | Often passes | Bypasses camera entirely | Medium |
| rPPG blood-flow liveness | Blocked (no pulse) | Blocked (no pulse) | Blocked (no pulse) | Blocked (no coherent pulse) | Low (passive) |
The pattern is the signal. No single non-physiological method blocks every attack family, while a real face versus a printed photo, a replayed screen, or a mask all reduce to the same physical fact: only one of them has a heartbeat the camera can recover.
Why each spoof type breaks a different check
It helps to look at the failure modes individually rather than as one undifferentiated "fraud" bucket.
- Printed photos fail on motion and pulse. They are cheap, fast, and still dominate low-effort attacks against consumer apps. A static challenge defeats them, but so does any signal that requires life.
- Screen replays fail on optics and pulse. They can carry blinks and head turns lifted from a real session, which is why they slip past liveness tests that only ask for movement. Reflections and pixel patterns help, but lighting and high-refresh displays narrow that margin.
- Masks fail on physiology, not appearance. A well-made mask can satisfy depth and texture checks because it has real three-dimensional structure and matte skin-like surfaces. What it cannot reproduce is blood perfusion under the surface.
- Deepfakes and injection attacks bypass the lens. Here the attacker feeds synthetic video directly into the verification stream, so depth and reflection cues never apply. Anti-spoofing facial analysis based on physiological consistency becomes one of the few signals that does not assume a real camera saw a real scene.
Industry applications of blood-flow anti-spoofing
Banking and account opening
Remote account opening is the highest-value target for synthetic identities, where a fabricated face needs to clear onboarding only once to seed a long fraud lifecycle. Passive rPPG liveness fits this flow because it requires no extra user action, preserving conversion while raising the cost of a successful spoof. Fraud teams increasingly treat liveness as a control plane signal feeding the broader risk engine rather than a single pass/fail gate.
KYC and identity verification vendors
Verification vendors face a portfolio problem: they must defend against printed photos at scale and against bespoke masks and deepfakes at the high end, often within the same product. Physiological signals add a layer that generalizes across attack types instead of being tuned to one. That generalization matters because, as competition results repeatedly show, detectors tuned to known attacks degrade sharply against novel ones.
Fintech and high-risk onboarding
Crypto, lending, and gambling platforms operate under multi-accounting and bonus-abuse pressure where the same operator presents many fabricated faces. A heartbeat-based check is difficult to mass-produce, because every spoof artifact, no matter how polished, lacks the underlying perfusion signal.
Current research and evidence
Physiological liveness has moved from niche idea to active research program. Julian Fierrez and collaborators at the Universidad Autonoma de Madrid have published work on exploiting physiology for presentation attack detection, including the PAD-Phys line of research, which evaluates how heart-rate signals recovered from video separate genuine subjects from artifacts. Researchers at Hochschule Bonn-Rhein-Sieg have studied combining time-of-flight depth features with rPPG, arguing that pairing physiological and geometric cues improves robustness over either alone.
Work on mask attacks specifically has shown that vascular-weighted, motion-robust rPPG signals can detect 3D mask presentations that defeat appearance-based methods (arXiv preprint 2305.15940, 2023). Earlier foundational research at the University of Oulu helped establish rPPG correspondence features for 3D mask detection, demonstrating that pulse coherence across facial regions distinguishes living skin from synthetic surfaces.
The LivDet-Face 2024 competition provides the sober counterweight. Its results confirmed that the strongest attacks still defeat many fielded detectors, and that generalization to unknown spoof types remains the central unsolved challenge. The evidence does not claim any one method is unbeatable. It supports a narrower and more useful conclusion: physiological signals close attack surfaces that geometry, motion, and texture leave open, which is why layered systems outperform single-method ones.
The future of photo vs live face verification
Three shifts are likely to define the next phase. First, the threat is moving off the camera. As injection and deepfake attacks grow, defenses that depend on a genuine optical capture lose ground, and signals tied to biological consistency gain relative value. Second, passive methods will continue displacing active challenges, because friction directly suppresses legitimate conversion and active prompts can be replayed. Third, evaluation will harden around standards such as ISO/IEC 30107-3 and independent competitions, pushing vendors to report performance against unknown attacks rather than curated ones.
The durable principle is simple. Reproductions of a face can imitate shape, motion, and surface, but reproducing live blood flow in a fabricated artifact remains far harder than printing a photo or rendering a deepfake. That asymmetry is what gives physiological liveness its staying power as other signals erode.
Frequently asked questions
Can you spoof a face check with a photo?
A printed or on-screen photo can defeat a basic check that only confirms a face is present. It cannot defeat liveness detection that reads blood flow, because a photo has no pulse signal for the system to recover, regardless of resolution or lighting.
How does rPPG tell a real face from a printed photo or mask?
rPPG measures tiny periodic color changes in skin caused by blood circulating with each heartbeat. A real face versus a printed photo, a screen replay, or a mask differ on this one physical fact: only living skin produces a coherent pulse signal across facial regions, so the artifact fails the check even when it looks convincing.
Why are masks harder to detect than photos?
Masks have genuine three-dimensional structure and skin-like surfaces, so they can satisfy depth and texture checks that block flat photos. Anti-spoofing facial analysis based on physiology catches them because the mask material has no blood perfusion underneath, which research on vascular-weighted rPPG signals has demonstrated.
Does blood-flow liveness work against deepfakes?
Deepfakes and injection attacks often bypass the camera entirely, defeating depth and reflection cues. Physiological consistency is one of the few signals that does not assume a real lens captured a real scene, because a synthetic face lacks the coherent heartbeat signal a living subject produces.
Circadify is building toward this space with rPPG-based liveness that reads real blood flow rather than relying on a person to blink or turn on cue. Identity verification vendors, banks, and fraud teams evaluating stronger anti-spoofing can see how it performs against photos, replays, masks, and deepfakes through an enterprise security demo.
